An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-classification

AvaAvarai/Java_Tabular_Vis_Toolkit

Cross-platform tool for Computational Interactive Visual Learning using lossless General Line Coordinate data visualizations and human-in-the-loop guided classification by eight classifier algorithms to find, test, and boost robust machine learning models with a goal of high case to parameter ratio.

Language: Java - Size: 241 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 1

mthh/jenkspy

Compute Natural Breaks in Python (Fisher-Jenks algorithm)

Language: Python - Size: 210 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 224 - Forks: 28

gabfr/truck-data-wrangler

ELT (Extract, Load, Transform) process of accelerometer/gyroscope events with Apache Spark (w/ Structured Streaming) and TimescaleDB

Language: Jupyter Notebook - Size: 6.58 MB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

arpitnarechania/resiliency-app

Resiliency is an ensemble binning method that considers how frequently a geographic entity (e.g., county) falls in a particular bin across multiple comparable data binning methods. This application helps users visualize and interact with the outputs of Resiliency on a variety of datasets.

Language: TypeScript - Size: 1.83 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Monish-Nallagondalla/Universal-Bank

Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.

Language: Jupyter Notebook - Size: 87.9 KB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Tanguy9862/AI-Powered-FDA-Drug-Scraper

Developed a Python-based web scraper leveraging generative AI with LangChain and GPT-4o-mini to extract and classify FDA drug approval data. Processed over 1,770 records, dynamically categorizing medications and treatment areas using LLMs to simplify complex medical information into actionable insights.

Language: Python - Size: 351 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

arpitnarechania/binguru

BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.

Language: TypeScript - Size: 66.4 KB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

AvaAvarai/Dynamic_Coordinates_Vis_System

Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.

Language: Python - Size: 30.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 2

chgl16/data-mining-algorithm

:bar_chart: 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法

Language: Python - Size: 8.79 KB - Last synced at: about 24 hours ago - Pushed at: almost 6 years ago - Stars: 24 - Forks: 7

xinglab-ai/genomap

Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data (Nature Communications, 2023)

Language: Python - Size: 43.7 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 15 - Forks: 3

openraven/mockingbird

A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.

Language: Python - Size: 564 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 6

rubyyy1118/Machine_Learning_Optimization_Study

The Learning From Data - Assignment in my MSc Business Analytics course

Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

SamJoeSilvano/Password_Strength_Prediction_using_NLP

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

Language: Jupyter Notebook - Size: 2.16 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

dsadriel/INF01124-CPD-TF

Trabalho final da disciplina Classificação e Pesquisa de Dados, ministrada pelo Prof. Leandro Krug Wives

Language: C - Size: 3.03 MB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

nightfallai/nightfall-python-sdk

Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform

Language: Python - Size: 5.67 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 13

nightfallai/sensitive-data-scanner

Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.

Language: Python - Size: 6.84 KB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 2

gulcihanglmz/image-classification-data-split

📂 Splits image datasets into training and testing sets for classification tasks. Useful for preparing data for machine learning models.

Language: Python - Size: 5.86 KB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

AvaAvarai/DataAnalysisNotebook

This data analysis notebook demonstrates lossless, lossy visualizations techinques, and classification methods. We demonstrate analysis of scientific data on hot-swappable datasets.

Language: Jupyter Notebook - Size: 27 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

ctd12/Predicting-MLB-Hall-of-Fame

Predicting MLB Hall of Fame status based on career statistics and accolades.

Language: R - Size: 401 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

JeffWang0325/Microsoft-DAT275X-Principles-of-Machine-Learning-Python-Edition

In this data science course, you will be given clear explanations of machine learning theory combined with practical scenarios and hands-on experience building, validating, and deploying machine learning models. You will learn how to build and derive insights from these models using Python, and Azure Notebooks.

Language: Jupyter Notebook - Size: 7.4 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

AvaAvarai/VKD_Tools

Visual Knowledge Discovery tools for interactively visualizing, exploring, and identifying complex n-D data patterns in multivariate CSV data, to visualize machine learning classifier models.

Language: Python - Size: 21.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

MelvinMo/ROPAC-Rule-OPtimized-Aggregation-Classifier

Python code for the ROPAC data classification algorithm

Language: Python - Size: 1020 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Abdulix/Techno_Hacks_EduTech

This repository is a Virtual Internship contains impressive projects related to Machine Learning.

Language: Python - Size: 1.61 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mrseanryan/data-type-predictor

Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...

Language: Python - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

CharanKocharla13/OIBSIP

4-Weeks Data Science Internship at Oasis Infobyte

Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dandibaroes/K-Nearest-Neighbor-for-data-classification

Using KNN Classifier for data classification of some dataset

Language: MATLAB - Size: 391 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 0

dandibaroes/Fuzzy-Logic

Using Fuzzy Logic to Simplify Recruitment Process in Company

Language: MATLAB - Size: 517 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

sanjaalcorps/NepaliDataClassifiers

Nepali Data Classifiers

Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

lucylow/ML_adversarial_images

Old ML Project - Create adversarial images to fool a MNIST classifier using TensorFlow.

Language: Jupyter Notebook - Size: 11 MB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

debmalya92/telecom-churn-prediction

Two differrent approach to predict Churn customers and finding out important variables that drives churn

Language: Python - Size: 26.7 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 4

debmalya92/credit-card-defaulter-prediction

The model predicts for the next month credit card defaulter based on demographic and last six months behavioral data

Language: Python - Size: 1.41 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 2

lorival/mini-tensor-flow

This project is a simplified version of TensorFlow, which uses a neural network to predict the price of homes in the Boston area

Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

lorival/image-classification-by-cifar-10

This project classify images from the CIFAR-10 dataset. The dataset consists of airplanes, dogs, cats, and other objects.

Language: Jupyter Notebook - Size: 129 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 1

lorival/sentiment-classification-by-text

This project uses a neural network to classify the sentiment of a review as positive or negative

Language: Jupyter Notebook - Size: 12.1 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

qeeqbox/data-classification

Data classification defines and categorizes data according to its type, sensitivity, and value

Size: 91.8 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

mac-aron/AI-sentiment-analysis

Mini research project in ML and sentiment analysis

Language: MATLAB - Size: 890 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

GIT-kiran-kumar/Data-analysis

Here You can get access to all my data analysis projects.

Language: Jupyter Notebook - Size: 66.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

IaKee/INF01124-Data-Classification-Search-Algorithms

Repository containing projects and algorithms developed for the INF01124 - Data Classification and Search Algorithms course at UFRGS.

Language: Python - Size: 3.51 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

ahmedsakrs/Data-Classification

Building multi models to classify numerical data of Gamma, Hadron dataset

Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

yousefkotp/MAGIC-Gamma-Telescope-Classification

Classification of Gamma and Hadron events by training classifieres and machine learning algorithms on the MAGIC Gamma Telescope dataset.

Language: Jupyter Notebook - Size: 2.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

slerpyyy/vscope

Command Line Data Visualizer

Language: C - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

codeforamerica/classifyr

A tool for aggregating and crowd-sourcing the classification emergency call data

Language: Ruby - Size: 1.23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

Sai-Likhith/Breast_Cancer_Data_Analysis-Using_ANN

Python code for data classification using artificial neural network

Language: Python - Size: 59.6 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

Kunika01-dev/Decision_Tree

In this case study, a decision tree is build to predict the income of a given population, which is labelled as <= 50𝐾𝑎𝑛𝑑> 50K on the basis of various attributes (predictors) like age, working class type, marital status, gender, race etc.

Language: Jupyter Notebook - Size: 1.56 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ElsevierSoftwareX/SOFTX-D-21-00116 Fork of itsoulos/GenClass

GenClass: A portable tool for data classification based on Grammatical Evolution

Size: 2.99 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

bhataparnak/Neural-Network-small-projects

Neural Network Deep learning specialization course offered via Coursera

Language: Jupyter Notebook - Size: 30.5 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

vipul43/perceptron

displaying perceptron algorithm to its core

Language: Jupyter Notebook - Size: 123 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

mthh/classif

Library for one-dimensional data classification and simple statistics in Rust

Language: Rust - Size: 22.5 KB - Last synced at: 27 days ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

Related Keywords
data-classification 48 machine-learning 15 data-visualization 10 data-analysis 9 python 8 data-cleaning 8 data-science 7 neural-network 6 deep-learning 6 visualization 3 pandas 3 artificial-intelligence 3 data 3 tensorflow 3 data-mining 2 classifier 2 predictive-modeling 2 python-3 2 nightfall 2 data-protection 2 data-privacy 2 general-line-coordinates 2 data-loss-prevention 2 neural-networks 2 multidimensional-data 2 synthetic-data-generation 2 ufrgs 2 conda 2 decision-tree 2 jupyter-notebook 2 classification 2 classification-algorithm 2 natural-language-processing 2 logistic-regression 2 tabular-data 2 python3 2 matlab 2 cartography 2 choropleth-map 2 data-binning 2 gis 2 matplotlib 2 decision-tree-classifier 2 scikit-learn 2 ai 1 data-types 1 machine-learning-algorithms 1 rule-based-classifier 1 nlp 1 infosecsimplified 1 sentiment-classification 1 ropac 1 stemming 1 qeeqbox 1 image-classification 1 google-colaboratory 1 convolutional-networks 1 machine-learning-practice 1 credit-card-defaulter-prediction 1 telecom-churn-prediction 1 confusion-matrix 1 knearest-neighbor 1 fuzzy-logic 1 adversarial-attacks 1 adversarial-example 1 tensrflow 1 numpy 1 adversarial-images 1 mnist 1 machine 1 learning 1 adversarial-machine-learning 1 classifer 1 fool 1 sentiment-analysis 1 genetic-algorithm 1 grammatical-evolution 1 stochastic-methods 1 adam-optimizer 1 convolutional-neural-networks 1 face-recognition 1 gradient-checking 1 hyperparameter-optimization 1 image-segmentation 1 l2regularization 1 mobilenetv2 1 neural-style-transfer 1 optimization-methods 1 rnn 1 self-driving-car 1 transfer-learning 1 unet-image-segmentation 1 word-embeddings 1 yolo 1 linearly-separable 1 matplotlib-pyplot 1 numpy-library 1 perceptron-algorithm 1 rust-library 1 statistics 1