GitHub topics: data-classification
AvaAvarai/Java_Tabular_Vis_Toolkit
Cross-platform tool for Computational Interactive Visual Learning using lossless General Line Coordinate data visualizations and human-in-the-loop guided classification by eight classifier algorithms to find, test, and boost robust machine learning models with a goal of high case to parameter ratio.
Language: Java - Size: 241 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 1

mthh/jenkspy
Compute Natural Breaks in Python (Fisher-Jenks algorithm)
Language: Python - Size: 210 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 224 - Forks: 28

gabfr/truck-data-wrangler
ELT (Extract, Load, Transform) process of accelerometer/gyroscope events with Apache Spark (w/ Structured Streaming) and TimescaleDB
Language: Jupyter Notebook - Size: 6.58 MB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

arpitnarechania/resiliency-app
Resiliency is an ensemble binning method that considers how frequently a geographic entity (e.g., county) falls in a particular bin across multiple comparable data binning methods. This application helps users visualize and interact with the outputs of Resiliency on a variety of datasets.
Language: TypeScript - Size: 1.83 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Monish-Nallagondalla/Universal-Bank
Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.
Language: Jupyter Notebook - Size: 87.9 KB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Tanguy9862/AI-Powered-FDA-Drug-Scraper
Developed a Python-based web scraper leveraging generative AI with LangChain and GPT-4o-mini to extract and classify FDA drug approval data. Processed over 1,770 records, dynamically categorizing medications and treatment areas using LLMs to simplify complex medical information into actionable insights.
Language: Python - Size: 351 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

arpitnarechania/binguru
BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.
Language: TypeScript - Size: 66.4 KB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

AvaAvarai/Dynamic_Coordinates_Vis_System
Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.
Language: Python - Size: 30.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 2

chgl16/data-mining-algorithm
:bar_chart: 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法
Language: Python - Size: 8.79 KB - Last synced at: about 24 hours ago - Pushed at: almost 6 years ago - Stars: 24 - Forks: 7

xinglab-ai/genomap
Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data (Nature Communications, 2023)
Language: Python - Size: 43.7 MB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 15 - Forks: 3

openraven/mockingbird
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
Language: Python - Size: 564 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 6

rubyyy1118/Machine_Learning_Optimization_Study
The Learning From Data - Assignment in my MSc Business Analytics course
Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

SamJoeSilvano/Password_Strength_Prediction_using_NLP
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
Language: Jupyter Notebook - Size: 2.16 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

dsadriel/INF01124-CPD-TF
Trabalho final da disciplina Classificação e Pesquisa de Dados, ministrada pelo Prof. Leandro Krug Wives
Language: C - Size: 3.03 MB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

nightfallai/nightfall-python-sdk
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
Language: Python - Size: 5.67 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 13

nightfallai/sensitive-data-scanner
Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.
Language: Python - Size: 6.84 KB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 2

gulcihanglmz/image-classification-data-split
📂 Splits image datasets into training and testing sets for classification tasks. Useful for preparing data for machine learning models.
Language: Python - Size: 5.86 KB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

AvaAvarai/DataAnalysisNotebook
This data analysis notebook demonstrates lossless, lossy visualizations techinques, and classification methods. We demonstrate analysis of scientific data on hot-swappable datasets.
Language: Jupyter Notebook - Size: 27 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

ctd12/Predicting-MLB-Hall-of-Fame
Predicting MLB Hall of Fame status based on career statistics and accolades.
Language: R - Size: 401 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

JeffWang0325/Microsoft-DAT275X-Principles-of-Machine-Learning-Python-Edition
In this data science course, you will be given clear explanations of machine learning theory combined with practical scenarios and hands-on experience building, validating, and deploying machine learning models. You will learn how to build and derive insights from these models using Python, and Azure Notebooks.
Language: Jupyter Notebook - Size: 7.4 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

AvaAvarai/VKD_Tools
Visual Knowledge Discovery tools for interactively visualizing, exploring, and identifying complex n-D data patterns in multivariate CSV data, to visualize machine learning classifier models.
Language: Python - Size: 21.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

MelvinMo/ROPAC-Rule-OPtimized-Aggregation-Classifier
Python code for the ROPAC data classification algorithm
Language: Python - Size: 1020 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Abdulix/Techno_Hacks_EduTech
This repository is a Virtual Internship contains impressive projects related to Machine Learning.
Language: Python - Size: 1.61 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mrseanryan/data-type-predictor
Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...
Language: Python - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

CharanKocharla13/OIBSIP
4-Weeks Data Science Internship at Oasis Infobyte
Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dandibaroes/K-Nearest-Neighbor-for-data-classification
Using KNN Classifier for data classification of some dataset
Language: MATLAB - Size: 391 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 0

dandibaroes/Fuzzy-Logic
Using Fuzzy Logic to Simplify Recruitment Process in Company
Language: MATLAB - Size: 517 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

sanjaalcorps/NepaliDataClassifiers
Nepali Data Classifiers
Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

lucylow/ML_adversarial_images
Old ML Project - Create adversarial images to fool a MNIST classifier using TensorFlow.
Language: Jupyter Notebook - Size: 11 MB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

debmalya92/telecom-churn-prediction
Two differrent approach to predict Churn customers and finding out important variables that drives churn
Language: Python - Size: 26.7 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 4

debmalya92/credit-card-defaulter-prediction
The model predicts for the next month credit card defaulter based on demographic and last six months behavioral data
Language: Python - Size: 1.41 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 2

lorival/mini-tensor-flow
This project is a simplified version of TensorFlow, which uses a neural network to predict the price of homes in the Boston area
Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

lorival/image-classification-by-cifar-10
This project classify images from the CIFAR-10 dataset. The dataset consists of airplanes, dogs, cats, and other objects.
Language: Jupyter Notebook - Size: 129 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 1

lorival/sentiment-classification-by-text
This project uses a neural network to classify the sentiment of a review as positive or negative
Language: Jupyter Notebook - Size: 12.1 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

qeeqbox/data-classification
Data classification defines and categorizes data according to its type, sensitivity, and value
Size: 91.8 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

mac-aron/AI-sentiment-analysis
Mini research project in ML and sentiment analysis
Language: MATLAB - Size: 890 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

GIT-kiran-kumar/Data-analysis
Here You can get access to all my data analysis projects.
Language: Jupyter Notebook - Size: 66.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

IaKee/INF01124-Data-Classification-Search-Algorithms
Repository containing projects and algorithms developed for the INF01124 - Data Classification and Search Algorithms course at UFRGS.
Language: Python - Size: 3.51 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

ahmedsakrs/Data-Classification
Building multi models to classify numerical data of Gamma, Hadron dataset
Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

yousefkotp/MAGIC-Gamma-Telescope-Classification
Classification of Gamma and Hadron events by training classifieres and machine learning algorithms on the MAGIC Gamma Telescope dataset.
Language: Jupyter Notebook - Size: 2.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

slerpyyy/vscope
Command Line Data Visualizer
Language: C - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

codeforamerica/classifyr
A tool for aggregating and crowd-sourcing the classification emergency call data
Language: Ruby - Size: 1.23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

Sai-Likhith/Breast_Cancer_Data_Analysis-Using_ANN
Python code for data classification using artificial neural network
Language: Python - Size: 59.6 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

Kunika01-dev/Decision_Tree
In this case study, a decision tree is build to predict the income of a given population, which is labelled as <= 50𝐾𝑎𝑛𝑑> 50K on the basis of various attributes (predictors) like age, working class type, marital status, gender, race etc.
Language: Jupyter Notebook - Size: 1.56 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ElsevierSoftwareX/SOFTX-D-21-00116 Fork of itsoulos/GenClass
GenClass: A portable tool for data classification based on Grammatical Evolution
Size: 2.99 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

bhataparnak/Neural-Network-small-projects
Neural Network Deep learning specialization course offered via Coursera
Language: Jupyter Notebook - Size: 30.5 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

vipul43/perceptron
displaying perceptron algorithm to its core
Language: Jupyter Notebook - Size: 123 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

mthh/classif
Library for one-dimensional data classification and simple statistics in Rust
Language: Rust - Size: 22.5 KB - Last synced at: 27 days ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0
