An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-classification"

mthh/jenkspy

Compute Natural Breaks in Python (Fisher-Jenks algorithm)

Language: Python - Size: 210 KB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 225 - Forks: 28

openraven/mockingbird

A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.

Language: Python - Size: 564 KB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 6

nightfallai/nightfall-python-sdk

Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform

Language: Python - Size: 5.67 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 25 - Forks: 13

chgl16/data-mining-algorithm

:bar_chart: 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法

Language: Python - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 25 - Forks: 7

xinglab-ai/genomap

Cartography of Genomic Interactions Enables Deep Analysis of Single-Cell Expression Data (Nature Communications, 2023)

Language: Python - Size: 43.7 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 19 - Forks: 3

arpitnarechania/binguru

BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.

Language: TypeScript - Size: 66.4 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

nightfallai/sensitive-data-scanner

Scan directories, exports, and backups for sensitive data (like PII and API keys) with Nightfall's data loss prevention (DLP) APIs. Discover what lives at-rest in your data silos.

Language: Python - Size: 6.84 KB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 2

dandibaroes/K-Nearest-Neighbor-for-data-classification

Using KNN Classifier for data classification of some dataset

Language: MATLAB - Size: 391 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 0

AvaAvarai/Dynamic_Coordinates_Vis_System

Build visual machine learning models with multidimensional general line coordinate visualizations by interactive classification and synthetic data generation tools.

Language: Python - Size: 30.4 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 2

MelvinMo/ROPAC-Rule-OPtimized-Aggregation-Classifier

Discover ROPAC, a novel rule-based classifier we proposed. Here, you'll find the code, data, and original paper detailing this data classification algorithm.

Language: Python - Size: 1.91 MB - Last synced at: 9 days ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

AvaAvarai/Java_Tabular_Vis_Toolkit

Cross-platform tool for Computational Interactive Visual Learning using lossless General Line Coordinate data visualizations and human-in-the-loop guided classification by eight classifier algorithms to find, test, and boost robust machine learning models with a goal of high case to parameter ratio.

Language: Java - Size: 241 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

gabfr/truck-data-wrangler

ELT (Extract, Load, Transform) process of accelerometer/gyroscope events with Apache Spark (w/ Structured Streaming) and TimescaleDB

Language: Jupyter Notebook - Size: 6.58 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 1

AvaAvarai/VKD_Tools

Visual Knowledge Discovery tools for interactively visualizing, exploring, and identifying complex n-D data patterns in multivariate CSV data, to visualize machine learning classifier models.

Language: Python - Size: 21.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

mrseanryan/data-type-predictor

Given the name of a property or attribute like 'BrandName' or 'AmountReceived', try to predict a data type like String, Boolean, Integer...

Language: Python - Size: 29.3 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

debmalya92/telecom-churn-prediction

Two differrent approach to predict Churn customers and finding out important variables that drives churn

Language: Python - Size: 26.7 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 4

slerpyyy/vscope

Command Line Data Visualizer

Language: C - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

lucylow/ML_adversarial_images

Old ML Project - Create adversarial images to fool a MNIST classifier using TensorFlow.

Language: Jupyter Notebook - Size: 11 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 1

dandibaroes/Fuzzy-Logic

Using Fuzzy Logic to Simplify Recruitment Process in Company

Language: MATLAB - Size: 517 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

lorival/image-classification-by-cifar-10

This project classify images from the CIFAR-10 dataset. The dataset consists of airplanes, dogs, cats, and other objects.

Language: Jupyter Notebook - Size: 129 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 1

AvaAvarai/DataAnalysisNotebook

This data analysis notebook demonstrates lossless, lossy visualizations techinques, and classification methods. We demonstrate analysis of scientific data on hot-swappable datasets.

Language: Jupyter Notebook - Size: 27 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

qeeqbox/data-classification

Data classification defines and categorizes data according to its type, sensitivity, and value

Size: 91.8 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

sanjaalcorps/NepaliDataClassifiers

Nepali Data Classifiers

Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

arpitnarechania/resiliency-app

Resiliency is an ensemble binning method that considers how frequently a geographic entity (e.g., county) falls in a particular bin across multiple comparable data binning methods. This application helps users visualize and interact with the outputs of Resiliency on a variety of datasets.

Language: TypeScript - Size: 1.83 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

JeffWang0325/Microsoft-DAT275X-Principles-of-Machine-Learning-Python-Edition

In this data science course, you will be given clear explanations of machine learning theory combined with practical scenarios and hands-on experience building, validating, and deploying machine learning models. You will learn how to build and derive insights from these models using Python, and Azure Notebooks.

Language: Jupyter Notebook - Size: 7.4 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1

mthh/classif

Library for one-dimensional data classification and simple statistics in Rust

Language: Rust - Size: 22.5 KB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 0

Monish-Nallagondalla/Universal-Bank

Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.

Language: Jupyter Notebook - Size: 87.9 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Tanguy9862/AI-Powered-FDA-Drug-Scraper

Developed a Python-based web scraper leveraging generative AI with LangChain and GPT-4o-mini to extract and classify FDA drug approval data. Processed over 1,770 records, dynamically categorizing medications and treatment areas using LLMs to simplify complex medical information into actionable insights.

Language: Python - Size: 351 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

rubyyy1118/Machine_Learning_Optimization_Study

The Learning From Data - Assignment in my MSc Business Analytics course

Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

SamJoeSilvano/Password_Strength_Prediction_using_NLP

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

Language: Jupyter Notebook - Size: 2.16 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

dsadriel/INF01124-CPD-TF

Trabalho final da disciplina Classificação e Pesquisa de Dados, ministrada pelo Prof. Leandro Krug Wives

Language: C - Size: 3.03 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

gulcihanglmz/image-classification-data-split

📂 Splits image datasets into training and testing sets for classification tasks. Useful for preparing data for machine learning models.

Language: Python - Size: 5.86 KB - Last synced at: 9 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

ctd12/Predicting-MLB-Hall-of-Fame

Predicting MLB Hall of Fame status based on career statistics and accolades.

Language: R - Size: 401 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Abdulix/Techno_Hacks_EduTech

This repository is a Virtual Internship contains impressive projects related to Machine Learning.

Language: Python - Size: 1.61 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

codeforamerica/classifyr

A tool for aggregating and crowd-sourcing the classification emergency call data

Language: Ruby - Size: 1.23 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

CharanKocharla13/OIBSIP

4-Weeks Data Science Internship at Oasis Infobyte

Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

GIT-kiran-kumar/Data-analysis

Here You can get access to all my data analysis projects.

Language: Jupyter Notebook - Size: 66.4 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

mac-aron/AI-sentiment-analysis

Mini research project in ML and sentiment analysis

Language: MATLAB - Size: 890 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

ahmedsakrs/Data-Classification

Building multi models to classify numerical data of Gamma, Hadron dataset

Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

IaKee/INF01124-Data-Classification-Search-Algorithms

Repository containing projects and algorithms developed for the INF01124 - Data Classification and Search Algorithms course at UFRGS.

Language: Python - Size: 3.51 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

yousefkotp/MAGIC-Gamma-Telescope-Classification

Classification of Gamma and Hadron events by training classifieres and machine learning algorithms on the MAGIC Gamma Telescope dataset.

Language: Jupyter Notebook - Size: 2.1 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Sai-Likhith/Breast_Cancer_Data_Analysis-Using_ANN

Python code for data classification using artificial neural network

Language: Python - Size: 59.6 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

Kunika01-dev/Decision_Tree

In this case study, a decision tree is build to predict the income of a given population, which is labelled as <= 50𝐾𝑎𝑛𝑑> 50K on the basis of various attributes (predictors) like age, working class type, marital status, gender, race etc.

Language: Jupyter Notebook - Size: 1.56 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ElsevierSoftwareX/SOFTX-D-21-00116 Fork of itsoulos/GenClass

GenClass: A portable tool for data classification based on Grammatical Evolution

Size: 2.99 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

bhataparnak/Neural-Network-small-projects

Neural Network Deep learning specialization course offered via Coursera

Language: Jupyter Notebook - Size: 30.5 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

debmalya92/credit-card-defaulter-prediction

The model predicts for the next month credit card defaulter based on demographic and last six months behavioral data

Language: Python - Size: 1.41 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 2

vipul43/perceptron

displaying perceptron algorithm to its core

Language: Jupyter Notebook - Size: 123 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

lorival/mini-tensor-flow

This project is a simplified version of TensorFlow, which uses a neural network to predict the price of homes in the Boston area

Language: Python - Size: 6.84 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

lorival/sentiment-classification-by-text

This project uses a neural network to classify the sentiment of a review as positive or negative

Language: Jupyter Notebook - Size: 12.1 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

Related Topics
machine-learning 15 data-visualization 10 data-analysis 9 data-cleaning 8 python 8 data-science 7 deep-learning 6 neural-network 6 artificial-intelligence 3 pandas 3 visualization 3 data 3 tensorflow 3 data-protection 2 data-privacy 2 data-loss-prevention 2 neural-networks 2 logistic-regression 2 decision-tree-classifier 2 gis 2 data-mining 2 natural-language-processing 2 matlab 2 matplotlib 2 tabular-data 2 data-binning 2 cartography 2 classifier 2 choropleth-map 2 decision-tree 2 conda 2 multidimensional-data 2 jupyter-notebook 2 scikit-learn 2 general-line-coordinates 2 python-3 2 predictive-modeling 2 synthetic-data-generation 2 python3 2 ufrgs 2 nightfall 2 classification 2 classification-algorithm 2 resiliency 1 open-source 1 classification-models 1 typescript-library 1 geovisualization 1 credit-card-prediction 1 geographical-information-system 1 imbalanced-datasets 1 geospatial-visualization 1 data-types 1 choropleth 1 stream 1 multivariate-data 1 opengl 1 parallel-coordinates 1 toolbox 1 visualizing-embeddings 1 computer-science 1 data-search 1 graduation-project 1 demonstration 1 hashing 1 pyhon 1 search-algorithm 1 sorting-algorithms 1 tkinter 1 automation 1 beautifulsoup 1 data-normalization 1 docker 1 generative-ai 1 gpt-4o-mini 1 langchain 1 portfolio 1 binning 1 nlp 1 stemming 1 apriori-algorithm 1 correlation-analysis 1 data-mining-algorithms 1 k-means-clustering 1 api 1 pii 1 sdk 1 secrets-detection 1 bioinfomatics-pipeline 1 bioinformatics 1 bioinformatics-tool 1 biomarker-discovery 1 biomarkers 1 cell-annotation 1 genomic-data-analysis 1 genomics 1 genomics-visualization 1 multi-omic-integration 1 regression-algorithms 1 single-cell 1