An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: under-sampling

Estaban65/transaction-fraud-detection

Machine Learning pipeline for financial transaction fraud detection. Incorporates SMOTE, ensemble models, neural networks.

Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

sushant1827/Credit-Card-Fraud-Detection

Demonstrates the use of ML for Anomaly Detection for Credit Card Transactions: Identifying Fraudulent Activity using Imbalanced Data

Language: Jupyter Notebook - Size: 11.9 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

AK-qr/Multi-Label-Classification

Multi-label classification project

Language: Jupyter Notebook - Size: 4.62 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

shwetajoshi601/yeast-multilabel-classifier

Multi-label classification approaches on the Yeast dataset

Language: Jupyter Notebook - Size: 919 KB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 11 - Forks: 5

MylieMudaliyar/Credit-Card-Fraud-Detection

Credit Fraud Detection of a highly imbalanced dataset of 280k transactions. Multiple ML algorithms(LogisticReg, ShallowNeuralNetwork, RandomForest, SVM, GradientBoosting) are compared for prediction purposes.

Language: Jupyter Notebook - Size: 305 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sharmaroshan/Fraud-Detection-in-Online-Transactions

Detecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting

Language: Jupyter Notebook - Size: 300 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 56 - Forks: 29

hanfei1986/Undersampling-of-imbalanced-data-with-RandomUnderSampler-and-others

Imbalanced data commonly exist in real world, especially in anomaly-detection tasks. Handling imbalanced data is important to the tasks, otherwise the predictions are biased towards the majority class. RandomUnderSampler, ClusterCentroids, CondensedNearestNeighbour, and etc. are useful undersampling tools to remove data for majority classes.

Language: Jupyter Notebook - Size: 3.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

swapnita-pandey/Credit-Card-Fraud-Detection

Credit Card Fraud Detection Using Machine Learning

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ankit-kothari/Credit-Risk-Analysis

Predicting the ability of a borrower to pay back the loan through Traditional Machine Learning Models and comparing to Ensembling Methods

Language: Jupyter Notebook - Size: 768 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 4

cbrito3/Credit_Risk_Analysis

Supervised Machine Learning and Credit Risk

Language: Jupyter Notebook - Size: 986 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

schatzederwelt/novosib-rzd

Автоматический классификатор объектов железнодорожного транспорта

Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

abhiram-ds/credit_card_fraud_detection

Credit Card Fraud detection based on anonymized data using multiple classification algorithms

Language: Jupyter Notebook - Size: 1.73 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

jabhinav/Data-Science-and-ML-for-Structured-Data-Classification

Repo contains scripts to perform data analysis on structure data. It also provides a comparison of various ML algorithms at different stages of data preparation.

Language: Jupyter Notebook - Size: 522 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Related Keywords
under-sampling 13 logistic-regression 5 imbalanced-data 4 machine-learning 4 over-sampling 4 machine-learning-algorithms 4 random-forest 3 classification 2 random-forest-classifier 2 data-analysis 2 data-visualization 2 python 2 smote 2 anamoly-detection 2 multi-label-classification 2 data-science 2 scikit-learn 2 query 2 binary-relevance 2 classifier-chains 2 decision-tree-classifier 2 decision-trees 2 nueral-networks 1 smote-sampling 1 ensembling-methods 1 credit-risk-analysis 1 boosting 1 under-fitting 1 skicit-learn 1 pandas-library 1 over-fitting 1 numpy-library 1 numpy 1 model-training 1 model-testing 1 model-evaluation 1 data-preparation 1 cost-sensitive-learning 1 binary-classification 1 xgboost 1 skewness 1 pandas 1 credit-card-fraud 1 class-imbalance 1 multiclass-classification 1 smote-oversampler 1 scikitlearn-machine-learning 1 precision-recall 1 naive-random-oversampler 1 imbalance-learning 1 easy-ensemble-classifier 1 cluster-centroids-undersampling 1 balanced-random-forest 1 ada-boost-classifier 1 xgboost-algorithm 1 dataset 1 precision-recall-curve 1 outlier-removal 1 outlier-detection 1 near-miss 1 knn-classifier 1 gridsearchcv 1 exploratory-data-analysis 1 data-scaling 1 correlation-analysis 1 seaborn 1 neural-network 1 fraud-prevention 1 fraud-detection 1 ethereum 1 data-science-projects 1 data-science-portfolio 1 ai-ml 1 csv-files 1 sampling 1 large-dataset 1 finance 1 deep-learning 1 data-analytics 1 confusion-matrix 1 classification-report 1 auprc 1 support-vector-classifier 1 shallow-neural-network 1 gradient-boosting-classifier 1 yeast-dataset 1 ensemble 1 svc 1 roc-auc-curve 1 roc-auc 1