GitHub topics: undersampling-technique
Emeraldugbeyide93/Undersampling-and-Oversampling-techniques-for-imbalanced-datasets
Beginner friendly project focusing on dataset imbalances using the oversampling and under sampling techniques
Size: 1.95 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

Dihan07/Undersampling-Comparision
Comparision of different undersampling techniques on a credit card dataset
Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

SherineTarek224/Credit_Fraud_Detection
This Project focuses on building a Fraud Detection using highly unbalanced Dataset from Kaggle of 170,884 and 305 frauds only using different machine learning models. Using Logistic Regression ,RandomForest and Neural Networks with preprocessing on Training Dataset and hyperparameters Tuning
Language: Jupyter Notebook - Size: 2.93 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

coderjolly/credit-risk-modelling
The aim of the project is to create a robust machine learning model that predicts the likelihood for a bank's customers to fail on their credit payments for the next month. The dataset used contains information on 24028 customers across 26 variables that includes information regarding whether customer defaulted, credit limits, bill history etc.
Language: Jupyter Notebook - Size: 6.33 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Jesly-Joji/Money-Laundering-Classification
Money Laundering Classification on IBM Transactions
Language: Jupyter Notebook - Size: 412 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

Demon-2-Angel/Cereberal-Stroke-Analysis
Cerebral stroke, a critical condition, demands vigilant analysis. Machine learning models, coupled with resampling techniques like SMOTEENN, enhance stroke prediction accuracy by addressing imbalanced datasets.
Language: Jupyter Notebook - Size: 1.25 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

mmsaki/credit-risks-ml
Using the imbalanced-learn and Scikit-learn libraries to build and evaluate machine learning models.
Language: Jupyter Notebook - Size: 9.02 MB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Ezgigunbatar/ToxicityDetection
Toxicity detection on imbalanced social media data. Focused on 2 main topics of toxicity detection: Class imbalance problem and detecting toxic comments.
Language: Jupyter Notebook - Size: 3.13 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

RAVI-CHANDRIKA-05/PREDICTING_PROBABILITY_OF_PAYING_BLIGHT_TICKETS
This repository contains files on Predict probability whether a given blight ticket will be paid
Language: HTML - Size: 20.9 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vicaaa12/advanced-machine-learning
Advanced Machine Learning
Language: Jupyter Notebook - Size: 1.93 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ClJuRo/Classification-of-Good-Loans-and-Bad-Loans-using-ML-algorithms
Predictive Modeling of Credit Risk Faced by a P2P lending platform
Language: Jupyter Notebook - Size: 8.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

joanitolopo/eval-sampling-methods
🔍 Evaluating Sampling Techniques for Healthcare Insurance Fraud Detection in Imbalanced Dataset.
Language: Python - Size: 188 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

AI-14/lumpy-skin-disease-classification
Binary classification of lumpy skin disease (imbalanced dataset) using ML algorithms in addition to oversampling/undersampling techniques.
Language: Jupyter Notebook - Size: 4.76 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

gabrielecola/Imbalance_classification_problem
Language: TeX - Size: 667 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

loopiiu/DSP2_Endterm
Expresso Churn Prediction Challenge - dealing with imbalanced dataset
Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 8 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

TheThugnomist/BankFraud
Building a machine learning model to check bank frauds
Language: Jupyter Notebook - Size: 300 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

vargovema/ccf_classification
Credit card fraud detection
Language: R - Size: 11 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

lamferzon/Heart-disease-analysis
Statistical analysis in R of a heart disease dataset by using logistic regression and random forest.
Language: R - Size: 3.59 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Godson199/Ecological-Footprint-Predictor
In this project, I worked on a classification problem using an imbalanced dataset which predicts ecological footprints. The aim of the project was not necessarily to build a classification model but to investigate the different methods of correcting an imbalanced dataset in order not to build a biased classifier
Language: Jupyter Notebook - Size: 4.7 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nafisa-samia/Credit-Card-Fraud-Detection-Dealing-with-Imbalanced-Data
This project is based on supervised machine learning where you will be predicting whether a credit card transaction is original transaction or fraud transaction based on various parameters. This is a classification problem.
Language: Jupyter Notebook - Size: 1.28 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

ramtiin/Credit-Card-Fraud-Detection
In this project, I explore different methods for detecting credit card fraud transactions; including using the Catboost algorithm with undersampling & oversampling methods, and using an almost new approach, by using deep learning and autoencoder.
Language: Jupyter Notebook - Size: 737 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

Soumyapro/Credit-Card-Fraud-Detection
The project was intended to detect fraudulent transactions from a highly imbalanced dataset.To solve the imbalance dataset problem random undersampling techniques were used.
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

kuntala-c/A-streamlit-app-to-predict-customer-purchase-using-ML-model Fork of nafisa-samia/A-streamlit-app-to-predict-customer-purchase-using-ML-model
A Machine Learning model that predicts the customer's possibility of purchase using historical data.
Language: Jupyter Notebook - Size: 3.68 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

COOLMudi/Data-Scientist-Spotify-Skip-Action-Prediction-
It's Technocolabs Software Data Scientist Internship Project (1st Dec 2021 - 15th Jan 2022). In this project the team was instructed to analysis big data of Spotify users and to perform Statistical and Exploratory Data Analysis and Model Development for Predicting Listener Behavior.
Language: Jupyter Notebook - Size: 37.5 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Sophy8281/sms-spam-detection
Spam messages detection model
Language: Jupyter Notebook - Size: 62.5 KB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

vivekyadav07/Credit-Card-Fraud-Detection
ML Project ,XGboost .Logistic Regression as classification,Decision Tree & balancing technique Undersampling & SMOTE.
Language: Jupyter Notebook - Size: 3.95 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

aolayeye/Credit_Risk_Analysis
Predicting Credit Risk with undersampling, oversampling, and ensemble methods across various machine learning algorithms.
Language: Jupyter Notebook - Size: 37 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

disha2sinha/Reducing-Imbalanced-dataset-by-Under-sampling-approach-Consensus-Clustering
Under-sampling based consensus clustering is applied on the three best clustering algorithms found after applying several Clustering Algorithms like K-means, K-modes, K-prototypes , K-means++ and fuzzy K-means on the majority class data of the IMBALANCED colon dataset to produce a BALANCED dataset.
Language: Jupyter Notebook - Size: 4.11 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1
