An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: unbalanced-data

SNBQT/Limited-Data-Rolling-Bearing-Fault-Diagnosis-with-Few-shot-Learning

This is the corresponding repository of paper Limited Data Rolling Bearing Fault Diagnosis with Few-shot Learning

Language: Jupyter Notebook - Size: 1.18 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 338 - Forks: 47

SakinaJaffri/British_Airway_Virtual_Internship

Web Scrapping British Airways review to gain company insights. Build a random forest model to predict customer buying behavior.

Language: Jupyter Notebook - Size: 3.03 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 2 - Forks: 0

SherineTarek224/Credit_Fraud_Detection

This Project focuses on building a Fraud Detection using highly unbalanced Dataset from Kaggle of 170,884 and 305 frauds only using different machine learning models. Using Logistic Regression ,RandomForest and Neural Networks with preprocessing on Training Dataset and hyperparameters Tuning

Language: Jupyter Notebook - Size: 2.93 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

RS2002/Label-Unbalance-in-High-Frequency-Trading

Official Repository for The Technical Report, Label Unbalance in High-frequency Trading

Language: Python - Size: 305 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

SaeidRostami/Customer_Churn

Customer churn analysis for a telecommunication company

Language: Jupyter Notebook - Size: 2.24 MB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 4

Maguids/Enhanced_KNN

This project consists on improving KNN to be able to better deal with imbalanced classes. Project for the "Machine Learning" course on the Second Semester of the Second Year of the Bachelor's Degree in Artificial Intelligence and Data Science.

Language: Jupyter Notebook - Size: 6.15 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

AnneQuinkenstein/AccidentsInBerlin

Mithilfe von Machine Learning und Open Data zu Unfällen in Berlin (2018-2021) beantworten wir folgende Frage: Was sind die wichtigen Faktoren/Einflüsse auf Unfallgefahr? Und wie gut lässt sich damit die Unfallschwere überhaupt vorhersagen?

Language: HTML - Size: 79.2 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

arianaira/mlp-unbalanced-classification

predicting showing up to doctor's appointment using mlp on imbalance dataset.

Language: Jupyter Notebook - Size: 1.74 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

WojciechMigda/TCO-CustomerChurn

Customer Churn (Drop Off) Modeling

Language: C++ - Size: 132 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Safaa-p/Machine-Failure-Prediction

Predicting Machine failure using Machine learning on a synthetic dataset of an existing milling machine consisting of 10,000 data points

Language: Jupyter Notebook - Size: 4.7 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Leoelsilva/predictiveanalytics

(Python) Proyecto enfocado en la creación de modelos predictivos como Regresión Logistica, Arboles de Decisión, KNN, SVM, Naive Bayes y Ensamblados. Inicialmente el problema consta de un analisis crediticio de clientes buenos/malos. Se utiliza una BBDD de clases desbalanceadas la cual se limpia y procesa para alimentar los modelos

Language: Jupyter Notebook - Size: 3.37 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

garick161/alpha_bank_cup_challenge_final

Разработка алгоритма привлечения новых клиентов банка

Language: Python - Size: 8.31 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

epicure24/Classifier-for-highly-unbalanced-data

This repo represents all the resampling techniques needed to achieve better results in highly unbalanced or skewed data that has 77 % of data in one class and rest in others.

Language: Jupyter Notebook - Size: 152 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Go-MinSeong/Predicting-whether-your-mail-will-be-read

predicting whether you read mail

Language: Jupyter Notebook - Size: 16.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

newsteps8/Term-Deposit-Prediction

Unbalanced Customer Data

Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

evanotero/deep-music-genre-classification

🎵 Using Deep Learning to Categorize Music as Time Progresses Through Spectrogram Analysis

Language: Python - Size: 25 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 22 - Forks: 7

pminguez/MachineLearning4UnbalancedData

Language: R - Size: 41 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

Albertsr/Class-Imbalance

Cost-Sensitive Learning / ReSampling / Weighting / Thresholding / BorderlineSMOTE / AdaCost / etc.

Language: Python - Size: 6 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 96 - Forks: 25

LN5user/starbucks

How A/B Testing and Machine Learning can efficiently improve your marketing strategies and saving costs

Language: Jupyter Notebook - Size: 1.51 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jaymax01/predicting-customer-purchases

classification of online purchasing rates

Language: Jupyter Notebook - Size: 2.24 MB - Last synced at: 15 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

celestialtaha/Unbalanced-dataset-Classification

Classification on Unbalanced Datasets using Boost Techniques (AdaBoost M2, SMOTE Boost, RusBoost,..)

Language: Jupyter Notebook - Size: 720 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

mjuez/ensemble-classification-imbalance-bigdata

⚖⚡ Experimental evaluation of ensemble classifiers for imbalance in Big Data.

Language: Scala - Size: 137 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

taisakamisarava/Employee-Salary-prediction

The model aimed at prediction of employee's salary

Language: Jupyter Notebook - Size: 685 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

mustafahakkoz/Advertisement-CTR-Prediction

A submission for HUAWEI - 2020 DIGIX GLOBAL AI CHALLENGE

Language: Jupyter Notebook - Size: 94.7 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 4

jCodingStuff/NLPReddit

Multinomial classification tasks in Reddit

Language: HTML - Size: 32 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 1

BurakMarangoz/PreProcessing

Preprocessing Analysis

Language: Jupyter Notebook - Size: 3.39 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ComputingVictor/Datathon_Allianz

Model created by the Team 6 for the I Data Talent Program of Allianz

Language: Jupyter Notebook - Size: 19 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Abletobetable/rails-champ

Цифровой прорыв - чемпионат Новосибирской области - классификация объектов железной дороги

Language: Jupyter Notebook - Size: 870 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

GayatriSharma23/Autism-Prediction

It's a classification model that predict whether an individual will suffer from autism in future or not

Language: Jupyter Notebook - Size: 269 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

ValeriaPineda23/Loan-Eligibility-Predictions

Applying CRISP-DM methodology for predicting Loan Elegibility

Language: Jupyter Notebook - Size: 1.28 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

kpratikin/Credit-Card-Fraud

Identify fraudulent credit card transactions so that customers are not charged for items that they did not purchase. (Python, Logistic Regression Classifier, Unbalanced dataset).

Language: Jupyter Notebook - Size: 359 KB - Last synced at: 9 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 4

rematchka/Intended-Sarcasm-Detection-In-English-and-Arabic-for-extremly-unbalanced-datasets

This repo contains work carried out for SemEval 2022 Task 6: iSarcasmEval: Intended Sarcasm Detection In English and Arabic

Language: Jupyter Notebook - Size: 1.5 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

IsabelleContant/P7_dashboard_streamlit

Projet réalisé dans le cadre du parcours diplômant de Data Scientist d'OpenClassrooms (projet n°7)

Language: Jupyter Notebook - Size: 24.3 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

eric-blankinshp/Credit_Risk_Analysis_Supervised_ML

About Six different techniques are employed to train and evaluate models with unbalanced classes. Algorithms are used to predict credit risk. Performance of these different models is compared and recommendations are suggested based on results. Topics

Language: Jupyter Notebook - Size: 18.5 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

burning-river/Assembly_Bias

Language: Jupyter Notebook - Size: 246 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

JCupe17/fake-users

Language: Jupyter Notebook - Size: 1.02 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

agomolka/AutomateDealingWithImbalance

Process of dealing with imbalanced data set and classification

Language: Jupyter Notebook - Size: 354 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dadavalangege/Classification_Models

Classfying an unbalanced data set of delayed flights with the help of SMOTE and comparing the performance of three different classification models (Decision Tree, Logistic Regression, XGBoost).

Language: Jupyter Notebook - Size: 271 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

gprzy/stroke-prediction

🩺 Machine Learning applied to stroke prediction for unbalanced data

Language: Jupyter Notebook - Size: 2.12 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

vipul-shinde/customer-churn-prediction

Customer Churn Prediction for a Telecom company using ML.

Language: Jupyter Notebook - Size: 3.01 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

hayatMohaAl/project_BAN

unbalanced class classification with deep learning

Size: 9.45 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

GabrielSandoval/plentina

Fraud detection based on 6 million transactions

Language: Jupyter Notebook - Size: 1.87 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

tarun360/Adversarial-Attack-on-3D-U-Net-model-Brain-Tumour-Segmentation.

Adversarial Attack on 3D U-Net model: Brain Tumour Segmentation.

Language: Jupyter Notebook - Size: 51.9 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 3

AkashSDas/cassava-leaf-disease-classification

Deep learning solution for Cassava Leaf Disease Classification, a Kaggle's Research Code Competition using Tensorflow.

Language: Jupyter Notebook - Size: 15.4 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

burning-river/KaggleCreditCardFraud

Language: Jupyter Notebook - Size: 64.7 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

soroushjavdan/RandomBalanceBoost

Implementation of Random Balance Algorithm

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: 7 months ago - Pushed at: about 7 years ago - Stars: 5 - Forks: 2

mmaguero/textcat-josa

Train JOSA (Jopara Sentiment Analysis) corpus with traditional machine learning algorithms.

Language: Python - Size: 5.86 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

AkashSDas/predict-hr-stay-or-leave

Sampling unbalanced dataset using SMOTE and creating a classifier to classify if a HR will stay or leave.

Language: Jupyter Notebook - Size: 2.59 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

MaryemSamet/Parkinson-diagnostic

Parkinson diagnostic with supervised and unsupervised machine learning

Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

RickFSA/Lending_Club_Default_Prediction

Classify default borrowers from initial loan application for Lending Club

Language: Jupyter Notebook - Size: 57.5 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

roronoasins/in-ugr

Repositorio de la asignatura Inteligencia de Negocio cursada en la UGR. curso 2020-21

Language: Jupyter Notebook - Size: 14.9 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

mahaveer-rulaniya/Classification-models

Train different classification models on the unbalanced dataset and applying different evaluation methods to it.

Language: Jupyter Notebook - Size: 90.8 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

AdamRajfer/lending-club

Binary classification with unbalanced tabular data

Language: Jupyter Notebook - Size: 134 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

panambY/Human_Activity_Recognition

Predict the activity category of a human.

Language: Jupyter Notebook - Size: 90.2 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

OddExtension5/Credit_Card_Fraud_Detection

Kaggle Project : Anonymized credit card transactions labeled as fraudulent or genuine

Language: Jupyter Notebook - Size: 6.94 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

SimarjotKaur/Customer-Churn-Prediction

To predict whether the customers will subscribe to the system after 1-month free trial or not.

Language: Jupyter Notebook - Size: 218 KB - Last synced at: 20 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

vedpbharti/Machine_Learning

Language: Jupyter Notebook - Size: 4.03 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

yuneming/UnbalancedDataLearning

research on unbalanced data problems

Size: 1.11 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

yulguseva/classification_anonimized_data

Prediction of a productional appliance readings based on anonimized data.

Language: Jupyter Notebook - Size: 75.2 KB - Last synced at: 10 months ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

KwokHing/Visualizing-Datasets-with-Facets

Demo on using Facets: An Open Source Visualization Tool for Machine Learning Training Data developed by Google's PAIR Initiative

Language: Jupyter Notebook - Size: 2.51 MB - Last synced at: about 1 month ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1

Related Keywords
unbalanced-data 60 machine-learning 28 classification 11 python 11 random-forest 11 data-science 7 logistic-regression 7 jupyter-notebook 6 smote 6 oversampling 5 svm-classifier 5 eda 4 deep-learning 4 feature-engineering 4 undersampling 4 visualization 4 classification-algorithm 4 kaggle 4 feature-extraction 3 python3 3 knn-classification 3 scikit-learn 3 cross-validation 3 imbalanced-data 3 imbalanced-classification 3 multiclass-classification 3 data-visualization 3 smote-sampling 3 timeseries 2 binary-classification 2 classification-models 2 resampling-methods 2 preprocessing 2 random-forest-classifier 2 big-data 2 lightgbm 2 kaggle-dataset 2 clustering 2 adaboost-algorithm 2 decision-trees 2 xgboost 2 credit-card-fraud 2 hyperparameter-tuning 2 data-augmentation 2 imbalanced-learning 2 nlp 2 data-analysis 2 decision-tree 2 ensemble-machine-learning 2 feature-selection 2 matplotlib 2 seaborn 2 supervised-learning 2 support-vector-machines 2 knn 2 spark 2 sentiment-analysis 2 api-rest 1 adversarial-attacks 1 docker 1 fastapi 1 brain-tumor 1 brain-tumor-segmentaiton 1 brain-tumour 1 carlini-wagner-attack 1 computer-vision 1 convolutional-neural-networks 1 dice-coefficient 1 enhancing 1 fast-gradient-sign-attack 1 roc-curve 1 stratified-cross-validation 1 fraud-detection 1 normalization 1 bert-models 1 cnn-model 1 emoji 1 irony-detection 1 loss 1 loss-functions 1 pytorch 1 sampling 1 sarcasm-detection 1 sarcastic-tweets 1 tweets 1 shap 1 streamlit 1 balanced-random-forest 1 easy-ensemble-classifier 1 smoteen 1 linear-regression 1 data-pipeline 1 hackathon 1 happy-customers 1 loan 1 comparative-analysis 1 stroke-prediction 1 jupyter 1 telecom 1 embedding 1