Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: smote-sampling

abis330/footbalysis

Player Rating System in Soccer using Machine Learning

Size: 0 Bytes - Last synced: 6 days ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

fosetorico/pH_level_forecasting

pH Level Forecasting of Well Water Samples in Malawi, Conducted by Leeds Beckett University

Language: Jupyter Notebook - Size: 1.37 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 0 - Forks: 0

razamehar/Predicting-Bank-Customer-Churn

This project aims to predict bank customer churn using a dataset derived from the Bank Customer Churn Prediction dataset available on Kaggle. The dataset for this competition has been generated from a deep learning model trained on the original dataset, with feature distributions being similar but not identical to the original data.

Language: Jupyter Notebook - Size: 9.24 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 0 - Forks: 0

UdbhavSrivastava/Stroke-Predictive-Modeling

This study uses predictive analytics to detect stroke risk factors early, aiming to reduce occurrences. By analyzing risk factors with machine learning, it uncovers patterns and correlations. Models such as Logistic Regression, KNN, Decision Trees, Random Forest, and Neural Network.

Language: Jupyter Notebook - Size: 8.14 MB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 0 - Forks: 0

fitria-dwi/Credit-Risk-Prediction

This project aims to identify patterns that indicate if a person is unlikely to repay the loan or labeled as bad risk and build a predictive model to predict loan risk from applicants.

Language: Jupyter Notebook - Size: 3.56 MB - Last synced: 22 days ago - Pushed: 23 days ago - Stars: 2 - Forks: 0

tomartushar/Credit-Card-Fraud-Detection

An ensemble of machine learning models for detecting fraudulent credit card transactions, utilizing advanced techniques for feature selection, data imbalance handling, and hyperparameter tuning.

Language: Jupyter Notebook - Size: 248 KB - Last synced: 25 days ago - Pushed: 27 days ago - Stars: 0 - Forks: 0

princessEmilyy/ML-project-diabetes-

Project for applied classical ML course at the Weizmann institute

Language: Python - Size: 183 MB - Last synced: 29 days ago - Pushed: 29 days ago - Stars: 0 - Forks: 0

dardenkyle/Credit-Card-Users-Churn-Prediction

Analyze the data and come up with a predictive model to determine if a customer will leave the credit card services or not and the reason behind it

Language: Jupyter Notebook - Size: 1.79 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

ShyrenMore/sem08

A compilation of codes for SMA, DC, ADS

Language: Python - Size: 1.63 MB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 7 - Forks: 3

vrbabu9000/bank_termDeposit_predictor

Given a highly biased banking dataset of 45k entries and 17 variables wherein I was able to filter out good predictors for modelling and further balance the bias in the dataset using SMOTETomek method. The bias removal was approached in two different ways and the pros and cons of both methods are stated. The end result is a logistic regression model with 81% accuracy and striking a sharp balance between precision and recall with the former having a slight upper hand.

Language: Jupyter Notebook - Size: 2.15 MB - Last synced: 2 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

Srijha09/Detection-Of-Fraudulent-Claims-In-Medical-Insurance

Checking whether the claims are fraud or non-fraud based on various attributes using Medicare Dataset

Language: Jupyter Notebook - Size: 916 KB - Last synced: about 2 months ago - Pushed: over 4 years ago - Stars: 1 - Forks: 1

haniye6776/outlier-detection

Language: Jupyter Notebook - Size: 4.29 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

Akshadachavan02/classification-_algo

As the there are couple of classification algorithms in supervised machine learning so some of them as here..

Language: Jupyter Notebook - Size: 276 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

dsvirenpai/KNN_Forensic_Science_Classifying_GlassType

KNN in Forensic Science: Classifying Glass Evidence for Criminal Investigations

Language: Jupyter Notebook - Size: 533 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

ajitsingh98/E-commerce-product-buyer-session-prediction

Predict whether customer purchase a product or not in a session

Language: Jupyter Notebook - Size: 48.2 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

Theofilusarifin/Diabetes-Disease-Prediction

Diabetes, characterized by high blood sugar levels due to ineffective insulin production or usage, poses serious health risks if not managed. Deep Learning offers promising avenues for diabetes management.

Language: Jupyter Notebook - Size: 1.32 MB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

sumitkutty/Respondent-Group-Classification

6-class classification problem on a large dataset using machine learning.

Language: Jupyter Notebook - Size: 2.01 MB - Last synced: 4 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

ggjay9/Application-Flow-Identification

Classify applications using flow features with Random Forest and K-Nearest Neighbor classifiers. Explore augmentation techniques like oversampling, SMOTE, BorderlineSMOTE, and ADASYN for better handling of underrepresented classes. Measure classifier effectiveness for different sampling techniques using accuracy, precision, recall, and F1-score.

Language: Jupyter Notebook - Size: 90.6 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

RampageousRJ/Spam-Email-Detection

Detects if an outgoing mail would be classified as spam or not

Language: Jupyter Notebook - Size: 11.1 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

nickjlupu/Credit-Risk

Supervised scikit-learn machine learning models using several sampling techniques.

Language: Jupyter Notebook - Size: 18.4 MB - Last synced: 5 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

garick161/alpha_bank_cup_challenge_final

Разработка алгоритма привлечения новых клиентов банка

Language: Python - Size: 8.31 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

ceenaa/fraud_detection

Credit Card fraud detection

Language: Jupyter Notebook - Size: 223 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 0

epicure24/Classifier-for-highly-unbalanced-data

This repo represents all the resampling techniques needed to achieve better results in highly unbalanced or skewed data that has 77 % of data in one class and rest in others.

Language: Jupyter Notebook - Size: 152 KB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

MsTao-68/Debt-Churn-Data-Analysis

使用比赛方提供的脱敏数据,进行客户信贷流失预测。

Language: Jupyter Notebook - Size: 15 MB - Last synced: 6 months ago - Pushed: about 2 years ago - Stars: 5 - Forks: 0

aryamaansaha/employeeattrition

This repository contains code that was used to predict employee attrition using machine learning methods.

Language: HTML - Size: 1.54 MB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

letstryy/LeadScoring

Predictive Lead Scoring using clickstream data

Language: Jupyter Notebook - Size: 2.17 MB - Last synced: 6 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 1

ishreya09/Parkinson-Prediction-Model

We train different models and apply techniques gives us better evaluation metrics, and find out the best model which works the best for Parkinson's Prediction System

Language: Jupyter Notebook - Size: 49.7 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

desininja/Employee-Attrition-analysis

To know the main reasons for attrition of employees.

Language: Jupyter Notebook - Size: 858 KB - Last synced: 7 months ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

shivamverma26/Credit_Card_Fraud_Detection

This project is dedicated to advanced machine learning techniques for credit card fraud detection, providing a solution to protect financial institutions and their clients by predicting and preventing fraudulent transactions in a highly imbalanced dataset.

Language: Jupyter Notebook - Size: 1.36 MB - Last synced: 5 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

Devanshu0502/Road-Severity-Classification

Language: Jupyter Notebook - Size: 1.84 MB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

aniass/Spam-detection

Spam detection in SMS messages with BERT model and Machine Learning algorithms

Language: Jupyter Notebook - Size: 608 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 13 - Forks: 6

skinan/Improved-Sampling-and-Feature-Selection-to-Support-Extreme-Gradient-Boosting-For-PCOS-Diagnosis

This project is a part of the research on PolyCystic Ovary Syndrome Diagnosis using patient history datasets through statistical feature selection and multiple machine learning strategies. The aim of this project was to identify the best possible features that strongly classifies PCOS in patients of different age and conditions.

Language: Jupyter Notebook - Size: 516 KB - Last synced: 8 months ago - Pushed: about 2 years ago - Stars: 3 - Forks: 6

Angienoelhaverly/Credit_Risk_Analysis

Perform a Credit Risk Supervised Machin Learning Analysis using scikit-learn and imbalanced-learn libraries.

Language: Jupyter Notebook - Size: 18.4 MB - Last synced: 8 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 1

manjugovindarajan/RENEWIND-predictive-maintenance-cost-maintenance-usingML

The aim is to decrease maintenance cost of generators used in wind energy production machinery. This is achieved by building various classification models, accounting for class imbalance, tuning on a user defined cost metric (function of true positives, false positives and false negatives predicted) & productionizing model using pipelines

Language: Jupyter Notebook - Size: 6.62 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

SarangGami/Bank-Marketing-Effectiveness-Prediction-supervised-learning

The main objective is to build a predictive model that predicts whether a new client will subscribe to a term deposit or not, based on data from previous marketing campaigns.

Language: Jupyter Notebook - Size: 7.1 MB - Last synced: 8 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

abhijha3011/Techniques-To-Handle-Imbalanced-Data

Different Techniques to Handle Imbalanced Data Set

Language: Jupyter Notebook - Size: 31.3 KB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

yrnigam/UCI_Credit_Card_Default

This dataset contains information on default payments, demographic factors, credit data, history of payment, and bill statements of credit card clients in Taiwan from April 2005 to September 2005.

Language: Jupyter Notebook - Size: 139 KB - Last synced: 9 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

Safaa-p/Fraudulent-Insurance-Claims-Detection

Different models to detect if a claim is fraudulent or not

Language: Jupyter Notebook - Size: 3.35 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

HRX101/Employee-satisfaction

Explainable AI for Predictive Analytics on Employee Satisfaction

Language: Jupyter Notebook - Size: 2.42 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

j1bulbul/Term_Deposit_Subscription_Predictor_ML

Supervised Learning, Binary Classification ML problem utilised to determine if an individual would subscribe to a term deposit based on various marketing characteristics

Language: Jupyter Notebook - Size: 2.18 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

joanitolopo/eval-sampling-methods

🔍 Evaluating Sampling Techniques for Healthcare Insurance Fraud Detection in Imbalanced Dataset.

Language: Python - Size: 188 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

sharmasapna/credit-card-fraud-detection

Code to detect credit card fraud detecton

Language: Jupyter Notebook - Size: 3.05 MB - Last synced: 9 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 1

theheisenberg10/Facebook-Post-Sentiment-Analysis-NLP-using-RNNs

Predicting sentiment of Facebook posts (Appreciation, Complaint, or Feedback) using RNNs

Language: Jupyter Notebook - Size: 12.7 KB - Last synced: 9 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

AI-14/lumpy-skin-disease-classification

Binary classification of lumpy skin disease (imbalanced dataset) using ML algorithms in addition to oversampling/undersampling techniques.

Language: Jupyter Notebook - Size: 4.76 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 2 - Forks: 0

Niranjan-stat/Loan-Default-Prediction

This project aims to provide a robust and accurate solution to the loan defaulting problem, helping financial institutions make more informed lending decisions and reduce their risk exposure.

Language: Jupyter Notebook - Size: 8.03 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

nazgol-nikravesh/NSGRT

Source code for "Cross-project Defect Prediction with An Enhanced Transfer Boosting Algorithm"

Language: Python - Size: 2.01 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

lucashomuniz/Project-6

Comparing Machine Learning Algorithms for Credit Risk Analysis in Banking

Language: R - Size: 78.1 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 1 - Forks: 0

krishna-aditi/credit-card-fraud-detection-imbalanced-dataset-problem

Language: Jupyter Notebook - Size: 47.9 KB - Last synced: 11 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

technisekai/sentiment-analysis-of-new-halal-logo

:round_pushpin: Final project of Telkom Institute of Technology Purwokerto

Language: Jupyter Notebook - Size: 130 KB - Last synced: 11 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

M-Hashemzadeh/RCSMOTE

RCSMOTE: Range-Controlled Synthetic Minority Over-sampling Technique for handling the class imbalance problem

Size: 6.28 MB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 3 - Forks: 0

rikhuijzer/Resample.jl

An implementation of SMOTE

Language: Julia - Size: 3.73 MB - Last synced: about 23 hours ago - Pushed: 8 months ago - Stars: 8 - Forks: 0

FataiAzeez/stroke_prediction_rf_svm_lr

This repo evaluates Logistic Regression, Random Forest, and Support Vector Machine models for predicting stroke risk. Implemented in Python, the project includes data pre-processing, model training, and performance metric calculations

Language: Jupyter Notebook - Size: 967 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

Palemravichandra/customer-segmentation

customer segmentation of insurance comapany

Language: Jupyter Notebook - Size: 4.18 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

Candida18/ADS_SMA

Language: Jupyter Notebook - Size: 9.25 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

yourssincerely/eurovision

This project provides a comprehensive analysis of the Eurovision Song Contest, with insights derived from both traditional statistical methods and machine learning techniques.

Language: Jupyter Notebook - Size: 28.2 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

avr2002/credit-card-default-prediction

This a classic Credit Card Default Prediction project where based on customer profile we want to predict whether the borrower is likely to default in the next 2 years or not having a delinquency of more than 3 months.

Language: Jupyter Notebook - Size: 10.4 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

urmipandya123/Road_Severity_Classification

Language: Jupyter Notebook - Size: 1.85 MB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

bharatkulmani/Dry-Bean

Project is about predicting Class Of Beans using Supervised Learning Models

Language: Jupyter Notebook - Size: 35.3 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

sahidul-shaikh/credit-card-fraud-detection

Machine learning model for Credit Card fraud detection

Language: Jupyter Notebook - Size: 1.47 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 37 - Forks: 32

AvinandanBose/Credit-Card-Fraud-Detection-Machine-Learning-

Credit Card Fraud Detection using Python and Machine Learning.

Language: Jupyter Notebook - Size: 77.5 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

rochitasundar/Predictive-maintenance-cost-minimization-using-ML-ReneWind

The aim to decrease the maintenance cost of generators used in wind energy production machinery. This is achieved by building various classification models, accounting for class imbalance, and tuning on a user defined cost metric (function of true positives, false positives and false negatives predicted) & productionising the model using pipelines.

Language: Jupyter Notebook - Size: 15.2 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 7 - Forks: 2

rupeshsure/Obstructive-Sleep-Apnea-Project

Obstructive Sleep Apnea classification with help of numerical data set which having the physical body characteristics with the help of machine learing

Language: Jupyter Notebook - Size: 48 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 1

ankit-kothari/Credit-Risk-Analysis

Predicting the ability of a borrower to pay back the loan through Traditional Machine Learning Models and comparing to Ensembling Methods

Language: Jupyter Notebook - Size: 768 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 4 - Forks: 4

MoinDalvs/Assignment_SVM_Forest_Fire_Prediction

Language: Jupyter Notebook - Size: 5.58 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 3 - Forks: 0

LokeshSreenathJ/Bankruptcy-Prediction---Analytics

Built XGBoost Classifier using SMOTE technique and Hyper-Parameter Tuning

Language: Jupyter Notebook - Size: 10.1 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

AayushSameerShah/SMOTE

This small repository contains the SMOTE implementation from scratch.

Language: Jupyter Notebook - Size: 1.12 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 2 - Forks: 0

TzeLun/SMOTE

A minority oversampling method for imbalance data set

Language: Python - Size: 19.5 KB - Last synced: over 1 year ago - Pushed: almost 4 years ago - Stars: 1 - Forks: 1

rishawsingh/Credit-Card-Fraud-Detection

System to tell apart the transaction was from the real user who owns the credit card or the transaction was from the stolen credit card.

Language: Jupyter Notebook - Size: 124 KB - Last synced: over 1 year ago - Pushed: almost 3 years ago - Stars: 9 - Forks: 0

BibhuPrasadPanda97/Credit-Card-Default-Risk---AmExpert-CodeLab

Competition conducted by American Express on HackerEarth Platform to Predict Credit Card Defaulters by building Machine Learning Models for the given data.

Language: Jupyter Notebook - Size: 2.87 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 1

chinmaysharmacs10/University_Recommender

A model that recommends University based on details of an applicant.

Language: Jupyter Notebook - Size: 6.73 MB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 4 - Forks: 0

gulabpatel/Handle_Imbalance

Language: Jupyter Notebook - Size: 190 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 7 - Forks: 0

Wamuza1/Credit_Risk_Analysis

Supervised Machin Learning Analysis using scikit-learn and imbalanced-learn libraries.

Language: Jupyter Notebook - Size: 18.4 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

yandi-farinango/CreditRiskModel

Training XGBoost ML model to detect credit default risk. Used SMOTE technique for handling unbalanced data. Evaluation of model trained on unbalanced dataset vs SMOTE generated dataset

Language: Jupyter Notebook - Size: 317 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

GianRomani/ML_course_homework

Code and reports of the two homework for the Machine Learning course (Winter 2020)

Language: Jupyter Notebook - Size: 40.1 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

Olatohun/campaign-response

Language: Jupyter Notebook - Size: 144 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

shumph10/Credit_Risk_Analysis

Established a supervised machine learning model trained and tested on credit risk data through a variety of methods to establish credit risk based on a number of factor

Language: Jupyter Notebook - Size: 39.6 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

tamannanazmin/Datathon

Language: Jupyter Notebook - Size: 4.01 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

sidharth-ds/Credit-Card-Default-prediction

EDA ---> Balancing the Dataset (SMOTE) ---> Feature engineering ---> Modelling with Hyperparameter Tuning

Language: Jupyter Notebook - Size: 1.75 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

gaetanoantonicchio/DataMining-2

Repository for "Data Mining - Advanced Topics and Applications" projects exam.

Language: Jupyter Notebook - Size: 233 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

Toshani/Credit-Card-Fraud-Detector

Mini project repository where we have implemented Credit card fraud detection using encoding, SMOTE-ing and KNN.

Language: Jupyter Notebook - Size: 7.42 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

kalyaniasthana/CS273A_project_diabetes

Course Project for CS273A: Machine Learning at UCI

Language: Jupyter Notebook - Size: 10 MB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0

bkraffa/desafioclassificacao

Desafio de Classificação do curso de Data Science e Machine Learning da Tera. Em um dataset de mais de 6 milhões de operações bancárias tinhamos um objetivo de realizar a previsão de fraudes. Fazendo uso de um processo de feature engineering que acrescentou 20 features ao modelo, combinado com um resampling feito através do método SMOTE. Para o treinamento criamos três modelos: Regressão Logística, Random Forest e XGBoost. Esses dois últimos performaram com precisão e recall superiores a 99%.

Language: Jupyter Notebook - Size: 168 KB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

akibzaman/Soil-nutrient-web

Prediction of basic soil nutrients (phosphorus, potassium, boron, calcium, magnesium and manganese) using reflectance from Hyperspectral Satellite Images (HSI).

Language: JavaScript - Size: 35.5 MB - Last synced: 11 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 2

alexberndt/machine-learning-sandbox

Collection of machine learning algorithms ...

Language: Jupyter Notebook - Size: 352 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

rohitdwivedula/enzyme-classification

Predict the enzyme class of a given FASTA sequence using deep learning methods including CNNs, LSTM, BiLSTM, GRU, and attention models along with a host of other ML methods.

Language: Python - Size: 76.8 MB - Last synced: over 1 year ago - Pushed: almost 3 years ago - Stars: 7 - Forks: 1

shaunwang1350/CreditLoans_MachineLearning

Credit risk is an inherently unbalanced classification problem, as the number of good loans easily outnumber the number of risky loans. I employed Machine Learning techniques to train and evaluate models with unbalanced classes. I used imbalanced-learn and scikit-learn libraries to build and evaluate models using resampling. I also evaluated the performance of these models and made a recommendation on whether they should be used to predict credit risk.

Language: Jupyter Notebook - Size: 18.4 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

izzypatrick21/cuisines

classification of asian and indian cuisines. A good example for resampling imbalance dataset for a classification project using interpolation. I have also included deploying machine learning model using Onnyx.

Language: Jupyter Notebook - Size: 149 KB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

lucacarniato/predicting-customer-churn-kaggle-competition

A solution to the Kaggle competition "Predicting Churning customers" (https://www.kaggle.com/sakshigoyal7/credit-card-customers)

Language: Jupyter Notebook - Size: 694 KB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

86lekwenshiung/Classification-Modelling-Projects

Classification Projects for balanced and imbalanced datasets

Language: Jupyter Notebook - Size: 11.1 MB - Last synced: 12 months ago - Pushed: over 2 years ago - Stars: 1 - Forks: 0

jharvey09/Risky_Business_Peer_To_Peer_Lending

In this project, I will use credit risk models to assess the credit risk using peer-to-peer lending. Algorithms such as SMOTE, Naive Random Sampling, etc.

Language: Jupyter Notebook - Size: 982 KB - Last synced: over 1 year ago - Pushed: almost 3 years ago - Stars: 0 - Forks: 0

ritwikkanodia/Privacy-Preserving-Machine-Learning

Maintaining the privacy of local server data in a federated learning framework using differential privacy by TensorFlow Privacy Library.

Language: Jupyter Notebook - Size: 2.77 MB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 2 - Forks: 0

WilliamSMendes/multiclass_students_classifier

A model for multiclass calssification, label ech student profile in Saint Paul School for predict the future profiles.

Language: Jupyter Notebook - Size: 505 KB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

jesussantana/Sampling

Perform Data Sampling with Python

Language: Jupyter Notebook - Size: 5.04 MB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

NataliaVelasquez18/credit-risk

The purpose of this study is to recommend whether PureLending should use machine learning to predict credit risk. Several machine learning models are built employing different techniques, then they are compared and analyzed to provide the recommendation.

Language: Jupyter Notebook - Size: 18.5 MB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

Rizwan-Hasan/Improved-Sampling-and-Feature-Selection-to-Support-Extreme-Gradient-Boosting-For-PCOS-Diagnosis Fork of skinan/Improved-Sampling-and-Feature-Selection-to-Support-Extreme-Gradient-Boosting-For-PCOS-Diagnosis

This project is a part of the research on PolyCystic Ovary Syndrome Diagnosis using patient history datasets through statistical feature selection and multiple machine learning strategies. The aim of this project was to identify the best possible features that strongly classifies PCOS in patients of different age and conditions.

Size: 386 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

Serfati/covid_bloodtest

❤️ 🩸 Blood test classifier for infected COVID-19 patients using xgb, catboost, rf and lr

Language: Jupyter Notebook - Size: 6.03 MB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 0

AkashSDas/predict-hr-stay-or-leave

Sampling unbalanced dataset using SMOTE and creating a classifier to classify if a HR will stay or leave.

Language: Jupyter Notebook - Size: 2.59 MB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

jasminedogu/DS4002-Sentiment-Analysis

A sentiment analysis using SPAM/HAM Text Classification data using Support Vector Machines. Utilizes different variations of the Synthetic Minority Oversampling Technique (SMOTE-SVM, SMOTE-KNN).

Language: HTML - Size: 5.57 MB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

rtharungowda/cascade-cup-2020

Cascade Cup Data Science Hackathon, Solve a real-world Data Science Challenge by Trell

Language: Jupyter Notebook - Size: 30.3 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

rachellimce/Project-4-West-Niles-Virus

DSI 16 Project 4, Predicting West Niles Virus

Language: Jupyter Notebook - Size: 16.4 MB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

Related Keywords
smote-sampling 103 machine-learning 35 logistic-regression 26 random-forest 25 xgboost 20 python 18 smote 14 data-science 13 oversampling 12 classification 11 undersampling 10 knn-classification 10 scikit-learn 9 hyperparameter-tuning 9 random-forest-classifier 8 outlier-detection 7 svm-classifier 7 feature-engineering 7 exploratory-data-analysis 6 eda 6 decision-trees 6 sklearn 6 pandas 5 machine-learning-algorithms 5 xgboost-classifier 5 pca 5 deep-learning 5 neural-networks 5 neural-network 5 data-visualization 5 classification-model 4 smoteenn 4 python3 4 svm 4 random-under-sampling 4 numpy 4 supervised-machine-learning 4 optuna 4 class-imbalance 4 oversampling-technique 4 credit-risk 4 sampling-methods 4 knn 3 support-vector-machines 3 matplotlib-pyplot 3 random-over-sampling 3 jupyter-notebook 3 cross-validation 3 lightgbm 3 xgboost-model 3 deep-neural-networks 3 data-cleaning 3 adasyn-sampling 3 smoteen 3 credit-card 3 seaborn 3 imbalanced-classification 3 unbalanced-data 3 decision-tree-classifier 3 imbalanced-data 3 fraud-detection 3 multiclass-classification 3 tensorflow 3 data-analytics 2 naive-random-oversampler 2 regularization 2 linear-regression 2 road-safety 2 predictive-modeling 2 tsne 2 classifier 2 keras 2 python-3 2 stratified-sampling 2 stratified-cross-validation 2 naive-bayes-classifier 2 adaboost 2 hypothesis-testing 2 fraudulent-transactions 2 cluster-centroids-undersampling 2 imblearn 2 undersampling-technique 2 ann 2 tree-model 2 autoencoders 2 train-validation-test 2 ml 2 shap 2 lime 2 diabetes 2 binary-classification 2 catboost 2 univariate-analysis 2 recursive-feature-elimination 2 credit-card-fraud 2 randomoversampler 2 bivariate-analysis 2 prediction-model 2 r 2 feature-extraction 2