Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: imblearn

viniciusds2020/ml_balaceamento_allknn

Este repositório contém um código de Machine Learning que utiliza o algoritmo AllKNN do pacote imblearn para realizar o balanceamento de dados.

Language: Jupyter Notebook - Size: 5.86 KB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 0 - Forks: 0

shinho123/23.11.10-1st-Korean-Society-of-Industrial-Engineers

2023년 11월 대한산업공학회(UNIST) : 다중 역할 경험을 고려한 게임 유저 이탈 예측: 롤 게임을 중심으로, 1저자

Language: Jupyter Notebook - Size: 45 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 0

mathewsrc/AWS-Machine-Learning-Engineer-Capstone

This project aims to train three classification models (LogisticRegression, DecisionTree and RandomForest) using AWS SageMaker to classify customer churn from a database obtained from kaggle.com.

Language: Jupyter Notebook - Size: 10.5 MB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 0 - Forks: 1

Vergosss/Decision_Theory_2023_2024_Project

Language: Python - Size: 494 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0

christopher-w-murphy/Notes-on-Decision-Trees-and-Random-Forests

These are my notes for the interview prep workshop I led on Random Forests

Language: Jupyter Notebook - Size: 2.21 MB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 0 - Forks: 3

jpcadena/cancer-classification

Breast cancer classification project.

Language: Python - Size: 625 KB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

pratik-choudhari/Intent-classification-using-python

Imbalanced Intent classification model with deployment

Language: JavaScript - Size: 6.53 MB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 4 - Forks: 1

LaurentVeyssier/Starbucks_case_study_Udacity_Data_Science

Case study from UDACITY Data Scientist Nanodegree

Language: Jupyter Notebook - Size: 1.75 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

apethani21/aus-rain-prediction

Binary classification project on trying to predict whether or not it will rain the next day using weather features, for various locations across Australia.

Language: Jupyter Notebook - Size: 24.6 MB - Last synced: 4 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

alaa-aleryani/Credit_Risk_Classification

We used various techniques to train and evaluate a model based on loan risk. We used a dataset of historical lending activity from a peer-to-peer lending services company to build a model that can identify the creditworthiness of borrowers.

Language: Jupyter Notebook - Size: 528 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

Shipra-09/Project-Vehicle-Insurance

This Github repository contains cross selling of health insurance customers on vehicle insurance product. We have to predict whether a customer would be interested in Vehicle Insurance or not by building a ML model. Exploring Insights/Inferences by performing EDA on the given project data. Finding the high accuracy

Language: Jupyter Notebook - Size: 10.1 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

saurabhwankhede022/Data-Science-With-Project-s

Data Science With Project's

Language: Jupyter Notebook - Size: 10.1 MB - Last synced: 5 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

saikrishnabudi/Random-Forests

Use Random Forest to prepare a model on fraud data. Treating those who have taxable income <= 30000 as "Risky" and others are "Good" and A cloth manufacturing company is interested to know about the segment or attributes causes high sale.

Language: Jupyter Notebook - Size: 4.74 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

Dhrumil-Zion/Sentiments-Prediction-Using-NLP

Predicting customer sentiments from feedbacks for amazon. While exploring NLP and its fundamentals, I have executed many data preprocessing techniques. In this repository, I have implemented a bag of words using CountVectorizer class from sklearn. I have trained this vector using the LogisticRegression algorithm which gives approx 93% accuracy. I have found out the top 20 positive and negative feedback words from thousands how feedbacks. Also after processing this much I have automated the whole process with one function so that it can be used as generic for many machine learning algorithms. I have also tested another algorithm called DummyClassifier which gives an accuracy of around 84%. After that, I have executed the famous algorithm which is TF-IDF for NLP. I have combined TF-IDF with LogisticRegression which gives almost 93% accuracy but deep insights. Also, while working with data has solved the problem of imbalanced data through RandomOverSampler class from imblearn library.

Language: Jupyter Notebook - Size: 316 KB - Last synced: 5 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

nickjlupu/Credit-Risk

Supervised scikit-learn machine learning models using several sampling techniques.

Language: Jupyter Notebook - Size: 18.4 MB - Last synced: 5 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

alessandrosocc/Machine-Learning-Project-2022

Final project of the Machine Learning course at the University of Cagliari in 2022. Analysis of a dataset, use of Machine Learning techniques with Oversampling and Undersampling techniques. Final report with the results obtained.

Language: Python - Size: 1.4 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

xinguanca/MLproject_creditcardfraud

My first machine learning project.

Language: Jupyter Notebook - Size: 33.2 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

SzymonWilczewski/bank-client-classification-ai

Project for "Computational intelligence" course

Language: Jupyter Notebook - Size: 690 KB - Last synced: 5 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

egorumaev/2023-cirrhosis-outcomes

Прогнозирование исхода лечения пациентов с циррозом печени

Language: Jupyter Notebook - Size: 5.64 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

egorumaev/2023-ods-turnstiles

Идентификация посетителя в зависимости от характерного времени его прохода на территорию организации

Language: Jupyter Notebook - Size: 1.98 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

Emeraldugbeyide93/Undersampling-and-Oversampling-techniques-for-imbalanced-datasets

Beginner friendly project focusing on dataset imbalances using the oversampling and under sampling techniques

Size: 1000 Bytes - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

kirankirank/imbalenced-data

imbalanced data

Language: Python - Size: 7.81 KB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

pratiksha2712/Customer-Churn-Analysis-and-Prediction

Data analysis and ML Modelling

Language: Jupyter Notebook - Size: 1.14 MB - Last synced: 7 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

sharmaroshan/Fraud-Detection-in-Insurace-Claims

This is a very Important part of Data Science Case Study because Detecting Frauds and Analyzing their Behaviours and finding reasons behind them is one of the prime responsibilities of a Data Scientist. This is the Branch which comes under Anamoly Detection.

Language: Jupyter Notebook - Size: 2.23 MB - Last synced: 8 months ago - Pushed: almost 5 years ago - Stars: 7 - Forks: 3

GenTaylor/Traffic-Accident-Analysis

Traffic Accident Analysis using python machine learning

Language: Jupyter Notebook - Size: 36.4 MB - Last synced: 8 months ago - Pushed: about 3 years ago - Stars: 15 - Forks: 9

alecngo/cervical-cancer-project

Deploy SVM, Random Forest, and Streamlit Package to make a web app to early detect Cervical Cancer

Language: Jupyter Notebook - Size: 5.46 MB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

pranshu1921/Personalised-Cancer-Diagnosis

Predict the effect of genetic mutations in cancer tumors and classify them based on text clinical literature.

Language: Jupyter Notebook - Size: 1.37 MB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

ats-tandjoeng7/Credit_Risk_Analysis

Application of various supervised Machine Learning techniques to solve a real-world case study.

Language: Jupyter Notebook - Size: 18.5 MB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

jianninapinto/Bandersnatch Fork of BloomTech-Labs/BandersnatchStarter

This project implements a machine learning model using Random Forest, XGBoost, and Support Vector Machines algorithms with oversampling and undersampling techniques to handle imbalanced classes for classification tasks in the context of predicting the rarity of monsters.

Language: Jupyter Notebook - Size: 581 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

earlyann/Stock_predictions

Using and comparing Support Vector machine, Random Branch Forest and Easy Ensemble algorithms to predict if a stock will have a positive or negative annual return.

Language: Jupyter Notebook - Size: 1020 KB - Last synced: 10 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

TrilokiDA/Credit-Card-Fraud-Detection

Synthetic Financial Datasets For Fraud Detection

Language: Jupyter Notebook - Size: 79.1 KB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 1

cp-PYFOREST/PYFOREST-ML

Utilizing machine learning to examine deforestation rates in the undeveloped region of Paraguay's Chaco

Language: Jupyter Notebook - Size: 16.2 MB - Last synced: 4 months ago - Pushed: 12 months ago - Stars: 1 - Forks: 2

suyinwb/Credit_Risk_Analysis

Using the credit card credit dataset from LendingClub, a peer-to-peer lending services company, you’ll oversample the data using the RandomOverSampler and SMOTE algorithms, and undersample the data using the ClusterCentroids algorithm. Then, you’ll use a combinatorial approach of over- and undersampling using the SMOTEENN algorithm.

Language: Jupyter Notebook - Size: 720 KB - Last synced: 11 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

nnilayy/Classification-Notebook

Data Science Classification General Notebook

Language: Jupyter Notebook - Size: 6.02 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 1 - Forks: 0

Priyanshusinhaa/CreditCardFraudDetection

Notebook represents the process of fraud detection using past data.

Language: Jupyter Notebook - Size: 372 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

MirrasHue/Intro-to-AI

An assignment from my Introduction to Artificial Intelligence course, in which we had to treat the datasets, train some models for classification and adjust their parameters

Language: Jupyter Notebook - Size: 0 Bytes - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

rewatevijaykumar/sensor-fault-detection

The Air Pressure System (APS) is crucial for heavy duty vehicles, utilizing compressed air to break pads and slow down the vehicle. The aim of this binary classification project is to minimize unnecessary repair costs by identifying component failure in APS, using Python, FastAPI, Machine Learning Algorithm, Docker and MongoDB.

Language: Jupyter Notebook - Size: 12.1 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

Sajid030/fraud_detection

In this repository I have trained my machine learning model to detect whether there was fraud or not in your online transaction.

Language: Jupyter Notebook - Size: 7.47 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

vsdcarneiro/Projeto-Integrado-PUC

Trabalho de Conclusão de Curso apresentado ao Curso de Especialização em Inteligência Artificial e Aprendizado de Máquina, como requisito parcial à obtenção do título de Especialista.

Language: Jupyter Notebook - Size: 20.3 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

shanuhalli/Assignment-Random-Forest

Use Random Forest to prepare a model on fraud data. Treating those who have taxable income <= 30000 as "Risky" and others are "Good" and A cloth manufacturing company is interested to know about the segment or attributes causes high sale.

Language: Jupyter Notebook - Size: 2.82 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

binaryvexjuiit/Detecting-Facial-Diseases-Through-Neural-Networks

Facial skin disease detection using Neural Networks

Language: Python - Size: 367 KB - Last synced: 12 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

ChaitanyaC22/Fraud_Analytics_Credit_Card_Fraud_Detection

The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.

Language: Jupyter Notebook - Size: 67.3 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1

shayleaschreurs/Machine-Learning-Trading-Bot

Our goal was to create a ML bot that analyzes real time trading data to determine the most opportune times buy and sell stock

Language: Jupyter Notebook - Size: 9.07 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

guilhermedom/pyspark-horsepower-multilinear-regression

PySpark for multiple linear regression on car horsepower using SMOTE for data augmentation.

Language: Jupyter Notebook - Size: 43.9 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

digvijaytaunk/aps-fault-detection-with-deployment

Language: Jupyter Notebook - Size: 20.6 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

prohogiy90/ds-model-of-the-behavior-of-the-SberAvtopodpiska-customers

This repository contain my final projekt on the Data science Skillbox school on the topic: "Development of a machine learning algorithm to predict the behavior of customers of the "SberAvtopodpiska"

Language: Jupyter Notebook - Size: 5.36 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

ngfelixxx/Bank-Account-Machine-Learning-Model

Performing Feature Engineering, Hyperparameter Tuning , and Classification Algorithms.

Language: Jupyter Notebook - Size: 149 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

schatzederwelt/toxic_comments_detection

Автоматическое выявление токсичных комментариев

Language: Jupyter Notebook - Size: 1.86 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

shayleaschreurs/Supervised_Learning_Regression_Model

Module 12 - Using the imblearn , I'll use a logistic regression model to compare 2 versions of a dataset. First, I’ll use the original data. Next, I’ll resample the data by using RandomOverSampler. In both cases, I’ll get the count of the target classes, train a logistic regression classifier, calculate the balanced accuracy score, generate a con

Language: Jupyter Notebook - Size: 914 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

spapicchio/Gamma-Telescope-Analysis

Machine Learning analysis for an imbalanced dataset. Developed as final project for the course "Machine Learning and Intelligent Systems" at Eurecom, Sophia Antipolis

Language: Jupyter Notebook - Size: 48.9 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 2 - Forks: 0

Nveatch/Credit_Risk_Analysis

Credit risk analysis using sklearn and supervised machine learning

Language: Jupyter Notebook - Size: 86.9 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

Wrancher123/Credit_Risk_Analysis

In this analysis, I will be using several supervised machine learning models to predict credit risk on loan data.

Language: Jupyter Notebook - Size: 18.9 MB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

AJMnd/Credit_Risk_Analysis

An analysis on credit risk

Language: Jupyter Notebook - Size: 192 KB - Last synced: over 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0

ShivamChoudhary17/Data-Science

Machine Learning, EDA, Feature Engg, PLot, Transformation of features

Language: Jupyter Notebook - Size: 2.57 MB - Last synced: 5 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

SayamAlt/Fraudulent-Transactions-Prediction

Successfully trained a machine learning model which can predict whether a given transaction is fraud or not.

Language: Python - Size: 32.2 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

samirhinojosa/OC-P7-implement-a-scoring-model

Implement a Scoring Model

Language: Jupyter Notebook - Size: 277 MB - Last synced: over 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

kalyaniasthana/CS273A_project_diabetes

Course Project for CS273A: Machine Learning at UCI

Language: Jupyter Notebook - Size: 10 MB - Last synced: 6 months ago - Pushed: over 3 years ago - Stars: 3 - Forks: 0

zunicd/T2D-Predictions

Predicting health risks for type 2 diabetes based on three A1C levels (no-diabetes, pre-diabetes, diabetes).

Language: Jupyter Notebook - Size: 7.89 MB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1

ChuaCheowHuan/basic_ML

This repository contains simple usage examples for basic machine learning libraries. These notebooks are tested in Colab.

Language: Jupyter Notebook - Size: 1.92 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

masatakashiwagi/analysis-imbalanced-classification

Over-Sampling and Under-Sampling for Imbalanced Classifications.

Language: Jupyter Notebook - Size: 910 KB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

bwacker1/Machine-Learning-Homework-Columbia-FinTech-Boot-Camp

Columbia FinTech Boot Camp Homework - Programs to utilize resampling and ensemble machine learning models to predict credit risk for retail loans.

Language: Jupyter Notebook - Size: 36.8 MB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

rajtulluri/Santander-Customer-transaction-prediction

Predicting whether a customer will carry out a transaction or not for Santander group

Language: Jupyter Notebook - Size: 107 KB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

Ostyk/Wonka-bar

LOOPQ prize competition: Detect defect greens chocolate bar wrappers, in Willy Wonka's company producing the golden scrumpalicious candy bar

Language: Jupyter Notebook - Size: 19.5 MB - Last synced: over 1 year ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

clairempr/spooky-classify

Text classification with scikit-learn, used to make predictions for Kaggle Spooky Author Identification competition

Language: Python - Size: 62.5 KB - Last synced: over 1 year ago - Pushed: over 6 years ago - Stars: 1 - Forks: 1

Related Keywords
imblearn 64 sklearn 27 python 27 pandas 24 machine-learning 23 numpy 17 scikit-learn 15 imbalanced-data 14 smote 11 matplotlib 11 seaborn 9 random-forest 8 classification 8 jupyter-notebook 8 logistic-regression 8 random-forest-classifier 7 xgboost 6 data-science 6 pipeline 6 oversampling 5 python3 5 imbalanced-learning 4 matplotlib-pyplot 4 adasyn 4 sklearn-metrics 3 supervised-machine-learning 3 confusion-matrix 3 random-over-sampling 3 data-visualization 3 feature-selection 3 catboost 3 fastapi 3 svm-classifier 3 lightgbm 3 scipy 3 data-analysis 3 decision-tree 2 smotetomek 2 streamlit 2 ensemble 2 extra-trees-classifier 2 smote-sampling 2 knn-classifier 2 multiclass-classification 2 smoteenn 2 nlp-machine-learning 2 scikitlearn-machine-learning 2 sklearn-library 2 standardscaler 2 counter 2 linear-regression 2 exploratory-data-analysis 2 randomoversampler 2 ensemble-learning 2 nlp 2 xgboost-classifier 2 deep-learning 2 supervised-learning 2 tensorflow 2 hyperparameter-optimization 2 pymongo 2 keras 2 mongodb 2 nltk 2 pathlib 2 eda 2 balancedrandomforestclassifier 2 undersampling 2 prediction 2 joblib 1 generalized-linear-model 1 boto3 1 sber 1 scikit-learn-python 1 histgram-gradient-boosting 1 multilinear-regression 1 pyspark 1 data-augmentation 1 lgbmclassifier 1 ordinary-least-squares 1 geospatial-data 1 geospatial-machine-learning 1 geospatial-predictions 1 land-use-plan 1 paraguay 1 policy-analysis 1 predictive-modeling 1 collections 1 machinelearning-python 1 datapipeline 1 docker 1 flask 1 mlflow 1 assignment-15 1 grid-search-cv 1 standard-scaler 1 glob 1 banking 1 credit-card-fraud-detection 1 decision-tree-classifier 1