An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: label-encoding

Sanchemtos/Multi-Label-Emotion-Recognition

This project focuses on detecting multiple emotions from English text using a fine-tuned **BERT** model. It leverages the [GoEmotions](https://huggingface.co/datasets/go_emotions) dataset — a large-scale human-annotated dataset of Reddit comments labeled with 27 emotions + neutral.

Language: Jupyter Notebook - Size: 118 KB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 0 - Forks: 0

362heavy/Liver_Cirrhosis_Stage_Detection_System

This repository contains a system for detecting the stage of liver cirrhosis using historical patient data. It employs machine learning to analyze key medical indicators and classify patients into three distinct stages. 🦠📈

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

saifalibaig/Multi-Label-Emotion-Recognition

This project focuses on detecting multiple emotions from English text using a fine-tuned **BERT** model. It leverages the [GoEmotions](https://huggingface.co/datasets/go_emotions) dataset — a large-scale human-annotated dataset of Reddit comments labeled with 27 emotions + neutral.

Language: Jupyter Notebook - Size: 122 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

SunnyRao07/stroke-risk-prediction

Predicting stroke risk using machine learning models based on healthcare and demographic data.

Language: Jupyter Notebook - Size: 1.63 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

SunnyRao07/Water-Quality-Analysis

A machine learning project that predicts water potability based on chemical and physical attributes, using models like Logistic Regression, Random Forest, and XGBoost.

Language: Jupyter Notebook - Size: 584 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

imharshag/NIDS-using-ML

This project showcases a Network Intrusion Detection System (NIDS) designed to bolster cybersecurity defenses against evolving threats

Size: 105 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 9 - Forks: 1

Sameer051022/LabelEncoding_Models

"A comprehensive exploration of machine learning models using label encoding to preprocess categorical data, featuring implementations in Python with sklearn."

Language: Jupyter Notebook - Size: 86.9 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

maheshvarade/decision-tree-for-classification-of-penguins-species

🐧 Decision Tree for Classification of Penguin Species This project implements a Decision Tree Classifier to classify different species of penguins using the Palmer Penguins dataset. The dataset contains features such as bill length, bill depth, flipper length, and body mass to predict the species (Adelie, Chinstrap, or Gentoo).

Language: Jupyter Notebook - Size: 1.23 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

JaspreetSingh-exe/Vehicle-Price-Prediction

Vehicle Price Prediction is a machine learning project that estimates vehicle prices using features like make, model, year, mileage, and more. It employs multiple regression models, including Linear Regression, Random Forest, Gradient Boosting, CatBoost, and Stacking Regressor, with GridSearchCV for tuning.

Language: Jupyter Notebook - Size: 1.17 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Shailesh-Padhariya/Human-Resources

This project analyzes employee attrition using machine learning models, including Logistic Regression, Random Forest, and XGBoost. The objective is to identify key factors influencing employee turnover and provide insights to improve retention strategies

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

LKEthridge/Supervised_Learning

Supervised Learning project from TripleTen

Language: Jupyter Notebook - Size: 324 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

kingership321/Esophageal_Cancer_Detection

This is an esophageal cancer detection project which uses dataset from Kaggle. The dataset has been attached to this repo. This repo also contains codes in both jupyter notebook and pdf format. The code involves techniques like EDA and feature engineering, LabelEncoding and One-Hot-Encoding and the model used is Linear Regression.

Language: Jupyter Notebook - Size: 5.5 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

rubyyy1118/Machine_Learning_Optimization_Study

The Learning From Data - Assignment in my MSc Business Analytics course

Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

aneeshmurali-n/Project-ML-Data-Preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

Language: Jupyter Notebook - Size: 174 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

abinashsahoo007/Project-Resume-Classification

The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.

Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

areeba0/Image-Classification-using-Gaussian-Naive-Bayes-SVM-and-KNN

the code uses KNN, Gaussian Naive Bayes & SVM to classify images. It preprocesses, normalizes data, applies PCA , computes accuracy, precision etc. It evaluates k-NN using Euclidean distance & cosine similarity, visualizing results with line plots, 3D scatter plots, & confusion matrices to demonstrate classifier performance.

Language: Jupyter Notebook - Size: 1.81 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

CRIPAC-DIG/hetgsl

[CIKM 2021] Code and dataset for "Label-informed Graph Structure Learning for Node Classification"

Language: Jupyter Notebook - Size: 152 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

SannketNikam/Credit-Risk-Analysis

Credito - Credit Risk Analysis using XGBoost Classifier with RandomizedSearchCV for loan approval decisions.

Language: Jupyter Notebook - Size: 6.47 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

sorodocosmin/feedbackHHC

This project focuses on analyzing patient feedback regarding the treatment provided by home healthcare service agencies.

Language: Python - Size: 5.52 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

shikhararyan/Text-Classification-Transformer-Model-

This sentiment analysis model utilizes a Transformer architecture to classify text sentiment into positive, negative, or neutral categories with high accuracy. It preprocesses text data, trains the model on the IMDB dataset, and effectively predicts sentiment based on user input.

Language: Jupyter Notebook - Size: 3.39 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Ninad077/Machine_Learning-Logistic_Regression

Content: Machine Learning, Logistic regression steps, Probability matrix, Confusion matrix, Accuracy score, Recall value, Data preprocessing, Label encoding, Scaling the data, Splitting train test data, Running Logistic Regression, Y prediction on test data, Class imbalance, Type 1 & Type 2 errors.

Language: Jupyter Notebook - Size: 898 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

damaniayesh/Covid_data_Prediction

This project provides the data based on classification to check if the patient is covid +ve or -ve.

Language: Jupyter Notebook - Size: 68.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Ishikaaa/ML-Data-Scientist-job-market-in-the-US

Language: Jupyter Notebook - Size: 82 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 1

Gopika-S-Nair/Machine-Learning---Classification

This repository contains Machine Learning Classification algorithms implementation

Language: Jupyter Notebook - Size: 456 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

REAtes/Directing-App-Customers-to-Subscription

The primary goal of this project is to convert free users of a financial tracking app into paid members. This conversion will be achieved by building a model that identifies users who are unlikely to enroll in the paid version of the app.

Language: Python - Size: 1.84 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

RimTouny/Credit-Card-Fraud-Detection

Focused on advancing credit card fraud detection, this project employs machine learning algorithms, including neural networks and decision trees, to enhance fraud prevention in the banking sector. It serves as the final project for a Data Science course at the University of Ottawa in 2023.

Language: Jupyter Notebook - Size: 6.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

katreparitosh/Water-Quality-Analysis

Analytical understanding and applying parameter optimization, regression with gradient descent to predict water quality levels across Indian waters.

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 4

Deepshikha05/Customer_Behavior_Analysis

Language: Jupyter Notebook - Size: 185 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

sayo2rule/bank-churning-with-machine-learning

This repository covers my code using regression models to predict if a customer would be exiting a bank or not. It also capture the use classification models to classify if a customer has left the bank or not (binary classification).

Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Himanshuachwaan/Diamonds

Diamonds Price prediction using Polars and Ski-learn

Language: HTML - Size: 252 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

m-enesyilmaz/Feature_Engineering_with_Python

📶In this repository, we will do feature engineering with Python.

Language: Python - Size: 364 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

chiragn888/Data-Science-Projects

This repository consists of various projects based on Machine Learning and NLP.

Language: Jupyter Notebook - Size: 52.7 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

RazHoshia/unofficial_encoders

Unofficial but extremely useful Label and One Hot encoders.

Language: Python - Size: 80.1 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

EmamulHossen/Feature_Engineering

Feature engineering or feature extraction or feature discovery is the process of extracting features from raw data.

Language: Jupyter Notebook - Size: 57.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

NirmalKanagasabai/Reddit_TopicClassification__RandomForest_SGDClassifier

Supervised Machine Learning methods (Random Forest and SGD Classifier) to classify short conversations extracted from Reddit

Language: Python - Size: 242 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1

prasadposture/Data-Preparation

There are lot of things that need to be done on the given dataset before we feed it to the machine, these things come under data preprocessing. In this repository I have tried to explain those things with some examples.

Language: Jupyter Notebook - Size: 420 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

maggieezzat/Visual-History-of-Nobel-Prize-Winners

This is an implementation for a DataCamp project: A Visual History of Nobel Prize Laureates. We try to answer the proposed questions and visualize the results.

Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

deepakrameshgowda/CREDIT-CARD-FRAUD-DETECTION

Building predictive models to detect and prevent the fraudulent transactions happening on cerdit cards and debit cards. Implementation of 2nd factor authentication for safe and secure transactions.

Language: Jupyter Notebook - Size: 7.71 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

S-CHAN11/Insurance-Claim-Prediction

In this project, I use 3 machine learning models (CART, Random Forest and ANN) to predict the claim frequency for a travel insurance firm. I also evaluate which of the three models is most suitable for our dataset.

Language: Jupyter Notebook - Size: 1.91 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Manjari-99/MLP_Perceptron

To implement the internal workings of perceptron and testing the accuracy of in train and test dataset.

Language: Jupyter Notebook - Size: 580 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

NavindaFernando/Feature-Extraction

Heart Risk Level Predicting Regression Model & Web using Feature Engineering and Data Preprocessing :baby_chick:

Language: Jupyter Notebook - Size: 68.4 KB - Last synced at: 14 days ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

bharatkulmani/Dry-Bean

Project is about predicting Class Of Beans using Supervised Learning Models

Language: Jupyter Notebook - Size: 35.3 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

shanuhalli/Assignment-Decision-Trees

Use decision trees to prepare a model on fraud data. Treating those who have taxable income <= 30000 as "Risky" and others are "Good" and A cloth manufacturing company is interested to know about the segment or attributes causes high sale.

Language: Jupyter Notebook - Size: 3.83 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MZayed47/CS-GO_KD-Classifier

This classification task is specifically dependent on a video dataset that includes video clips of kill and death scenes from the first-person shooting game “CS Go”. I have used the ResNet-50 model for image classification and then turn it into a more accurate video classifier by employing the rolling averaging method.

Language: Jupyter Notebook - Size: 636 MB - Last synced at: 8 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

sayo2rule/car_sharing_services

Database management and data analytics from a car-sharing dataset. The dataset contains information about the customers' demand rate between January 2017 and August 2018.

Language: Python - Size: 572 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

sauhard2701/Fraud-Transaction-Detection

INSAID Assignment to create a ML model to detect fraud transactions for a financial company.

Language: Jupyter Notebook - Size: 861 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 2

MoinDalvs/Learn_Feature_Engineering

Data Set: House Prices: Advanced Regression Techniques Feature Engineering with 80+ Features

Language: Jupyter Notebook - Size: 630 KB - Last synced at: about 20 hours ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

nani757/Ordinal-Encoding---Label-Encoding

Ordinal Encoding - Label Encoding

Language: Jupyter Notebook - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

copev313/Chatbot-Using-Deep-Learning

We build a chatbot by implementing machine learning and natural language processing.

Language: Jupyter Notebook - Size: 368 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

Csengupta1101/Data-Is-Good-Exam---September

This Repository Consists the exam Problems and solutions conducted on September - 2021

Language: Jupyter Notebook - Size: 1.87 MB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

winleexz/XGBoost_Regression

A Kaggle project to predict house prices using XGBoost modeling

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

saminens/Women-in-Data-Science-2020

WiDS Datathon 2020 on patient health through data from MIT’s GOSSIS (Global Open Source Severity of Illness Score) initiative.

Language: Jupyter Notebook - Size: 929 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1

SubhamChoudhury/Feature_Engineering-and-Selection

This repository contains all the resources made and used in the session for Feature Engineering

Language: Jupyter Notebook - Size: 1000 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

ft-airbnb-price-01/backend-model

Repo houses the predictive NN model and its associated .py modules

Language: Jupyter Notebook - Size: 183 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

Zauverer/exce_Linear_Discriminant_Analysis

Linear Discriminant Analysis

Language: Jupyter Notebook - Size: 293 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Related Keywords
label-encoding 55 one-hot-encoding 14 feature-engineering 14 machine-learning 13 python 13 random-forest 9 logistic-regression 9 pandas 9 data-cleaning 9 data-preprocessing 8 matplotlib 7 random-forest-classifier 7 numpy 7 decision-tree-classifier 7 feature-scaling 6 confusion-matrix 6 sklearn 6 data-visualization 6 missing-values 5 standard-scaler 5 hyperparameter-tuning 5 seaborn 4 outliers 4 decision-trees 4 smote 4 xgboost 4 exploratory-data-analysis 4 linear-regression 4 regression-models 4 principal-component-analysis 3 outlier-removal 3 outlier-detection 3 model-evaluation 3 hyperparameter-optimization 3 ordinal-encoding 3 data-science 3 eda 3 python3 3 feature-selection 3 artificial-intelligence 3 xgboost-classifier 3 jupyter-notebook 3 accuracy-score 3 fraud-detection 2 encoder 2 svm 2 classification 2 decision-tree 2 tensorflow 2 gradient-descent 2 transformation 2 scaling 2 handling-missing-value 2 sigmoid-function 2 shapely 2 huggingface 2 standardscaler 2 classification-report 2 machine-learning-algorithms 2 scikit-learn 2 tokenization 2 sgd-classifier 2 ai 2 text-preprocessing 2 class-imbalance 2 linear-discriminant-analysis 2 predictive-modeling 2 bert-model 2 gaussian-naive-bayes 2 knn 2 neural-network 2 knn-classification 2 joblib 2 quantile-transformer 1 classifier 1 auc 1 artificial-neural-networks 1 quadratic-discriminant-analysis 1 cart 1 data-cleaning-and-preprocessing 1 polynomial-features 1 gridsearchcv 1 multivariate-analysis 1 neural-network-classifier 1 html5 1 handling-outlier 1 predictive-analysis 1 univariate-analysis 1 variable-importance 1 keras-tensorflow 1 perceptron-learning-algorithm 1 flask 1 categorical-encoding 1 data-binning 1 topic-classification 1 sgd 1 scikitlearn-machine-learning 1 reddit 1 missing-data 1 sklearn-compatible 1