An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: categorical-encoding

Nouran246/House-Pricing-Prediction Fork of Yahia-Elshobokshy/Task1-Machine-Learning

Housing Prices Prediction using Machine Learning Developed a regression model to predict housing prices using data preprocessing, feature engineering, and various regression algorithms. Tuned hyperparameters and evaluated performance with key metrics (RMSE, MAE, R²).

Language: Jupyter Notebook - Size: 2.66 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

rahulvictor12/German-Bank-Loan-Defaulter-Prediction

A machine learning project to predict loan defaults in a German bank's customer base. Using the German Credit Risk dataset, it explores key factors contributing to defaults and trains models like Random Forest, GBM, and XGBoost. Includes EDA, data processing, hyperparameter tuning, and model evaluation.

Language: Jupyter Notebook - Size: 1.02 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

NiharJani2002/kaggle-Intermediate-Machine-Learning

Intermediate Machine Learning Course By Kaggle

Language: Jupyter Notebook - Size: 108 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Davityak03/Random-Forest-Ensemble-Technique

Language: Jupyter Notebook - Size: 32.2 KB - Last synced at: 2 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

venkat-a/Text_Processing_RNN_LSTM

Text Processing RNN leverages RNN and LSTM models for advanced text processing. It features deep learning techniques for NLP tasks, utilizing GloVe for word embeddings, aimed at both educational and practical applications.

Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dixitamol/ML_code_templates_R

Code templates for different ML algorithms

Language: R - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

arunsinghbabal/Controlled-Automation-for-Data-Processing

Perform semi automated exploratory data analysis, feature engineering and feature selection on provided dataset by visualizing every possibilities on each step and assisting the user to make a meaningful decision to achieve a low-bias and low-variance model.

Language: Jupyter Notebook - Size: 79.7 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

SayamAlt/E-Commerce-Text-Classification

Successfully established a machine learning model that can accurately classify an e-commerce product into one of four categories, namely "Books", "Clothing & Accessories", "Household" and "Electronics", based on the product's description.

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ecvsgl/MLproject-DrivenData-PumpItUp

Exploratory data analysis and model preparation for DrivenData contest: PumpItUp!

Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

leffff/waveml

Open source machine learning library with various machine learning tools

Language: Python - Size: 70.3 KB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 1

NavindaFernando/Feature-Extraction

Heart Risk Level Predicting Regression Model & Web using Feature Engineering and Data Preprocessing :baby_chick:

Language: Jupyter Notebook - Size: 68.4 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

cpa-analytics/embedding-encoder

Scikit-Learn compatible transformer that turns categorical variables into dense entity embeddings.

Language: Jupyter Notebook - Size: 758 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 34 - Forks: 6

albertusk95/weight-of-evidence-spark

Weight of Evidence Encoding & Information Value

Language: Python - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 9 - Forks: 4

Yashasvi863/Machine-Learning-Classification

Customer Churn Analysis

Language: Jupyter Notebook - Size: 729 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

AshiniAnantharaman/Image_classification_of_CIFAR10_dataset

The project uses Artificial Neural Network and Convolutional Neural Network to classify images into 10 different categories.

Language: Jupyter Notebook - Size: 2.17 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

NaquibAlam/Categorical_Encoding_Experimentation

This repo contains code for experimenting with categorical encoding - WoE, Catboost, Target encoder, and many more.

Size: 177 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

subhayuroy/KickStarter

🎬This KickStarter project is about some🎞 foreign films🎥 and music videos🎶. This is an analysis 📽of their 'goal currency' and release time.🎦

Language: Jupyter Notebook - Size: 113 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Related Keywords
categorical-encoding 17 machine-learning 9 xgboost 4 random-forest 4 exploratory-data-analysis 4 feature-selection 4 scikit-learn 3 feature-engineering 3 model-validation 2 python 2 data-preprocessing 2 pandas 2 classification 2 numpy 2 hyperparameter-tuning 2 confusion-matrix 2 logistic-regression 2 gridsearchcv 2 tensorflow 2 classification-model 1 drivendata 1 stacked-predictions 1 gradient-boosting-classifier 1 knn-classification 1 linear-transformations 1 ensemble 1 feature-transformation 1 text-vectorization 1 text-preprocessing 1 text-classification 1 model-training-and-evaluation 1 model-deployment 1 hyperparameter-optimization 1 cross-validation 1 data-visualization 1 data-transformation 1 data-science 1 auto-feature-selection 1 auto-feature-engineering 1 feature-generation 1 baseline-model 1 target-encoding 1 high-cardinality-encoding 1 sklearn-library 1 convolutional-neural-networks 1 artificial-neural-networks 1 feature-extraction 1 weight-of-evidence 1 information-value 1 neural-networks 1 keras 1 embeddings 1 deep-learning 1 categorical-features 1 scaling 1 quantile-transformer 1 polynomial-features 1 label-encoding 1 joblib 1 html5 1 handling-outlier 1 flask 1 weighted-averages 1 stacking 1 auto-eda 1 lstm-neural-networks 1 embedding-vectors 1 deep-neural-networks 1 ensemble-model 1 pipeline-automation 1 model-optimization 1 gradient-boosting 1 feature-leakage 1 data-imputation 1 advanced-machine-learning 1 recall 1 randomsearch-cv 1 precision 1 modelevaluation 1 missing-value-handling 1 gbm 1 f1-score 1 data-processing 1 bagging 1 ada-boost-classifier 1 accuracy 1 regression-algorithms 1 model-evaluation 1 learning-curve-analysis 1 feature-scaling 1 upper-confidence-bounds 1 svm 1 reinforcement-learning 1 regression 1 pca 1 nlp 1 naive-bayes-classifier 1 lda 1 forward-selection 1 eclat-algorithm 1