An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: cross-validation

Zulqarnain-10/Behavioral-Risk-Factor-By-Tobacco-Use

This project analyzes tobacco use risk factors using machine learning to predict 'High Confidence Limit' for various demographics. It includes data preprocessing, EDA, and multiple ML models like Linear Regression, Random Forest, and SVM. Detailed results and code are available in the repository.

Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 0 - Forks: 0

supcode1234/Twitter-Sentiment-Analysis

🔍 Analyze Twitter sentiment by classifying tweets into Positive, Negative, Neutral, and Irrelevant categories using machine learning models.

Language: Jupyter Notebook - Size: 1.59 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

vicky999jeevi/AI-Code-Explainer-Optimizer

🧩 Optimize and explain code effortlessly with our AI-driven multi-agent system, using LangGraph and Gemini for efficient solutions.

Language: Python - Size: 7.01 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

precioussak/scikit-learn-is-what-you-dont-need

🔍 Discover why scikit-learn may not meet your needs and explore better alternatives for your machine learning projects.

Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

JavedFazlulahF/Customer-Churn-Prediction

📊 Predict customer churn in telecom using machine learning to enhance retention strategies and drive better business outcomes.

Language: Jupyter Notebook - Size: 2.86 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

subash12345679-png/RentalPrice-ML-Modeling

📊 Predict apartment rental prices in Tel Aviv using machine learning models, featuring Elastic Net and Decision Trees for accurate forecasting.

Language: Jupyter Notebook - Size: 1.43 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

Shivaay8055/Bank-Marketing-Data

Los datos se relacionan con campañas de marketing directo (llamadas telefónicas) de una entidad bancaria portuguesa. El objetivo de la clasificación es predecir si el cliente suscribirá un depósito a plazo (variable y).

Size: 2.44 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

tutkufurkan/Machine-Learning---Advanced-Topics

Advanced ML tutorial: NLP gender classification, PCA visualization, hyperparameter optimization with Grid Search, and collaborative filtering recommendations using real-world datasets

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

ChokZB/college-regression-analysis

Regression analysis of the ISLR2 College dataset using R. Includes data exploration, model fitting, and diagnostics.

Language: R - Size: 1.84 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

johnnydelsage/Car-Price-Prediction

🚗 Predict used car prices accurately with machine learning models using various regression techniques and market data analysis for fair valuation.

Language: Jupyter Notebook - Size: 1.35 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

MUSA-Zhanchao/MUSA5000-OLS

Spatial Statistics Assignment1

Language: HTML - Size: 16.5 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

stan-dev/loo

loo R package for approximate leave-one-out cross-validation (LOO-CV) and Pareto smoothed importance sampling (PSIS)

Language: R - Size: 131 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 155 - Forks: 37

dhaneshbb/BikeRental

Daily bike rental demand prediction using Ridge Regression on Capital Bikeshare data (2011-2012). Addresses multicollinearity, zero-inflated features, and non-normal distributions. Test R²=0.832, CV R²=0.815±0.032. Includes statistical analysis, VIF removal, and comparison of 10 regression algorithms.

Language: Jupyter Notebook - Size: 20.2 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

0xPutri/mlflow_project

Machine Learning model optimization project using MLflow — includes Grid Search, Random Search, Bayesian Optimization, and K-Fold Cross Validation for efficient experiment tracking.

Language: Python - Size: 144 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

dhaneshbb/AutoPricePred

Machine learning regression model predicting 1985 automobile prices. Lasso model achieves 91.7% R² with superior generalization over XGBoost. Handles extreme multicollinearity (VIF 16,676→8.36), data leakage detection, and outlier treatment through PCA and domain-driven feature engineering.

Language: Jupyter Notebook - Size: 33.4 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

trent-b/iterative-stratification

scikit-learn cross validators for iterative stratification of multilabel data

Language: Python - Size: 45.9 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 881 - Forks: 74

Princemurchale/-Predictive-Maintenance-for-Renewable-Energy-Sources

The aim to decrease the maintenance cost of generators used in wind energy production machinery. This is achieved by building various classification models, accounting for class imbalance, and tuning on a user defined cost metric (function of true positives, false positives and false negatives predicted) & productionising the model using pipelines.

Size: 14.6 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

dbouget/validation_metrics_computation

Validation and metrics computation over 3D medical volumes (backend for Raidionics)

Language: Python - Size: 627 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 6 - Forks: 5

donishadsmith/vswift

A R package for evaluating ML classification models.

Language: R - Size: 1.02 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

p-dirac/python-keras-cnn

This application demonstrates the implementation details of a convolutional network programmed with Python class structures.

Language: Python - Size: 7.58 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

danielemonteverdi02/Progetto-Titanic-regressione-logistica-

Analisi e predizione della sopravvivenza dei passeggeri del Titanic. Pulizia dei dati, imputazione dei valori mancanti (imputazione singola e con MICE), feature engineering, codifica one-hot e scalatura numerica. Addestramento di una regressione logistica con cross-validation e valutazione su validation set. Predizione finale sul test set.

Language: Jupyter Notebook - Size: 838 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

PhilBoileau/cvCovEst

An R package for nonparametric covariance matrix estimation in high dimensions

Language: R - Size: 4.88 MB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 14 - Forks: 4

statalasso/lassopack

LASSOPACK: Stata module for lasso, square-root lasso, elastic net, ridge, adaptive lasso estimation and cross-validation

Language: Stata - Size: 1.38 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 6 - Forks: 4

Davide011/ML_project_South_African_Heart_Disease

Public Repository: Machine Learning & Data Mining project using the South African Heart Disease dataset. Applied PCA, Regularized Linear Regression, ANN, Logistic Regression, and Decision Trees with cross-validation for regression and classification. Includes feature scaling, EDA, and statistical tests.

Size: 1.32 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

autoreject/autoreject

Automated rejection and repair of bad trials/sensors in M/EEG

Language: Python - Size: 704 KB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 147 - Forks: 59

sahildev23/predictive-modeling

ML pipeline with automated preprocessing, cross-validation, and performance visualization. Achieved 28% accuracy improvement on 100k+ records

Language: Python - Size: 10.7 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

AnderCruz/Text-Classification-Generation-Project-CNN

This project explores natural language processing techniques using TensorFlow and Keras, focusing on challenges faced by a news portal. The portal requires solutions for classifying articles based on their titles and descriptions, and for assisting writers in maintaining a consistent style by suggesting subsequent words.

Size: 9.77 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

nf-core/drugresponseeval

Pipeline for testing drug response prediction models in a statistically and biologically sound way.

Language: Nextflow - Size: 7.68 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 24 - Forks: 6

4Freye/panelsplit

A tool for performing cross-validation with panel data

Language: Python - Size: 8.57 MB - Last synced at: 24 days ago - Pushed at: 26 days ago - Stars: 21 - Forks: 2

distcomp/SvF

Implementation of SvF-technology of balanced identification of mathematical models by experimental data

Language: Python - Size: 38.8 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

ctesta01/nadir

Super learning with flexible formulas

Language: R - Size: 20.1 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 5 - Forks: 1

chonzadaniel/chonzadaniel

Skilled Data Scientist with hands on experience in ML, NLP, Deep Learning & GenAI building clean, modular projects with real-world problems solutions: text classification, class imbalance, RAG systems, and PEFT. Developed impactful AI tools powered by AWS, Streamlit, Slack, & vector DBs.

Size: 29.3 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mdabros/SharpLearning

Machine learning for C# .Net

Language: C# - Size: 4.04 MB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 400 - Forks: 86

TheMrityunjayPathak/SpaceshipTitanicClassification

Spaceship Titanic Classification

Language: Jupyter Notebook - Size: 5.27 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

NadyaKameelaa1/klasifikasi_lemon

Ulangan Praktik Mapel Pilihan (AI) - Machine Learning, StandardScaler, OneHotEncoder, OrdinalEncoder, Logistic Regression, dan lain sebagainya.

Language: Jupyter Notebook - Size: 84 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

FBruzzesi/timebasedcv

Time based splits for cross validation

Language: Python - Size: 1.7 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 39 - Forks: 1

alopezmoreira1989/Remote-Work-in-Data-Science

This repository contains my data science project developed in Kaggle. It includes Jupyter notebooks, datasets, and analysis focused on remote work in data science. The goal is to explore insights, apply machine learning techniques, and showcase reproducible workflows.

Language: Jupyter Notebook - Size: 9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

GDSL-UL/san

Spatial Modelling for Data Scientists

Language: TeX - Size: 523 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 86 - Forks: 19

rvalavi/blockCV

The blockCV package creates spatially or environmentally separated training and testing folds for cross-validation to provide a robust error estimation in spatially structured environments. See

Language: R - Size: 51.4 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 114 - Forks: 23

Umer-Farooq-CS/California-Housing-Regression

Linear Regression experiments on the California Housing dataset across five phases, using NumPy and scikit-learn only (no pandas). Includes EDA, polynomial features, SGD with scaling, residuals, 5-fold CV, and an LNCS-style report with figures.

Language: Jupyter Notebook - Size: 6.25 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

RubixML/CIFAR-10

Use the famous CIFAR-10 dataset to train a multi-layer neural network to recognize images of cats, dogs, and other things.

Language: PHP - Size: 163 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 57 - Forks: 9

Not-Buddy/FerricSort

Rust-based CNN for garbage classification into 6 categories with high accuracy on GPU.

Language: Rust - Size: 9.05 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

shahzadmustafa15/credit-card-fraud-detection

Credit card fraud detection using Random Forest with Stratified K-Fold cross-validation and F1-score evaluation.

Language: Jupyter Notebook - Size: 46.9 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

StarlangSoftware/Sampling-CPP

Data sampling library

Language: C++ - Size: 223 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 1

0aub/vision-cls-master

Deep learning and machine learning framework for image classification with 60+ pretrained models, 15+ attention mechanisms, cross-validation support, and Docker deployment. Built with PyTorch.

Language: Python - Size: 56.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

EshanSugeesh/Amex-Machine-Learning-Project

Predictive modeling on American Express dataset using advanced ML algorithms, extensive feature engineering (300+ features), and cloud-based data science workflows.

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Jonghwan-dev/Awesome-Segmentation-in-Medical

Welcome, Awesome-segmentation-in-medical project. This repo is robust, reproducible, and fair benchmarking framework for breast ultrasound (BUS) image segmentation. Features 10+ models (UNet, UNet++, TransUnet, SwinUnet etc.), 5+ datasets, and strict data separation to prevent test set leakage.

Language: Python - Size: 1.24 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 6 - Forks: 0

mljs/libsvm

LIBSVM for the browser and nodejs :fire:

Language: JavaScript - Size: 1.54 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 87 - Forks: 14

shramitamaheshwari/Bus-demand-forecasting

End-to-end Bus Demand Forecasting project: performed data preprocessing, feature engineering, and trained models (LightGBM) to predict seat demand. Includes cross-validation, evaluation (RMSE), and generates submission files for competition use.

Language: Python - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

kaladabrio2020/machine-learning

Times Series, Classification-label/image, Regression

Language: Jupyter Notebook - Size: 281 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

knickodem/kfa

k-fold cross validation for factor analysis

Language: HTML - Size: 9.39 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 7 - Forks: 1

RubixML/HAR

Recognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.

Language: PHP - Size: 23.9 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 24 - Forks: 7

RubixML/Colors

Demonstrating unsupervised clustering using the K Means algorithm and synthetic color data.

Language: PHP - Size: 257 KB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 18 - Forks: 3

RubixML/Credit

An example project that predicts risk of credit card default using a Logistic Regression classifier and a 30,000 sample dataset.

Language: PHP - Size: 1.76 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 36 - Forks: 12

RubixML/MNIST

Handwritten digit recognizer using a feed-forward neural network and the MNIST dataset of 70,000 human-labeled handwritten digits.

Language: PHP - Size: 19.9 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 39 - Forks: 9

nirmal2i43a5/Automobile_Price_Prediction_System

This project aims to predict vehicle pricing based on various attributes related to design, performance, market conditions, and temporal factors.

Language: Jupyter Notebook - Size: 3.92 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

alexkychen/assignPOP

Population Assignment using Genetic, Non-genetic or Integrated Data in a Machine-learning Framework. Methods in Ecology and Evolution. 2018;9:439–446.

Language: R - Size: 8.81 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 16 - Forks: 4

BCG-X-Official/sklearndf

DataFrame support for scikit-learn.

Language: Python - Size: 19.7 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 63 - Forks: 8

karinneaiello/Student-Exam-Scores-Prediction

Python

Language: Jupyter Notebook - Size: 8.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

RubixML/Divorce

Use the K Nearest Neighbors algorithm to predict the probability of a divorce with high accuracy.

Language: PHP - Size: 87.9 KB - Last synced at: 25 days ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 11

WalidGharianiEAGLE/spatial-kfold

spatial resampling for more robust cross validation in spatial studies

Language: Python - Size: 4.33 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 16 - Forks: 0

adirbella37/RentalPrice-ML-Modeling

A machine learning project to predict apartment rental prices in Tel Aviv using Elastic Net and Decision Trees. It includes data preprocessing, feature engineering, model training, and performance evaluation.

Language: Jupyter Notebook - Size: 185 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ialwayslikedgrime/airbnb-milan-price-prediction

End-to-end machine learning pipeline for predicting Airbnb listing prices in Milan using geospatial analysis, nested cross-validation, and feature engineering. Integrates multiple data sources with Milan's neighborhood boundaries to identify pricing drivers and market opportunities. XGBoost model R² = 0.587 with comprehensive SHAP interpretability

Language: Jupyter Notebook - Size: 22.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mlr-org/mlr3spatiotempcv

Spatiotemporal resampling methods for mlr3

Language: TeX - Size: 454 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 53 - Forks: 10

peter-ehmann/cross-validation

10-fold cross-validation simulation to identify optimal lambda for ridge regression on n=1000 observations of p=10000 Rademacher random variables.

Language: R - Size: 1.59 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

kapsner/mlexperiments

An extensible framework for reproducible machine learning experiments

Language: R - Size: 698 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 2

unnatii14/Predicting-Health-Insurance-OLS-Model-Feature-Engineering-and-Selection

OLS regression model that identifies key factors influencing health insurance costs, while ensuring the predictors are meaningful and not highly correlated.

Language: Jupyter Notebook - Size: 246 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

PacktWorkshops/The-Deep-Learning-with-Keras-Workshop

An Interactive Approach to Understanding Deep Learning with Keras

Language: Jupyter Notebook - Size: 229 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 51 - Forks: 91

m-clark/book-of-models

Spells for everyday living, also a book -- Models Demystified -- coming out in 2025.

Language: Python - Size: 152 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 78 - Forks: 18

Yosri-Ben-Halima/cpcv-train-test-data-split-module

A Python module for time series cross-validation using Combinatorial Purged Cross-Validation (CPCV) with embargo to prevent data leakage.

Language: Python - Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

business-science/modeltime.resample

Resampling Tools for Time Series Forecasting with Modeltime

Language: R - Size: 21.3 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 21 - Forks: 6

EEMnz/Assessment-thermal-models

Python codes for Assessment of thermal mode-based kinetic models via stratified cross-validation and TPE optimization

Language: Python - Size: 460 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ramgarhiahere0/CTune-MLX

🚀 Simplify ML model training on Apple Silicon with CTune-MLX, overcoming unsloth issues and enabling seamless format conversion.

Language: Shell - Size: 24.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

bth-dipt-research/SVAR

This repository contains code developed for the SVAR project (Trafikverket 2023-2025).

Language: Python - Size: 9.27 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

tstran155/Optimization-of-building-energy-consumption

This repo demonstrates how to build a surrogate (proxy) model by multivariate regressing building energy consumption data (univariate and multivariate) and use (1) Bayesian framework, (2) Pyomo package, (3) Genetic algorithm with local search, and (4) Pymoo package to find optimum design parameters and minimum energy consumption.

Language: Jupyter Notebook - Size: 5.31 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 31 - Forks: 8

MirumeYato/mugen

mirror of my main git repo

Language: Python - Size: 76.2 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Emriss0/Tech-Tweet

TechTweet is a microblogging platform for tech enthusiasts, allowing users to share short tech messages and engage in discussions. Join the community, post your thoughts, and connect with others! 🐙💻

Language: HTML - Size: 26.4 KB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

SadmanSakib93/Stratified-k-fold-cross-validation-Image-classification-keras

This python program demonstrates image classification with stratified k-fold cross validation technique.

Language: Jupyter Notebook - Size: 315 KB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 37 - Forks: 14

timothyckl/sbss

similarity-based stratified k-fold cross validation

Language: Python - Size: 61.5 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

tlverse/hal9001

🤠 📿 The Highly Adaptive Lasso

Language: R - Size: 13.1 MB - Last synced at: 23 days ago - Pushed at: 12 months ago - Stars: 49 - Forks: 14

aufahuhs/Advanced-Machine-Learning-Personal-Project

This project explores ML techniques across classification and regression. It includes penguin species classification, breast cancer prediction, and baseball performance prediction using regularization. After, I will develop an XGBoost model for hotel cancellation prediction, analyzing key booking factors and optimizing performance. (In Progress)

Size: 1.95 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

v-sundaresan/truenet

DL tool for white matter hyperintensities segmentation

Language: Python - Size: 94.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 15 - Forks: 7

cissagatto/CrossValidationMultiLabel

A code to execute and save cross-validation in multilabel classification

Language: R - Size: 29.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

DidierRLopes/timeseries-cv

Time-Series Cross-Validation Module

Language: Jupyter Notebook - Size: 454 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 46 - Forks: 10

msmbuilder/osprey

🦅Hyperparameter optimization for machine learning pipelines 🦅

Language: Python - Size: 974 KB - Last synced at: about 9 hours ago - Pushed at: almost 5 years ago - Stars: 73 - Forks: 26

gershonc/octopus-ml

A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data with minimal effort.

Language: Jupyter Notebook - Size: 21.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 22 - Forks: 5

elyase/do-we-need-crossvalidation

Language: Python - Size: 1.65 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

francescopiocirillo/linear-regression-from-scratch-R

Hands-on regression analysis project in R using a dataset with 30 predictors. Includes manual OLS implementation without lm(), p-value computation, and comparison with built-in functions. Applies stepwise selection (AIC/BIC), Ridge, and Lasso to minimize test error and identify key predictors.

Language: R - Size: 219 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jishen-harilal/analytical-models-in-excel

A curated Excel workbook showcasing core data analysis techniques - including regression, classification, dimensionality reduction, and cross-validation - implemented entirely within spreadsheets. Ideal for demonstrating manual model logic, clean formatting, and advanced Excel proficiency without code.

Size: 785 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

francescopiocirillo/regression-model-comparison-R

This project explores linear regression model selection in R using Best Subset Selection (BIC), stepwise methods with cross-validation, Ridge, and Lasso. Includes MSE evaluation on test data, multicollinearity analysis (VIF), and correlation insights for variable selection.

Language: R - Size: 930 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

supremkc05/FoodandBeverage_Data_Analytics

This project analyzes a dataset of food and beverage shops, performing data cleaning, exploratory data analysis (EDA), visualizations, and predictive modeling using machine learning.

Language: HTML - Size: 1.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ankita14-p/Diabetes-Prediction-ML

Diabetes prediction model using various ML algorithms with data preprocessing, SMOTE, and model evaluation. Includes ablation study for key techniques.

Language: Jupyter Notebook - Size: 4.87 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Devdes-BossYod/iris-classifier

Classify the Iris dataset using a Multi-layer Perceptron. This project includes data preprocessing, hyperparameter tuning, and model evaluation. 🌱🌍

Size: 1.95 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

shraddha-r0/pgp-ml-ai-portfolio

A series of six hands-on projects completed during my PGP ML and AI academic training with UT Austin and Great Learning

Language: Jupyter Notebook - Size: 9.73 MB - Last synced at: 28 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

WenjieZ/TSCV

Time Series Cross-Validation -- an extension for scikit-learn

Language: Python - Size: 224 KB - Last synced at: 28 days ago - Pushed at: almost 3 years ago - Stars: 258 - Forks: 42

arjsabbir88/LifeSure-Server

LifeSure Server is the backend API for the LifeSure application—a platform designed to manage and support health, insurance, or related services (customize this description based on your app's purpose). This server provides RESTful endpoints, handles data persistence, authentication

Language: JavaScript - Size: 51.8 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

arjsabbir88/LifeSure

LifeSure is a modern MERN stack web application designed to simplify life insurance management for customers, agents, and admins. Built for a tech-driven insurance startup, the platform offers a seamless, secure, and fully digital

Language: JavaScript - Size: 758 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Prabhakar200216/ml-project-2-customer-churn

A machine learning project that predicts customer churn using decision trees, random forest, and XGBoost.

Size: 148 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

RubixML/Iris

The original lightweight introduction to machine learning in Rubix ML using the famous Iris dataset and the K Nearest Neighbors classifier.

Language: PHP - Size: 417 KB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 34 - Forks: 9

RubixML/Dota2

Build a classifier to predict the outcome of Dota 2 games with the Naive Bayes algorithm and results from 102,944 sample games.

Language: PHP - Size: 4.89 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 15 - Forks: 2

Related Keywords
cross-validation 980 machine-learning 428 python 227 random-forest 140 classification 130 data-science 130 logistic-regression 123 linear-regression 112 feature-engineering 97 scikit-learn 97 pandas 89 regression 85 hyperparameter-tuning 80 data-visualization 78 sklearn 70 decision-trees 70 r 68 xgboost 63 numpy 63 exploratory-data-analysis 59 deep-learning 58 matplotlib 55 svm 54 knn 53 eda 50 gridsearchcv 47 seaborn 45 machine-learning-algorithms 42 hyperparameter-optimization 41 python3 40 feature-selection 39 supervised-learning 39 data-analysis 37 jupyter-notebook 36 model-evaluation 36 pca 36 decision-tree 33 statistics 33 ridge-regression 32 regression-models 32 grid-search 31 data-preprocessing 31 knn-classification 31 model-selection 30 regularization 30 neural-networks 30 support-vector-machines 30 pipeline 28 neural-network 28 svm-classifier 27 prediction 27 data-cleaning 27 keras 26 random-forest-classifier 25 naive-bayes-classifier 24 tensorflow 24 lasso-regression 23 confusion-matrix 23 gradient-boosting 22 predictive-modeling 22 naive-bayes 22 feature-extraction 21 pytorch 21 kfold-cross-validation 21 decision-tree-classifier 20 k-fold 19 binary-classification 19 visualization 18 principal-component-analysis 18 natural-language-processing 18 clustering 18 bootstrap 17 statistical-analysis 17 dimensionality-reduction 17 smote 16 image-classification 16 computer-vision 16 time-series 16 classification-algorithm 16 support-vector-machine 16 model-deployment 15 convolutional-neural-networks 15 kaggle 15 k-nearest-neighbors 15 preprocessing 15 nlp 14 model-training-and-evaluation 14 optimization 14 ml 14 evaluation-metrics 14 outlier-detection 14 accuracy 13 supervised-machine-learning 13 ensemble-learning 13 polynomial-regression 13 k-nearest-neighbours 13 imbalanced-data 12 adaboost 12 artificial-intelligence 12 dataset 12