GitHub topics: cross-validation
Zulqarnain-10/Behavioral-Risk-Factor-By-Tobacco-Use
This project analyzes tobacco use risk factors using machine learning to predict 'High Confidence Limit' for various demographics. It includes data preprocessing, EDA, and multiple ML models like Linear Regression, Random Forest, and SVM. Detailed results and code are available in the repository.
Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 0 - Forks: 0
supcode1234/Twitter-Sentiment-Analysis
🔍 Analyze Twitter sentiment by classifying tweets into Positive, Negative, Neutral, and Irrelevant categories using machine learning models.
Language: Jupyter Notebook - Size: 1.59 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
vicky999jeevi/AI-Code-Explainer-Optimizer
🧩 Optimize and explain code effortlessly with our AI-driven multi-agent system, using LangGraph and Gemini for efficient solutions.
Language: Python - Size: 7.01 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
precioussak/scikit-learn-is-what-you-dont-need
🔍 Discover why scikit-learn may not meet your needs and explore better alternatives for your machine learning projects.
Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0
JavedFazlulahF/Customer-Churn-Prediction
📊 Predict customer churn in telecom using machine learning to enhance retention strategies and drive better business outcomes.
Language: Jupyter Notebook - Size: 2.86 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0
subash12345679-png/RentalPrice-ML-Modeling
📊 Predict apartment rental prices in Tel Aviv using machine learning models, featuring Elastic Net and Decision Trees for accurate forecasting.
Language: Jupyter Notebook - Size: 1.43 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0
Shivaay8055/Bank-Marketing-Data
Los datos se relacionan con campañas de marketing directo (llamadas telefónicas) de una entidad bancaria portuguesa. El objetivo de la clasificación es predecir si el cliente suscribirá un depósito a plazo (variable y).
Size: 2.44 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0
tutkufurkan/Machine-Learning---Advanced-Topics
Advanced ML tutorial: NLP gender classification, PCA visualization, hyperparameter optimization with Grid Search, and collaborative filtering recommendations using real-world datasets
Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0
ChokZB/college-regression-analysis
Regression analysis of the ISLR2 College dataset using R. Includes data exploration, model fitting, and diagnostics.
Language: R - Size: 1.84 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0
johnnydelsage/Car-Price-Prediction
🚗 Predict used car prices accurately with machine learning models using various regression techniques and market data analysis for fair valuation.
Language: Jupyter Notebook - Size: 1.35 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0
MUSA-Zhanchao/MUSA5000-OLS
Spatial Statistics Assignment1
Language: HTML - Size: 16.5 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
stan-dev/loo
loo R package for approximate leave-one-out cross-validation (LOO-CV) and Pareto smoothed importance sampling (PSIS)
Language: R - Size: 131 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 155 - Forks: 37
dhaneshbb/BikeRental
Daily bike rental demand prediction using Ridge Regression on Capital Bikeshare data (2011-2012). Addresses multicollinearity, zero-inflated features, and non-normal distributions. Test R²=0.832, CV R²=0.815±0.032. Includes statistical analysis, VIF removal, and comparison of 10 regression algorithms.
Language: Jupyter Notebook - Size: 20.2 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0
0xPutri/mlflow_project
Machine Learning model optimization project using MLflow — includes Grid Search, Random Search, Bayesian Optimization, and K-Fold Cross Validation for efficient experiment tracking.
Language: Python - Size: 144 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0
dhaneshbb/AutoPricePred
Machine learning regression model predicting 1985 automobile prices. Lasso model achieves 91.7% R² with superior generalization over XGBoost. Handles extreme multicollinearity (VIF 16,676→8.36), data leakage detection, and outlier treatment through PCA and domain-driven feature engineering.
Language: Jupyter Notebook - Size: 33.4 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0
trent-b/iterative-stratification
scikit-learn cross validators for iterative stratification of multilabel data
Language: Python - Size: 45.9 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 881 - Forks: 74
Princemurchale/-Predictive-Maintenance-for-Renewable-Energy-Sources
The aim to decrease the maintenance cost of generators used in wind energy production machinery. This is achieved by building various classification models, accounting for class imbalance, and tuning on a user defined cost metric (function of true positives, false positives and false negatives predicted) & productionising the model using pipelines.
Size: 14.6 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0
dbouget/validation_metrics_computation
Validation and metrics computation over 3D medical volumes (backend for Raidionics)
Language: Python - Size: 627 KB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 6 - Forks: 5
donishadsmith/vswift
A R package for evaluating ML classification models.
Language: R - Size: 1.02 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0
p-dirac/python-keras-cnn
This application demonstrates the implementation details of a convolutional network programmed with Python class structures.
Language: Python - Size: 7.58 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0
danielemonteverdi02/Progetto-Titanic-regressione-logistica-
Analisi e predizione della sopravvivenza dei passeggeri del Titanic. Pulizia dei dati, imputazione dei valori mancanti (imputazione singola e con MICE), feature engineering, codifica one-hot e scalatura numerica. Addestramento di una regressione logistica con cross-validation e valutazione su validation set. Predizione finale sul test set.
Language: Jupyter Notebook - Size: 838 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0
PhilBoileau/cvCovEst
An R package for nonparametric covariance matrix estimation in high dimensions
Language: R - Size: 4.88 MB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 14 - Forks: 4
statalasso/lassopack
LASSOPACK: Stata module for lasso, square-root lasso, elastic net, ridge, adaptive lasso estimation and cross-validation
Language: Stata - Size: 1.38 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 6 - Forks: 4
Davide011/ML_project_South_African_Heart_Disease
Public Repository: Machine Learning & Data Mining project using the South African Heart Disease dataset. Applied PCA, Regularized Linear Regression, ANN, Logistic Regression, and Decision Trees with cross-validation for regression and classification. Includes feature scaling, EDA, and statistical tests.
Size: 1.32 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0
autoreject/autoreject
Automated rejection and repair of bad trials/sensors in M/EEG
Language: Python - Size: 704 KB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 147 - Forks: 59
sahildev23/predictive-modeling
ML pipeline with automated preprocessing, cross-validation, and performance visualization. Achieved 28% accuracy improvement on 100k+ records
Language: Python - Size: 10.7 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0
AnderCruz/Text-Classification-Generation-Project-CNN
This project explores natural language processing techniques using TensorFlow and Keras, focusing on challenges faced by a news portal. The portal requires solutions for classifying articles based on their titles and descriptions, and for assisting writers in maintaining a consistent style by suggesting subsequent words.
Size: 9.77 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0
nf-core/drugresponseeval
Pipeline for testing drug response prediction models in a statistically and biologically sound way.
Language: Nextflow - Size: 7.68 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 24 - Forks: 6
4Freye/panelsplit
A tool for performing cross-validation with panel data
Language: Python - Size: 8.57 MB - Last synced at: 24 days ago - Pushed at: 26 days ago - Stars: 21 - Forks: 2
distcomp/SvF
Implementation of SvF-technology of balanced identification of mathematical models by experimental data
Language: Python - Size: 38.8 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0
ctesta01/nadir
Super learning with flexible formulas
Language: R - Size: 20.1 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 5 - Forks: 1
chonzadaniel/chonzadaniel
Skilled Data Scientist with hands on experience in ML, NLP, Deep Learning & GenAI building clean, modular projects with real-world problems solutions: text classification, class imbalance, RAG systems, and PEFT. Developed impactful AI tools powered by AWS, Streamlit, Slack, & vector DBs.
Size: 29.3 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
mdabros/SharpLearning
Machine learning for C# .Net
Language: C# - Size: 4.04 MB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 400 - Forks: 86
TheMrityunjayPathak/SpaceshipTitanicClassification
Spaceship Titanic Classification
Language: Jupyter Notebook - Size: 5.27 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
NadyaKameelaa1/klasifikasi_lemon
Ulangan Praktik Mapel Pilihan (AI) - Machine Learning, StandardScaler, OneHotEncoder, OrdinalEncoder, Logistic Regression, dan lain sebagainya.
Language: Jupyter Notebook - Size: 84 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
FBruzzesi/timebasedcv
Time based splits for cross validation
Language: Python - Size: 1.7 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 39 - Forks: 1
alopezmoreira1989/Remote-Work-in-Data-Science
This repository contains my data science project developed in Kaggle. It includes Jupyter notebooks, datasets, and analysis focused on remote work in data science. The goal is to explore insights, apply machine learning techniques, and showcase reproducible workflows.
Language: Jupyter Notebook - Size: 9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
GDSL-UL/san
Spatial Modelling for Data Scientists
Language: TeX - Size: 523 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 86 - Forks: 19
rvalavi/blockCV
The blockCV package creates spatially or environmentally separated training and testing folds for cross-validation to provide a robust error estimation in spatially structured environments. See
Language: R - Size: 51.4 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 114 - Forks: 23
Umer-Farooq-CS/California-Housing-Regression
Linear Regression experiments on the California Housing dataset across five phases, using NumPy and scikit-learn only (no pandas). Includes EDA, polynomial features, SGD with scaling, residuals, 5-fold CV, and an LNCS-style report with figures.
Language: Jupyter Notebook - Size: 6.25 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
RubixML/CIFAR-10
Use the famous CIFAR-10 dataset to train a multi-layer neural network to recognize images of cats, dogs, and other things.
Language: PHP - Size: 163 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 57 - Forks: 9
Not-Buddy/FerricSort
Rust-based CNN for garbage classification into 6 categories with high accuracy on GPU.
Language: Rust - Size: 9.05 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
shahzadmustafa15/credit-card-fraud-detection
Credit card fraud detection using Random Forest with Stratified K-Fold cross-validation and F1-score evaluation.
Language: Jupyter Notebook - Size: 46.9 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
StarlangSoftware/Sampling-CPP
Data sampling library
Language: C++ - Size: 223 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 1
0aub/vision-cls-master
Deep learning and machine learning framework for image classification with 60+ pretrained models, 15+ attention mechanisms, cross-validation support, and Docker deployment. Built with PyTorch.
Language: Python - Size: 56.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
EshanSugeesh/Amex-Machine-Learning-Project
Predictive modeling on American Express dataset using advanced ML algorithms, extensive feature engineering (300+ features), and cloud-based data science workflows.
Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
Jonghwan-dev/Awesome-Segmentation-in-Medical
Welcome, Awesome-segmentation-in-medical project. This repo is robust, reproducible, and fair benchmarking framework for breast ultrasound (BUS) image segmentation. Features 10+ models (UNet, UNet++, TransUnet, SwinUnet etc.), 5+ datasets, and strict data separation to prevent test set leakage.
Language: Python - Size: 1.24 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 6 - Forks: 0
mljs/libsvm
LIBSVM for the browser and nodejs :fire:
Language: JavaScript - Size: 1.54 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 87 - Forks: 14
shramitamaheshwari/Bus-demand-forecasting
End-to-end Bus Demand Forecasting project: performed data preprocessing, feature engineering, and trained models (LightGBM) to predict seat demand. Includes cross-validation, evaluation (RMSE), and generates submission files for competition use.
Language: Python - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
kaladabrio2020/machine-learning
Times Series, Classification-label/image, Regression
Language: Jupyter Notebook - Size: 281 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
knickodem/kfa
k-fold cross validation for factor analysis
Language: HTML - Size: 9.39 MB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 7 - Forks: 1
RubixML/HAR
Recognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Language: PHP - Size: 23.9 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 24 - Forks: 7
RubixML/Colors
Demonstrating unsupervised clustering using the K Means algorithm and synthetic color data.
Language: PHP - Size: 257 KB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 18 - Forks: 3
RubixML/Credit
An example project that predicts risk of credit card default using a Logistic Regression classifier and a 30,000 sample dataset.
Language: PHP - Size: 1.76 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 36 - Forks: 12
RubixML/MNIST
Handwritten digit recognizer using a feed-forward neural network and the MNIST dataset of 70,000 human-labeled handwritten digits.
Language: PHP - Size: 19.9 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 39 - Forks: 9
nirmal2i43a5/Automobile_Price_Prediction_System
This project aims to predict vehicle pricing based on various attributes related to design, performance, market conditions, and temporal factors.
Language: Jupyter Notebook - Size: 3.92 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
alexkychen/assignPOP
Population Assignment using Genetic, Non-genetic or Integrated Data in a Machine-learning Framework. Methods in Ecology and Evolution. 2018;9:439–446.
Language: R - Size: 8.81 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 16 - Forks: 4
BCG-X-Official/sklearndf
DataFrame support for scikit-learn.
Language: Python - Size: 19.7 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 63 - Forks: 8
karinneaiello/Student-Exam-Scores-Prediction
Python
Language: Jupyter Notebook - Size: 8.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
RubixML/Divorce
Use the K Nearest Neighbors algorithm to predict the probability of a divorce with high accuracy.
Language: PHP - Size: 87.9 KB - Last synced at: 25 days ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 11
WalidGharianiEAGLE/spatial-kfold
spatial resampling for more robust cross validation in spatial studies
Language: Python - Size: 4.33 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 16 - Forks: 0
adirbella37/RentalPrice-ML-Modeling
A machine learning project to predict apartment rental prices in Tel Aviv using Elastic Net and Decision Trees. It includes data preprocessing, feature engineering, model training, and performance evaluation.
Language: Jupyter Notebook - Size: 185 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
ialwayslikedgrime/airbnb-milan-price-prediction
End-to-end machine learning pipeline for predicting Airbnb listing prices in Milan using geospatial analysis, nested cross-validation, and feature engineering. Integrates multiple data sources with Milan's neighborhood boundaries to identify pricing drivers and market opportunities. XGBoost model R² = 0.587 with comprehensive SHAP interpretability
Language: Jupyter Notebook - Size: 22.6 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
mlr-org/mlr3spatiotempcv
Spatiotemporal resampling methods for mlr3
Language: TeX - Size: 454 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 53 - Forks: 10
peter-ehmann/cross-validation
10-fold cross-validation simulation to identify optimal lambda for ridge regression on n=1000 observations of p=10000 Rademacher random variables.
Language: R - Size: 1.59 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0
kapsner/mlexperiments
An extensible framework for reproducible machine learning experiments
Language: R - Size: 698 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 2
unnatii14/Predicting-Health-Insurance-OLS-Model-Feature-Engineering-and-Selection
OLS regression model that identifies key factors influencing health insurance costs, while ensuring the predictors are meaningful and not highly correlated.
Language: Jupyter Notebook - Size: 246 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
PacktWorkshops/The-Deep-Learning-with-Keras-Workshop
An Interactive Approach to Understanding Deep Learning with Keras
Language: Jupyter Notebook - Size: 229 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 51 - Forks: 91
m-clark/book-of-models
Spells for everyday living, also a book -- Models Demystified -- coming out in 2025.
Language: Python - Size: 152 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 78 - Forks: 18
Yosri-Ben-Halima/cpcv-train-test-data-split-module
A Python module for time series cross-validation using Combinatorial Purged Cross-Validation (CPCV) with embargo to prevent data leakage.
Language: Python - Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
business-science/modeltime.resample
Resampling Tools for Time Series Forecasting with Modeltime
Language: R - Size: 21.3 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 21 - Forks: 6
EEMnz/Assessment-thermal-models
Python codes for Assessment of thermal mode-based kinetic models via stratified cross-validation and TPE optimization
Language: Python - Size: 460 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
ramgarhiahere0/CTune-MLX
🚀 Simplify ML model training on Apple Silicon with CTune-MLX, overcoming unsloth issues and enabling seamless format conversion.
Language: Shell - Size: 24.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
bth-dipt-research/SVAR
This repository contains code developed for the SVAR project (Trafikverket 2023-2025).
Language: Python - Size: 9.27 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0
tstran155/Optimization-of-building-energy-consumption
This repo demonstrates how to build a surrogate (proxy) model by multivariate regressing building energy consumption data (univariate and multivariate) and use (1) Bayesian framework, (2) Pyomo package, (3) Genetic algorithm with local search, and (4) Pymoo package to find optimum design parameters and minimum energy consumption.
Language: Jupyter Notebook - Size: 5.31 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 31 - Forks: 8
MirumeYato/mugen
mirror of my main git repo
Language: Python - Size: 76.2 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
Emriss0/Tech-Tweet
TechTweet is a microblogging platform for tech enthusiasts, allowing users to share short tech messages and engage in discussions. Join the community, post your thoughts, and connect with others! 🐙💻
Language: HTML - Size: 26.4 KB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
SadmanSakib93/Stratified-k-fold-cross-validation-Image-classification-keras
This python program demonstrates image classification with stratified k-fold cross validation technique.
Language: Jupyter Notebook - Size: 315 KB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 37 - Forks: 14
timothyckl/sbss
similarity-based stratified k-fold cross validation
Language: Python - Size: 61.5 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0
tlverse/hal9001
🤠 📿 The Highly Adaptive Lasso
Language: R - Size: 13.1 MB - Last synced at: 23 days ago - Pushed at: 12 months ago - Stars: 49 - Forks: 14
aufahuhs/Advanced-Machine-Learning-Personal-Project
This project explores ML techniques across classification and regression. It includes penguin species classification, breast cancer prediction, and baseball performance prediction using regularization. After, I will develop an XGBoost model for hotel cancellation prediction, analyzing key booking factors and optimizing performance. (In Progress)
Size: 1.95 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0
v-sundaresan/truenet
DL tool for white matter hyperintensities segmentation
Language: Python - Size: 94.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 15 - Forks: 7
cissagatto/CrossValidationMultiLabel
A code to execute and save cross-validation in multilabel classification
Language: R - Size: 29.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0
DidierRLopes/timeseries-cv
Time-Series Cross-Validation Module
Language: Jupyter Notebook - Size: 454 KB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 46 - Forks: 10
msmbuilder/osprey
🦅Hyperparameter optimization for machine learning pipelines 🦅
Language: Python - Size: 974 KB - Last synced at: about 9 hours ago - Pushed at: almost 5 years ago - Stars: 73 - Forks: 26
gershonc/octopus-ml
A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data with minimal effort.
Language: Jupyter Notebook - Size: 21.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 22 - Forks: 5
elyase/do-we-need-crossvalidation
Language: Python - Size: 1.65 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
francescopiocirillo/linear-regression-from-scratch-R
Hands-on regression analysis project in R using a dataset with 30 predictors. Includes manual OLS implementation without lm(), p-value computation, and comparison with built-in functions. Applies stepwise selection (AIC/BIC), Ridge, and Lasso to minimize test error and identify key predictors.
Language: R - Size: 219 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
jishen-harilal/analytical-models-in-excel
A curated Excel workbook showcasing core data analysis techniques - including regression, classification, dimensionality reduction, and cross-validation - implemented entirely within spreadsheets. Ideal for demonstrating manual model logic, clean formatting, and advanced Excel proficiency without code.
Size: 785 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0
francescopiocirillo/regression-model-comparison-R
This project explores linear regression model selection in R using Best Subset Selection (BIC), stepwise methods with cross-validation, Ridge, and Lasso. Includes MSE evaluation on test data, multicollinearity analysis (VIF), and correlation insights for variable selection.
Language: R - Size: 930 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
supremkc05/FoodandBeverage_Data_Analytics
This project analyzes a dataset of food and beverage shops, performing data cleaning, exploratory data analysis (EDA), visualizations, and predictive modeling using machine learning.
Language: HTML - Size: 1.9 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
ankita14-p/Diabetes-Prediction-ML
Diabetes prediction model using various ML algorithms with data preprocessing, SMOTE, and model evaluation. Includes ablation study for key techniques.
Language: Jupyter Notebook - Size: 4.87 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0
Devdes-BossYod/iris-classifier
Classify the Iris dataset using a Multi-layer Perceptron. This project includes data preprocessing, hyperparameter tuning, and model evaluation. 🌱🌍
Size: 1.95 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
shraddha-r0/pgp-ml-ai-portfolio
A series of six hands-on projects completed during my PGP ML and AI academic training with UT Austin and Great Learning
Language: Jupyter Notebook - Size: 9.73 MB - Last synced at: 28 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0
WenjieZ/TSCV
Time Series Cross-Validation -- an extension for scikit-learn
Language: Python - Size: 224 KB - Last synced at: 28 days ago - Pushed at: almost 3 years ago - Stars: 258 - Forks: 42
arjsabbir88/LifeSure-Server
LifeSure Server is the backend API for the LifeSure application—a platform designed to manage and support health, insurance, or related services (customize this description based on your app's purpose). This server provides RESTful endpoints, handles data persistence, authentication
Language: JavaScript - Size: 51.8 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
arjsabbir88/LifeSure
LifeSure is a modern MERN stack web application designed to simplify life insurance management for customers, agents, and admins. Built for a tech-driven insurance startup, the platform offers a seamless, secure, and fully digital
Language: JavaScript - Size: 758 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
Prabhakar200216/ml-project-2-customer-churn
A machine learning project that predicts customer churn using decision trees, random forest, and XGBoost.
Size: 148 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
RubixML/Iris
The original lightweight introduction to machine learning in Rubix ML using the famous Iris dataset and the K Nearest Neighbors classifier.
Language: PHP - Size: 417 KB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 34 - Forks: 9
RubixML/Dota2
Build a classifier to predict the outcome of Dota 2 games with the Naive Bayes algorithm and results from 102,944 sample games.
Language: PHP - Size: 4.89 MB - Last synced at: 25 days ago - Pushed at: 4 months ago - Stars: 15 - Forks: 2