An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: multicollinearity

yizenglistat/glmcs

Generalized Linear Models with Confidence Sets

Language: R - Size: 4.79 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

francescopiocirillo/linear-regression-from-scratch-R

Hands-on regression analysis project in R using a dataset with 30 predictors. Includes manual OLS implementation without lm(), p-value computation, and comparison with built-in functions. Applies stepwise selection (AIC/BIC), Ridge, and Lasso to minimize test error and identify key predictors.

Language: R - Size: 219 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

0xnu/multicollinearity_llm 📦

A multicollinearity-based compression C program, identifies and removes highly correlated weights in neural networks, thereby reducing redundancy.

Language: C - Size: 223 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

subu53/Ames-Housing-Price-Prediction-Ml-Regression

House price prediction using regression machine learning models

Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

jibbs1703/King-County-House-Prices

This repository contains a price prediction model for King County, Washington. The model results and analysis would prove invaluable to investors and stakeholders in the King County, WA housing market.

Language: Jupyter Notebook - Size: 859 KB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Morano-git/WDBC-ML-Classification-Assignment

Fundamentals of Machine Learning Assignment Repository

Language: Jupyter Notebook - Size: 5.09 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

FedeGambe/Master_s_thesis_Data_science

Questa repository contiene il codice e i materiali relativi alla tesi magistrale, con un focus su analisi statistiche ed analisi predittive. Include strumenti e metodi per esplorare e modellare i dati, con tecniche statistiche avanzate come la regressione logistica, analisi di clustering, e metodi di ML e DL per la previsione e classificazione

Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

BlasBenito/collinear

R package to manage multicollinearity in modeling data frames.

Language: R - Size: 20.2 MB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 12 - Forks: 1

favstats/multicol_sim

Analyzing Multicollineaerity with a little simulation

Language: HTML - Size: 23.6 MB - Last synced at: 5 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

Edanur-Y/Variable-Analysis-of-Banks-Ratio-Data

Testing variables for multicollinearity, multivariate normality and analyzing outliers and missing values. ⭕SPSS 🔵R

Size: 1.16 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

HMesghali/Biogas-Production-Machine-Learning-Analysis

Machine learning approach for feature selection and uncertainty analysis in wastewater treatment plant biogas production. Explores advanced ML techniques for optimizing renewable energy processes.

Language: Jupyter Notebook - Size: 354 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

VivekSagarSingh/Probability-of-Credit-card-Default

Classification problem using multiple ML Algorithms

Language: Jupyter Notebook - Size: 26.5 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 1

zohrabgulushev/Data-Science-Project

Language: Jupyter Notebook - Size: 1.08 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

prneidhardt/Supervised-Learning-Classification

INN Hotels Project

Language: Jupyter Notebook - Size: 3.77 MB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 2

Akashash01/Akash_Linear-regression

This is an linear approach machine learning model used to predict the values of variable(dependent) based on other variables(independent).

Language: Python - Size: 30.3 KB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

manjugovindarajan/INNHOTELS-supervisedlearningclassifications-

Analyze INN Hotels data to find which factors have a high influence on booking cancellations, build a predictive model to predict which booking is going to be canceled in advance, and help in formulating profitable policies for cancellations and refunds.

Language: Jupyter Notebook - Size: 3.01 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

BNTechie/Regression_analysis

house price prediction, Comparison of Ml algorithm, Logistic regression, Multicollinearity, Multivariate regression analysis, Linear model with random effects, Robust regression

Language: Jupyter Notebook - Size: 3.84 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

c0ra/RRSARMMI

This repository contains the code and data necessary to reproduce the results presented in the paper "Ridge Regularization for Spatial Auto-regressive Models with Multicollinearity Issues" submitted to Advances in Statistical Analysis (AStA).

Language: HTML - Size: 4.27 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

aayushi-droid/Multicollinearity-in-Regression-Analysis

Multicollinearity in Regression Analysis

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

Avinash793/regression-analysis-examples

Detailed implementation of various regression analysis models and concepts on real dataset.

Language: Jupyter Notebook - Size: 3.55 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

gepapago/Empirical-Research

Basic methodologies of Empirical Research applied on various case studies (R language)

Size: 46.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

oliviergimenez/p2cr

Fit principal component capture-recapture model to snow petrel data

Size: 2.83 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

UzoigweC/INNHotels

Analyze the data of INN Hotels to find which factors have a high influence on booking cancellations, build a predictive model that can predict which booking is going to be canceled in advance, and help in formulating profitable policies for cancellations and refunds

Language: HTML - Size: 8.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Vaneeza-7/Bayesian-Statistics-Regression-Models-in-pymc3 📦

Bayesian Statistics: Linear Regression and multi-linear models and related concepts (multicollinearity, correlation coefficient etc) on iris dataset in pymc3

Language: Jupyter Notebook - Size: 188 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

eeshwarib23/Airbnb-regression-analysis

ML | Regression Analysis| Random Forest| XGBoost| Gradient Boost| EDA| Feature Engineering| Feature selection

Size: 4.27 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

jmersinger/EPI-vs-GDP-Data-Analysis-Visualization-Paper

Research, Analysis, and Final Paper for my Intro to Econometrics class taken in Fall 2023

Language: R - Size: 11.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

OmBaval/Airline-Customer-Satisfaction

This project employs a dataset of 103,904 entries with 25 features. Utilizing the XGBoost classifier,The workflow involves data fetching, feature selection, preprocessing, correlation analysis, best feature selection, data rescaling, train-test split, and target balancing. Predicts whether a customer will experience satisfaction with a flight.

Language: Jupyter Notebook - Size: 3.84 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

venkatesh-eranti/Housing_case-study

A real estate company that has a dataset containing the prices of properties in the Delhi region. It wishes to use the data to optimise the sale prices of the properties based on important factors such as area, bedrooms, parking, etc

Language: Jupyter Notebook - Size: 1020 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

rojaff/dredge_mc

Assess multicollinearity between predictors when running the dredge function (MuMIn - R)

Language: R - Size: 16.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 2

ashishyadav24092000/Multicollinearity-in-Regression

Showing how to identify multicollinearity in a regression problem using the OLS(Ordiniary Least Square Method) and correlation chart adn finaly eradicating it.

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

bhattbhavesh91/multicollinearity_detection

Small example on how you can detect multicollinearity

Language: Jupyter Notebook - Size: 49.8 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 13 - Forks: 10

NicholasDominic/Stochastic-VIF-ID-Rice-SNPs

By leveraging ensemble learning, this program can be used to analyze the Linkage Disequilibrium between SNPs in each Indonesian rice chromosomes. Developed using Python 3.9.12.

Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

esmailza/Housing_Price

A simple Neural Network Model to predict the housing price based on the house features like bedrooms, area, etc. We are using kaggle Housing Prices Dataset. The data has multicollinearity prob

Language: Jupyter Notebook - Size: 933 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

SarangGami/TED-Talks-Views-Prediction-Supervised-learning

This project aims to build a regression model that predicts the number of views for TED Talks videos on the TED website.

Language: Jupyter Notebook - Size: 21 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

SarangGami/Bank-Marketing-Effectiveness-Prediction-supervised-learning

The main objective is to build a predictive model that predicts whether a new client will subscribe to a term deposit or not, based on data from previous marketing campaigns.

Language: Jupyter Notebook - Size: 7.1 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

olesyamba/ICvsML

Usual linear regression or XGBoost? Combo! Or how I was investigating the impact of intellectual capital on NASDAQ-100 capitalization during 2 years.

Size: 31.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

alef-s/INN_hotels

Analyze the data of INN Hotels to find which factors have a high influence on booking cancellations, build a predictive model that can predict which booking is going to be canceled in advance, and help in formulating profitable policies for cancellations and refunds.

Language: Jupyter Notebook - Size: 3.38 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

mjkim1001/AS21

Applied Statistics I, 2021, UNC at Chapel Hill-linear regression

Size: 729 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Niranjan-stat/Regression-Analysis-on-Drinking-Data

Linear regression, VIF, Auto Correlation.

Size: 601 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

WuCandice/-Statistical-Analysis-of-Economic-Variables-and-the-Mortgage-Rate-in-the-United-States

This project is about to use linear regression to examine the relationship between various economic variables and the mortgage rate in the United States.

Language: R - Size: 3.98 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

mamomen1996/Python_CS_01

Traditional Regression problem project in Python

Language: HTML - Size: 1.27 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

babusarath05/multi_corr

multi_corr helps to identify multicollinearity in a simple and straight manner.

Language: Python - Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Yemi-Ak/Supervised-Learning-Booking-Cancellations-INNHotels

The aim is to analyze the data of INN Hotels to find which factors have a high influence on booking cancellations, build predictive models(logisitic regression, decision trees) that can predict which booking is going to be canceled in advance, and help in formulating profitable policies for cancellations and refunds.

Language: Jupyter Notebook - Size: 3.58 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

sachelsout/effect-of-collinear-features-on-linear-models

This repository shows, how linear models behave if the features of the dataset are collinear in nature. Support Vector Machine(SVM) and Logistic Regression(LR) algorithms are used as linear models. Weights and accuracy scores are recorded in different scenarios.

Language: Jupyter Notebook - Size: 54.7 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

bhattbhavesh91/pca-multicollinearity

A simple example to show how Principal Component Analysis can be used to Address Multicollinearity

Language: Jupyter Notebook - Size: 339 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 4

Lori10/BostonHousing-Using-LinearRegreesion-Ridge-Lasso-ElasticNet

In this repo I have implemented a machine learning project which predicts the house price in Boston. I have covered these topics : Exploratory Data Analysis, Feature Engineering including feature scaling, transformation into normally distributed data, multicollinearity, feature selection. I have trained the dataset using Linear Regression, Ridge, Lasso, and Elastic Net Regression.

Language: Jupyter Notebook - Size: 294 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

sduxbury/vif-ergm

R function to detect multicollinearity in ERGM

Language: R - Size: 14.5 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

uweremer/regression_diagnostics

Skript zur Videoreihe Regressionsdiagnostik in R

Language: R - Size: 312 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

sachin17git/Malware-detection-ML

Android malware detection using machine learning.

Language: Jupyter Notebook - Size: 8.81 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

being-aerys/Data_Processing_and_Feature_Engineering_in_Machine_Learning

This is an attempt to summarize feature engineering methods that I have learned over the course of my graduate school.

Language: Jupyter Notebook - Size: 550 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 10 - Forks: 0

bhattbhavesh91/lasso-regression-python

This repository shows how Lasso Regression selects correlated predictors

Language: Jupyter Notebook - Size: 50.8 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 7

petermchale/predict_customer_response

Machine-learning models to predict whether customers respond to a marketing campaign

Language: Jupyter Notebook - Size: 1.28 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 3

dayanacavalcante/Multicollinearity_VIF

Quantify multicollinearity by VIF

Language: Python - Size: 6.84 KB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

chinmayeeguru/Bike-Sharing-Linear-Regression-Model

To model the demand for shared bikes with the available independent variables

Language: Jupyter Notebook - Size: 1.08 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

devosmitachatterjee2018/Linear_Statistical_Models

The project involves the multivariate regression analysis of a dataset.

Language: R - Size: 1.72 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

ahmed-arafaath/Ad_clicks_predictor

An ML model to predict Ad viewers based on various factors.

Language: Jupyter Notebook - Size: 112 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

swarnava-96/Linear-Ridge-and-Lasso-Regression

Language: Jupyter Notebook - Size: 43.9 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

daniel-furman/RecFeatureSelect

Feature selection functions (1) using the multi-collinearity matrix and recursively proceeding to a spearman threshold and (2) using Forward Stepwise Selection running on an ensemble sklearner (with options for HPO).

Language: Python - Size: 1.84 MB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

bhattbhavesh91/regression-excercise-ols-ridge

A Regression Exercise covering OLS & Ridge Regression

Language: Jupyter Notebook - Size: 753 KB - Last synced at: 6 months ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 1

AnFrBo/suicidal_actions

Analysis of Influencing Factors Leading to Suicidal Actions via Linear Regression and Regularization Methods

Language: R - Size: 2.22 MB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

govardhan26/Linear-regression

Linear regression on numerical attributes

Language: HTML - Size: 1.7 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 3

amkatrutsa/QPFeatureSelection

Quadratic programming feature selection

Language: Matlab - Size: 926 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 5 - Forks: 3

ankitbit/Linear_and_Generalized_Linear_Models

This repository has scripts that are part of the programming assignments of the course Linear and Generalized Linear Models taught at FME, UPC Barcelonatech.

Language: R - Size: 159 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 2

ireneliu521/BOPS-Strategy-Analysis_Project_R

Evaluate the Buy Online Pick-up in Store (BOPS) strategy with a real-world dataset

Size: 10.1 MB - Last synced at: 3 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

Related Keywords
multicollinearity 64 linear-regression 17 machine-learning 17 logistic-regression 10 python 9 feature-selection 8 feature-engineering 7 heteroscedasticity 7 regression-analysis 6 regression-models 6 r 6 data-science 6 exploratory-data-analysis 6 ridge-regression 5 pruning 5 machine-learning-algorithms 5 auc-roc-curve 5 autocorrelation 4 lasso-regression 4 statistics 4 data-cleaning 4 pandas 4 pca 4 python3 3 outlier-detection 3 decision-trees 3 data-preprocessing 3 numpy 3 ols-regression 3 decision-tree 3 model-selection 3 regression 3 variance-inflation-factor 3 data-analysis 3 data-visualization 3 scikit-learn 3 boosting-algorithms 2 statsmodels 2 collinearity-diagnostics 2 diagnostics 2 residuals 2 vif 2 linear-models 2 simple-linear-regression 2 correlation 2 principal-component-analysis 2 gridsearchcv 2 heteroskedasticity 2 data-wrangling 2 variable-selection 2 cross-validation 2 predictive-modeling 2 feature-extraction 2 data 2 multiple-linear-regression 2 ols-regression-model 2 supervised-learning 2 prediction-model 2 regression-diagnostics 2 xgboost-classifier 2 multivariate-regression 2 recursive-feature-elimination 2 evaluation-metrics 2 anova-test 2 probit 1 elasticnetregression 1 machinelearning 1 social-networks 1 catboost-classifier 1 ergm 1 bayesian-optimization 1 bayesian-inference 1 exponential-random-graph-models 1 assumptions-in-regression 1 social-network-analysis 1 network-analysis 1 address-multicollinearity 1 box-cox-transformation 1 threshold 1 xgboost-model 1 rstudio 1 panel-data 1 p-value 1 nasdaq100 1 intellectual-capital 1 fixed-effects-model 1 smote-sampling 1 pre-processing 1 imbalanced-classes 1 bank-marketing-analysis 1 views-prediction 1 ted-talks 1 ml-algortihms 1 svm 1 noise 1 gaussian 1 collinearity 1 datapreprocessing 1 correlation-coefficient 1 sklearn 1