An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: hypothesis-testing

ANSQ1/Statistical-Analysis-of-Advertising-Dataset

This repository contains a thorough analysis of an advertising dataset using statistical methods. 📊 Explore Reports 1 and 2 for insights on effective advertising strategies and visualizations. 💻

Language: Jupyter Notebook - Size: 39.4 MB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 0 - Forks: 0

minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

Language: Python - Size: 2.86 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

statsmodels/statsmodels

Statsmodels: statistical modeling and econometrics in Python

Language: Python - Size: 52.4 MB - Last synced at: 2 days ago - Pushed at: 15 days ago - Stars: 10,706 - Forks: 3,268

NathanP23/Principles-and-Applications-in-Stat-Analysis-52221

Content from the course "Principles and Applications in Statistical Analysis (52221)" at The Hebrew University of Jerusalem, in the Department of Statistics and Data Science.

Language: Jupyter Notebook - Size: 992 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

NathanP23/Regression-and-Statistical-Models-52571

Content of the course "Regression and Statistical Models (52571)" at The Hebrew University of Jerusalem, in the Department of Statistics and Data Science.

Language: Jupyter Notebook - Size: 23.1 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

NathanP23/Data-Analysis-with-R-52414

Labs and Final assignments from the course "Data Analysis with R (52414)" at The Hebrew University of Jerusalem, in the Department of Statistics and Data Science.

Size: 167 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

unionai-oss/pandera

A light-weight, flexible, and expressive statistical data testing library

Language: Python - Size: 4.23 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,835 - Forks: 339

LouaiMuhammed/telecom-churn-prediction

End-to-end telecom customer churn analysis and prediction project, involving data cleaning, exploratory data analysis, and machine learning model development. This project was developed as part of the DEPI Internship.

Language: Jupyter Notebook - Size: 9.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

JetBrains-Research/bioinf-commons

Bioinformatics library in Kotlin

Language: Kotlin - Size: 2.44 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 32 - Forks: 3

JuliaStats/HypothesisTests.jl

Hypothesis tests for Julia

Language: Julia - Size: 2.15 MB - Last synced at: about 16 hours ago - Pushed at: about 2 months ago - Stars: 310 - Forks: 88

erdogant/distfit

distfit is a python library for probability density fitting.

Language: Jupyter Notebook - Size: 15.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 388 - Forks: 29

majianthu/symmetry

Code for the paper "Testing symmetry with copula entropy based two-sample test"

Language: R - Size: 10.7 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

bindugayatri02/Insights_in_Boston_real_estate-Statistical_analysis

This project involves predicting Boston housing prices using statistical analysis and machine learning in Python. As a Data Scientist for a Boston-based housing agency, I analyzed historical housing data from the U.S. Census Service. I generated key statistics and visualizations to uncover trends and insights in the housing market.

Language: Jupyter Notebook - Size: 356 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

ssinix/crag-to-competition

Climbing Performance Analysis: Does outdoor sport climbing success (8a.nu, The Crag) translate to IFSC competition results? Scrapes and analyzes data to explore correlations between outdoor grades and competitive boulder/lead rankings.

Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

danielvartan/bootstrap

🥾 Illustration of the Bootstrap Method

Language: R - Size: 3.12 MB - Last synced at: 6 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

ArnauGarciaGRBIO/GofCens

Repository with the Gofcens R package. The package contains Goodness-of-Fit Methods for Complete and Right-Censored Data.

Language: R - Size: 2.67 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 1

PragyanTiwari/Hypothesis-Testing-Medical-Insurance-Data

Testing Hypothesis on Medical Insurance dataset to analyze & gain insights using Normality Test, T-Test, Chi-Square Test etc. Along with Power Analysis to check the statistical significance.

Language: Jupyter Notebook - Size: 1.79 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 0

fannie1208/FactTest

[ICML2025] "FactTest: Factuality Testing in Large Language Models with Finite-Sample and Distribution-Free Guarantees"

Language: Python - Size: 43.9 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

pmichaillat/p-hacking

Code and data for the paper "Critical Values Robust to P-hacking"

Language: MATLAB - Size: 149 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 6 - Forks: 1

fgcz/prolfqua

Differential Expression Analysis tool box R lang package for omics data

Language: R - Size: 771 MB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 45 - Forks: 9

IndrajeetPatil/ggstatsplot

Enhancing {ggplot2} plots with statistical analysis 📊📣

Language: R - Size: 2.21 GB - Last synced at: 6 days ago - Pushed at: 19 days ago - Stars: 2,098 - Forks: 200

JuliaDynamics/TimeseriesSurrogates.jl

A Julia package for generating timeseries surrogates

Language: Julia - Size: 259 MB - Last synced at: 2 days ago - Pushed at: 18 days ago - Stars: 54 - Forks: 9

djeada/Statistics-Notes

This repository contains notes, explanations, and code snippets related to essential statistics concepts and techniques. The materials cover a range of topics, from basic probability and descriptive statistics to more advanced concepts like hypothesis testing and confidence intervals.

Language: Jupyter Notebook - Size: 4.01 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 5 - Forks: 2

R-Shing/waze-user-retention

Exploratory Data Analysis and Modeling of Waze User Churn and Retention Rates

Language: Jupyter Notebook - Size: 5.18 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

SuryaVamsi-P/YouTube-User-Behavior-Analysis

This project uncovers audience behavior patterns by analyzing YouTube video engagement metrics using Python. From 360° EDA to interactive dashboards, it breaks down how views, likes, dislikes, and comments reveal user sentiment and content performance, built with NumPy, Pandas, Seaborn, Dash, and hypothesis testing to produce real time analytics.

Language: Python - Size: 3.18 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

Vinit2244/Statistical-Analysis-of-Advertising-Dataset

This project analyzes advertising dataset using statistical methods such as descriptive statistics, visualisations, and inferential analysis. The project first visualises the dataset then applies hypothesis testing techniques to identify the most effective advertising platforms and strategies.

Language: Jupyter Notebook - Size: 39.4 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

Jabulente/T-Test-Python-Implementation

A Python-based implementation of one-sample, two-sample, and paired t-tests for statistical analysis and hypothesis testing.

Language: Jupyter Notebook - Size: 437 KB - Last synced at: 7 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

nane-khachatryan21/stats-project

Statistical analysis of Armenia's gender wage gap. Includes R scripts for data cleaning & hypothesis testing, LaTeX report with final results, interactive visualizations and maps.

Language: HTML - Size: 49.6 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

Alcampopiano/hypothesize

Robust statistics in Python

Language: Python - Size: 5.2 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 67 - Forks: 4

ganava4/smsets

A collection of simple parameter estimation and significance tests for the comparison of multivariate means and variation, covered in Chapters 4 and 5 of the book Multivariate Statistical Methods. A Primer. 5th edition.

Language: R - Size: 156 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

vusaverse/vvdoctor

R Shiny app / package to automate statistical testing

Language: R - Size: 1.11 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1

jchristopherson/fstats

A modern Fortran statistical library.

Language: Fortran - Size: 5.23 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 15 - Forks: 1

joshbker/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

Language: Python - Size: 2.87 MB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

moderndive/ModernDive_book

Statistical Inference via Data Science: A ModernDive into R and the Tidyverse

Language: HTML - Size: 1.35 GB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 776 - Forks: 506

0todd0000/rft1d

One-Dimensional Random Field Theory in Python

Language: Python - Size: 14.2 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 3 - Forks: 3

shenxiangzhuang/mppt

A Modern Python Package Template

Language: Python - Size: 2.06 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 26 - Forks: 1

CSwebD/Weight-Change-Analysis

Analysis of weight-change effects in mice and rats using simulation, visualization, hypothesis testing and distribution fitting in R.

Language: R - Size: 0 Bytes - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

John-sam1983/John_Ndaa_Samson_Data_Science_Portfolio

This repository is a compilation of all the data science and in particular Machine Learning projects I have successfully carried out.

Language: Jupyter Notebook - Size: 8.24 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

Marco-Congedo/PermutationTests.jl

Univariate and multiple comparisons statistical hypothesis testing by data permutation

Language: Julia - Size: 6.78 MB - Last synced at: 16 days ago - Pushed at: 11 months ago - Stars: 7 - Forks: 0

NagrajMG/No-More-Circles-Escaping-the-VSM-loop

Search Engine based on Cranfield dataset

Language: Jupyter Notebook - Size: 12.6 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0

Dharmeshgadhiya161/Netflix-Movies-and-TV-Shows-Clustering-Unsupervised-ML

Netflix Movies and TV Shows Clustering Unsupervised ML

Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

SamarthKolge-Analyst/Yulu_MicroMobility_Hypothesis_Testing

Analyzing factors affecting shared electric cycle demand for Yulu using hypothesis testing methods like t-test, ANOVA and Chi-square.

Language: Jupyter Notebook - Size: 15.2 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

SamarthKolge-Analyst/Walmart-Purchase-Analysis-Confidence-Interval-CLT

Analyze Walmart Black Friday purchase data to understand customer spending patterns by gender, marital status, and age using Confidence Intervals and the Central Limit Theorem (CLT). Includes EDA, visualization, hypothesis testing, and actionable business insights.

Language: Jupyter Notebook - Size: 22.8 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

NimanthaSupun/statistical_analysis_techniques

application of key statistical techniques—hypothesis testing, linear regression, and time series analysis—on real-world datasets

Language: Python - Size: 1.64 MB - Last synced at: 2 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

vadimtyuryaev/ANOVA

Welcome to an R repository featuring from-scratch implementations of one-way and two-way ANOVA, along with Tukey's HSD test.

Language: R - Size: 570 KB - Last synced at: 22 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

claire-1125/Foodie_Express_Analysis

사이드 프로젝트로 진행한 음식 배달 앱 로그 데이터 분석 프로젝트입니다.

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

neurodata/hyppo

Python package for multivariate hypothesis testing

Language: Python - Size: 33.5 MB - Last synced at: 16 days ago - Pushed at: 23 days ago - Stars: 225 - Forks: 93

tirthajyoti/Stats-Maths-with-Python

General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python

Language: Jupyter Notebook - Size: 70.1 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 926 - Forks: 381

InduwaraRathnayake/CS3121-OnlinePurchaseIntention

This project explores the factors influencing consumers' intentions to make online purchases during crises in Sri Lanka. Using survey data (836 responses), we perform data preprocessing, exploratory analysis, hypothesis testing, and rule mining to derive actionable insights for Wolt's marketing strategies.

Language: Jupyter Notebook - Size: 11.8 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 1 - Forks: 0

hoangsonww/Amazon-Reviews-Analysis

🧐 This project analyzes Amazon Fine Food Reviews to investigate whether negative reviews are more emotionally intense and lexically repetitive than positive ones. Using R, we apply sentiment analysis and lexical diversity metrics to uncover patterns in consumer review language.

Language: R - Size: 209 KB - Last synced at: 25 days ago - Pushed at: 27 days ago - Stars: 17 - Forks: 12

julian0112/EDA-and-Hypothesis-Testing-of-Feline-Dataset

EDA and Hypothesis Testing of Feline Behavior and Personality Survey Data

Language: Python - Size: 3.48 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

pb319/Segment-Stream

A repository dedicated to the analysis of customer segmentation. This project aims to implement and evaluate various segmentation methodologies, drawing inspiration and techniques from current research in the field.

Language: Jupyter Notebook - Size: 19.5 MB - Last synced at: 7 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

spatstat/spatstat.explore

Sub-package of spatstat providing functions for exploratory and nonparametric data analysis

Language: R - Size: 1.44 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 1

akshayyewle/Statistical-Analysis

Language: Jupyter Notebook - Size: 24.8 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

yeager20001118/AdapTesting

Tool box for Data Adaptive Hypothesis Testing

Language: Python - Size: 28 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 3 - Forks: 0

PamelaPairo/maestria_DM

Materiales de las clases prácticas de AID y Aprendizaje Automático

Language: HTML - Size: 26.6 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 1

ashrithssreddy/statistics-toolkit

My personal toolkit for clean, end-to-end Applied Statistics workflows from formal training — built for how I approach A/B testing, hypothesis testing and distributions day to day.

Language: HTML - Size: 25.4 MB - Last synced at: 1 day ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

xrobin/pROC

Display and analyze ROC curves in R and S+

Language: R - Size: 2.26 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 127 - Forks: 31

muthuganeshece/Business-Case-Study

This repository contains a collection of my work on business case studies of various industries, including e-commerce, logistics, retail, media etc.,

Language: Jupyter Notebook - Size: 123 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

snap-stanford/POPPER

Automated Hypothesis Testing with Agentic Sequential Falsifications

Language: Python - Size: 26.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 182 - Forks: 17

AishwaryaGade02/Loan-Funnel-Optimization-Analysis

Tracks how loan applications move through each stage, helps spot where people drop off, and gives clear insights to improve approval strategies and overall performance.

Language: Python - Size: 1.43 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

namans-git/statistical_inference

report on stat inference

Language: Jupyter Notebook - Size: 1.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mirkobunse/critdd

Critical difference diagrams with Python and Tikz

Language: Python - Size: 2.85 MB - Last synced at: 29 days ago - Pushed at: 8 months ago - Stars: 33 - Forks: 3

Wilfred/propcheck

Quickcheck/hypothesis style testing for elisp

Language: Emacs Lisp - Size: 106 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 59 - Forks: 2

steviecurran/two-sample

The confidence interval for a two sample test.

Language: Python - Size: 56.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mohd-faizy/Stats-with-Data

This repository is a resource for learning and applying statistics in data science. It contains code examples and explanations for many common statistical concepts, from descriptive statistics through regression and time series analysis.

Language: Jupyter Notebook - Size: 7.49 MB - Last synced at: 2 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

yug95/MachineLearning

Machine learning for beginner(Data Science enthusiast)

Language: Jupyter Notebook - Size: 187 MB - Last synced at: 14 days ago - Pushed at: 2 months ago - Stars: 115 - Forks: 131

lesleyzhao/Airbnb_Price_Prediction

Use machine learning to forecast prices based on property features and amenities.

Language: Jupyter Notebook - Size: 1.86 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

AishwaryaGade02/California-Wildfire-Data-Analysis

Data analysis on California wildfire data from 2014 to 2025. This data analysis includes feature engineering, exploratory data analysis, correlation analysis, hypothesis testing and documenting every finding

Language: Jupyter Notebook - Size: 569 KB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

TG-SHIVAM/hypothesis-testing-with-mens-and-womens-soccer-matches

a data-driven exploration of international men's and women's football (soccer) match results using Python

Language: Jupyter Notebook - Size: 1.44 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Kaleidophon/deep-significance

Enabling easy statistical significance testing for deep neural networks.

Language: Python - Size: 5.58 MB - Last synced at: 14 days ago - Pushed at: 11 months ago - Stars: 335 - Forks: 19

Yang-Weichao/L2test

Code for "Score function-based tests for ultrahigh-dimensional linear models"

Language: R - Size: 3.91 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

FabianaCampanari/PracticalStats-PUCSP-2024

Statistical Measures in Python - Age and Salary Analysis

Language: Jupyter Notebook - Size: 61.6 MB - Last synced at: 26 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

alextimans/max-rank

Code repository for the paper "Max-Rank: Efficient Multiple Testing for Conformal Prediction" @ AISTATS 2025

Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

AkshyaKC/Hypothesis-Testing

This project analyzes the relationship between the "Man of the Match" (MoM) award and the match outcome in IPL cricket matches (from season 2008 to 2017). The analysis employs statistical methods to test if MoM awards are significantly associated with the winning team status.

Language: Jupyter Notebook - Size: 255 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

tejaswirupa/United-Airlines-Flight-Gain-Analysis

Analyzed 30K+ United Airlines flights to evaluate time gained or lost during flight. Used hypothesis testing to compare on-time vs. late departures, identifying routes with the highest average time gain.

Size: 236 KB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

majianthu/pycopent

Estimating Copula Entropy (Mutual Information), Transfer Entropy (Conditional Mutual Information), and the statistics for multivariate normality test and two-sample test, and change point detection in Python

Language: Python - Size: 1.42 MB - Last synced at: 20 days ago - Pushed at: 8 months ago - Stars: 163 - Forks: 33

Wb-az/timeseries-sensor-anomaly-detection

Unsupervised anomaly detection in vibration signal using PyCaret vs BiLSTM

Language: Jupyter Notebook - Size: 42.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 2

ariqlubis/hypolens

a Python module that serves as a wrapper to simplify hypothesis testing using the Pingouin statistical library. With Hypolens, you only need to specify which column to group by and which column to test — no more writing repetitive boilerplate code.

Language: Python - Size: 27.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Sree-git4/Personal-Project-Sree

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

lckh24/Multidimensional-Analysis-Course

Project using PCA to analysis

Language: HTML - Size: 5.18 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

TimKong21/Medical-Appointment-No-Show-Prediction

A machine learning solution predicting patient no-shows in healthcare appointments. This project integrates EDA, data processing, feature engineering, and XGBoost modeling, with a workflow spanning from Snowflake data retrieval to AWS deployment (S3, SageMaker, Lambda, API Gateway), aiming to enhance appointment management in medical ERP systems.

Language: Jupyter Notebook - Size: 35.4 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Pegah-Ardehkhani/Statistics-and-Probability-in-Python

A comprehensive exploration of Statistics and Probability Theory concepts, with practical implementations in Python

Language: Jupyter Notebook - Size: 6.66 MB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 140 - Forks: 38

mlatinov/-Wild-Blueberry-Yield-Prediction-

This dataset, sourced from Kaggle, provides insights into agricultural productivity by analyzing conditions that impact blueberry growth. Our goal is to develop a predictive model that helps farmers and researchers make data-driven decisions for improving crop yield.

Language: R - Size: 737 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ankit-verma2000/Business-Case-Yulu

This project analyzed factors affecting the demand for shared electric cycles in the Indian market. Using EDA and hypothesis testing, I found no significant effect of "working day" on rental count but confirmed that seasonality influences demand. The insights provide valuable guidance for optimizing shared cycle availability.

Language: Jupyter Notebook - Size: 1.54 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

caesaredia/chicago-taxi-data-insights

Exploratory data analysis and hypothesis testing on Chicago taxi trip data to uncover patterns in demand and the effects of rainy weather on travel time.

Language: Jupyter Notebook - Size: 341 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

bhaskatripathi/HypothesisHub

An AI Tool for Automated Research Question and Hypothesis Generation from a given Scientific Literature

Language: Jupyter Notebook - Size: 599 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 35 - Forks: 2

cescalara/icecube_tools

Python tools for working with the IceCube public data.

Language: Python - Size: 34.8 MB - Last synced at: 24 days ago - Pushed at: 7 months ago - Stars: 14 - Forks: 8

0DmytroPoliak0/pix2pix3d-my-version-tests-only

Quality Assurance Testing of Image Generation Models using pix2pix3D

Language: Python - Size: 274 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

shervinea/stanford-cme-106-probability-and-statistics

VIP cheatsheets for Stanford's CME 106 Probability and Statistics for Engineers

Size: 1.27 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 729 - Forks: 222

djsutherland/opt-mmd

Learning kernels to maximize the power of MMD tests

Language: Python - Size: 97.7 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 210 - Forks: 73

RishiMdvrm/Comprehensive-analysis-of-EPL-match-data

This project analyzes EPL matches using machine learning and statistical methods, focusing on factors like team formations and home advantage to predict match outcomes and provide actionable insights.

Language: Jupyter Notebook - Size: 6.86 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

edcr09/Mobile_carrier_tariff_analysis

DA-4 Proyecto de análisis de tarifas de operador movil

Language: Jupyter Notebook - Size: 145 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

albertfrancajosuacosta/An_Error_Approximation_Approach_Based_on_Active_Learning_for_Concept_Drift_Detection

The datasets and software generated or analyzed in the paper 'An Error Approximate Approach Based on Active Learning for Concept Drift Detection' are available in this repository.

Size: 156 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Abhi1697/BankLoanDefaultRiskAnalysis

This repo provides insights to the key financial factors that influence the Loan Default likelihood. An extensive data cleaning and transformation, followed by exploratory data analysis and statistical hypothesis testing was done

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ligurio/lark-grammars

Grammars suitable for lark parser and Hypothesis

Language: Python - Size: 161 KB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 46 - Forks: 5

Kong-WayneState/HDMANOVA

High-Dimensional MANOVA

Language: R - Size: 47.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Fabio-Vieira/EBF

An R package to compute Empirical Bayes Factors from random-effect estimates. This test can be used in the mixed-effect model context to determine whether an effect should be fixed or random.

Language: R - Size: 12.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mjteran/correlation_hypotheses_EDA

This project uses correlation analysis to explore relationships between health and lifestyle factors. It involves Exploratory Data Analysis EDA, Hypothesis testing, and various Correlation tests (Pearson, Point-Biserial, Phi Coefficient, Kendall’s Tau) to identify significant correlations.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ederricho/Probability-Distributions

Null Distributions for Nonparametric Tests

Language: R - Size: 24.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Related Keywords
hypothesis-testing 1,089 python 306 statistics 253 machine-learning 148 pandas 141 data-science 141 data-analysis 131 data-visualization 128 numpy 107 r 102 exploratory-data-analysis 101 statistical-analysis 98 seaborn 77 linear-regression 75 matplotlib 70 probability 66 regression 64 ab-testing 64 scipy 62 eda 61 confidence-intervals 56 jupyter-notebook 51 python3 50 logistic-regression 49 visualization 47 t-test 42 descriptive-statistics 39 inferential-statistics 38 anova 38 p-value 37 feature-engineering 37 scipy-stats 37 anova-test 37 regression-analysis 35 statistical-inference 34 data-cleaning 33 significance-testing 33 statsmodels 32 stats 31 sql 30 random-forest 29 chi-square-test 28 probability-distribution 25 data 25 ttest 24 matplotlib-pyplot 24 r-programming 23 hypothesis-tests 22 rstudio 21 analysis 21 clustering 20 data-analytics 20 null-hypothesis 20 scikit-learn 19 statistical-tests 19 classification 19 excel 19 deep-learning 19 abtesting 18 regression-models 18 hypothesis 17 normal-distribution 17 chi2-contingency 17 z-test 16 tableau 16 kernel-methods 16 alternate-hypothesis 15 plotly 15 time-series-analysis 15 central-limit-theorem 15 data-preprocessing 15 ggplot2 15 dataanalysis 14 sklearn 14 multiple-linear-regression 14 machine-learning-algorithms 14 pca 13 analytics 13 a-b-testing 13 time-series 13 nlp 13 decision-trees 13 bootstrap 13 statistical-models 13 pandas-dataframe 12 hyperparameter-tuning 12 confidence-interval 12 correlation-analysis 12 datacleaning 11 data-wrangling 11 sampling-distribution 11 experimental-design 11 sampling 11 feature-selection 11 streamlit 11 frequentist-statistics 10 r-studio 10 estimation 10 bayesian-statistics 10 hypothesis-test 10