GitHub topics: explanatory-data-analysis
ahmad-prasetyo-hermanto/explanatory-data-analysis
Background problem Pada bagian ini silahkan bisa menjelaskan background problem yang ada pada project anda. Jelaskan Situation (problem2nya) dan juga berkaitan dengan Task (tujuan, goals, yang mau dicapai dan dikerjai).
Size: 2.93 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

mhmmdrzkya2000/DigitalSkillFair38_Data_Science_2025
Titanic EDA - Explanatory Data Analysis Repository ini merupakan hasil pelatihan selama 1 minggu dari DigitalSkillFair38 Data Science yang saya ikuti bersama Dibimbing.id berfokus materi tentang proses Explanatory Data Analysis (EDA) terhadap datasheet Titanic
Language: Jupyter Notebook - Size: 8.4 MB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

harisfariyano/EDA-Python-Titanic-Dataset
Titanic EDA - Explanatory Data Analysis Repository ini merupakan hasil dari mini bootcamp yang saya ikuti bersama Dibimbing.id, yang berfokus pada proses Explanatory Data Analysis (EDA) terhadap dataset legendaris Titanic
Language: Jupyter Notebook - Size: 93.8 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

sof1a03/Explainable_AI
Explainable AI (INFOMXAI) - utrecht University. Group 4
Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

roma-glushko/kaggle-house-prices
🏘 Ames house dataset modelled and explained
Language: Jupyter Notebook - Size: 39.1 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

zenahmad06/binary-classification-rainfall
Binary classification rainfall using Logistic regression and XGBoost
Language: HTML - Size: 623 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

kivanc57/explaratory_analysis
Exploratory and Descriptive Data Analysis on Indonesian data using R. This project involves reading data, feature analysis, correlation analysis, logistic regression, PCA, MDS, and clustering. Visualizations include boxplots, scatter plots, corrgrams, and dendrograms. Comprehensive report available in report.docx.
Language: R - Size: 1.71 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

kivanc57/feature_comparison
This project explores the relationship between features and diagnosis in cancer data. Using methods like boxplots, scatterplots, PCA, k-means clustering, and logistic regression, we analyze and visualize data to understand health indicators.
Language: R - Size: 3.33 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

FrienDotJava/air-quality-analysis
Data Analysis for Air Quality in different District in Beijing.
Language: Jupyter Notebook - Size: 17.8 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ngangawairimu/regression-model-for-predicting-house-prices
This project focuses on applying statistical modeling techniques to predict house prices in Melbourne using the Melbourne House Price dataset. It involves data cleaning, exploratory data analysis (EDA), feature selection, and fitting a regression model to predict the target variable, which is the house price.
Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Milanpeter-77/Coursework-Yacht-Pricing
Language: R - Size: 14.6 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

sondosaabed/Prosper-Loan-Analysis
A Comprehensive exploratory data analysis (EDA) on a loan dataset to uncover key trends, patterns, and relationships among various loan attributes. By visualizing and analyzing the data, we aim to gain insights into loan performance, borrower characteristics, and market dynamics. 🪙🏦
Language: HTML - Size: 26.1 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 5 - Forks: 0

mzfarhan/Analyzing-eCommerce-Business-Performance-with-SQL
This project was created to solve an E-Commerce business case provided by Rakamin Academy.
Language: Jupyter Notebook - Size: 726 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

idilersudas/BTC-NewsSentiments
Bitcoin price fluctuation prediction model using headline sentiment scores from top newpaper articles. This is the repository that includes all the data and python scripts used while creating the project.
Language: Jupyter Notebook - Size: 8.94 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

SalonenAntti/LSTM-Modeling
Predicting stock performance with LSTM
Language: Jupyter Notebook - Size: 7.32 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

wambugu71/auto_eda_dsail
Automating process of EDA (Explaratory Data Analysis) with Generative AI and opensource python tools.
Size: 27.3 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

gkansdine/Analyze-medical-appointment-data
The purpose of this report is to analyze medical appointment data to identify factors influencing no-show rates
Language: HTML - Size: 3.34 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mbithesss/IBM-HR-Analytics-Employee-Attrition-and-Perfomance
This project analyzes a dataset of nearly 1500 IBM employees to explore factors contributing to employee attrition.
Language: Jupyter Notebook - Size: 1.26 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

sandhya-0310/Worldwide-Mortality-Analysis-2021
Worldwide-Mortality-Analysis-2021 examines COVID-19's impact on global mortality rates and national responses, revealing significant age-related effects and highlighting disparities linked to institutional trust rather than income inequality.
Language: Jupyter Notebook - Size: 490 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Shoh96/ALX-Data-Analyst
This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course
Language: HTML - Size: 9.18 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Briankim254/Explaratory-data-analysis
This repository demonstrates the use of Pandas Profiling library for Exploratory Data Analysis (EDA) within a Jupyter Notebook. By automating much of the EDA process, the library generates comprehensive and interactive reports, complete with insightful visualizations to facilitate data understanding.
Language: Jupyter Notebook - Size: 544 KB - Last synced at: 7 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Hide-A-Pumpkin/Obesity-Data-Analysis
This is the final community contribution for EDAV Fall 2023, Columbia University. Author: Xinyi Zhao, Jean Law
Language: Jupyter Notebook - Size: 29.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

saboye/EDA-Canadian-Income-Distribution
A project is to make a simple Exploratory Data Analysis to find if there is a direct relationship between income and the level of education in Canada.
Language: R - Size: 81.1 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

fadiyahsutopo/AnalisisDataPython
Proyek Akhir kelas Belajar Analisis Data dengan Python dari Dicoding Indonesia
Language: Python - Size: 439 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nadineamin/pisa_data_analysis
# PISA 2012 Data ## by Nadine Amin ## Dataset > PISA is a survey of students' skills and knowledge as they approach the end of compulsory education. It focuses on examining how well prepared the students are for life beyond school. > Around 510,000 students in 65 economies took part in the PISA 2012 assessment of reading, mathematics and science representing about 28 million 15-year-olds globally. Of those economies, 44 took part in an assessment of creative problem solving and 18 in an assessment of financial literacy. ## Summary of Findings > Before starting this study, I thought the features that would affect the total scores the most were the teachers' influences, the students' immigration status, the class size, and the parents' highest schooling. However, almost none of my assumptions were correct once I started to see the relationships of the variables with the total scores and with other variables. > The number of cellphones, TVs, computers & books, the parents' schooling & jobs, and the homework study time were the variables that affected the total scores. > The higher the number of cellphones, TVs, computers and books, the higher the chances of getting a better total score. This could be because the family's social status was better, and therefore provided better support for the students. > As long as the parents' schooling was level 3A or higher, there is a good chance that the students would get higher grades. Furthermore, parents who had full-time jobs resulted in their children getting higher scores. This could be because having role models to look up to will make you work harder and believe in yourself more. > Finally, students who studied for longer hours had a higher chance of scoring better. ## Key Insights for Presentation > In the presentation, I will show the plots that had an effect on the total score the most. Those include the bivariate plots of the variables mentioned above against the total score. I will also include the multivariate plot of the father and mother's jobs vs. the number of cellphones vs. the total score.
Language: HTML - Size: 19.6 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

VaderSame/Iris-Dataset
This GitHub repository contains a comprehensive analysis of the popular Iris dataset using various machine learning algorithms, including Logistic Regression, Support Vector Machines (SVM), and Random Forest. Additionally, it explores the impact of different data split ratios (80-10-10 vs. 60-20-20) on model performance.
Language: Jupyter Notebook - Size: 632 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kae4ka/anime-recommendations
Explanatory data analysis of anime ratings dataset 🕵🏻♀️
Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

OrNixz/case-study-dicoding-collection
This is a part of the exercise project provided by Dicoding in "Learn Data Analytics with Python" course.
Language: Jupyter Notebook - Size: 436 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kk289/Stock_Price_Prediction
Stock Price Prediction of APPLE Using Python
Language: Python - Size: 1.03 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 1

ArkadiuszSlowik/Data-analysis-and-Storytelling
Exploratory and explanatory analysis of a small, generated dataset. pandas + missingno + seaborn.
Language: Jupyter Notebook - Size: 1.78 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

sakshi-shende/Brooklyn-housing-prices
Using linear regression to explain housing prices in Brooklyn, NY from 2016-2020 and estimate how prices changed from quarter 3 and quarter 4 of 2020
Language: R - Size: 2.99 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Andrew2077/102Flowers-EDA-Classification
FellowshipAi project
Language: Python - Size: 46.7 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

alchemine/spatio-temporal Fork of Inha-Competition-Team/spatio-temporal
2022 인하 인공지능 챌린지
Language: Jupyter Notebook - Size: 312 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

madandahal/Auto-Mpg--Linear-Regression
Auto_MPG (Linear Regression)
Language: Jupyter Notebook - Size: 451 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

morikaglobal/waterpipe_breakage_data_analysis
Data Analysis of potential factors affecting water pipe breakage
Language: Jupyter Notebook - Size: 21.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ahujaya/Wrangle-and-Analyze-Twitter-Data-Python
The dataset that I will be wrangling, analyzing and visualizing is the tweet archive of Twitter user @dog_rates, also known as WeRateDogs. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10. The numerators, though? Almost always greater than 10. 11/10, 12/10, 13/10, etc. Why? Because "they're good dogs Brent." WeRateDogs has over 4 million followers and has received international media coverage.
Language: Jupyter Notebook - Size: 25.2 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

ahujaya/Communicate-Data-Findings-Python
Bay Wheels is a public bicycle sharing system in the San Francisco Bay Area, California. I'm most interested in exploring the bike trips' duration and rental events occurrance patterns in terms of time of day, day of the week, along with how these relate to the riders' characteristics, i.e. their user type, gender, age, etc. to get a sense of what and how people are using the bike sharing service for.
Language: HTML - Size: 2.83 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

aeglon97/Communicate-Data-Findings
A presentation of my exploratory data visualization results with univariate, bivariate, and multivariate features.
Language: HTML - Size: 31.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

yabiola/udacity-data-analyst-projects
This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course
Language: HTML - Size: 25.8 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

antran28/House-Price-Prediction
Predict house price using linear regression model
Language: Jupyter Notebook - Size: 278 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Abanoub8yuossef/Prosper-Loan_data-analysis-project
making analysis using Exploratory and Explanatory visualizations through Univariate, Bivariate, Multivariate Exploration
Language: Jupyter Notebook - Size: 2.97 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Ch3ekiBr3eki/popular_videogames
EDA analysis and a couple of models from classical machine learning on actual data as of 12.07.2023 about video games. Dataset from kaggle link in readme.
Language: Jupyter Notebook - Size: 619 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

alchemine/analysis-tools
Analysis tools for machine learning projects
Language: Python - Size: 10.4 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Paul-Asamoah-Boadu/Prosper-Loan-Data
This data set contains 113,937 loans with 81 variables on each loan, including loan amount, borrower rate (or interest rate), current loan status, borrower income, and many others. The analysis explore the factors and patterns in the creditworthiness of borrowers and the borrowing trend of Prosper Loan Business.
Language: HTML - Size: 2.67 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

collins-kimotho/Wrangle-and-analyze-project
Data Wrangling and Analysis Project: Analyzing WeRateDogs Twitter Account Data
Language: HTML - Size: 3.78 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

NouranHany/Instacart-Market-Basket-Analysis
A Recommender system that predicts your next order based on your previous purchases. Also, it discuss the association between product purchases.
Language: Jupyter Notebook - Size: 32.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 0

hellofromtheothersky/Laptop-price-analysis-and-prediction
Crawl data, process data, visualize, and create ML model for laptop price prediction
Language: Jupyter Notebook - Size: 987 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1

Usama-Tariq/Udacity_Communicat-Data-Findings_Project-5_DAND
Performed an exploratory data analysis using python and presented explanatory plots that convey insights of data.
Language: HTML - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

Sadiq-marcelo/investigate-FordGoBike-tripdata
Investigate Ford GoBike Project
Language: HTML - Size: 7.55 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

iamsj2022/Job-a-ThonNOVAV
Analytic Vidhya's Problem Statement to Predict or Forecast the Future Energy Demand For Next Three Years.
Language: Jupyter Notebook - Size: 1.68 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dilsadkut/World_Happiness_Analysis
World Happiness Analysis
Language: Jupyter Notebook - Size: 879 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

dorothy-nguyen/prosper-loan-exploration
This project is conducted as a part of Udacity Data Analyst Nanodegree. The purpose of this project is to perform exploratory data analysis, then create a presentation with explanatory charts that conveys findings and insights from the data set provided.
Language: HTML - Size: 3.18 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

apriantoni/DokumentasiTesis2022
Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

chunlinli/travelers
2017 Travelers Case Competition Minnesconsin Insurance Company Modeling Problem.
Language: TeX - Size: 262 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

DSKunth/Communicate-Data-Findings
This project has two parts that demonstrate the importance and value of data visualization techniques in the data analysis process: Exploratory and Explanatory Data Visualization.
Language: HTML - Size: 4.4 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

amige9/Loan_Data_from_Prosper_Visual_Analysis
Exploratory and Explanatory Visualization on Prosper Loan Data
Language: HTML - Size: 2.76 MB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

TomiJames/communicate-data-findings
This repo contains notebooks as well as other files where the Ford GoBike System data was analyzed using exploratory and explanatory visualization techniques.
Language: HTML - Size: 15.3 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

alinasahoo/python-data-science-essentials-2
This repository contains my learning path of python for data-science essential training(part-2). Here, I have included chapter-wise topics and my practice problems. Also, feel free to checkout for better understanding.
Language: Jupyter Notebook - Size: 416 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

mhezarei/divar-data-analyst-summercamp-entrance-task
Divar's 2021 Data Analyst summer camp entrance task.
Language: Jupyter Notebook - Size: 9.86 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 7 - Forks: 0

alinasahoo/titanic-kaggle-eda
This repository was just for my practice. Here, I have performed explanatory data analysis on the famous titanic dataset from kaggle.
Language: Jupyter Notebook - Size: 250 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

EnkiDoctor/The_TMDB_data_analysis
The analysis and prediction of TMDB dataset
Language: Jupyter Notebook - Size: 18.9 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

morikaglobal/EDA_starwars_survey
EDA with Python (Pandas and Matplotlib)
Language: Jupyter Notebook - Size: 198 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

nachovazquez98/multiple-regression-with-continuous-variables
The dataset is property of Ares Materials Inc
Language: Jupyter Notebook - Size: 6.65 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

canaytore/boun-data_analysis-group_work Fork of pjournal/boun01g-data-mine-r-s
performed end-to-end reproducible data analyzes in the IE48A - Essentials of Data Analysis course taught at Boğaziçi University
Size: 26.2 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

canaytore/boun-data_analysis Fork of pjournal/boun01-canaytore
performed end-to-end reproducible data analyzes in the IE48A - Essentials of Data Analysis course taught at Boğaziçi University
Language: HTML - Size: 3.67 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Paras-Singh7/InstaCart
Language: Jupyter Notebook - Size: 30.3 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Paras-Singh7/Uber-Data-Analysis
Analysing the data of uber using R
Language: Jupyter Notebook - Size: 103 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

RosanaFSS/Tableau-journey
Gaining familiarity with Tableau
Size: 1000 KB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

shikhargupta-in/Predicting-House-Prices
Predicting House Prices using Linear Regression Model
Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

roma-glushko/kaggle-wine-quality
🍷 Quality analysis of red and white variants of the Portuguese "Vinho Verde" wine
Language: Jupyter Notebook - Size: 64.2 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

RohitMidha23/Explained
Basics of ML libraries Explained through Jupyter Notebooks
Language: Jupyter Notebook - Size: 3.83 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 13 - Forks: 6

ZSoumia/US-flights-data-story-
This project was the last project of my data analyst nanodegree : Creating a data story with Tableau
Size: 1.44 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

ZSoumia/EDA-for-Monica-dataset
This is the 6th project in my data analysis nanodegree and it focuses on prforming exploratory data analysis ( or EDA for short ) in R
Language: R - Size: 1.2 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 0

praxitelisk/DAND-P6-Create-A-Tableau-Story
In this data analysis project, I have explored the Prosper dataset and used Tableau to create my visualizations. Prosper is a peer-to-peer platform that lends money and its goal is to connect people who need money with those people who have the money to invest.
Size: 31.5 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
