An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-cleaning-and-preprocessing"

venkat-0706/Black-Friday

Black Friday Sales Analysis explores customer demographics, purchasing behaviors, and product trends to uncover insights and patterns driving sales during Black Friday events.

Language: Jupyter Notebook - Size: 480 KB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 9 - Forks: 0

aliiimaher/Laptop-Price-Prediction

This is an AI model for predicting laptop price, trained on about 1200 data.

Language: Python - Size: 9.46 MB - Last synced at: 19 days ago - Pushed at: 9 months ago - Stars: 8 - Forks: 1

Pratiikpy/Data-science-cheatsheet

Welcome to my data science repository! Here you will find a collection of resources and examples for exploring, analyzing, and manipulating data using Python. The repository includes code templates, case studies, and exercises to help you learn and practice data science concepts and techniques. The topics covered include data exploration, data visu

Size: 32.2 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

Mindful-AI-Assistants/SP2024-Election-Analysis

๐Ÿ“Š An analysis of voting patterns in Sรฃo Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.

Language: HTML - Size: 80.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 2

Daniel-Andarge/AiML-ethiopian-medical-biz-datawarehouse

The Ethiopian Medical Business Data Warehouse & Analytics Platform is a comprehensive data solution tailored to enhance the efficiency and efficacy of Ethiopia's healthcare and medical sectors.

Language: Jupyter Notebook - Size: 9.97 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 4 - Forks: 0

PatilNi3/PROJECT_POWER_BI

Global Superstore BI Dashboard

Size: 2.9 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

harshita2234/Potato-Prices-Prediction

Project aims to forecast potato prices in India using LSTM, KNN, and Random Forest Regression, integrating historical data on prices, regional stats, and rainfall patterns. Targeting agricultural stakeholders for informed decision-making.

Language: Python - Size: 862 KB - Last synced at: 29 days ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

Vidhi1290/Robust-yield-prediction-

"Predicting a Greener Future ๐ŸŒพ๐Ÿ“Š Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."

Language: Jupyter Notebook - Size: 278 KB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

iresil/FowlFlightForensics

A Kafka-based CSV parser for bird-related airplane accidents

Language: Java - Size: 5.23 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

MAHMOUD2ABDALLAH/Bike-Sales

It was a competition on KAGGLE for prediction on the most sales products on bikes via their features

Language: Python - Size: 358 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

karlyndiary/Global-Electronics-Retailer-Sales-and-Customer-Insights

Developed an analysis using Python, SQL, and Excel to examine sales and customer demographics for a Global Electronics Retailer. The findings aim to enhance business strategies and improve overall performance.

Language: Jupyter Notebook - Size: 52.5 MB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

RandomGamingDev/grabcraft-to-schema

A Python library and its cli for converting grabcraft to schema (more specifically litematica schematic) files

Language: Python - Size: 56.4 MB - Last synced at: 24 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 1

madhurimarawat/Data-Warehousing

This repository contains practical examples of data warehousing concepts, including star schema and ETL processes, all implemented using MySQL.

Language: Jupyter Notebook - Size: 10.5 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

sbrzt/castelli

Data management for Emilia-Romagna Castles project.

Language: Jupyter Notebook - Size: 8.13 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

utkarsh-284/Cyclistic-Case-Study

This repository is for "Cyclistic" Case Study

Language: Jupyter Notebook - Size: 76.2 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

MAHMOUD2ABDALLAH/family-members-segmentation

Machine learning Classification for Family Determination for various generations by their age, height, weight, etc...

Language: Python - Size: 629 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

hari255/Visual-Story-Telling

Developed Interactive visualizations and a Shiny Dashboard using using R from a complicated and in-complete time-series dataset.

Language: HTML - Size: 17.8 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

Adesh1214/Banking-And-Finance-Analysis-PowerBI

This project is a Banking and Financial Transaction Analysis Dashboard created using Power BI. The primary goal is to provide actionable insights into customer demographics, branch performance, and financial transactions for effective decision-making.

Size: 10.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

shrawans007/hotel_customers_sentiments

Sentiment Analysis for a Hotel Based on Customer's Reviews

Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Anarya22/Tata-Data-Visualization-Empowering-Business-with-Effective-Insights-Job-Simulation-on-Forage

Completed a simulation involving creating data visualizations for Tata Consultancy Services. Created visuals for data analysis to help executives with effective decision making.

Size: 21.8 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

SudarshanaSRao/From-Data-to-Gold--My-Journey-Creating-an-Olympic-Tableau-Dashboard

Developed an interactive dashboard using Tableau with Kaggleโ€™s Olympic dataset.

Language: Jupyter Notebook - Size: 269 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

girish119628/CodSoft

Data Enthusiast | Predictive Modeler | Turning Insights into Strategies

Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

mayankyadav23/T20I-World-Cup-2024-Analysis

Explore my Jupyter Notebook ๐Ÿ“Š featuring comprehensive datasets and visualizations from the 2024 T20 World Cup analysis. Discover key insights into player performances ๐Ÿ, match statistics ๐Ÿ“ˆ, and team dynamics, making it a valuable resource for cricket enthusiasts and analysts alike! ๐ŸŒŸet enthusiasts and analysts alike!

Language: HTML - Size: 2.73 MB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Hamada-khairi/PFDA-Hamada

A comprehensive R-based data analysis project that examines housing rental patterns across multiple cities, utilizing statistical methods and visualization techniques to analyze 4,746 properties' data points including rent prices, locations, and amenities. The project employs various R libraries to clean, process, and visualize rental market trends

Language: R - Size: 3.68 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

sanasayyed2001/Enhancing-Human-Resource-Operations-Through-Tableau-Visualizations

This project analyzes HR data using Tableau to uncover insights that optimize Human Resource operations. By visualizing key metrics such as staffing, salary distribution, gender balance, and performance trends, the dashboard supports data-driven decisions in areas like employee retention, salary management, and workforce diversity.

Size: 290 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

DA-Atharv/Movie_Rental_Analysis

This is a Capstone Project which provides an in-depth analysis of the Shakila DVD Rental Store, utilizing Excel, SQL, and Power BI to deliver actionable insights and dynamic dashboard visualization for enhancing business operations and customer experiences in the movie rental industry.

Size: 12.8 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

ShrishtiHore/AB-Testing-and-Predictive-Analysis-for-New-Menu-Launch-using-Alteryx

The coffee restaurant will test a new menu in Denver and Chicago using TV ads to see if it boosts profits by at least 18%, justifying the marketing costs, and needs an analysis to decide on a wider rollout.

Size: 22.6 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

shwetaagrey/SharkTankDataDive

Explore Shark Tank investments with SQL. Uncover insights, success rates, and industry preferences.

Size: 3.91 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

sumitdeole/data-cleaning-project

E-commerce use case: This project conducts a comprehensive data cleaning exercise on the eCommerce data.

Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

Shwethapoojary-20/Data-analysis-with-python

Data cleaning , analysis and visualization of 4 different sector's data

Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

MSnellgrove/Springboard

Projects and case studies from my time studying Data Science with USF

Language: Jupyter Notebook - Size: 26 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Rama-Mwenda/Indian_Ecosystem_Analysis

A project analyzing the Indian startup ecosystem between 2018 and 2021.

Language: Jupyter Notebook - Size: 176 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

rajesh9943/Market-Domination-and-Exploring-Trends-in-2023-Indian-Car-Sales-with-Exploratory-Data-Analysis

The report also reveals the dominance of the Top 25 best-selling car models. These top sellers captured a substantial portion of the market, accounting for more than 75% of the total cars sold in April 2023 (likely a typo, referring to May 2023). This suggests a preference for established and popular car models among Indian consumers.

Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

TharunRacharla/Sports_Celebrity_Image_Classification-project

Language: Jupyter Notebook - Size: 108 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

doshiharmish/NYC-Green-Taxi-Trip-Analysis

Explore NYC Green Taxi data, predicting fares and optimizing pickup locations using machine learning. Regression models uncover travel patterns and enhance taxi services for an efficient urban transport experience.

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

epmanu185/Restaurant_order_analysis

Analyze order data to identify the most and least popular menu items and types of cuisine

Size: 139 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

arbaazkhaan/FIFA-Dataset-Refinement

Welcome to the FIFA Dataset Data Cleaning and Transformation project! This initiative focuses on refining and enhancing the FIFA dataset to ensure it is well-prepared for in-depth analysis. The project involves a comprehensive data cleaning process and transformation of key features to improve data quality and usability.

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

arbaazkhaan/Developer-Insights-Project

This project delves into comprehensive insights extracted from the Stack Overflow Developer Survey 2022. The dataset provides a rich source of information about developers' demographics, coding experience, compensation, and various aspects of their professional lives.

Language: Jupyter Notebook - Size: 2.63 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nikhilbordekar/EDA-of-Play-Store-Data

This project is an EDA endeavor that delves into the world of Google Play Store data. This project uncovers valuable trends, patterns, and statistics within the Play Store ecosystem. From app categories and user reviews to pricing and app sizes, the project offers a comprehensive analysis of this dynamic and ever-expanding marketplace.

Language: Jupyter Notebook - Size: 966 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Eoinmark/Python-Projects

This repository is a compilation of my academic and personal projects accomplished using Python. The most common libraries used in these projects include NumPy, Pandas, Scipy, and Matplotlib. Also contained in this repository are the certificates I gained by upskilling in pursuit of my passion and eagerness for Data Science.

Language: Jupyter Notebook - Size: 5.86 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

bartek410/Capstone-Project

This is my BrainStation capstone project on music genre classification. I use supervised and unsupervised learning to classify songs into genre based on specific attributes from two Spotify datasets..

Language: Jupyter Notebook - Size: 41.3 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

morikaglobal/waterpipe_breakage_data_analysis

Data Analysis of potential factors affecting water pipe breakage

Language: Jupyter Notebook - Size: 21.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

VivekAgrawl/food-app-analysis

In this project, a real-world dataset from Zomato, one of the most widely used food ordering platforms, was worked on.

Language: Python - Size: 7.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

VivekAgrawl/medical-appointments-analysis

This project involves analyzing real-world medical appointment data through Time Series Analysis. The tasks include dataset cleaning, comprehensive analysis, and extracting insights using Python and MySQL.

Language: Python - Size: 2.38 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

RajveerAlive/Data-analytics

Data analyst trainee

Size: 19.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Benazir023/BookReviewAnalysis_efficient_workflow

This is a Dataquest project that focuses on creating an efficient workflow

Size: 318 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

ericsun153/Illuminating_US_Outage_Landscape

Power Outage Data Analysis in USA

Language: Jupyter Notebook - Size: 26.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

KiranMallik/Sales-vs-Customer-Ratings-Analysis-in-Power-BI

The goal is to understand whether higher customer ratings drive more sales and how businesses can optimize their strategies using data-driven insights to identify key sales patterns

Size: 287 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

Calebtheman116/hotel_customers_sentiments

Sentiment Analysis for a Hotel Based on Customer's Reviews

Size: 1.95 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

RKodah/Correlation-One-Datafolio

Submission requirement for Correlation One graduation. The repo contains every step of the process as well as the live dashboard and Google Colab for the python code.

Language: Jupyter Notebook - Size: 5.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

Nikol2004/Databases-and-Big-Data-2024

Designed a relational database from scratch using Spotify's most-streamed tracks dataset. Cleaned and normalized data, built queries to explore musical trends, and analyzed song features across platforms.

Language: Python - Size: 106 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Manishdebnath99/Build_Week_Project

SAL_BW_Project_1 โ€“ Analyzing Job description data to derive meaningful insights.

Language: Jupyter Notebook - Size: 33.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

m-hussain-x199/data-science

Projects related Data Visualisation, Cleaning and Preprocessing.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

jrili/ibm-project-world-largest-banks

Web scraping + ETL project: Extract and compile information about the 10 largest banks in the world. From IBM via Coursera.

Language: Python - Size: 4.88 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

jrili/data-engineer-portfolio

Jessa Rili-Migriรฑo's Data Engineer Portfolio

Size: 10.7 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

ailasheri83/data-science-portfolio

A portfolio showcasing my data science and analysis projects.

Language: HTML - Size: 990 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

NikitaPatil7/Netflix-dataset-Data-Cleaning-and-Preprocessing-

Data cleaning and exploratory data analysis (EDA) on the Netflix Titles dataset. This project covers preprocessing, handling missing values, standardization, and visual insights using Python and Pandas.

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

jrili/datacamp-cleaning-bank-marketing

Data cleaning project on bank marketing campaign data from Datacamp

Language: Jupyter Notebook - Size: 507 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

omari-kd/TransBorder-Freight-Data-Analysis

This project analyses transportation data from the Bureau of Transportation Statistics (BTS) to uncover insights into cross-border freight's efficiency, safety and environmental impacts across road, rail, air and water modes.

Language: R - Size: 346 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

jensoto/MPS-DataAnalytics

Academic portfolio of course work for my Master's in Data Analytics.

Size: 116 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

anupamsaha18/Fintech-Capstone-Project---Loan-Default-Analysis

The Loan Default Analysis project aims to identify key factors contributing to loan defaults by analyzing borrower profiles, financial data, and credit risk indicators. Using statistical methods, visualizations, and predictive modeling, the project provides insights to mitigate risks and improve lending strategies.

Size: 1.95 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

crazy-dot/Zomato-Data-Analysis

This project analyzes 50k Bengaluru restaurants from Zomato, focusing on 17 features like location and ratings. It cleans, explores, and visualizes data to improve services. Key visualizations include delivery, booking, location, and cost. The goal is to provide insights for better customer experiences.

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

anthonypham26/ClassicalMachineLearning01

This repository contains a beginner-friendly introduction to Machine Learning, covering essential concepts such as data preprocessing, feature engineering, data visualization, and ML fundamentals.

Size: 379 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

angelicaakwarteng/Python-projects

This portfolio contains projects done using Python programming language to work on real-life data to gain insights. The individual projects cover random topics like defining my own functions, creating my own classes, exploratory data analysis and even predictive modelling using Jupyter notebook.

Language: Jupyter Notebook - Size: 3.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

milestxne/oibsip

This repository contains different data analysis projects under my Oasis Infobyte Internship.

Language: Jupyter Notebook - Size: 20.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

gunagvn/Guna_ML_model

My First basic Machine Learning model

Language: Jupyter Notebook - Size: 214 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

MuhammadAdnan1998/Growth_Mindset_Challenge_Web_App_with_Streamlit

Data Sweeper is a Streamlit-based tool for CSV to Excel conversion, data cleaning, and visualization. Easily remove duplicates, fill missing values, select columns, and generate interactive charts. Perfect for data analysts and business professionals. ๐Ÿš€

Language: Python - Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Wilfrida-Were/Retail-Sales-EDA-in-Python

Exploratory Data Analysis on Retail Sales Data in Python

Language: Jupyter Notebook - Size: 672 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

TanmayBorse/Institionistic_fuzzy_approx_space

This model introduces a hybrid approach that utilizes rough sets on intuitionistic fuzzy approximation spaces for pre-processing and soft sets for post-processing, resulting in an effective decision-making solution.

Language: Jupyter Notebook - Size: 5.08 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

k4karann/healthdata_analysis_using_r

HealthCare Data analysis using R Lang includes data cleaning, data processing, and visualizations.

Language: R - Size: 506 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

sumedhsp/Exploring-311-Service-Requests

311 Service Requests: Resource Utilization & Resolution Time Prediction - Foundations of Data Science - Final Project (CS-GY 6053)

Language: Jupyter Notebook - Size: 3.52 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

kathisnehith/Medicare-IP-hospital-Analysis

In-depth Data analysis and visualization of Medicare inpatient hospital data.

Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

cudjoejosephine/Advanced-Excel

This repository contains all my Data Analytics projects in Excel

Size: 461 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Abhi-Pat/PowerBI_Sales-and-Transaction-Data-Analysis

Data Preparation: Imported and cleaned large datasets (Customer, Transaction, Product, and Sales) using DAX queries for column creation and Power Query for transformation. Visualization: Designed tailored dashboards for desktop and mobile, integrating Smart Narrative and AI visuals for actionable business insights.

Size: 1.32 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

who-else-but-arjun/Convolve

This repository contains the projects developed for the Convolve PAN IIT AI-ML Hackathon, conducted by IDFC Bank. Predicting Credit Card Defaulters โ€“ A deep learning-based model to assess the risk of credit card default. Optimizing Email Engagement Time Slots โ€“ A machine learning model to determine the best time slots for personalised emails.

Language: Jupyter Notebook - Size: 7.19 MB - Last synced at: 14 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

aninditaws/Questionnaire-Exploratory-Data-Analysis

A comprehensive EDA project for analyzing questionnaire results. Includes data cleaning, descriptive statistics, and visualizations to identify trends and patterns in survey responses.

Language: HTML - Size: 1.07 MB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

LKEthridge/Integrated_Project_2

Integrated Project 2 from TripleTen

Language: Jupyter Notebook - Size: 15 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Chiugo-Nsoke/Student-Performance-Analysis

An analysis of student performance factors using Python, featuring data cleaning, EDA, and machine learning for prediction.

Language: Jupyter Notebook - Size: 777 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

LegallyNotBlonde/Tableau_Citi_Bike

Analyzed Citi Bike data to uncover trends in ride duration, peak usage, and station demand, offering recommendations to optimize bike availability

Language: Jupyter Notebook - Size: 23.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

JohnnySolo/Data-Analysis-Project---Spotify-Hit-Songs

That's my 1st year project in the course "Introduction to Statistics"

Size: 62.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

anwarraif/contoh-dsf-porto

EDA is Exploratory and Explanatory Data Analysis. I'm using Global Super Store from Kaggle.com . Superstores industry comprises of companies that operateby having large size spaces which store and supply large amounts of goods.

Language: Jupyter Notebook - Size: 10 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Wilfrida-Were/Tech-Layoffs-Data-Cleaning-in-SQL

Tech Layoffs Data cleaning in SQL - Data Visualisation in Tableau

Language: Jupyter Notebook - Size: 461 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

eshambr/Corpay-Cross-Sell-Strategy-Enhancement-and-Model-Building

This project refines Corpay's Cross-Sell Program by analyzing customer performance data and using predictive modeling to identify high-creditworthiness profiles, optimizing customer selection and profitability.

Language: R - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

vamsi-krishnakOO7/Research-on-VIDWAN-Authors-Collaboration-Network

The following repository contains the files and data relevant to the study I did about the Evolution of Research Themes and Trends in India Across the Years via the VIDWAN Author's Collaboration Network

Language: Jupyter Notebook - Size: 7.02 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Anu0408/House-Price-Prediction-MachineLearning-Application

A real-time, end-to-end machine learning application built with Flask and integrated with MLflow for tracking and model management. The application predicts house prices based on user input, leveraging trained regression models and providing a web interface for seamless interaction.

Language: Python - Size: 462 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

AsuquoAA/Ann_Arbor_Weather_Analysis_2005-2015

This project analyzes historical weather data from Ann Arbor, Michigan, collected by the National Centers for Environmental Information (NCEI) Global Historical Climatology Network daily (GHCNd).

Language: Jupyter Notebook - Size: 2.96 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

AsuquoAA/Big_4_Sports_Teams_and_City_Population_Analysis-2018-

Analysis of sports teams' win/loss ratios vs. metro area populations across NFL, NBA, MLB, and NHL.

Language: HTML - Size: 86.9 KB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Pra5etya/Covid-in-Indonesia

Analysis of the spread of COVID-19 in Indonesia, based on daily cases, death and recovery rates, and regional categories based on WHO. This repository includes informative visualization data, risk level classifications, and data-driven analysis to understand the impact of the pandemic in different regions.

Language: Jupyter Notebook - Size: 55.7 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Krisanth-21/T20-World-Cup-Top-11-Player-Analysis

This project analyzes the 2022 T20 World Cup data to determine the top 11 players based on their performance. I used ParseHub to collect data from the ESPNcricinfo website, then cleaned and transformed it with NumPy and pandas, Python libraries, and created dashboards in Power BI.

Language: Jupyter Notebook - Size: 474 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

AnnaAnastasy/Mushroom-Binary-Classification-EDA-ML

Explored and modeled a competition dataset of mushroom species, focusing on data cleaning, exploratory data analysis, and building machine learning models for accurate classification of edible and poisonous mushrooms.

Language: Jupyter Notebook - Size: 4.06 MB - Last synced at: 25 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lmizner/CS249_DataScienceFundamentals

Course work from UCLA's CS249 - Data Science Fundamentals

Language: Jupyter Notebook - Size: 2.45 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Aqila-Farahmand/MasterThesis

Identifying Files and Workflows Contributing to TD Using Data Mining and NLP Techniques

Language: Jupyter Notebook - Size: 2.56 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

2020101/Healthcare-data---Python-Data-Prepartion-Data-Cleaning-Data-Exploration

Jupyter notebook templates in Python for Preparation, Data Cleaning, and EDA

Language: Jupyter Notebook - Size: 9.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

udhaya2823/CarDheko-Used_Car_Price_Prediction

๐Ÿš— Car Dheko - Used Car Price Prediction This project enhances Car Dheko's customer experience by deploying an ML model that predicts used car prices accurately. Using a multi-city dataset, we perform data cleaning, feature engineering, and model optimization. The final model is hosted on a Streamlit app, providing instant price prediction.

Language: Jupyter Notebook - Size: 8.14 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

RampageousRJ/Network-Traffic-Analysis

Analysis report pattern based on Wireshark captures to resolve anomalies and enhance the efficiency of the network.

Language: HTML - Size: 17.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

AvanishVerma1703/PRODIGY_DS_01

This repository contains my submission for Task-1 of the Data Science Internship at Prodigy Infotech.

Language: Jupyter Notebook - Size: 748 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

kente001/Excelerate-EDA-Project

This project involved analyzing data from an online platform where users sign up and apply for various programs. The dataset included demographic details, program applications, sign-up dates, badge statuses, rewards, and skill points. The objective was to identify insights that could improve the platformโ€™s usability and user experience.

Language: Python - Size: 2.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Patangaysheetal/DataAnalyticsPersonalProjects

Size: 29.9 MB - Last synced at: 24 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

yemifatodu/SQL-QUERY_MERI-SKILL-SALES-DATA

sql queries used for meri skill internship sales project

Size: 7.81 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Srujana-Konga/Customer-Segmentation

This project segments customers based on purchasing behavior and demographics to enhance marketing strategies and improve customer satisfaction. A Decision Tree classifier is utilized to predict customer responses to marketing campaigns using various customer attributes.

Language: Jupyter Notebook - Size: 811 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Related Topics
data-visualization 82 data-analysis 65 python 51 exploratory-data-analysis 35 pandas 26 sql 25 data-science 25 excel 18 machine-learning 17 jupyter-notebook 17 feature-engineering 15 data-analytics 14 matplotlib 14 data-cleaning 13 numpy 12 powerbi 12 tableau 11 python3 10 data-transformation 9 eda 9 etl 8 machine-learning-algorithms 8 linear-regression 7 seaborn 7 data-analysis-python 6 hyperparameter-tuning 6 cross-validation 6 sentiment-analysis 6 pivot-tables 6 data-mining 6 kaggle-dataset 5 random-forest 5 model-deployment 5 data-wrangling 5 tableau-dashboards 5 tableau-public 4 data 4 regression 4 model-training-and-evaluation 4 sklearn 4 supervised-learning 4 decision-tree-classifier 4 random-forest-classifier 4 business-intelligence 4 mysql 4 visualization 4 dashboard 4 microsoft-excel 4 matplotlib-pyplot 4 r 4 data-engineering 4 pandas-dataframe 3 predictive-analytics 3 machine-learning-models 3 pandas-library 3 time-series-analysis 3 data-integration 3 descriptive-statistics 3 nlp-machine-learning 3 streamlit 3 logistic-regression 3 natural-language-processing 3 supervised-machine-learning 3 statistical-analysis 3 model-evaluation 3 data-manipulation 3 data-analyst 3 predictive-modeling 3 retail 3 data-visualisation 3 exploratory-data-visualizations 3 business-analytics 3 docker 3 pandas-python 3 regression-models 3 classification 3 data-visualization-project 3 data-visualization-dashboard 3 deep-learning 2 insights 2 dashboards 2 data-warehousing 2 anomaly-detection 2 data-vizualisation 2 etl-pipeline 2 data-mining-python 2 database 2 reporting 2 ai 2 customer-behavior-analysis 2 scikit-learn 2 data-manipulation-with-pandas 2 gridsearchcv 2 powerquery 2 lstm 2 report 2 neural-networks 2 data-quality-assessment 2 dimensionality-reduction 2 silhouette-score 2