An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-cleaning-and-preprocessing"

venkat-0706/Black-Friday

Black Friday Sales Analysis explores customer demographics, purchasing behaviors, and product trends to uncover insights and patterns driving sales during Black Friday events.

Language: Jupyter Notebook - Size: 480 KB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 9 - Forks: 0

aliiimaher/Laptop-Price-Prediction

This is an AI model for predicting laptop price, trained on about 1200 data.

Language: Python - Size: 9.46 MB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 8 - Forks: 1

Mindful-AI-Assistants/SP2024-Election-Analysis

📊 An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.

Language: HTML - Size: 86.6 MB - Last synced at: 9 days ago - Pushed at: 13 days ago - Stars: 7 - Forks: 3

Pratiikpy/Data-science-cheatsheet

Welcome to my data science repository! Here you will find a collection of resources and examples for exploring, analyzing, and manipulating data using Python. The repository includes code templates, case studies, and exercises to help you learn and practice data science concepts and techniques. The topics covered include data exploration, data visu

Size: 32.2 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

Daniel-Andarge/AiML-ethiopian-medical-biz-datawarehouse

The Ethiopian Medical Business Data Warehouse & Analytics Platform is a comprehensive data solution tailored to enhance the efficiency and efficacy of Ethiopia's healthcare and medical sectors.

Language: Jupyter Notebook - Size: 9.97 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

PatilNi3/PROJECT_POWER_BI

Global Superstore BI Dashboard

Size: 2.9 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

omari-kd/TransBorder-Freight-Data-Analysis

This project analyses transportation data from the Bureau of Transportation Statistics (BTS) to uncover insights into cross-border freight's efficiency, safety and environmental impacts across road, rail, air and water modes.

Language: R - Size: 346 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

MAGICS-LAB/SMUTF Fork of fireindark707/Python-Schema-Matching

[Information System] SMUTF: Schema Matching Using Generative Tags and Hybrid Features

Language: Python - Size: 21.1 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 2

harshita2234/Potato-Prices-Prediction

Project aims to forecast potato prices in India using LSTM, KNN, and Random Forest Regression, integrating historical data on prices, regional stats, and rainfall patterns. Targeting agricultural stakeholders for informed decision-making.

Language: Python - Size: 862 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

Vidhi1290/Robust-yield-prediction-

"Predicting a Greener Future 🌾📊 Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."

Language: Jupyter Notebook - Size: 278 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

iresil/FowlFlightForensics

A Kafka-based CSV parser for bird-related airplane accidents

Language: Java - Size: 5.23 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

MAHMOUD2ABDALLAH/Bike-Sales

It was a competition on KAGGLE for prediction on the most sales products on bikes via their features

Language: Python - Size: 358 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

karlyndiary/Global-Electronics-Retailer-Sales-and-Customer-Insights

Developed an analysis using Python, SQL, and Excel to examine sales and customer demographics for a Global Electronics Retailer. The findings aim to enhance business strategies and improve overall performance.

Language: Jupyter Notebook - Size: 52.5 MB - Last synced at: 29 days ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

RandomGamingDev/grabcraft-to-schema

A Python library and its cli for converting grabcraft to schema (more specifically litematica schematic) files

Language: Python - Size: 56.4 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

M-Z-5474/swiggy_sales_predictor

A machine learning-powered Streamlit web application that predicts product sales based on item and outlet features. Built using Python, XGBoost, and deployed on Streamlit Cloud, the app also features interactive EDA and model insights for real-world business use cases.

Language: Jupyter Notebook - Size: 1.73 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

hinmfm/Top-CV-BA-Job-Analysis

Dự án thực hiện nhằm phân tích tình hình thị trường công việc BA tại Hà Nội dựa trên dữ liệu cào về từ TopCV

Language: Jupyter Notebook - Size: 4.67 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

jana6604/COVID-19-Exploratory-Data-Analysis-EDA-Project

Language: Jupyter Notebook - Size: 2.54 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

nithyaak7/Google-Play-Store-Analysis

Understanding patterns and relationships in the Google Playstore Dataset from Kaggle and Visualizing the findings.

Language: Jupyter Notebook - Size: 9.69 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

TharunRacharla/Sports_Celebrity_Image_Classification-project

Language: Jupyter Notebook - Size: 109 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 1 - Forks: 0

sbrzt/castelli

Data management for Emilia-Romagna Castles project.

Language: Jupyter Notebook - Size: 9.35 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

Seif-Elkerdany/Attendance_System

This is our project for image processing course in our university (AIU)

Language: Python - Size: 15.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 1

gurjeevanmalhi/Netflix-Analysis

Analyzing Netflix data

Size: 8.79 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

WhereisHussain/Data-Science

Projects related Data Visualisation, Cleaning, Preprocessing, Machine Learning, Deep Learning, ANN and CNN Projects and Model Training and Model Evaluation

Language: Jupyter Notebook - Size: 451 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Seyigate/Sales-vs-Customer-Ratings-Analysis-in-Power-BI

The goal is to understand whether higher customer ratings drive more sales and how businesses can optimize their strategies using data-driven insights to identify key sales patterns

Size: 301 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

madhurimarawat/Data-Warehousing

This repository contains practical examples of data warehousing concepts, including star schema and ETL processes, all implemented using MySQL.

Language: Jupyter Notebook - Size: 10.5 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

utkarsh-284/Cyclistic-Case-Study

This repository is for "Cyclistic" Case Study

Language: Jupyter Notebook - Size: 76.2 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

MAHMOUD2ABDALLAH/family-members-segmentation

Machine learning Classification for Family Determination for various generations by their age, height, weight, etc...

Language: Python - Size: 629 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

hari255/Visual-Story-Telling

Developed Interactive visualizations and a Shiny Dashboard using using R from a complicated and in-complete time-series dataset.

Language: HTML - Size: 17.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Adesh1214/Banking-And-Finance-Analysis-PowerBI

This project is a Banking and Financial Transaction Analysis Dashboard created using Power BI. The primary goal is to provide actionable insights into customer demographics, branch performance, and financial transactions for effective decision-making.

Size: 10.9 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

shrawans007/hotel_customers_sentiments

Sentiment Analysis for a Hotel Based on Customer's Reviews

Size: 14.6 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Anarya22/Tata-Data-Visualization-Empowering-Business-with-Effective-Insights-Job-Simulation-on-Forage

Completed a simulation involving creating data visualizations for Tata Consultancy Services. Created visuals for data analysis to help executives with effective decision making.

Size: 21.8 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

SudarshanaSRao/From-Data-to-Gold--My-Journey-Creating-an-Olympic-Tableau-Dashboard

Developed an interactive dashboard using Tableau with Kaggle’s Olympic dataset.

Language: Jupyter Notebook - Size: 269 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

girish119628/CodSoft

Data Enthusiast | Predictive Modeler | Turning Insights into Strategies

Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

mayankyadav23/T20I-World-Cup-2024-Analysis

Explore my Jupyter Notebook 📊 featuring comprehensive datasets and visualizations from the 2024 T20 World Cup analysis. Discover key insights into player performances 🏏, match statistics 📈, and team dynamics, making it a valuable resource for cricket enthusiasts and analysts alike! 🌟et enthusiasts and analysts alike!

Language: HTML - Size: 2.73 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

Hamada-khairi/PFDA-Hamada

A comprehensive R-based data analysis project that examines housing rental patterns across multiple cities, utilizing statistical methods and visualization techniques to analyze 4,746 properties' data points including rent prices, locations, and amenities. The project employs various R libraries to clean, process, and visualize rental market trends

Language: R - Size: 3.68 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

sanasayyed2001/Enhancing-Human-Resource-Operations-Through-Tableau-Visualizations

This project analyzes HR data using Tableau to uncover insights that optimize Human Resource operations. By visualizing key metrics such as staffing, salary distribution, gender balance, and performance trends, the dashboard supports data-driven decisions in areas like employee retention, salary management, and workforce diversity.

Size: 290 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

DA-Atharv/Movie_Rental_Analysis

This is a Capstone Project which provides an in-depth analysis of the Shakila DVD Rental Store, utilizing Excel, SQL, and Power BI to deliver actionable insights and dynamic dashboard visualization for enhancing business operations and customer experiences in the movie rental industry.

Size: 12.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

ShrishtiHore/AB-Testing-and-Predictive-Analysis-for-New-Menu-Launch-using-Alteryx

The coffee restaurant will test a new menu in Denver and Chicago using TV ads to see if it boosts profits by at least 18%, justifying the marketing costs, and needs an analysis to decide on a wider rollout.

Size: 22.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

shwetaagrey/SharkTankDataDive

Explore Shark Tank investments with SQL. Uncover insights, success rates, and industry preferences.

Size: 3.91 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

sumitdeole/data-cleaning-project

E-commerce use case: This project conducts a comprehensive data cleaning exercise on the eCommerce data.

Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Shwethapoojary-20/Data-analysis-with-python

Data cleaning , analysis and visualization of 4 different sector's data

Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

MSnellgrove/Springboard

Projects and case studies from my time studying Data Science with USF

Language: Jupyter Notebook - Size: 26 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Rama-Mwenda/Indian_Ecosystem_Analysis

A project analyzing the Indian startup ecosystem between 2018 and 2021.

Language: Jupyter Notebook - Size: 176 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

rajesh9943/Market-Domination-and-Exploring-Trends-in-2023-Indian-Car-Sales-with-Exploratory-Data-Analysis

The report also reveals the dominance of the Top 25 best-selling car models. These top sellers captured a substantial portion of the market, accounting for more than 75% of the total cars sold in April 2023 (likely a typo, referring to May 2023). This suggests a preference for established and popular car models among Indian consumers.

Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

doshiharmish/NYC-Green-Taxi-Trip-Analysis

Explore NYC Green Taxi data, predicting fares and optimizing pickup locations using machine learning. Regression models uncover travel patterns and enhance taxi services for an efficient urban transport experience.

Language: Jupyter Notebook - Size: 1.05 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

epmanu185/Restaurant_order_analysis

Analyze order data to identify the most and least popular menu items and types of cuisine

Size: 139 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

arbaazkhaan/FIFA-Dataset-Refinement

Welcome to the FIFA Dataset Data Cleaning and Transformation project! This initiative focuses on refining and enhancing the FIFA dataset to ensure it is well-prepared for in-depth analysis. The project involves a comprehensive data cleaning process and transformation of key features to improve data quality and usability.

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

arbaazkhaan/Developer-Insights-Project

This project delves into comprehensive insights extracted from the Stack Overflow Developer Survey 2022. The dataset provides a rich source of information about developers' demographics, coding experience, compensation, and various aspects of their professional lives.

Language: Jupyter Notebook - Size: 2.63 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

nikhilbordekar/EDA-of-Play-Store-Data

This project is an EDA endeavor that delves into the world of Google Play Store data. This project uncovers valuable trends, patterns, and statistics within the Play Store ecosystem. From app categories and user reviews to pricing and app sizes, the project offers a comprehensive analysis of this dynamic and ever-expanding marketplace.

Language: Jupyter Notebook - Size: 966 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Eoinmark/Python-Projects

This repository is a compilation of my academic and personal projects accomplished using Python. The most common libraries used in these projects include NumPy, Pandas, Scipy, and Matplotlib. Also contained in this repository are the certificates I gained by upskilling in pursuit of my passion and eagerness for Data Science.

Language: Jupyter Notebook - Size: 5.86 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

bartek410/Capstone-Project

This is my BrainStation capstone project on music genre classification. I use supervised and unsupervised learning to classify songs into genre based on specific attributes from two Spotify datasets..

Language: Jupyter Notebook - Size: 41.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

morikaglobal/waterpipe_breakage_data_analysis

Data Analysis of potential factors affecting water pipe breakage

Language: Jupyter Notebook - Size: 21.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

VivekAgrawl/food-app-analysis

In this project, a real-world dataset from Zomato, one of the most widely used food ordering platforms, was worked on.

Language: Python - Size: 7.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

VivekAgrawl/medical-appointments-analysis

This project involves analyzing real-world medical appointment data through Time Series Analysis. The tasks include dataset cleaning, comprehensive analysis, and extracting insights using Python and MySQL.

Language: Python - Size: 2.38 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

RajveerAlive/Data-analytics

Data analyst trainee

Size: 19.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Benazir023/BookReviewAnalysis_efficient_workflow

This is a Dataquest project that focuses on creating an efficient workflow

Size: 318 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ericsun153/Illuminating_US_Outage_Landscape

Power Outage Data Analysis in USA

Language: Jupyter Notebook - Size: 26.5 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

Anna-Ann11/big4-risk-compliance-excel

Excel dashboard analyzing risk and compliance trends among Big Four accounting firms (Deloitte, PwC, EY, KPMG). Includes firm-wise audit data, compliance scores, and KPI visualizations across multiple years.

Size: 186 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

ArthurdxvMods/laptop_price_prediction

Predict laptop prices using machine learning with a Random Forest model. Explore data, build a pipeline, and interact through a Streamlit app. 🖥️📊

Language: Jupyter Notebook - Size: 3.89 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Aritgon/Real-Estate-DWH

This project showcases an end-to-end real estate data pipeline using Python, SQL, and Power BI. It involves cleaning raw housing data, building a star-schema data warehouse in PostgreSQL, and visualizing insights through an interactive Power BI dashboard. The goal is to simulate a real-world property analytics system for smarter decision-making.

Language: Jupyter Notebook - Size: 65.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

SyedaNimraFatima/Coffee-Shop-Sales-Analysis-SQL-PowerBI

A dynamic Power BI dashboard for analyzing sales, product trends, and customer behavior across NYC coffee shops. Built using Excel, DAX, and custom visuals to support business intelligence and decision-making.

Size: 8.33 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

Calebtheman116/hotel_customers_sentiments

Sentiment Analysis for a Hotel Based on Customer's Reviews

Size: 1.95 KB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

ameerahrazali/superstore-analysis

Interactive Excel dashboard project analyzing Superstore sales and product performance using Power Query, Power Pivot, and derived KPIs & as a foundational layout for future Power BI modeling.

Size: 397 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

KiranMallik/Sales-vs-Customer-Ratings-Analysis-in-Power-BI

The goal is to understand whether higher customer ratings drive more sales and how businesses can optimize their strategies using data-driven insights to identify key sales patterns

Size: 287 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

ahmedhaha657/COVID-19-Exploratory-Data-Analysis-EDA-Project

Explore global COVID-19 trends with this EDA project using Python and Power BI. Analyze data, visualize insights, and understand the pandemic's impact. 🦠📊

Language: Jupyter Notebook - Size: 1.88 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

protham02/FIFA-23-Analysis

Cleaned, analyzed, and visualized the FIFA 23 dataset using Python and Power BI to extract key player insights and performance trends.

Language: Jupyter Notebook - Size: 10.9 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

Partym32/Top-CV-BA-Job-Analysis

Analyze BA job market trends in Hanoi using data scraped from Top CV. Explore insights for job seekers and employers. 🖥️📊

Language: Jupyter Notebook - Size: 188 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

Manishdebnath99/job-posting-analysis

Data-Driven Analysis of Job Descriptions

Language: Jupyter Notebook - Size: 56.6 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 1

dharmender12/quotes-scraping-sql-eda

This is build week project of masai.

Language: Jupyter Notebook - Size: 2.49 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

DatabyRose/my-data-portfolio

A growing collection of data analytics projects by me—a former Business Analyst turned data storyteller. From SQL to Python, dashboards to predictions, each case study reflects real-world impact, curiosity, and a passion for turning data into decisions.

Size: 2.93 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

kshitija-chilbule01/Stack-Overflow-survey-dataset-cleaning-and-analysis

Stack Overflow Annual Developer Survey Dataset Cleaning and Analysis with Python and Power BI. The data comprises the results of the survey conducted by Stack Overflow in the year 2024.

Size: 17.6 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

MahammedRehman/2021-Online-Sales-Dashboard-Excel-vs-AI

In this project, I compared two dashboards created from the same dataset -- one manually built in Excel and the other auto-generated using Quadratic AI. The comparison focused on visual clarity, insights quality, and build efficiency.

Size: 215 KB - Last synced at: 13 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

hossein-rahmati/Airbnb-property-dataset

This project explores, cleans, and analyzes an Airbnb property dataset to uncover insights related to listings, pricing, and availability. The goal is to better understand patterns in Airbnb listings, detect outliers, and prepare data for potential machine learning models or business insights.

Language: Jupyter Notebook - Size: 3.13 MB - Last synced at: 9 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

AARYAN-O/CineStack-360

CineFlow 360 is an end-to-end movie data engineering pipeline powered by Databricks, featuring real-time streaming, robust ETL processes, user validation, ABC and DQ frameworks, automatic data cleaning between stages, and AI-generated reports using Gemini.

Language: Python - Size: 5.79 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

ssolik/everdoor_emporium_project

This project was designed to explore and analyze customer engagement and marketing performance for Everdoor Emporium in 2024 across both digital and physical channels.

Size: 10.4 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

Pankajjoshi11/walmart_retail_analysis

A complete data-driven project that analyzes Walmart-style retail transactions, extracts actionable insights, and predicts customer satisfaction levels using machine learning. The project includes exploratory data analysis, feature engineering, data cleaning, class imbalance handling, and model building with a robust ML pipeline.

Language: Jupyter Notebook - Size: 354 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

hassanfarhan777/code2prompt-llm-fine-tuning

Efficient, reproducible dataset curation for LLM fine-tuning: scripts and best practices for preparing code datasets without repository bloat.

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Kshitija-Chilbule15/Python-Data-Cleaning-FE-and-EDA-Projects

Python🐍: Data Cleaning, Feature Engineering, and EDA Projects

Language: Jupyter Notebook - Size: 2.86 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

dtbkhanh/Data-Analytics-and-Reports

Collection of data analysis projects and interactive dashboards for various datasets.

Language: Jupyter Notebook - Size: 5.56 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

MallikaUppuganti/TATA_Data_Visualisation

TATA Data Visualisation: Empowering Business with Effective Insights is a virtual intership programme where we analyze the data that would inform leadership decisions across the relevant Stakeholders.

Size: 28.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

RushiChinagounolla/layoffs-data-cleaning-sql

SQL-based project focused on cleaning and analyzing real-world layoffs data.

Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

MostafaGmalFouda/Heart-Disease-Predictor

this is my graduation project from the Egypt Digital Pioneers Initiative, in which we worked on a model for predicting healthcare.

Language: Python - Size: 235 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

MostafaGmalFouda/HeartDisease-DEPI-project

This is my graduation project from the Egypt Digital Pioneers Initiative, in which we worked on a model for predicting healthcare.

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

bensbehChaimae/car-data-intermediate-preprocessing

This repository contains intermediate-level data preprocessing scripts to clean, transform, and prepare car dataset for machine learning models.

Language: Jupyter Notebook - Size: 5.39 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

AbubakarPungiwale/Machine-Learning-Models

A GitHub repository featuring diverse machine learning models implemented with Python. It includes algorithms like Support Vector Machine (SVM), Random Forest, Decision Tree, K-Nearest Neighbors (KNN), Linear Regression, Logistic Regression, Naive Bayes, and Gradient Boosting. The repository covers data preprocessing,

Language: Jupyter Notebook - Size: 4.91 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Kaustubhsutar/PowerBI-Sales-Insight-Atliq-Hardware

An end-to-end Power BI project featuring data cleaning, star schema modeling, DAX-driven KPIs, and three interactive dashboards—delivering actionable insights on sales performance, profitability, and market trends.

Size: 5.62 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

jrili/data-engineer-portfolio

Jessa Rili-Migriño's Data Engineer Portfolio

Size: 24.4 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

R-Shing/waze-user-retention

Exploratory Data Analysis and Modeling of Waze User Churn and Retention Rates

Language: Jupyter Notebook - Size: 5.18 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

aruppatra04/End-to-End-Data_Warehouse-Pipeline

Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

Language: TSQL - Size: 2.17 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

EvaSamoilenko/Monster.com-jobs

Проект по обработке заранее не обработанного Monster.com jobs датасета о вакансиях для многостороннего изучения данных в дальнейшем.

Language: Jupyter Notebook - Size: 167 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

gwentzel26/BCT_Dashboards

This project aimed to uncover meaningful insights into rider behavior and transit system performance using visual dashboards and spatial analysis tools

Size: 18.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

focustechnomedia/Student_performance_ML_Project

Smart Student Performance Prediction App using ML and Django A web platform that predicts student outcomes using academic and behavioral data. It features data cleaning, EDA, feature engineering, and a Random Forest model. Includes dashboards for students, teachers, and admins with personalized stats, alerts, and PDF reports.

Language: Jupyter Notebook - Size: 3.02 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Abhijeet107/Final-project

Final project summation INTERNSHIP PROJECTS (2 WEEKS)

Language: Jupyter Notebook - Size: 6.89 MB - Last synced at: 27 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jrili/ibm-etl-car-dealership

ETL project on car dealership data taken from IBM Python project for Data Engineering on Coursera.

Language: Jupyter Notebook - Size: 29.3 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jrili/ibm-project-world-largest-banks

Web scraping + ETL project: Extract and compile information about the 10 largest banks in the world. From IBM via Coursera.

Language: Jupyter Notebook - Size: 698 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jrili/datacamp-cleaning-bank-marketing

Data cleaning project on bank marketing campaign data from Datacamp

Language: Jupyter Notebook - Size: 521 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Samuelson777/Dominos-Predictive-Purchase-Order-System

Optimize ingredient ordering for Dominos with our Predictive Purchase Order System. This project uses historical sales data to forecast demand, ensuring optimal stock levels, reducing waste, and preventing stockouts. Join us in enhancing efficiency!

Language: Jupyter Notebook - Size: 969 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Samuelson777/CarDheko-UsedCarPricePrediction

This project leverages machine learning to enhance customer experience in the used car market by predicting car prices based on features like make, model, and year. The model is integrated into an interactive Streamlit web application for user-friendly access.

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Samuelson777/DataSpark-Illuminating-Insights-for-Global-Electronics

Explore our EDA project aimed at uncovering insights from Global Electronics' data. Discover actionable recommendations to enhance customer satisfaction, optimize operations, and drive business growth. Join us on this data-driven journey!

Language: Jupyter Notebook - Size: 1.04 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ailasheri83/data-science-portfolio

A portfolio showcasing my data science and analysis projects.

Language: HTML - Size: 1.54 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Related Topics
data-visualization 105 data-analysis 83 python 66 exploratory-data-analysis 45 data-science 34 sql 32 pandas 32 machine-learning 28 excel 23 jupyter-notebook 20 feature-engineering 19 data-analytics 19 powerbi 18 data-cleaning 18 eda 16 matplotlib 15 numpy 15 tableau 15 python3 12 etl 10 data-transformation 10 machine-learning-algorithms 10 business-intelligence 9 seaborn 8 tableau-public 7 data-mining 7 data 7 tableau-dashboards 7 linear-regression 7 visualization 7 data-analysis-python 7 pivot-tables 7 sentiment-analysis 7 dashboards 7 regression 6 model-deployment 6 hyperparameter-tuning 6 random-forest 6 predictive-modeling 6 cross-validation 6 kaggle-dataset 6 matplotlib-pyplot 6 mysql 5 sklearn 5 dashboard 5 microsoft-excel 5 r 5 model-training-and-evaluation 5 natural-language-processing 5 data-wrangling 5 data-visualisation 4 machine-learning-models 4 business-analytics 4 model-evaluation 4 neural-networks 4 random-forest-classifier 4 decision-tree-classifier 4 supervised-learning 4 database 4 data-engineering 4 statistical-analysis 4 power-bi 4 web-scraping 4 predictive-analytics 4 ai 4 github 3 pandas-python 3 retail 3 reporting 3 data-collection 3 mysql-database 3 deep-learning 3 data-analyst 3 data-warehousing 3 logistic-regression 3 regression-models 3 pandas-library 3 nlp-machine-learning 3 streamlit 3 descriptive-statistics 3 data-integration 3 time-series-analysis 3 sql-server 3 regression-analysis 3 pandas-dataframe 3 supervised-machine-learning 3 scikit-learn 3 docker 3 data-manipulation 3 exploratory-data-visualizations 3 business-insights 3 classification 3 data-visualization-project 3 data-visualization-dashboard 3 pyspark 2 github-config 2 random-forest-regression 2 data-correlation 2 dashboard-application 2 price-prediction-model 2