An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-cleaning-and-preprocessing

TharunRacharla/Sports_Celebrity_Image_Classification-project

Language: Jupyter Notebook - Size: 109 MB - Last synced at: about 12 hours ago - Pushed at: about 14 hours ago - Stars: 1 - Forks: 0

AARYAN-O/CineStack-360

CineFlow 360 is an end-to-end movie data engineering pipeline powered by Databricks, featuring real-time streaming, robust ETL processes, user validation, ABC and DQ frameworks, automatic data cleaning between stages, and AI-generated reports using Gemini.

Language: Python - Size: 5.79 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Calebtheman116/hotel_customers_sentiments

Sentiment Analysis for a Hotel Based on Customer's Reviews

Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

KiranMallik/Sales-vs-Customer-Ratings-Analysis-in-Power-BI

The goal is to understand whether higher customer ratings drive more sales and how businesses can optimize their strategies using data-driven insights to identify key sales patterns

Size: 287 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ssolik/everdoor_emporium_project

This project was designed to explore and analyze customer engagement and marketing performance for Everdoor Emporium in 2024 across both digital and physical channels.

Size: 10.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Pankajjoshi11/walmart_retail_analysis

A complete data-driven project that analyzes Walmart-style retail transactions, extracts actionable insights, and predicts customer satisfaction levels using machine learning. The project includes exploratory data analysis, feature engineering, data cleaning, class imbalance handling, and model building with a robust ML pipeline.

Language: Jupyter Notebook - Size: 354 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Mindful-AI-Assistants/SP2024-Election-Analysis

📊 An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.

Language: HTML - Size: 86.6 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 7 - Forks: 3

hassanfarhan777/code2prompt-llm-fine-tuning

Efficient, reproducible dataset curation for LLM fine-tuning: scripts and best practices for preparing code datasets without repository bloat.

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Kshitija-Chilbule15/Python-Data-Cleaning-FE-and-EDA-Projects

Python🐍: Data Cleaning, Feature Engineering, and EDA Projects

Language: Jupyter Notebook - Size: 2.86 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

dtbkhanh/Data-Analytics-and-Reports

Collection of data analysis projects and interactive dashboards for various datasets.

Language: Jupyter Notebook - Size: 5.56 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

MallikaUppuganti/TATA_Data_Visualisation

TATA Data Visualisation: Empowering Business with Effective Insights is a virtual intership programme where we analyze the data that would inform leadership decisions across the relevant Stakeholders.

Size: 28.2 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

RushiChinagounolla/layoffs-data-cleaning-sql

SQL-based project focused on cleaning and analyzing real-world layoffs data.

Size: 0 Bytes - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

MostafaGmalFouda/Heart-Disease-Predictor

this is my graduation project from the Egypt Digital Pioneers Initiative, in which we worked on a model for predicting healthcare.

Language: Python - Size: 235 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

MostafaGmalFouda/HeartDisease-DEPI-project

This is my graduation project from the Egypt Digital Pioneers Initiative, in which we worked on a model for predicting healthcare.

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

bensbehChaimae/car-data-intermediate-preprocessing

This repository contains intermediate-level data preprocessing scripts to clean, transform, and prepare car dataset for machine learning models.

Language: Jupyter Notebook - Size: 5.39 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

WhereisHussain/Data-Science

Projects related Data Visualisation, Cleaning, Preprocessing, Machine Learning, Deep Learning, ANN and CNN Projects and Model Training and Model Evaluation

Language: Jupyter Notebook - Size: 451 KB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

AbubakarPungiwale/Machine-Learning-Models

A GitHub repository featuring diverse machine learning models implemented with Python. It includes algorithms like Support Vector Machine (SVM), Random Forest, Decision Tree, K-Nearest Neighbors (KNN), Linear Regression, Logistic Regression, Naive Bayes, and Gradient Boosting. The repository covers data preprocessing,

Language: Jupyter Notebook - Size: 4.91 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

omari-kd/TransBorder-Freight-Data-Analysis

This project analyses transportation data from the Bureau of Transportation Statistics (BTS) to uncover insights into cross-border freight's efficiency, safety and environmental impacts across road, rail, air and water modes.

Language: R - Size: 346 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

Kaustubhsutar/PowerBI-Sales-Insight-Atliq-Hardware

An end-to-end Power BI project featuring data cleaning, star schema modeling, DAX-driven KPIs, and three interactive dashboards—delivering actionable insights on sales performance, profitability, and market trends.

Size: 5.62 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

jrili/data-engineer-portfolio

Jessa Rili-Migriño's Data Engineer Portfolio

Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

R-Shing/waze-user-retention

Exploratory Data Analysis and Modeling of Waze User Churn and Retention Rates

Language: Jupyter Notebook - Size: 5.18 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

aruppatra04/End-to-End-Data_Warehouse-Pipeline

Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

Language: TSQL - Size: 2.17 MB - Last synced at: 16 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

EvaSamoilenko/Monster.com-jobs

Проект по обработке заранее не обработанного Monster.com jobs датасета о вакансиях для многостороннего изучения данных в дальнейшем.

Language: Jupyter Notebook - Size: 167 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

sbrzt/castelli

Data management for Emilia-Romagna Castles project.

Language: Jupyter Notebook - Size: 9.35 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

gwentzel26/BCT_Dashboards

This project aimed to uncover meaningful insights into rider behavior and transit system performance using visual dashboards and spatial analysis tools

Size: 18.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

focustechnomedia/Student_performance_ML_Project

Smart Student Performance Prediction App using ML and Django A web platform that predicts student outcomes using academic and behavioral data. It features data cleaning, EDA, feature engineering, and a Random Forest model. Includes dashboards for students, teachers, and admins with personalized stats, alerts, and PDF reports.

Language: Jupyter Notebook - Size: 3.02 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

gurjeevanmalhi/Netflix-Analysis

Analyzing Netflix data

Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Seif-Elkerdany/Attendance_System

This is our project for image processing course in our university (AIU)

Language: Python - Size: 15.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

MAGICS-LAB/SMUTF Fork of fireindark707/Python-Schema-Matching

[Information System] SMUTF: Schema Matching Using Generative Tags and Hybrid Features

Language: Python - Size: 21.1 MB - Last synced at: about 8 hours ago - Pushed at: 2 months ago - Stars: 3 - Forks: 2

Abhijeet107/Final-project

Final project summation INTERNSHIP PROJECTS (2 WEEKS)

Language: Jupyter Notebook - Size: 6.89 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

jrili/ibm-etl-car-dealership

ETL project on car dealership data taken from IBM Python project for Data Engineering on Coursera.

Language: Jupyter Notebook - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

AishwaryaGade02/Analyzing-Animated-Movie-Release-Date-Patterns-and-its-Effect-on-Revenue

Predicting the release date for the anime movie to maximize the revenue of the movie

Language: Jupyter Notebook - Size: 439 KB - Last synced at: 24 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

jrili/ibm-project-world-largest-banks

Web scraping + ETL project: Extract and compile information about the 10 largest banks in the world. From IBM via Coursera.

Language: Jupyter Notebook - Size: 698 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

jrili/datacamp-cleaning-bank-marketing

Data cleaning project on bank marketing campaign data from Datacamp

Language: Jupyter Notebook - Size: 521 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Samuelson777/Dominos-Predictive-Purchase-Order-System

Optimize ingredient ordering for Dominos with our Predictive Purchase Order System. This project uses historical sales data to forecast demand, ensuring optimal stock levels, reducing waste, and preventing stockouts. Join us in enhancing efficiency!

Language: Jupyter Notebook - Size: 969 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Samuelson777/CarDheko-UsedCarPricePrediction

This project leverages machine learning to enhance customer experience in the used car market by predicting car prices based on features like make, model, and year. The model is integrated into an interactive Streamlit web application for user-friendly access.

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Samuelson777/DataSpark-Illuminating-Insights-for-Global-Electronics

Explore our EDA project aimed at uncovering insights from Global Electronics' data. Discover actionable recommendations to enhance customer satisfaction, optimize operations, and drive business growth. Join us on this data-driven journey!

Language: Jupyter Notebook - Size: 1.04 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Seyigate/Sales-vs-Customer-Ratings-Analysis-in-Power-BI

The goal is to understand whether higher customer ratings drive more sales and how businesses can optimize their strategies using data-driven insights to identify key sales patterns

Size: 301 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

ailasheri83/data-science-portfolio

A portfolio showcasing my data science and analysis projects.

Language: HTML - Size: 1.54 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

zodrickjohn/Data-Analysis-Power-BI-Project----Sales-Data-Analysis

This project presents a complete Sales Dashboard built in Power BI, analyzing product performance, revenue trends, and region-wise sales. It leverages DAX for custom KPIs and provides a business-friendly interface for data-driven decisions.

Size: 2.44 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

RKodah/Correlation-One-Datafolio

Submission requirement for Correlation One graduation. The repo contains every step of the process as well as the live dashboard and Google Colab for the python code.

Language: Jupyter Notebook - Size: 5.1 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

madhurimarawat/Data-Warehousing

This repository contains practical examples of data warehousing concepts, including star schema and ETL processes, all implemented using MySQL.

Language: Jupyter Notebook - Size: 10.5 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Nikol2004/Databases-and-Big-Data-2024

Designed a relational database from scratch using Spotify's most-streamed tracks dataset. Cleaned and normalized data, built queries to explore musical trends, and analyzed song features across platforms.

Language: Python - Size: 106 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Manishdebnath99/Build_Week_Project

SAL_BW_Project_1 – Analyzing Job description data to derive meaningful insights.

Language: Jupyter Notebook - Size: 33.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

iresil/FowlFlightForensics

A Kafka-based CSV parser for bird-related airplane accidents

Language: Java - Size: 5.23 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

NikitaPatil7/Netflix-dataset-Data-Cleaning-and-Preprocessing-

Data cleaning and exploratory data analysis (EDA) on the Netflix Titles dataset. This project covers preprocessing, handling missing values, standardization, and visual insights using Python and Pandas.

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Daniel-Andarge/AiML-ethiopian-medical-biz-datawarehouse

The Ethiopian Medical Business Data Warehouse & Analytics Platform is a comprehensive data solution tailored to enhance the efficiency and efficacy of Ethiopia's healthcare and medical sectors.

Language: Jupyter Notebook - Size: 9.97 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

who-else-but-arjun/Convolve

This repository contains the projects developed for the Convolve PAN IIT AI-ML Hackathon, conducted by IDFC Bank. Predicting Credit Card Defaulters – A deep learning-based model to assess the risk of credit card default. Optimizing Email Engagement Time Slots – A machine learning model to determine the best time slots for personalised emails.

Language: Jupyter Notebook - Size: 7.19 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

jensoto/MPS-DataAnalytics

Academic portfolio of course work for my Master's in Data Analytics.

Size: 116 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

anupamsaha18/Fintech-Capstone-Project---Loan-Default-Analysis

The Loan Default Analysis project aims to identify key factors contributing to loan defaults by analyzing borrower profiles, financial data, and credit risk indicators. Using statistical methods, visualizations, and predictive modeling, the project provides insights to mitigate risks and improve lending strategies.

Size: 1.95 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

crazy-dot/Zomato-Data-Analysis

This project analyzes 50k Bengaluru restaurants from Zomato, focusing on 17 features like location and ratings. It cleans, explores, and visualizes data to improve services. Key visualizations include delivery, booking, location, and cost. The goal is to provide insights for better customer experiences.

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

anthonypham26/ClassicalMachineLearning01

This repository contains a beginner-friendly introduction to Machine Learning, covering essential concepts such as data preprocessing, feature engineering, data visualization, and ML fundamentals.

Size: 379 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

shubhamgoyal575/Tableau-Visualization-Dashboard

This repository features interactive Tableau dashboards for sales performance and healthcare analysis. It includes insights on revenue trends, regional sales, patient demographics, and hospital occupancy for data-driven decision-making. 🚀

Size: 1.81 MB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

shubhamgoyal575/Sentiment-Analysis-NLP-

This project uses machine learning to classify text sentiment as positive, negative, or neutral. It includes data preprocessing, feature extraction, and models like Logistic Regression, SVM, and Random Forest. Built with Python and Scikit-Learn.

Language: Jupyter Notebook - Size: 1.9 MB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Patangaysheetal/DataAnalyticsPersonalProjects

Size: 29.9 MB - Last synced at: 11 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

angelicaakwarteng/Python-projects

This portfolio contains projects done using Python programming language to work on real-life data to gain insights. The individual projects cover random topics like defining my own functions, creating my own classes, exploratory data analysis and even predictive modelling using Jupyter notebook.

Language: Jupyter Notebook - Size: 3.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

karlyndiary/Global-Electronics-Retailer-Sales-and-Customer-Insights

Developed an analysis using Python, SQL, and Excel to examine sales and customer demographics for a Global Electronics Retailer. The findings aim to enhance business strategies and improve overall performance.

Language: Jupyter Notebook - Size: 52.5 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

venkat-0706/Black-Friday

Black Friday Sales Analysis explores customer demographics, purchasing behaviors, and product trends to uncover insights and patterns driving sales during Black Friday events.

Language: Jupyter Notebook - Size: 480 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 9 - Forks: 0

milestxne/oibsip

This repository contains different data analysis projects under my Oasis Infobyte Internship.

Language: Jupyter Notebook - Size: 20.5 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gunagvn/Guna_ML_model

My First basic Machine Learning model

Language: Jupyter Notebook - Size: 214 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

MuhammadAdnan1998/Growth_Mindset_Challenge_Web_App_with_Streamlit

Data Sweeper is a Streamlit-based tool for CSV to Excel conversion, data cleaning, and visualization. Easily remove duplicates, fill missing values, select columns, and generate interactive charts. Perfect for data analysts and business professionals. 🚀

Language: Python - Size: 2.93 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

utkarsh-284/Cyclistic-Case-Study

This repository is for "Cyclistic" Case Study

Language: Jupyter Notebook - Size: 76.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Wilfrida-Were/Retail-Sales-EDA-in-Python

Exploratory Data Analysis on Retail Sales Data in Python

Language: Jupyter Notebook - Size: 672 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

MAHMOUD2ABDALLAH/family-members-segmentation

Machine learning Classification for Family Determination for various generations by their age, height, weight, etc...

Language: Python - Size: 629 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

MAHMOUD2ABDALLAH/Bike-Sales

It was a competition on KAGGLE for prediction on the most sales products on bikes via their features

Language: Python - Size: 358 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

TanmayBorse/Institionistic_fuzzy_approx_space

This model introduces a hybrid approach that utilizes rough sets on intuitionistic fuzzy approximation spaces for pre-processing and soft sets for post-processing, resulting in an effective decision-making solution.

Language: Jupyter Notebook - Size: 5.08 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

k4karann/healthdata_analysis_using_r

HealthCare Data analysis using R Lang includes data cleaning, data processing, and visualizations.

Language: R - Size: 506 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

rajesh9943/Market-Domination-and-Exploring-Trends-in-2023-Indian-Car-Sales-with-Exploratory-Data-Analysis

The report also reveals the dominance of the Top 25 best-selling car models. These top sellers captured a substantial portion of the market, accounting for more than 75% of the total cars sold in April 2023 (likely a typo, referring to May 2023). This suggests a preference for established and popular car models among Indian consumers.

Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

sumedhsp/Exploring-311-Service-Requests

311 Service Requests: Resource Utilization & Resolution Time Prediction - Foundations of Data Science - Final Project (CS-GY 6053)

Language: Jupyter Notebook - Size: 3.52 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

hari255/Visual-Story-Telling

Developed Interactive visualizations and a Shiny Dashboard using using R from a complicated and in-complete time-series dataset.

Language: HTML - Size: 17.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

kathisnehith/Medicare-IP-hospital-Analysis

In-depth Data analysis and visualization of Medicare inpatient hospital data.

Size: 9.77 KB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

cudjoejosephine/Advanced-Excel

This repository contains all my Data Analytics projects in Excel

Size: 461 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Adesh1214/Banking-And-Finance-Analysis-PowerBI

This project is a Banking and Financial Transaction Analysis Dashboard created using Power BI. The primary goal is to provide actionable insights into customer demographics, branch performance, and financial transactions for effective decision-making.

Size: 10.9 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Abhi-Pat/PowerBI_Sales-and-Transaction-Data-Analysis

Data Preparation: Imported and cleaned large datasets (Customer, Transaction, Product, and Sales) using DAX queries for column creation and Power Query for transformation. Visualization: Designed tailored dashboards for desktop and mobile, integrating Smart Narrative and AI visuals for actionable business insights.

Size: 1.32 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

shrawans007/hotel_customers_sentiments

Sentiment Analysis for a Hotel Based on Customer's Reviews

Size: 14.6 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

aninditaws/Questionnaire-Exploratory-Data-Analysis

A comprehensive EDA project for analyzing questionnaire results. Includes data cleaning, descriptive statistics, and visualizations to identify trends and patterns in survey responses.

Language: HTML - Size: 1.07 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Hamada-khairi/PFDA-Hamada

A comprehensive R-based data analysis project that examines housing rental patterns across multiple cities, utilizing statistical methods and visualization techniques to analyze 4,746 properties' data points including rent prices, locations, and amenities. The project employs various R libraries to clean, process, and visualize rental market trends

Language: R - Size: 3.68 MB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

LKEthridge/Integrated_Project_2

Integrated Project 2 from TripleTen

Language: Jupyter Notebook - Size: 15 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Daivik-Gangappa/SocialMediaInfluenceAnalysis

This project analyzes Twitter data to assess public sentiment on U.S. President Joe Biden before and after elections. Using machine learning models (Decision Trees, Random Forests, Naive Bayes, Logistic Regression), it predicts sentiment trends and their potential impact on future elections.

Language: Jupyter Notebook - Size: 1.67 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Anarya22/Tata-Data-Visualization-Empowering-Business-with-Effective-Insights-Job-Simulation-on-Forage

Completed a simulation involving creating data visualizations for Tata Consultancy Services. Created visuals for data analysis to help executives with effective decision making.

Size: 21.8 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Chiugo-Nsoke/Student-Performance-Analysis

An analysis of student performance factors using Python, featuring data cleaning, EDA, and machine learning for prediction.

Language: Jupyter Notebook - Size: 777 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

LegallyNotBlonde/Tableau_Citi_Bike

Analyzed Citi Bike data to uncover trends in ride duration, peak usage, and station demand, offering recommendations to optimize bike availability

Language: Jupyter Notebook - Size: 23.7 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

SudarshanaSRao/From-Data-to-Gold--My-Journey-Creating-an-Olympic-Tableau-Dashboard

Developed an interactive dashboard using Tableau with Kaggle’s Olympic dataset.

Language: Jupyter Notebook - Size: 269 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

JohnnySolo/Data-Analysis-Project---Spotify-Hit-Songs

That's my 1st year project in the course "Introduction to Statistics"

Size: 62.5 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

eshambr/Corpay-Cross-Sell-Strategy-Enhancement-and-Model-Building

This project refines Corpay's Cross-Sell Program by analyzing customer performance data and using predictive modeling to identify high-creditworthiness profiles, optimizing customer selection and profitability.

Language: R - Size: 736 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

anwarraif/contoh-dsf-porto

EDA is Exploratory and Explanatory Data Analysis. I'm using Global Super Store from Kaggle.com . Superstores industry comprises of companies that operateby having large size spaces which store and supply large amounts of goods.

Language: Jupyter Notebook - Size: 10 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Wilfrida-Were/Tech-Layoffs-Data-Cleaning-in-SQL

Tech Layoffs Data cleaning in SQL - Data Visualisation in Tableau

Language: Jupyter Notebook - Size: 461 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

aliiimaher/Laptop-Price-Prediction

This is an AI model for predicting laptop price, trained on about 1200 data.

Language: Python - Size: 9.46 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 8 - Forks: 1

harshita2234/Potato-Prices-Prediction

Project aims to forecast potato prices in India using LSTM, KNN, and Random Forest Regression, integrating historical data on prices, regional stats, and rainfall patterns. Targeting agricultural stakeholders for informed decision-making.

Language: Python - Size: 862 KB - Last synced at: 28 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

PatilNi3/PROJECT_POWER_BI

Global Superstore BI Dashboard

Size: 2.9 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

vamsi-krishnakOO7/Research-on-VIDWAN-Authors-Collaboration-Network

The following repository contains the files and data relevant to the study I did about the Evolution of Research Themes and Trends in India Across the Years via the VIDWAN Author's Collaboration Network

Language: Jupyter Notebook - Size: 7.02 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Anu0408/House-Price-Prediction-MachineLearning-Application

A real-time, end-to-end machine learning application built with Flask and integrated with MLflow for tracking and model management. The application predicts house prices based on user input, leveraging trained regression models and providing a web interface for seamless interaction.

Language: Python - Size: 462 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

AsuquoAA/Ann_Arbor_Weather_Analysis_2005-2015

This project analyzes historical weather data from Ann Arbor, Michigan, collected by the National Centers for Environmental Information (NCEI) Global Historical Climatology Network daily (GHCNd).

Language: Jupyter Notebook - Size: 2.96 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

AsuquoAA/Big_4_Sports_Teams_and_City_Population_Analysis-2018-

Analysis of sports teams' win/loss ratios vs. metro area populations across NFL, NBA, MLB, and NHL.

Language: HTML - Size: 86.9 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Pra5etya/Covid-in-Indonesia

Analysis of the spread of COVID-19 in Indonesia, based on daily cases, death and recovery rates, and regional categories based on WHO. This repository includes informative visualization data, risk level classifications, and data-driven analysis to understand the impact of the pandemic in different regions.

Language: Jupyter Notebook - Size: 55.7 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Krisanth-21/T20-World-Cup-Top-11-Player-Analysis

This project analyzes the 2022 T20 World Cup data to determine the top 11 players based on their performance. I used ParseHub to collect data from the ESPNcricinfo website, then cleaned and transformed it with NumPy and pandas, Python libraries, and created dashboards in Power BI.

Language: Jupyter Notebook - Size: 474 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

RandomGamingDev/grabcraft-to-schema

A Python library and its cli for converting grabcraft to schema (more specifically litematica schematic) files

Language: Python - Size: 56.4 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 1

AnnaAnastasy/Mushroom-Binary-Classification-EDA-ML

Explored and modeled a competition dataset of mushroom species, focusing on data cleaning, exploratory data analysis, and building machine learning models for accurate classification of edible and poisonous mushrooms.

Language: Jupyter Notebook - Size: 4.06 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

lmizner/CS249_DataScienceFundamentals

Course work from UCLA's CS249 - Data Science Fundamentals

Language: Jupyter Notebook - Size: 2.45 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

girish119628/CodSoft

Data Enthusiast | Predictive Modeler | Turning Insights into Strategies

Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: 10 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

Related Keywords
data-cleaning-and-preprocessing 224 data-visualization 96 data-analysis 78 python 60 exploratory-data-analysis 43 sql 31 data-science 31 pandas 27 machine-learning 25 excel 21 jupyter-notebook 19 data-cleaning 18 feature-engineering 17 data-analytics 15 matplotlib 14 powerbi 14 tableau 14 numpy 12 eda 12 python3 11 machine-learning-algorithms 10 data-transformation 10 etl 9 seaborn 7 tableau-dashboards 7 data-mining 7 linear-regression 7 tableau-public 7 sentiment-analysis 7 data 6 visualization 6 random-forest 6 data-analysis-python 6 business-intelligence 6 pivot-tables 6 model-deployment 6 cross-validation 6 kaggle-dataset 6 hyperparameter-tuning 6 predictive-modeling 5 natural-language-processing 5 model-training-and-evaluation 5 r 5 mysql 5 regression 5 dashboards 5 data-wrangling 5 microsoft-excel 5 sklearn 4 supervised-learning 4 random-forest-classifier 4 business-analytics 4 decision-tree-classifier 4 machine-learning-models 4 statistical-analysis 4 neural-networks 4 dashboard 4 matplotlib-pyplot 4 data-engineering 4 database 4 pandas-dataframe 3 reporting 3 data-manipulation 3 data-analyst 3 regression-models 3 model-evaluation 3 pandas-library 3 logistic-regression 3 docker 3 data-visualisation 3 pandas-python 3 predictive-analytics 3 regression-analysis 3 sql-server 3 nlp-machine-learning 3 data-integration 3 mysql-database 3 deep-learning 3 exploratory-data-visualizations 3 data-collection 3 streamlit 3 business-insights 3 classification 3 retail 3 descriptive-statistics 3 scikit-learn 3 time-series-analysis 3 data-visualization-dashboard 3 power-bi 3 data-visualization-project 3 supervised-machine-learning 3 github 3 etl-pipeline 2 data-warehousing 2 power-bi-dashboard 2 computer-vision 2 dimensionality-reduction 2 knn-regression 2 clustering 2 streamlit-application-development 2