An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: cleaning-data

pyjanitor-devs/pyjanitor

Clean APIs for data cleaning. Python implementation of R package Janitor

Language: Python - Size: 11.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1,431 - Forks: 173

laayad/Coffee-Sales-Insights

Explore coffee sales trends through data analysis and visualization. Discover insights using Excel and Tableau. ☕📊

Size: 1.71 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

steviecurran/wrangling-lecture

A lecture on introductory data wrangling for my 3rd year Physics and Space Sciene students.

Size: 2.93 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Mohammed061/Transportation-and-logistics-Challenge

Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.

Language: Jupyter Notebook - Size: 3.36 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

araafroyall/Cleaner-Royall

🚀 𝗔 𝗠𝗼𝘀𝘁 𝗔𝗱𝘃𝗮𝗻𝗰𝗲 𝗖𝗹𝗲𝗮𝗻𝗲𝗿 𝗙𝗼𝗿 𝗔𝗻𝗱𝗿𝗼𝗶𝗱 [Root]

Language: Java - Size: 11.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 151 - Forks: 8

tannerearsley/vehicle-sales-sql

Vehicle Sales Analysis: SQL Data Cleaning and Manipulation

Size: 113 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

Gebrehiwot-Tesfaye/customer-experiance-analysis

App Review Scraper & Analyzer

Language: Python - Size: 37.7 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

engineering87/SharpSanitizer

A .NET library for sanitizing and validating object properties using customizable rules to ensure clean and secure data

Language: C# - Size: 49.8 KB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 4 - Forks: 0

CyberCRI/refinedoc

python library for post-extraction refinement of text that may be derived from PDF extraction.

Language: Python - Size: 19.5 KB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 2

Nicetink/Effinitum-X

"System optimization tool created with WPF"

Language: C# - Size: 3.72 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

Erdincuzunlu/python-data-cleaning-cheatsheet

A practical 11-step Python cheatsheet for data cleaning tasks in data science.

Language: Python - Size: 2.93 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

prasanthg3/cleantext

An open-source package for python to clean raw text data

Language: Python - Size: 27.3 KB - Last synced at: 15 days ago - Pushed at: almost 2 years ago - Stars: 70 - Forks: 11

JuanParias29/BigDataProcessingProject

Este repositorio contiene un proyecto de análisis y procesamiento de datos a gran escala basado en la metodología CRISP-DM, enfocado en resolver preguntas de negocio dentro del ámbito educativo.

Language: Jupyter Notebook - Size: 4.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

IRedDragonICY/booruprompt

A simple web application built with NextJS to extract tags from booru websites. Just paste the URL of a booru post, and this tool will fetch and display the associated tags, ready for you to copy.

Language: TypeScript - Size: 774 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

idemio/tiny-clean

light-weight high-performance sanitizers for common use cases

Language: Rust - Size: 124 KB - Last synced at: 26 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

shellynagar27/Mobile-Sales-Analysis

Analyzed 2024 mobile sales data to uncover product trends, customer behavior, and regional insights using Power BI dashboards and structured data modeling.

Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mfakhriazhar/data-cleaning-sql

Therefore, this project focuses on the data cleaning process using MySQL to ensure the data used is clean, valid, and ready for analysis. With proper data cleaning, we can minimize the risk of errors and improve the quality of the insights generated.

Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

WillianMonteiro23/projetos-sql

Esta repositório contém projetos que utilizam SQL para análise de dados. Cada projeto explora conjuntos de dados, realizando consultas para extrair insights, transformar dados e visualizar resultados, demonstrando habilidades em gerenciamento de banco de dados e otimização de consultas.

Language: TSQL - Size: 60.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

AdhamAyoub/Breast-Cancer-Wisconsin-Diagnostic-

Training data using logistic regression to predict whether the patient is Diagnosed as (malignant or benign)

Language: Jupyter Notebook - Size: 120 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Prathameshv07/CodeX-Drink-Insights

Solved CodeBasics Resume Project Challenge #6 leveraging PowerBI.

Size: 2.09 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Prathameshv07/Atliq-Mart-Analysis

Solved CodeBasics Resume Project Challenge #9 with the help of Python Notebook

Language: Jupyter Notebook - Size: 2.44 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

nikhiljsk/preprocess_nlp

A fast framework for pre-processing (Cleaning text, Reduction of vocabulary, Feature extraction and Vectorization). Implemented with parallel processing using custom number of processes.

Language: Python - Size: 58.6 KB - Last synced at: 27 days ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 4

ElaWajdzik/SQL_Challenge_Case_Study_2---Pizza-Runner

This project 🍕 explores the Pizza Runner business using SQL Server. It involves data cleaning, analysis of pizza metrics, customer experience, and pricing optimization, aiming to improve business efficiency and decision-making.

Language: TSQL - Size: 25.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

gamal1osama/Bank-Customer-Churn

This project predicts bank customer churn using machine learning. It covers data cleaning, EDA, feature engineering, model benchmarking, and hyperparameter tuning. The final model is a tuned XGBoost classifier with strong performance.

Language: Jupyter Notebook - Size: 8.01 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mehtadigisha/Clean-Visualize-Analyze

Clean Visualize Analyze

Language: Jupyter Notebook - Size: 533 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

SergeiNikolenko/LPCE

The LPCE project is designed to purify and process PDB structures to extract and filter ligands and remove unwanted components such as water molecules and junk ligands.

Language: Jupyter Notebook - Size: 17.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

jfernando-work/Cafe_SQLDataClean

This project focuses on cleaning and preparing raw transactional data from a cafe using SQL. The dataset includes item purchases (e.g. coffee, cake, cookies) with details such as quantity, pricing, location, and payment method.

Size: 13.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

WillianMonteiro23/projetos-excel

Repositório dedicado à análise de dados utilizando Excel. Aborda desde fórmulas básicas e avançadas até tabelas dinâmicas, gráficos, filtros e manipulação de dados. Focado na solução de problemas reais de negócios através do Excel

Size: 2.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

sowmyamaddali/Virginia-Air-Quality

Built an end-to-end air quality dashboard in Tableau Public using EPA AQS data, showcasing PM2.5 trends across time and region. Provided insights on daily spikes, weekday variations, and county-level pollution hotspots in Virginia.

Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

prakashpandey16/sql_data_warehouse_project

Building a modern data warehouse with SQL Server, including ETL Processes, data modeling, and analytics.

Language: TSQL - Size: 9.66 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

longNguyen010203/Youtube-Recommend-Master-ETL-Pipeline

A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Docker. Data from kaggle and youtube-api

Language: Jupyter Notebook - Size: 701 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 21 - Forks: 2

Aman-Satone/Sales-Insights

India based Hardware company Sales Insights - A Data Analysis Project performed on Power Bi& SQL

Size: 1.93 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

albiagro/students-grade-prediction

This project focuses on data analysis and grade prediction for students enrolled in a Portuguese language course. Using the Student Alcohol Consumption dataset, we aim to understand how personal, social, and academic features influence students' final grades.

Language: Jupyter Notebook - Size: 605 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

nisha854/CodeX-Beverage-Marketing-Analysis

This project analyzes CodeX Beverage Marketing. Here I find problems and offer solutions, study consumer preferences, and improve brand awareness using competition analysis, marketing channels, and optimization in focus cities. Also examined purchase behavior, identify product development opportunities, strategize pricing and promotions.

Size: 4.58 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

jubileeamechi/PythonProjects

This repository contains a collection of Python projects focused on data analysis, machine learning, and automation. Each project showcases practical applications such as sentiment analysis, predictive modeling, and more, helping to enhance Python programming skills.

Language: Jupyter Notebook - Size: 124 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

notesjor/corpusexplorer2.0

Korpuslinguistik war noch nie so einfach...

Language: C# - Size: 32.5 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 23 - Forks: 3

hyslan/banco_facfar

Insert/Update de dados ao banco FACFAR

Language: R - Size: 31.4 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Meteor-Community-Packages/meteor-simple-schema

Meteor integration package for simpl-schema

Language: JavaScript - Size: 1.59 MB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 917 - Forks: 164

shellynagar27/Provide-Insights-for-a-Strategic-Merger-in-the-OTT-Domain

LioCinema and Jotstar's merger aims to create India's leading OTT platform. This project analyzes subscriber trends, content performance, engagement, and revenue to drive data-backed strategic decisions.

Size: 2.93 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

EmadBeltaje/gaza_flutter_cleaner

Clean all your flutter projects with one command line and save your disk space 🚀

Language: Dart - Size: 213 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 7 - Forks: 0

ndomah1/Learning-Microsoft-Excel

This repository offers a comprehensive tutorial on Microsoft Excel, covering pivot tables, essential formulas, lookup functions, conditional formatting, chart creation, and data cleaning techniques to empower users with effective data analysis skills.

Size: 10.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

shellynagar27/Transportation-and-logistics-Challenge

Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.

Language: Jupyter Notebook - Size: 3.38 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

stephendotgg/vanisher 📦

Discord bot to bulk remove someone's messages in a server.

Language: Java - Size: 56.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

SPARTANX21/SQL-Data-Analysis-Healthcare-Project

SQL - Healthcare Dataset Analysis

Size: 545 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 4

VikkiezDev/AI-Global-Index-Analysis

This project analyzes the AI readiness of 62 countries using key indicators like government strategy, commercial activity, research, development, and infrastructure. Through data cleaning, EDA, and visualization, it identifies key drivers of AI adoption and competitiveness.

Language: Python - Size: 1.05 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

LiakosData/Stocks_Analysis

This project analyzes stock market data for six major stocks (AAPL, AMZN, NVDA, TSLA, GOOG, SPY) using Python and Power BI. It explores trends, volatility, and correlations.

Language: Jupyter Notebook - Size: 8.77 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

CodePurge/CodePurgeKit

Swift package providing shared data models, utilities, and SwiftUI components to manage and organize purgable items in Xcode environments.

Language: Swift - Size: 46.9 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

kevinwood15/Python_ML_KMeans_Project

This project uses the KMeans ML algorithm to identify segments of the broader population that form the core customer base of a company.

Language: Jupyter Notebook - Size: 275 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

kevinwood15/Python_ML_Classification_Modeling

This project uses GaussianNB, Random Forest, and AdaBoost Classification Models to predict the income category of individuals with US Census Data

Language: Jupyter Notebook - Size: 156 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Kingflow-23/wikipedia-topic-clustering

This project scrapes Wikipedia pages on various topics, processes the text using TF-IDF vectorization, and clusters the topics using KMeans. The results are visualized in a 2D plot using UMAP, providing insights into the relationships and groupings of different Wikipedia topics based on their content.

Language: Jupyter Notebook - Size: 1.86 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

kevinwood15/Python_Twitter_DataWrangling_Project

The main objectives of this project is to wrangle (clean) and analyze twitter data. I deal with some messy data, clean it, then plot some visualizations of the data to analyze it.

Language: Jupyter Notebook - Size: 150 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Yusuf-Abol/debris-in-the-matrix

In this project, I embark on the journey of cleaning messy data, comparing it to washing and tidying up a room.

Language: Jupyter Notebook - Size: 212 KB - Last synced at: 16 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Sahil-113/Predictive-Analysis

Language: Jupyter Notebook - Size: 157 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

kingabzpro/Annual-Recycled-Energy-Saved-in-Singapore

Learn how much Singapore is saving energy per years by recycling plastics, paper, glass, ferrous and non-ferrous metal

Language: Jupyter Notebook - Size: 600 KB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 1

LinuxProgramador/Cleaning

Tool that allows you to safely delete multimedia files, without the possibility of recovering the content of the file.

Language: Python - Size: 18.6 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Madhuresh2011/Gen-Z-project-Using-SQL

Gen-Z career aspiration response data analysis ,It aims to provide actionable strategies for businesses targeting this influential generation.

Size: 4.46 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

jackmnob/Python-Tableau-EDA-StockDash

Data cleaning, preparation, and manipulation (EDA) for an interactive stock market dashboard with Tableau - using pandas (Python) via JupyterLab

Language: Jupyter Notebook - Size: 503 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

sadidul/scrape_flight_data

Flight Data Scraping: Analysis and Visualizations in Tableau

Language: Jupyter Notebook - Size: 89.8 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

miftahulhadii/miftahulhadii.github.io

A portfolio website for sharing project created by Miftahul Hadi.

Language: HTML - Size: 32 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Pranathi0408/Sample-Superstore-Profit-Report

Power BI Project: Superstore profit report 📌 Project Overview This Power BI project provides interactive visualizations and data insights for sales analysis. The dashboard enables users to explore key performance metrics, identify trends, and make data-driven decisions.

Size: 10.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

AlbertBagdos256/Loan-Applications-Analytics

The comprehensive analytics of the data collections of client's loan applications.

Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Dawid-switaj/Excel-Projects

In this Repository i share my projects done in Excel.

Size: 5.56 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gre1wy/MathMod

KPI IPT course, 5 semester

Language: Jupyter Notebook - Size: 2.04 MB - Last synced at: 23 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

ganeshreddyt/DataVisualization-World

Skilled in creating clear and impactful visualizations using Excel and Tableau. Proficient in Excel's advanced charts, pivot tables, and formatting to analyze data effectively. Experienced in building interactive Tableau dashboards to present insights, track KPIs, and support data-driven decisions.

Size: 31.7 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

elkronos/helper_py

Helper functions in python

Language: Python - Size: 98.6 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Kisaa-Fatima/Amazon-Fashion-Products-Data-Processing-and-Database-Schema

Contains work on preprocessing, cleaning, and storing Amazon fashion product data in a MySQL database. The project builds on data collected in a previous assignment, focusing on ensuring data quality, handling image data, and designing a normalized database schema.

Language: Jupyter Notebook - Size: 11.9 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Nitya-34/Lego-Dataset-Analysis

Language: Jupyter Notebook - Size: 259 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ddzikri/mini-project

Mini Project Data Engineer at Alterra Academy

Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Anarya22/E-Commerce_Analysis

E-Commerce_Analysis is a data analysis project performed on the Superstore_USA dataset. It explores various aspects of e-commerce performance, including sales trends, customer demographics, product categories, and regional performance. The analysis includes data cleaning, visualizations, and insights on factors influencing sales and profitability.

Language: Jupyter Notebook - Size: 918 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

saob007/Modelado_retencion_personal_proyecto

Construcción de un modelo de aprendizaje automático que permite predecir si un empleado desertará o no de una empresa industrial de desarrollo automotriz

Language: Jupyter Notebook - Size: 18.2 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

AksharaBhavitha/COVID19-Analysis

This repository contains the analysis of COVID19 and Visualisations including the CSV file used.

Language: Jupyter Notebook - Size: 454 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Madhuresh2011/Amazon-Sales-Report-Analysis-Using-Python

This project focuses on analyzing Amazon sales data using Python to uncover insights into sales performance, customer behavior, and product trends

Language: Jupyter Notebook - Size: 4.23 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

andradejoaoantonio/PortfolioProjects

Hello everyone! I have created this repository to showcase my skills, share knowledge and track my progress in the Data Analytics and Science fields.

Language: Jupyter Notebook - Size: 77.2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

DimitrisKatos/Retail_Sales_Amount

Language: Jupyter Notebook - Size: 1.3 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Eman288/ML-With-Streamlit

This repository contains a collection of machine learning models deployed using Streamlit, a Python-based framework for building interactive web applications. The project demonstrates how to effectively integrate machine learning workflows with a user-friendly interface,making it easier for end users to interact with and understand machine learning

Language: Python - Size: 155 MB - Last synced at: 18 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Nagar2nd/ML-RegressionModel---CarDekho-Price-Prediction

This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.

Language: Jupyter Notebook - Size: 573 KB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

gre1wy/MTAD

KPI IPT course, 5 semester

Language: Rich Text Format - Size: 30.9 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

thimyxuan/speed-dating-analysis

A speed dating analysis

Language: Jupyter Notebook - Size: 3.22 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

alvaro-concha/animal-behavior-preprocessing

animal-behavior-preprocessing is a Python repository to preprocess animal behavior data. It works on the output spreadsheets from video-tracking of animal body parts with LEAP or DeepLabCut. It applies a Median Filter, an Ensemble Kalman Filter, transforms data to joint angles and computes their Morlet Wavelet Spectra.

Language: Python - Size: 251 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

brunofsbravo/US-Household-Income

Processo de limpeza e exploração de dados da Renda de Famílias disponibilizados pelo governo dos Estados Unidos.

Size: 1.72 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

maxxhvo/Modelling_Climate_Change_Via_Indicators

Statistical Modelling and Data Visualization of a Climate Change Dataset (January 1984 to December 2008 ) Sourced from Kaggle

Language: HTML - Size: 15.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

IstinNew/Enaic-s-Discount-Strategy-Analysis

**(Open to Collaboration):** This project evaluates the impact of discounts on sales and customer retention for Eniac. It includes data cleaning, visualization, storytelling, and strategic insights to optimize discount strategies while maintaining brand reputation. 📊🛍️✨

Size: 7.62 MB - Last synced at: 20 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

nadaabdelmalek97/Supplier-Quality-Analysis-

Analysis for a real Data Set aims to improve manufacturing quality by identifying key causes of downtime and defects, with vendors and material performance.

Language: Jupyter Notebook - Size: 2.59 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

nadaabdelmalek97/-Deforestation-SQL-project-

“ Deforestation SQL project “ 📌 This project was a comprehensive journey that encompassed rigorous data cleaning ,building schema, and provide insights with regard to the forestation trend between 1990 - 2016. Our main goal was to explore and analysed the dataset by writing simple SQL queries 📝

Size: 389 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

LisaKey/datacamp-data-analyst-python-sql-projects

Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.

Language: Jupyter Notebook - Size: 20.7 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

AlonResearch/MEB-9600Sanitizer

A small scrip to split the output file of the Nihon Codhen MEB-9600 in to different files with each of the sweeps for post data analysis.

Language: Python - Size: 14.6 KB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

BadrAnalyst/Nashville-Housing-SQL-Data-Cleaning

This project focuses on cleaning and preparing the Nashville Housing dataset for analysis using SQL. It involves identifying and rectifying inconsistencies, handling missing values, and optimizing the dataset for further exploration. The cleaned data is essential for accurate insights into housing trends and patterns in Nashville.

Size: 5.64 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

beery4010/Wrangle-and-Analyze-Data

Udacity Data Analyst Nanodegree - Project IV

Language: HTML - Size: 3.14 MB - Last synced at: 27 days ago - Pushed at: almost 5 years ago - Stars: 7 - Forks: 12

ELHoussineT/AutoDataCleaner

Simple and automatic data cleaning in one line of code! It performs one-hot encoding, date & time casting to datetime dtype, detects binary columns, safely convert non-numeric columns to numeric dtypes, cleaning dirty/empty values, normalizing values and removing unwanted columns all in one line of code. Get your data ready for model training and fitting quickly.

Language: Python - Size: 647 KB - Last synced at: 5 days ago - Pushed at: about 4 years ago - Stars: 19 - Forks: 4

BadrAnalyst/Data-Cleaning-and-Exploratory-Data-Analysis-Project

This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.

Size: 52.7 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

tanyagarg25/Uber_vs_Lyft_Price_Analysis

A comprehensive analysis comparing Uber and Lyft ride prices and service performance. The project explores key factors such as distance, surge pricing, and weather conditions affecting fares. Data cleaning, visualizations, and predictive modeling were used to provide insights into pricing strategies and market positioning.

Size: 2.55 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

tanyagarg25/Local_Store_Performance_Analysis

Analyzing local store performance using sales data to identify trends, inefficiencies, and opportunities for growth. This project includes data cleaning, descriptive statistics, and interactive visualizations using Tableau and Excel

Size: 1.09 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Xuehong-pdx/Health-Data-Analysis

This project aims to derive insights from health related data

Language: Jupyter Notebook - Size: 1.64 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

Naveen-Sharma220722/Employee_Attrition_Analysis

A Dashboard made in Excel showing the trend of how employees left the organization.

Size: 9.19 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

sehgal-vishal/Citi-Bike-Data-Analysis

The goal of this project is to analyze the usage patterns of Citi Bike in New York City

Size: 256 KB - Last synced at: 22 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

abudiab/Tech-layoffs-data-cleaning-and-exploration-using-MySQL

Cleaning and Exploring the Tech layoffs dataset from COVID 2019 to 2023

Size: 4.32 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

albvieiraa/physical_activity_google_project_2024

Projeto da certificação do Google Data Analysis

Language: Jupyter Notebook - Size: 229 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

theo-liang/SQL-and-Tableau-Project-Analysis-for-Rockbuster-Stealth

This project involved analyzing data for Rockbuster Stealth LLC, a fictional movie rental company transitioning to an online video rental service.

Size: 2.19 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

dnadar881/First-projects

Data Analyst

Language: Jupyter Notebook - Size: 22.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

christopherBelter/qvr_processing

A function to process QVR data using R

Language: R - Size: 39.1 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0