An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: cleaning-data-in-python

OlayinkaJames01/DataFestAfrica-ML-Data-Hackathon Fork of mamakay2212/DataFestAfrica-ML-Data-Hackathon

A machine learning datathon by DataFest Africa

Language: Jupyter Notebook - Size: 841 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

kyrollossarwat/Cleaning-for-Machine-learning

The main things we do in EDA or cleaning the data are: removing or modifying null values, removing duplicates, handling outliers, encoding and scaling.

Language: Jupyter Notebook - Size: 169 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

eshayalagi/CODE_DHELHI_INTERNSHIP

Welcome to CodeBook – Your Data Science Internship Begins!

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Saneachka/Regression-of-rental-apartment-prices

Regression of apartment rental prices

Language: Jupyter Notebook - Size: 4.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Hadelockeuse/Rental_Housing_in_Berlin

Personal projet to learn web scraping, Power BI and revive my Python syntax knowledge

Language: Jupyter Notebook - Size: 1.49 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Amulya20010418/Zomato_data_analysis

Analyze Zomato restaurant data all the world and find the insights by using Python libaries and also visualize the dataset by using Power-bi

Language: Python - Size: 1.54 MB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

IstinNew/Enaic-s-Discount-Strategy-Analysis

**(Open to Collaboration):** This project evaluates the impact of discounts on sales and customer retention for Eniac. It includes data cleaning, visualization, storytelling, and strategic insights to optimize discount strategies while maintaining brand reputation. 📊🛍️✨

Size: 7.62 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

riyadhussain123/Lloyds_Data_Sci_internship_files

Lloyds Banking Group Data Science Virtual Internship Project

Language: Jupyter Notebook - Size: 1.9 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

gittsed/Miles_per_Gallon

Exploratory Data Analysis of the mpg dataset is examining and understanding the characteristics of the data to uncover patterns and relationships within the dataset.

Language: Jupyter Notebook - Size: 46.9 KB - Last synced at: 10 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

gautam2000/Wine-Reviews-Data-EDA-Project

Analyze Wine Reviews Data and find the insights by using python libraries

Language: Jupyter Notebook - Size: 334 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

DidierKOUADIO/Data-science

Language: R - Size: 40 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

celestearia/Fun_Project

An Interactive Map of Festivals and Trails in France

Language: Python - Size: 7.82 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

devanshijbhatt/Exercise-Exploratory-Analysis-With-SQL

An exploratory analysis on various exercises. This analysis explores factors such as Difficulty Level, Target Muscle Group, Primary Equipment, Posture, and Body Region. for helping the development of Home Workout Application

Size: 267 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Azie88/Beginner-Data-Analysis

Beginner Data Analysis and Power BI Dashboarding with Indian Startup Dataset (2018-2021)

Language: Jupyter Notebook - Size: 3.47 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

ajupeter23/Exploratory-Data-Analysis

Data Cleaning and Analysis - A part of Assessment 2 -GD604 Data Collection and Analysis New Zealand School of Education College (NZSE)

Language: HTML - Size: 1.91 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dariga-sm/Exploring-the-Bitcoin-Cryptocurrency-Market

explore the market capitalization of different cryptocurrencies.

Language: Jupyter Notebook - Size: 172 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

Shoh96/ALX-Data-Analyst

This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course

Language: HTML - Size: 9.18 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

krithisudhir/Color-Survey

Understanding the 5 Stages of Data Analysis Using a Simple Online Survey About Favourite Colors

Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

marcogarganigo/WindowsCleaner

Python script to automatically clean temporary files

Language: Python - Size: 1000 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SumanKOLKATA/Workforce-analysis-

Workforce status analysis

Language: Jupyter Notebook - Size: 1.06 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

datatalking/Portfolio-datatalking

My portfolio website

Language: HTML - Size: 12.1 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

JuliaIron/Business_Analytics_of_e_commerce Fork of plumeris/Mid-bootcamp-project

Defining and creating dashboards with KPIs of an e-commerce. Working with SQL, Python, Tableau.

Language: Jupyter Notebook - Size: 141 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Fondzenyuy/SURP-2022-Astro

Summer Undergraduate Research Project @UniversityofToronto Astronomy : Removing foregrounds from the CMB (Cosmic Microwave Background) radiation maps of the universe from the PLANCK and WMAP satellite missions.

Language: Jupyter Notebook - Size: 237 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

labrijisaad/exploratory-data-analysis-in-Python

In this project, we will see in a hands-on training jupyter notebook how to effectively diagnose and deal with missing data in Python.

Language: Jupyter Notebook - Size: 5.19 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 2

ritikaga/Festive-Season-Sales-Analysis

Analyze Diwali Sales data using Pandas, NumPy, Matplotlib, and Seaborn Libraries to Improve customer experience and also sales.

Language: Python - Size: 514 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

the-ogre/TransplantWaitTimePrediction

Ongoing Project: "Transplant wait time prediction" IPBA, IIM Indore.

Language: Jupyter Notebook - Size: 8.79 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

tezwithdata/CleaningData

This repository consist with my cleaning data project. You can check .ipynb version for documentation. If you want to practice, click the link for the source that i used:

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

yabiola/udacity-data-analyst-projects

This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course

Language: HTML - Size: 25.8 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

AlbertHunduza/Smartphone-Project

Analyzing and Visualizing the evolution of Smartphone specifications on a web-scraped dataset.

Language: Jupyter Notebook - Size: 26 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ritikaga/Zomato-Analysis-with-Python-and-visualization-with-Power-BI

Analyze Zomato restaurant data all the world and find the insights by using Python libaries and also visualize the dataset by using Power-bi

Language: Python - Size: 1.54 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

f-ssemwanga/pandas-numpy-repo

This repo has extensive work I have done on Pandas and NumPy Modules during the advanced programming Module

Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: 13 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

sbettid/GPSClean

An application to correct a GPS trace using machine learning techniques.

Language: Python - Size: 9.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 0

natigl/DA-bootcamp-eda-limpieza-etl

This repository contains the exercises I did in the second module of my Data Analytics bootcamp at Adalab.

Language: Jupyter Notebook - Size: 6.04 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

sengourav/Web_Scraping_of_IMDB_Webpage

This repo contains web scraping of IMDB site and data cleaning of collected data along with EDA on the cleaned data.

Language: Jupyter Notebook - Size: 862 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

babiotg/E-Commerce-Amazon-EDA

Exploratory Data Analysis and Data Cleaning on a Amazon E-Commerce Dataset

Language: Jupyter Notebook - Size: 15 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

HebaHossam68/Heart-Disease-

Machine learning project

Language: Jupyter Notebook - Size: 553 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

shreyas-singhal/Lead-Scoring-Case-Study

X Education Organization wants to identify if a customer registered on their website for enquiry is a potential customer or not. Using past data to build a machine learning algorithm

Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Janice-Afi/Wikipedia

An exploratory analysis on Wikipedia page visits

Language: HTML - Size: 13.1 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

andrewmuhoro/DataC-TChallenges

Using Python Pandas to clean and transform an anonymized version of a dataset.

Language: Jupyter Notebook - Size: 652 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Dalia-Mahmoud-ElSayes/eT3-Task

DataScience Task for final acceptance stage for eT3 Internship

Language: Jupyter Notebook - Size: 335 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

dbeteta-w/linguistic_data_treatment

Language: Python - Size: 59.6 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

zkrzn/DataScienceProjects

Extraction Cleaning Manipulation Visualization Machine Learning & More

Language: Jupyter Notebook - Size: 15.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

mcastelli5/Customer-Segmentation

Unsupervised kMeans clustering model analysis for e-commerce transaction data to uncover unique customer segments.

Language: Jupyter Notebook - Size: 9.14 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Smcgb/Udacity_Data_Wrangle

This excercise takes a large amount of data from the Kaggle WeRateDogs and a provided Udacity dataset to show data wrangling and cleaning with various Python tools.

Language: HTML - Size: 2.64 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Related Keywords
cleaning-data-in-python 44 python 17 data 12 exploratory-data-analysis 11 visualization 9 data-visualization 8 numpy 7 cleaning-data 6 eda 6 analysis 5 data-science 5 pandas 5 machine-learning 5 seaborn 4 jupyter-notebook 4 matplotlib 4 machine-learning-algorithms 3 data-analysis 3 twitter-api 3 data-cleaning 2 dataset 2 business-analytics 2 pandas-dataframe 2 datascience 2 dataanalytics 2 explanatory-data-analysis 2 gathering-data 2 presentation-slides 2 reporting 2 wrangling-data 2 visualization-dashboard 2 powerbi 2 matplotlib-pyplot 2 python3 2 dashboard 2 outlier-analysis 1 deployment 1 outlier-treatment 1 decsion-tree 1 accuracy-metrics 1 missingno 1 matplotlib-figures 1 scraping-websites 1 beautifulsoup4 1 etl-pipeline 1 gps-data 1 univariate-analysis 1 manipulation 1 model-building 1 logistic-regression-algorithm 1 joblib 1 dummy-variables 1 bivariate-analysis 1 streamlit 1 random-forest 1 knn-classifier 1 preprocessing 1 logistic-regression 1 pandas-library 1 outlier-detection 1 numby 1 requests 1 vscode 1 unsupervised-machine-learning 1 segmentation 1 pipeline 1 kmeans-clustering 1 kaggle 1 etl 1 elbow-method 1 ecommerce 1 distribution 1 customer-segmentation 1 customer 1 analysis-framework 1 algotithms 1 vizualisation 1 sql 1 postgresql 1 machinelearning 1 extraction 1 datamanipulation 1 dataanalysis 1 business-intelligence 1 big-data 1 ab-testing 1 open-source 1 nlp 1 exercises 1 dataframe 1 regex 1 manipulating-dataframes-with-pandas 1 power-bi 1 workout-tracker 1 workout-generator 1 gym-management 1 database 1 data-mining 1 r 1 cleaning-data-in-r 1