An open API service providing repository metadata for many open source software ecosystems.

Topic: "wrangling-cleaning"

jayronsoares/automated_data_engineering

Simple ETL pipeline to extract information from CSV, LOG, JSON files and load it into MySQL database using Python and SQL language.

Language: Python - Size: 535 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 2

PelumiAdeboye/multinational-retail-data-centralisation

Language: Python - Size: 20.5 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ujunwa-DS/SPACEX-FALCON-9-CAPSTONE-PROJECT

A total package of what data science is all about. from dashboard building to data wrangling, sql, data collection, vizualization, webscrapping to presentaion.

Language: Jupyter Notebook - Size: 8.12 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Daniel-Elston/Hum-Whistle-Song-Recognition-Software

Machine learning, signal processing pipeline used to identify song name from user input (hum/whistle to song).

Language: Jupyter Notebook - Size: 4.11 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Amr-YA/tmdb_analysis

Movies data analysis to produce visuals and insights about the data-set of 10,000 movies.

Language: Jupyter Notebook - Size: 233 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

dirkkadijk/analyse-and-wrangle-WeRateDogs-data

project in Udacity Data Analyst Nanodegree. This project focused on advanced data gathering (several sources incl twitter API), wrangling and cleaning of data. Plus 2 reports.

Language: Jupyter Notebook - Size: 1.54 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

Adongo/HR-Employee-Attrition

Exploratory Data Analysis to uncover factors data lead to employee attrition.

Language: Jupyter Notebook - Size: 433 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

dirkkadijk/Ford-GoBike-Data-Exploration-and-Communication-of-insights

Capstone project of Udacity Data Analyst Nanodegree. Focus on advanced visualizations to explore data and to communicate insights and patterns. Final slide deck is made with Jupyter notebook with interactive HTML slides (based on reveal.js).

Language: HTML - Size: 8.04 MB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 2

Wb-az/ML-airbnb-paris-analytics-and-price-prediction

Airbnb Paris - analytics and accommodation price prediction

Language: Jupyter Notebook - Size: 36.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Nmension/Fcc_medical_data_visualizer

Project (3/5) from the Fcc course : 'Data Analysis with Python'

Language: Python - Size: 882 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

DemolisherAA/Data_Wrangling

This repository contains two Jupyter Notebooks focusing on tasks related to data preprocessing and cleaning: Task 1: "Data Cleaning and Preprocessing" Task 2: "Data Loading and Cleaning Workflow"

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Sadiq-marcelo/investigate-FordGoBike-tripdata

Investigate Ford GoBike Project

Language: HTML - Size: 7.55 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

AjmalSarwary/BRENT-Model

Predictive Model for BRENT price movements

Language: Jupyter Notebook - Size: 106 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

c13mora/traffic_accidents_prediction

Machine learning models to predict if a given traffic accident will end up in casualties

Language: Jupyter Notebook - Size: 1.51 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Ken-Vu/Event-Category-Disparities-in-Elm-City-Stories-

A data analysis project for the American Statistical Association's DataFest competition that won "Best in Show" in 2022

Size: 7.81 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

dbmurray/preppin_data_rstats_chapter

This is a repository of #rstats solutions to the Preppin' Data challenges published at preppindata.com

Language: R - Size: 361 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

shiraen/sea_countries_debt_analysis

Analysis on external debt of select South East Asian Countries for the past 10 years.

Language: Jupyter Notebook - Size: 12.1 MB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nilooy5/Covid19-Data-Analysis

Covid19 data analysis and forecasting

Language: R - Size: 109 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

s-njeru/Data-Wrangling

Data Wrangling Project from the Udacity Data Analytics Nano Degree

Language: HTML - Size: 2.61 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

s-njeru/Data-Analysis-Process

This project is from the Data Analysis Nano Degree from Udacity. The intent is to walk through the data analysis process on a medical patient show/no-show dataset, identifying the relationshp between various dependent variables to the dependent variable - whether a patient will show up or not.

Language: HTML - Size: 4.19 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

akhmadtaufik/project-wrangling-pacmann

Medium Article

Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

I-Sobe/absenteeism

My First Machine Learning Project (A 365 careers project)

Language: Jupyter Notebook - Size: 976 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

brianmaleek/project_workspace_2_tweepy

Wrangling and analyzing we rate dogs twitter account which rates people's dogs with a humorous comment about the dog.

Language: Jupyter Notebook - Size: 2.57 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

AbiolaBajo10/Prosper-Loan-Data

The Loan Data from Prosper dataset is a financial dataset which is related to the loan, borrowers, interest rates, etc. Prosper or Prosper Marketplace Inc. is a San Francisco, California based company specializing in loans at low interest rates to the borrowers. We are using the dataset from Prosper for exploratory data analysis.The dataset from prosper is comprised of 81 variables and contains 113937 entries.

Language: HTML - Size: 870 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

codeninja2020/Telecomdata

Data analysis Project for A Telecoms Company

Language: Jupyter Notebook - Size: 9.62 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

AmMoPy/WeRateDogsAnalytics

Analysing twitter user dog_rates AKA WeRateDogs.

Language: Jupyter Notebook - Size: 2.8 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Daniel-Elston/Credit-Card-Default-Prediction-Algorithm

Algorithm used to predict whether a bank customer will default on given credit cards using bank telemarketing dataset.

Language: Jupyter Notebook - Size: 1010 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

LucasDeMatheo/DataScienceProject_Titanic

This project, carried out in Jupyter Notebook, aims to explore the main Data Analysis techniques with Python tools. Pandas, Numpy, Seaborn, Matplotlib, Plotly and sklearn are used. Divided into three notebooks, I separate the data cleaning, data analysis and machine learning part. For more details and goals, see README

Language: Jupyter Notebook - Size: 4.92 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Tola-adelase/Wrangle-and-Analyze-Data-UdacityProject

Udacity Nano degree Project 4. (Wrangle and Analyze Data)

Language: Jupyter Notebook - Size: 1.79 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Tola-adelase/data-visualization-udacityproject

Udacity Nano degree Project 5. (Communicate Data Findings)

Language: HTML - Size: 21.7 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

anna-ringwood/cleaning-deduplicating-donor-data

This repository holds the code files used in an undergraduate data wrangling project from March - August 2021.

Size: 40 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

giuliabrambilla/star-wars-analysis

🌌 Data analysis to answer questions about one of the most successful movie franchises of all time: Star Wars.

Language: Jupyter Notebook - Size: 520 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

NeilFabiao/Factors-that-affect-manufacturing-GDP-SA-perspective

Factors that affect manufacturing GDP SA perspective

Language: Jupyter Notebook - Size: 2.79 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

moriahtaylor1/national_parks_shiny

Shiny Contest 2021 Submission - Biodiversity in U.S. National Parks

Language: R - Size: 324 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

yehia55/IMDb_exploratory_analysis

-Data wrangling -Asking questions about data -Data visualization -Answering data questions -writing a final report that shows the data limitations and analysis results

Language: Jupyter Notebook - Size: 752 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Doumham-Armah/DataWranglingTutorial

Taking a dataset about obesity rates that looks messy and converting it to a nice and clean format.

Language: HTML - Size: 210 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

AbbaTek-Group/Spatial_Analysis_Dashboard

Visualizing B.C Protected and Conserved Areas with static and interactive maps to summarize the status and distribution of different types of conserved and protected areas with the updated CPCAD data.

Language: HTML - Size: 817 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

Adongo/Telco-Customer-Churn-Analysis

Data Wrangling, Data Visualization, Data Cleaning

Language: Jupyter Notebook - Size: 746 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

MSjoia/project-4_udacity_Wrangle-and-Analye-Data

data wrangling

Language: Jupyter Notebook - Size: 3.48 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

siawayforward/ten-years-of-nba-data

Using ten NBA seasons of NBA data from 2010-2019 to practice web scrapping, data cleaning, and wrangling. The scrapped information includes teams, coaches, champions, and players. Some of this data is going to be used for a finals winner prediction project. This repository will therefore highlight how to clean data and make notes about why certain decisions about missing data were made.

Language: Jupyter Notebook - Size: 2.95 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

Related Topics
python 13 data 10 data-visualization 10 data-analysis 9 data-science 8 exploratory-data-analysis 8 wrangling-data 7 pandas 7 machine-learning 6 visualization 6 seaborn 4 wrangling 3 algorithms 3 analytics 3 eda 3 matplotlib 3 twitter-api 3 cleaning-data 2 jupyter-notebook 2 numpy 2 shiny-apps 2 exploratory-data-visualizations 2 ggplot2 2 datawrangling 2 mysql 2 machine-learning-algorithms 2 udacity-nanodegree 2 explanatory-data-analysis 1 time-series-analysis 1 principal-component-analysis-pca 1 price-prediction 1 python-3 1 pre-processing 1 plotly-express 1 market-data 1 latent-variable-models 1 colab-notebooks 1 artificial-neural-networks 1 forward-selection 1 r 1 nonprofit 1 geocoding 1 donor 1 donations 1 deduplication 1 pca-analysis 1 classification 1 banking-applications 1 datavisualization-project 1 signal-processing 1 data-engineering 1 artificial-intelligence 1 movie-database 1 anlysis 1 wrangle-and-analyse 1 data-wrangling 1 dataset 1 datacleaning 1 tutorial 1 numpy-tutorial 1 weratedogs 1 twitter 1 tweepy-api 1 gathering-data 1 cleaning-dataset 1 assessing-data 1 dataanalyst 1 sql 1 python-language 1 matplotlib-pyplot 1 spatial-data-analysis 1 rstats 1 analysis 1 tidyverse 1 tidy 1 shinyapps 1 shiny-r 1 shiny 1 titanic-kaggle 1 python3 1 pyhton 1 predictions 1 imputation 1 munging 1 visualization-libraries 1 uni-bi-multivariate-exploration 1 reporting 1 json 1 gathering 1 imbalanced-data 1 classification-model 1 nba-data 1 data-cleaning 1 deduplicate 1 cleaning 1 tableau 1 logistic-regression 1 pandas-python 1 matplo 1 we-rate-dogs 1