Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: wrangling-data

sondosaabed/Advanced-Data-Wrangling-Project

Language: Jupyter Notebook - Size: 9.77 KB - Last synced: 8 days ago - Pushed: 9 days ago - Stars: 0 - Forks: 0

mejbass/Open-source-Data-Science-couch

A structured 3-year curriculum for data science, covering foundational, intermediate, and advanced topics

Size: 17.6 KB - Last synced: 10 days ago - Pushed: 11 days ago - Stars: 0 - Forks: 0

PiotrTymoszuk/trafo

Transformation Toolset for Vectors, Matrices, Lists and Data Frames

Language: R - Size: 86.9 KB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 0 - Forks: 0

tien-duong115/Twitter_doggo_page_analysis

Introduction In this data wrangling project, the goal is to clean up the data quality and tidiness issues using both visual and programmatic assessments

Language: Jupyter Notebook - Size: 1.73 MB - Last synced: 25 days ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

surajhari/JPMorgan_Excel_VirtualCaseExperience

Projects and Certifications related to JPMorgan Chase & Co. Excel Skills Virtual Case Experience, a job simulation program offered by JPMorgan on Forage platform.

Size: 117 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

Shoh96/ALX-Data-Analyst

This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course

Language: HTML - Size: 9.18 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0

hawkfish/textform

A data transformation pipeline library based on Potter's Wheel.

Language: Python - Size: 465 KB - Last synced: about 15 hours ago - Pushed: over 2 years ago - Stars: 7 - Forks: 0

MdTanvirHossainTusher/Sales-Prediction

A linear regression model to predict sales based on advertising costs

Language: Jupyter Notebook - Size: 299 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

mjchimbadzwa/ML-Classification-project

A Simplilearn class project practicing machine learning classification algorithms.

Language: Jupyter Notebook - Size: 3.23 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

anoushirazi/IBM-Data-Science-Capstone-Project

The SpaceX project analysis involves evaluating SpaceX launch data to identify trends and insights related to launch success rates, payload capacities, and the impact of different variables on mission outcomes.

Language: Jupyter Notebook - Size: 63.8 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

vijayasaravana/Customer_Sales_Analysis_dashboard

Power BI

Size: 2.83 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

imukoki/Wrangle-and-Analyze-Data

Use Python to perform Data Wrangling (Gathering, Assessing, Cleaning) of the WeRateDogs Twitter account and archive, followed by storing, analyzing and visualizing the wrangled data.

Language: Jupyter Notebook - Size: 1.38 MB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

lolivera0409/Proyecto_Final---Data_Science---CoderHouse2023

Proyecto final del curso Data Science 2023

Language: Jupyter Notebook - Size: 1.81 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

divyansh1195/Halliburton-Landmark-Learning-ML-with-Python

Machine Learning with Python: Halliburton Landmark Learning

Language: Jupyter Notebook - Size: 12.5 MB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

nafisalawalidris/SpaceX-Falcon-9-first-stage-Landing-Prediction

The project revolves around predicting the successful landing of the Falcon 9 first stage during SpaceX rocket launches. By leveraging the concepts and techniques learned in the specialization, we aim to develop a predictive model that can determine the likelihood of a successful landing.

Language: Jupyter Notebook - Size: 945 KB - Last synced: 8 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

nourhenehanana/Time-series-analysis

Working with a time series of energy data we’ll see how techniques such as time-based indexing, resampling, and rolling windows can help us explore variations in electricity demand and renewable energy supply over time.

Language: R - Size: 363 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

awojidetola/Udacity-Data-Analysis-ND

This repository holds all the projects for the ALXT Data Analysis Udacity Nanodegree Program

Language: HTML - Size: 6.23 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

harshdeepkalita/Data-Analyst-Mini-Projects

Data Analysis Mini Projects

Language: Jupyter Notebook - Size: 18.1 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

NdAbdulsalaam/Movie_success_determinants

Performed descriptive analysis movie data set, evaluated trends, established facts said by the data and advised on best practices when a new movie is to be released

Language: Jupyter Notebook - Size: 3.08 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

yabiola/udacity-data-analyst-projects

This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course

Language: HTML - Size: 25.8 MB - Last synced: 10 months ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0

GustavoFacincani/My-Code

Scripts that I've used during grad school for data collection, analysis, visualization, cleaning, wrangling, etc., for classes, project reports, and manuscripts.

Language: R - Size: 376 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

anna-ringwood/cleaning-deduplicating-donor-data

This repository holds the code files used in an undergraduate data wrangling project from March - August 2021.

Size: 40 KB - Last synced: 4 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

dell-datascience/Data_Analytics

Albert Dellor - Data Analyst Project Portfolio

Language: Jupyter Notebook - Size: 113 MB - Last synced: 4 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 1

lethuyngocan/EDA-Projects

I showcase 3 distinct projects that apply EDA techniques to serve diverse business objectives.

Language: Jupyter Notebook - Size: 1.77 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 1 - Forks: 0

UdaykiranEstari/DataPreparation-Experimental

Python-based data cleaning and wrangling project showcasing preprocessing steps. Handles missing values, transforms data, removes duplicates, treats outliers, and performs feature engineering for enhanced analysis. Validates data quality and consistency. Dataset from YouTube channel. Size: 76,378 rows, 9 columns.

Language: Jupyter Notebook - Size: 9.1 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

gebenner1/CIS5450BigDataProject

Data wrangling, analysis, and prediction for Amazon book reviews.

Language: Jupyter Notebook - Size: 481 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

marbel89/AI-incidents_Wrangling

Wrangles an AI incident data set for easier visualization purposes

Language: Jupyter Notebook - Size: 5.36 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

Benazir023/Cyclistic

Analyzing Capstone Project (Cyclistic)

Language: R - Size: 617 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

taricov/Python_Wrangling_Messy_Data

I came across a challenge introduced by Shashank Kalanithi. it's a data set in form of a CSV file with an interesting block-like structure. The challenge was to transform this data into the regular form of long thin tables.

Language: Python - Size: 156 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

Sheminho/TMDb_movie_data_analysis

Analyzing TMDb_movie website to explore the dataset to answer some questions and represent the data in an interactive way.

Language: Jupyter Notebook - Size: 1.44 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

pandeyankitg/CapstoneAlmabetterEDA

EDA Capstone Project(Almabetter)

Language: Jupyter Notebook - Size: 7.87 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

rkadey/data-wrangling-in-R

Language: R - Size: 3.91 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

SedemBuabs/Udacity-Project-Investigate-a-Dataset

Udacity Data Analyst Nanodegree Project : Investigate a Dataset

Language: HTML - Size: 5.49 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

s-njeru/Data-Wrangling

Data Wrangling Project from the Udacity Data Analytics Nano Degree

Language: HTML - Size: 2.61 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

brianmaleek/project_workspace_2_tweepy

Wrangling and analyzing we rate dogs twitter account which rates people's dogs with a humorous comment about the dog.

Language: Jupyter Notebook - Size: 2.57 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

andrewmuhoro/WeRateDogs

WeRateDogs is a twitter account which share dog images and write a brief panegyric about the dog, then they let their followers to rate it by favoriting it. The goal is to go through the whole data analysis process — collecting the data, cleaning the data, analyzing the data and finally visualizing the data with emphasis on data wrangling.

Language: Jupyter Notebook - Size: 6.45 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

kfrawee/WeRateDogs

Gathering data from a variety of sources and in a variety of formats, assessing its quality and tidiness, then cleaning it. Showcase wrangling efforts through analysis and visualizations.

Language: Jupyter Notebook - Size: 3.84 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 1 - Forks: 4

jarsonX/Data_Analysis

Tasks and small projects related to data analysis. Mostly automatisation and data wrangling.

Language: Jupyter Notebook - Size: 5.51 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1

Irene-arch/TMDB-Movies-Dataset

This project is part of the projects in the Data Analyst NanoDegree Program from Udacity

Language: HTML - Size: 3.25 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

comsavvy/WeRateDogs-Wrangling-and-Visualization-Analysis

Analysis on WeRateDogs tweets

Language: HTML - Size: 4.33 MB - Last synced: 11 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

sl-solution/InMemoryDatasetsTutorial

A tutorial for working with InMemoryDatasets.jl.

Language: Jupyter Notebook - Size: 8.25 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 6 - Forks: 0

Tola-adelase/Wrangle-and-Analyze-Data-UdacityProject

Udacity Nano degree Project 4. (Wrangle and Analyze Data)

Language: Jupyter Notebook - Size: 1.79 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

yuliianikolaenko/text-network-analysis

Part of the Data Science course project dedicated to Digital Ethics concepts mapping

Language: Jupyter Notebook - Size: 3.23 MB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 1

Krishnkumar542/IBM_Data_Science_Professional_Certification

This repository contains all the resources of the final Capstone Project which is a part of the IBM Data Science Professional Certification.

Language: Jupyter Notebook - Size: 3.24 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

dbmurray/preppin_data_rstats_chapter

This is a repository of #rstats solutions to the Preppin' Data challenges published at preppindata.com

Language: R - Size: 361 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

moriahtaylor1/national_parks_shiny

Shiny Contest 2021 Submission - Biodiversity in U.S. National Parks

Language: R - Size: 324 KB - Last synced: over 1 year ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

Related Keywords
wrangling-data 46 python 17 data-science 11 data-visualization 10 data 10 matplotlib 10 data-analysis 9 exploratory-data-analysis 9 numpy 9 pandas 9 cleaning-data 8 visualization 8 seaborn 7 python3 6 wrangling-cleaning 6 gathering-data 5 jupyter-notebook 5 machine-learning 5 twitter-api 5 r 4 wrangling 4 dataanalytics 3 assessing-data 3 statistical-analysis 3 sql 3 weratedogs 3 cleaning-data-in-python 2 explanatory-data-analysis 2 tweepy-api 2 dataset 2 presentation-slides 2 reporting 2 cleaning-dataset 2 eda 2 tidyverse 2 time-series-analysis 2 udacity-nanodegree 2 analysis 2 statistics 2 pandas-dataframe 2 pandas-python 2 merge 2 python-files 1 classification 1 datawrangling 1 datascience 1 data-cleaning 1 cyclistic 1 bigquery 1 random-forest 1 natural-language-processing 1 logistic-regression 1 merging-data 1 downscaling 1 energy-data 1 hydrology 1 netcdf-files 1 plotting-in-r 1 sankey-diagram 1 spatial-analysis 1 timeseries 1 violinplot 1 wrangle-and-analyse 1 cleaning 1 deduplicate 1 deduplication 1 donations 1 donor 1 geocoding 1 nonprofit 1 etl 1 visualisation 1 sale 1 turnover-analysis 1 big-data 1 ipython-notebook 1 flight-data-analysis 1 inmemorydatasets 1 julia 1 manipulate-data 1 tutorial 1 dataanalyst 1 network-analysis 1 text-mining 1 final-project 1 ibm-data-science-professional 1 machine-learning-coursera 1 spacex-api 1 webscrapping 1 rstats 1 ggplot2 1 shiny 1 shiny-apps 1 shiny-r 1 shinyapps 1 tidy 1 jypyternotebook 1 google-data-studio 1 almabetter 1 colab-notebook 1