An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: wrangling-data

bindugayatri02/Real-Estate-Price-Prediction-Project

To import data from multiple sources, clean and wrangle data, perform exploratory data analysis (EDA), and create meaningful data visualizations. I will then predict future trends from data by developing linear, multiple, polynomial regression models & pipelines and learn how to analyzethem.

Language: Jupyter Notebook - Size: 67.4 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ohspc89/Better_Call_Jin

A repository containing mentoring materials for a Ph.D. student in Neuroscience

Language: MATLAB - Size: 12.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Suchi25Sathavara/Data-Wrangling-with-R

Analyzing Road Accidents in Victoria, Australia

Size: 582 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

kevinwood15/Python_ML_KMeans_Project

This project uses the KMeans ML algorithm to identify segments of the broader population that form the core customer base of a company.

Language: Jupyter Notebook - Size: 275 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

DemolisherAA/Data_Wrangling

This repository contains two Jupyter Notebooks focusing on tasks related to data preprocessing and cleaning: Task 1: "Data Cleaning and Preprocessing" Task 2: "Data Loading and Cleaning Workflow"

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

kevinwood15/Python_ML_Classification_Modeling

This project uses GaussianNB, Random Forest, and AdaBoost Classification Models to predict the income category of individuals with US Census Data

Language: Jupyter Notebook - Size: 156 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

kevinwood15/Python_Twitter_DataWrangling_Project

The main objectives of this project is to wrangle (clean) and analyze twitter data. I deal with some messy data, clean it, then plot some visualizations of the data to analyze it.

Language: Jupyter Notebook - Size: 150 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Shivanikanodia/House-Sales-in-King-Country

Language: Jupyter Notebook - Size: 186 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

rishabhj29/-SWEETGUARD-Unveiling-Your-Diabetes-Destiny-

SWEETGUARD ๐Ÿ›ก๐Ÿ” โ€“ A data-driven diabetes risk assessment tool that leverages machine learning and public health datasets to predict individualized diabetes risk scores. Using Python ๐Ÿ, Power BI ๐Ÿ“Š, and statistical analysis, this project identifies key lifestyle factors and empowers individuals with personalized health insights.

Language: HTML - Size: 19.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

PiotrTymoszuk/trafo

Transformation Toolset for Vectors, Matrices, Lists and Data Frames

Language: R - Size: 86.9 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

comsavvy/WeRateDogs-Wrangling-and-Visualization-Analysis

Analysis on WeRateDogs tweets

Language: HTML - Size: 4.33 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

CaritoRamos/predictive-classification-model-in-r

HOTEL RESERVATION CANCELLATION

Language: HTML - Size: 1.86 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

nafisalawalidris/SpaceX-Falcon-9-first-stage-Landing-Prediction

The project revolves around predicting the successful landing of the Falcon 9 first stage during SpaceX rocket launches. By leveraging the concepts and techniques learned in the specialization, we aim to develop a predictive model that can determine the likelihood of a successful landing.

Language: Jupyter Notebook - Size: 945 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 1

sondosaabed/Oil-vs-BigTech-stock-investigation

๐Ÿ’น๐Ÿ“ˆInvestigating the oils market prices in addition to the stock market prices between the start of 2001 to the end of 2023. ๐Ÿ’ฐ๐Ÿ“‰

Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 5 - Forks: 0

theo-liang/Python-Project-Analysis-for-Instacart

This project involved analyzing Instacart's sales data to understand customer purchasing behaviors and optimize marketing strategies.

Language: Jupyter Notebook - Size: 1.73 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

altamashajaz/Applied-Statistics

This project analyzes customer purchasing behavior using descriptive statistics. It includes data preprocessing, exploratory data analysis, and statistical analysis to uncover patterns and trends. The goal is to optimize marketing strategies and improve offer acceptance rates.

Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

GustavoFacincani/My-Code

Scripts that I've used during grad school for data collection, analysis, visualization, cleaning, wrangling, etc., for classes, project reports, and manuscripts.

Language: R - Size: 389 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

mjchimbadzwa/ML-Classification-project

A Simplilearn class project practicing machine learning classification algorithms.

Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Bassamejlaoui/Open-source-Data-Science-couch

A structured 3-year curriculum for data science, covering foundational, intermediate, and advanced topics

Size: 17.6 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

tien-duong115/Twitter_doggo_page_analysis

Introduction In this data wrangling project, the goal is to clean up the data quality and tidiness issues using both visual and programmatic assessments

Language: Jupyter Notebook - Size: 1.73 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

surajhari/JPMorgan_Excel_VirtualCaseExperience

Projects and Certifications related to JPMorgan Chase & Co. Excel Skills Virtual Case Experience, a job simulation program offered by JPMorgan on Forage platform.

Size: 117 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Shoh96/ALX-Data-Analyst

This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course

Language: HTML - Size: 9.18 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

hawkfish/textform

A data transformation pipeline library based on Potter's Wheel.

Language: Python - Size: 465 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

MdTanvirHossainTusher/Sales-Prediction

A linear regression model to predict sales based on advertising costs

Language: Jupyter Notebook - Size: 299 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

anoushirazi/IBM-Data-Science-Capstone-Project

The SpaceX project analysis involves evaluating SpaceX launch data to identify trends and insights related to launch success rates, payload capacities, and the impact of different variables on mission outcomes.

Language: Jupyter Notebook - Size: 63.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vijayasaravana/Customer_Sales_Analysis_dashboard

Power BI

Size: 2.83 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

imukoki/Wrangle-and-Analyze-Data

Use Python to perform Data Wrangling (Gathering, Assessing, Cleaning) of the WeRateDogs Twitter account and archive, followed by storing, analyzing and visualizing the wrangled data.

Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

lolivera0409/Proyecto_Final---Data_Science---CoderHouse2023

Proyecto final del curso Data Science 2023

Language: Jupyter Notebook - Size: 1.81 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

divyansh1195/Halliburton-Landmark-Learning-ML-with-Python

Machine Learning with Python: Halliburton Landmark Learning

Language: Jupyter Notebook - Size: 12.5 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

nourhenehanana/Time-series-analysis

Working with a time series of energy data weโ€™ll see how techniques such as time-based indexing, resampling, and rolling windows can help us explore variations in electricity demand and renewable energy supply over time.

Language: R - Size: 363 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

awojidetola/Udacity-Data-Analysis-ND

This repository holds all the projects for the ALXT Data Analysis Udacity Nanodegree Program

Language: HTML - Size: 6.23 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

harshdeepkalita/Data-Analyst-Mini-Projects

Data Analysis Mini Projects

Language: Jupyter Notebook - Size: 18.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

NdAbdulsalaam/Movie_success_determinants

Performed descriptive analysis movie data set, evaluated trends, established facts said by the data and advised on best practices when a new movie is to be released

Language: Jupyter Notebook - Size: 3.08 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

yabiola/udacity-data-analyst-projects

This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course

Language: HTML - Size: 25.8 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

anna-ringwood/cleaning-deduplicating-donor-data

This repository holds the code files used in an undergraduate data wrangling project from March - August 2021.

Size: 40 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

dell-datascience/Data_Analytics

Albert Dellor - Data Analyst Project Portfolio

Language: Jupyter Notebook - Size: 113 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

lethuyngocan/EDA-Projects

I showcase 3 distinct projects that apply EDA techniques to serve diverse business objectives.

Language: Jupyter Notebook - Size: 1.77 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

UdaykiranEstari/DataPreparation-Experimental

Python-based data cleaning and wrangling project showcasing preprocessing steps. Handles missing values, transforms data, removes duplicates, treats outliers, and performs feature engineering for enhanced analysis. Validates data quality and consistency. Dataset from YouTube channel. Size: 76,378 rows, 9 columns.

Language: Jupyter Notebook - Size: 9.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

gebenner1/CIS5450BigDataProject

Data wrangling, analysis, and prediction for Amazon book reviews.

Language: Jupyter Notebook - Size: 481 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

marbel89/AI-incidents_Wrangling

Wrangles an AI incident data set for easier visualization purposes

Language: Jupyter Notebook - Size: 5.36 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Benazir023/Cyclistic

Analyzing Capstone Project (Cyclistic)

Language: R - Size: 617 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

taricov/Python_Wrangling_Messy_Data

I came across a challenge introduced by Shashank Kalanithi. it's a data set in form of a CSV file with an interesting block-like structure. The challenge was to transform this data into the regular form of long thin tables.

Language: Python - Size: 156 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Sheminho/TMDb_movie_data_analysis

Analyzing TMDb_movie website to explore the dataset to answer some questions and represent the data in an interactive way.

Language: Jupyter Notebook - Size: 1.44 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

pandeyankitg/CapstoneAlmabetterEDA

EDA Capstone Project(Almabetter)

Language: Jupyter Notebook - Size: 7.87 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

rkadey/data-wrangling-in-R

Language: R - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

SedemBuabs/Udacity-Project-Investigate-a-Dataset

Udacity Data Analyst Nanodegree Project : Investigate a Dataset

Language: HTML - Size: 5.49 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

s-njeru/Data-Wrangling

Data Wrangling Project from the Udacity Data Analytics Nano Degree

Language: HTML - Size: 2.61 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dorothy-nguyen/WeRateDogs-Twitter-Data-Wrangling

This project corresponds to data wrangling project within Udacity Data Analyst Nanodegree.

Language: HTML - Size: 2.61 MB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

brianmaleek/project_workspace_2_tweepy

Wrangling and analyzing we rate dogs twitter account which rates people's dogs with a humorous comment about the dog.

Language: Jupyter Notebook - Size: 2.57 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

andrewmuhoro/WeRateDogs

WeRateDogs is a twitter account which share dog images and write a brief panegyric about the dog, then they let their followers to rate it by favoriting it. The goal is to go through the whole data analysis process โ€” collecting the data, cleaning the data, analyzing the data and finally visualizing the data with emphasis on data wrangling.

Language: Jupyter Notebook - Size: 6.45 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

kfrawee/WeRateDogs

Gathering data from a variety of sources and in a variety of formats, assessing its quality and tidiness, then cleaning it. Showcase wrangling efforts through analysis and visualizations.

Language: Jupyter Notebook - Size: 3.84 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 4

jarsonX/Data_Analysis

Tasks and small projects related to data analysis. Mostly automatisation and data wrangling.

Language: Jupyter Notebook - Size: 5.51 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

Irene-arch/TMDB-Movies-Dataset

This project is part of the projects in the Data Analyst NanoDegree Program from Udacity

Language: HTML - Size: 3.25 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Tola-adelase/Wrangle-and-Analyze-Data-UdacityProject

Udacity Nano degree Project 4. (Wrangle and Analyze Data)

Language: Jupyter Notebook - Size: 1.79 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

yuliianikolaenko/text-network-analysis

Part of the Data Science course project dedicated to Digital Ethics concepts mapping

Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

Krishnkumar542/IBM_Data_Science_Professional_Certification

This repository contains all the resources of the final Capstone Project which is a part of the IBM Data Science Professional Certification.

Language: Jupyter Notebook - Size: 3.24 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

dbmurray/preppin_data_rstats_chapter

This is a repository of #rstats solutions to the Preppin' Data challenges published at preppindata.com

Language: R - Size: 361 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

moriahtaylor1/national_parks_shiny

Shiny Contest 2021 Submission - Biodiversity in U.S. National Parks

Language: R - Size: 324 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Related Keywords
wrangling-data 58 python 22 data-science 15 data-visualization 13 data 12 cleaning-data 11 numpy 10 exploratory-data-analysis 10 matplotlib 10 pandas 9 data-analysis 9 visualization 9 seaborn 7 python3 7 wrangling-cleaning 7 r 6 twitter-api 6 machine-learning 5 gathering-data 5 jupyter-notebook 4 wrangling 4 statistical-analysis 4 merging-data 3 pandas-dataframe 3 analysis 3 assessing-data 3 machine-learning-algorithms 3 weratedogs 3 eda 3 reporting 3 dataanalytics 3 sql 3 predictive-modeling 3 tidyverse 2 cleaning-dataset 2 logistic-regression 2 tweepy-api 2 statistics 2 random-forest 2 cleaning-data-in-python 2 powerbi 2 explanatory-data-analysis 2 excel 2 presentation-slides 2 udacity-nanodegree 2 datacleaning 2 time-series-analysis 2 pandas-python 2 merge 2 feature-engineering 2 college-assignment 1 data-analysis-python 1 ab-testing 1 tweepy 1 dax 1 hypothesis-testing 1 mini-project 1 udacity-data-analyst-nanodegree 1 analytics 1 tmdb-movie 1 cleaning 1 deduplicate 1 deduplication 1 donations 1 forcasting 1 metrics 1 pivot-tables 1 relation-extraction 1 trend-analysis 1 vizualisation 1 clean-data 1 extract 1 extract-data 1 json 1 storing-data 1 web-scraping 1 dataset 1 arrays 1 exploratory-data-visualizations 1 joins 1 reshaping-datasets 1 postgresql 1 tableau 1 json-api 1 python-library 1 twitter 1 dataanalysis 1 html 1 regular-expressions 1 tweet-data 1 twitter-archive 1 python-beginners 1 python-data-analysis 1 python-exercise 1 python-files 1 dataanalyst 1 network-analysis 1 text-mining 1 final-project 1 ibm-data-science-professional 1