GitHub topics: wrangling-data
bindugayatri02/Real-Estate-Price-Prediction-Project
To import data from multiple sources, clean and wrangle data, perform exploratory data analysis (EDA), and create meaningful data visualizations. I will then predict future trends from data by developing linear, multiple, polynomial regression models & pipelines and learn how to analyzethem.
Language: Jupyter Notebook - Size: 67.4 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ohspc89/Better_Call_Jin
A repository containing mentoring materials for a Ph.D. student in Neuroscience
Language: MATLAB - Size: 12.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Suchi25Sathavara/Data-Wrangling-with-R
Analyzing Road Accidents in Victoria, Australia
Size: 582 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

kevinwood15/Python_ML_KMeans_Project
This project uses the KMeans ML algorithm to identify segments of the broader population that form the core customer base of a company.
Language: Jupyter Notebook - Size: 275 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

DemolisherAA/Data_Wrangling
This repository contains two Jupyter Notebooks focusing on tasks related to data preprocessing and cleaning: Task 1: "Data Cleaning and Preprocessing" Task 2: "Data Loading and Cleaning Workflow"
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

kevinwood15/Python_ML_Classification_Modeling
This project uses GaussianNB, Random Forest, and AdaBoost Classification Models to predict the income category of individuals with US Census Data
Language: Jupyter Notebook - Size: 156 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

kevinwood15/Python_Twitter_DataWrangling_Project
The main objectives of this project is to wrangle (clean) and analyze twitter data. I deal with some messy data, clean it, then plot some visualizations of the data to analyze it.
Language: Jupyter Notebook - Size: 150 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Shivanikanodia/House-Sales-in-King-Country
Language: Jupyter Notebook - Size: 186 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

rishabhj29/-SWEETGUARD-Unveiling-Your-Diabetes-Destiny-
SWEETGUARD ๐ก๐ โ A data-driven diabetes risk assessment tool that leverages machine learning and public health datasets to predict individualized diabetes risk scores. Using Python ๐, Power BI ๐, and statistical analysis, this project identifies key lifestyle factors and empowers individuals with personalized health insights.
Language: HTML - Size: 19.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

PiotrTymoszuk/trafo
Transformation Toolset for Vectors, Matrices, Lists and Data Frames
Language: R - Size: 86.9 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

comsavvy/WeRateDogs-Wrangling-and-Visualization-Analysis
Analysis on WeRateDogs tweets
Language: HTML - Size: 4.33 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

CaritoRamos/predictive-classification-model-in-r
HOTEL RESERVATION CANCELLATION
Language: HTML - Size: 1.86 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

nafisalawalidris/SpaceX-Falcon-9-first-stage-Landing-Prediction
The project revolves around predicting the successful landing of the Falcon 9 first stage during SpaceX rocket launches. By leveraging the concepts and techniques learned in the specialization, we aim to develop a predictive model that can determine the likelihood of a successful landing.
Language: Jupyter Notebook - Size: 945 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 1

sondosaabed/Oil-vs-BigTech-stock-investigation
๐น๐Investigating the oils market prices in addition to the stock market prices between the start of 2001 to the end of 2023. ๐ฐ๐
Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 5 - Forks: 0

theo-liang/Python-Project-Analysis-for-Instacart
This project involved analyzing Instacart's sales data to understand customer purchasing behaviors and optimize marketing strategies.
Language: Jupyter Notebook - Size: 1.73 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

altamashajaz/Applied-Statistics
This project analyzes customer purchasing behavior using descriptive statistics. It includes data preprocessing, exploratory data analysis, and statistical analysis to uncover patterns and trends. The goal is to optimize marketing strategies and improve offer acceptance rates.
Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

GustavoFacincani/My-Code
Scripts that I've used during grad school for data collection, analysis, visualization, cleaning, wrangling, etc., for classes, project reports, and manuscripts.
Language: R - Size: 389 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

mjchimbadzwa/ML-Classification-project
A Simplilearn class project practicing machine learning classification algorithms.
Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

Bassamejlaoui/Open-source-Data-Science-couch
A structured 3-year curriculum for data science, covering foundational, intermediate, and advanced topics
Size: 17.6 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

tien-duong115/Twitter_doggo_page_analysis
Introduction In this data wrangling project, the goal is to clean up the data quality and tidiness issues using both visual and programmatic assessments
Language: Jupyter Notebook - Size: 1.73 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

surajhari/JPMorgan_Excel_VirtualCaseExperience
Projects and Certifications related to JPMorgan Chase & Co. Excel Skills Virtual Case Experience, a job simulation program offered by JPMorgan on Forage platform.
Size: 117 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Shoh96/ALX-Data-Analyst
This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course
Language: HTML - Size: 9.18 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

hawkfish/textform
A data transformation pipeline library based on Potter's Wheel.
Language: Python - Size: 465 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

MdTanvirHossainTusher/Sales-Prediction
A linear regression model to predict sales based on advertising costs
Language: Jupyter Notebook - Size: 299 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

anoushirazi/IBM-Data-Science-Capstone-Project
The SpaceX project analysis involves evaluating SpaceX launch data to identify trends and insights related to launch success rates, payload capacities, and the impact of different variables on mission outcomes.
Language: Jupyter Notebook - Size: 63.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vijayasaravana/Customer_Sales_Analysis_dashboard
Power BI
Size: 2.83 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

imukoki/Wrangle-and-Analyze-Data
Use Python to perform Data Wrangling (Gathering, Assessing, Cleaning) of the WeRateDogs Twitter account and archive, followed by storing, analyzing and visualizing the wrangled data.
Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

lolivera0409/Proyecto_Final---Data_Science---CoderHouse2023
Proyecto final del curso Data Science 2023
Language: Jupyter Notebook - Size: 1.81 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

divyansh1195/Halliburton-Landmark-Learning-ML-with-Python
Machine Learning with Python: Halliburton Landmark Learning
Language: Jupyter Notebook - Size: 12.5 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

nourhenehanana/Time-series-analysis
Working with a time series of energy data weโll see how techniques such as time-based indexing, resampling, and rolling windows can help us explore variations in electricity demand and renewable energy supply over time.
Language: R - Size: 363 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

awojidetola/Udacity-Data-Analysis-ND
This repository holds all the projects for the ALXT Data Analysis Udacity Nanodegree Program
Language: HTML - Size: 6.23 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

harshdeepkalita/Data-Analyst-Mini-Projects
Data Analysis Mini Projects
Language: Jupyter Notebook - Size: 18.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

NdAbdulsalaam/Movie_success_determinants
Performed descriptive analysis movie data set, evaluated trends, established facts said by the data and advised on best practices when a new movie is to be released
Language: Jupyter Notebook - Size: 3.08 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

yabiola/udacity-data-analyst-projects
This repository contains 3 projects that were carried out and submitted for my ALX Udacity Data Analyst Course
Language: HTML - Size: 25.8 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

anna-ringwood/cleaning-deduplicating-donor-data
This repository holds the code files used in an undergraduate data wrangling project from March - August 2021.
Size: 40 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

dell-datascience/Data_Analytics
Albert Dellor - Data Analyst Project Portfolio
Language: Jupyter Notebook - Size: 113 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

lethuyngocan/EDA-Projects
I showcase 3 distinct projects that apply EDA techniques to serve diverse business objectives.
Language: Jupyter Notebook - Size: 1.77 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

UdaykiranEstari/DataPreparation-Experimental
Python-based data cleaning and wrangling project showcasing preprocessing steps. Handles missing values, transforms data, removes duplicates, treats outliers, and performs feature engineering for enhanced analysis. Validates data quality and consistency. Dataset from YouTube channel. Size: 76,378 rows, 9 columns.
Language: Jupyter Notebook - Size: 9.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

gebenner1/CIS5450BigDataProject
Data wrangling, analysis, and prediction for Amazon book reviews.
Language: Jupyter Notebook - Size: 481 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

marbel89/AI-incidents_Wrangling
Wrangles an AI incident data set for easier visualization purposes
Language: Jupyter Notebook - Size: 5.36 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Benazir023/Cyclistic
Analyzing Capstone Project (Cyclistic)
Language: R - Size: 617 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

taricov/Python_Wrangling_Messy_Data
I came across a challenge introduced by Shashank Kalanithi. it's a data set in form of a CSV file with an interesting block-like structure. The challenge was to transform this data into the regular form of long thin tables.
Language: Python - Size: 156 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Sheminho/TMDb_movie_data_analysis
Analyzing TMDb_movie website to explore the dataset to answer some questions and represent the data in an interactive way.
Language: Jupyter Notebook - Size: 1.44 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

pandeyankitg/CapstoneAlmabetterEDA
EDA Capstone Project(Almabetter)
Language: Jupyter Notebook - Size: 7.87 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

rkadey/data-wrangling-in-R
Language: R - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

SedemBuabs/Udacity-Project-Investigate-a-Dataset
Udacity Data Analyst Nanodegree Project : Investigate a Dataset
Language: HTML - Size: 5.49 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

s-njeru/Data-Wrangling
Data Wrangling Project from the Udacity Data Analytics Nano Degree
Language: HTML - Size: 2.61 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dorothy-nguyen/WeRateDogs-Twitter-Data-Wrangling
This project corresponds to data wrangling project within Udacity Data Analyst Nanodegree.
Language: HTML - Size: 2.61 MB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

brianmaleek/project_workspace_2_tweepy
Wrangling and analyzing we rate dogs twitter account which rates people's dogs with a humorous comment about the dog.
Language: Jupyter Notebook - Size: 2.57 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

andrewmuhoro/WeRateDogs
WeRateDogs is a twitter account which share dog images and write a brief panegyric about the dog, then they let their followers to rate it by favoriting it. The goal is to go through the whole data analysis process โ collecting the data, cleaning the data, analyzing the data and finally visualizing the data with emphasis on data wrangling.
Language: Jupyter Notebook - Size: 6.45 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

kfrawee/WeRateDogs
Gathering data from a variety of sources and in a variety of formats, assessing its quality and tidiness, then cleaning it. Showcase wrangling efforts through analysis and visualizations.
Language: Jupyter Notebook - Size: 3.84 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 4

jarsonX/Data_Analysis
Tasks and small projects related to data analysis. Mostly automatisation and data wrangling.
Language: Jupyter Notebook - Size: 5.51 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

Irene-arch/TMDB-Movies-Dataset
This project is part of the projects in the Data Analyst NanoDegree Program from Udacity
Language: HTML - Size: 3.25 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Tola-adelase/Wrangle-and-Analyze-Data-UdacityProject
Udacity Nano degree Project 4. (Wrangle and Analyze Data)
Language: Jupyter Notebook - Size: 1.79 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

yuliianikolaenko/text-network-analysis
Part of the Data Science course project dedicated to Digital Ethics concepts mapping
Language: Jupyter Notebook - Size: 3.23 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

Krishnkumar542/IBM_Data_Science_Professional_Certification
This repository contains all the resources of the final Capstone Project which is a part of the IBM Data Science Professional Certification.
Language: Jupyter Notebook - Size: 3.24 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

dbmurray/preppin_data_rstats_chapter
This is a repository of #rstats solutions to the Preppin' Data challenges published at preppindata.com
Language: R - Size: 361 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

moriahtaylor1/national_parks_shiny
Shiny Contest 2021 Submission - Biodiversity in U.S. National Parks
Language: R - Size: 324 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0
