Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: datawrangling

saulpw/visidata

A terminal spreadsheet multitool for discovering and arranging data

Language: Python - Size: 51.5 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 7,462 - Forks: 272

sondosaabed/Data-Analyst-Nanodegree

I aquired a full scholarship from Google Launchpad. Advanced data wrangling skills to work with messy, complex real-world datasets. Highly customized visualizations using the Matplotlib Python library

Language: Jupyter Notebook - Size: 11 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 1 - Forks: 0

emmatoms/Study_Projects

This include all my data analysis study works and assignments.

Language: Jupyter Notebook - Size: 1020 KB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 0 - Forks: 0

AKSHATNAREDI/HR-Data-Analytics

Designed a dashboard to track employee data for the HR team including attendence, working hours and leaves. This dashboard can streamline the HR processes and also can save the HR team 3-4 hours of work daily.

Size: 868 KB - Last synced: 20 days ago - Pushed: 20 days ago - Stars: 1 - Forks: 0

chauhanhimanc9/Social-Buzz-Suad

Power BI Project

Size: 4.39 MB - Last synced: 23 days ago - Pushed: 23 days ago - Stars: 0 - Forks: 0

FlorenciaBezmalinovich/Proyectos_ML

Aquí compartiré y documentaré mi aprendizaje en el análisis predictivo utilizando ML y prácticas de Python. Variando desde simples ejercicios hasta proyectos prácticos. Cada proyecto incluye archivos de soporte y notebooks con código y análisis y las prácticas sus enunciados comentado al inicio.

Language: Jupyter Notebook - Size: 31.8 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

Chandramanoj/Terros-real-estate-agency

The objective of this study is to map all the relevant features for the properties along with the information related to the geography around it, to estimate the value of a particular property/house.

Size: 359 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

Chandramanoj/HBFC-bank-personal-loan

The aim of this project is to improve of taking personal loans by using EDA and statistical measures

Size: 1.77 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

codotype/codotype-mongodb-scripts-generator

:leaves: Codotype generator for scripts to populate MongoDB with CSV and JSON files

Language: JavaScript - Size: 14.6 KB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 0 - Forks: 1

yessasvini23/IBM_Data_Science_-Capstone-Project-Wining-Space-Race.ipnyb

IBM Data Science Capstone Project from Coursera

Language: Jupyter Notebook - Size: 6.2 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

amiegirl/WeRateDogs

Analyzing and visualizing is the tweet archive of Twitter user @dog_rates, also known as WeRateDogs.

Language: Jupyter Notebook - Size: 438 KB - Last synced: 21 days ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

kosson/sva21

Acest repo conține materiale, seturi de date și soluții care au fost folosite în cadrul Școlii de vară Astra, prima ediție, 2021

Size: 3.57 MB - Last synced: 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

nzsaurabh/aggregatingdata

Aggregate data in R using simple SQL commands

Language: R - Size: 5.57 MB - Last synced: 2 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

MishraCo/FinanceDataViz-StockPrices-

Data Wrangling and Visualizing Stock Data from Alpha Vantage API - Apple Stocks

Language: Jupyter Notebook - Size: 59.6 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

msamhz/Kaggle_WorldMortalityRates

Visualising World Mortality Rates

Language: Jupyter Notebook - Size: 2.29 MB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

jl3392/Data-Wrangling-practice

Data wrangling using python and SQL

Language: Jupyter Notebook - Size: 599 KB - Last synced: 2 months ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

tiangenglu/utilities

data wrangling examples

Language: Python - Size: 22.5 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

hzoscar/ds_challenge_2_Data_Insider

The primary objective was to produce a detailed report to objectively understand the behaviour of corporations by using mainly the previous nine editions of Forbes 2000. This involved merging datasets, analyzing discrepancies, and providing insights through data visualization techniques.

Language: Jupyter Notebook - Size: 24.2 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

nischalshrestha/Unravel

A fluent code explorer for R. 🔍

Language: R - Size: 6.57 MB - Last synced: 3 months ago - Pushed: 10 months ago - Stars: 100 - Forks: 3

ShabnaNasser/Capstone_Two

Healthcare Provider Fraud Detection Analysis and Prediction

Language: Jupyter Notebook - Size: 10.2 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

PramodRawat157/Data-Analysis-with-Python---IBM-Data-Science

To import data from multiple sources, clean and wrangle data, perform exploratory data analysis (EDA), and create meaningful data visualizations. I will then predict future trends from data by developing linear, multiple, polynomial regression models & pipelines and learn how to evaluate them.

Language: Jupyter Notebook - Size: 9.02 MB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1

mansenfranzen/pywrangler

Advanced data wrangling for python

Language: Python - Size: 521 KB - Last synced: 2 months ago - Pushed: 9 months ago - Stars: 11 - Forks: 4

AbdelRahman-AboulEla/Analyze-AB-Test-Results

Language: Jupyter Notebook - Size: 5.86 MB - Last synced: 4 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

simran08udhani/RLabBasedProject

Explore a sophisticated R Shiny app, the "Global Economic Dashboard," analyzing 20 countries' economic data (2012–2022) with advanced 2D/3D visualization and a predictive Regression ML model for Education Expenditure (%GDP). Do check PowerBI Dashboard for the same!

Language: R - Size: 1.16 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

Kush-Trivedi/Logistic-Regression-K-NN-for-Heart-Attack

Various predictor factors to try to generate a forecast about heart disease patients and Logistic regression and K-Nearest Neighbor to develop a model to predict whether the patients have heart disease or not for the analysis, Finally Some basic visualizations.

Language: R - Size: 7.81 KB - Last synced: 5 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

abs711/pyspark_stuff

Language: Jupyter Notebook - Size: 1.49 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

wernerdassuncao/data_wrangling

Data Science course exercises, textbook chapter 20 and forth

Language: R - Size: 1.53 MB - Last synced: 5 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

3liud/bike-sharing-analysis

A data analysis project to explore bike sharing, and what factors impact bike sharing in Washington, D.C., USA, for the period between January 1, 2011, and December 31, 2012. Data source: https://archive.ics.uci.edu/ml/datasets/Bike+Sharing+Dataset#

Language: Jupyter Notebook - Size: 5.3 MB - Last synced: 5 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

algorithmwatch/dataskop-scrapers 📦

Scrapers, parsers, data wrangling and utilities for TikTok and YouTube

Language: TypeScript - Size: 2.97 MB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 3 - Forks: 0

parthalalit/Israeli-Palestinian-Conflict

Extensively worked on data cleaning and data wrangling to analyze the number of deaths between years 2000-2023 with the help of pie charts, bar charts, word cloud and heat maps. Used time series analysis to check if the events of death follow stationarity and forecasted the trends.

Language: R - Size: 2.26 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

Ashish25/Wrangling

This project is aimed to use data munging techniques, such as assessing the quality of the data for validity, accuracy, completeness, consistency and uniformity, to clean the OpenStreetMap data for a part of the world.

Language: HTML - Size: 353 KB - Last synced: 6 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

MahbubHossainFaisal/Metahuman_Clash_Consultant

A content based recommendation system project based on Metahumans

Language: Jupyter Notebook - Size: 40.3 MB - Last synced: 4 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

iamkd99/Data-Analysis-Projects

Projects related to Data Analysis

Language: Jupyter Notebook - Size: 13.2 MB - Last synced: 6 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

thai22011/NLP_Hotel_Review

Natural Language Processing with Hotel Reviews on Booking.com

Language: Jupyter Notebook - Size: 49.1 MB - Last synced: 6 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

beingbvh/Singapore_Resale_Flat_Prices_Predicting

The objective of this project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.

Language: Jupyter Notebook - Size: 804 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

AnAriz101/Portfolio

Versatile data analyst skilled in extracting actionable insights from complex datasets. Proficient in statistical analysis, data visualization, and trend identification. Proven track record in transforming raw data into strategic business recommendations.

Language: Jupyter Notebook - Size: 4.02 MB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

lf-bf/IBM-Data-Science-Capstone-SpaceX

In this project, we predicted if the Falcon 9 first stage will land successfully by following the data science methodology.

Language: Jupyter Notebook - Size: 452 KB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

raghavendranhp/Attrition-Alchemy

This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.

Language: Jupyter Notebook - Size: 15.7 MB - Last synced: 4 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

haojing9058/MS-Thesis--Explaining-patterns-of-child-malnutrition-in-Uganda

Size: 4.13 MB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

haojing9058/Comparison-of-Regression-Machine-Learning-Algorithms-of-Explaining-Students-Academic-Performance

Language: Jupyter Notebook - Size: 1.58 MB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 4 - Forks: 5

patmendoza330/annotationwrangling_python

Datawrangling in Python using pandas

Language: Python - Size: 3.73 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

patmendoza330/animelistclean

Clean Data from MyAnimeList API

Size: 8.79 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

Elaine-AL/Arifu-housing_trainings

Data wrangling and analytics of housing trainings dataset from Arifu.

Language: HTML - Size: 19.9 MB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

patmendoza330/geneontologyconversion

Oftentimes, we come across data that isn't in the form that we need to make joins, when that happens, we can convert those using simple python scripts I use the gene ontology OBO format and convert it into tabular format for making joins with other tables using this python script.

Language: Python - Size: 4.21 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

patmendoza330/annotationwrangling

Converting and integrating data from multiple sources is often tricky business. Luckily there are some great tools available that make this a breeze. I use a genetic annotation file (Brachypodium) and incorporate gene ontology definitions. This Uses dplyr and tidyr to do the data wrangling.

Size: 3.73 MB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

NikhilaThota/CapstoneProject_House_Prices_Prediction

Understand the relationships between various features in relation with the sale price of a house using exploratory data analysis and statistical analysis. Applied ML algorithms such as Multiple Linear Regression, Ridge Regression and Lasso Regression in combination with cross validation. Performed parameter tuning, compared the test scores and suggested a best model to predict the final sale price of a house. Seaborn is used to plot graphs and scikit learn package is used for statistical analysis.

Language: Jupyter Notebook - Size: 7.91 MB - Last synced: 7 months ago - Pushed: over 6 years ago - Stars: 18 - Forks: 13

MezbanS/Customer-Service-Requests-Analysis.

Perform data analysis of service request (311) calls from New York City. I have utilized data wrangling techniques to understand the pattern in the data and visualize the major types of complaints.

Language: HTML - Size: 554 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

JESUSC1/IMDb-Data-Analysis-Exercise-Part-1

Utilized Python tools to delve into IMDb data, uncovering trends in title releases and viewer preferences. Visualized patterns in genres and title runtimes, offering insights into evolving media consumption. Applied regression models like linear, polynomial, and random forest to predict title ratings, revealing factors impacting viewer choices.

Language: Jupyter Notebook - Size: 1.07 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

JESUSC1/Chicago-Public-Libraries-Data-Analysis

Utilized Python tools to map Chicago libraries and identify post-2020 visitation declines, likely impacted by COVID-19. Conducted statistical tests to validate these trends and provided data-driven insights for enhancing user engagement. Offered a comprehensive view of library dynamics, aiding stakeholders in future planning.

Language: HTML - Size: 4.71 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

SQLicious/Alteryx-Project-Ecommerce-Transaction-Cohort-Analysis

Exploring E-commerce Transactions: Data Cleaning, Cohort Analysis, and Business Insights generation using Alteryx Designer Core.

Size: 1.74 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

pranshu1921/Reducing-Traffic-Mortality-USA

Reducing Traffic Mortality rates in USA

Language: Jupyter Notebook - Size: 102 KB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 3

gaju45/Diwali-Sales-Analysis

Explore Diwali Sales Data Analysis: a project dissecting 11,251 records with 15 columns to uncover customer demographics and buying trends during the festive season. The objective? Understand customer behavior, identify key demographics and product categories driving sales. Utilizing Python libraries like Pandas, NumPy, Matplotlib, and Seaborn, thi

Language: Jupyter Notebook - Size: 480 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

gaju45/housing-data-EDA-project

Explore the Housing Data EDA Project, an in-depth analysis of a housing dataset. Discover trends, correlations, and factors influencing housing prices. Gain valuable insights for individuals and real estate professionals.

Language: Jupyter Notebook - Size: 1.07 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

JonathanCristovao/Computer-Vision-Python-OpenCV

This repository presents image manipulation techniques with the OpenCV library, from basic to advanced.

Language: Jupyter Notebook - Size: 1.08 MB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

MahbubHossainFaisal/Exploratory_Data_Analysis_on_Superheroes

Exploratory data analysis on superheroes

Language: Jupyter Notebook - Size: 168 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

Nzavo/Data-Science-Capstone-Project

In this project, we predicted if the Falcon 9 first stage will land successfully by following the data science methodology.

Language: Jupyter Notebook - Size: 1.35 MB - Last synced: 9 months ago - Pushed: about 1 year ago - Stars: 2 - Forks: 0

weihan07/Modified-Retail-Food-Environment-Index Fork of AbrahamLimBingSern/Modified-Retail-Food-Environment-Index

Data Wrangling Group Project

Language: Jupyter Notebook - Size: 3.31 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

NMangera/Udacity_DataAnalysis_Nanodegree

Language: Jupyter Notebook - Size: 9.58 MB - Last synced: 9 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

siglimumuni/my_projects

A portfolio of projects spanning data wrangling, exploratory data analysis, insight analytics and machine learning in Python and R.

Language: Jupyter Notebook - Size: 10.1 MB - Last synced: 9 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

sanithps98/Automobile-Dataset-Analysis

This project analyzes and visualizes the Used Car Prices from the Automobile dataset in order to predict the most probable car price

Language: Jupyter Notebook - Size: 805 KB - Last synced: 8 months ago - Pushed: almost 3 years ago - Stars: 40 - Forks: 34

adamcorren/horse_racing_project_report

Applying Exploratory Analysis to Outperform Sports Exchanges

Size: 8.03 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

deivanand/Next-Word-Prediction

Next Word Prediction Model built deployed using FLASK API and was implemented by Bi-LSTM model with attention layer.

Language: Jupyter Notebook - Size: 50.1 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

ujunwa-DS/Fifa-women

To draw insight from the FIFA women's championship. Total number of goals, total number of games, average goals per game, assists, penalties, red and yellow card, goals per 90 mins, top scoring countries was determined.

Size: 8.73 MB - Last synced: 9 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

SindiAI/WeRateDogs

This repository provide an overview of the data wrangling process used for the WeRateDogs Twitter account dataset. The data wrangling process included data gathering, assessment, and cleaning to ensure the dataset was free of quality and tidiness issues.

Language: Jupyter Notebook - Size: 52.7 KB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

iban121/PotentialTalent

Develop a ranked and re-ranked database of potential candidates based on user input.

Language: Jupyter Notebook - Size: 980 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

UBC-MDS/DSCI525_Group14

Web and Cloud Computing

Language: HTML - Size: 2.04 MB - Last synced: 10 months ago - Pushed: about 3 years ago - Stars: 1 - Forks: 1

Aisha-Ojey/WeRateDogs-Data-Wrangling-and-Analysis-Project

Explore the journey of data wrangling and analysis in the WeRateDogs project. Using Python, gather, assess, and clean data from the Twitter archive of @dog_rates. Unveil insights through visualizations and uncover trends like decreasing retweets over time. For details, refer to wrangle_act.ipynb and wrangle_report.pdf.

Language: Jupyter Notebook - Size: 2.29 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

Aisha-Ojey/TMDB-Movie-Dataset-Analysis

This analysis examines a dataset of 10,000 movies from a movie database, revealing insights and trends in the industry. Notably, drama is the most popular genre, and factors like budget and popularity impact revenue. However, limited data, replaced null values, outliers, and correlation-causation considerations call for cautious interpretation.

Language: Jupyter Notebook - Size: 3.68 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 1 - Forks: 0

SouRitra01/CCE-IIT-Madras-DSAI

Data Science and Artificial Intelligence advanced certification course led by the IIT Madras & Intellipaat

Language: Jupyter Notebook - Size: 82.8 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 6 - Forks: 7

theShmoo/DataWranglingScripts

Scripts for data wrangling operations

Language: Python - Size: 7.81 KB - Last synced: 10 months ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

alasdairgm/dirty_data_project

A collection of 'dirty' datasets that I cleaned and analysed

Language: HTML - Size: 14 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

imRishabhGupta/Data-Wrangling

This repo contains the code to download data and then extract it, if needed, and store it in a pickle file.

Language: Python - Size: 4.88 KB - Last synced: 10 months ago - Pushed: about 7 years ago - Stars: 0 - Forks: 0

bpriantti/data_wralling_python_pandas

Repositório visa abordar os principais metodos de Data Wralling com Pandas utilizando a linguagem python.

Language: Jupyter Notebook - Size: 98.6 KB - Last synced: 10 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

EmanueleCannizzaro/udacity_data_wrangling_mongodb Fork of udacity/ud032

Data Wrangling with MongoDB class code

Language: Jupyter Notebook - Size: 20.3 MB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

marvin-rubia/Automating-Basic-Data-Wrangling-with-a-Python-Function

This repository provides the Python function I created to automate basic dataset cleaning. With one line of code, a given dataframe can be cleaned by the said function.

Language: Jupyter Notebook - Size: 123 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

weismanm12/finances-database

Personal finance database creation, SQL analysis, and Power BI dashboard

Language: Jupyter Notebook - Size: 1.48 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

AliValiyev/CEIT418-Data-Science

These are my solutions for the labs and assignments for Data Science course.

Language: Jupyter Notebook - Size: 16.7 MB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

adamcorren/fantasy_football_project_report

Using Machine Learning to Mitigate Human Bias: Fantasy Football

Language: Jupyter Notebook - Size: 14.7 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 1

adamcorren/fantasy_football_data_wrangling_pandas

FPL 1 - Wrangling data to create a data set ready for machine learning

Language: Jupyter Notebook - Size: 1 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0

WMF07/WalmartSalesAnalysis---MySQL

Data analysis to gain insight into the sales data of Walmart to understand the different factors that affect sales of the different branches.

Size: 2.93 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

hassanmujtaba7/DataAnalytics_PortfolioProject

SQL & Tableau - Portfolio Project

Size: 6.56 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

NAVEENDATAANALYST/Medical-Appointment-No-Show

You can find the dataset in kaggle

Language: Jupyter Notebook - Size: 3.2 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

bvnarayanan/Reddit-Weep_Sentiment-Analysis_Report

We analyzed sentiment towards US private and public universities using VADER Sentiment Analysis and Reddit data. Explored common words used by students to understand topics discussed. Compared sentiment changes pre-pandemic and during the pandemic, examining differences between public and private universities.

Language: Jupyter Notebook - Size: 892 KB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

Jayplect/Pymaceuticals

The purpose of this project was to compare the performance of Pymaceuticals’ drug of interest, Capomulin, against the other treatment regimens.

Language: Jupyter Notebook - Size: 479 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

Benazir023/NYC_schools_perceptions_analysis

The analysis seeks to understand how the perceptions of schools affect performance and demographics and vice versa

Size: 15.8 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

mattlomba/italian_emigrants22

geographical data visualization showing italians moving abroad in the year 2022.

Size: 1.67 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

PramodRawat157/IBM_APPLIED_DATA_SCIENCE_CAPSTONE_PROJECT-

IBM APPLIED DATA SCIENCE CAPSTONE PROJECT

Language: Jupyter Notebook - Size: 9.51 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

ShrishtiHore/Quantium_Virtual_Experience_Program

This repository is a collection of all the solutions of tasks that were assigned to me during my Data Analytics Virtual Internship Experience Program at Quantium. 💻📚📊

Language: Jupyter Notebook - Size: 18.6 MB - Last synced: about 1 year ago - Pushed: about 3 years ago - Stars: 3 - Forks: 13

taricov/Python_Wrangling_Box_Office_Data

This is a pandas test for a data science job. The solution here is in form of a notebook beside the main Python file.

Language: Python - Size: 23.4 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

taricov/Python_Wrangling_Messy_Data

I came across a challenge introduced by Shashank Kalanithi. it's a data set in form of a CSV file with an interesting block-like structure. The challenge was to transform this data into the regular form of long thin tables.

Language: Python - Size: 156 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

NikithaKaredla/Opinion-ming-on-sales-prediction-using-web-scraping

Language: HTML - Size: 329 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

waveform80/structa

A small utility for analyzing data structures (e.g. JSON files)

Language: Python - Size: 495 KB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 4 - Forks: 1

Didilish/WorldQuant-University-Data-Science-projects

I have successfully completed a 16-week and 8 end-to-end, applied data science projects of the Applied Data Science Lab module at WorldQuant University. The mini-projects included scientific computing, data wrangling, machine learning and natural language processing with Python.

Size: 0 Bytes - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

adaobionyeakagbu/we_rate_dogs_wrangling

This project involves wrangling, analyzing and visualizing the tweet archive of Twitter user @dog_rates, also known as WeRateDogs. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10.

Language: Jupyter Notebook - Size: 1.83 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

samchak18/Playstore-App-Review-Analysis-EDA

AlmaBetter Capstone Project -Exploratory data analysis (EDA) is used by data scientists to analyze and investigate data sets and summarize their main characteristics, often employing data visualization methods.

Language: Jupyter Notebook - Size: 10.5 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

HugoTokairim/IBM-Data-Analyst-Capstone-Project

I was given a real-world business problem to analyze and had to utilize my skills to tackle it. The experience allowed me to put my knowledge into practice.

Language: Jupyter Notebook - Size: 363 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

kumod007/Data-Profilling

DATA PROFILING is a process of examining, analyzing, and creating useful summaries of data. The process yields a high-level overview which aids in the discovery of data quality issues, risks, and overall trends.

Language: Jupyter Notebook - Size: 43 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

bamidele42/Udacity_Nanodegree

This repo contain the projects for Udacity Nanodegree program.

Language: Jupyter Notebook - Size: 1.44 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

iPhiliph/Data-Wrangling-OpenStreetMap

通过Python对OpenStreetMap的数据集进行整理和清洗。

Language: Jupyter Notebook - Size: 50.5 MB - Last synced: about 1 year ago - Pushed: over 6 years ago - Stars: 5 - Forks: 3

PsyTeachR/quant-fun-v2

Fundamentals of Quantitative Analysis

Language: TeX - Size: 24.8 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 4 - Forks: 8