Topic: "datacleaning"
OpenRefine/OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
Language: Java - Size: 388 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 11,657 - Forks: 2,108
great-expectations/great_expectations
Always know what to expect from your data.
Language: Python - Size: 230 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 11,013 - Forks: 1,653
sfu-db/dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Language: Python - Size: 214 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 2,203 - Forks: 219
yobulkdev/yobulkdev
🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative
Language: JavaScript - Size: 973 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 895 - Forks: 45
DataCanvasIO/HyperGBM
A full pipeline AutoML tool for tabular data
Language: Python - Size: 11 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 355 - Forks: 47
sharmaroshan/Twitter-Sentiment-Analysis
It is a Natural Language Processing Problem where Sentiment Analysis is done by Classifying the Positive tweets from negative tweets by machine learning models for classification, text mining, text analysis, data analysis and data visualization
Language: Jupyter Notebook - Size: 2.77 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 168 - Forks: 114
DataKitchen/data-observability-installer
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.
Language: Python - Size: 237 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 128 - Forks: 12
imdevskp/covid_19_jhu_data_web_scrap_and_cleaning
This repository contains data and code used to get and clean data from https://github.com/CSSEGISandData/COVID-19 and https://www.worldometers.info/coronavirus/
Language: Jupyter Notebook - Size: 123 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 91 - Forks: 94
prasanthg3/cleantext
An open-source package for python to clean raw text data
Language: Python - Size: 27.3 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 72 - Forks: 11
benchopt/benchmark_bilevel
Benchmark for bi-level optimization solvers
Language: Python - Size: 382 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 48 - Forks: 10
imdevskp/covid-19-india-data
data and code for scrapping and cleaning data on covid-19 in India from https://www.mohfw.gov.in/ and https://www.covid19india.org/
Language: Jupyter Notebook - Size: 21.1 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 38 - Forks: 80
data-cleaning/validatedb
Validate on a table in a DB, using dbplyr
Language: R - Size: 623 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 33 - Forks: 3
RonKG/Machine-Learning-Projects-2
Language: HTML - Size: 12.1 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 24 - Forks: 18
DemonDamon/tongdaxin-futures-data-clearing-database-operation
对通达信数据进行去重和清洗处理,并将数据存入MongoDB,方便往后研究
Language: Python - Size: 854 KB - Last synced at: almost 3 years ago - Pushed at: over 7 years ago - Stars: 19 - Forks: 15
nirala96/Bangalore-House-Prediction-App
Predicts home prices of Bangalore. Used Flutter, Flask and Jupyter Notebook.
Language: Jupyter Notebook - Size: 423 KB - Last synced at: 7 months ago - Pushed at: over 4 years ago - Stars: 18 - Forks: 0
mne-tools/mne-denoise
mne-denoise provides narrow-band artefact removal tailored to MNE-Python workflows. It wraps harmonic regression techniques to suppress power-line noise and other oscillatory contaminants while preserving signal rank and interpretability.
Language: Python - Size: 27.2 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 17 - Forks: 4
theodi/OpenRefine-WS
Code to enable OpenRefine to run as an authenticated web service
Language: JavaScript - Size: 272 KB - Last synced at: about 1 year ago - Pushed at: over 11 years ago - Stars: 16 - Forks: 2
ahmadjaved97/ImageClusterViz
A tool for clustering images using deep learning features and visualizing the results in organized grids.
Language: Python - Size: 22.2 MB - Last synced at: 16 days ago - Pushed at: 19 days ago - Stars: 15 - Forks: 0
sayaliwalke30/Kaggle-Projects
This repo contains 4 different projects. Built various machine learning models for Kaggle competitions. Also carried out Exploratory Data Analysis, Data Cleaning, Data Visualization, Data Munging, Feature Selection etc
Language: Jupyter Notebook - Size: 4.03 MB - Last synced at: almost 3 years ago - Pushed at: over 4 years ago - Stars: 14 - Forks: 11
EastTower16/LLMDataDistill
distill large scale web page text
Language: C++ - Size: 1.5 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 1
ropensci/excluder
Checks for Exclusion Criteria in Online Data
Language: R - Size: 947 KB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 5
ironmussa/Optimus-examples 📦
Examples for Optimus a Data Cleansing Library for Big Data.
Size: 925 KB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 9 - Forks: 4
ShrishtiHore/Weapons-Detection-in-Real-Time-Surveillance-Videos
This project aims to minimize the police response time by detecting weapons through a live CCTV camera feed. So it alerts the police as soon as it detects any sort of weapons. In our project we are focusing on guns primarily. 🔫💣💻🎥
Language: Jupyter Notebook - Size: 43.1 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 7 - Forks: 1
Ronlee12355/kaggle-with-R
All kaggle datasets and the R codes
Language: HTML - Size: 59.8 MB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 6
allenlsj/Spark-lean Fork of qltf8/1004_LPL_project
Spark-lean, an interactive PySpark-based Data Cleaning Library
Language: Python - Size: 1.97 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 0
rojaAchary/Data_Preprocessing_Techniques
⚒️ Data preprocessing is the process of transforming raw data into an understandable format. It is also an important step in data mining as we cannot work with raw data. The quality of the data should be checked before applying machine learning or data mining algorithms
Language: Jupyter Notebook - Size: 88.9 KB - Last synced at: almost 3 years ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 2
kkverma/Twitter-Sentiment-Analysis
A basic machine learning model built in python jupyter notebook to classify whether a set of tweets into two categories: racist/sexist non-racist/sexist.
Language: Jupyter Notebook - Size: 6.34 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 2
Nelson-Gon/mde
mde: Missing Data Explorer
Language: R - Size: 1.37 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 4
Salaah01/pandas-data-cleaner
A package to aid with data cleaning using pandas.
Language: Python - Size: 27.3 KB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 1
cybergeekgyan/ResumeScreening
Resume Screening using Machine Learning and Python
Language: Python - Size: 576 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 1
easonlai/Samples_for_Azure_Databricks_Orientation
Samples for Azure Databricks Orientation
Language: HTML - Size: 6.78 MB - Last synced at: 8 months ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 2
imdevskp/sars-2003-outbreak-data-webscraping-code
repository contains complete WHO data of 2003 outbreak with code used to web scrap, data mung and cleaning
Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 2
NhanAZ/DataCleaner
Clean up unnecessary data inside plugin_data folder
Language: PHP - Size: 80.1 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 3
project1A1B/Big-Mart-Sales-Prediction
Building Big Mart Sales Prediction model
Language: Jupyter Notebook - Size: 1.68 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0
d3mastermind/Google_Play-and-App-Store-Analysis
My Analysis on Profitable App Profiles for the App Store and Google Play Markets
Language: Jupyter Notebook - Size: 718 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0
RawatMeghna/KPMG-Data-Analytics-Virtual-Internship
In this online program, I completed similar tasks that KPMG Graduates do in the company. I learned what it is like working at one of the world’s best data analytics team, and built skills required to excel as a analytics consultant.
Size: 9.87 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1
aysbt/DataScienceCourse
The course material from multiple sources
Language: Jupyter Notebook - Size: 3.47 MB - Last synced at: almost 3 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 5
ManojKumarMaruthi/Regression
Metro Interstate Traffic Volume
Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 7
Nozomi-Takemura/24-hour-McKinsey-Analytics-Online-Hackathon-Healthcare-Analytic
Language: Jupyter Notebook - Size: 884 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1
Ricco1010/ricco
A personal toolset built over time by Ricco
Language: Python - Size: 5.38 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1
karlyndiary/IMDb-Data-Analysis
Data Analysis on the IMDb Dataset using Python & Power BI.
Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0
cybergeekgyan/Data-Science-Portfolio
Resources and Portfolio of my Data Science projects for academics, and self learning journey
Language: Jupyter Notebook - Size: 262 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0
jomuzhi/ukhls_cleanpool
using STATA to clean and pool UKHLS data (1991-Now)
Language: Stata - Size: 28.3 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0
root-yash/Hotstar-Disney-Plus-Scraper
Hotstar Disney + movie and tv show web Scraper made using pyppeteer, request and beautiful soup.
Language: Python - Size: 2.04 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0
siddh34/DSML-Project
Regression
Language: Jupyter Notebook - Size: 6.58 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0
mennamamdouh/Analysis-of-Video-Games
This repository is for a data analytics project using SQL. The project is about analyzing and getting insights about video games sales, and users and critics reviews.
Language: SQL - Size: 1020 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0
cc59chong/Cleaning-Data-with-PySpark
Language: Jupyter Notebook - Size: 6.48 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2
mchenryspagg/Google-Play-Store-Apps-Analysis-Visualization
An analysis and visualization of google play store apps scraped data for the period of 2010 - 2018 . This project aims at cleaning the dataset, analyzing the given dataset, and mining informational quality insights. This project also involves visualizing the data to better and easily understand trends and different categories.
Language: Jupyter Notebook - Size: 5.51 MB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0
maqboolkazmii/Google_Data_Analyst_Projects
This repositery have Capston projects and Task which i have done During Goolge Data Analytic Course on Coursera.
Language: TSQL - Size: 13.8 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0
divithraju/divith-raju-Immigration-Data-Engineering
A Capstone Project that covers several aspects of Data Engineering (Data Exploration, Cleaning, Modeling, Pipelining, Processing)
Language: Jupyter Notebook - Size: 2.5 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0
Rose-njeru/Data-Cleaning-in-SQL
Size: 2.43 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0
vijishmadhavan/PARSE-CLIP
A simple CLIP based project for combining images from multiple datasets.
Language: Jupyter Notebook - Size: 4.12 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0
skupriienko/PDF-and-URL-parser
python module for parsing PDF and scraping URLs
Language: Python - Size: 6.12 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0
ShrishtiHore/Quantium_Virtual_Experience_Program
This repository is a collection of all the solutions of tasks that were assigned to me during my Data Analytics Virtual Internship Experience Program at Quantium. 💻📚📊
Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 13
YuehHanChen/Video_Game_Sales_Analysis
Analyze sales data from more than 16,500 games.
Language: HTML - Size: 1.3 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1
PasinduAnthony/API-Download
It Downloads, outputs in the console, Cleans the data and saves it as a csv file
Language: Java - Size: 125 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0
RamEppala/imbalanceddatasetproject
Machine Learning Project on Imbalanced Data in R
Language: R - Size: 5.69 MB - Last synced at: 7 months ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 3
kaushalshetty/Preprocessing
Text Preprocessing
Language: Python - Size: 1.95 KB - Last synced at: almost 3 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 0
JohnTocci/Nullaxe
Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.
Language: Python - Size: 201 KB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0
AnjaliRai24/Loksabha-Election-2024-Analysis-Through-Power-BI
This repository hosts interactive dashboards and detailed data visualizations that provide insights into the 2024 Indian parliamentary elections. Utilizing Power BI, we've analyzed voter demographics, electoral results, constituency-wise trends, and more, offering a comprehensive view of the election dynamics.
Size: 194 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0
jahnavigupta06/SQL-SERVER-CREDIT-CARD-ANALYSIS
🔍 Credit Card Transactions Analysis Project explores a real-world dataset of 26,000+ credit card transactions from 2013–2015 using SQL Server. It focuses on customer spending behavior, card type usage, and expense patterns. Includes performance-optimized queries using CTEs, execution plans, indexing and advance SQL logic to extract meaningful
Size: 1.07 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0
Arzan101/EV--Car-Data-Analysis
This Power BI dashboard provides an interactive and data-driven overview of the electric vehicle (EV) landscape. It visualizes key insights across various dimensions including sales trends, model performance, manufacturer comparisons, and market growth. The purpose of the dashboard is to enable stakeholders to explore and analyze development
Size: 360 KB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0
2KRISHNAYADAV/DATSH.AI
Size: 7.81 KB - Last synced at: 7 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0
vishrut-b/clustering-analysis-of-online-retail-data
This project leverages machine learning techniques to analyze online retail data through customer segmentation. It uses KMeans clustering to identify key customer groups and proposes tailored business strategies based on their purchasing behaviors.
Language: Jupyter Notebook - Size: 1.92 MB - Last synced at: 8 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0
pavankethavath/Car_dekho_car_price_prediction
A Streamlit web app utilizing Python, scikit-learn, and pandas for used car price prediction. Features data preprocessing (scaling, encoding), Random Forest model optimization with GridSearchCV, and interactive user input handling. Achieves high accuracy (R² score: 0.9028), showcasing skills in machine learning, data engineering, and deployment.
Language: Python - Size: 182 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0
girish119628/girish119628
Data Enthusiast | Predictive Modeler | Turning Insights into Strategies
Size: 8.79 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0
ngambip/Top-uk-Youtubers-2024.githu.io
This project involves a comprehensive analysis to determine the top YouTubers in the UK for 2024, Using Excel, SQL and Power BI.
Size: 2.38 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
ngambip/Diabetes_factors_2024
Exploring BMI Categories and Health Factors.
Size: 6.39 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
FaezehAbedi2023/Optimizing-Credit-Card-Fraud-Detection-in-Banking-with-Ensemble-Learning-Techniques
This research advances credit card fraud detection by integrating machine learning and deep learning techniques. Key findings include improved model adaptability through hyperparameter tuning.
Language: Jupyter Notebook - Size: 62.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
IbadDE/SQL_projects
Language: Jupyter Notebook - Size: 913 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
saahen-sriyan-mishra/PyAnalytics-Python_Data_Analysis
Here I conducted EDA on a diverse datasets, including movies, sales, and gaming data. Did data cleaning, visualization, and interpretation using libraries like pandas, NumPy, Matplotlib, and Seaborn to extract actionable insights for informed decision-making processes.
Language: Jupyter Notebook - Size: 149 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
ArtemyiMelehin/DataCleaningProject
Preprocessing ML by cleaning data
Language: Jupyter Notebook - Size: 6.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0
saahen-sriyan-mishra/InsightStream-Power_BI_Visualizations
Using POWER BI to perform data visualization in different domains to show insights, trends and forecasts by using different datasets like excel, csv and SQL.
Size: 7.18 MB - Last synced at: 8 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0
yogeshkasar778/PWC_task_2-Customer_Churn_Retension_dashboard
Size: 1.25 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 13
prijall/FeatureEngineering
This repository will contain all my work related to feature engineering
Language: Jupyter Notebook - Size: 3.84 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0
80396-B2/Credit_Score_Prediction
Given a person’s credit-related information, I am building a Machine/Deep learning model that can classify the credit score.
Language: Jupyter Notebook - Size: 30 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0
YamanAlBochi/GoogleAppsAnalysis
Analyzing various apps found on the Google Play Store with the help of different python libraries. The dataset is chosen from Kaggle. It is the web scraped data of 10k Play Store apps for analyzing the Android market. It consists of in total of 10841 rows and 13 columns.
Language: Jupyter Notebook - Size: 226 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0
yamovan/datascience
Данный проект направлен на демонстрацию основных принципов анализа, преобразования, очистки и визуализации данных
Language: HTML - Size: 36.7 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0
ritikaga/Superstore-Sales-Analysis-and-Time-series-Forecasting
Analysis Sales data to gain insights and create Interactive Sales Dashboard and also predict /Forecast the next sales with the use of Power-Bi.
Size: 2.59 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0
SurajGusain0007/FAASO_SQL_ANALYSIS_PROJECT
Rebel Foods, formerly known as Faasos, is a renowned food delivery company in India. It originated as a restaurant chain but transitioned into an online platform. Rebel Foods is known for providing affordable meals, a wide variety of cuisines, and prompt delivery services. They operate numerous brands and leverage technology to enhance the overall
Size: 647 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0
baranylcn/RuleBasedCustomerSegmentation_with_GezinomiDataset
Size: 2.82 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0
ibtassam1/Airbnb
Using Python to perform exploratory data analysis on scraped Airbnb dataset. This multi-faceted analysis was selected by University Master program director for the data portal (link in Readme.md).
Language: Jupyter Notebook - Size: 7.87 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0
RAMSON-OFON/datascience
This repo consist of projects on: Data Wrangling, Data Visualization and Machine Learning
Language: HTML - Size: 6.78 MB - Last synced at: almost 3 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0
Anubhavchandil/RESEARCH-INTERN
Worked on a dataset of high entropy alloys which is used to design materials for additive manufacturing. Being responsible for Performing Data Analysis and constructing Machine learning algorithms, including neural networks, Gradient boosting for carrying predictions useful for advanced material invention.
Language: Jupyter Notebook - Size: 17.5 MB - Last synced at: almost 3 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1
behdadmansouri/Datamining_HW1_Data_Cleaning
preprocessing on a flower iris dataset
Language: Python - Size: 40 KB - Last synced at: almost 3 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0
sonu275981/Big-Mart-Sales-Prediction
Using Machine Learning Algorithms for Regression Analysis to predict the sales pattern and Using Data Analysis and Data Visualizations to Support it.
Language: Jupyter Notebook - Size: 339 KB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 3
binamify/66DaysOfData
The #66Days of Data is a initiative started by Ken Jee started to help people develop better data science habits!
Language: Jupyter Notebook - Size: 813 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0
elliotastern/sternclean
Clean your data frame in one readable function
Language: R - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0
charleswu52/BitcoinAnalysis
Bitcoin Transaction Data Analysis System.
Language: Scala - Size: 2.1 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0
thebadcoder96/Eminem_NLP
Extracting lyrics from Genius API and conducting NLP analysis on Eminem's lyrics
Language: Jupyter Notebook - Size: 7.67 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0
esvs2202/Car-Price-prediction_-hackathon-
Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1
rahul3687/Exploratory-data-analysis-python
Exploratory data analysis. loan amount prediction on the basis of credit score.
Language: Jupyter Notebook - Size: 4.76 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 2
pragatimehra21/Data_Analyst_Udacity
Udacity Data Analyst Projects
Language: HTML - Size: 2.22 MB - Last synced at: almost 3 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0
imdevskp/flights-crash-data-web-scraping-cleaning
Repository contains notebooks and datasets on no. of flights departures, passengers flew, flights crashed etc.
Language: Jupyter Notebook - Size: 3.1 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0
imdevskp/ebola_outbreak_dataset
The repository contains data and code for scrapping and cleaning data on Ebola outbreak in 2014
Language: Jupyter Notebook - Size: 105 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 3
Kunal1198/Data-Cleaning-with-Pandas
The main aim is to clean the data with pandas library.
Language: Jupyter Notebook - Size: 43.9 KB - Last synced at: 4 months ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0
sbavon/Kaggle-NYC-Taxi-Fare-Prediction
My solution for Kaggle NYC Taxi Fare Prediction ( ranked 21st/1463)
Language: Python - Size: 984 KB - Last synced at: 5 months ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 2
Mindful-AI-Assistants/8-social-buzz-ai-Project-Social-Pulse-A-Machine-Learning-Approach-to-Academic-Performance-Modeling
🪐 8- Social Buss: Extension Project – Social Pulse A machine-learning project analyzing anonymized student performance data through cleaning, exploration, feature engineering, and predictive modeling.
Language: Jupyter Notebook - Size: 23.8 MB - Last synced at: 8 days ago - Pushed at: 25 days ago - Stars: 1 - Forks: 0
MariaEgbuna/Road-Accidents
Analyzing a road accidents dataset using Python.
Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
dikshabagul/Cloud-Cost-Monitoring-
A Power BI project to monitor and optimize cloud cost and usage across multiple environments.
Size: 459 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0