An open API service providing repository metadata for many open source software ecosystems.

Topic: "datacleaning"

OpenRefine/OpenRefine

OpenRefine is a free, open source power tool for working with messy data and improving it

Language: Java - Size: 388 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 11,657 - Forks: 2,108

great-expectations/great_expectations

Always know what to expect from your data.

Language: Python - Size: 230 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 11,013 - Forks: 1,653

sfu-db/dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

Language: Python - Size: 214 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 2,203 - Forks: 219

yobulkdev/yobulkdev

🔥 🔥 🔥Open Source & AI driven Data Onboarding Platform:Free flatfile.com alternative

Language: JavaScript - Size: 973 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 895 - Forks: 45

DataCanvasIO/HyperGBM

A full pipeline AutoML tool for tabular data

Language: Python - Size: 11 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 355 - Forks: 47

sharmaroshan/Twitter-Sentiment-Analysis

It is a Natural Language Processing Problem where Sentiment Analysis is done by Classifying the Positive tweets from negative tweets by machine learning models for classification, text mining, text analysis, data analysis and data visualization

Language: Jupyter Notebook - Size: 2.77 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 168 - Forks: 114

DataKitchen/data-observability-installer

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

Language: Python - Size: 237 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 128 - Forks: 12

imdevskp/covid_19_jhu_data_web_scrap_and_cleaning

This repository contains data and code used to get and clean data from https://github.com/CSSEGISandData/COVID-19 and https://www.worldometers.info/coronavirus/

Language: Jupyter Notebook - Size: 123 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 91 - Forks: 94

prasanthg3/cleantext

An open-source package for python to clean raw text data

Language: Python - Size: 27.3 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 72 - Forks: 11

benchopt/benchmark_bilevel

Benchmark for bi-level optimization solvers

Language: Python - Size: 382 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 48 - Forks: 10

imdevskp/covid-19-india-data

data and code for scrapping and cleaning data on covid-19 in India from https://www.mohfw.gov.in/ and https://www.covid19india.org/

Language: Jupyter Notebook - Size: 21.1 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 38 - Forks: 80

data-cleaning/validatedb

Validate on a table in a DB, using dbplyr

Language: R - Size: 623 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 33 - Forks: 3

RonKG/Machine-Learning-Projects-2

Language: HTML - Size: 12.1 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 24 - Forks: 18

DemonDamon/tongdaxin-futures-data-clearing-database-operation

对通达信数据进行去重和清洗处理,并将数据存入MongoDB,方便往后研究

Language: Python - Size: 854 KB - Last synced at: almost 3 years ago - Pushed at: over 7 years ago - Stars: 19 - Forks: 15

nirala96/Bangalore-House-Prediction-App

Predicts home prices of Bangalore. Used Flutter, Flask and Jupyter Notebook.

Language: Jupyter Notebook - Size: 423 KB - Last synced at: 7 months ago - Pushed at: over 4 years ago - Stars: 18 - Forks: 0

mne-tools/mne-denoise

mne-denoise provides narrow-band artefact removal tailored to MNE-Python workflows. It wraps harmonic regression techniques to suppress power-line noise and other oscillatory contaminants while preserving signal rank and interpretability.

Language: Python - Size: 27.2 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 17 - Forks: 4

theodi/OpenRefine-WS

Code to enable OpenRefine to run as an authenticated web service

Language: JavaScript - Size: 272 KB - Last synced at: about 1 year ago - Pushed at: over 11 years ago - Stars: 16 - Forks: 2

ahmadjaved97/ImageClusterViz

A tool for clustering images using deep learning features and visualizing the results in organized grids.

Language: Python - Size: 22.2 MB - Last synced at: 16 days ago - Pushed at: 19 days ago - Stars: 15 - Forks: 0

sayaliwalke30/Kaggle-Projects

This repo contains 4 different projects. Built various machine learning models for Kaggle competitions. Also carried out Exploratory Data Analysis, Data Cleaning, Data Visualization, Data Munging, Feature Selection etc

Language: Jupyter Notebook - Size: 4.03 MB - Last synced at: almost 3 years ago - Pushed at: over 4 years ago - Stars: 14 - Forks: 11

EastTower16/LLMDataDistill

distill large scale web page text

Language: C++ - Size: 1.5 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 1

ropensci/excluder

Checks for Exclusion Criteria in Online Data

Language: R - Size: 947 KB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 5

ironmussa/Optimus-examples 📦

Examples for Optimus a Data Cleansing Library for Big Data.

Size: 925 KB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 9 - Forks: 4

ShrishtiHore/Weapons-Detection-in-Real-Time-Surveillance-Videos

This project aims to minimize the police response time by detecting weapons through a live CCTV camera feed. So it alerts the police as soon as it detects any sort of weapons. In our project we are focusing on guns primarily. 🔫💣💻🎥

Language: Jupyter Notebook - Size: 43.1 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 7 - Forks: 1

Ronlee12355/kaggle-with-R

All kaggle datasets and the R codes

Language: HTML - Size: 59.8 MB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 6

allenlsj/Spark-lean Fork of qltf8/1004_LPL_project

Spark-lean, an interactive PySpark-based Data Cleaning Library

Language: Python - Size: 1.97 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 7 - Forks: 0

rojaAchary/Data_Preprocessing_Techniques

⚒️ Data preprocessing is the process of transforming raw data into an understandable format. It is also an important step in data mining as we cannot work with raw data. The quality of the data should be checked before applying machine learning or data mining algorithms

Language: Jupyter Notebook - Size: 88.9 KB - Last synced at: almost 3 years ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 2

kkverma/Twitter-Sentiment-Analysis

A basic machine learning model built in python jupyter notebook to classify whether a set of tweets into two categories: racist/sexist non-racist/sexist.

Language: Jupyter Notebook - Size: 6.34 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 2

Nelson-Gon/mde

mde: Missing Data Explorer

Language: R - Size: 1.37 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 4

Salaah01/pandas-data-cleaner

A package to aid with data cleaning using pandas.

Language: Python - Size: 27.3 KB - Last synced at: 5 months ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 1

cybergeekgyan/ResumeScreening

Resume Screening using Machine Learning and Python

Language: Python - Size: 576 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 1

easonlai/Samples_for_Azure_Databricks_Orientation

Samples for Azure Databricks Orientation

Language: HTML - Size: 6.78 MB - Last synced at: 8 months ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 2

imdevskp/sars-2003-outbreak-data-webscraping-code

repository contains complete WHO data of 2003 outbreak with code used to web scrap, data mung and cleaning

Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 2

NhanAZ/DataCleaner

Clean up unnecessary data inside plugin_data folder

Language: PHP - Size: 80.1 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 3

project1A1B/Big-Mart-Sales-Prediction

Building Big Mart Sales Prediction model

Language: Jupyter Notebook - Size: 1.68 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

d3mastermind/Google_Play-and-App-Store-Analysis

My Analysis on Profitable App Profiles for the App Store and Google Play Markets

Language: Jupyter Notebook - Size: 718 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

RawatMeghna/KPMG-Data-Analytics-Virtual-Internship

In this online program, I completed similar tasks that KPMG Graduates do in the company. I learned what it is like working at one of the world’s best data analytics team, and built skills required to excel as a analytics consultant.

Size: 9.87 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1

aysbt/DataScienceCourse

The course material from multiple sources

Language: Jupyter Notebook - Size: 3.47 MB - Last synced at: almost 3 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 5

ManojKumarMaruthi/Regression

Metro Interstate Traffic Volume

Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 7

Nozomi-Takemura/24-hour-McKinsey-Analytics-Online-Hackathon-Healthcare-Analytic

Language: Jupyter Notebook - Size: 884 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

Ricco1010/ricco

A personal toolset built over time by Ricco

Language: Python - Size: 5.38 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

karlyndiary/IMDb-Data-Analysis

Data Analysis on the IMDb Dataset using Python & Power BI.

Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

cybergeekgyan/Data-Science-Portfolio

Resources and Portfolio of my Data Science projects for academics, and self learning journey

Language: Jupyter Notebook - Size: 262 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

jomuzhi/ukhls_cleanpool

using STATA to clean and pool UKHLS data (1991-Now)

Language: Stata - Size: 28.3 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

root-yash/Hotstar-Disney-Plus-Scraper

Hotstar Disney + movie and tv show web Scraper made using pyppeteer, request and beautiful soup.

Language: Python - Size: 2.04 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

siddh34/DSML-Project

Regression

Language: Jupyter Notebook - Size: 6.58 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

mennamamdouh/Analysis-of-Video-Games

This repository is for a data analytics project using SQL. The project is about analyzing and getting insights about video games sales, and users and critics reviews.

Language: SQL - Size: 1020 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

cc59chong/Cleaning-Data-with-PySpark

Language: Jupyter Notebook - Size: 6.48 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

mchenryspagg/Google-Play-Store-Apps-Analysis-Visualization

An analysis and visualization of google play store apps scraped data for the period of 2010 - 2018 . This project aims at cleaning the dataset, analyzing the given dataset, and mining informational quality insights. This project also involves visualizing the data to better and easily understand trends and different categories.

Language: Jupyter Notebook - Size: 5.51 MB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

maqboolkazmii/Google_Data_Analyst_Projects

This repositery have Capston projects and Task which i have done During Goolge Data Analytic Course on Coursera.

Language: TSQL - Size: 13.8 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

divithraju/divith-raju-Immigration-Data-Engineering

A Capstone Project that covers several aspects of Data Engineering (Data Exploration, Cleaning, Modeling, Pipelining, Processing)

Language: Jupyter Notebook - Size: 2.5 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

Rose-njeru/Data-Cleaning-in-SQL

Size: 2.43 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

vijishmadhavan/PARSE-CLIP

A simple CLIP based project for combining images from multiple datasets.

Language: Jupyter Notebook - Size: 4.12 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

skupriienko/PDF-and-URL-parser

python module for parsing PDF and scraping URLs

Language: Python - Size: 6.12 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

ShrishtiHore/Quantium_Virtual_Experience_Program

This repository is a collection of all the solutions of tasks that were assigned to me during my Data Analytics Virtual Internship Experience Program at Quantium. 💻📚📊

Language: Jupyter Notebook - Size: 18.6 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 13

YuehHanChen/Video_Game_Sales_Analysis

Analyze sales data from more than 16,500 games.

Language: HTML - Size: 1.3 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

PasinduAnthony/API-Download

It Downloads, outputs in the console, Cleans the data and saves it as a csv file

Language: Java - Size: 125 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0

RamEppala/imbalanceddatasetproject

Machine Learning Project on Imbalanced Data in R

Language: R - Size: 5.69 MB - Last synced at: 7 months ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 3

kaushalshetty/Preprocessing

Text Preprocessing

Language: Python - Size: 1.95 KB - Last synced at: almost 3 years ago - Pushed at: about 8 years ago - Stars: 3 - Forks: 0

JohnTocci/Nullaxe

Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.

Language: Python - Size: 201 KB - Last synced at: 10 days ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

AnjaliRai24/Loksabha-Election-2024-Analysis-Through-Power-BI

This repository hosts interactive dashboards and detailed data visualizations that provide insights into the 2024 Indian parliamentary elections. Utilizing Power BI, we've analyzed voter demographics, electoral results, constituency-wise trends, and more, offering a comprehensive view of the election dynamics.

Size: 194 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

jahnavigupta06/SQL-SERVER-CREDIT-CARD-ANALYSIS

🔍 Credit Card Transactions Analysis Project explores a real-world dataset of 26,000+ credit card transactions from 2013–2015 using SQL Server. It focuses on customer spending behavior, card type usage, and expense patterns. Includes performance-optimized queries using CTEs, execution plans, indexing and advance SQL logic to extract meaningful

Size: 1.07 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

Arzan101/EV--Car-Data-Analysis

This Power BI dashboard provides an interactive and data-driven overview of the electric vehicle (EV) landscape. It visualizes key insights across various dimensions including sales trends, model performance, manufacturer comparisons, and market growth. The purpose of the dashboard is to enable stakeholders to explore and analyze development

Size: 360 KB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

2KRISHNAYADAV/DATSH.AI

Size: 7.81 KB - Last synced at: 7 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

vishrut-b/clustering-analysis-of-online-retail-data

This project leverages machine learning techniques to analyze online retail data through customer segmentation. It uses KMeans clustering to identify key customer groups and proposes tailored business strategies based on their purchasing behaviors.

Language: Jupyter Notebook - Size: 1.92 MB - Last synced at: 8 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0

pavankethavath/Car_dekho_car_price_prediction

A Streamlit web app utilizing Python, scikit-learn, and pandas for used car price prediction. Features data preprocessing (scaling, encoding), Random Forest model optimization with GridSearchCV, and interactive user input handling. Achieves high accuracy (R² score: 0.9028), showcasing skills in machine learning, data engineering, and deployment.

Language: Python - Size: 182 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

girish119628/girish119628

Data Enthusiast | Predictive Modeler | Turning Insights into Strategies

Size: 8.79 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

ngambip/Top-uk-Youtubers-2024.githu.io

This project involves a comprehensive analysis to determine the top YouTubers in the UK for 2024, Using Excel, SQL and Power BI.

Size: 2.38 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

ngambip/Diabetes_factors_2024

Exploring BMI Categories and Health Factors.

Size: 6.39 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

FaezehAbedi2023/Optimizing-Credit-Card-Fraud-Detection-in-Banking-with-Ensemble-Learning-Techniques

This research advances credit card fraud detection by integrating machine learning and deep learning techniques. Key findings include improved model adaptability through hyperparameter tuning.

Language: Jupyter Notebook - Size: 62.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

IbadDE/SQL_projects

Language: Jupyter Notebook - Size: 913 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

saahen-sriyan-mishra/PyAnalytics-Python_Data_Analysis

Here I conducted EDA on a diverse datasets, including movies, sales, and gaming data. Did data cleaning, visualization, and interpretation using libraries like pandas, NumPy, Matplotlib, and Seaborn to extract actionable insights for informed decision-making processes.

Language: Jupyter Notebook - Size: 149 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

ArtemyiMelehin/DataCleaningProject

Preprocessing ML by cleaning data

Language: Jupyter Notebook - Size: 6.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

saahen-sriyan-mishra/InsightStream-Power_BI_Visualizations

Using POWER BI to perform data visualization in different domains to show insights, trends and forecasts by using different datasets like excel, csv and SQL.

Size: 7.18 MB - Last synced at: 8 months ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

yogeshkasar778/PWC_task_2-Customer_Churn_Retension_dashboard

Size: 1.25 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 13

prijall/FeatureEngineering

This repository will contain all my work related to feature engineering

Language: Jupyter Notebook - Size: 3.84 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

80396-B2/Credit_Score_Prediction

Given a person’s credit-related information, I am building a Machine/Deep learning model that can classify the credit score.

Language: Jupyter Notebook - Size: 30 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

YamanAlBochi/GoogleAppsAnalysis

Analyzing various apps found on the Google Play Store with the help of different python libraries. The dataset is chosen from Kaggle. It is the web scraped data of 10k Play Store apps for analyzing the Android market. It consists of in total of 10841 rows and 13 columns.

Language: Jupyter Notebook - Size: 226 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

yamovan/datascience

Данный проект направлен на демонстрацию основных принципов анализа, преобразования, очистки и визуализации данных

Language: HTML - Size: 36.7 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

ritikaga/Superstore-Sales-Analysis-and-Time-series-Forecasting

Analysis Sales data to gain insights and create Interactive Sales Dashboard and also predict /Forecast the next sales with the use of Power-Bi.

Size: 2.59 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

SurajGusain0007/FAASO_SQL_ANALYSIS_PROJECT

Rebel Foods, formerly known as Faasos, is a renowned food delivery company in India. It originated as a restaurant chain but transitioned into an online platform. Rebel Foods is known for providing affordable meals, a wide variety of cuisines, and prompt delivery services. They operate numerous brands and leverage technology to enhance the overall

Size: 647 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

baranylcn/RuleBasedCustomerSegmentation_with_GezinomiDataset

Size: 2.82 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

ibtassam1/Airbnb

Using Python to perform exploratory data analysis on scraped Airbnb dataset. This multi-faceted analysis was selected by University Master program director for the data portal (link in Readme.md).

Language: Jupyter Notebook - Size: 7.87 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

RAMSON-OFON/datascience

This repo consist of projects on: Data Wrangling, Data Visualization and Machine Learning

Language: HTML - Size: 6.78 MB - Last synced at: almost 3 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

Anubhavchandil/RESEARCH-INTERN

Worked on a dataset of high entropy alloys which is used to design materials for additive manufacturing. Being responsible for Performing Data Analysis and constructing Machine learning algorithms, including neural networks, Gradient boosting for carrying predictions useful for advanced material invention.

Language: Jupyter Notebook - Size: 17.5 MB - Last synced at: almost 3 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

behdadmansouri/Datamining_HW1_Data_Cleaning

preprocessing on a flower iris dataset

Language: Python - Size: 40 KB - Last synced at: almost 3 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

sonu275981/Big-Mart-Sales-Prediction

Using Machine Learning Algorithms for Regression Analysis to predict the sales pattern and Using Data Analysis and Data Visualizations to Support it.

Language: Jupyter Notebook - Size: 339 KB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 3

binamify/66DaysOfData

The #66Days of Data is a initiative started by Ken Jee started to help people develop better data science habits!

Language: Jupyter Notebook - Size: 813 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

elliotastern/sternclean

Clean your data frame in one readable function

Language: R - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

charleswu52/BitcoinAnalysis

Bitcoin Transaction Data Analysis System.

Language: Scala - Size: 2.1 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

thebadcoder96/Eminem_NLP

Extracting lyrics from Genius API and conducting NLP analysis on Eminem's lyrics

Language: Jupyter Notebook - Size: 7.67 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

esvs2202/Car-Price-prediction_-hackathon-

Language: Jupyter Notebook - Size: 1.4 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

rahul3687/Exploratory-data-analysis-python

Exploratory data analysis. loan amount prediction on the basis of credit score.

Language: Jupyter Notebook - Size: 4.76 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 2

pragatimehra21/Data_Analyst_Udacity

Udacity Data Analyst Projects

Language: HTML - Size: 2.22 MB - Last synced at: almost 3 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

imdevskp/flights-crash-data-web-scraping-cleaning

Repository contains notebooks and datasets on no. of flights departures, passengers flew, flights crashed etc.

Language: Jupyter Notebook - Size: 3.1 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

imdevskp/ebola_outbreak_dataset

The repository contains data and code for scrapping and cleaning data on Ebola outbreak in 2014

Language: Jupyter Notebook - Size: 105 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 3

Kunal1198/Data-Cleaning-with-Pandas

The main aim is to clean the data with pandas library.

Language: Jupyter Notebook - Size: 43.9 KB - Last synced at: 4 months ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

sbavon/Kaggle-NYC-Taxi-Fare-Prediction

My solution for Kaggle NYC Taxi Fare Prediction ( ranked 21st/1463)

Language: Python - Size: 984 KB - Last synced at: 5 months ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 2

Mindful-AI-Assistants/8-social-buzz-ai-Project-Social-Pulse-A-Machine-Learning-Approach-to-Academic-Performance-Modeling

🪐 8- Social Buss: Extension Project – Social Pulse
A machine-learning project analyzing anonymized student performance data through cleaning, exploration, feature engineering, and predictive modeling.

Language: Jupyter Notebook - Size: 23.8 MB - Last synced at: 8 days ago - Pushed at: 25 days ago - Stars: 1 - Forks: 0

MariaEgbuna/Road-Accidents

Analyzing a road accidents dataset using Python.

Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

dikshabagul/Cloud-Cost-Monitoring-

A Power BI project to monitor and optimize cloud cost and usage across multiple environments.

Size: 459 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Related Topics
python 267 data-visualization 226 data 191 dataanalysis 176 powerbi 170 data-science 160 datavisualization 155 pandas 150 sql 149 data-analysis 149 machine-learning 126 excel 114 eda 97 exploratory-data-analysis 86 numpy 84 jupyter-notebook 79 matplotlib 72 dashboard 71 visualization 68 seaborn 67 datapreprocessing 52 dataanalytics 52 python3 51 datamodeling 48 mysql 46 powerquery 45 tableau 43 feature-engineering 41 datascience 40 analysis 36 datatransformation 36 webscraping 35 dataexploration 32 database 31 machine-learning-algorithms 30 dataset 29 postgresql 28 etl 28 pivot-tables 27 dax 26 datawrangling 26 r 25 datamanipulation 24 pandas-dataframe 24 insights 24 linear-regression 24 matplotlib-pyplot 21 analytics 20 regression 20 streamlit 20 dataprocessing 19 nlp 18 logistic-regression 18 preprocessing 18 statistics 17 seaborn-plots 16 dataengineering 16 scikit-learn 16 predictive-modeling 15 dax-query 15 random-forest 15 powerbidashboard 15 kmeans-clustering 15 datapreparation 15 etl-pipeline 14 dashboards 14 plotly 14 clustering 14 powerbi-visuals 14 dataanalyst 13 deep-learning 13 machinelearning 13 dax-expression 13 business-intelligence 12 msexcel 12 powerbi-report 12 classification 12 sklearn 12 charts 11 mysql-database 11 datanalysis 11 pandas-python 11 kaggle 10 numpy-library 10 modelevaluation 10 prediction 10 statistical-analysis 10 hypothesis-testing 10 data-mining 9 tableau-dashboards 9 presentation 9 datacleansing 9 xgboost 9 random-forest-classifier 9 supervised-learning 9 dataextraction 9 datavisualization-project 9 artificial-intelligence 8 nlp-machine-learning 8 streamlit-webapp 8