An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-preparation

MeghnathReddy/R-Sales-Prediction

Data wrangling, regression modeling and analysis.

Language: R - Size: 6.39 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

marizombie/bing-images-downloader

Simple python app for Bing images download with help of Images Search API and Visual Search API, can be used for datasets preparing

Language: Python - Size: 19.5 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

ilyak93/Election-Prediction-using-ML

Language: Python - Size: 19.9 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Promeos/classification-exercises

ML Classification exercises to develop skills for Data Science while at Codeup.

Language: Jupyter Notebook - Size: 12 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

rkarwayun/MSCI-641-Final-Project

93.91% score on FNC Challenge.

Language: Jupyter Notebook - Size: 43 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

nabeel-gulzar/crop-and-resize

This script is written to crop a set of images to the center and then resize the cropped image to a certain resolution.

Language: Python - Size: 1.95 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

stivenramireza/social-media-ml

Data Science project about Social Media using supervised, unsupervised and reinforcement Machine Learning algorithms.

Language: Jupyter Notebook - Size: 15.8 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

gaoisbest/Pytorch_notes_and_projects

Pytorch notes and projects

Language: Jupyter Notebook - Size: 101 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

Develop-Packt/Exploring-the-Online-Retail-Dataset

Search for and deal with missing values, outliers, and anomalies in an online retail dataset. Create new columns from existing data and design visualizations to demonstrate your findings

Language: Jupyter Notebook - Size: 14.7 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

Develop-Packt/Discovering-the-Building-Blocks-of-Neural-Networks-with-PyTorch

Discover the main building blocks of neural networks and understand the three main neural network architectures. Explore the process of solving a regression data problem

Language: Jupyter Notebook - Size: 22.8 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

chaitanyacsss/SceneClassification

Scene Classification using pytorch : Dataset created using frames from youtube videos

Language: Python - Size: 127 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

rai-harshit/simple_image_labeler

Labeling tool for Image Classification tasks.

Language: Python - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

juliensiebert/data-preparation

notebooks tutorial data preparation

Language: Jupyter Notebook - Size: 2.99 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

dyscord-lab/SA_Analysis

Functions for data preparation for the SA protocol in the dyscord lab

Language: R - Size: 12.2 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

PierreKieffer/DataPreprocessing

Custom data preprocessing library made for machine learning

Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

AbhishekRS4/cityscapes

Data preparation and augmentation scripts

Language: Python - Size: 4.88 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

NGG-Group-CS320/Database

Database files and info for MF-DAT

Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

sabyasachi2505/Zeus

Class built on top of pySpark dataframes that automates data preparation and pre-processing

Language: Jupyter Notebook - Size: 43 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

sayanm-10/Basic-Data-Cleanup

Given a CSV file, pre-process and clean the data.

Language: R - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

Related Keywords
data-preparation 319 python 80 machine-learning 79 data-preprocessing 73 data-science 68 data-analysis 68 data-visualization 60 data-cleaning 52 pandas 34 exploratory-data-analysis 30 deep-learning 22 feature-engineering 22 classification 19 data 19 numpy 18 data-wrangling 17 sql 16 matplotlib 16 python3 15 data-processing 15 seaborn 14 r 14 logistic-regression 14 eda 13 scikit-learn 12 machine-learning-algorithms 11 random-forest 10 tableau 10 jupyter-notebook 9 clustering 9 tensorflow 9 linear-regression 9 regression 9 predictive-modeling 9 data-manipulation 8 data-mining 8 statistics 8 nlp 8 data-analytics 8 dataset 7 image-processing 7 neural-network 7 visualization 7 feature-selection 7 excel 7 data-cleansing 7 feature-extraction 6 data-collection 6 neural-networks 6 data-engineering 6 opencv 6 data-transformation 6 artificial-intelligence 6 statistical-analysis 6 text-processing 6 data-quality 5 datasets 5 data-exploration 5 docker 5 supervised-learning 5 pca 5 data-visualisation 5 plotly 5 preprocessing 5 time-series-analysis 5 keras 5 dashboard 5 svm-classifier 4 pytorch 4 sentiment-analysis 4 missing-values 4 decision-tree-classifier 4 model-training-and-evaluation 4 natural-language-processing 4 random-forest-classifier 4 named-entity-recognition 4 large-language-models 4 analytics 4 train-test-split 4 data-normalization 4 mysql 4 streamlit 4 pipeline 4 decission-tree 4 powerbi 4 computer-vision 4 ml 4 web-scraping 4 deep-neural-networks 4 analysis 4 sklearn 4 data-prep 4 image-classification 4 hypothesis-testing 4 data-modeling 4 data-structures 3 data-understanding 3 algorithms 3 modeling 3 datascience 3