An open API service providing repository metadata for many open source software ecosystems.

Topic: "dirty-data"

sfirke/janitor

simple tools for data cleaning in R

Language: R - Size: 8.2 MB - Last synced at: 10 days ago - Pushed at: 5 months ago - Stars: 1,411 - Forks: 132

skrub-data/skrub

Machine learning with dataframes

Language: Python - Size: 12.4 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,399 - Forks: 131

dirty-data-science/python

Tutorial material on machine learning with dirty data in Python

Language: Python - Size: 27.1 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 61 - Forks: 8

raamana/missingdata

missing data handing: visualize and impute

Language: Python - Size: 1.52 MB - Last synced at: 14 days ago - Pushed at: almost 6 years ago - Stars: 18 - Forks: 1

Patrick-Frisella/Cleaning-NIH-Chest-Xray-Dataset

Cleaning the NIH chest x-ray dataset using an image classifier.

Language: Jupyter Notebook - Size: 1.78 GB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ThisIsJohnnyLau/dirty_data_project

Dirty data project

Language: R - Size: 21.6 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

d3rty/json

Flexible JSON decoding for Go — gracefully handling schema variations and forgiving mistakes.

Language: Go - Size: 2.18 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

jbn/vaquero

A Python library for iterative and interactive data wrangling at laptop-scale.

Language: Python - Size: 137 KB - Last synced at: 3 days ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

jl3392/Data-Wrangling-practice

Data wrangling using python and SQL

Language: Jupyter Notebook - Size: 599 KB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0