An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: munging

whythawk/whyqd

data wrangling simplicity, complete audit transparency, and at speed

Language: Python - Size: 14 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 34 - Forks: 1

jceresearch/pydit

Library of data wrangling functions that an internal auditor typically needs (for my own use and learning, if you wish to use or collaborate pls get in touch, or use at your own peril).

Language: Python - Size: 2.16 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

davorg/dmp

Data Munging with Perl

Language: HTML - Size: 3.75 MB - Last synced at: 18 days ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

andy-j-block/pandas_exercises

Copy of GitHub user guipsamora's pandas_exercises repo (https://github.com/guipsamora/pandas_exercises). This repository is simply a showcase of my munging skills using pandas.

Language: Jupyter Notebook - Size: 17.5 MB - Last synced at: 12 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Adongo/HR-Employee-Attrition

Exploratory Data Analysis to uncover factors data lead to employee attrition.

Language: Jupyter Notebook - Size: 433 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

okarch/xlsutil

Language: Java - Size: 1.23 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

okarch/dimpsy

Language: Java - Size: 144 KB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

nrrb/Yelp-Challenge-Dataset Fork of vc1492a/Yelp-Challenge-Dataset

Munging the data from the Yelp Academic Dataset 2017.

Language: Python - Size: 7.94 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 6 - Forks: 1

bmgrayb/reddit_data_viz

Repository to hold code associated with the Reddit r/DataIsBeautiful Monthly DataViz Battles.

Language: R - Size: 280 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

NoetherEmmy/intransigentms-tools

Tools (mostly scripts) auxiliary to the IntransigentMS server, usable for MS private servers generally

Language: Python - Size: 61.5 KB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 2

mackenziedg/collegecostpredictor

Provides insights to potential college students on their long-term prospects based off of their choice of school.

Language: HTML - Size: 9.54 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

RMHogervorst/unicorns_on_unicycles

A dataset of 'historical' data, useful for munging/ cleaning practice

Language: R - Size: 71.3 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 9 - Forks: 9

QGoGithub/Big-Data-Implementations---Quant_Research

Big Data Implementations - Quantitative_Research

Language: R - Size: 14.6 KB - Last synced at: 9 months ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 0

4DtyU/R-scripting-samples_AnnaBird

Example code supporting immunology research

Language: HTML - Size: 11 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

bnarath/python-challenge

Python scripting challenge

Language: Python - Size: 22.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

EmmaMuhleman1/emmamuhlemantest1.github.io

Language: HTML - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

BriceWolfgang/DupeLines

Draws lines between features that have a duplicate field value

Language: Python - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0