An open API service providing repository metadata for many open source software ecosystems.

GitHub / datasets 114 Repositories

Important, commonly-used datasets in high quality, easy-to-use & open form as data packages

datasets/media-types

List of MIME types, subtypes, and file name extensions.

Language: Python - Size: 174 KB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 33 - Forks: 12

datasets/nyse-other-listings

Data package for NYSE listings

Language: Python - Size: 1.58 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 30 - Forks: 19

datasets/finance-vix

CBOE Volatility Index (VIX) time-series dataset including daily open, close, high and low.

Language: Makefile - Size: 467 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 75 - Forks: 36

datasets/commodity-prices

Monthly Prices of 53 commodities and 10 indexes from 1980 to 2016.

Language: Python - Size: 926 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 27 - Forks: 10

datasets/football-datasets

Major Europe leagues data (England, Spain, Italy, Germany and France)

Language: Python - Size: 1.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 68 - Forks: 26

datasets/emojis

Unicode Emoji as UTS #51 specification

Language: Python - Size: 463 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 32 - Forks: 18

datasets/oil-prices

Brent crude and WTI oil prices from US EIA

Language: Python - Size: 1.19 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 102 - Forks: 62

datasets/geoip2-ipv4

GeoIP2 - free IP geolocation database.

Language: Python - Size: 24.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 65 - Forks: 20

datasets/ppp

Purchasing power parity (PPP)

Language: Python - Size: 690 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 42 - Forks: 15

datasets/investor-flow-of-funds-us

Monthly net new cash flow into various mutual fund investment classes (equities, bonds etc).

Language: Python - Size: 133 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 33 - Forks: 21

datasets/house-prices-us

US House Price Indices (Case-Shiller)

Language: Python - Size: 1.17 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 25 - Forks: 22

datasets/top-level-domain-names

The delegation details of top-level domains

Language: Python - Size: 58.6 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 40 - Forks: 42

datasets/bond-yields-us-10y

10 year nominal yields on US government bonds from the Federal Reserve

Language: Python - Size: 69.3 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 18 - Forks: 10

datasets/natural-gas

Natural Gas Prices including Henry Hub

Language: Python - Size: 280 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 27 - Forks: 16

datasets/cpi-gb

Consumer Price Index (and hence inflation) for the UK from 1850 to the present (monthly since June 1947).

Language: Python - Size: 39.1 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 9 - Forks: 7

datasets/euribor

Euribor rates by year and granularity.

Language: Python - Size: 287 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 11 - Forks: 13

datasets/house-prices-uk

UK house prices dataset

Language: Python - Size: 393 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 14 - Forks: 46

datasets/gini-index

Repository of the GINI index official repository.

Language: Python - Size: 142 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 16 - Forks: 9

datasets/cpi

Annual consumer price index datapackage for most countries in the world

Language: Python - Size: 384 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 15 - Forks: 10

datasets/world-cities

List of major cities of the world as a datapackage

Language: Python - Size: 819 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 254 - Forks: 202

datasets/nasdaq-listings

Data package for Nasdaq listings

Language: Python - Size: 775 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 63 - Forks: 56

datasets/land-matrix

land-matrix

Language: Python - Size: 1.92 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 7 - Forks: 4

datasets/gold-prices

Gold prices data package

Language: Python - Size: 1.04 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 51 - Forks: 42

datasets/co2-ppm

CO2 PPM - Trends in Atmospheric Carbon Dioxide

Language: Shell - Size: 207 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 21 - Forks: 15

datasets/membership-to-copyright-treaties

Membership to Copyright Treaties

Language: Python - Size: 78.1 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 3 - Forks: 6

datasets/gdp

Country, regional and world GDP in current US Dollars ($)

Language: Python - Size: 765 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 80 - Forks: 58

datasets/bond-yields-gov-long-term

Long term government bond yields

Language: Python - Size: 16.6 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 7 - Forks: 4

datasets/awesome-data

Curated list of quality open datasets

Size: 190 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 854 - Forks: 114

datasets/co2-ppm-daily

Carbon Dioxide levels in the atmosphere (ppm on a daily basis)

Language: Python - Size: 427 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 14 - Forks: 11

datasets/publicbodies

A database of public bodies such as government departments, ministries etc.

Language: Less - Size: 13.5 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 68 - Forks: 28

datasets/un-locode

United Nations Codes for Trade and Transport Locations (UN/LOCODE) and Country Codes

Language: Python - Size: 28.6 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 154 - Forks: 58

datasets/geo-countries

Country polygons as GeoJSON in a datapackage

Language: Makefile - Size: 10.7 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 497 - Forks: 139

datasets/covid-19

Novel Coronavirus 2019 time series data on cases

Language: Python - Size: 4.93 GB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 1,167 - Forks: 603

datasets/cpi-us

Us Consumer Price Index (DataHub Data Package)

Language: Python - Size: 80.1 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 11

datasets/employment-us

US Employment and Unemployment rates since 1940 from Bureau of Labor Statistics

Language: Python - Size: 36.1 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 11

datasets/imf-weo

IMF World Economic Outlook Database Data

Language: Python - Size: 3.08 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 39 - Forks: 16

datasets/language-codes

ISO Language Codes (639-1 and 639-2)

Language: Shell - Size: 56.3 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 101 - Forks: 61

datasets/country-codes

Comprehensive country code information, including ISO 3166 codes, ITU dialing codes, ISO 4217 currency codes, and many others

Language: Python - Size: 784 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 926 - Forks: 574

datasets/smdg-master-terminal-facilities-list

List mantained by the SMDG Secretariat to specify the port terminal facilities in UN/EDIFACT messages.

Language: Python - Size: 203 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 11

datasets/london-median-housing-affordability

Language: Python - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 4

datasets/currency-codes

ISO 4217 List of Currencies and Currency Codes

Language: Shell - Size: 77.1 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 156 - Forks: 179

datasets/cash-surplus-deficit

Cash Surplus/Deficit (% of GDP), from 1990 to 2013

Language: Python - Size: 381 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 4

datasets/gdp-us

Gross Domestic Product of the United States (US GDP)

Language: Python - Size: 46.9 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 13

datasets/population

Population figures for countries, regions (e.g. Asia) and the world.

Language: Python - Size: 718 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 103 - Forks: 149

datasets/bond-yields-uk-10y

Long-term (10 year) UK Government Bond Yields

Language: Python - Size: 54.7 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 11

datasets/inflation

Annual Inflation, GDP deflator and consumer prices

Language: Python - Size: 1.09 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 5

datasets/harmonized-system

HS Code as a datapackage

Language: Python - Size: 329 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 132 - Forks: 46

datasets/exchange-rates

Foreign exchange rates from US Federal Reserve.

Language: Python - Size: 3.15 MB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 25 - Forks: 20

datasets/breast-cancer

Breast cancer occurrences.

Language: Python - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 24 - Forks: 70

datasets/country-list

List of all countries in the world with their ISO 2 digit codes (ISO 3166-1) as CSV and JSON

Size: 40 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 167 - Forks: 185

datasets/socrata-opendata

This repo contains scripts for generating datasets from socrata-opendata

Language: Python - Size: 17.6 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 4

datasets/clinical-trials-us

Official US clinical trial outcomes from the FDA

Language: JavaScript - Size: 11.7 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 32 - Forks: 9

datasets/s-and-p-500

S&P 500 index data (aka Standard and Poor's index of 500 major US stocks)

Language: Python - Size: 2.85 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 289 - Forks: 49

datasets/dermatology

Patients with dermatology illnesses.

Language: Python - Size: 11.7 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 7 - Forks: 11

datasets/eeg-eye-state

EEG measurements where the output is whether eye was open or not

Language: Python - Size: 401 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 6 - Forks: 5

datasets/core-datasets

DataHub.io awesome datasets - curated collections of high quality dataset organized by topic

Language: JavaScript - Size: 162 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 60 - Forks: 15

datasets/five-thirty-eight-datasets

Over 100 datasets scraped from FiveThirtyEight

Language: Python - Size: 39.7 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 38 - Forks: 33

datasets/exchange-rates-usd

Exchange Rates Data Package

Language: Python - Size: 1.95 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 7

datasets/browser-stats

Web browser usage statistics

Language: Python - Size: 20.5 KB - Last synced at: 30 days ago - Pushed at: 6 months ago - Stars: 20 - Forks: 12

datasets/threatened-species

Dataset covering IUCN Red List of Threatened Animal Species

Language: Python - Size: 4.03 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 4 - Forks: 3

datasets/usa-education-budget-analysis

United States of America Education budget to GDP analysis

Language: Python - Size: 396 KB - Last synced at: 30 days ago - Pushed at: 5 months ago - Stars: 4 - Forks: 5

datasets/fips-10-4

List of FIPS (Federal Information Processing Standards) region codes

Language: Python - Size: 164 KB - Last synced at: 30 days ago - Pushed at: 4 months ago - Stars: 10 - Forks: 8

datasets/pharmaceutical-drug-spending

Pharmaceutical Drug Spending by countries

Language: Python - Size: 2.56 MB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 8 - Forks: 8

datasets/population-reference-bureau

Collect datasets from Population Reference Bureau about demographic and health

Language: Python - Size: 7.41 MB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 5

datasets/london-transport

Language: Python - Size: 41 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 5

datasets/genome-sequencing-costs

Costs associated with DNA sequencing since 2001

Language: Python - Size: 61.5 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 9

datasets/household-income-us-historical

Income Limits for Each Fifth and Top 5 Percent of All Households: 1967 to 2016

Language: Python - Size: 27.3 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 6

datasets/house-prices-global

Residential property price statistics from different countries (from bis.org)

Language: Python - Size: 1.08 MB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 11

datasets/cpi-change

Annual Consumer Price Index Percent Change 1974-2016

Language: Python - Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 4 - Forks: 4

datasets/population-city

City population yearly timeseries for female and male, and for both sexes, collected by the United Nations Statistics Division and published by UNData.

Language: Python - Size: 2.08 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 27 - Forks: 9

datasets/interest-rates-gb

Interest Rate since 1694 from Bank of England.

Language: Python - Size: 24.4 KB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 5

datasets/cofog

Classifications of Functions of Government

Language: Python - Size: 68.4 KB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 6

datasets/dac-and-crs-code-lists

Machine readable DAC CRS codelists

Language: Python - Size: 3.86 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 12 - Forks: 9

datasets/eu-emissions-trading-system

Data about the EU emission trading system (ETS)

Language: Python - Size: 1.28 MB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 16 - Forks: 9

datasets/datacatalogs.org

Data from DataCatalogs.org

Language: Python - Size: 4.88 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 4

datasets/geo-ne-admin1

Test of a datapackage for Natural Earth admin1

Language: Python - Size: 9.96 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 7 - Forks: 5

datasets/unece-units-of-measure

Standardised codes from Recommendation 20, mantained by UNECE.

Language: Java - Size: 636 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 14 - Forks: 10

datasets/gcat-artificial-space-objects

General Catalog of Artificial Space Objects. Jonathan's Space Report.

Size: 1.95 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

datasets/race-and-ethnicity-codes-us

US Race and Ethnicity Codes

Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 4

datasets/global-temp-anomalies

Data about global annual anomalies

Language: Python - Size: 40 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 5

datasets/s-and-p-500-companies-financials

List of companies in the S&P 500 (Standard and Poor's 500).

Language: HTML - Size: 1.83 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 66 - Forks: 85

datasets/london-life-expectancy

Language: Python - Size: 70.3 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 4

datasets/london-unemployment

Language: Python - Size: 24.4 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 6

datasets/london-population

Population of London - CSV'd and Data Package'd

Language: Python - Size: 39.1 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 7

datasets/gdp-uk

UK GDP

Language: Shell - Size: 31.3 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 5

datasets/london-underground-report

Language: Python - Size: 23.4 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 3

datasets/cervical-cancer

Cervical cancer occurrences

Language: Python - Size: 20.5 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 9

datasets/ICC-Incoterms

International Commercial Terms (‘Incoterms’) are internationally recognised standard trade terms used in sales contracts.

Size: 13.7 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 12

datasets/sea-level-rise

Global Mean Sea Level Rise

Language: Python - Size: 419 KB - Last synced at: 30 days ago - Pushed at: 4 months ago - Stars: 22 - Forks: 28

datasets/glwd

Global Lakes and Wetlands Database Levels 1 and 2 Polygons as GeoJSON (.geojson/.topojson) with original format (.shp)

Size: 141 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 12 - Forks: 7

datasets/IMO-IMDG-Codes

Official IMDG Codes for use in transport of dangerous goods as described by the IMO

Size: 12.7 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 12 - Forks: 10

datasets/edgar

Securities and Exchange Commission (SEC) EDGAR database which contains regulatory filings from publicly-traded US corporations.

Language: HTML - Size: 26.4 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 330 - Forks: 68

datasets/ISO-Container-Codes

Coded list of ISO 6346 shipping containers, used in international trade and electronic shipping messages.

Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 39 - Forks: 24

datasets/continent-codes

List of continents with two letter code

Size: 15.6 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 20 - Forks: 15

datasets/global-temp

Global Temperature Time Series

Language: Python - Size: 187 KB - Last synced at: 30 days ago - Pushed at: 6 months ago - Stars: 22 - Forks: 18

datasets/geo-nuts-administrative-boundaries

Datapackage for NUTS admin levels 1, 2 and 3 edition 2010

Language: Python - Size: 11.9 MB - Last synced at: 30 days ago - Pushed at: 5 months ago - Stars: 11 - Forks: 7

datasets/primary-tumor

Primary tumors in people

Language: Python - Size: 7.81 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 5 - Forks: 3

datasets/co2-fossil-by-nation

Annual info about co2 emissions per nation

Language: Python - Size: 1.86 MB - Last synced at: 30 days ago - Pushed at: 5 months ago - Stars: 14 - Forks: 16

datasets/world-religion-projections

Word Religion Projections (2010-2050)

Size: 45.9 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 9

datasets/glacier-mass-balance

Average cumulative mass balance of "reference" Glaciers worldwide

Language: Python - Size: 30.3 KB - Last synced at: 30 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 18