GitHub / datasets 114 Repositories
Important, commonly-used datasets in high quality, easy-to-use & open form as data packages
datasets/media-types
List of MIME types, subtypes, and file name extensions.
Language: Python - Size: 174 KB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 33 - Forks: 12

datasets/nyse-other-listings
Data package for NYSE listings
Language: Python - Size: 1.58 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 30 - Forks: 19

datasets/finance-vix
CBOE Volatility Index (VIX) time-series dataset including daily open, close, high and low.
Language: Makefile - Size: 467 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 75 - Forks: 36

datasets/commodity-prices
Monthly Prices of 53 commodities and 10 indexes from 1980 to 2016.
Language: Python - Size: 926 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 27 - Forks: 10

datasets/football-datasets
Major Europe leagues data (England, Spain, Italy, Germany and France)
Language: Python - Size: 1.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 68 - Forks: 26

datasets/emojis
Unicode Emoji as UTS #51 specification
Language: Python - Size: 463 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 32 - Forks: 18

datasets/oil-prices
Brent crude and WTI oil prices from US EIA
Language: Python - Size: 1.19 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 102 - Forks: 62

datasets/geoip2-ipv4
GeoIP2 - free IP geolocation database.
Language: Python - Size: 24.4 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 65 - Forks: 20

datasets/ppp
Purchasing power parity (PPP)
Language: Python - Size: 690 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 42 - Forks: 15

datasets/investor-flow-of-funds-us
Monthly net new cash flow into various mutual fund investment classes (equities, bonds etc).
Language: Python - Size: 133 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 33 - Forks: 21

datasets/house-prices-us
US House Price Indices (Case-Shiller)
Language: Python - Size: 1.17 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 25 - Forks: 22

datasets/top-level-domain-names
The delegation details of top-level domains
Language: Python - Size: 58.6 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 40 - Forks: 42

datasets/bond-yields-us-10y
10 year nominal yields on US government bonds from the Federal Reserve
Language: Python - Size: 69.3 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 18 - Forks: 10

datasets/natural-gas
Natural Gas Prices including Henry Hub
Language: Python - Size: 280 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 27 - Forks: 16

datasets/cpi-gb
Consumer Price Index (and hence inflation) for the UK from 1850 to the present (monthly since June 1947).
Language: Python - Size: 39.1 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 9 - Forks: 7

datasets/euribor
Euribor rates by year and granularity.
Language: Python - Size: 287 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 11 - Forks: 13

datasets/house-prices-uk
UK house prices dataset
Language: Python - Size: 393 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 14 - Forks: 46

datasets/gini-index
Repository of the GINI index official repository.
Language: Python - Size: 142 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 16 - Forks: 9

datasets/cpi
Annual consumer price index datapackage for most countries in the world
Language: Python - Size: 384 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 15 - Forks: 10

datasets/world-cities
List of major cities of the world as a datapackage
Language: Python - Size: 819 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 254 - Forks: 202

datasets/nasdaq-listings
Data package for Nasdaq listings
Language: Python - Size: 775 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 63 - Forks: 56

datasets/land-matrix
land-matrix
Language: Python - Size: 1.92 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 7 - Forks: 4

datasets/gold-prices
Gold prices data package
Language: Python - Size: 1.04 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 51 - Forks: 42

datasets/co2-ppm
CO2 PPM - Trends in Atmospheric Carbon Dioxide
Language: Shell - Size: 207 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 21 - Forks: 15

datasets/membership-to-copyright-treaties
Membership to Copyright Treaties
Language: Python - Size: 78.1 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 3 - Forks: 6

datasets/gdp
Country, regional and world GDP in current US Dollars ($)
Language: Python - Size: 765 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 80 - Forks: 58

datasets/bond-yields-gov-long-term
Long term government bond yields
Language: Python - Size: 16.6 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 7 - Forks: 4

datasets/awesome-data
Curated list of quality open datasets
Size: 190 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 854 - Forks: 114

datasets/co2-ppm-daily
Carbon Dioxide levels in the atmosphere (ppm on a daily basis)
Language: Python - Size: 427 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 14 - Forks: 11

datasets/publicbodies
A database of public bodies such as government departments, ministries etc.
Language: Less - Size: 13.5 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 68 - Forks: 28

datasets/un-locode
United Nations Codes for Trade and Transport Locations (UN/LOCODE) and Country Codes
Language: Python - Size: 28.6 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 154 - Forks: 58

datasets/geo-countries
Country polygons as GeoJSON in a datapackage
Language: Makefile - Size: 10.7 MB - Last synced at: 27 days ago - Pushed at: about 1 month ago - Stars: 497 - Forks: 139

datasets/covid-19
Novel Coronavirus 2019 time series data on cases
Language: Python - Size: 4.93 GB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 1,167 - Forks: 603

datasets/cpi-us
Us Consumer Price Index (DataHub Data Package)
Language: Python - Size: 80.1 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 11

datasets/employment-us
US Employment and Unemployment rates since 1940 from Bureau of Labor Statistics
Language: Python - Size: 36.1 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 11

datasets/imf-weo
IMF World Economic Outlook Database Data
Language: Python - Size: 3.08 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 39 - Forks: 16

datasets/language-codes
ISO Language Codes (639-1 and 639-2)
Language: Shell - Size: 56.3 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 101 - Forks: 61

datasets/country-codes
Comprehensive country code information, including ISO 3166 codes, ITU dialing codes, ISO 4217 currency codes, and many others
Language: Python - Size: 784 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 926 - Forks: 574

datasets/smdg-master-terminal-facilities-list
List mantained by the SMDG Secretariat to specify the port terminal facilities in UN/EDIFACT messages.
Language: Python - Size: 203 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 11

datasets/london-median-housing-affordability
Language: Python - Size: 15.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 4

datasets/currency-codes
ISO 4217 List of Currencies and Currency Codes
Language: Shell - Size: 77.1 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 156 - Forks: 179

datasets/cash-surplus-deficit
Cash Surplus/Deficit (% of GDP), from 1990 to 2013
Language: Python - Size: 381 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 4

datasets/gdp-us
Gross Domestic Product of the United States (US GDP)
Language: Python - Size: 46.9 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 13

datasets/population
Population figures for countries, regions (e.g. Asia) and the world.
Language: Python - Size: 718 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 103 - Forks: 149

datasets/bond-yields-uk-10y
Long-term (10 year) UK Government Bond Yields
Language: Python - Size: 54.7 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 11

datasets/inflation
Annual Inflation, GDP deflator and consumer prices
Language: Python - Size: 1.09 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 5

datasets/harmonized-system
HS Code as a datapackage
Language: Python - Size: 329 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 132 - Forks: 46

datasets/exchange-rates
Foreign exchange rates from US Federal Reserve.
Language: Python - Size: 3.15 MB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 25 - Forks: 20

datasets/breast-cancer
Breast cancer occurrences.
Language: Python - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 24 - Forks: 70

datasets/country-list
List of all countries in the world with their ISO 2 digit codes (ISO 3166-1) as CSV and JSON
Size: 40 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 167 - Forks: 185

datasets/socrata-opendata
This repo contains scripts for generating datasets from socrata-opendata
Language: Python - Size: 17.6 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 4

datasets/clinical-trials-us
Official US clinical trial outcomes from the FDA
Language: JavaScript - Size: 11.7 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 32 - Forks: 9

datasets/s-and-p-500
S&P 500 index data (aka Standard and Poor's index of 500 major US stocks)
Language: Python - Size: 2.85 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 289 - Forks: 49

datasets/dermatology
Patients with dermatology illnesses.
Language: Python - Size: 11.7 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 7 - Forks: 11

datasets/eeg-eye-state
EEG measurements where the output is whether eye was open or not
Language: Python - Size: 401 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 6 - Forks: 5

datasets/core-datasets
DataHub.io awesome datasets - curated collections of high quality dataset organized by topic
Language: JavaScript - Size: 162 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 60 - Forks: 15

datasets/five-thirty-eight-datasets
Over 100 datasets scraped from FiveThirtyEight
Language: Python - Size: 39.7 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 38 - Forks: 33

datasets/exchange-rates-usd
Exchange Rates Data Package
Language: Python - Size: 1.95 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 7

datasets/browser-stats
Web browser usage statistics
Language: Python - Size: 20.5 KB - Last synced at: 30 days ago - Pushed at: 6 months ago - Stars: 20 - Forks: 12

datasets/threatened-species
Dataset covering IUCN Red List of Threatened Animal Species
Language: Python - Size: 4.03 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 4 - Forks: 3

datasets/usa-education-budget-analysis
United States of America Education budget to GDP analysis
Language: Python - Size: 396 KB - Last synced at: 30 days ago - Pushed at: 5 months ago - Stars: 4 - Forks: 5

datasets/fips-10-4
List of FIPS (Federal Information Processing Standards) region codes
Language: Python - Size: 164 KB - Last synced at: 30 days ago - Pushed at: 4 months ago - Stars: 10 - Forks: 8

datasets/pharmaceutical-drug-spending
Pharmaceutical Drug Spending by countries
Language: Python - Size: 2.56 MB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 8 - Forks: 8

datasets/population-reference-bureau
Collect datasets from Population Reference Bureau about demographic and health
Language: Python - Size: 7.41 MB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 5

datasets/london-transport
Language: Python - Size: 41 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 5

datasets/genome-sequencing-costs
Costs associated with DNA sequencing since 2001
Language: Python - Size: 61.5 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 9

datasets/household-income-us-historical
Income Limits for Each Fifth and Top 5 Percent of All Households: 1967 to 2016
Language: Python - Size: 27.3 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 6

datasets/house-prices-global
Residential property price statistics from different countries (from bis.org)
Language: Python - Size: 1.08 MB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 11

datasets/cpi-change
Annual Consumer Price Index Percent Change 1974-2016
Language: Python - Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 4 - Forks: 4

datasets/population-city
City population yearly timeseries for female and male, and for both sexes, collected by the United Nations Statistics Division and published by UNData.
Language: Python - Size: 2.08 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 27 - Forks: 9

datasets/interest-rates-gb
Interest Rate since 1694 from Bank of England.
Language: Python - Size: 24.4 KB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 5

datasets/cofog
Classifications of Functions of Government
Language: Python - Size: 68.4 KB - Last synced at: 30 days ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 6

datasets/dac-and-crs-code-lists
Machine readable DAC CRS codelists
Language: Python - Size: 3.86 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 12 - Forks: 9

datasets/eu-emissions-trading-system
Data about the EU emission trading system (ETS)
Language: Python - Size: 1.28 MB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 16 - Forks: 9

datasets/datacatalogs.org
Data from DataCatalogs.org
Language: Python - Size: 4.88 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 4

datasets/geo-ne-admin1
Test of a datapackage for Natural Earth admin1
Language: Python - Size: 9.96 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 7 - Forks: 5

datasets/unece-units-of-measure
Standardised codes from Recommendation 20, mantained by UNECE.
Language: Java - Size: 636 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 14 - Forks: 10

datasets/gcat-artificial-space-objects
General Catalog of Artificial Space Objects. Jonathan's Space Report.
Size: 1.95 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

datasets/race-and-ethnicity-codes-us
US Race and Ethnicity Codes
Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 4

datasets/global-temp-anomalies
Data about global annual anomalies
Language: Python - Size: 40 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 5

datasets/s-and-p-500-companies-financials
List of companies in the S&P 500 (Standard and Poor's 500).
Language: HTML - Size: 1.83 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 66 - Forks: 85

datasets/london-life-expectancy
Language: Python - Size: 70.3 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 4

datasets/london-unemployment
Language: Python - Size: 24.4 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 6

datasets/london-population
Population of London - CSV'd and Data Package'd
Language: Python - Size: 39.1 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 7

datasets/gdp-uk
UK GDP
Language: Shell - Size: 31.3 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 5

datasets/london-underground-report
Language: Python - Size: 23.4 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 3

datasets/cervical-cancer
Cervical cancer occurrences
Language: Python - Size: 20.5 KB - Last synced at: 30 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 9

datasets/ICC-Incoterms
International Commercial Terms (‘Incoterms’) are internationally recognised standard trade terms used in sales contracts.
Size: 13.7 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 12

datasets/sea-level-rise
Global Mean Sea Level Rise
Language: Python - Size: 419 KB - Last synced at: 30 days ago - Pushed at: 4 months ago - Stars: 22 - Forks: 28

datasets/glwd
Global Lakes and Wetlands Database Levels 1 and 2 Polygons as GeoJSON (.geojson/.topojson) with original format (.shp)
Size: 141 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 12 - Forks: 7

datasets/IMO-IMDG-Codes
Official IMDG Codes for use in transport of dangerous goods as described by the IMO
Size: 12.7 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 12 - Forks: 10

datasets/edgar
Securities and Exchange Commission (SEC) EDGAR database which contains regulatory filings from publicly-traded US corporations.
Language: HTML - Size: 26.4 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 330 - Forks: 68

datasets/ISO-Container-Codes
Coded list of ISO 6346 shipping containers, used in international trade and electronic shipping messages.
Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 39 - Forks: 24

datasets/continent-codes
List of continents with two letter code
Size: 15.6 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 20 - Forks: 15

datasets/global-temp
Global Temperature Time Series
Language: Python - Size: 187 KB - Last synced at: 30 days ago - Pushed at: 6 months ago - Stars: 22 - Forks: 18

datasets/geo-nuts-administrative-boundaries
Datapackage for NUTS admin levels 1, 2 and 3 edition 2010
Language: Python - Size: 11.9 MB - Last synced at: 30 days ago - Pushed at: 5 months ago - Stars: 11 - Forks: 7

datasets/primary-tumor
Primary tumors in people
Language: Python - Size: 7.81 KB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 5 - Forks: 3

datasets/co2-fossil-by-nation
Annual info about co2 emissions per nation
Language: Python - Size: 1.86 MB - Last synced at: 30 days ago - Pushed at: 5 months ago - Stars: 14 - Forks: 16

datasets/world-religion-projections
Word Religion Projections (2010-2050)
Size: 45.9 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 9

datasets/glacier-mass-balance
Average cumulative mass balance of "reference" Glaciers worldwide
Language: Python - Size: 30.3 KB - Last synced at: 30 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 18
