An open API service providing repository metadata for many open source software ecosystems.

GitHub / airscholar / FootballDataEngineering

An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/airscholar%2FFootballDataEngineering
PURL: pkg:github/airscholar/FootballDataEngineering

Stars: 23
Forks: 19
Open issues: 0

License: None
Language: Python
Size: 469 KB
Dependencies parsed at: Pending

Created at: almost 2 years ago
Updated at: 4 months ago
Pushed at: almost 2 years ago
Last synced at: 4 months ago

Topics: apache-airflow, azure-data-factory, azure-data-lake-gen2, azure-databricks, azure-synapse-analytics, data-engineering, dataengineering

    Loading...