An open API service providing repository metadata for many open source software ecosystems.

GitHub / imsanjoykb / ETL-Project

The goal of this project is to illustrate Extract Transform Load (ETL) using Python and SQL. ETL is a process commonly done in computing, which takes raw data, cleans it and stores it for later use. The extraction phase targets and retrieves the data. Transform manipulates and cleans the data. Then load stores the data, typically in a data warehouse.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/imsanjoykb%2FETL-Project
PURL: pkg:github/imsanjoykb/ETL-Project

Stars: 22
Forks: 9
Open issues: 0

License: mit
Language: Jupyter Notebook
Size: 285 KB
Dependencies parsed at: Pending

Created at: almost 4 years ago
Updated at: 5 months ago
Pushed at: almost 4 years ago
Last synced at: 4 months ago

Topics: data-engineering, database, datalake, datawarehouse, etl, etl-automation, etl-pipeline, etl-solutions

    Loading...