GitHub / imsanjoykb / ETL-Project
The goal of this project is to illustrate Extract Transform Load (ETL) using Python and SQL. ETL is a process commonly done in computing, which takes raw data, cleans it and stores it for later use. The extraction phase targets and retrieves the data. Transform manipulates and cleans the data. Then load stores the data, typically in a data warehouse.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/imsanjoykb%2FETL-Project
PURL: pkg:github/imsanjoykb/ETL-Project
Stars: 22
Forks: 9
Open issues: 0
License: mit
Language: Jupyter Notebook
Size: 285 KB
Dependencies parsed at: Pending
Created at: almost 4 years ago
Updated at: 5 months ago
Pushed at: almost 4 years ago
Last synced at: 4 months ago
Topics: data-engineering, database, datalake, datawarehouse, etl, etl-automation, etl-pipeline, etl-solutions