GitHub / vaxdata22 / City-Weather-and-S3File-RDS-S3-BigQuery-ETL-by-Airflow-on-EC2
This is my third AWS Cloud ETL project. This data pipeline orchestration uses Apache Airflow on AWS EC2. It demonstrates how to build an ETL data pipeline that would perform data extraction to a database in parallel to a loading process into the same database, join the tables, copy joined data to S3 and finally copy the S3 file to BigQuery DW.
Stars: 0
Forks: 0
Open issues: 0
License: None
Language: Jupyter Notebook
Size: 4.44 MB
Dependencies parsed at: Pending
Created at: 3 months ago
Updated at: 2 months ago
Pushed at: 2 months ago
Last synced at: 2 months ago
Topics: apache-airflow, aws-ec2, aws-rds-postgres, aws-s3, bigquery, business-intelligence, dags, data-warehousing, etl-pipeline, openweathermap-api, orchestration, python3, sql