Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / jashshah-dev / AWS-Big-Data-Pipeline-orchestrated-with-Airflow

A robust data pipeline leveraging Amazon EMR and PySpark, orchestrated seamlessly with Apache Airflow for efficient batch processing

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jashshah-dev%2FAWS-Big-Data-Pipeline-orchestrated-with-Airflow

Stars: 0
Forks: 0
Open Issues: 0

License: None
Language: Python
Repo Size: 16.6 KB
Dependencies: pending

Created: 5 months ago
Updated: 5 months ago
Last pushed: 5 months ago
Last synced: 5 months ago

Topics: airflow-dags, amazon-s3, distributed-computing, emr-cluster, pyspark, snowflake, transient-cluster

Files
    Loading...
    Readme
    Loading...