An open API service providing repository metadata for many open source software ecosystems.

GitHub / shahidmalik4 / aws-glue-stepfunctions-etl

This project automates an ETL pipeline using AWS Glue, S3, Athena, and Step Functions to transform raw Airbnb data. It cleanses, enriches, and organizes the data into separate raw and transformed databases, enabling efficient querying and analysis via Athena, with automated notifications through SNS.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shahidmalik4%2Faws-glue-stepfunctions-etl

Stars: 0
Forks: 0
Open issues: 0

License: None
Language: Python
Size: 3.47 MB
Dependencies parsed at: Pending

Created at: 6 months ago
Updated at: 6 months ago
Pushed at: 6 months ago
Last synced at: about 2 months ago

Topics: aws, aws-athena, aws-glue, aws-glue-crawler, aws-s3, aws-sns, aws-step-functions, etl-pipeline, pyspark

    Loading...