GitHub / branesh2k / AWS-emr-project
AWS EMR-based ETL pipeline using PySpark and S3. Executed using SSH spark-submit.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/branesh2k%2FAWS-emr-project
PURL: pkg:github/branesh2k/AWS-emr-project
Stars: 0
Forks: 0
Open issues: 0
License: None
Language: Python
Size: 1.29 MB
Dependencies parsed at: Pending
Created at: 4 months ago
Updated at: 4 months ago
Pushed at: 4 months ago
Last synced at: 4 months ago
Topics: aws-emr, big-data, pyspark, s3-etl, spark-submit, ssh