GitHub / Wittline / pyspark-on-aws-emr
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Wittline%2Fpyspark-on-aws-emr
Stars: 27
Forks: 13
Open issues: 0
License: apache-2.0
Language: Python
Size: 3.61 MB
Dependencies parsed at: Pending
Created at: over 4 years ago
Updated at: 2 months ago
Pushed at: almost 3 years ago
Last synced at: about 2 months ago
Topics: aws, aws-emr, big-data, big-data-analytics, dataengineering, ec2-spot, ec2-spot-instances, emr-cluster, pyspark, python, spark, wordcloud-generator
Funding Links https://github.com/sponsors/Wittline