An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: spark-worker

longNguyen010203/Spark-Processing-AWS

👷🌇 Set up and build a big data processing pipeline with Apache Spark, 📦 AWS services (S3, EMR, EC2, IAM, VPC, Redshift) Terraform to setup the infrastructure and Integration Airflow to automate workflows🥊

Language: Python - Size: 1010 KB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

JBris/docker-spark-sparklyr

Docker setup for Apache Spark and the R sparklyr package

Language: Dockerfile - Size: 18.6 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

indy-3rdman/docker-dotnet-spark

A .NET for Apache Spark docker image (3rdman/dotnet-spark)

Language: Shell - Size: 15 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 18 - Forks: 10