An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: spark-dataset

JBris/docker-spark-sparklyr

Docker setup for Apache Spark and the R sparklyr package

Language: Dockerfile - Size: 18.6 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

NashTech-Labs/Sparkathon

A library having Java and Scala examples for Spark 2.x

Language: Java - Size: 113 MB - Last synced at: 4 months ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 9

ayse-ok/ApacheSpark

Apache Spark read file, map transformation, spark-sql api, spark-dataset api examples with java

Language: Java - Size: 96.7 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

fpopic/wt-interview-challenge

(Interview) WT Data Engineer Interview Challenge

Language: Scala - Size: 74.2 KB - Last synced at: 5 months ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 0

amanjeetsahu/Apache-Spark-using-Scala

This repo contains my learnings and practices Zepplin notebooks on Spark using Scala. All the notebooks in the repo can be used as template code for most of the ML algorithms and can be built upon it for more complex problems.

Size: 20.9 MB - Last synced at: 9 days ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

SevakAvet/spark-session-enricher

Calculate user sessions & stats on top of them for imaginary ecom site using Spark sql & aggregations

Language: Scala - Size: 10.7 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0