GitHub topics: spark-dataset
JBris/docker-spark-sparklyr
Docker setup for Apache Spark and the R sparklyr package
Language: Dockerfile - Size: 18.6 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

NashTech-Labs/Sparkathon
A library having Java and Scala examples for Spark 2.x
Language: Java - Size: 113 MB - Last synced at: 4 months ago - Pushed at: over 8 years ago - Stars: 7 - Forks: 9

ayse-ok/ApacheSpark
Apache Spark read file, map transformation, spark-sql api, spark-dataset api examples with java
Language: Java - Size: 96.7 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

fpopic/wt-interview-challenge
(Interview) WT Data Engineer Interview Challenge
Language: Scala - Size: 74.2 KB - Last synced at: 5 months ago - Pushed at: almost 8 years ago - Stars: 4 - Forks: 0

amanjeetsahu/Apache-Spark-using-Scala
This repo contains my learnings and practices Zepplin notebooks on Spark using Scala. All the notebooks in the repo can be used as template code for most of the ML algorithms and can be built upon it for more complex problems.
Size: 20.9 MB - Last synced at: 9 days ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

SevakAvet/spark-session-enricher
Calculate user sessions & stats on top of them for imaginary ecom site using Spark sql & aggregations
Language: Scala - Size: 10.7 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0
