GitHub / SA01 / spark-data-stats-tutorial
Contains the code and examples for my article on Medium, which explains how to optimize computing data statistics in Apache Spark jobs using the Observations feature.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SA01%2Fspark-data-stats-tutorial
PURL: pkg:github/SA01/spark-data-stats-tutorial
Stars: 0
Forks: 0
Open issues: 0
License: None
Language: Python
Size: 4.88 KB
Dependencies parsed at: Pending
Created at: about 2 years ago
Updated at: 8 months ago
Pushed at: 8 months ago
Last synced at: 8 months ago
Topics: analytics, apache-spark, big-data, data-engineering, pyspark, python, spark, spark-monitor, spark-sql