An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: spark-session

codam-coding-college/spark-sessions

Spark sessions help beginning students dissect the first larger projects of the curriculum.

Language: Shell - Size: 5.85 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 15 - Forks: 1

Lefteris-Souflas/Spark-Movies-Analytics

Utilizing Apache Spark & PySpark to analyze a movie dataset. Tasks include data exploration, identifying top-rated movies, training a linear regression model, and experimenting with Airflow.

Language: Jupyter Notebook - Size: 289 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

tohid-yousefi/Customer_Segmentation_with_pyspark_on_Flo_Dataset

In this section, we will perform customer segmentation using pyspark in the Flo dataset.

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0