Topic: "pysaprk"
riju18/apache-iceberg-kickstart
Size: 70.3 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

miltiadiss/CEID_NE4348-Big-Data-Management-Systems
This project implements a real-time data pipeline with Kafka, Spark, and MongoDB. It generates vehicle data using UXSIM, streams it to a Kafka broker, processes it with Spark, and stores raw and processed data in MongoDB. Queries analyze vehicle counts, speeds, and routes over specified periods.
Language: Python - Size: 4.88 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

LalitSharma7/F1-Data-Analysis
Project based on application of azure databricks
Language: Python - Size: 28.3 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

victorlifan/Sparkify--Pyspark-Big-Data-Project
This project performed data wrangling, analysis, visualization as well as machine learning prediction on a hypothetical music app's user churn with pyspark.
Language: Jupyter Notebook - Size: 33.9 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Munanga/Pyspark-Analysis
Sample analysis done using pyspark on parking violations issued for fiscal year 2017 using the databricks platform
Language: HTML - Size: 49.8 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

yhskgo/pyspark_deep_learning
Language: Jupyter Notebook - Size: 141 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

hd-zhao-uu/1TD169_Project
Language: Jupyter Notebook - Size: 1.36 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

johngodoi/learning_pyspark
Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

adharangaonkar/ETL-Pipelines
A repository concentrating on using High end parallel pipelines to perform ETL across various data sources
Language: Jupyter Notebook - Size: 672 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

jpoberhauser/dist_comp_final
NBA shot predictions with PySpark and SparkML
Language: Jupyter Notebook - Size: 4.06 MB - Last synced at: 10 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
