pyspark-maestro

This repo contains implementations of PySpark for real-world use cases for batch data processing, streaming data processing sourced from Kafka, sockets, etc., spark optimizations, business specific bigdata processing scenario solutions, and machine learning use cases.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DebanjanSarkar%2Fpyspark-maestro
PURL: pkg:github/DebanjanSarkar/pyspark-maestro

Stars: 2
Forks: 1
Open issues: 0

License: None
Language: Jupyter Notebook
Size: 66.1 MB
Dependencies parsed at: Pending

Created at: about 1 year ago
Updated at: 9 months ago
Pushed at: about 1 year ago
Last synced at: 4 months ago

Commit Stats

Commits: 6
Authors: 2
Mean commits per author: 3.0
Development Distribution Score: 0.333
More commit stats: https://commits.ecosyste.ms/hosts/GitHub/repositories/DebanjanSarkar/pyspark-maestro

Topics: json, kafka, kafka-python, kafka-streams, pyspark, pyspark-api, pyspark-machine-learning, pyspark-mllib, pyspark-streaming, python3, spark, spark-mllib, spark-sql, spark-streaming

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub / DebanjanSarkar / pyspark-maestro

Commit Stats