An open API service providing repository metadata for many open source software ecosystems.

GitHub / rupeshtiwari / kafka-spark-streaming-avro-in-python

Streaming kafka events using Spark in avro format and saving the events in parquet format

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rupeshtiwari%2Fkafka-spark-streaming-avro-in-python
PURL: pkg:github/rupeshtiwari/kafka-spark-streaming-avro-in-python

Stars: 5
Forks: 1
Open issues: 0

License: None
Language: Python
Size: 38.1 KB
Dependencies parsed at: Pending

Created at: over 3 years ago
Updated at: about 1 year ago
Pushed at: over 3 years ago
Last synced at: 13 days ago

Topics: avro, aws, confluent-kafka, java, kafka, msk, parquet, parquet-files, pyspark, python, python3, real-time-streaming, scala, spark

Funding Links https://github.com/sponsors/rupeshtiwari

    Loading...