An open API service providing repository metadata for many open source software ecosystems.

GitHub / ManikHossain08 / Realtime-ETL-DataPipeline-Using-Avro_Schema_Registry-Spark-Kafka-HDFS-Hive-Scala

Bigdata processing (Realtime ETL DataPipeline) using Avro Schema Registry, Spark, Kafka, HDFS, Hive, Scala, docker, spark-streaming

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ManikHossain08%2FRealtime-ETL-DataPipeline-Using-Avro_Schema_Registry-Spark-Kafka-HDFS-Hive-Scala
PURL: pkg:github/ManikHossain08/Realtime-ETL-DataPipeline-Using-Avro_Schema_Registry-Spark-Kafka-HDFS-Hive-Scala

Stars: 2
Forks: 0
Open issues: 0

License: None
Language: Scala
Size: 1.36 MB
Dependencies parsed at: Pending

Created at: over 3 years ago
Updated at: over 2 years ago
Pushed at: over 3 years ago
Last synced at: about 2 years ago

Topics: avro, avro-schema-registry, big-data, docker-image, etl-pipeline, hdfs, hive, kafka, kafka-broker, kafka-consumer, kafka-container, kafka-producer, kafka-streams, parquet, scala, spark, spark-sql, spark-streaming

    Loading...