Kafka-Spark-Redshift-Streaming-Data-Ingestion-Project

This project is a real-time data pipeline designed for ingesting, processing, and storing telecom call records. It integrates Apache Kafka, Apache Spark Streaming, and AWS Redshift to handle large volumes of streaming data in near real-time. The pipeline is containerized with Docker Compose, enabling easy deployment, scalability, and modularity.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/BhawnaMehbubani%2FKafka-Spark-Redshift-Streaming-Data-Ingestion-Project

Stars: 0
Forks: 0
Open issues: 0

License: None
Language: Python
Size: 952 KB
Dependencies parsed at: Pending

Created at: 4 months ago
Updated at: 4 months ago
Pushed at: 4 months ago
Last synced at: about 2 months ago

Topics: apache-kafka, apache-spark, aws-redshift, docker, spark-streaming

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub / BhawnaMehbubani / Kafka-Spark-Redshift-Streaming-Data-Ingestion-Project