GitHub / BhawnaMehbubani / Kafka-Spark-Redshift-Streaming-Data-Ingestion-Project
This project is a real-time data pipeline designed for ingesting, processing, and storing telecom call records. It integrates Apache Kafka, Apache Spark Streaming, and AWS Redshift to handle large volumes of streaming data in near real-time. The pipeline is containerized with Docker Compose, enabling easy deployment, scalability, and modularity.
Stars: 0
Forks: 0
Open issues: 0
License: None
Language: Python
Size: 952 KB
Dependencies parsed at: Pending
Created at: 4 months ago
Updated at: 4 months ago
Pushed at: 4 months ago
Last synced at: about 2 months ago
Topics: apache-kafka, apache-spark, aws-redshift, docker, spark-streaming