GitHub / faizpuad / DataEngineeringProject-DocumentStreamingWithData
The core objective of this project is to build an end-to-end data streaming pipeline that processes this dataset in real-time. By leveraging modern data engineering tools and techniques, we aim to connect, buffer, process, store, and visualize streaming data. This allows for better understanding of data flows, handling of large-scale real-time data
Stars: 1
Forks: 1
Open issues: 0
License: None
Language: Jupyter Notebook
Size: 1.8 MB
Dependencies parsed at: Pending
Created at: 6 months ago
Updated at: 5 months ago
Pushed at: 6 months ago
Last synced at: 17 days ago
Topics: data-engineering, document-streaming, fastapi, kafka, mon, spark-streaming-kafka, streamlit