An open API service providing repository metadata for many open source software ecosystems.

GitHub / faizpuad / DataEngineeringProject-DocumentStreamingWithData

The core objective of this project is to build an end-to-end data streaming pipeline that processes this dataset in real-time. By leveraging modern data engineering tools and techniques, we aim to connect, buffer, process, store, and visualize streaming data. This allows for better understanding of data flows, handling of large-scale real-time data

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/faizpuad%2FDataEngineeringProject-DocumentStreamingWithData

Stars: 1
Forks: 1
Open issues: 0

License: None
Language: Jupyter Notebook
Size: 1.8 MB
Dependencies parsed at: Pending

Created at: 6 months ago
Updated at: 5 months ago
Pushed at: 6 months ago
Last synced at: 17 days ago

Topics: data-engineering, document-streaming, fastapi, kafka, mon, spark-streaming-kafka, streamlit

    Loading...