An open API service providing repository metadata for many open source software ecosystems.

GitHub / hoangsonww / End-to-End-Data-Pipeline

📈 A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transformation, storage, monitoring, and AI/ML serving with CI/CD automation using Terraform & GitHub Actions.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hoangsonww%2FEnd-to-End-Data-Pipeline
PURL: pkg:github/hoangsonww/End-to-End-Data-Pipeline

Stars: 41
Forks: 24
Open issues: 0

License: mit
Language: Python
Size: 31.1 MB
Dependencies parsed at: Pending

Created at: 5 months ago
Updated at: about 1 month ago
Pushed at: about 1 month ago
Last synced at: about 1 month ago

Topics: airflow, apache, docker, elasticsearch, flink, grafana, great-expectations, hadoop, influxdb, kafka, kubernetes, looker, minio, mlflow, postgresql, prometheus, python, spark, sql, terraform

    Loading...