An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-orchestration"

kestra-io/kestra

:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 600+ plugins. Alternative to Airflow, n8n, Rundeck, VMware vRA, Zapier ...

Language: Java - Size: 55.9 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 17,577 - Forks: 1,476

Alluxio/alluxio

Alluxio, data orchestration for analytics and machine learning in the cloud

Language: Java - Size: 196 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 6,981 - Forks: 2,941

cubefs/cubefs

cloud-native distributed storage

Language: Go - Size: 243 MB - Last synced at: about 13 hours ago - Pushed at: 1 day ago - Stars: 5,051 - Forks: 659

apache/incubator-graphar

An open source, standard data file format for graph data storage and retrieval.

Language: C++ - Size: 6.84 MB - Last synced at: 4 days ago - Pushed at: 11 days ago - Stars: 274 - Forks: 66

iam-mhaseeb/Skytrax-Data-Warehouse 📦

A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.

Language: Python - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 133 - Forks: 28

jonathanneo/data-aware-orchestration

Data-aware orchestration with dagster, dbt, and airbyte

Language: Python - Size: 1.36 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 31 - Forks: 0

kestra-io/examples

Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services

Language: HCL - Size: 3.28 MB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 25 - Forks: 9

ozkary/data-engineering-mta-turnstile

Data Engineering - Metropolitan Transportation Authority (MTA) Subway Data Analysis

Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 25 - Forks: 4

SAP-samples/btp-data-to-value-workshop

This repo contains a dataset, exercises, and sample code for an end-to-end SAP BTP data-to-value bootcamp covering SAP HANA Cloud, SAP Data Warehouse Cloud, SAP Data Intelligence Cloud, and SAP Analytics Cloud.

Language: Jupyter Notebook - Size: 167 MB - Last synced at: 25 days ago - Pushed at: about 2 months ago - Stars: 24 - Forks: 24

astronomer/airflow-provider-fivetran-async

A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran

Language: Python - Size: 196 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 22 - Forks: 9

anna-geller/kestra-ci-cd

CI/CD repository template to automate deployments of your production flows

Language: HCL - Size: 104 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 12 - Forks: 5

Alluxio/k8s-operator

An operator for managing Alluxio system on Kubernetes cluster

Language: Go - Size: 270 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 9

dagster-io/dagster-quickstart 📦

Get started with Dagster ASAP

Language: Python - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 122

anna-geller/kestra-terraform-examples

Bring Infrastructure as Code best practices to your data workflows with Kestra and Terraform

Language: HCL - Size: 746 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

kestra-io/data-engineering-zoomcamp

Code for the Data Engineering Zoomcamp course

Size: 470 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1

longNguyen010203/Finance-Data-Ingestion-Pipeline-with-Kafka

Develop a real-time data ingestion pipeline using Kafka and Spark. Collect minute-level stock data from Yahoo Finance, ingest it into Kafka, and process it with Spark Streaming, storing the results in Cassandra. Orchestrated the workflow using Airflow deployed on Docker.

Language: Python - Size: 250 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

taquynhnga2001/proptech-dagster

Build an ELT pipeline with dagster and dbt to schedule loading HDB resale transactions in Singapore into Google BigQuery data warehouse, then create Power BI dashboard to enhance insight exploration.

Language: Python - Size: 3.42 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

zpencerguy/superdoppler

Data orchestration repo with Docker deployment

Language: Python - Size: 38.1 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

jasontanx/prefect-learning

Prefect - Data orchestration tool practice & learning

Language: Python - Size: 314 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

stemitom/postgres-pipeline

A simple pipeline infrastructure with ETL pipeline contained in a Docker environment on Apache Airflow for orchestration and Postgres for data warehousing

Language: Python - Size: 217 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 3

Annielytix/azure-data-factory-data-vault

Working with SCD Type (Change Data Capture) and need a Data Vault model to test Azure Data Factory v2? - This Code with Help!

Size: 2.75 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

GADES-DATAENG/webinar

Code, scripts, and resources for the Data Engineering Fundamentals Course Webinar, covering Python, data pipelines, Apache Airflow, and more.

Language: Python - Size: 26.9 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Wireforce-LLC/m3

☕ Data Orchestrator. Without abstractions

Language: TypeScript - Size: 139 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

kingabzpro/5-Airflow-Alternatives-for-Data-Orchestration-Tutorial

Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI

Language: Python - Size: 40 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

philiporlando/dagster_university

I created this repo to follow along with the examples in the Dagster University Essentials course.

Language: Python - Size: 90.8 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

ddeutils/data-orchestra

❌ Full-Stack Data Orchestration config by Yaml template with Flask & HTMX

Language: Python - Size: 3.81 MB - Last synced at: about 20 hours ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

MostafaNabilll/end2end_pipeline

End to End data engineering project

Language: Python - Size: 3.87 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jacquessham/airflow_notes

Repository to store scripts and notes on Airflow

Language: Python - Size: 186 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0