Topic: "data-pipelines"
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language: Python - Size: 415 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 41,829 - Forks: 15,504

pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Language: Python - Size: 132 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 33,513 - Forks: 962

dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
Language: Python - Size: 1.29 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 13,946 - Forks: 1,800

apache/dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Language: Java - Size: 210 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 13,790 - Forks: 4,875

Unstructured-IO/unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
Language: HTML - Size: 193 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 12,544 - Forks: 1,030

mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Language: Python - Size: 232 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8,460 - Forks: 862

infinyon/fluvio
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
Language: Rust - Size: 34.1 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 4,981 - Forks: 516

StructuredLabs/preswald
Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.
Language: Python - Size: 97.2 MB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 4,320 - Forks: 665

orchest/orchest
Build data pipelines, the easy way 🛠️
Language: TypeScript - Size: 27.2 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 4,141 - Forks: 263

Netflix/maestro
Maestro: Netflix’s Workflow Orchestrator
Language: Java - Size: 1.78 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 3,515 - Forks: 231

ucbepic/docetl
A system for agentic LLM-powered data processing and ETL
Language: Python - Size: 61.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,794 - Forks: 303

meltano/meltano
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Language: Python - Size: 141 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 2,187 - Forks: 181

elementary-data/elementary
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Language: HTML - Size: 208 MB - Last synced at: about 2 hours ago - Pushed at: about 3 hours ago - Stars: 2,146 - Forks: 195

data-engineering-community/data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.
Language: CSS - Size: 7.84 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1,718 - Forks: 201

feldera/feldera
The Feldera Incremental Computation Engine
Language: Rust - Size: 165 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,585 - Forks: 75

combust/mleap
MLeap: Deploy ML Pipelines to Production
Language: Scala - Size: 3.4 MB - Last synced at: 21 days ago - Pushed at: 9 months ago - Stars: 1,520 - Forks: 313

pyper-dev/pyper
Concurrent Python made simple
Language: Python - Size: 462 KB - Last synced at: 16 days ago - Pushed at: 7 months ago - Stars: 1,462 - Forks: 30

opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Language: Java - Size: 27.9 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 1,317 - Forks: 123

fmind/mlops-python-package
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
Language: Jupyter Notebook - Size: 3.24 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1,273 - Forks: 191

yobix-ai/extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Language: Rust - Size: 2.88 MB - Last synced at: 15 days ago - Pushed at: 9 months ago - Stars: 1,217 - Forks: 56

OpenDCAI/DataFlow
Easy Data Preparation with latest LLMs-based Operators and Pipelines.
Language: Python - Size: 74.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,178 - Forks: 77

amphi-ai/amphi-etl
Visual Data Preparation and Transformation. Low-Code Python-based ETL.
Language: TypeScript - Size: 2.86 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1,095 - Forks: 72

bruin-data/bruin
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
Language: Go - Size: 153 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 976 - Forks: 45

dataform-co/dataform
Dataform is a framework for managing SQL based data operations in BigQuery
Language: TypeScript - Size: 16.6 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 918 - Forks: 183

raystack/optimus
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Language: Go - Size: 12.2 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 748 - Forks: 154

artie-labs/transfer
Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.
Language: Go - Size: 4.45 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 667 - Forks: 38

elementary-data/dbt-data-reliability
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Language: Python - Size: 7.75 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 459 - Forks: 110

vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
Language: Python - Size: 110 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 456 - Forks: 59

gabledata/recap
Work with your web service, database, and streaming schemas in a single format.
Language: Python - Size: 1.54 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 344 - Forks: 26

dataflint/spark
Drop-in replacement for Apache Spark UI
Language: TypeScript - Size: 18.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 299 - Forks: 39

tuva-health/tuva
Main repo including core data model, data marts, data quality tests, and terminology sets.
Language: HTML - Size: 46.5 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 272 - Forks: 95

dataplane-app/dataplane
Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.
Language: JavaScript - Size: 281 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 226 - Forks: 33

terrytangyuan/awesome-kubeflow
A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)
Size: 199 KB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 214 - Forks: 18

kevin-hanselman/dud
A lightweight CLI tool for versioning data alongside source code and building data pipelines.
Language: Go - Size: 3.42 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 212 - Forks: 9

datajoint/datajoint-python
Relational data pipelines for the science lab
Language: Python - Size: 20.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 183 - Forks: 90

koolreport/core
An Open Source PHP Reporting Framework that helps you to write perfect data reports or to construct awesome dashboards in PHP. Working great with all PHP versions from 5.6 to latest 8.0. Fully compatible with all kinds of MVC frameworks like Laravel, CodeIgniter, Symfony.
Language: PHP - Size: 2.66 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 176 - Forks: 34

realize-engineering/pipebird
Pipebird is open source infrastructure for securely sharing data with customers.
Language: TypeScript - Size: 1.91 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 168 - Forks: 7

GoogleCloudPlatform/public-datasets-pipelines
Cloud-native, data onboarding architecture for Google Cloud Datasets
Language: Python - Size: 6.66 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 162 - Forks: 69

mitdbg/palimpzest
A System for Optimized Semantic Computation
Language: Python - Size: 375 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 137 - Forks: 25

smart-data-lake/smart-data-lake
Smart Automation Tool for building modern Data Lakes and Data Pipelines
Language: Scala - Size: 45.4 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 124 - Forks: 24

DidactHQ/didact
The open core .NET job orchestrator that we've been missing
Language: C# - Size: 353 KB - Last synced at: 6 days ago - Pushed at: 20 days ago - Stars: 119 - Forks: 1

linkedin/Hoptimator
Multi-hop declarative data pipelines
Language: Java - Size: 1010 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 118 - Forks: 13

Burla-Cloud/burla
The simplest way to run Python on lot's of computers.
Language: TypeScript - Size: 2.41 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 114 - Forks: 3

patterns-app/patterns-devkit
Data pipelines from re-usable components
Language: Python - Size: 1.75 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 107 - Forks: 5

mycelial/mycelial
Move your data with ease.
Language: Rust - Size: 2.21 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 106 - Forks: 9

shravan-kuchkula/udacity-data-eng-proj-1
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
Language: Python - Size: 3.47 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 88 - Forks: 58

beneath-hq/beneath 📦
Beneath is a serverless real-time data platform ⚡️
Language: Go - Size: 11 MB - Last synced at: about 4 hours ago - Pushed at: over 3 years ago - Stars: 84 - Forks: 10

conductor-oss/python-sdk
Conductor OSS SDK for Python programming language
Language: Python - Size: 3.45 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 81 - Forks: 36

DataCater/datacater 📦
The developer-friendly ETL platform for transforming data in real-time. Based on Apache Kafka® and Kubernetes®.
Language: JavaScript - Size: 4.08 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 81 - Forks: 3

DidactHQ/didact-engine
The REST API and execution engine for the Didact Platform.
Language: C# - Size: 261 KB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 78 - Forks: 3

minhadona/data_engineer_interview_challenges
Found a data engineering challenge or participated in a selection process ? Share with us!
Language: Python - Size: 7.35 MB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 65 - Forks: 12

immu0001/Udacity-Data-Engineer-nanodegree
Classwork projects and home works done through Udacity data engineering nano degree
Language: Jupyter Notebook - Size: 101 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 71

iesahin/xvc
A robust (🐢) and fast (🐇) MLOps tool for managing data and pipelines in Rust (🦀)
Language: Rust - Size: 6.7 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 57 - Forks: 1

exospherehost/exospherehost
Mono repo for exosphere.host to simplify infrastructure for AI agents
Language: Python - Size: 29.4 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 56 - Forks: 19

DrDroidLab/kenobi
Easiest way to monitor asynchronous data pipelines
Language: Python - Size: 2.47 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 55 - Forks: 4

CogStack/CogStack-NiFi
Building data processing pipelines for documents processing with NLP using Apache NiFi and related services
Language: Python - Size: 281 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 53 - Forks: 19

siyul-park/uniflow
A high-performance, extremely flexible, and easily extensible universal workflow engine.
Language: Go - Size: 3.15 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 53 - Forks: 5

KentHsu/Udacity-Data-Engineering-Nanodgree
Udacity Data Engineering Nanodegree Program
Language: Jupyter Notebook - Size: 2.12 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 52 - Forks: 59

bakdata/streams-explorer
Explore Apache Kafka data pipelines in Kubernetes.
Language: Python - Size: 3.63 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 46 - Forks: 5

DanilBaibak/ml-in-production
The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
Language: Python - Size: 143 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 43 - Forks: 19

flipkart-incubator/spark-transformers
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
Language: Java - Size: 609 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 42 - Forks: 29

confluentinc/learn-kafka-courses
Learn the basics of Apache Kafka® from leaders in the Kafka community with these video courses covering the Kafka ecosystem and hands-on exercises.
Language: Shell - Size: 41 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 38 - Forks: 89

Galileo-Galilei/kedro-pandera
A kedro plugin to use pandera in your kedro projects
Language: Python - Size: 208 KB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 36 - Forks: 4

tabsdata/tabsdata
A Pub/Sub for Tables based data integration platform, to discover, publish, modify and consume data effortlessly.
Language: Rust - Size: 11.8 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 35 - Forks: 0

mdh266/AirflowDataPipeline
Example of an ETL Pipeline using Airflow
Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 8 years ago - Stars: 35 - Forks: 20

Tanguy9862/Space-App
A Dash application visualizing humanity's journey into space with data from over 7,000 launches and key milestones, from Sputnik to Mars rovers. Built on scalable data pipelines and deployed on GCP, the app offers real-time updates and interactive insights into space exploration history.
Language: Python - Size: 802 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 29 - Forks: 7

montara-io/dbt-command-center
Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.
Language: TypeScript - Size: 3.55 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 28 - Forks: 0

tuva-health/FHIR_inferno
Connector that loads FHIR r4 USCDIv3 JSON data from local file storage into the Tuva common data model in Snowflake.
Language: Python - Size: 201 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 27 - Forks: 9

giacbrd/SmartPipeline
A framework for rapid development of robust data pipelines following a simple design pattern
Language: Python - Size: 393 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 3

arakat-community/arakat 📦
ARAKAT - Big Data Analysis and Business Intelligence Application Development Platform
Language: Python - Size: 31.6 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 27 - Forks: 21

electronick1/stepist 📦
Framework for data processing
Language: Python - Size: 865 KB - Last synced at: 13 days ago - Pushed at: almost 6 years ago - Stars: 27 - Forks: 5

tuva-health/demo
A starter dbt project and synthetic claims dataset for trying out the Tuva Project.
Size: 1.87 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 26 - Forks: 36

kestra-io/examples
Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services
Language: HCL - Size: 3.28 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 25 - Forks: 9

DidactHQ/didact-ui
The web dashboard for the Didact Platform.
Language: C# - Size: 664 KB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 1

pachyderm/neon-workshop
A Pachyderm deep learning tutorial for conference workshops
Language: Python - Size: 56.6 KB - Last synced at: 24 days ago - Pushed at: about 8 years ago - Stars: 19 - Forks: 6

RiveryIO/rivery_cli
Rivery CLI
Language: Python - Size: 626 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 18 - Forks: 2

tsdat/tsdat
Framework for standardizing, transforming, and applying quality checks to time series data.
Language: Jupyter Notebook - Size: 147 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 8

larribas/dagger
Define sophisticated data pipelines with Python and run them on different distributed systems (such as Argo Workflows).
Language: Python - Size: 9.97 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 7

adilkhash/apache-airflow-intro
Language: Python - Size: 9.77 KB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 4

ipeluffo/airflow-on-kubernetes
Source code for guide to run Apache Airflow on Kubernetes
Language: Python - Size: 7.81 KB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 17 - Forks: 13

jrlasak/databricks_apparel_streaming
Databricks DLT Apparel Pipeline Project: Learn medallion architecture, streaming, and data engineering with Delta Live Tables. Includes synthetic data, step-by-step guide, and certification prep.
Language: Python - Size: 1.22 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 16 - Forks: 9

marcio-azevedo/fsharp-data-processing-pipeline
Provides an extensible solution for creating Data Processing Pipelines in F#.
Language: F# - Size: 352 KB - Last synced at: 20 days ago - Pushed at: over 7 years ago - Stars: 15 - Forks: 1

apicrafter/datacrafter
NoSQL extract, transform, load (ETL) toolkit with Python
Language: Python - Size: 470 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 13 - Forks: 3

tuva-health/medicare_cclf_connector
This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.
Size: 1.02 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 13 - Forks: 16

anna-geller/kestra-ci-cd
CI/CD repository template to automate deployments of your production flows
Language: HCL - Size: 104 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 12 - Forks: 5

ketgo/marshmallow-pyspark
Marshmallow serializer integration with pyspark
Language: Python - Size: 63.5 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 4

dushyantkhosla/airflow4ds
Using Apache Airflow to author, run and monitor complex data pipelines.
Language: Jupyter Notebook - Size: 22.5 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 12 - Forks: 2

brunocampos01/data-engineering
Language: Python - Size: 165 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 11 - Forks: 2

alireza-heidarii/Real-Time-Data-Cleaning-Pipeline-for-Medical-and-Healthcare-Data
A real-time data cleaning pipeline for medical and healthcare data using Apache Spark, SparkNLP, Spark Streaming, and Kafka.
Language: Python - Size: 11.7 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

kiwicom/terraform-provider-montecarlo
This open-source Terraform provider enables users to seamlessly integrate the Monte Carlo data reliabillity platform into their infrastructure as a code (IaC) workflows.
Language: Go - Size: 284 KB - Last synced at: 5 days ago - Pushed at: 23 days ago - Stars: 10 - Forks: 7

bitroot/coflux
Open-source workflow engine. Orchestrate and observe computational workflows defined in plain Python. Suitable for data pipelines, background tasks, etc.
Language: TypeScript - Size: 5.79 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 1

tuva-health/medicare_lds_connector
Maps Medicare LDS claims data to the Tuva Input Layer so you can easily run the Tuva Project.
Size: 688 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 10 - Forks: 5

MattTriano/analytics_data_where_house
An analytics engineering sandbox focusing on real estates prices in Cook County, IL
Language: Python - Size: 17.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0

TextQLLabs/dbt-documentor
✍️ dbt doc generator for advanced data teams
Language: F# - Size: 187 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

ImperialCollegeLondon/prefect-managedfiletransfer
Point and click upload and download files between local/SFTP/Cloud with rclone support. Built on/for Prefect.IO. Managed file transfer appliance - Copy/Move/Overwrite/Unzip/Checksumming etc.
Language: Python - Size: 47.4 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 8 - Forks: 0

mackelab/epiphyte
Python toolkit for working with high-dimensional neural data recorded during naturalistic, continuous stimuli @a-darcher @rachrapp
Language: Jupyter Notebook - Size: 190 MB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 8 - Forks: 1

AnthonyByansi/Airflow-Data-Pipeline-Automation
Automate your data pipelines using Apache Airflow with this ready-to-use DAG for data integration, ETL and workflow automation.
Size: 60 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 8 - Forks: 0

AnthonyByansi/Rust-Exploratorium
🚀 Master Rust programming with this comprehensive roadmap! Explore fundamental and advanced concepts, code examples, and resources.
Language: Rust - Size: 38.1 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 0

goto/optimus Fork of raystack/optimus
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Language: Go - Size: 16.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 4

metaheed/kolle
Business model representation automation
Language: Shell - Size: 154 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 1
