An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-pipelines"

apache/airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language: Python - Size: 415 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 41,829 - Forks: 15,504

pathwaycom/pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Language: Python - Size: 132 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 33,513 - Forks: 962

dagster-io/dagster

An orchestration platform for the development, production, and observation of data assets.

Language: Python - Size: 1.29 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 13,946 - Forks: 1,800

apache/dolphinscheduler

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Language: Java - Size: 210 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 13,790 - Forks: 4,875

Unstructured-IO/unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

Language: HTML - Size: 193 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 12,544 - Forks: 1,030

mage-ai/mage-ai

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Language: Python - Size: 232 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8,460 - Forks: 862

infinyon/fluvio

🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.

Language: Rust - Size: 34.1 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 4,981 - Forks: 516

StructuredLabs/preswald

Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.

Language: Python - Size: 97.2 MB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 4,320 - Forks: 665

orchest/orchest

Build data pipelines, the easy way 🛠️

Language: TypeScript - Size: 27.2 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 4,141 - Forks: 263

Netflix/maestro

Maestro: Netflix’s Workflow Orchestrator

Language: Java - Size: 1.78 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 3,515 - Forks: 231

ucbepic/docetl

A system for agentic LLM-powered data processing and ETL

Language: Python - Size: 61.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,794 - Forks: 303

meltano/meltano

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Language: Python - Size: 141 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 2,187 - Forks: 181

elementary-data/elementary

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

Language: HTML - Size: 208 MB - Last synced at: about 2 hours ago - Pushed at: about 3 hours ago - Stars: 2,146 - Forks: 195

data-engineering-community/data-engineering-wiki

The best place to learn data engineering. Built and maintained by the data engineering community.

Language: CSS - Size: 7.84 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1,718 - Forks: 201

feldera/feldera

The Feldera Incremental Computation Engine

Language: Rust - Size: 165 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,585 - Forks: 75

combust/mleap

MLeap: Deploy ML Pipelines to Production

Language: Scala - Size: 3.4 MB - Last synced at: 21 days ago - Pushed at: 9 months ago - Stars: 1,520 - Forks: 313

pyper-dev/pyper

Concurrent Python made simple

Language: Python - Size: 462 KB - Last synced at: 16 days ago - Pushed at: 7 months ago - Stars: 1,462 - Forks: 30

opendatadiscovery/odd-platform

First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

Language: Java - Size: 27.9 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 1,317 - Forks: 123

fmind/mlops-python-package

Kickstart your MLOps initiative with a flexible, robust, and productive Python package.

Language: Jupyter Notebook - Size: 3.24 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1,273 - Forks: 191

yobix-ai/extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

Language: Rust - Size: 2.88 MB - Last synced at: 15 days ago - Pushed at: 9 months ago - Stars: 1,217 - Forks: 56

OpenDCAI/DataFlow

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Language: Python - Size: 74.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,178 - Forks: 77

amphi-ai/amphi-etl

Visual Data Preparation and Transformation. Low-Code Python-based ETL.

Language: TypeScript - Size: 2.86 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1,095 - Forks: 72

bruin-data/bruin

Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

Language: Go - Size: 153 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 976 - Forks: 45

dataform-co/dataform

Dataform is a framework for managing SQL based data operations in BigQuery

Language: TypeScript - Size: 16.6 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 918 - Forks: 183

raystack/optimus

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

Language: Go - Size: 12.2 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 748 - Forks: 154

artie-labs/transfer

Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.

Language: Go - Size: 4.45 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 667 - Forks: 38

elementary-data/dbt-data-reliability

dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

Language: Python - Size: 7.75 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 459 - Forks: 110

vmware/versatile-data-kit

One framework to develop, deploy and operate data workflows with Python and SQL.

Language: Python - Size: 110 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 456 - Forks: 59

gabledata/recap

Work with your web service, database, and streaming schemas in a single format.

Language: Python - Size: 1.54 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 344 - Forks: 26

dataflint/spark

Drop-in replacement for Apache Spark UI

Language: TypeScript - Size: 18.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 299 - Forks: 39

tuva-health/tuva

Main repo including core data model, data marts, data quality tests, and terminology sets.

Language: HTML - Size: 46.5 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 272 - Forks: 95

dataplane-app/dataplane

Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.

Language: JavaScript - Size: 281 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 226 - Forks: 33

terrytangyuan/awesome-kubeflow

A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)

Size: 199 KB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 214 - Forks: 18

kevin-hanselman/dud

A lightweight CLI tool for versioning data alongside source code and building data pipelines.

Language: Go - Size: 3.42 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 212 - Forks: 9

datajoint/datajoint-python

Relational data pipelines for the science lab

Language: Python - Size: 20.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 183 - Forks: 90

koolreport/core

An Open Source PHP Reporting Framework that helps you to write perfect data reports or to construct awesome dashboards in PHP. Working great with all PHP versions from 5.6 to latest 8.0. Fully compatible with all kinds of MVC frameworks like Laravel, CodeIgniter, Symfony.

Language: PHP - Size: 2.66 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 176 - Forks: 34

realize-engineering/pipebird

Pipebird is open source infrastructure for securely sharing data with customers.

Language: TypeScript - Size: 1.91 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 168 - Forks: 7

GoogleCloudPlatform/public-datasets-pipelines

Cloud-native, data onboarding architecture for Google Cloud Datasets

Language: Python - Size: 6.66 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 162 - Forks: 69

mitdbg/palimpzest

A System for Optimized Semantic Computation

Language: Python - Size: 375 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 137 - Forks: 25

smart-data-lake/smart-data-lake

Smart Automation Tool for building modern Data Lakes and Data Pipelines

Language: Scala - Size: 45.4 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 124 - Forks: 24

DidactHQ/didact

The open core .NET job orchestrator that we've been missing

Language: C# - Size: 353 KB - Last synced at: 6 days ago - Pushed at: 20 days ago - Stars: 119 - Forks: 1

linkedin/Hoptimator

Multi-hop declarative data pipelines

Language: Java - Size: 1010 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 118 - Forks: 13

Burla-Cloud/burla

The simplest way to run Python on lot's of computers.

Language: TypeScript - Size: 2.41 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 114 - Forks: 3

patterns-app/patterns-devkit

Data pipelines from re-usable components

Language: Python - Size: 1.75 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 107 - Forks: 5

mycelial/mycelial

Move your data with ease.

Language: Rust - Size: 2.21 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 106 - Forks: 9

shravan-kuchkula/udacity-data-eng-proj-1

Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3

Language: Python - Size: 3.47 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 88 - Forks: 58

beneath-hq/beneath 📦

Beneath is a serverless real-time data platform ⚡️

Language: Go - Size: 11 MB - Last synced at: about 4 hours ago - Pushed at: over 3 years ago - Stars: 84 - Forks: 10

conductor-oss/python-sdk

Conductor OSS SDK for Python programming language

Language: Python - Size: 3.45 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 81 - Forks: 36

DataCater/datacater 📦

The developer-friendly ETL platform for transforming data in real-time. Based on Apache Kafka® and Kubernetes®.

Language: JavaScript - Size: 4.08 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 81 - Forks: 3

DidactHQ/didact-engine

The REST API and execution engine for the Didact Platform.

Language: C# - Size: 261 KB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 78 - Forks: 3

minhadona/data_engineer_interview_challenges

Found a data engineering challenge or participated in a selection process ? Share with us!

Language: Python - Size: 7.35 MB - Last synced at: 8 days ago - Pushed at: almost 3 years ago - Stars: 65 - Forks: 12

immu0001/Udacity-Data-Engineer-nanodegree

Classwork projects and home works done through Udacity data engineering nano degree

Language: Jupyter Notebook - Size: 101 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 71

iesahin/xvc

A robust (🐢) and fast (🐇) MLOps tool for managing data and pipelines in Rust (🦀)

Language: Rust - Size: 6.7 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 57 - Forks: 1

exospherehost/exospherehost

Mono repo for exosphere.host to simplify infrastructure for AI agents

Language: Python - Size: 29.4 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 56 - Forks: 19

DrDroidLab/kenobi

Easiest way to monitor asynchronous data pipelines

Language: Python - Size: 2.47 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 55 - Forks: 4

CogStack/CogStack-NiFi

Building data processing pipelines for documents processing with NLP using Apache NiFi and related services

Language: Python - Size: 281 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 53 - Forks: 19

siyul-park/uniflow

A high-performance, extremely flexible, and easily extensible universal workflow engine.

Language: Go - Size: 3.15 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 53 - Forks: 5

KentHsu/Udacity-Data-Engineering-Nanodgree

Udacity Data Engineering Nanodegree Program

Language: Jupyter Notebook - Size: 2.12 MB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 52 - Forks: 59

bakdata/streams-explorer

Explore Apache Kafka data pipelines in Kubernetes.

Language: Python - Size: 3.63 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 46 - Forks: 5

DanilBaibak/ml-in-production

The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.

Language: Python - Size: 143 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 43 - Forks: 19

flipkart-incubator/spark-transformers

Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.

Language: Java - Size: 609 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 42 - Forks: 29

confluentinc/learn-kafka-courses

Learn the basics of Apache Kafka® from leaders in the Kafka community with these video courses covering the Kafka ecosystem and hands-on exercises.

Language: Shell - Size: 41 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 38 - Forks: 89

Galileo-Galilei/kedro-pandera

A kedro plugin to use pandera in your kedro projects

Language: Python - Size: 208 KB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 36 - Forks: 4

tabsdata/tabsdata

A Pub/Sub for Tables based data integration platform, to discover, publish, modify and consume data effortlessly.

Language: Rust - Size: 11.8 MB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 35 - Forks: 0

mdh266/AirflowDataPipeline

Example of an ETL Pipeline using Airflow

Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 8 years ago - Stars: 35 - Forks: 20

Tanguy9862/Space-App

A Dash application visualizing humanity's journey into space with data from over 7,000 launches and key milestones, from Sputnik to Mars rovers. Built on scalable data pipelines and deployed on GCP, the app offers real-time updates and interactive insights into space exploration history.

Language: Python - Size: 802 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 29 - Forks: 7

montara-io/dbt-command-center

Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.

Language: TypeScript - Size: 3.55 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 28 - Forks: 0

tuva-health/FHIR_inferno

Connector that loads FHIR r4 USCDIv3 JSON data from local file storage into the Tuva common data model in Snowflake.

Language: Python - Size: 201 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 27 - Forks: 9

giacbrd/SmartPipeline

A framework for rapid development of robust data pipelines following a simple design pattern

Language: Python - Size: 393 KB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 3

arakat-community/arakat 📦

ARAKAT - Big Data Analysis and Business Intelligence Application Development Platform

Language: Python - Size: 31.6 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 27 - Forks: 21

electronick1/stepist 📦

Framework for data processing

Language: Python - Size: 865 KB - Last synced at: 13 days ago - Pushed at: almost 6 years ago - Stars: 27 - Forks: 5

tuva-health/demo

A starter dbt project and synthetic claims dataset for trying out the Tuva Project.

Size: 1.87 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 26 - Forks: 36

kestra-io/examples

Best practices for data workflows, integrations with the Modern Data Stack (MDS), Infrastructure as Code (IaC), Cloud Provider Services

Language: HCL - Size: 3.28 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 25 - Forks: 9

DidactHQ/didact-ui

The web dashboard for the Didact Platform.

Language: C# - Size: 664 KB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 1

pachyderm/neon-workshop

A Pachyderm deep learning tutorial for conference workshops

Language: Python - Size: 56.6 KB - Last synced at: 24 days ago - Pushed at: about 8 years ago - Stars: 19 - Forks: 6

RiveryIO/rivery_cli

Rivery CLI

Language: Python - Size: 626 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 18 - Forks: 2

tsdat/tsdat

Framework for standardizing, transforming, and applying quality checks to time series data.

Language: Jupyter Notebook - Size: 147 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 8

larribas/dagger

Define sophisticated data pipelines with Python and run them on different distributed systems (such as Argo Workflows).

Language: Python - Size: 9.97 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 7

adilkhash/apache-airflow-intro

Language: Python - Size: 9.77 KB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 4

ipeluffo/airflow-on-kubernetes

Source code for guide to run Apache Airflow on Kubernetes

Language: Python - Size: 7.81 KB - Last synced at: 7 days ago - Pushed at: over 5 years ago - Stars: 17 - Forks: 13

jrlasak/databricks_apparel_streaming

Databricks DLT Apparel Pipeline Project: Learn medallion architecture, streaming, and data engineering with Delta Live Tables. Includes synthetic data, step-by-step guide, and certification prep.

Language: Python - Size: 1.22 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 16 - Forks: 9

marcio-azevedo/fsharp-data-processing-pipeline

Provides an extensible solution for creating Data Processing Pipelines in F#.

Language: F# - Size: 352 KB - Last synced at: 20 days ago - Pushed at: over 7 years ago - Stars: 15 - Forks: 1

apicrafter/datacrafter

NoSQL extract, transform, load (ETL) toolkit with Python

Language: Python - Size: 470 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 13 - Forks: 3

tuva-health/medicare_cclf_connector

This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.

Size: 1.02 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 13 - Forks: 16

anna-geller/kestra-ci-cd

CI/CD repository template to automate deployments of your production flows

Language: HCL - Size: 104 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 12 - Forks: 5

ketgo/marshmallow-pyspark

Marshmallow serializer integration with pyspark

Language: Python - Size: 63.5 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 4

dushyantkhosla/airflow4ds

Using Apache Airflow to author, run and monitor complex data pipelines.

Language: Jupyter Notebook - Size: 22.5 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 12 - Forks: 2

brunocampos01/data-engineering

Language: Python - Size: 165 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 11 - Forks: 2

alireza-heidarii/Real-Time-Data-Cleaning-Pipeline-for-Medical-and-Healthcare-Data

A real-time data cleaning pipeline for medical and healthcare data using Apache Spark, SparkNLP, Spark Streaming, and Kafka.

Language: Python - Size: 11.7 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

kiwicom/terraform-provider-montecarlo

This open-source Terraform provider enables users to seamlessly integrate the Monte Carlo data reliabillity platform into their infrastructure as a code (IaC) workflows.

Language: Go - Size: 284 KB - Last synced at: 5 days ago - Pushed at: 23 days ago - Stars: 10 - Forks: 7

bitroot/coflux

Open-source workflow engine. Orchestrate and observe computational workflows defined in plain Python. Suitable for data pipelines, background tasks, etc.

Language: TypeScript - Size: 5.79 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 1

tuva-health/medicare_lds_connector

Maps Medicare LDS claims data to the Tuva Input Layer so you can easily run the Tuva Project.

Size: 688 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 10 - Forks: 5

MattTriano/analytics_data_where_house

An analytics engineering sandbox focusing on real estates prices in Cook County, IL

Language: Python - Size: 17.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0

TextQLLabs/dbt-documentor

✍️ dbt doc generator for advanced data teams

Language: F# - Size: 187 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 1

ImperialCollegeLondon/prefect-managedfiletransfer

Point and click upload and download files between local/SFTP/Cloud with rclone support. Built on/for Prefect.IO. Managed file transfer appliance - Copy/Move/Overwrite/Unzip/Checksumming etc.

Language: Python - Size: 47.4 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 8 - Forks: 0

mackelab/epiphyte

Python toolkit for working with high-dimensional neural data recorded during naturalistic, continuous stimuli @a-darcher @rachrapp

Language: Jupyter Notebook - Size: 190 MB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 8 - Forks: 1

AnthonyByansi/Airflow-Data-Pipeline-Automation

Automate your data pipelines using Apache Airflow with this ready-to-use DAG for data integration, ETL and workflow automation.

Size: 60 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 8 - Forks: 0

AnthonyByansi/Rust-Exploratorium

🚀 Master Rust programming with this comprehensive roadmap! Explore fundamental and advanced concepts, code examples, and resources.

Language: Rust - Size: 38.1 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 0

goto/optimus Fork of raystack/optimus

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

Language: Go - Size: 16.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 4

metaheed/kolle

Business model representation automation

Language: Shell - Size: 154 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 1