An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: airflow

airflow-laminar/airflow-config

A Configuration System for Airflow

Language: Python - Size: 2.27 MB - Last synced at: 22 minutes ago - Pushed at: about 2 hours ago - Stars: 13 - Forks: 3

MauriceKuenicke/Confluence-RAG

Building a RAG System connected to Confluence using Airflow, FastAPI and Qdrant

Language: Python - Size: 43 KB - Last synced at: about 6 hours ago - Pushed at: about 8 hours ago - Stars: 1 - Forks: 0

falklast4/mlops-proj-demo

🚀 Build a production-grade ML inference service with a robust MLOps pipeline for seamless deployment and management.

Size: 1.43 MB - Last synced at: about 11 hours ago - Pushed at: about 13 hours ago - Stars: 0 - Forks: 0

dilambestall/ORDER_TO_DELIVERY_ANALYSTICS_PIPELINE

Data Engineering pipeline project using Olist Kaggle dataset (Order-to-Delivery Analytics)

Size: 269 KB - Last synced at: about 16 hours ago - Pushed at: about 16 hours ago - Stars: 0 - Forks: 0

SalimM21/Plateforme-de-Scoring-Automatiser-avec-MLOps-Complet

La plateforme de scoring automatisé avec MLOps vise à fournir un système intelligent, scalable et traçable pour : Évaluer le risque de crédit des prospects, Détecter anomalies et fraudes dans les transactions, Anticiper les besoins logistiques (prévisions d’approvisionnement), Automatiser les rapports de conformité (KYC, AML).

Language: Python - Size: 7.85 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 1 - Forks: 0

Dayan8554/House-Price-Model

🏠 Predict house prices using machine learning with data cleaning, feature engineering, and regression modeling for accurate results.

Language: Jupyter Notebook - Size: 2.18 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

tdbsilva/aws-data-pipeline

Pipeline de dados completo em AWS (S3, Glue, Athena, Redshift e Spark)

Language: Python - Size: 25.4 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

YousefMostafa24/daily-revenue-pipeline

Airflow DAG to compute, process, and visualize daily sales revenue from a PostgreSQL database using Python, Pandas, and Matplotlib

Language: Python - Size: 315 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

devopscorner/terraform-infra

Production Grade Terraform for Provisioning Infrastructure

Language: HCL - Size: 37.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 25 - Forks: 4

mwasifanwar/YOLO-Anywhere

Detect ANY object by name (no training) using YOLO-World - state-of-the-art zero-shot detection.

Language: Python - Size: 20.5 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

rexleimo/agno-Go

Agno-Go: A High-Performance Multi-Agent System Framework Based on Golang. Inheriting the Agno design philosophy, it leverages Golang's concurrency model and per

Language: Go - Size: 1.28 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 31 - Forks: 5

kestra-io/plugin-airflow

Kestra plugin integrating with Apache Airflow

Language: Java - Size: 241 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 4

100472261/Weather-Data-Pipeline

🔎 Enter this repository to discover more.

Language: Python - Size: 154 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

mrmonkeyman68/-E-Commerce_Sales_Analysis

This project dives deep into the sales, delivery, and customer feedback data of major grocery delivery platforms – Blinkit, Swiggy Instamart, and JioMart. It is designed to showcase my ability to clean, analyze, and visualize data using Microsoft Excel.

Size: 11.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

PoSungKim/development_study

open source based development related contents

Language: Java - Size: 6.41 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 0

airflow-laminar/airflow-ha

High Availability (HA) DAG Utility

Language: Python - Size: 2.25 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 9 - Forks: 1

PavelGrigoryevDS/wwi-data-pipeline-dashboard

🌐 End-to-end data pipeline and interactive dashboard for Wide World Importers. Features ETL process, star schema data warehouse, and business performance analytics.

Language: Jupyter Notebook - Size: 29.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

astronomer/dag-factory

Construct Apache Airflow DAGs Declaratively via YAML configuration files

Language: Python - Size: 11.4 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,362 - Forks: 213

haider123768/dbt-core

🔄 Transform your data easily with dbt, allowing analysts to create tables and views in data warehouses using simple select statements.

Language: Python - Size: 22.3 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

nguyennam05/AI.TUTOR

AI Tutor is a chatbot-based web app that answers syllabus-specific queries using Google Gemini API. It integrates Google Drive for eBook storage, MongoDB for chat history, and Clerk for user authentication, ensuring accurate, secure, and curriculum-aligned responses to students.

Language: JavaScript - Size: 7.06 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 1

Hippaho/Sparkify

A music streaming company, Sparkify, has decided that it is time to introduce more automation and monitoring to their data warehouse ETL pipelines and come to the conclusion that the best tool to achieve this is Apache Airflow.

Language: Python - Size: 17.6 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Xxmadkillerx10/data-engineering-zoomcamp

The Data Engineering Zoomcamp covers essential skills in containerization, workflow orchestration, data warehousing, analytics engineering, batch, and streaming processing. It includes tools like Docker, Terraform, BigQuery, dbt, Spark, Kafka, Kestra, Postgres, Google Data Studio, and Metabase.

Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

shuldeshoff/doc-mlops-pipeline

MLOps platform for intelligent document processing and validation. Includes OCR, data pipelines, model training, MLflow tracking, Airflow orchestration, and model serving via Seldon Core. Designed for scalable document recognition and classification in enterprise environments

Language: Python - Size: 117 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

beraldinho10/Dolphin

📄 Parse document images effectively with Dolphin, using heterogeneous anchor prompting to enhance accuracy and streamline processing.

Language: HTML - Size: 11.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

MobileTeleSystems/data-rentgen

NextGen DataMotion Lineage

Language: Python - Size: 16.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 11 - Forks: 0

minyeamer/linkmerce

E-commerce API integration management

Language: Python - Size: 559 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

rdeak/neo4j-schema-enforcment-in-etl

About Basic example how Graphql API can enforce schema in neo4j in ETL process executed with Airflow

Language: Python - Size: 60.5 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

elyra-ai/elyra

Elyra extends JupyterLab with an AI centric approach.

Language: Python - Size: 115 MB - Last synced at: 2 days ago - Pushed at: 17 days ago - Stars: 1,967 - Forks: 360

Lowkey144/ai-data-engineering-ecosystem-guide

📊 Explore the AI, Machine Learning, Data Science, and Data Engineering landscape with categorized tools, libraries, and workflows for effective implementation.

Size: 1.29 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

coursementor/ifood-data-governance-pipeline

🐙 iFood Data Governance Pipeline oferece governança de dados corporativa para o domínio de delivery, com rastreabilidade, qualidade automatizada e conformidade LGPD.

Language: Python - Size: 111 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

ahmedd38/dataengineer-portfolio

📊 End-to-end ETL pipelines, Airflow DAGs, notebook-driven analytics & data warehousing

Size: 7.81 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

argoproj/argo-workflows

Workflow Engine for Kubernetes

Language: Go - Size: 154 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 16,138 - Forks: 3,397

kaueX3/HW_10_prod

🛡️ Build a production-ready ML system for fraud detection with auto-scaling, monitoring, and orchestration using Kubernetes on Yandex Cloud.

Language: Python - Size: 1.49 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Silkyalisa/ai-cli

🚀 Enhance your coding workflow with AI-CLI, a Zellij layout for efficient AI programming using Claude and advanced monitoring tools.

Language: Shell - Size: 1.94 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

apache/airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Language: Python - Size: 468 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 42,968 - Forks: 15,853

aBandicootCalledSmashes/airflow-logs-cleanup

Clean up old Airflow log files with a script or Airflow DAG. Frees disk space by deleting rotated logs, removing old files, and cleaning up empty directories.

Language: Python - Size: 4.88 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

mame-1213/Car-Rental-System-Django

This is a Car Rental Management System built using Django. The system allows customers to book cars online, manage bookings, and view car availability. Admins can manage car listings, booking statuses, and customer information.

Language: Python - Size: 2.04 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

apache/dolphinscheduler

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Language: Java - Size: 211 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 13,904 - Forks: 4,923

airflow-laminar/airflow-pydantic

Pydantic models for Apache Airflow

Language: Python - Size: 3.36 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8 - Forks: 1

apache/airflow-site

Apache Airflow Website

Language: HTML - Size: 792 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 155 - Forks: 398

mozilla/telemetry-airflow

Airflow configuration for Telemetry

Language: Python - Size: 3.35 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 196 - Forks: 97

devopscorner/iac-terraform-emr

AWS Summit 2022 ASEAN --- COM203 Using IaC with Terraform to provision Big Data Platform on Amazon EMR

Language: HCL - Size: 25.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8 - Forks: 1

status-im/airflow-dags

Status BI python DAGs for Airflow

Language: Python - Size: 310 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 2

astronomer/airflow-provider-fivetran-async

A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran

Language: Python - Size: 235 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 22 - Forks: 10

MatthewLawrencel/Ecommerce-Lakehouse

E-commerce Data Lakehouse Pipeline Automated ETL pipeline for e-commerce data using Airflow. Data flows from Bronze → Silver → Gold layers, transforming raw CSVs into clean, partitioned, and aggregated datasets for analytics.

Language: Python - Size: 196 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

campolloram/my-notes

My personal notes on different topics I'm studying

Size: 43 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 6 - Forks: 0

airflow-laminar/airflow-priority

Priority Tags for Airflow Dags

Language: Python - Size: 2.26 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 1

astronomer/templates

Language: Python - Size: 194 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 15 - Forks: 5

MTES-MCT/sparte

Mon Diagnostic Artificialisation aide les collectivités à analyser et maitriser la consommation d'espaces et l'artificialisation des sols de leur territoire

Language: Python - Size: 98.7 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8 - Forks: 2

Ombraloose/Airbnb-Data-Pipeline

Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

GoogleCloudPlatform/public-datasets-pipelines

Cloud-native, data onboarding architecture for Google Cloud Datasets

Language: Python - Size: 6.68 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 165 - Forks: 69

MobileTeleSystems/evacuator

Catch exception and exit with specific exit code

Language: Python - Size: 209 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

astronomer/airflow-chart

A Helm chart to install Apache Airflow on Kubernetes

Language: Python - Size: 4.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 290 - Forks: 94

fireure/Build-AI-Now

🤖 Build your AI skills through hands-on projects and a structured roadmap designed for beginners to experienced developers.

Size: 1.29 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

choval/devairflow

Simple local development airflow image

Size: 241 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

alvgonfri/consumption-prediction-spark

Automated system for predicting energy consumption in households, using a pipeline orchestrated with Apache Airflow and Apache Spark.

Language: Jupyter Notebook - Size: 1.94 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Phantom9888/RetailETL-Store-Data-Pipeline

📊 Streamline retail store data processing and enhance reporting with this efficient ETL pipeline.

Size: 1.29 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

rifqyirfanto21/module3-capstone_project

Purwadhika Data Engineer Path: Module 3 Capstone Project

Language: Python - Size: 28.3 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Junwu0615/Airflow-End-To-End-Dev

Airflow End-To-End 開發流程

Size: 89.8 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

MrXujiang/react-flow

react-flow中文文档.

Language: TypeScript - Size: 524 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 61 - Forks: 8

EbEmad/Chat-with-Database-pipeline

Language: Jupyter Notebook - Size: 1.49 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

michaelconan/personal-reporting-pipelines

Application infrastructure, configuration, and workflow definitions for personal dlt pipelines and dbt models

Language: Python - Size: 6.99 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

GustavoNav/exchange_extractor

Aplicação para criar um ambiente e executar pipelines de dados.

Language: Python - Size: 1.41 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

qdeli187/apache-airflow-providers-apache-datafusion-ballista

🍃 Run Apache Datafusion Ballista workflows within Airflow

Language: Python - Size: 108 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

Ochecodes/Lamba-AI-Powered-Misinformation-detector-Browser-Extention

Realtime sentiment analysis tool that lives in your browser and helps you make better choices on the contents you choose to consume on the web. Help newsrooms and media practitioners score Misinformation and mental health implication of their publication on the enduser.

Language: Python - Size: 24.4 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

andreax79/airflow-code-editor

A plugin for Apache Airflow that allows you to edit DAGs in browser

Language: Vue - Size: 15.5 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 450 - Forks: 55

salimt/Transfermarkt-ETL-and-LIVE-Scores

asyncIO, Github Actions, GCP, dbt, Terraform, Docker

Language: Python - Size: 121 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 6 - Forks: 0

Dpbm/qcop

An AI model to predict the output of a quantum cirucit

Language: Python - Size: 830 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

shivam1423/Real-Time-Grid-Monitoring-System

Real-Time Grid Monitoring System is an end-to-end data pipeline and analytics platform that enables live monitoring, analysis, and visualization of electrical grid performance.

Language: Python - Size: 1.66 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

fideldalmasso/data_engineering_stack_demo

A modular and extensible demo stack for Data Engineering workflows, using open-source tools.

Language: Jupyter Notebook - Size: 1.7 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0

meta-pytorch/torchx

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

Language: Python - Size: 36.6 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 395 - Forks: 146

jass228/streamlytics

ETL pipeline project designed to analyze and explore the Netflix catalogue using data from the TMDB API.

Language: Python - Size: 32.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

vinitg96/elt-data-lakehouse

Data Lakehouse moderno com MinIO, DuckDB, dbt, metabase e airflow

Language: Python - Size: 12.4 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

apache/airflow-client-python

Apache Airflow - OpenApi Client for Python

Language: Python - Size: 1.86 MB - Last synced at: 4 days ago - Pushed at: 9 days ago - Stars: 434 - Forks: 62

holland-reece/holland-tunnel

Multilayer data warehouse for NPI physician registry data (BigQuery, Airflow, dbt, BI)

Language: Python - Size: 25.4 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

santosr2/airflow-community-chart Fork of airflow-helm/charts

The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, it has since helped thousands of companies create production-ready deployments of Airflow on Kubernetes.

Language: Shell - Size: 1000 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

mohidmakhdoomi/DataPipe

Data pipeline --> transactional DB, CDC, streaming, real time analytics ||| cloud infrastructure, data lake, distributed processing, transformations, data warehouse ||| orchestration, containerization

Language: Shell - Size: 5.11 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

buun-ch/buun-stack

A remotely accessible Kubernetes home lab with OIDC authentication. Build a modern development environment with integrated data analytics and AI capabilities. Includes an open data stack for data ingestion, transformation, serving, and orchestration.

Language: Just - Size: 682 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

astronomer/astronomer-cosmos

Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code

Language: Python - Size: 19.3 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,065 - Forks: 241

gestaogovbr/FastETL

Airflow plugins for implementing data pipelines. | Plugins do Airflow para implementação de pipelines de dados.

Language: Python - Size: 5 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 48 - Forks: 9

goto/optimus Fork of raystack/optimus

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

Language: Go - Size: 19 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8 - Forks: 4

tomasfarias/airflow-dbt-python

A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.

Language: Python - Size: 9.72 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 208 - Forks: 40

itinycheng/flink-platform-backend

Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.

Language: Java - Size: 11.7 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 17 - Forks: 6

Muhammad-Hamza-Khan-03/NexusDrive

Real-Time Delivery ETA Prediction and Delay Risk Analytics

Language: Jupyter Notebook - Size: 9.43 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

AtlasOfLivingAustralia/pipelines-airflow

About Airflow DAGs and supporting files for running pipelines on Apache Airflow with Elastic Map Reduce.

Language: Python - Size: 349 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 1

stellar/stellar-etl-airflow

Airflow DAGs for the Stellar ETL project

Language: Python - Size: 3.68 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 38 - Forks: 19

ZhaoJackson/Survey_Panel_Analytics

Interactive survey panel analytics dashboard with automated quality control pipeline. Built with R Shiny for demographic analysis, response density modeling, and real-time quality monitoring across 100+ survey projects.

Language: HTML - Size: 15.9 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

GregoryKogan/crypto-trading-data-pipeline

Real-time crypto trading data pipeline using Apache Spark, Kafka, and Airflow. Containerized microservices architecture for streaming analytics.

Language: Python - Size: 21.5 KB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0

bigfoot-5/JobPostings_Analysis

This repository contains the code to implement a Data Pipeline which scrapes data from LinkedIn and uses Llama3 to extract the key skills from the job description of each job

Language: Python - Size: 3.25 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

tier940/docker-repo

Language: Dockerfile - Size: 410 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 1

alexandre-cameron-borges/datastream

Ce projet dans le cadre du Sorbonne Data Analytics 2025-2026 démontre la mise en place d'un pipeline de bout en bout : ingestion des données avec Kafka, orchestration des tâches avec Airflow, stockage et interrogation sur Google Cloud (GCS, BigQuery), et application d'un modèle de Machine Learning (KMeans) pour la segmentation clients.

Language: Python - Size: 88.9 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

mikeroyal/Apache-Airflow-Guide

Apache Airflow Guide

Language: Python - Size: 279 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 14

calbergs/spotify-api

Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped

Language: Python - Size: 3.11 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 46 - Forks: 1

airflow-laminar/airflow-balancer

Utilities for tracking hosts and ports and load balancing DAGs

Language: Python - Size: 1.68 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

jghoman/awesome-apache-airflow

Curated list of resources about Apache Airflow

Language: Shell - Size: 550 KB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 3,847 - Forks: 498

subhamay-bhattacharyya/apache-airflow-template

📄🎯 GitHub Repository Template for Apache Airflow

Size: 2.93 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

duxstein/agent-framework

Build-Your-Own AI Agent Framework — An enterprise-grade, modular orchestration engine for defining, executing, and monitoring intelligent agentic workflows. Designed to power complex multi-agent systems with memory, guardrails, and observability — using Apache Kafka, Airflow, and Intel® OpenVINO™ for optimized performance.

Language: Python - Size: 239 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

grilled-swampert/crypto-data-pipeline

Automated ETL pipeline built for extracting, transforming, and loading cryptocurrency market data.

Language: Python - Size: 6.1 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

masabai/sql_real_estate_etl_dbt_dag

Real Estate Data SQL ETL Pipeline using Dbt core on Airflow Postgres Database

Language: Python - Size: 6.45 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

MarkPhamm/consumer_complaint_pipeline

An automated data pipeline for extracting, loading, and analyzing consumer complaint data from the Consumer Financial Protection Bureau (CFPB) database. This production-ready solution enables financial institutions and analysts to monitor complaint trends, identify issues, and drive data-driven decisions.

Language: Python - Size: 24.2 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0