GitHub topics: airflow
airflow-laminar/airflow-config
A Configuration System for Airflow
Language: Python - Size: 2.27 MB - Last synced at: 22 minutes ago - Pushed at: about 2 hours ago - Stars: 13 - Forks: 3
MauriceKuenicke/Confluence-RAG
Building a RAG System connected to Confluence using Airflow, FastAPI and Qdrant
Language: Python - Size: 43 KB - Last synced at: about 6 hours ago - Pushed at: about 8 hours ago - Stars: 1 - Forks: 0
falklast4/mlops-proj-demo
🚀 Build a production-grade ML inference service with a robust MLOps pipeline for seamless deployment and management.
Size: 1.43 MB - Last synced at: about 11 hours ago - Pushed at: about 13 hours ago - Stars: 0 - Forks: 0
dilambestall/ORDER_TO_DELIVERY_ANALYSTICS_PIPELINE
Data Engineering pipeline project using Olist Kaggle dataset (Order-to-Delivery Analytics)
Size: 269 KB - Last synced at: about 16 hours ago - Pushed at: about 16 hours ago - Stars: 0 - Forks: 0
SalimM21/Plateforme-de-Scoring-Automatiser-avec-MLOps-Complet
La plateforme de scoring automatisé avec MLOps vise à fournir un système intelligent, scalable et traçable pour : Évaluer le risque de crédit des prospects, Détecter anomalies et fraudes dans les transactions, Anticiper les besoins logistiques (prévisions d’approvisionnement), Automatiser les rapports de conformité (KYC, AML).
Language: Python - Size: 7.85 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 1 - Forks: 0
Dayan8554/House-Price-Model
🏠 Predict house prices using machine learning with data cleaning, feature engineering, and regression modeling for accurate results.
Language: Jupyter Notebook - Size: 2.18 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0
tdbsilva/aws-data-pipeline
Pipeline de dados completo em AWS (S3, Glue, Athena, Redshift e Spark)
Language: Python - Size: 25.4 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0
YousefMostafa24/daily-revenue-pipeline
Airflow DAG to compute, process, and visualize daily sales revenue from a PostgreSQL database using Python, Pandas, and Matplotlib
Language: Python - Size: 315 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0
devopscorner/terraform-infra
Production Grade Terraform for Provisioning Infrastructure
Language: HCL - Size: 37.1 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 25 - Forks: 4
mwasifanwar/YOLO-Anywhere
Detect ANY object by name (no training) using YOLO-World - state-of-the-art zero-shot detection.
Language: Python - Size: 20.5 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0
rexleimo/agno-Go
Agno-Go: A High-Performance Multi-Agent System Framework Based on Golang. Inheriting the Agno design philosophy, it leverages Golang's concurrency model and per
Language: Go - Size: 1.28 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 31 - Forks: 5
kestra-io/plugin-airflow
Kestra plugin integrating with Apache Airflow
Language: Java - Size: 241 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 4
100472261/Weather-Data-Pipeline
🔎 Enter this repository to discover more.
Language: Python - Size: 154 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0
mrmonkeyman68/-E-Commerce_Sales_Analysis
This project dives deep into the sales, delivery, and customer feedback data of major grocery delivery platforms – Blinkit, Swiggy Instamart, and JioMart. It is designed to showcase my ability to clean, analyze, and visualize data using Microsoft Excel.
Size: 11.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0
PoSungKim/development_study
open source based development related contents
Language: Java - Size: 6.41 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 0
airflow-laminar/airflow-ha
High Availability (HA) DAG Utility
Language: Python - Size: 2.25 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 9 - Forks: 1
PavelGrigoryevDS/wwi-data-pipeline-dashboard
🌐 End-to-end data pipeline and interactive dashboard for Wide World Importers. Features ETL process, star schema data warehouse, and business performance analytics.
Language: Jupyter Notebook - Size: 29.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0
astronomer/dag-factory
Construct Apache Airflow DAGs Declaratively via YAML configuration files
Language: Python - Size: 11.4 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,362 - Forks: 213
haider123768/dbt-core
🔄 Transform your data easily with dbt, allowing analysts to create tables and views in data warehouses using simple select statements.
Language: Python - Size: 22.3 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0
nguyennam05/AI.TUTOR
AI Tutor is a chatbot-based web app that answers syllabus-specific queries using Google Gemini API. It integrates Google Drive for eBook storage, MongoDB for chat history, and Clerk for user authentication, ensuring accurate, secure, and curriculum-aligned responses to students.
Language: JavaScript - Size: 7.06 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 1
Hippaho/Sparkify
A music streaming company, Sparkify, has decided that it is time to introduce more automation and monitoring to their data warehouse ETL pipelines and come to the conclusion that the best tool to achieve this is Apache Airflow.
Language: Python - Size: 17.6 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
Xxmadkillerx10/data-engineering-zoomcamp
The Data Engineering Zoomcamp covers essential skills in containerization, workflow orchestration, data warehousing, analytics engineering, batch, and streaming processing. It includes tools like Docker, Terraform, BigQuery, dbt, Spark, Kafka, Kestra, Postgres, Google Data Studio, and Metabase.
Size: 1.95 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
shuldeshoff/doc-mlops-pipeline
MLOps platform for intelligent document processing and validation. Includes OCR, data pipelines, model training, MLflow tracking, Airflow orchestration, and model serving via Seldon Core. Designed for scalable document recognition and classification in enterprise environments
Language: Python - Size: 117 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0
beraldinho10/Dolphin
📄 Parse document images effectively with Dolphin, using heterogeneous anchor prompting to enhance accuracy and streamline processing.
Language: HTML - Size: 11.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
MobileTeleSystems/data-rentgen
NextGen DataMotion Lineage
Language: Python - Size: 16.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 11 - Forks: 0
minyeamer/linkmerce
E-commerce API integration management
Language: Python - Size: 559 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0
rdeak/neo4j-schema-enforcment-in-etl
About Basic example how Graphql API can enforce schema in neo4j in ETL process executed with Airflow
Language: Python - Size: 60.5 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
elyra-ai/elyra
Elyra extends JupyterLab with an AI centric approach.
Language: Python - Size: 115 MB - Last synced at: 2 days ago - Pushed at: 17 days ago - Stars: 1,967 - Forks: 360
Lowkey144/ai-data-engineering-ecosystem-guide
📊 Explore the AI, Machine Learning, Data Science, and Data Engineering landscape with categorized tools, libraries, and workflows for effective implementation.
Size: 1.29 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
coursementor/ifood-data-governance-pipeline
🐙 iFood Data Governance Pipeline oferece governança de dados corporativa para o domínio de delivery, com rastreabilidade, qualidade automatizada e conformidade LGPD.
Language: Python - Size: 111 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0
ahmedd38/dataengineer-portfolio
📊 End-to-end ETL pipelines, Airflow DAGs, notebook-driven analytics & data warehousing
Size: 7.81 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0
argoproj/argo-workflows
Workflow Engine for Kubernetes
Language: Go - Size: 154 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 16,138 - Forks: 3,397
kaueX3/HW_10_prod
🛡️ Build a production-ready ML system for fraud detection with auto-scaling, monitoring, and orchestration using Kubernetes on Yandex Cloud.
Language: Python - Size: 1.49 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0
Silkyalisa/ai-cli
🚀 Enhance your coding workflow with AI-CLI, a Zellij layout for efficient AI programming using Claude and advanced monitoring tools.
Language: Shell - Size: 1.94 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language: Python - Size: 468 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 42,968 - Forks: 15,853
aBandicootCalledSmashes/airflow-logs-cleanup
Clean up old Airflow log files with a script or Airflow DAG. Frees disk space by deleting rotated logs, removing old files, and cleaning up empty directories.
Language: Python - Size: 4.88 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0
mame-1213/Car-Rental-System-Django
This is a Car Rental Management System built using Django. The system allows customers to book cars online, manage bookings, and view car availability. Admins can manage car listings, booking statuses, and customer information.
Language: Python - Size: 2.04 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0
apache/dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Language: Java - Size: 211 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 13,904 - Forks: 4,923
airflow-laminar/airflow-pydantic
Pydantic models for Apache Airflow
Language: Python - Size: 3.36 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8 - Forks: 1
apache/airflow-site
Apache Airflow Website
Language: HTML - Size: 792 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 155 - Forks: 398
mozilla/telemetry-airflow
Airflow configuration for Telemetry
Language: Python - Size: 3.35 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 196 - Forks: 97
devopscorner/iac-terraform-emr
AWS Summit 2022 ASEAN --- COM203 Using IaC with Terraform to provision Big Data Platform on Amazon EMR
Language: HCL - Size: 25.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8 - Forks: 1
status-im/airflow-dags
Status BI python DAGs for Airflow
Language: Python - Size: 310 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 2
astronomer/airflow-provider-fivetran-async
A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran
Language: Python - Size: 235 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 22 - Forks: 10
MatthewLawrencel/Ecommerce-Lakehouse
E-commerce Data Lakehouse Pipeline Automated ETL pipeline for e-commerce data using Airflow. Data flows from Bronze → Silver → Gold layers, transforming raw CSVs into clean, partitioned, and aggregated datasets for analytics.
Language: Python - Size: 196 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0
campolloram/my-notes
My personal notes on different topics I'm studying
Size: 43 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 6 - Forks: 0
airflow-laminar/airflow-priority
Priority Tags for Airflow Dags
Language: Python - Size: 2.26 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 1
astronomer/templates
Language: Python - Size: 194 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 15 - Forks: 5
MTES-MCT/sparte
Mon Diagnostic Artificialisation aide les collectivités à analyser et maitriser la consommation d'espaces et l'artificialisation des sols de leur territoire
Language: Python - Size: 98.7 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8 - Forks: 2
Ombraloose/Airbnb-Data-Pipeline
Language: Jupyter Notebook - Size: 1.03 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0
GoogleCloudPlatform/public-datasets-pipelines
Cloud-native, data onboarding architecture for Google Cloud Datasets
Language: Python - Size: 6.68 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 165 - Forks: 69
MobileTeleSystems/evacuator
Catch exception and exit with specific exit code
Language: Python - Size: 209 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0
astronomer/airflow-chart
A Helm chart to install Apache Airflow on Kubernetes
Language: Python - Size: 4.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 290 - Forks: 94
fireure/Build-AI-Now
🤖 Build your AI skills through hands-on projects and a structured roadmap designed for beginners to experienced developers.
Size: 1.29 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0
choval/devairflow
Simple local development airflow image
Size: 241 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0
alvgonfri/consumption-prediction-spark
Automated system for predicting energy consumption in households, using a pipeline orchestrated with Apache Airflow and Apache Spark.
Language: Jupyter Notebook - Size: 1.94 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0
Phantom9888/RetailETL-Store-Data-Pipeline
📊 Streamline retail store data processing and enhance reporting with this efficient ETL pipeline.
Size: 1.29 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0
rifqyirfanto21/module3-capstone_project
Purwadhika Data Engineer Path: Module 3 Capstone Project
Language: Python - Size: 28.3 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0
Junwu0615/Airflow-End-To-End-Dev
Airflow End-To-End 開發流程
Size: 89.8 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0
MrXujiang/react-flow
react-flow中文文档.
Language: TypeScript - Size: 524 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 61 - Forks: 8
EbEmad/Chat-with-Database-pipeline
Language: Jupyter Notebook - Size: 1.49 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0
michaelconan/personal-reporting-pipelines
Application infrastructure, configuration, and workflow definitions for personal dlt pipelines and dbt models
Language: Python - Size: 6.99 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0
GustavoNav/exchange_extractor
Aplicação para criar um ambiente e executar pipelines de dados.
Language: Python - Size: 1.41 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0
qdeli187/apache-airflow-providers-apache-datafusion-ballista
🍃 Run Apache Datafusion Ballista workflows within Airflow
Language: Python - Size: 108 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0
Ochecodes/Lamba-AI-Powered-Misinformation-detector-Browser-Extention
Realtime sentiment analysis tool that lives in your browser and helps you make better choices on the contents you choose to consume on the web. Help newsrooms and media practitioners score Misinformation and mental health implication of their publication on the enduser.
Language: Python - Size: 24.4 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0
andreax79/airflow-code-editor
A plugin for Apache Airflow that allows you to edit DAGs in browser
Language: Vue - Size: 15.5 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 450 - Forks: 55
salimt/Transfermarkt-ETL-and-LIVE-Scores
asyncIO, Github Actions, GCP, dbt, Terraform, Docker
Language: Python - Size: 121 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 6 - Forks: 0
Dpbm/qcop
An AI model to predict the output of a quantum cirucit
Language: Python - Size: 830 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
shivam1423/Real-Time-Grid-Monitoring-System
Real-Time Grid Monitoring System is an end-to-end data pipeline and analytics platform that enables live monitoring, analysis, and visualization of electrical grid performance.
Language: Python - Size: 1.66 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
fideldalmasso/data_engineering_stack_demo
A modular and extensible demo stack for Data Engineering workflows, using open-source tools.
Language: Jupyter Notebook - Size: 1.7 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 0
meta-pytorch/torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Language: Python - Size: 36.6 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 395 - Forks: 146
jass228/streamlytics
ETL pipeline project designed to analyze and explore the Netflix catalogue using data from the TMDB API.
Language: Python - Size: 32.8 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
vinitg96/elt-data-lakehouse
Data Lakehouse moderno com MinIO, DuckDB, dbt, metabase e airflow
Language: Python - Size: 12.4 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0
apache/airflow-client-python
Apache Airflow - OpenApi Client for Python
Language: Python - Size: 1.86 MB - Last synced at: 4 days ago - Pushed at: 9 days ago - Stars: 434 - Forks: 62
holland-reece/holland-tunnel
Multilayer data warehouse for NPI physician registry data (BigQuery, Airflow, dbt, BI)
Language: Python - Size: 25.4 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0
santosr2/airflow-community-chart Fork of airflow-helm/charts
The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, it has since helped thousands of companies create production-ready deployments of Airflow on Kubernetes.
Language: Shell - Size: 1000 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0
mohidmakhdoomi/DataPipe
Data pipeline --> transactional DB, CDC, streaming, real time analytics ||| cloud infrastructure, data lake, distributed processing, transformations, data warehouse ||| orchestration, containerization
Language: Shell - Size: 5.11 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0
buun-ch/buun-stack
A remotely accessible Kubernetes home lab with OIDC authentication. Build a modern development environment with integrated data analytics and AI capabilities. Includes an open data stack for data ingestion, transformation, serving, and orchestration.
Language: Just - Size: 682 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0
astronomer/astronomer-cosmos
Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code
Language: Python - Size: 19.3 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,065 - Forks: 241
gestaogovbr/FastETL
Airflow plugins for implementing data pipelines. | Plugins do Airflow para implementação de pipelines de dados.
Language: Python - Size: 5 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 48 - Forks: 9
goto/optimus Fork of raystack/optimus
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Language: Go - Size: 19 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8 - Forks: 4
tomasfarias/airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Language: Python - Size: 9.72 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 208 - Forks: 40
itinycheng/flink-platform-backend
Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.
Language: Java - Size: 11.7 MB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 17 - Forks: 6
Muhammad-Hamza-Khan-03/NexusDrive
Real-Time Delivery ETA Prediction and Delay Risk Analytics
Language: Jupyter Notebook - Size: 9.43 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0
AtlasOfLivingAustralia/pipelines-airflow
About Airflow DAGs and supporting files for running pipelines on Apache Airflow with Elastic Map Reduce.
Language: Python - Size: 349 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 1
stellar/stellar-etl-airflow
Airflow DAGs for the Stellar ETL project
Language: Python - Size: 3.68 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 38 - Forks: 19
ZhaoJackson/Survey_Panel_Analytics
Interactive survey panel analytics dashboard with automated quality control pipeline. Built with R Shiny for demographic analysis, response density modeling, and real-time quality monitoring across 100+ survey projects.
Language: HTML - Size: 15.9 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0
GregoryKogan/crypto-trading-data-pipeline
Real-time crypto trading data pipeline using Apache Spark, Kafka, and Airflow. Containerized microservices architecture for streaming analytics.
Language: Python - Size: 21.5 KB - Last synced at: 5 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0
bigfoot-5/JobPostings_Analysis
This repository contains the code to implement a Data Pipeline which scrapes data from LinkedIn and uses Llama3 to extract the key skills from the job description of each job
Language: Python - Size: 3.25 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0
tier940/docker-repo
Language: Dockerfile - Size: 410 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 1
alexandre-cameron-borges/datastream
Ce projet dans le cadre du Sorbonne Data Analytics 2025-2026 démontre la mise en place d'un pipeline de bout en bout : ingestion des données avec Kafka, orchestration des tâches avec Airflow, stockage et interrogation sur Google Cloud (GCS, BigQuery), et application d'un modèle de Machine Learning (KMeans) pour la segmentation clients.
Language: Python - Size: 88.9 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0
mikeroyal/Apache-Airflow-Guide
Apache Airflow Guide
Language: Python - Size: 279 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 14
calbergs/spotify-api
Pipeline that extracts data from the Spotify API to build a more detailed version of Spotify Wrapped
Language: Python - Size: 3.11 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 46 - Forks: 1
airflow-laminar/airflow-balancer
Utilities for tracking hosts and ports and load balancing DAGs
Language: Python - Size: 1.68 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0
jghoman/awesome-apache-airflow
Curated list of resources about Apache Airflow
Language: Shell - Size: 550 KB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 3,847 - Forks: 498
subhamay-bhattacharyya/apache-airflow-template
📄🎯 GitHub Repository Template for Apache Airflow
Size: 2.93 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0
duxstein/agent-framework
Build-Your-Own AI Agent Framework — An enterprise-grade, modular orchestration engine for defining, executing, and monitoring intelligent agentic workflows. Designed to power complex multi-agent systems with memory, guardrails, and observability — using Apache Kafka, Airflow, and Intel® OpenVINO™ for optimized performance.
Language: Python - Size: 239 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0
grilled-swampert/crypto-data-pipeline
Automated ETL pipeline built for extracting, transforming, and loading cryptocurrency market data.
Language: Python - Size: 6.1 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0
masabai/sql_real_estate_etl_dbt_dag
Real Estate Data SQL ETL Pipeline using Dbt core on Airflow Postgres Database
Language: Python - Size: 6.45 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
MarkPhamm/consumer_complaint_pipeline
An automated data pipeline for extracting, loading, and analyzing consumer complaint data from the Consumer Financial Protection Bureau (CFPB) database. This production-ready solution enables financial institutions and analysts to monitor complaint trends, identify issues, and drive data-driven decisions.
Language: Python - Size: 24.2 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0