Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub topics: airflow-dags
adrianmarino/thesis-paper
Collaborative and hybrid recommendation systems
Language: Jupyter Notebook - Size: 353 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 0 - Forks: 1
tulibraries/funcake_dags
Airflow DAGs for PA Digital aggregation processes
Language: Python - Size: 1.75 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 4 - Forks: 2
tulibraries/manifold_airflow_dags
Airflow DAGs for the Manifold (TUL Website) application
Language: Python - Size: 1.54 MB - Last synced: 2 days ago - Pushed: 3 days ago - Stars: 2 - Forks: 0
xennen/DataEngineerYP
Data Engineer projects
Language: Python - Size: 513 KB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 0 - Forks: 0
tulibraries/cob_datapipeline
Airflow Data Processing Pipeline for TUL Catalog on Blacklight Data
Language: Python - Size: 2.33 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 6 - Forks: 0
matsudan/airflow-dag-examples
Apache Airflow DAG examples
Language: Python - Size: 96.7 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 1 - Forks: 1
rohithlanka/weatherdatapipeline
A project leveraging Python scripts to fetch weather data from an API, transformed it using DBT in Snowflake, and orchestrated the workflow with Apache Airflow for seamless data integration into reporting tool, ensuring streamlined data-driven insights.(reporting tool- work in progress)
Language: Python - Size: 11.7 KB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 0 - Forks: 0
Azazel0203/ocr_captcha
This project creates a basic web service for solving image-based CAPTCHAs. Using the Flask framework, it allows users to upload CAPTCHA images and employs an Optical Character Recognition (OCR) pipeline to extract the embedded text.
Language: Python - Size: 41.5 MB - Last synced: 12 days ago - Pushed: about 2 months ago - Stars: 6 - Forks: 0
AnthonyByansi/Airflow-Data-Pipeline-Automation
Automate your data pipelines using Apache Airflow with this ready-to-use DAG for data integration, ETL and workflow automation.
Language: Python - Size: 15.6 KB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 7 - Forks: 0
data-burst/airflow-git-sync
Sync DAG changes from Git to Airflow
Size: 118 KB - Last synced: 12 days ago - Pushed: 13 days ago - Stars: 33 - Forks: 6
Chinaskidev/ETL-Clima-ElSalvador
MLOps, haciendo un ETL sencillo usando Docker y Airflow y Google Cloud
Language: Python - Size: 50.8 KB - Last synced: 14 days ago - Pushed: 14 days ago - Stars: 0 - Forks: 0
anilkulkarni87/airflow-docker
This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and workflows.
Language: Python - Size: 106 KB - Last synced: 13 days ago - Pushed: 3 months ago - Stars: 22 - Forks: 10
essraahmed/Data-Pipeline-with-Airflow
Data Pipeline with Apache Airflow
Language: Python - Size: 444 KB - Last synced: 15 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1
DunnBC22/Data_Engineering_Projects
This repository includes data engineering projects using Apache Airflow. I hope to add more projects using different technologies soon!
Language: Python - Size: 15.1 MB - Last synced: 16 days ago - Pushed: 17 days ago - Stars: 4 - Forks: 0
arjunan-k/Twitter_Pipeline
Tweets extractor using Twitter API by Tweepy library. Created the workflow in Apache Airflow with the help of AWS EC2 & S3 bucket.
Language: Python - Size: 865 KB - Last synced: 19 days ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
nick-roberson/comp-blocks
Computation as Composable Blocks, can be extended to run on remote pipeline frameworks if necessary.
Language: Python - Size: 2.96 MB - Last synced: 19 days ago - Pushed: 19 days ago - Stars: 1 - Forks: 0
akhilkn8/speech-emotion-detection-GCP
This repo contains the codes to perform Speech Emotion Recognition (SER) from 4 popular datasets. This projects aims to do a complete end-to-end MLOps orchestration using Gogle Cloud Platform's Cloud composer
Language: Python - Size: 3.62 MB - Last synced: 22 days ago - Pushed: 22 days ago - Stars: 0 - Forks: 0
gestaogovbr/Ro-dou
Gerador de DAGs no Airflow para fazer clipping do Diário Oficial da União.
Language: Python - Size: 484 KB - Last synced: 24 days ago - Pushed: 25 days ago - Stars: 62 - Forks: 12
polarbeargo/Data-Pipelines-with-Airflow
Language: Python - Size: 1020 KB - Last synced: 25 days ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
casassg/corrent
Corrent: Experimental Airflow functional DAG API
Language: Python - Size: 187 KB - Last synced: 28 days ago - Pushed: over 4 years ago - Stars: 4 - Forks: 0
tulibraries/tulflow
TU Libraries Python Library for functions used in indexing ETL, particularly for Airflow
Language: Python - Size: 2.26 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 3 - Forks: 1
ShamilNur/data-mining-kfu-2021
Homework assignments from courses in Data Mining, KFU, 4th semester, 2021.
Language: Jupyter Notebook - Size: 1.24 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
BaliDataMan/airflow-Tut
Airflow Tutorial to learn practical implementation with basics Concepts, Architecture, Working & Configuration practises.
Language: Shell - Size: 425 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
status-im/airflow-dags
Status BI python DAGs for Airflow
Language: Python - Size: 99.6 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 2
nbdevs/RDWA
Real-time Data Warehousing with Airflow: An events based microservices pipeline.
Size: 13.7 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
tangphucnhan/Deploy-Airflow-server-with-docker
Deploy Apache Airflow server in docker. This docker will run three main services: Postgresql, Airflow weberver and Airflow scheduler.
Language: Python - Size: 5.86 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
jesslossa/airflow-docker-localexecutor
Implements Airflow through docker on localhost (LocalExecutor), using docker-desktop.
Language: Python - Size: 171 KB - Last synced: about 1 month ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
coderjolly/spotify-api-data-analysis
The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.
Language: Python - Size: 4.33 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
Julianadev/DAG_Operators
Automating pipelines using airflow operators
Language: Python - Size: 5.86 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
MatheusSC017/Message-Manager
This project aims to create a messaging system to manage messages sent via email and WhatsApp, working on them in an asynchronous and scalable way. It also has the functionality to save logs and send personalized messages for each type of subject.
Language: Python - Size: 70.3 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 2 - Forks: 0
yosh0555/airflow_with_mysql_and_snowflake
This is a small project where I used Airflow DAG program to create table in MySQL database and insert values into it. I also created a pipeline using spark to fetch data from MySQL database and process it and store the processed data into Snowflake data warehouse, I used Apache Airflow to automate this process
Language: Python - Size: 1000 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0
Shreyjain203/Spotify-recommendation-system
Built a Spotify recommendation system: fetched data from Spotify API, stored it on Google Cloud Storage, orchestrated with Airflow (Composer) to load into MongoDB Atlas, and developed a recommendation engine based on the data.
Language: Python - Size: 3.51 MB - Last synced: 30 days ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
khaz-dev/TollData_ETL_AirflowBash
Build an Toll data simulation ETL Pipeline using Airflow (Python and Bash)
Language: Python - Size: 899 KB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 1 - Forks: 0
mikeroyal/Apache-Airflow-Guide
Apache Airflow Guide
Language: Python - Size: 279 KB - Last synced: 1 day ago - Pushed: 13 days ago - Stars: 17 - Forks: 8
stevehoober254/ETL_Data_Pipeline_For_Retail_Store
ETL (Extract, Transform, Load) pipeline to integrate sales data from various sources into a central data warehouse
Language: Python - Size: 5.86 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0
WALIDAADI/ETL_using_Airflow
This project presents a robust data pipeline using Apache Airflow for orchestration, Apache Kafka for real-time data streaming, and MongoDB for data storage. It automates the process of web scraping to collect large companies' data, transforms and processes this data, and then stores it efficiently.
Size: 69.3 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
chandulal/airflow-testing
Airflow Unit Tests and Integration Tests
Language: Python - Size: 465 KB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 249 - Forks: 45
JBris/time-series-airflow-kafka-spark
A simple demonstration of an Airflow-Kafka-Spark (AKS) stack for online time series forecasting.
Language: Python - Size: 699 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 1 - Forks: 0
AhmetFurkanDEMIR/airflow-spark-kafka-example
Airflow, Spark and Kafka example
Language: Dockerfile - Size: 532 KB - Last synced: 13 days ago - Pushed: 6 months ago - Stars: 4 - Forks: 0
visheshgupta-BA/MLOps---Airflow-Docker
Language: Python - Size: 1.87 MB - Last synced: about 1 month ago - Pushed: 2 months ago - Stars: 0 - Forks: 0
shiv-rna/Airflow-Basics
Welcome to my Apache Airflow learning journey repository! 🚀 This repository serves as a comprehensive documentation of my exploration and understanding of Apache Airflow, an open-source platform for orchestrating complex workflows.
Language: Python - Size: 65.8 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
Melcade/zelda-api
Utilize Airflow and JupyterLab to ETL data from the Zelda API, transform it, and load it into SQLite for SQL analysis in JupyterLab.
Language: Jupyter Notebook - Size: 521 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
astronomer/astro-provider-databricks
Orchestrate your Databricks notebooks in Airflow and execute them as Databricks Workflows
Language: Python - Size: 11.1 MB - Last synced: 17 days ago - Pushed: about 1 month ago - Stars: 20 - Forks: 10
AlexanderM-T/AirFlow-Training
En este repositorio se encuentran algunos dags realizados siguiendo el entrenamiento de data engineer con el fin de aprender y practicar airflow
Language: Python - Size: 18.6 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
ohadmata/simple-dag-editor
Zero configuration Airflow plugin that let you manage your DAG files.
Language: Python - Size: 394 KB - Last synced: 3 months ago - Pushed: over 2 years ago - Stars: 35 - Forks: 3
skalskibukowa/Airflow-python-practice
Project to practice airflow and python - ETL
Language: Python - Size: 15.7 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0
arunp77/SQL
SQL, Databases, warehouses, Data lake, cloud storage, MYSQL, Data Pipeline
Size: 18.7 MB - Last synced: 18 days ago - Pushed: 7 months ago - Stars: 2 - Forks: 1
dronectl/windfarm
Micro-service deployments for aggregation and analysis of propulsion tests.
Language: Jsonnet - Size: 220 KB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0
akhich551995/Data-streaming-project-airflow-Kafka-spark-t-cassandra-docker
building a real-time data streaming pipeline, covering each phase from data ingestion to processing and finally storage. We'll utilize a powerful stack of tools and technologies, including Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra—all neatly containerized using Docker.
Language: Python - Size: 1.95 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
igorlangoni/final_project_data_eng_makers Fork of jdench1989/data_eng_final_project
Final project for the Makers Academy Data Engineering Bootcamp! In this amazing, complex group project we had to analyse a massive dataset and extract insightful data that could be used to improve education world-wide!
Language: Python - Size: 125 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 1
Susanhuynh/etl-API-data-to-AWS-S3-using-Airflow
This project focuses on utilizing Apache Airflow to orchestrate an ETL (Extract, Transform, Load) process using data from the Stack Overflow API. The primary objective is to determine the most prominent tags on Stack Overflow for the current month.
Language: Python - Size: 1.61 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
benkeyben/10alytics_air_realtor
10alytics_air_realtor ,a dynamic repository, hosts an AWS-driven data pipeline. Utilizing Apache Airflow, AWS S3, and EC2, it performs efficient ETL operations, extracting comprehensive real estate data from the Realty Mole Property API via RapidAPI. This tool empowers real estate professionals with timely insights for strategic decision-making.
Language: Jupyter Notebook - Size: 656 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0
borgettas/apache-airflow
This project is destinate a study for create a environment Apache Airflow
Language: Makefile - Size: 8.79 KB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0
igorlangoni/online_retail_data_pipeline
An end-to-end pipeline that ingests raw data from CSV files through Airflow DAGS into BigQuery. From there, it uses dbt to normalize and clean the data and afterwards to make the transformations and come up with relevan reports.
Language: Python - Size: 15.4 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
sergio11/lyric_wave_architecture
🎵 LyricWave - Your AI Music Composer 🎶 Compose Unique MP4 Songs Effortlessly! LyricWave uses AI to create personalized music by harmonizing lyrics with captivating melodies and synthetic vocals. Unleash your musical creativity today! 🚀🎶
Language: Python - Size: 29 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 8 - Forks: 2
raphaelmansuy/mwaa_cli
A simple AirFlow mwaa cli command utility. It can be used to pause all the DAGS for a MWAA environment
Language: Shell - Size: 123 KB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 4 - Forks: 1
aimlnerd/data-pipelines-with-airflow
Airflow DAG tutorial with docker compose local setup
Language: Python - Size: 16.6 KB - Last synced: 4 months ago - Pushed: over 1 year ago - Stars: 3 - Forks: 0
philipobiorah/airflow_project
Weather Data ETL Project : This project involves a Python script that performs an Extract, Transform, Load (ETL) process for weather data. The script retrieves current weather information for a specified location and stores this data in an AWS S3 bucket.
Language: Python - Size: 8.79 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
sangwanamit621/learning-and-experiments
Welcome to the Learning and Experiments Hub—a dynamic repository capturing my journey of exploration and experimentation in the vast world of technology. This space serves as a digital canvas where I document my learning process, experiments, and discoveries.
Language: Jupyter Notebook - Size: 50.8 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0
cristian-rincon/action-composer-sync
This is a simple action that helps you to fetch your Apache Airflow DAGs to Google Cloud Composer
Language: Shell - Size: 3.91 KB - Last synced: 2 months ago - Pushed: about 2 years ago - Stars: 6 - Forks: 0
jashshah-dev/AWS-Big-Data-Pipeline-orchestrated-with-Airflow
A robust data pipeline leveraging Amazon EMR and PySpark, orchestrated seamlessly with Apache Airflow for efficient batch processing
Language: Python - Size: 16.6 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
pulkit6559/FlaskSparkAirflow-MFAnalytics
Flask-powered application for orchestrating Spark jobs through a dashboard in a Dockerized Airflow and Spark environment, for Mutual Fund Data retrieval and analysis (https://www.mfapi.in/)
Language: Python - Size: 1.14 MB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
makarov-m/Yandex.Practicum.DE
Here I added 6 projects which have been made by me during my apprenticeship in Yandex.Practicum as data engineer.
Language: Python - Size: 3.01 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 3 - Forks: 3
Penosh22/Market-news-and-forex-exchange-ETL-pipeline
ETL pipeline to extract webscraped forex data and google news library data on daily basis and storing it in a postgres database for market analysis and insights
Language: Python - Size: 1.95 KB - Last synced: 5 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
MinaBasem/San-Francisco-Fire-Incidents-Streaming
An Ariflow, Kafka, AWS and Python data streaming project for Fire incidents in San Francisco
Language: Python - Size: 14.6 KB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0
yassinessadi/Fraud-Detection-in-Financial-Transactions
Detect and prevent fraudulent transactions in real-time. Our project utilizes advanced data analysis, focusing on transactional, customer, and external data to enhance security, maintain customer trust, and minimize financial losses.
Language: Python - Size: 143 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
owenhiggins/ML-Network-Intrusion-Detection-Model
Created by: Owen Higgins ([email protected]) & Casey Gary ([email protected])
Language: Python - Size: 273 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 1 - Forks: 1
Ashutosh27ind/airflow-cert-dag-authoring
This repository contains examples and template files for DAG Authoring
Language: Python - Size: 23.4 KB - Last synced: 18 days ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
karlapereira/airflow-example
Language: Python - Size: 29.3 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
lgthevinh/ev-stock-etl-pipeline
An ETL pipeline that extracts, transforms, and loads data from various sources related to electric vehicle (EV) stocks.
Language: Python - Size: 14.6 KB - Last synced: about 1 month ago - Pushed: 6 months ago - Stars: 0 - Forks: 0
andressagomes26/airflow_challenge
Repositório destinado à resolução do Desafio Módulo 5 LH - Orquestração com Airflow
Language: Python - Size: 454 KB - Last synced: 6 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
pgrondein/etl_data_pipeline_airflow
Pipeline that analyzes the web server log file, extracts the required lines and fields, transforms, and load (append to an existing file.)
Language: Python - Size: 7.81 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1 - Forks: 0
michaelosthege/apache-airflow-flowitems
This package helps to reduce the amount of boilerplate code when creating Airflow DAGs from Python callables.
Language: Python - Size: 33.2 KB - Last synced: 8 days ago - Pushed: about 1 year ago - Stars: 4 - Forks: 0
teenbress/Weather_Data_Pipeline_with_Airflow_and_Slack_Alert
End-to-end data pipeline --OpenWeather Alert, monitored with Slack alert
Language: Python - Size: 1.64 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0
sagardua297/udacity-data-engineering-nd
Data Pipeline Analytics Platform is an end-to-end generic Big Data pipeline. Involves following tech stack: AWS S3, AWS Redshift, AWS EMR Cluster, Apache Spark, Apache Airflow.
Language: Python - Size: 1.81 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0
Manny-Brar/DataEngineeringNanodegree-P5-DataPipelines-Airflow-Spark-AWS
Utilizing Airflow's built-in functionalities creating a reusable ETL pipeline. Source data resides in a S3 bucket, and the pipeline should include data quality checks and data should be processed within AWS Redshift.
Language: Python - Size: 216 KB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
mpavanetti/airflow
This set of code and instructions has the porpouse to instanciate a compiled environment with set of docker images like airflow webserver, airflow scheduler, postgresql, pyspark, Data Pipeline consuming data from weather api , processing with pyspark and storing in postgresql
Language: PHP - Size: 1.62 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 8 - Forks: 3
sreekesh93/airflow_beginners
to start airflow in local with basic setup
Language: Python - Size: 2.93 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 4 - Forks: 0
Mouhamed-Jinja/postgres-s3-data-migration-with-airflow
Airflow Data Migration Project: A comprehensive Airflow project demonstrating data migration from PostgreSQL to AWS S3. Leverage the power of Airflow's operators, connections, and hooks to build robust and scalable data pipelines. Ideal for data engineering enthusiasts looking to learn and implement Airflow in real-world scenarios
Language: Python - Size: 7.08 MB - Last synced: 13 days ago - Pushed: 7 months ago - Stars: 0 - Forks: 0
prasadanilmore/ETL-JustEat-TestCase
This documentation provides an overview of the tasks completed in this project. The project comprises three key tasks: ETL (Extract, Transform, Load), API development, and Data Orchestration using Airflow. Each task is detailed below along with explanations of design choices and considerations.
Language: Python - Size: 79.1 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
zarexalvindaria/data-engineering
This repo contains the Data Engineering exercises I took in Datacamp.
Language: Python - Size: 76.7 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 3 - Forks: 0
njoppi2/data-pipeline
🔄 A data pipeline that automates daily data extraction and loading between two Postgres databases.
Language: Jupyter Notebook - Size: 81.1 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0
alopezgo/ETL-Python-Airflow-Bigquery
Extract Transform and Load Data scripts with Python, Bigquery/SQL API and Airflow orchestration
Language: Jupyter Notebook - Size: 388 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
scriptstar/airflow-weather-station
Get started with Apache Airflow. Check the README for instructions on how to run your first DAGs today. 🚀
Language: Python - Size: 12.5 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
at-akshat-2107/Codepro-EdTech-Lead-Scoring-Classification-MLOps-Assignment
In this project, you will create an end-to-end Airflow pipeline, integrated with MLflow, for CodePro, an EdTech startup, to perform lead scoring and maximize profitability while minimizing the Customer Acquisition Cost (CAC). The assignment involves data collection, preprocessing, and periodic model retraining within Airflow, alongside development
Language: Jupyter Notebook - Size: 13.9 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
lediau/bigdata-data-engineering-ai-masters
Language: Python - Size: 122 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0
scalactic/airflow-example
airflow-example in docker
Language: Python - Size: 7.81 KB - Last synced: 9 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0
seblum/mlops-airflow-DAGs
Airflow DAGs repository synced for the MLOps plattform of the repository mlops-eks-mlplatform
Language: Python - Size: 165 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0
immu0001/Udacity-Data-Engineer-nanodegree
Classwork projects and home works done through Udacity data engineering nano degree
Language: Jupyter Notebook - Size: 101 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 64 - Forks: 71
k0rsakov/dag_factory
Фабрика DAG
Language: Python - Size: 16.6 KB - Last synced: 9 months ago - Pushed: about 1 year ago - Stars: 0 - Forks: 1
shlin168/airflow-local-playground
docker compose files and some example DAGs for playing and learning airflow
Language: Python - Size: 148 KB - Last synced: 9 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
luelhagos/Data-warehouse
Data Engineering: Data warehouse tech stack with MySQL, DBT, Airflow, and Spark
Language: Python - Size: 2.93 KB - Last synced: 9 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
avikbesu/AirflowPractice
Language: Python - Size: 43.9 KB - Last synced: 9 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0
prasanthsagarkottakota/Twitter-Pipeline
Tools : Python, Twitter API, Apache airflow, Amazon S3, EC2
Language: Python - Size: 7.81 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0
belladzhu/airflow-projects
Создание дагов для автоматизации отчетности
Language: Python - Size: 5.86 KB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 0 - Forks: 0
rikiafifuddin/airflow-docker
Airflow Docker with pip instalation using requirement.txt and using DAG from cloned repository
Language: Dockerfile - Size: 16.6 KB - Last synced: 9 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0
mehroosali/s3-redshift-batch-etl-pipeline
Built functional python ETL script with functions that initialized spark clusters using pyspark library to extract songs stored in S3 bucket. Partitioned songs data by year and artist_id and compressed in parquet output files to increase load performance. Used the overwrite mode in spark to ensure every new run of ELT script is overwritten in the data lake to avoid duplicates. Orchestrated ELT data pipeline that extracts from S3, loads in redshift for transformation and loads output back to S3. Used hooks in airflow to make connection credentials configurable in order to separate access rights from code base for security. Used operators to execute loading and transformation scripts for redshift with airflow DAG.
Language: Python - Size: 944 KB - Last synced: 9 months ago - Pushed: over 2 years ago - Stars: 4 - Forks: 3
mehroosali/ABCStoresPipeline
Batch ETL data pipeline built on HDP 3.0 to process daily sales and business data to procedure power Bi reports. Automated the pipelines using Airflow.
Language: Scala - Size: 464 KB - Last synced: 9 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0
avocadojj/marketing
Making pipeline for bank marketing campaign
Language: Jupyter Notebook - Size: 212 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 2
andersonesanto/igti-edc-desafio-final
IGTI MBA Engenharida de dados - Bootcamp Engenheiro de Dados Cloud - Desafio final
Language: Python - Size: 58.6 KB - Last synced: 8 months ago - Pushed: about 2 years ago - Stars: 2 - Forks: 0