An open API service providing repository metadata for many open source software ecosystems.

Topic: "etl-pipelines"

yobix-ai/extractous

Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.

Language: Rust - Size: 2.88 MB - Last synced at: 28 days ago - Pushed at: 5 months ago - Stars: 1,051 - Forks: 43

patterns-app/patterns-devkit

Data pipelines from re-usable components

Language: Python - Size: 1.75 MB - Last synced at: 28 days ago - Pushed at: about 2 years ago - Stars: 108 - Forks: 5

level-vc/useful

The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.

Language: Python - Size: 49.1 MB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 20 - Forks: 1

abrahamkoloboe27/Airflow-Pipeline-Dashboard-Compagnie-Aerienne

Lien de l'application

Language: Python - Size: 555 KB - Last synced at: 27 days ago - Pushed at: 5 months ago - Stars: 5 - Forks: 0

ChristianRCanlas/ChristianRCanlas.github.io

e-Portfolio showcasing my personal projects.

Language: Python - Size: 279 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

prneidhardt/Apache-Data-Pipeline

Sparkify project

Language: Jupyter Notebook - Size: 264 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

EmmanuelEzenwere/DataSift

DataSift auto applies a data pre-processing pipeline to Data Science Projects.

Language: Python - Size: 117 KB - Last synced at: 2 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

Chek0rrdn/DataEngineer_ETL

A project structure for doing and sharing data engineer work.

Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Xuconnika/baboon

For scribes of Thoth in the shell — your codebrain’s sacred scroll.

Language: Dockerfile - Size: 3.91 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

IMAbril/RENIS

Language: Jupyter Notebook - Size: 4.18 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

pranaypkadu/networksecurity

End To End MLOPS Project With ETL Pipelines- Building Network Security System

Language: Python - Size: 18.6 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

speedbits/LimitlessETL

A Python and Spark based ETL framework. While it operates within speed limits that is framework and standards, but offers boundless possibilities.

Language: Python - Size: 15.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

extralo/loom

Weaving together different threads (services like image/audio converse, ETL services, etc.) to enable the World Wide Flow

Language: JavaScript - Size: 11.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Suraj520/data-engineering-projects

Data engineering projects.

Language: Jupyter Notebook - Size: 271 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

omar-elmaria/airflow_local

This repo contains the DAGs that run on my local Airflow environment. I use the local environment to test my DAGs before deploying them to virtual machines via Kubernetes

Language: Python - Size: 81.1 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

juniors90/PymaciesArg

An extension that registers all pharmacies in Argentina.

Language: Python - Size: 26.9 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

angelxd84130/Airflow-ETL

Build ETL piplines on AirFlow to load data from BigQuery and store it in MySQL

Language: Python - Size: 705 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

siddarthaThentu/Disaster-Response-Pipeline

A deployed machine learning model that has the capability to automatically classify the incoming disaster messages into related 36 categories. Project developed as a part of Udacity's Data Science Nanodegree program.

Language: Python - Size: 9.57 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Guilherme-B/baboon

JSON-driven ETL pipeline framework prototype

Language: Python - Size: 22.5 KB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0