GitHub topics: elt
dataform-co/dataform
Dataform is a framework for managing SQL based data operations in BigQuery
Language: TypeScript - Size: 16.5 MB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 906 - Forks: 181

cloudquery/cloudquery
The developer first cloud governance platform
Language: Go - Size: 173 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 6,108 - Forks: 527

datazip-inc/olake
Fastest open-source tool for replicating Databases to Data Lake in Open Table Formats like Apache Iceberg. ⚡ Efficient, quick and scalable data ingestion for real-time analytics. Supporting Postgres, MongoDB and MySQL
Language: Go - Size: 31.9 MB - Last synced at: about 19 hours ago - Pushed at: about 19 hours ago - Stars: 883 - Forks: 80

goto/optimus-any2any
optimus-any2any is a versatile tool designed to transfer data from any source to any sink with configurable options. It supports various data sources and sinks, providing a flexible and powerful way to handle data transfers.
Language: Go - Size: 855 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

artie-labs/transfer
Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift, Databricks) in real-time.
Language: Go - Size: 3.98 MB - Last synced at: about 4 hours ago - Pushed at: about 5 hours ago - Stars: 655 - Forks: 33

apache/flink-cdc
Flink CDC is a streaming data integration tool
Language: Java - Size: 41.1 MB - Last synced at: 2 days ago - Pushed at: 8 days ago - Stars: 6,081 - Forks: 2,012

mage-ai/mage-ai
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Language: Python - Size: 234 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 8,358 - Forks: 847

quarylabs/quary
Open-source BI for engineers
Language: Rust - Size: 105 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 2,308 - Forks: 57

rudderlabs/rudder-server
Privacy and Security focused Segment-alternative, in Golang and React
Language: Go - Size: 309 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 4,200 - Forks: 337

DataRecce/recce
The data-validation toolkit for enhanced dbt (data build tool) PR review
Language: TypeScript - Size: 20.6 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 372 - Forks: 14

mahmoudparsian/data-warehousing
This repository is a place for the Data Warehousing course at the Information Systems & Analytics department, Santa Clara University.
Language: Jupyter Notebook - Size: 538 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 9 - Forks: 2

dbt-labs/dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Language: Python - Size: 46.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 10,924 - Forks: 1,738

dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Language: Python - Size: 91.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,687 - Forks: 283

TobikoData/sqlmesh
Scalable and efficient data transformation framework - backwards compatible with dbt.
Language: Python - Size: 77.4 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,362 - Forks: 218

Teradata/dbt-teradata
dbt adapter for Teradata
Language: Python - Size: 1.09 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 24 - Forks: 17

apache/doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
Language: Java - Size: 1 GB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 13,742 - Forks: 3,441

apache/seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Language: Java - Size: 43.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8,548 - Forks: 1,991

brankowss/Cloud-Book-Price-Tracker-ELT-Pipeline
Cloud Data Engineering ELT pipeline: Scrapes Serbian bookstore data (Scrapy), stores raw on S3, loads to RDS (Postgres), transforms/tests with dbt, orchestrated by Airflow on EC2 (Docker). Includes Telegram alerts.
Language: Python - Size: 631 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Language: Python - Size: 677 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 18,320 - Forks: 4,554

edgarrmondragon/meltano-dogfood
Personal dogfood Meltano project
Language: Pkl - Size: 1.94 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 8 - Forks: 1

apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language: Python - Size: 382 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 40,370 - Forks: 15,104

airbytehq/PyAirbyte
PyAirbyte brings the power of Airbyte to every Python developer.
Language: Python - Size: 2.46 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 267 - Forks: 53

MeltanoLabs/target-snowflake
Singer Target for the Snowflake cloud Data Warehouse
Language: Python - Size: 1.05 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11 - Forks: 28

meltano/meltano
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Language: Python - Size: 140 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,085 - Forks: 178

reservoir-data/tap-canny
Singer tap for Canny. Built with the Meltano Singer SDK.
Language: Python - Size: 593 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

reservoir-data/tap-polarsh
Singer Tap for polar.sh
Language: Python - Size: 1.34 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

edgarrmondragon/tap-clinicaltrials
Singer tap for ClinicalTrials.gov study records data.
Language: Python - Size: 325 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

MeltanoLabs/tap-stackexchange
Singer tap for the StackExchange API
Language: Python - Size: 1.16 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 3 - Forks: 1

MeltanoLabs/tap-intacct
Singer tap for the Sage Intacct API
Language: Python - Size: 337 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 1

MeltanoLabs/target-csv
A CSV target for Singer, made with the Meltano SDK for Taps and Targets
Language: Python - Size: 522 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 5 - Forks: 7

edgarrmondragon/tap-google-play
Singer tap for Google Play Reviews
Language: Python - Size: 589 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 3

reservoir-data/tap-honeybadger
Singer tap for Honeybadger.io
Language: Python - Size: 377 KB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

Netflix/maestro
Maestro: Netflix’s Workflow Orchestrator
Language: Java - Size: 1.55 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 3,480 - Forks: 217

scribe-org/Scribe-Server
Backend service for Scribe data downloads
Language: Go - Size: 343 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 5 - Forks: 8

franloza/running-races-insights
Web application created with Evidence and DuckDB to share stats about the running races in Cuenca.
Language: Jupyter Notebook - Size: 3.67 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 0

datacoves/docs
Datacoves public documentation
Language: Python - Size: 47.5 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 3 - Forks: 3

dashmug/glue-utils
Python library designed to enhance the developer experience when working with AWS Glue ETL and Python Shell jobs by reducing boilerplate code, increasing type safety, and improving IDE auto-completion.
Language: Python - Size: 762 KB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 8 - Forks: 2

faros-ai/airbyte-connectors
Airbyte connectors (sources & destinations) + Airbyte CDK for JavaScript/TypeScript
Language: TypeScript - Size: 24.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 120 - Forks: 65

cloudquery/plugin-sdk
CloudQuery Go SDK for source and destination plugins
Language: Go - Size: 18.2 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 24 - Forks: 25

transferia/transferia
Open Source Cloud Native Ingestion engine
Language: Go - Size: 21.4 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 121 - Forks: 14

slingdata-io/sling-cli
Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.
Language: Go - Size: 77.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 583 - Forks: 48

airbytehq/abctl
Airbyte's CLI for managing local Airbyte installations
Language: Go - Size: 690 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 51 - Forks: 7

flow-php/etl-adapter-doctrine
PHP ETL Adapter: Doctrine
Language: PHP - Size: 380 KB - Last synced at: about 1 hour ago - Pushed at: 10 days ago - Stars: 3 - Forks: 2

gouline/dbt-metabase
dbt + Metabase integration
Language: Python - Size: 2.15 MB - Last synced at: 10 days ago - Pushed at: 22 days ago - Stars: 522 - Forks: 75

dom-andolino/portfolio-google-api
Exploring Google Knowledge Graph API
Language: Jupyter Notebook - Size: 827 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

ericmulovhedzi/AI
Data Science and Deep Machine Learning (ML) algorithm source codes mostly written primarily in Python
Language: Python - Size: 2.71 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

umitkaanusta/reddit-detective
Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Language: Python - Size: 269 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 217 - Forks: 15

guidok91/spark-movies-etl
Spark data pipeline that processes movie ratings data.
Language: Python - Size: 3.79 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 28 - Forks: 12

reservoir-data/tap-betterstack
Singer tap for Better Stack. Built with the Meltano Singer SDK.
Language: Python - Size: 748 KB - Last synced at: 6 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

reservoir-data/tap-readme
Singer tap for ReadMe.com
Language: Python - Size: 527 KB - Last synced at: 11 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

reservoir-data/tap-toggl
Singer Tap for the Toggl API
Language: Python - Size: 346 KB - Last synced at: 6 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

reservoir-data/tap-pomelo
Singer tap for Pomelo, built with the Singer SDK
Language: Python - Size: 528 KB - Last synced at: 6 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

datayoga-io/datayoga
streaming data pipeline platform
Language: Python - Size: 2.56 MB - Last synced at: about 24 hours ago - Pushed at: 2 months ago - Stars: 29 - Forks: 4

datacoves/dbt-coves
CLI tool for dbt users to simplify creation of staging models (yml and sql) files
Language: Python - Size: 2.95 MB - Last synced at: 1 day ago - Pushed at: 15 days ago - Stars: 263 - Forks: 16

astronomer/astro-sdk 📦
Astro SDK allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Language: Python - Size: 7.54 MB - Last synced at: 9 days ago - Pushed at: 18 days ago - Stars: 369 - Forks: 48

vbluuiza/github-repos-language-etl
📊 ETL pipeline that extracts and analyzes programming languages used in GitHub repositories from tech companies like Amazon, Netflix, and Spotify. Built with Python, Pandas, and WSL2.
Language: Jupyter Notebook - Size: 88.9 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

edgarrmondragon/tap-planetscaleapi
Singer Tap for the PlanetScale API
Language: Python - Size: 542 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

unytics/airbyte_serverless
Airbyte made simple (no UI, no database, no cluster)
Language: Python - Size: 3.43 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 173 - Forks: 14

tigisthailay/data2bots-elt-pipeline-airflow
A project to build an ELT pipeline (batch or streaming) that loads the business data in to the data2bots data warehouse and performs the transformation.
Language: Jupyter Notebook - Size: 30 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

apache/airflow-publish
Publishing PyPI packages for Apache Airflow
Language: Python - Size: 54.7 KB - Last synced at: 6 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 1

ucbepic/docetl
A system for agentic LLM-powered data processing and ETL
Language: Python - Size: 54.1 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 1,949 - Forks: 186

vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
Language: Python - Size: 110 MB - Last synced at: 15 days ago - Pushed at: 17 days ago - Stars: 449 - Forks: 59

MeltanoLabs/tap-dbt
Singer Tap for dbt API v2 built with the Meltano SDK
Language: Python - Size: 1010 KB - Last synced at: 6 days ago - Pushed at: 25 days ago - Stars: 12 - Forks: 7

cre-dev/xml2db
A Python package to load complex XML files into a relational database
Language: Python - Size: 858 KB - Last synced at: 29 days ago - Pushed at: 30 days ago - Stars: 12 - Forks: 5

cuebook/cuelake
Use SQL to build ELT pipelines on a data lakehouse.
Language: JavaScript - Size: 28 MB - Last synced at: 18 days ago - Pushed at: about 3 years ago - Stars: 288 - Forks: 28

Francois-lenne/elt-mp4-quiberon
the goal of this project is to retrieve the video of the municipality of quiberon and see if a person is in or no
Language: Python - Size: 38.1 KB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

173TECH/sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Language: Python - Size: 4.28 MB - Last synced at: 22 days ago - Pushed at: about 2 months ago - Stars: 121 - Forks: 15

GuiFernandess7/reddit-images-extractor-and-ml-classifier
ELT application that captures images from Reddit and stores them in an SQLite database for further ML classification.
Language: Python - Size: 602 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

dataforgelabs/dataforge-core
DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles
Language: PLpgSQL - Size: 750 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 48 - Forks: 1

raystack/optimus
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Language: Go - Size: 12.2 MB - Last synced at: 15 days ago - Pushed at: 12 months ago - Stars: 748 - Forks: 154

Datavault-UK/automate-dv
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
Size: 8.32 MB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 538 - Forks: 136

blotoutio/elt-source-mapping
Repository contains the predefined mappings for known 3rd party sources for data ingestion & transformation.
Size: 178 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

airbytehq/airbyte-interop-catalog
The Airbyte Interop Catalog. Tools, guides, and reusable dbt packages which help users join together Airbyte datasets and 3rd party schemas.
Language: Python - Size: 3.71 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

aws-samples/aws-etl-orchestrator
A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Language: Python - Size: 651 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 340 - Forks: 140

95xin/Data-Engineering-Project---Automatic-Batch-Data-Processing
Data Engineering Project - Automated Batch Data Processing
Language: Python - Size: 996 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

chayansraj/Python-ETL-pipeline-using-Airflow-on-AWS
This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow orchestration tool on AWS EC2 instance.
Language: Python - Size: 356 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 14 - Forks: 4

rosaihzaa/simple-airflow-pipeline
ELT pipeline using Apache Airflow to scrape property data from www.properstar.com, load it into BigQuery, and transform the data with SQL.
Language: Python - Size: 86.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

montara-io/dbt-command-center
Never sift through endless dbt™ logs again. dbt Command Center is a free, open-source, local web application that provides a user-friendly interface to monitor and manage dbt runs.
Language: TypeScript - Size: 3.55 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 28 - Forks: 0

Kushalkhadka7/dagster_clickhouse_dbt
DBT and clickhouse test project with dagster
Language: Python - Size: 4.03 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

andrewtavis/wikirepo
Python based Wikidata framework for easy dataframe extraction
Language: Python - Size: 1.67 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 44 - Forks: 6

wmakeev/simplex
SimplEx - simple expression language
Language: JavaScript - Size: 375 KB - Last synced at: 17 days ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

goto/optimus Fork of raystack/optimus
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Language: Go - Size: 33.5 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 7 - Forks: 4

BigDatalex/rewe_products
Data Engineering Pipeline extracting and transforming data from rewe shop API.
Language: Python - Size: 214 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

apache/doris-streamloader
Stream Loader for Apache Doris
Language: Go - Size: 39.1 KB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 24 - Forks: 18

reservoir-data/tap-streakcrm
Singer Tap for streak.com
Language: Python - Size: 353 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

dataform-co/dataform-example-project
Example project on Dataform
Language: JavaScript - Size: 55.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 4

MeltanoLabs/Singer-Working-Group
Working group for ongoing development and iteration of the Singer Spec, the de-facto protocol for open source data connectors. Please use "Issues" to create discussion items - or use "Discussions" for general questions.
Size: 28.3 KB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 14 - Forks: 4

kuwala-io/kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times
Language: JavaScript - Size: 7.79 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 792 - Forks: 54

ascrus/getl
A tool for developing and testing ETL and ELT processes for automating the capture, delivery and processing of information in data warehouses on the MicroFocus Vertica platform.
Language: Groovy - Size: 232 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 57 - Forks: 10

feluelle/finance-data-builder
Finance 🏦 Data Builder 🛠️ @ postgres 🐘
Language: Python - Size: 1.61 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 1

taquynhnga2001/proptech-dagster
Build an ELT pipeline with dagster and dbt to schedule loading HDB resale transactions in Singapore into Google BigQuery data warehouse, then create Power BI dashboard to enhance insight exploration.
Language: Python - Size: 3.42 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

codeforkjeff/dbt-sqlite
A SQLite adapter plugin for dbt (data build tool)
Language: Python - Size: 173 KB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 78 - Forks: 15

apache/doris-sdk
SDK for Apache Doris
Language: Thrift - Size: 35.2 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 7

MattTriano/analytics_data_where_house
An analytics engineering sandbox focusing on real estates prices in Cook County, IL
Language: Python - Size: 17.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 9 - Forks: 0

transferia/iceberg
Transferia iceberg provider
Language: Go - Size: 134 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

yokawasa/databricks-notebooks
Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 2 months ago - Pushed at: almost 7 years ago - Stars: 86 - Forks: 75

ideavision/llm-development
"🚀 A job-ready, hands-on repository for practical LLM development! Master prompt engineering, fine-tuning, retrieval-augmented generation (RAG), and more with real-world examples and best practices. Perfect for AI engineers looking to build and deploy powerful language models."
Language: Jupyter Notebook - Size: 27.4 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Renatoelho/fluxo-elt
Trata-se de um processo de ELT (Extração, Carga e Transformação) que integra um sistema legado com um banco de dados relacional (no exemplo, um MySQL) para um banco NoSQL (ElasticSearch) sem alterações significativas nos dados transferidos.
Language: Python - Size: 4.99 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Renatoelho/apache-nifi-enriquecimento-cep
Neste projeto, mergulho no universo do Apache Nifi, explorando como consumir e salvar dados de uma API diretamente em um banco de dados.
Size: 2.37 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 2

mattiasthalen/arcane-insight
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.
Language: Python - Size: 741 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 33 - Forks: 1
