An open API service providing repository metadata for many open source software ecosystems.

GitHub / realdatadriven / etlx

This project is an ETL (Extract, Transform, Load) Framework powered by DuckDB, designed to seamlessly integrate and process data from diverse sources. It leverages Markdown as a configuration medium, where YAML blocks define metadata for each data source, and embedded SQL blocks specify the extraction, transformation, and loading logic.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/realdatadriven%2Fetlx
PURL: pkg:github/realdatadriven/etlx

Stars: 11
Forks: 1
Open issues: 0

License: mit
Language: Go
Size: 2.94 MB
Dependencies parsed at: Pending

Created at: 8 months ago
Updated at: 13 days ago
Pushed at: 13 days ago
Last synced at: 12 days ago

Topics: data-engineering, data-quality, data-quality-checks, data-quality-monitoring, data-science, duckdb, etl, object-storage, relational-databases, report, report-automation, s3, s3-storage

    Loading...