An open API service providing repository metadata for many open source software ecosystems.

GitHub / PrathameshLakawade / Pipeline-Genie

Pipeline-Genie is an intelligent data pipeline that processes CSV datasets, identifies their schema, and leverages LLaMA 2.0 to extract business insights. Users can select relevant business needs, triggering automated ETL transformations using Apache Spark. The final transformed dataset is stored in AWS S3 and made available for download.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PrathameshLakawade%2FPipeline-Genie
PURL: pkg:github/PrathameshLakawade/Pipeline-Genie

Stars: 4
Forks: 0
Open issues: 0

License: mit
Language: Python
Size: 850 KB
Dependencies parsed at: Pending

Created at: 6 months ago
Updated at: 4 months ago
Pushed at: 5 months ago
Last synced at: 3 months ago

Topics: apache-spark, artificial-intelligence, aws-s3, business-insights, csv-processing, data-pipeline, data-transformation, etl-pipeline, fastapi, generative-ai, llama2, machine-learning, mongodb-atlas, python, react

    Loading...