GitHub / PrathameshLakawade / Pipeline-Genie
Pipeline-Genie is an intelligent data pipeline that processes CSV datasets, identifies their schema, and leverages LLaMA 2.0 to extract business insights. Users can select relevant business needs, triggering automated ETL transformations using Apache Spark. The final transformed dataset is stored in AWS S3 and made available for download.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/PrathameshLakawade%2FPipeline-Genie
PURL: pkg:github/PrathameshLakawade/Pipeline-Genie
Stars: 4
Forks: 0
Open issues: 0
License: mit
Language: Python
Size: 850 KB
Dependencies parsed at: Pending
Created at: 6 months ago
Updated at: 4 months ago
Pushed at: 5 months ago
Last synced at: 3 months ago
Topics: apache-spark, artificial-intelligence, aws-s3, business-insights, csv-processing, data-pipeline, data-transformation, etl-pipeline, fastapi, generative-ai, llama2, machine-learning, mongodb-atlas, python, react