An open API service providing repository metadata for many open source software ecosystems.

GitHub / shravan-kuchkula / udacity-data-eng-proj2

A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract data from S3, apply a series of transformations and load into S3 and Redshift.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shravan-kuchkula%2Fudacity-data-eng-proj2
PURL: pkg:github/shravan-kuchkula/udacity-data-eng-proj2

Stars: 25
Forks: 17
Open issues: 0

License: None
Language: Jupyter Notebook
Size: 102 MB
Dependencies parsed at: Pending

Created at: over 5 years ago
Updated at: about 3 years ago
Pushed at: almost 4 years ago
Last synced at: over 2 years ago

Topics: airflow, docker, etl-pipeline, python, redshift, s3fs

    Loading...