Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / polakowo / yelp-3nf

3NF-normalize Yelp data on S3 with Spark and load it into Redshift - automate the whole thing with Apache Airflow

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/polakowo%2Fyelp-3nf

Stars: 12
Forks: 3
Open Issues: 0

License: None
Language: Jupyter Notebook
Repo Size: 1.82 MB
Dependencies: 0

Created: almost 5 years ago
Updated: 3 months ago
Last pushed: almost 5 years ago
Last synced: 26 days ago

Topics: 3nf, airflow, amazon-emr, amazon-redshift, cloud, data-marts, data-pipeline, data-warehouse, dimensional-tables, etl-process, normalization, nosql, s3-bucket, spark, sql, yelp-dataset

Funding links: https://github.com/sponsors/polakowo

Files
    Loading...
    Readme
    Loading...

    No dependencies found