An open API service providing repository metadata for many open source software ecosystems.

Topic: "redshift-cluster"

josephmachado/beginner_de_project

Beginner data engineering project - batch edition

Language: HTML - Size: 31.1 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 521 - Forks: 165

terraform-aws-modules/terraform-aws-redshift

Terraform module to create AWS Redshift resources πŸ‡ΊπŸ‡¦

Language: HCL - Size: 186 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 88 - Forks: 159

servian/amazon-redshift-checklist

This checklist aims to be an exhaustive list of all elements you should consider when using Amazon Redshift.

Size: 81.1 KB - Last synced at: 5 months ago - Pushed at: almost 5 years ago - Stars: 15 - Forks: 6

jaredfiacco2/AWS-E-Scooter-Tracker

Use AWS Lambda to Pull E-Scooter and E-Bike Location Data, store in S3 & Redshift using Data Vault Data Model, Server to Google Data Studio Dashboard

Language: PLpgSQL - Size: 29.2 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 1

essraahmed/Data-Warehouse-With-Redshift

Data Warehouse with AWS Redshift and Visualizing data using Power BI

Language: Jupyter Notebook - Size: 618 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

miztiik/redshift-demo

Simple getting started 1-node redshift cluster stack

Language: Python - Size: 1.95 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 7

idelfonsog2/data_warehouse_with_aws_redshift πŸ“¦

The idea is how can we prepare data to be used by Business Intelligence applications like Tableu or even Jupyternotebook! πŸ‘ In order to help the business see an overview of the data in a diagram of what important features of the product their customers might be using. Mainly, how can we improve the performance of these OLAP and OLTP transactions? For that, we use the combination of star schema tables, we build a strategy for a distributed data system, and do grouping for all the features thanks to REDSHIFT.

Language: Jupyter Notebook - Size: 40 KB - Last synced at: 7 days ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 1

Dina-Hosny/Sparkify---Data-Pipelines-with-Airflow

Sparkify - Data Pipelines with Airflow - Udacity Data Engineering Expert Track.

Language: Python - Size: 22.5 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Ahmedmagdy31/Data-Engineering-Project-over-AWS

Building and executing end-to-end ELT pipeline and driving analytics using Amazon Redshift as the data warehouse solution.

Size: 11.7 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

polarbeargo/Udacity-nd027-Data-Warehouse

Language: Jupyter Notebook - Size: 392 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 3

rishavmehra/health-synth

Health Synth simplifies healthcare with unified data, automated notifications, and patient feedback and monitoring the reports for better care and efficiency.

Language: Go - Size: 489 KB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

MaxineXiong/Cloud-Data-Warehousing-with-AWS-Redshift

This project builds a cloud-based ETL pipeline for Sparkify to move data to a cloud data warehouse. It extracts song and user activity data from AWS S3, stages it in Redshift, and transforms it into a star-schema data model with fact and dimension tables, enabling efficient querying to answer business questions.

Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: 6 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

abrook7/ETL_Project

Airflow orchestrated ETL (running in docker containers) that pulls batch data from an API to a local Postgres database, loads to AWS S3/Redshift provisioned by Terraform, and visualized in Quicksight.

Language: Python - Size: 983 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

essraahmed/Data-Pipeline-with-Airflow

Data Pipeline with Apache Airflow

Language: Python - Size: 444 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

ShikhaYadav123/AWS-Glue-IMDB-Data-Quality-ETL-Pipeline

IMDB Movie Data ETL Pipeline using S3, Glue, Redshift, EventBridge, SNS

Language: Python - Size: 5.69 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

anjijava16/Multi_Cloud_DWH_Utils

Compare the Multi Cloud Data warehouse systems

Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sudips413/DataModelingCollectionWarehousingRedshift

The data is collected from IMDB and then transformed before loading to warehouse

Language: Jupyter Notebook - Size: 1.07 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Marcoc51/Sparkify-Data-Warehouse

This project is a data warehousing solution for Sparkify, a music streaming service to extract data from JSON logs and stores it in a star schema data model in Amazon Redshift.

Language: Python - Size: 13.7 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

KopiCloud/terraform-aws-redshift-cluster

Deploy an Amazon Redshift Cluster in AWS using Terraform

Language: HCL - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

julianazhu/sparkify-redshift-etl

ETL Pipeline from AWS S3 to Redshift

Language: Jupyter Notebook - Size: 252 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

rbuerki/Data-Engineering-Nanodegree

Reference code and projects for Udacity's Data Engineering Nanodegree. Graduated Jun 2020.

Language: Jupyter Notebook - Size: 1.95 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

Bahaa29/Sparkify_Airflow

data pipeline ETL using Apache Airflow form data movement and amazon s3-storge with redshift cluster to storing the data in fact and DEM teables

Language: Python - Size: 20.5 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

jkenney0501/AWS_Data_Engineering

AWS Pipeline examples from Udacity Date Engineering Nanodegree.

Language: Jupyter Notebook - Size: 1.72 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

jkenney0501/AWS-Data-Engineering-Project-

This project builds a pipeline from AWS S3 storage to a Redshift Cloud hosted Cluster where an ETL process extracts the staged data and creates a STAR Schema to allow business users to query the data easier for business intelligence.

Language: Jupyter Notebook - Size: 128 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

mhaywardhill/Redshift-DWH-ETL

Udacity Data Engineering Nanodegree Project 3

Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

alexandrabaturina/redshift-data-warehouse

Python ETL pipeline to load data from Amazon S3 to Redshift analytics tables

Language: Python - Size: 26.4 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Dipesh-Pokhrel/redshift-cluster

Warehousing with redshift

Language: Jupyter Notebook - Size: 9.91 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

joyceannie/Data-Warehouse-AWS

A music streaming startup, Sparkify, has grown their user base and song database and want to move their processes and data onto the cloud. The data resides in S3, in a directory of JSON logs on user activity on the app, as well as a directory with JSON metadata on the songs in their app. The objective of the project is to create an ETL pieline to build a datawarehouse . We extract data from S3, stage them in Redshift, and transform data into a set of dimensional tables for the analytics team to continue finding insights into what songs their users are listening to.

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

sunnykan/sparkifydb_rs

A data warehouse on Amazon Redshift using a star schema to facilitate the analysis of user behaviour on a music streaming app.

Language: Python - Size: 53.7 KB - Last synced at: 8 months ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

topgyalgurung/AWS_Redshift-

Size: 1000 Bytes - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

adharangaonkar/ETL-Pipelines

A repository concentrating on using High end parallel pipelines to perform ETL across various data sources

Language: Jupyter Notebook - Size: 672 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Kanishkparganiha/data-warehouse-on-AWS-Redshift-for-Music-Application

Language: Python - Size: 95.7 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

mlincon/terraform-module-aws-redshift

Simplistic Terraform module for creating a AWS Redshift cluster to allow direct access programmatically or via a tool like DBeaver, pgAdmin, etc.

Language: HCL - Size: 24.4 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

MrBenA/Data_Warehouse-Amazon_Redshift

Udacity Data Engineering project: Data Warehouse

Language: Python - Size: 12.7 KB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

jpsalado92/Udacity-DEND_DataWarehouse-AWSRedshift

Full code for UDACITY's Data Engineer Nano Degree project. Build a Data Warehouse in AWS with Amazon Redshift.

Language: Python - Size: 3.93 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

topunix/AWS-Redshift

:cloud: Creating a Redshift Cluster using the AWS Python SDK

Language: Python - Size: 8.79 KB - Last synced at: 6 months ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0