An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: dimensional-modeling

samuel-wahinya/VelociMart-Business-Intelligence-Report

This project showcases how i helped VelociMart shift from Excel-based reporting to a robust data infrastructure using SQL Server and Tableau. Built around Sales, Customers, and Products Data Marts, it follows the Medallion Architecture (Bronze → Silver → Gold) to deliver clean, analysis-ready views for effective business insights.

Language: TSQL - Size: 2.22 MB - Last synced at: about 2 hours ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

mahmoudparsian/data-warehousing

This repository is a place for the Data Warehousing course at the Information Systems & Analytics department, Santa Clara University.

Language: Jupyter Notebook - Size: 478 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 8 - Forks: 2

deepakramani/dbt-bike-insights

A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.

Language: Makefile - Size: 4.48 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

breadboard-bi/breadboard

Language: TXL - Size: 5.04 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 8 - Forks: 2

mattiasthalen/adventure-works

Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, event-enhanced Puppini bridges, and temporal resolution across DAS/DAB/DAR layers.

Language: Python - Size: 18.4 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 95 - Forks: 8

datalopes1/machine_stop

Projeto end-to-end da criação de um Data Warehouse para uma companhia fictícia de mineração chamada Astarte Mining Co.

Language: Jupyter Notebook - Size: 1.13 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

GADES-DATAENG/webinar

Code, scripts, and resources for the Data Engineering Fundamentals Course Webinar, covering Python, data pipelines, Apache Airflow, and more.

Language: Python - Size: 26.9 MB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lopezj1/noaa_eda

This project demonstrates an ETL pipeline that processes NOAA's fishing survey data, then makes it available for analysis through an interactive web app.

Language: Python - Size: 1.75 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

MaxineXiong/Cloud-Data-Warehousing-with-AWS-Redshift

This project builds a cloud-based ETL pipeline for Sparkify to move data to a cloud data warehouse. It extracts song and user activity data from AWS S3, stages it in Redshift, and transforms it into a star-schema data model with fact and dimension tables, enabling efficient querying to answer business questions.

Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

lupusruber/music_analytics

This project processes real-time music event data using Kafka, Apache Spark on Google Cloud Dataproc, and stores the transformed data in BigQuery for analytics, all orchestrated by Airflow and managed with Terraform.

Language: Jupyter Notebook - Size: 22.1 MB - Last synced at: 26 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

mchenryspagg/creating-a-dimensional-data-model

This project involves creating a dimensional data model using MySQL Workbench for a car repair shop’s operations in western Canada by examining a sample invoice,

Size: 960 KB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

mchenryspagg/hng-hire-data-model

The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.

Size: 529 KB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

AmmarSahyoun/Dimensional-modeling

OLAP in TSQL and Python

Language: TSQL - Size: 527 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

VeenaBhyrava/ordersDimModelling

Dimensional modelling of order transactions

Language: Python - Size: 17.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

hase3b/End-to-End-DWH-Pipeline

This repository contains the end-to-end pipeline for building a data warehouse for a real estate management company. The pipeline includes data generation, ETL process, creation of star schema dimensions and fact table, visualization using Power BI, and automation with Pabbly Connect.

Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: 19 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

BrksGuiOrnelas/data_modelling_dw

Criação de um Data Warehouse (DW) utilizando modelagem dimensional em um esquema estrela.

Size: 334 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Rconybea/xo-unit

compile-time dimension conversion and checking + support fractional dimension

Language: C++ - Size: 480 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Rconybea/xo-pyunit

python bindings for xo-unit

Language: C++ - Size: 5.86 KB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

ali-bin-kashif/etl-pipeline-project-cola-next

Developed a robust ETL pipeline for Next Cola Pvt. Ltd data which extracts data from many different OLTP sources, converts them into dimensions and facts and load into datawarehouse for analytical workload.

Language: Python - Size: 344 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

doshiharmish/Cross-Company-Insights

Project involves merging customer reviews from Fudgemart and FudgeFlix to create a unified data warehouse using Kimball's approach. Utilizing Power BI, it aims to extract actionable insights for Fudge Inc., guiding strategic decisions, product enhancements, and market expansion based on comprehensive business intelligence.

Language: TSQL - Size: 4.13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

dennis-barrett/dimdates-dot-com

Source code for the Kimball-style date dimension generator dimdates.com.

Language: JavaScript - Size: 839 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sarutlaa/Instacart-Market-Basket-Analysis

Building Dimensional Model and SQL Analysis in SnowflakeDB

Size: 938 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cadmiumkitty/data-quality-analytics

Data quality analytics prototype linking faceted classification with dimensional data visualisation.

Language: Python - Size: 895 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rtimbro185/syr_mads_ist722_data_warehouse

Syracuse University, Masters of Applied Data Science - IST 722 Data Warehouse

Language: TSQL - Size: 50.6 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 4

andyvroberts/banana

Dimensional modelling using SQL Server and SSAS cube (tabular) creation

Language: SQL - Size: 75.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

eddahbarasa/AdvancedDataModeling

Starts with a conceptual model ends with a Tableau interactive dash board. In between there is building ER diagrams, forward engineering to build normalized databases, dimensional data modelling and visualizations in tableau.

Language: Io - Size: 7.63 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

mereshd/COVID-database

A comprehensive dimensional model for COVID data, enabling insights for future vaccination campaigns through robust visual analytics.

Size: 23.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lauranonato/Data-Warehousing

creating a data warehouse for a football game management company and some SQL queries to analyse data.

Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SaadAhmedWaqar/Data-Warehousing-Redshift

A Data Warehousing project for retail sales using dimension modelling best practices with SCD type 2 on AWS Redshift. Utilizing AWS Lambda, Glue Workflows and Python Shell jobs to create and automate an ELT pipeline where batch data coming into S3 is loaded onto Redshift and necessary transformations are performed to meet requirements.

Language: Python - Size: 411 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ovokpus/analytics-engineering-prototype

Analytics Engineering with dbt on Bigquery. This project implements the use of Analytics Engineering Best practices to build a dimensional data model, using dbt (data build tool) and BigQuery.

Size: 1.22 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vioxcd/coursera-dwh-for-bi-capstone

Coursera DWH for BI Capstone (Implementation)

Language: Jupyter Notebook - Size: 20.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

MEDHAT-ALHADDAD/Pizza_Runner

Case Study SQL Reporting

Language: Jupyter Notebook - Size: 1.54 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

eddahbarasa/FinalProjectAdvancedDataModelling

Creating erd using MySQL Workbench, Forward engineering, Dimensional data model, Tableau visualisation

Size: 105 KB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

desaikun1996/New-York-City-Arrests-Data-Modelling-Analysis-and-Visualization

Analysis of New York State Police Department Arrests dataset. Created Dimensional Model for the provided dataset. Using Alteryx and Talend, built ETL pipelines to process, clean the data and create dimensions and facts in the destination database. Further, visualized the necessary details of the database using Tableau and PowerBI.

Size: 12.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 7

abdulrahmankhayal/AdventureWorksDM

dimensional modeling of AdventureWorks2017 for sales, creating a DataMart. It includes an ETL pipeline that loads the data from AdventureWorks2017 to AdventureWorksDM using SQL Server Integration Services (SSIS) and implements Slowly Changing Dimension (SCD) handling using the SCD wizard and Merge statement.

Language: TSQL - Size: 414 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

sskender/business-intelligence

Business Intelligence FER labs

Language: TSQL - Size: 33.3 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

armanawn/ssc-workshop-databases-2022

2022 SCC Data Science & Analytics Workshop on Databases

Language: Jupyter Notebook - Size: 8.89 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

desaikun1996/Dallas-Food-Inspection-Data-Analysis

Analysis of a dataset containing information collected on food inspection done at various restaurants in Dallas.

Size: 25 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

desaikun1996/Boston-Food-Inspection-Data-Analysis

An examination of a dataset collecting data from food inspections conducted at several Boston restaurants.

Size: 25.1 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

sanketsanap5/University-Student-Enrollment-DW

Design and develop a dimensional data model for University Student Degree Program Enrollment and Performance using data modeling tool Dataedo

Size: 1.88 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

maxhardt/aws-redshift-elt

ELT and Dimensional Modeling on AWS Redshift

Language: Jupyter Notebook - Size: 123 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

maxhardt/data-modeling-with-postgres

ETL and Dimensional Modeling with PostgreSQL

Language: Jupyter Notebook - Size: 154 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0