GitHub topics: dimensional-modeling
samuel-wahinya/VelociMart-Business-Intelligence-Report
This project showcases how i helped VelociMart shift from Excel-based reporting to a robust data infrastructure using SQL Server and Tableau. Built around Sales, Customers, and Products Data Marts, it follows the Medallion Architecture (Bronze → Silver → Gold) to deliver clean, analysis-ready views for effective business insights.
Language: TSQL - Size: 2.22 MB - Last synced at: about 2 hours ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

mahmoudparsian/data-warehousing
This repository is a place for the Data Warehousing course at the Information Systems & Analytics department, Santa Clara University.
Language: Jupyter Notebook - Size: 478 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 8 - Forks: 2

deepakramani/dbt-bike-insights
A small scale ETL project that ingests data into the data warehouse using dbt(data build tool) to facilitate transformation and analytics.
Language: Makefile - Size: 4.48 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

breadboard-bi/breadboard
Language: TXL - Size: 5.04 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 8 - Forks: 2

mattiasthalen/adventure-works
Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, event-enhanced Puppini bridges, and temporal resolution across DAS/DAB/DAR layers.
Language: Python - Size: 18.4 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 95 - Forks: 8

datalopes1/machine_stop
Projeto end-to-end da criação de um Data Warehouse para uma companhia fictícia de mineração chamada Astarte Mining Co.
Language: Jupyter Notebook - Size: 1.13 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

GADES-DATAENG/webinar
Code, scripts, and resources for the Data Engineering Fundamentals Course Webinar, covering Python, data pipelines, Apache Airflow, and more.
Language: Python - Size: 26.9 MB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lopezj1/noaa_eda
This project demonstrates an ETL pipeline that processes NOAA's fishing survey data, then makes it available for analysis through an interactive web app.
Language: Python - Size: 1.75 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

MaxineXiong/Cloud-Data-Warehousing-with-AWS-Redshift
This project builds a cloud-based ETL pipeline for Sparkify to move data to a cloud data warehouse. It extracts song and user activity data from AWS S3, stages it in Redshift, and transforms it into a star-schema data model with fact and dimension tables, enabling efficient querying to answer business questions.
Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

lupusruber/music_analytics
This project processes real-time music event data using Kafka, Apache Spark on Google Cloud Dataproc, and stores the transformed data in BigQuery for analytics, all orchestrated by Airflow and managed with Terraform.
Language: Jupyter Notebook - Size: 22.1 MB - Last synced at: 26 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

mchenryspagg/creating-a-dimensional-data-model
This project involves creating a dimensional data model using MySQL Workbench for a car repair shop’s operations in western Canada by examining a sample invoice,
Size: 960 KB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

mchenryspagg/hng-hire-data-model
The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.
Size: 529 KB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

AmmarSahyoun/Dimensional-modeling
OLAP in TSQL and Python
Language: TSQL - Size: 527 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

VeenaBhyrava/ordersDimModelling
Dimensional modelling of order transactions
Language: Python - Size: 17.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

hase3b/End-to-End-DWH-Pipeline
This repository contains the end-to-end pipeline for building a data warehouse for a real estate management company. The pipeline includes data generation, ETL process, creation of star schema dimensions and fact table, visualization using Power BI, and automation with Pabbly Connect.
Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: 19 days ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

BrksGuiOrnelas/data_modelling_dw
Criação de um Data Warehouse (DW) utilizando modelagem dimensional em um esquema estrela.
Size: 334 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Rconybea/xo-unit
compile-time dimension conversion and checking + support fractional dimension
Language: C++ - Size: 480 KB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Rconybea/xo-pyunit
python bindings for xo-unit
Language: C++ - Size: 5.86 KB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

ali-bin-kashif/etl-pipeline-project-cola-next
Developed a robust ETL pipeline for Next Cola Pvt. Ltd data which extracts data from many different OLTP sources, converts them into dimensions and facts and load into datawarehouse for analytical workload.
Language: Python - Size: 344 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

doshiharmish/Cross-Company-Insights
Project involves merging customer reviews from Fudgemart and FudgeFlix to create a unified data warehouse using Kimball's approach. Utilizing Power BI, it aims to extract actionable insights for Fudge Inc., guiding strategic decisions, product enhancements, and market expansion based on comprehensive business intelligence.
Language: TSQL - Size: 4.13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

dennis-barrett/dimdates-dot-com
Source code for the Kimball-style date dimension generator dimdates.com.
Language: JavaScript - Size: 839 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sarutlaa/Instacart-Market-Basket-Analysis
Building Dimensional Model and SQL Analysis in SnowflakeDB
Size: 938 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cadmiumkitty/data-quality-analytics
Data quality analytics prototype linking faceted classification with dimensional data visualisation.
Language: Python - Size: 895 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rtimbro185/syr_mads_ist722_data_warehouse
Syracuse University, Masters of Applied Data Science - IST 722 Data Warehouse
Language: TSQL - Size: 50.6 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 4

andyvroberts/banana
Dimensional modelling using SQL Server and SSAS cube (tabular) creation
Language: SQL - Size: 75.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

eddahbarasa/AdvancedDataModeling
Starts with a conceptual model ends with a Tableau interactive dash board. In between there is building ER diagrams, forward engineering to build normalized databases, dimensional data modelling and visualizations in tableau.
Language: Io - Size: 7.63 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

mereshd/COVID-database
A comprehensive dimensional model for COVID data, enabling insights for future vaccination campaigns through robust visual analytics.
Size: 23.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lauranonato/Data-Warehousing
creating a data warehouse for a football game management company and some SQL queries to analyse data.
Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SaadAhmedWaqar/Data-Warehousing-Redshift
A Data Warehousing project for retail sales using dimension modelling best practices with SCD type 2 on AWS Redshift. Utilizing AWS Lambda, Glue Workflows and Python Shell jobs to create and automate an ELT pipeline where batch data coming into S3 is loaded onto Redshift and necessary transformations are performed to meet requirements.
Language: Python - Size: 411 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ovokpus/analytics-engineering-prototype
Analytics Engineering with dbt on Bigquery. This project implements the use of Analytics Engineering Best practices to build a dimensional data model, using dbt (data build tool) and BigQuery.
Size: 1.22 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vioxcd/coursera-dwh-for-bi-capstone
Coursera DWH for BI Capstone (Implementation)
Language: Jupyter Notebook - Size: 20.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

MEDHAT-ALHADDAD/Pizza_Runner
Case Study SQL Reporting
Language: Jupyter Notebook - Size: 1.54 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

eddahbarasa/FinalProjectAdvancedDataModelling
Creating erd using MySQL Workbench, Forward engineering, Dimensional data model, Tableau visualisation
Size: 105 KB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

desaikun1996/New-York-City-Arrests-Data-Modelling-Analysis-and-Visualization
Analysis of New York State Police Department Arrests dataset. Created Dimensional Model for the provided dataset. Using Alteryx and Talend, built ETL pipelines to process, clean the data and create dimensions and facts in the destination database. Further, visualized the necessary details of the database using Tableau and PowerBI.
Size: 12.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 7

abdulrahmankhayal/AdventureWorksDM
dimensional modeling of AdventureWorks2017 for sales, creating a DataMart. It includes an ETL pipeline that loads the data from AdventureWorks2017 to AdventureWorksDM using SQL Server Integration Services (SSIS) and implements Slowly Changing Dimension (SCD) handling using the SCD wizard and Merge statement.
Language: TSQL - Size: 414 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

sskender/business-intelligence
Business Intelligence FER labs
Language: TSQL - Size: 33.3 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

armanawn/ssc-workshop-databases-2022
2022 SCC Data Science & Analytics Workshop on Databases
Language: Jupyter Notebook - Size: 8.89 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

desaikun1996/Dallas-Food-Inspection-Data-Analysis
Analysis of a dataset containing information collected on food inspection done at various restaurants in Dallas.
Size: 25 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

desaikun1996/Boston-Food-Inspection-Data-Analysis
An examination of a dataset collecting data from food inspections conducted at several Boston restaurants.
Size: 25.1 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

sanketsanap5/University-Student-Enrollment-DW
Design and develop a dimensional data model for University Student Degree Program Enrollment and Performance using data modeling tool Dataedo
Size: 1.88 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

maxhardt/aws-redshift-elt
ELT and Dimensional Modeling on AWS Redshift
Language: Jupyter Notebook - Size: 123 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

maxhardt/data-modeling-with-postgres
ETL and Dimensional Modeling with PostgreSQL
Language: Jupyter Notebook - Size: 154 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0
