GitHub topics: datawarehousing
Akshay1010567/tp_final_pulseras_inteligentes
Trabajo práctico final de la materia "Base de Datos" de la Licenciatura en Ciencia de Datos (UNSAM). 1C-2025
Language: Python - Size: 43.9 KB - Last synced at: about 2 hours ago - Pushed at: about 4 hours ago - Stars: 0 - Forks: 0

Hippaho/Sparkify
A music streaming company, Sparkify, has decided that it is time to introduce more automation and monitoring to their data warehouse ETL pipelines and come to the conclusion that the best tool to achieve this is Apache Airflow.
Language: Python - Size: 17.6 KB - Last synced at: about 9 hours ago - Pushed at: about 11 hours ago - Stars: 0 - Forks: 0

Sri-Harsha-K/Sql-Datawarehousing-Project
End-to-end SQL data warehousing project using Bronze-Silver-Gold architecture with ETL pipelines, dimensional modeling, and real-world CRM & ERP datasets in Microsoft SQL Server.
Language: TSQL - Size: 1.13 MB - Last synced at: about 22 hours ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

Daniele1388/DWH---Global-Tourism-Project
This project is a complete SQL-based Data Warehouse built from official UN Tourism statistics (UNWTO), covering global tourism trends from 1995 to 2022.
Language: TSQL - Size: 113 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Datavault-UK/automate-dv
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
Size: 8.34 MB - Last synced at: about 5 hours ago - Pushed at: 14 days ago - Stars: 555 - Forks: 141

DataWithBaraa/sql-data-warehouse-project
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Language: TSQL - Size: 20.5 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 293 - Forks: 241

Alex-Nettekoven/Snowflake-ETL-Pipeline
An end-to-end ETL pipeline using Python to generate sales data and load it into a Snowflake data warehouse.
Language: Python - Size: 38.7 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

mihir-robotics/bigquery-views-performance-benchmark
Compare the performance of Logical Views, Materialized Views, and Table Functions in Google BigQuery.
Size: 164 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

qubasehq/qudata
A comprehensive LLM data processing system designed to transform raw multi-format data into high-quality training datasets optimized for Large Language Models.
Language: Python - Size: 3.75 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

cynkra/dm
Working with relational data models in R
Language: R - Size: 57.1 MB - Last synced at: 5 days ago - Pushed at: 22 days ago - Stars: 518 - Forks: 48

dannydave/sql-data-warehouse-project
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Language: TSQL - Size: 1.79 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

melinteflxrin/SoftServe-BigData-Project
End-to-end data warehousing project integrating APIs, ETL workflows, and PostgreSQL for analytics and reporting.
Language: Python - Size: 1.62 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

sam6362/Data-Warehousing-and-Bussiness-Intelligence
Explore the **Data Warehousing and Business Intelligence** repository for hands-on assignments and labs focused on key concepts in data management. Dive into SQL practices and ETL transformations to enhance your skills in data-driven decision-making! 🗃️📊
Size: 1000 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ThomasShikalepo/sql-data-warehouse-project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics
Language: TSQL - Size: 2.04 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

MoShora99/sql-data-warehouse-project
Build modern data warehouse with mysql, Including ETL processes, data modeling and analytics
Language: TSQL - Size: 19.5 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

AnnieFiB/OtherProjects
Powering Data Dreams: From Orchestration to Analytics with Cloud Precision
Language: Scala - Size: 523 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

data-engine-thinking/samples
Practical examples supporting Data Engine Thinking.
Language: TSQL - Size: 192 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

mara/mara-schema
Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables
Language: Python - Size: 3.87 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 75 - Forks: 4

MadhukarSaiBabu/Aviation-Trend-Analysis-using-MapReduce-and-R
Developed a data-driven solution leveraging Hadoop MapReduce, Hive, and R to analyze air travel data. Identified trends in passenger volume, route utilization, and peak travel periods, providing actionable insights for optimizing airline operations and improving the passenger experience.
Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

techsparksguru/data_ai_for_all
Data Analysis, Analytics, Science, AI & ML, LLM etc.
Language: Jupyter Notebook - Size: 23.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 3

zainea-bogdan/Data_Engineer_Project_WoWCinema
WoWCinema is a project based on a fictional scenario where I stepped into the role of a Data Engineer, designing and building an end-to-end Data Infrastructure. A ETL pipeline ingests data from multiple sources, transforms it, and loads it into a centralized PostgreSQL data warehouse to power analytics, KPI tracking, and reporting
Language: Python - Size: 2.14 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Gerardo1909/tp_final_pulseras_inteligentes
Trabajo práctico final de la materia "Base de Datos" de la Licenciatura en Ciencia de Datos (UNSAM). 1C-2025
Language: Python - Size: 503 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

aessing/demo-mdwh
Modern Dataware House Demos with Azure Databricks, Azure Data Factory & Azure Dedicated SQL pool (formerly SQL DW)
Size: 48.3 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 2

Ahmed-Aquarius/Data-Warehousing-Assignment
Solution of DW assignment with SSIS. Question2: type-2 slowly changing dimension with incremental loading. Quesiton3: Versioning
Size: 513 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

anuragmudgal96/data-warehouse-project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
Language: TSQL - Size: 1.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

mugabi91/DWH
A Data Warehouse (DWH) project built with Microsoft SQL Server using a three-layer architecture (Bronze, Silver, Gold). Raw CSV data from ERP and CRM systems is ingested, cleaned, and structured using T-SQL for optimized storage and analytics. Let me know if you want it tweaked further! 🚀
Language: TSQL - Size: 1.09 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

yrehim7/data_warehouse_project
A complete, easy-to-follow guide on building a modern data warehouse with SQL Server. Learn how to design ETL processes, create effective data models, and leverage analytics for better insights.
Language: TSQL - Size: 1.54 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

FirasKahlaoui/retail-data-warehouse
This project demonstrates the creation of a Data Warehouse using SQL Server 2022. It includes the design of dimension and fact tables, ETL processes for data integration, Python scripts for synthetic data generation, and SQL queries for KPI analysis to support business decision-making.
Language: Python - Size: 865 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 6 - Forks: 0

abu14/data_warehousing_bi_data_pipelines
An OLAP system that contains integrated data and enable faster analytics.
Size: 5.26 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

jazzido/mondrian-rest
A REST interface for Mondrian ROLAP server
Language: Ruby - Size: 3.72 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 33 - Forks: 8

Abena61/USA-Employment-Trends--One-million-Data-set-Normalization-Segmentation-Analysis
This project involves the segmentation, normalization, and management of a large employee position dataset. Using Power Query, I broke down over 1 million records into multiple CSV files, each representing key entities such as job titles, employers, locations, industries, and states. The data was then loaded into MySQL Workbench.
Size: 6.33 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

SiyaMathe/Modern-Data-Architecture-Concepts
This project aims to provide a comprehensive overview of modern data architecture concepts, including data lakes, data meshes, cloud-based solutions, and real-time processing, and their application in addressing contemporary data challenges.
Size: 8.79 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Rudra-G-23/SQL-Data-Warehouse-Project
This repo provides a step-by-step approach to building a modern data warehouse using PostgreSQL. It covers the ETL (Extract, Transform, Load) process, data modeling, exploratory data analysis (EDA), and advanced data analysis techniques.
Language: PLpgSQL - Size: 9.32 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

fadzy/Mtech-mini-projects
here you find a mix of projects l worked on during my studies
Language: Jupyter Notebook - Size: 3.35 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Mariann95/SQL_Data_Warehouse_And_Analytics_Project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics. This repository also contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
Language: TSQL - Size: 2.45 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

RanaGaballah/DataWareHouse_SSIS
SSIS (SQL Server Integration Services) project for building a data warehouse solution
Language: TSQL - Size: 191 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

kstrassheim/datawarehouse-crawler
This is a content and schema crawler tool to receive, update and import various kinds of data into a Onprem or Cloud based SQLServer or Azure-Synapse-Analysis (Azure Datawarehouse SQLServer). As source it supports SQLServer Tables, ODATA Endpoints, CSV Files or Excel Files. For multiple sources it can run in parallel mode where it would make a thread for each connection. The speciality of this crawler is that it creates the target tables by himself using the additional info from source.json. In case of Azure-Synapse-Analysis it would estimate the distribution type and keys. The syncing works completely without SQL Transactions by using a consistency correction algorithm for very frequent fact tables. There are 5 Syncing Algorithms (see Manual/Insert) which can be selected as well as one Update Algorithm.
Language: C# - Size: 4.17 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

Salma-Mamdoh/Datawarehouse_Project
Our project for Datawarehouse Course taken during fall 2024 semester
Language: TSQL - Size: 4.45 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

MohamedHmini/tweetsOLAPing
implementing an end-to-end tweets ETL/Analysis pipeline.
Language: Python - Size: 5.99 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 57 - Forks: 7

praveendecode/YouTube-Data-Harvesting-Warehousing
Efficient YouTube data harvesting and warehousing with Python, SQL, MongoDB, and Streamlit, enabling seamless analysis and visualization for insightful decision-making in content management and audience engagement strategies
Language: Python - Size: 1.34 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 4

domenicoboris89/analytics-engineering
Use Case of a Dimensional Data Warehouse built with dbt and BigQuery.
Size: 376 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

TabithaSW/Database_Create
Creating a database management system that takes in SQL statements and generates custom databases, tables, views, e.t.c
Language: Python - Size: 1.87 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

AmrMohamed16/DEPI-Project
Transforming Passenger Feedback into Actionable Insights: An In-Depth Data Engineering Project to Uncover Key Drivers of Airline Customer Satisfaction and Improve Service Quality
Language: Jupyter Notebook - Size: 10.5 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

CICIFLY/Data_Engineering_Project_Portfolio
Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3
Language: Jupyter Notebook - Size: 33 MB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 25 - Forks: 4

BayoAdejare/dw-optimization-insurance
Insurance Data Warehouse Optimization Project
Language: Python - Size: 39.1 KB - Last synced at: about 7 hours ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

kupokev/DWH-Scripts
This project is meant to provide scripts to automate different functionality in a data warehouse.
Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

parthnchoudhury/Enterprise_Data_Architecture
The pragmatic technology journey for an Enterprise Data Model serving reporting, analytical, advanced data science and other digital use cases with integrated data from a variety of sources.
Size: 666 KB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Shekhar-rv/datawarehouse-project-repo
Northwind Traders database (Warehousing - Creation and ETL)
Language: PLpgSQL - Size: 546 KB - Last synced at: 6 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1

Abhi0323/Full-Cycle-ETL-Analytics-with-Google-Analytics-and-Snowflake
Explore the transformative power of data analytics in my portfolio, where Google Analytics and Snowflake converge to provide comprehensive insights. This project leverages advanced ETL techniques and real-time data integration to enhance user engagement and optimize content delivery effectively.
Language: Jupyter Notebook - Size: 1.48 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 4

Salma-Mamdoh/EJADA-Internship-Project
My Project at my Summer Internship At Data Management Team At EJADA
Size: 4.01 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

citysiva180/data_engineering_literacy
This repo is completely dedicated for Learning Data Engineering Concepts which includes Managing Data Ware House, Data Lakes, Marts, Cubes and Other Data Engineering Elements
Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Phelipe-Sempreboni/informations
Repository for tutorials, information and notes on technology in general.
Size: 63.9 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Kmohamedalie/Data-Warehouse-IBM
Data Warehouse IBM 🔡🔢🏭
Language: Python - Size: 2.96 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jnbdz/data-warehouse-quickstarts
Data warehouse (DW) quickstarts! :minidisc:
Size: 350 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

NouraAlgohary/Gravity-Books-ETL-and-Data-Warehouse
Language: TSQL - Size: 868 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
Language: Python - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

DivineSamOfficial/Banking-Data-Warehouse-Pipeline
Banking Data Warehouse Pipeline
Language: Python - Size: 52.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jaehyeon-kim/iceberg-etl-demo
Data Warehousing ETL Demo with Apache Iceberg on EMR Local Environment
Language: Python - Size: 1.26 MB - Last synced at: 20 days ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 1

danlsn/causality
A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.
Size: 8.99 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Gokulakkrizhna/youtube_data_harvesting
This project enables users to fetch data from YouTube by utilizing the YouTube Data API key. The retrieved data is then stored in a MySQL database. Subsequently, the stored data is analyzed and presented in a Streamlit web application using Pandas DataFrame.
Language: Python - Size: 84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

pacicap/Data-Warehousing
Extraction of data from different Database sources, Transformation (unification and cleaning) of extracted data and laoding into the data warehouse
Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

essraahmed/Data-Warehouse-With-Redshift
Data Warehouse with AWS Redshift and Visualizing data using Power BI
Language: Jupyter Notebook - Size: 618 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

ankurkum93/BIKE-MS
Size: 3.52 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Mouhamed-Jinja/Python-Airflow-Postgres-Docker-DWH
This repository contains Apache Airflow Directed Acyclic Graphs (DAGs) and associated scripts for orchestrating an Extract, Transform, Load (ETL) workflow. The workflow is designed to extract data from a source, perform transformations, and load it into a data warehouse.
Language: Jupyter Notebook - Size: 11.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

abhigupta2909/DataWarehousing-Chicago-Food-inspection
DataWarehousing on Chicago food database.
Language: Python - Size: 4.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Dina-Hosny/Analyze-and-Model-Airline-System
Analyzing Airline System and Building Data Warehouse Model to Store the Data and Answer Some Business Questions
Language: PLSQL - Size: 5.31 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

adeelnasir0405/Data-Warehousing-Advance-Tableau-
Employed statistical analysis, forecasting, clustering, and control chart techniques to extract insights and monitor data variation effectively, showcasing Tableau's advanced capabilities for informed decision-making.
Size: 446 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mehassanhmood/DataEngineering-Project
A data engineering project.
Language: Shell - Size: 3.23 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

YasmineBoukhalfa/BI-project
Project BI / DATA Warehouse / My Master project
Language: CSS - Size: 34.3 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Muthukumar0908/Final_retail_sales_forecast
In this project Utilizing advanced time series forecasting models, successfully predicted department-wide sales for each store for the upcoming year and Visualizing the data in streamlit GUI.
Language: Jupyter Notebook - Size: 607 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

yingzima/Business-Intelligence-Project
BI project for an insurance technology company
Size: 894 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 3

RichardOgujawa/academic-writeups
A collection of my academic write-ups including topics pertaining to Machine Learning, Statistics, Data Analytics, etc.
Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

caupolicanre/datawarehouse-ElProfesional
Language: Python - Size: 1.03 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Mouhamed-Jinja/PostgresBlend-Data-Pipeline
"PostgresBlend Data Pipeline" is a comprehensive data integration solution designed to seamlessly merge diverse data sources into a unified PostgreSQL Data Warehouse. This project streamlines the process of integrating data from CSVs, JSON, Parquet, and MySQL databases, utilizing Apache Spark for efficient transformation and organization.
Language: Python - Size: 51.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mario-galindo/dbtlearn
Data Warehouse dbt project
Size: 71.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Dineshkumar56/youtubedataharvesting
YouTube Data Harvesting and Warehousing using SQL, MongoDB and Streamlit
Language: Python - Size: 10.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

mario-galindo/dbt-core-demo
Proof of concept to manage data warehouse data transformations
Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

praveendecode/iss-data-warehouse-mongodb-sql-project
Space Exploration Data Fusion : Unleashing the International Space Station Insights with MongoDB and SQL Integration
Language: Python - Size: 40 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jennaallen/football_schools
Using a dimensional model, data warehouse, and Tableau I explored data from the College Scorecard and NCAA Division I FBS football games :football:
Language: R - Size: 3.55 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

pgrondein/data_platform_for_data_analytics
This project goal is to design a Data Platform for retail Data Analytics.
Language: Python - Size: 43 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

pausanchezv/Bases-de-dades-II-Big-Data
Assignatura Bases de Dades Avançades d'Enginyeria Informàtica (Universitatd de Barcelona)
Language: Java - Size: 106 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

lwdovico/LDS-Project
Repository of a Data Science Project
Language: Jupyter Notebook - Size: 14.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

CEDStandards/CEDS-Collaborative-Exchange
The CEDS Collaborative Exchange is a repository of code developed by the community that interacts with the CEDS Integration Data Store and the CEDS Elements repositories. All resources provided in this community are considered free and open source.
Language: TSQL - Size: 19.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 3

MiladNooraei/Quera-Superstore Fork of FarzanehSoltanzadeh/Quera-Superstore
Conducted data pre-processing, optimized data warehousing, applied statistical analysis and machine learning techniques, and created visually compelling Power BI visualizations to derive valuable insights for informed decision-making.
Language: Jupyter Notebook - Size: 22.8 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

mehroosali/bigquery-sparksql-batch-etl
Batch ETL pipeline project on GCP to load and transform daily flight data using Spark to update tables in BigQuery. The pipeline is automated using Airflow.
Language: Python - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

jmcorreia/Druid_SSB_Benchmark
Ingestion Tasks and Scripts used to benchmark Druid's performance, using SSB Benchmark.
Language: Shell - Size: 39.1 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 3

ChetanMJ/Data-Warehousing
This is a Datawarehouse ETL application for Spain Airbnb data
Language: TSQL - Size: 1.43 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

dharm18/stock-datawarehouse
A data warehouse and business intelligence project on Stock market dataset to answer non-trivial BI queries.
Language: R - Size: 79.6 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 3

ovokpus/analytics-engineering-prototype
Analytics Engineering with dbt on Bigquery. This project implements the use of Analytics Engineering Best practices to build a dimensional data model, using dbt (data build tool) and BigQuery.
Size: 1.22 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

juned-56/current-covid-info
Check daily covid information
Language: Python - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

smath17/DeclarativeETL
Generate DDL and Python (PygramETL) code from shared specification
Language: Python - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

kromozome2003/Snowflake-Json-DataPipeline
Building Json data pipeline within Snowflake using Streams and Tasks
Language: TSQL - Size: 45.8 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 22 - Forks: 7

Dina-Hosny/Retail-Store-Data-Modeling-and-Analysis-using-DataStage
The project implements a star-schema data warehousing flow, then utilize IBM InfoSphere DataStage to develop efficient ETL pipelines to create data marts and perform some analysis on them.
Size: 132 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

epilif1017a/bigdatabenchmarks
Code and Documents related to the SSB+ Benchmark
Language: C - Size: 171 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

hams71/Dbt_Demo
Using dbt to load(seed) and do some transformations and then finally load that data to some Cloud Warehouse
Language: Python - Size: 80.5 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

escobarana/SSIS_DWH
Datawarehouse & ETL using Visual Studio 2019 SSIS
Size: 92.8 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

rehamessa/Airline-System-DWH-Modeling Fork of ManarAymanF/Airline-System-DWH-Modeling
A leading airline company engaged our services to support the executive management in their analysis of current business processes and identification of new opportunities for company growth.
Size: 324 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

ibromley/sparkify-s3-datalake
Data Warehousing with Spark & Amazon S3
Language: Python - Size: 393 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0

victorskl/genomic-bigdata-spark
Genomic BigData Warehousing with Apache Spark and LakeHouse Architecture
Language: Jupyter Notebook - Size: 172 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

pranjals26/Data-Management-Project-Flight-delays
Data Cleaning and Analysis on Flight Delay & Cancellation
Language: Jupyter Notebook - Size: 9.6 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0
