An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: datawarehousing

Akshay1010567/tp_final_pulseras_inteligentes

Trabajo práctico final de la materia "Base de Datos" de la Licenciatura en Ciencia de Datos (UNSAM). 1C-2025

Language: Python - Size: 43.9 KB - Last synced at: about 2 hours ago - Pushed at: about 4 hours ago - Stars: 0 - Forks: 0

Hippaho/Sparkify

A music streaming company, Sparkify, has decided that it is time to introduce more automation and monitoring to their data warehouse ETL pipelines and come to the conclusion that the best tool to achieve this is Apache Airflow.

Language: Python - Size: 17.6 KB - Last synced at: about 9 hours ago - Pushed at: about 11 hours ago - Stars: 0 - Forks: 0

Sri-Harsha-K/Sql-Datawarehousing-Project

End-to-end SQL data warehousing project using Bronze-Silver-Gold architecture with ETL pipelines, dimensional modeling, and real-world CRM & ERP datasets in Microsoft SQL Server.

Language: TSQL - Size: 1.13 MB - Last synced at: about 22 hours ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

Daniele1388/DWH---Global-Tourism-Project

This project is a complete SQL-based Data Warehouse built from official UN Tourism statistics (UNWTO), covering global tourism trends from 1995 to 2022.

Language: TSQL - Size: 113 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Datavault-UK/automate-dv

A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)

Size: 8.34 MB - Last synced at: about 5 hours ago - Pushed at: 14 days ago - Stars: 555 - Forks: 141

DataWithBaraa/sql-data-warehouse-project

A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

Language: TSQL - Size: 20.5 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 293 - Forks: 241

Alex-Nettekoven/Snowflake-ETL-Pipeline

An end-to-end ETL pipeline using Python to generate sales data and load it into a Snowflake data warehouse.

Language: Python - Size: 38.7 MB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

mihir-robotics/bigquery-views-performance-benchmark

Compare the performance of Logical Views, Materialized Views, and Table Functions in Google BigQuery.

Size: 164 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

qubasehq/qudata

A comprehensive LLM data processing system designed to transform raw multi-format data into high-quality training datasets optimized for Large Language Models.

Language: Python - Size: 3.75 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

cynkra/dm

Working with relational data models in R

Language: R - Size: 57.1 MB - Last synced at: 5 days ago - Pushed at: 22 days ago - Stars: 518 - Forks: 48

dannydave/sql-data-warehouse-project

A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

Language: TSQL - Size: 1.79 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

melinteflxrin/SoftServe-BigData-Project

End-to-end data warehousing project integrating APIs, ETL workflows, and PostgreSQL for analytics and reporting.

Language: Python - Size: 1.62 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

sam6362/Data-Warehousing-and-Bussiness-Intelligence

Explore the **Data Warehousing and Business Intelligence** repository for hands-on assignments and labs focused on key concepts in data management. Dive into SQL practices and ETL transformations to enhance your skills in data-driven decision-making! 🗃️📊

Size: 1000 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ThomasShikalepo/sql-data-warehouse-project

Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics

Language: TSQL - Size: 2.04 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

MoShora99/sql-data-warehouse-project

Build modern data warehouse with mysql, Including ETL processes, data modeling and analytics

Language: TSQL - Size: 19.5 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

AnnieFiB/OtherProjects

Powering Data Dreams: From Orchestration to Analytics with Cloud Precision

Language: Scala - Size: 523 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

data-engine-thinking/samples

Practical examples supporting Data Engine Thinking.

Language: TSQL - Size: 192 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

mara/mara-schema

Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables

Language: Python - Size: 3.87 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 75 - Forks: 4

MadhukarSaiBabu/Aviation-Trend-Analysis-using-MapReduce-and-R

Developed a data-driven solution leveraging Hadoop MapReduce, Hive, and R to analyze air travel data. Identified trends in passenger volume, route utilization, and peak travel periods, providing actionable insights for optimizing airline operations and improving the passenger experience.

Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

techsparksguru/data_ai_for_all

Data Analysis, Analytics, Science, AI & ML, LLM etc.

Language: Jupyter Notebook - Size: 23.3 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 3

zainea-bogdan/Data_Engineer_Project_WoWCinema

WoWCinema is a project based on a fictional scenario where I stepped into the role of a Data Engineer, designing and building an end-to-end Data Infrastructure. A ETL pipeline ingests data from multiple sources, transforms it, and loads it into a centralized PostgreSQL data warehouse to power analytics, KPI tracking, and reporting

Language: Python - Size: 2.14 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Gerardo1909/tp_final_pulseras_inteligentes

Trabajo práctico final de la materia "Base de Datos" de la Licenciatura en Ciencia de Datos (UNSAM). 1C-2025

Language: Python - Size: 503 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

aessing/demo-mdwh

Modern Dataware House Demos with Azure Databricks, Azure Data Factory & Azure Dedicated SQL pool (formerly SQL DW)

Size: 48.3 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 2

Ahmed-Aquarius/Data-Warehousing-Assignment

Solution of DW assignment with SSIS. Question2: type-2 slowly changing dimension with incremental loading. Quesiton3: Versioning

Size: 513 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

anuragmudgal96/data-warehouse-project

Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

Language: TSQL - Size: 1.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

mugabi91/DWH

A Data Warehouse (DWH) project built with Microsoft SQL Server using a three-layer architecture (Bronze, Silver, Gold). Raw CSV data from ERP and CRM systems is ingested, cleaned, and structured using T-SQL for optimized storage and analytics. Let me know if you want it tweaked further! 🚀

Language: TSQL - Size: 1.09 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

yrehim7/data_warehouse_project

A complete, easy-to-follow guide on building a modern data warehouse with SQL Server. Learn how to design ETL processes, create effective data models, and leverage analytics for better insights.

Language: TSQL - Size: 1.54 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

FirasKahlaoui/retail-data-warehouse

This project demonstrates the creation of a Data Warehouse using SQL Server 2022. It includes the design of dimension and fact tables, ETL processes for data integration, Python scripts for synthetic data generation, and SQL queries for KPI analysis to support business decision-making.

Language: Python - Size: 865 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 6 - Forks: 0

abu14/data_warehousing_bi_data_pipelines

An OLAP system that contains integrated data and enable faster analytics.

Size: 5.26 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

jazzido/mondrian-rest

A REST interface for Mondrian ROLAP server

Language: Ruby - Size: 3.72 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 33 - Forks: 8

Abena61/USA-Employment-Trends--One-million-Data-set-Normalization-Segmentation-Analysis

This project involves the segmentation, normalization, and management of a large employee position dataset. Using Power Query, I broke down over 1 million records into multiple CSV files, each representing key entities such as job titles, employers, locations, industries, and states. The data was then loaded into MySQL Workbench.

Size: 6.33 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

SiyaMathe/Modern-Data-Architecture-Concepts

This project aims to provide a comprehensive overview of modern data architecture concepts, including data lakes, data meshes, cloud-based solutions, and real-time processing, and their application in addressing contemporary data challenges.

Size: 8.79 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Rudra-G-23/SQL-Data-Warehouse-Project

This repo provides a step-by-step approach to building a modern data warehouse using PostgreSQL. It covers the ETL (Extract, Transform, Load) process, data modeling, exploratory data analysis (EDA), and advanced data analysis techniques.

Language: PLpgSQL - Size: 9.32 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

fadzy/Mtech-mini-projects

here you find a mix of projects l worked on during my studies

Language: Jupyter Notebook - Size: 3.35 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Mariann95/SQL_Data_Warehouse_And_Analytics_Project

Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics. This repository also contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

Language: TSQL - Size: 2.45 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

RanaGaballah/DataWareHouse_SSIS

SSIS (SQL Server Integration Services) project for building a data warehouse solution

Language: TSQL - Size: 191 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

kstrassheim/datawarehouse-crawler

This is a content and schema crawler tool to receive, update and import various kinds of data into a Onprem or Cloud based SQLServer or Azure-Synapse-Analysis (Azure Datawarehouse SQLServer). As source it supports SQLServer Tables, ODATA Endpoints, CSV Files or Excel Files. For multiple sources it can run in parallel mode where it would make a thread for each connection. The speciality of this crawler is that it creates the target tables by himself using the additional info from source.json. In case of Azure-Synapse-Analysis it would estimate the distribution type and keys. The syncing works completely without SQL Transactions by using a consistency correction algorithm for very frequent fact tables. There are 5 Syncing Algorithms (see Manual/Insert) which can be selected as well as one Update Algorithm.

Language: C# - Size: 4.17 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

Salma-Mamdoh/Datawarehouse_Project

Our project for Datawarehouse Course taken during fall 2024 semester

Language: TSQL - Size: 4.45 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

MohamedHmini/tweetsOLAPing

implementing an end-to-end tweets ETL/Analysis pipeline.

Language: Python - Size: 5.99 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 57 - Forks: 7

praveendecode/YouTube-Data-Harvesting-Warehousing

Efficient YouTube data harvesting and warehousing with Python, SQL, MongoDB, and Streamlit, enabling seamless analysis and visualization for insightful decision-making in content management and audience engagement strategies

Language: Python - Size: 1.34 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 4

domenicoboris89/analytics-engineering

Use Case of a Dimensional Data Warehouse built with dbt and BigQuery.

Size: 376 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

TabithaSW/Database_Create

Creating a database management system that takes in SQL statements and generates custom databases, tables, views, e.t.c

Language: Python - Size: 1.87 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

AmrMohamed16/DEPI-Project

Transforming Passenger Feedback into Actionable Insights: An In-Depth Data Engineering Project to Uncover Key Drivers of Airline Customer Satisfaction and Improve Service Quality

Language: Jupyter Notebook - Size: 10.5 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

CICIFLY/Data_Engineering_Project_Portfolio

Data Engineering, Data Warehouse, Data Mart, Cloud Data, AWS, SAS, Redshift, S3

Language: Jupyter Notebook - Size: 33 MB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 25 - Forks: 4

BayoAdejare/dw-optimization-insurance

Insurance Data Warehouse Optimization Project

Language: Python - Size: 39.1 KB - Last synced at: about 7 hours ago - Pushed at: 12 months ago - Stars: 0 - Forks: 1

kupokev/DWH-Scripts

This project is meant to provide scripts to automate different functionality in a data warehouse.

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

parthnchoudhury/Enterprise_Data_Architecture

The pragmatic technology journey for an Enterprise Data Model serving reporting, analytical, advanced data science and other digital use cases with integrated data from a variety of sources.

Size: 666 KB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Shekhar-rv/datawarehouse-project-repo

Northwind Traders database (Warehousing - Creation and ETL)

Language: PLpgSQL - Size: 546 KB - Last synced at: 6 days ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1

Abhi0323/Full-Cycle-ETL-Analytics-with-Google-Analytics-and-Snowflake

Explore the transformative power of data analytics in my portfolio, where Google Analytics and Snowflake converge to provide comprehensive insights. This project leverages advanced ETL techniques and real-time data integration to enhance user engagement and optimize content delivery effectively.

Language: Jupyter Notebook - Size: 1.48 MB - Last synced at: 10 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 4

Salma-Mamdoh/EJADA-Internship-Project

My Project at my Summer Internship At Data Management Team At EJADA

Size: 4.01 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

citysiva180/data_engineering_literacy

This repo is completely dedicated for Learning Data Engineering Concepts which includes Managing Data Ware House, Data Lakes, Marts, Cubes and Other Data Engineering Elements

Size: 8.79 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Phelipe-Sempreboni/informations

Repository for tutorials, information and notes on technology in general.

Size: 63.9 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Kmohamedalie/Data-Warehouse-IBM

Data Warehouse IBM 🔡🔢🏭

Language: Python - Size: 2.96 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jnbdz/data-warehouse-quickstarts

Data warehouse (DW) quickstarts! :minidisc:

Size: 350 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

NouraAlgohary/Gravity-Books-ETL-and-Data-Warehouse

Language: TSQL - Size: 868 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 1

cqllum/schema2dwh

⚡ Automatically produce a data model on your database using its information schema using GenAI.

Language: Python - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

DivineSamOfficial/Banking-Data-Warehouse-Pipeline

Banking Data Warehouse Pipeline

Language: Python - Size: 52.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jaehyeon-kim/iceberg-etl-demo

Data Warehousing ETL Demo with Apache Iceberg on EMR Local Environment

Language: Python - Size: 1.26 MB - Last synced at: 20 days ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 1

danlsn/causality

A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.

Size: 8.99 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Gokulakkrizhna/youtube_data_harvesting

This project enables users to fetch data from YouTube by utilizing the YouTube Data API key. The retrieved data is then stored in a MySQL database. Subsequently, the stored data is analyzed and presented in a Streamlit web application using Pandas DataFrame.

Language: Python - Size: 84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

pacicap/Data-Warehousing

Extraction of data from different Database sources, Transformation (unification and cleaning) of extracted data and laoding into the data warehouse

Size: 23.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

essraahmed/Data-Warehouse-With-Redshift

Data Warehouse with AWS Redshift and Visualizing data using Power BI

Language: Jupyter Notebook - Size: 618 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 1

ankurkum93/BIKE-MS

Size: 3.52 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Mouhamed-Jinja/Python-Airflow-Postgres-Docker-DWH

This repository contains Apache Airflow Directed Acyclic Graphs (DAGs) and associated scripts for orchestrating an Extract, Transform, Load (ETL) workflow. The workflow is designed to extract data from a source, perform transformations, and load it into a data warehouse.

Language: Jupyter Notebook - Size: 11.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

abhigupta2909/DataWarehousing-Chicago-Food-inspection

DataWarehousing on Chicago food database.

Language: Python - Size: 4.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Dina-Hosny/Analyze-and-Model-Airline-System

Analyzing Airline System and Building Data Warehouse Model to Store the Data and Answer Some Business Questions

Language: PLSQL - Size: 5.31 MB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

adeelnasir0405/Data-Warehousing-Advance-Tableau-

Employed statistical analysis, forecasting, clustering, and control chart techniques to extract insights and monitor data variation effectively, showcasing Tableau's advanced capabilities for informed decision-making.

Size: 446 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mehassanhmood/DataEngineering-Project

A data engineering project.

Language: Shell - Size: 3.23 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

YasmineBoukhalfa/BI-project

Project BI / DATA Warehouse / My Master project

Language: CSS - Size: 34.3 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Muthukumar0908/Final_retail_sales_forecast

In this project Utilizing advanced time series forecasting models, successfully predicted department-wide sales for each store for the upcoming year and Visualizing the data in streamlit GUI.

Language: Jupyter Notebook - Size: 607 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

yingzima/Business-Intelligence-Project

BI project for an insurance technology company

Size: 894 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 3

RichardOgujawa/academic-writeups

A collection of my academic write-ups including topics pertaining to Machine Learning, Statistics, Data Analytics, etc.

Size: 0 Bytes - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

caupolicanre/datawarehouse-ElProfesional

Language: Python - Size: 1.03 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Mouhamed-Jinja/PostgresBlend-Data-Pipeline

"PostgresBlend Data Pipeline" is a comprehensive data integration solution designed to seamlessly merge diverse data sources into a unified PostgreSQL Data Warehouse. This project streamlines the process of integrating data from CSVs, JSON, Parquet, and MySQL databases, utilizing Apache Spark for efficient transformation and organization.

Language: Python - Size: 51.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mario-galindo/dbtlearn

Data Warehouse dbt project

Size: 71.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Dineshkumar56/youtubedataharvesting

YouTube Data Harvesting and Warehousing using SQL, MongoDB and Streamlit

Language: Python - Size: 10.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

mario-galindo/dbt-core-demo

Proof of concept to manage data warehouse data transformations

Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

praveendecode/iss-data-warehouse-mongodb-sql-project

Space Exploration Data Fusion : Unleashing the International Space Station Insights with MongoDB and SQL Integration

Language: Python - Size: 40 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jennaallen/football_schools

Using a dimensional model, data warehouse, and Tableau I explored data from the College Scorecard and NCAA Division I FBS football games :football:

Language: R - Size: 3.55 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

pgrondein/data_platform_for_data_analytics

This project goal is to design a Data Platform for retail Data Analytics.

Language: Python - Size: 43 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

pausanchezv/Bases-de-dades-II-Big-Data

Assignatura Bases de Dades Avançades d'Enginyeria Informàtica (Universitatd de Barcelona)

Language: Java - Size: 106 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

lwdovico/LDS-Project

Repository of a Data Science Project

Language: Jupyter Notebook - Size: 14.7 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

CEDStandards/CEDS-Collaborative-Exchange

The CEDS Collaborative Exchange is a repository of code developed by the community that interacts with the CEDS Integration Data Store and the CEDS Elements repositories. All resources provided in this community are considered free and open source.

Language: TSQL - Size: 19.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 3

MiladNooraei/Quera-Superstore Fork of FarzanehSoltanzadeh/Quera-Superstore

Conducted data pre-processing, optimized data warehousing, applied statistical analysis and machine learning techniques, and created visually compelling Power BI visualizations to derive valuable insights for informed decision-making.

Language: Jupyter Notebook - Size: 22.8 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

mehroosali/bigquery-sparksql-batch-etl

Batch ETL pipeline project on GCP to load and transform daily flight data using Spark to update tables in BigQuery. The pipeline is automated using Airflow.

Language: Python - Size: 11.8 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

jmcorreia/Druid_SSB_Benchmark

Ingestion Tasks and Scripts used to benchmark Druid's performance, using SSB Benchmark.

Language: Shell - Size: 39.1 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 1 - Forks: 3

ChetanMJ/Data-Warehousing

This is a Datawarehouse ETL application for Spain Airbnb data

Language: TSQL - Size: 1.43 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

dharm18/stock-datawarehouse

A data warehouse and business intelligence project on Stock market dataset to answer non-trivial BI queries.

Language: R - Size: 79.6 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 3

ovokpus/analytics-engineering-prototype

Analytics Engineering with dbt on Bigquery. This project implements the use of Analytics Engineering Best practices to build a dimensional data model, using dbt (data build tool) and BigQuery.

Size: 1.22 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

juned-56/current-covid-info

Check daily covid information

Language: Python - Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

smath17/DeclarativeETL

Generate DDL and Python (PygramETL) code from shared specification

Language: Python - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

kromozome2003/Snowflake-Json-DataPipeline

Building Json data pipeline within Snowflake using Streams and Tasks

Language: TSQL - Size: 45.8 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 22 - Forks: 7

Dina-Hosny/Retail-Store-Data-Modeling-and-Analysis-using-DataStage

The project implements a star-schema data warehousing flow, then utilize IBM InfoSphere DataStage to develop efficient ETL pipelines to create data marts and perform some analysis on them.

Size: 132 KB - Last synced at: 6 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

epilif1017a/bigdatabenchmarks

Code and Documents related to the SSB+ Benchmark

Language: C - Size: 171 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

hams71/Dbt_Demo

Using dbt to load(seed) and do some transformations and then finally load that data to some Cloud Warehouse

Language: Python - Size: 80.5 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

escobarana/SSIS_DWH

Datawarehouse & ETL using Visual Studio 2019 SSIS

Size: 92.8 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

rehamessa/Airline-System-DWH-Modeling Fork of ManarAymanF/Airline-System-DWH-Modeling

A leading airline company engaged our services to support the executive management in their analysis of current business processes and identification of new opportunities for company growth.

Size: 324 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

ibromley/sparkify-s3-datalake

Data Warehousing with Spark & Amazon S3

Language: Python - Size: 393 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 0

victorskl/genomic-bigdata-spark

Genomic BigData Warehousing with Apache Spark and LakeHouse Architecture

Language: Jupyter Notebook - Size: 172 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

pranjals26/Data-Management-Project-Flight-delays

Data Cleaning and Analysis on Flight Delay & Cancellation

Language: Jupyter Notebook - Size: 9.6 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0