An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: azure-data-lake-gen2

airscholar/FootballDataEngineering

An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.

Language: Python - Size: 469 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 19

zBalachandar/Sales-Data-Analytics-Azure-Data-Engineering-End-to-End-Project-13

This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2017LT Database.

Language: Jupyter Notebook - Size: 23.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 9 - Forks: 4

MohssineSERRAJI/azure-data-lake

A lightweight toolkit for Azure Data Lake Storage Gen2 operations, featuring AzCopy commands and Databricks integration examples. Includes sample data and notebooks for quick experimentation with data lake architectures.

Language: Jupyter Notebook - Size: 449 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

SayamAlt/Azure-Synapse-Analytics-Course

This repository contains all scripts and notebooks created in the Azure Synapse Analytics course.course.

Language: TSQL - Size: 7.08 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

brsavii/data-engineering-project

Development of a Data Pipeline using Azure Synapse

Language: Python - Size: 4.68 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Yusreen/BingDataAnalyticsPlatform

An end-to-end data engineering pipeline that fetches data from the BingAPI, cleans and transforms it with Azure Databricks.Sentiment Analysis is performed in AzureML and the data is visualized using Tableau.

Language: Jupyter Notebook - Size: 360 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

zBalachandar/Tokyo-Olympic-Data-Analytics-Azure-End-To-End-Data-Engineering-Project-12

Tokyo-olympic-azure-data-engineering-end-to-end-project

Language: HTML - Size: 44.5 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

UdbhavSrivastava/Tokyo-Olympics-2020-DataPipeline

A cutting-edge data project leverages Azure's suite of services to seamlessly transform raw data from GitHub into actionable insights. Using Azure Data Factory for data ingestion, Databricks for PySpark transformations, Synapse Analytics for advanced analysis, and Power BI for intuitive visualization, this project navigates complex data workflows..

Language: Jupyter Notebook - Size: 173 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

rohitkulkarni08/Azure-ETL-AmazonSalesAnalysis

A comprehensive ETL pipeline and sales analysis project leveraging Microsoft Azure and PySpark, designed to optimize e-commerce sales by providing actionable insights through detailed data analysis.

Language: Jupyter Notebook - Size: 8.04 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rohitkulkarni08/Azure-ETL-Pipeline-MovieAnalytics

This project demonstrates an ETL pipeline using Microsoft Azure for IMDb Movie Rating Dataset analysis. It covers data extraction from Azure Blob Storage, transformation with Azure Databricks, and loading into Azure SQL using Azure Data Factory. The pipeline automates insights generation and is a practical example of cloud-based data engineering.

Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Shashi42/Azure-End-to-End-Sales-Data-Analytics-Pipeline

This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2022LT Database.

Language: Jupyter Notebook - Size: 501 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

manaswipatil/Tokyo-Olympics-Data-Analytics-in-Azure

Azure pipeline for data analytics on Tokyo Olympics data

Size: 507 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

waqarg2001/Formula1-Insights-DE

Formula 1 race data engineering project which utilises azure services and databricks to ingest and analyse the data.

Language: Python - Size: 2.92 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0