Topic: "azure-databricks"
Azure/azure-cosmosdb-spark 📦
Apache Spark Connector for Azure Cosmos DB
Size: 192 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 203 - Forks: 120

AzureCosmosDB/scenario-based-labs
Cosmos DB oriented labs for IoT and Retail scenarios
Language: JavaScript - Size: 316 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 112 - Forks: 88

microsoft/Azure-Databricks-NYC-Taxi-Workshop
An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset
Language: Scala - Size: 42.3 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 109 - Forks: 109

microsoft/Purview-ADB-Lineage-Solution-Accelerator
A connector to ingest Azure Databricks lineage into Microsoft Purview
Language: C# - Size: 12.8 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 92 - Forks: 58

microsoft/A-TALE-OF-THREE-CITIES
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
Language: R - Size: 21.8 MB - Last synced at: 6 days ago - Pushed at: about 4 years ago - Stars: 86 - Forks: 34

AdamPaternostro/Azure-Databricks-Dev-Ops
Complete end to end sample of doing DevOps with Azure Databricks
Language: Shell - Size: 887 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 69 - Forks: 102

tomaztk/Azure-Databricks
Azure Databricks - Advent of 2020 Blogposts
Language: Jupyter Notebook - Size: 44.9 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 60 - Forks: 49

Jayvardhan-Reddy/Azure-Certification-DP-200
Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution
Size: 4.42 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 57 - Forks: 46

bhavink/databricks
Databricks Platform - Architecture, Security, Automation and much more!!
Language: Jupyter Notebook - Size: 14.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 50 - Forks: 27

rafaelpierre/pyjaws
PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows
Language: Python - Size: 3.46 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 43 - Forks: 4

airscholar/FootballDataEngineering
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
Language: Python - Size: 469 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 19

syedhassaanahmed/databricks-notebooks
Collection of Databricks and Jupyter Notebooks
Language: Jupyter Notebook - Size: 742 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 15

Annielytix/Ready2019_AA_AI319
What the Hack Challenge format of the Advanced Databricks Workshop
Language: Jupyter Notebook - Size: 39.3 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 17 - Forks: 16

tfayyaz/awesome-azure-databricks
Awesome content all about Azure Databricks
Size: 87.9 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 7

BlueGranite/DatabricksTraining
Repository for Microsoft Databricks Training Events - Hosted by BlueGranite
Language: Python - Size: 14 MB - Last synced at: about 1 month ago - Pushed at: almost 6 years ago - Stars: 15 - Forks: 8

Annielytix/Advanced-Databricks-for-ML-Build-2019
Using Azure Databricks (Spark) for ML, this is the //build 2019 repository with homework examples, code and notebooks
Language: Jupyter Notebook - Size: 14.4 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 18

aminekaabachi/azure-databricks-sdk-python 📦
[archived] A Python SDK for the Azure Databricks REST API 2.0
Language: Python - Size: 94.7 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 13 - Forks: 6

Annielytix/DevOpsforDatabricks
Are you like me , a Senior Data Scientist, wanting to learn more about how to approach DevOps, specifically when you using Databricks (workspaces, notebooks, libraries etc) ? Set up using @Azure @Databricks
Language: PowerShell - Size: 4.1 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 14

AnthonyByansi/Azure-Data-Fundamentals-Guide
A comprehensive guide to understanding and implementing data management and analytics solutions in the Azure ecosystem using Azure Data Fundamentals.
Language: Mermaid - Size: 74.2 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 10 - Forks: 3

zBalachandar/Sales-Data-Analytics-Azure-Data-Engineering-End-to-End-Project-13
This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2017LT Database.
Language: Jupyter Notebook - Size: 23.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 9 - Forks: 4

bennyaustin/pyspark-utils
Reusable Python classes that extend open source PySpark capabilities. Examples of implementation is available under notebooks of repo https://github.com/bennyaustin/synapse-dataplatform
Language: Python - Size: 36.1 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 9 - Forks: 4

Annielytix/Ready2019_AA_AI_200
A Beginner's Guide to Azure Databricks
Size: 24.7 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 14

cheukhin1024/Financial-Data-Project-in-Azure
Free High-Quality Financial Data in Azure
Language: Python - Size: 848 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 5

retkowsky/Azure-Databricks-Workshop
Azure Databricks workshop
Language: Jupyter Notebook - Size: 2.03 MB - Last synced at: 9 days ago - Pushed at: almost 5 years ago - Stars: 8 - Forks: 6

chayansraj/Microsoft-Azure-Medallion-Data-pipeline
In this project we are going to create an end-to-end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.
Language: Jupyter Notebook - Size: 11.5 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 7 - Forks: 6

bennyaustin/synapse-dataplatform
A modern data platform implemented on Azure Synapse Analytics using ELT Framework - https://github.com/bennyaustin/elt-framework. Data platform infrastructure provisioned using https://github.com/bennyaustin/iac-synapse-dataplatform
Language: TSQL - Size: 1.79 MB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 7 - Forks: 6

ezwiefel/azure-databricks-api
A wrapper for the Azure Databricks REST API
Language: Python - Size: 62.5 KB - Last synced at: 12 days ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 9

PujitH-V/ETL_with_Pyspark_-_SparkSQL
A sample project designed to demonstrate ETL process using Pyspark & Spark SQL API in Apache Spark.
Language: HTML - Size: 305 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 4

sfrechette/azure-databricks-citibike-nyc-analysis
Analyzing NYC bike data with Azure Databricks
Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

Philippos01/mlops-energy-forecast-thesis
Automated pipeline for energy consumption forecasting across Europe using Azure cloud and Databricks.
Language: Python - Size: 1.37 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

anderl80/aml-vs-adb
End-to-end ML pipelines in Azure Machine Learning and Azure Databricks.
Language: Jupyter Notebook - Size: 188 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 1

aessing/demo-mdwh
Modern Dataware House Demos with Azure Databricks, Azure Data Factory & Azure Dedicated SQL pool (formerly SQL DW)
Size: 48.3 MB - Last synced at: about 2 months ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 2

Annielytix/008-DatabricksIntroML
Ready2019_WTH_DatabricksIntroML
Language: Scala - Size: 43.6 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 3

retkowsky/AMLlabs
Azure AI hands-on labs
Language: Jupyter Notebook - Size: 117 MB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 4

retkowsky/Titanic
Exemple AutoML avec Azure ML service SDK
Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 2

AdamPaternostro/Azure-Databricks-With-Spline
Using Spline with Azure Databricks
Size: 247 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 1

syedhassaanahmed/spark-with-engineering-fundamentals
E2E Spark data pipelines with engineering fundamentals
Language: HCL - Size: 1.22 MB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 2 - Forks: 2

aymane-maghouti/HR-Data-Pipeline-Azure
This project is a comprehensive data engineering solution that extracts HR data from a GitHub repository, performs data transformations using Azure services, and creates an interactive HR dashboard using Power BI. The goal is to enable HR professionals and decision-makers to gain insights from the HR data for better workforce management.
Language: Jupyter Notebook - Size: 3 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

tomarv2/terraform-databricks-azure-workspace
Terraform module to create Databricks Azure workspace
Language: HCL - Size: 387 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

BlueGranite/azure-synapse-vcf-analysis
Sample code for analyzing VCF files (converted to Parquet) in Azure Databricks and Synapse.
Size: 14.8 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

Bertelsmann-AI/databricks_learnings Fork of gabriben/databricks-learnings
Special insights regarding Databricks.
Size: 71.3 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

AdamPaternostro/Azure-Databricks-External-Hive-and-ADLS
Shows how to use an External Hive (SQL Server) along with ADLS Gen 1 as part of a Databricks initialization script that runs when the cluster is created.
Language: PowerShell - Size: 13.7 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1

AdamPaternostro/Azure-Databricks-CI-CD-Initial-Token
How to do CI/CD with Azure Databricks and get the initial Databricks token.
Language: C# - Size: 501 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 4

adithyavk9/DataEngineeringProjectAdventureWorks
An end to end data engineering project built on Azure
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

enricogoerlitz/explore-azure-databricks
End-to-end backend and data hub architecture on Azure, integrating Databricks and a suite of Azure services for seamless data processing, analytics, and deployment.
Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

SayamAlt/Amazon-Products-API-ETL-and-ML-pipeline
In this project, I've created an end-to-end ETL pipeline and subsequently developed a machine learning model to predict the price of Amazon products based on several product-related features.
Language: Python - Size: 2.95 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

lamiaaali/DEPI-Graduation-Project
SkinCare Sentiment Analysis Reviews
Language: Jupyter Notebook - Size: 7.72 MB - Last synced at: 20 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 2

alexanderbean/E2E-Data-Engineering-in-Azure
End-to-end ETL pipeline in the Microsoft Azure cloud - (Jun '24 - Jul '24)
Language: Python - Size: 1.95 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

zBalachandar/Tokyo-Olympic-Data-Analytics-Azure-End-To-End-Data-Engineering-Project-12
Tokyo-olympic-azure-data-engineering-end-to-end-project
Language: HTML - Size: 44.5 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Amrit-Hub/DP-203-Data-Engineer-Associate-Questions
This repo contains "Azure Data Engineer Associate" Questions and related docs.
Size: 29.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

giufalcao/Formula-1
A data pipeline project build on databricks and azure to demostrate lifecycle of a cloud data project.
Language: Python - Size: 5.21 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ciaran28/dstoolkit-mlops-databricks Fork of microsoft/dstoolkit-mlops-databricks
ML Ops Accelerator: Databricks & Azure Machine Learning Unification
Language: Python - Size: 82.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

manaswipatil/Tokyo-Olympics-Data-Analytics-in-Azure
Azure pipeline for data analytics on Tokyo Olympics data
Size: 507 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

venkatakamaiah46/Azure
POC projects working on Cloud Platforms
Language: HTML - Size: 208 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

iammustafatz/Mlflow-Diabetes-Prediction-Pipeline
This repository showcases how to build a machine learning pipeline for predicting diabetes in patients using PySpark and MLflow, and how to deploy it using Azure Databricks.
Language: Jupyter Notebook - Size: 1.89 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

kagarlickij/azure-databricks-arm
Azure RM template for Databricks and Azure DevOps pipeline that does Databricks workspace and cluster deploy
Language: PowerShell - Size: 174 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 8

randyroac/azure-databricks-etl-project
ETL motor racing data project using Azure Databricks, Pyspark and Azure Date Lakes
Language: Python - Size: 1.52 MB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 2

Andy-Pham-72/Top-Rentals-Cineplex
Applying data engineering techniques to create data pipeline with Azure Cloud Computing
Language: Python - Size: 17.4 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

easonlai/eda_for_prudential_life_insurance_sample_data
Notebook sample of Exploratory Data Analysis (EDA) for Prudential Life Insurance Sample Data
Language: Jupyter Notebook - Size: 4.18 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

syedhassaanahmed/tf-adb-hdi-hbase
Terraform template which shows how to connect E2E from Azure Databricks to an HDInsight HBase cluster
Language: HCL - Size: 7.81 KB - Last synced at: 3 days ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

colbyford/malaria_DREAM2019
Article Repository for: Ensemble Machine Learning Modeling for the Prediction of Artemisinin Resistance in Malaria
Language: Python - Size: 271 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

syedhassaanahmed/azure-data-manager
Cloud services for defining, ingesting, transforming, analyzing and showcasing big data
Language: C# - Size: 759 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 2

AdamPaternostro/Azure-Databricks-auto-sklearn
Shows how to install auto-sklearn on an Azure Databricks cluster
Language: Shell - Size: 303 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

mpfishe2/eventhubs-databricks-quickstart
Get up and running quickly with Spark Structured Streaming on Azure Databricks using Azure Event Hubs
Language: Scala - Size: 3.91 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

s-yazhini/Hexa-DE-Main-Project
Data engineering main project 1
Language: Jupyter Notebook - Size: 15.5 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

mysticrenji/az-databricks-pipelines
Repository contains TF files pertaining to Azure pipelines and Azure databricks
Language: HCL - Size: 133 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 2

Sasuke565/rmodel
rModel is a framework for building LLM applications with agentic workflow agent, agentic, ai, flow, framework, graph, llm, multi-agent, workflow
Size: 1000 Bytes - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

srimantapal205/Subject-Wise-Question---Answer
This branch focuses on building Data Engineering Interview Question and Answer
Size: 482 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

benijake/databricks-read-write-database-sp
How can I use a service principal to read and write to a database from Databricks?
Language: Jupyter Notebook - Size: 4.01 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

xom9s/sports
Repository for backend infrastructure of a project
Language: Python - Size: 73.2 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

benijake/databricks-read-write-database-entra
How to read and write to an Azure SQL or Postgres DB from a Databricks notebook
Language: Jupyter Notebook - Size: 1.55 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

Jcnok/Microsoft_AI_for_Tech-Azure_Databricks
Resolução dos desafios de projetos realizados durante o Bootcamp Microsoft Azure Databricks - 2025
Language: Jupyter Notebook - Size: 389 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

productiveAnalytics/mlops_with_databricks
CI/CD pipeline and MLOps with Databricks (Azure Databricks & Azure DevOps)
Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Mohitsai/future-of-hiring
Automated ETL pipeline in Azure for job market analysis using Terraform, Azure Functions, Azure Databricks, Azure Data Lake and PowerBI
Language: HCL - Size: 20.5 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

MohssineSERRAJI/azure-data-lake
A lightweight toolkit for Azure Data Lake Storage Gen2 operations, featuring AzCopy commands and Databricks integration examples. Includes sample data and notebooks for quick experimentation with data lake architectures.
Language: Jupyter Notebook - Size: 449 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

tahir007malik/ecommerceDataStreamingAnalytics
This repository features a production-grade data pipeline leveraging Confluent Kafka for real-time collection of e-commerce clickstream and user activity data.
Language: Jupyter Notebook - Size: 558 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

shudhanshurp/News_Recommendation_System
This repository presents a News Recommendation System using Azure Data Factory, Azure Databricks, and Azure Data Lake to create a data pipeline for ML models. It uses BERT for content-based filtering, Neural Collaborative Filtering for user behaviors, and a hybrid model that combines both to enhance news recommendations.
Language: Jupyter Notebook - Size: 55.9 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

SayamAlt/TMDB-Movies-End-to-End-ETL-and-ML-Pipeline
This project encompasses end-to-end ETL and ML pipeline development. Data ingestion from TMDB API covered top-rated, current, upcoming, and popular movies with genres. Performed EDA to derive several valuable insights and observations. Developed a regression model with 97% r2 score to predict average movie ratings accurately.
Language: Python - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Sivaprasad-V/Tokyo-Olympics-Azure-Data-Engineering-Project
Azure End To End Data Engineering Project
Language: Jupyter Notebook - Size: 358 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Sivaprasad-V/NYC-TAXI-Azure-Data-Engineering-Project
Azure End To End Data Engineering Project
Language: Jupyter Notebook - Size: 17.4 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Sivaprasad-V/Adventure-Works-Azure-Data-Engineering-Project
Azure End To End Data Engineering Project
Language: Jupyter Notebook - Size: 2.92 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Redgerd/Reddit-Post-Analysis-Workflow
This Reddit Post Analysis Workflow collects and processes Reddit data using Apache Spark and Delta Lake. It transforms raw data, applies sentiment analysis, and extracts TF-IDF features. The pipeline ensures reliable, high-quality data storage and supports continuous analytics.
Language: HTML - Size: 193 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 1

najmaelboutaheri/Patents_analysis
This repository contains code and resources for analyzing patents using Apache Spark, Python, and AWS services. The objective of this project is to extract insights and trends from patent data to inform business decisions and intellectual property strategies.
Language: Jupyter Notebook - Size: 7.79 MB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

gudashashank/tokyo-olympics-analysis
An Azure cloud-based data analytics solution that processes and visualizes the 2021 Tokyo Olympics dataset. This end-to-end pipeline leverages Azure Data Factory for data ingestion, Data Lake Storage Gen2 for secure storage, Databricks for data transformation, Synapse Analytics for SQL querying, and Power BI for interactive visualization
Size: 1.18 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

s-yazhini/PySpark-and-SparkSQL
In Azure DataBricks
Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

uminskib/Toronto_traffic_collisions_and_weather_Azure_Data_Engineering
Comprehensive data engineering solution using Azure platform tools such as Data Factory and Databricks, completed with analysis and dashboard in Power BI.
Language: Jupyter Notebook - Size: 20.5 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

arnabsaha7/TechRetail-Sales-Analysis
TechRetail Azure Data Pipeline Analysis, provides a robust analysis of retail data via an Azure-based data pipeline.
Language: Jupyter Notebook - Size: 5.21 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

clouddrove/terraform-azure-databricks
This terraform module is designed to create Azure Databricks resources. Azure Databricks is a fully managed first-party service that enables an open data lakehouse in Azure.
Language: HCL - Size: 56.6 KB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 3

laismeuchi/dados-databricks-base-cnpj
Projeto utilizando a base de CNPJ da Receita Federal
Language: Python - Size: 84 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

ssanthosh010303/collection-data-training
A collection of challenges exercised during data training program.
Size: 1000 Bytes - Last synced at: 25 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

sandeep-khr/Tokyo-Olympics-Data-Insights-using-Azure
Size: 343 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Intellipaat-software-solution-official/Azure-Data-Engineering-Capstone-Project
This Capstone Project includes an End to End Data Engineering Pipeline right from Ingesting the data from HTTPs server to cleaning and transforming the data in Azure Databricks and finally reporting the data on Power BI Desktop
Size: 6.87 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Dimitrov-S-Dev/resume
Dimitrov-S-Dev Resume/ Portfolio
Language: CSS - Size: 23.1 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ahmedlrashed/E2E-Azure-Pipeline
Databricks ETL Pipeline for retrieving and processing NI TestStand test results, featuring a well-documented notebook for ETL operations, Data Lake for storage, Spark SQL+Python for transformations, and Power BI as the final visualization of factory metrics.
Language: Jupyter Notebook - Size: 1.22 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mihirkudale/Olympic-data-analysis-azure-data-engineering-project
Language: Jupyter Notebook - Size: 143 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

matgonz/azure_databricks_mlops_e2e
[👩🏫] In this repository I'll show you how to use Azure Databricks for development and training machine learning models, and build a MLOps pipeline to serving them with CI/CD process.
Language: Jupyter Notebook - Size: 2.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sakethmukkanti/Machinery-Moniter-Iot-Streaming-With-Azure
An application developed to give real-time insights on machine health using Iot sensors by tracking and monitoring parameters such as temperature, pressure, current and humidity.
Language: Jupyter Notebook - Size: 210 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

RJ-Raj/IoT-Data-Pipeline
This repository contains code for an end-to-end IoT data pipeline using Azure services. It ingests, processes, and stores IoT device data from AWS S3 to Azure Data Lake Storage and Azure SQL Database, leveraging Azure Data Factory and Azure Functions for seamless integration and automation.
Language: Python - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sakethmukkanti/Movielens-Dataset-Analysis-Azure-Data-Engineering-Project
Created a movie recommendation system on Azure utilizing Spark SQL for analyzing the MovieLens dataset.
Language: Jupyter Notebook - Size: 1.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

rohitkulkarni08/Azure-ETL-AmazonSalesAnalysis
A comprehensive ETL pipeline and sales analysis project leveraging Microsoft Azure and PySpark, designed to optimize e-commerce sales by providing actionable insights through detailed data analysis.
Language: Jupyter Notebook - Size: 8.04 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
