GitHub topics: dataengineer
RaviSoni804426/Data-Code
📊 This repo Contains my Data Science & Machine Learning Journey with hands on practice.
Language: Jupyter Notebook - Size: 798 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

DarrenDavy12/Earthquake-Events-and-Risks-Project---Azure-Data-Pipeline---API-Connection-
Earthquake Events and Risks Project - Azure Data Pipeline - API Connection
Size: 3.94 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

anandjha90/ANALYTICSWITHANAND
This repository contains all the codes,ppts,project & interview questions which I have used in my LIVE CLASS on YouTube and any other relevant documents and assignments related to the course.
Size: 321 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 69 - Forks: 29

Anas399/SPARK_CLUSTER_DOCKER
Set-up local spark cluster, hadoop (hdfs), airflow, postgresql on docker with ease, without any local installations
Size: 1000 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

holynomad/build-up-SQL-Python-expert
SQL & Python expert 🌊🏄🏻♂️
Language: Python - Size: 109 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

imsheth/imsheth.github.io
Ishan Sheth (imsheth) is a seasoned polyglot and can be found in the terminal, the browser, eating, cooking or amongst nature. As a software engineer, he has experienced the rollercoaster journeys of building products from the ground up, onboarding the first customers for products which were eventually acquired and has also offered specialized consultations to corporations which operate at a multi million dollar scale. Owing to this he has got communication, bringing order to chaos and closure under his belt, which are now his forte. Trance music gets him high and keeps him going on with life.
Language: HTML - Size: 52.4 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

ianCristianAriel/2024-argentina-analisis-sistema-educativo
Este proyecto académico se centra en el análisis del sistema educativo de Argentina, utilizando técnicas de ciencia de datos, aprendizaje automático y visualización de datos. El objetivo es obtener insights valiosos que ayuden a comprender mejor la situación educativa en el país.
Size: 3.91 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

vitornimschofsky/Open-Brewery-DB_Azure-ELT
Example of the entire process of ingestion, storage, transformation and visualization of a brewery database
Language: Jupyter Notebook - Size: 1.56 MB - Last synced at: 8 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Wittline/apache-spark-docker
Dockerizing an Apache Spark Standalone Cluster
Language: VBA - Size: 63.7 MB - Last synced at: 10 days ago - Pushed at: almost 3 years ago - Stars: 43 - Forks: 27

pltommasino/pltommasino
Size: 9.77 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

yuriidd/DataLearnDE
repository for Datalearn-Data-Engineer course and Surfalytics projects
Language: Dockerfile - Size: 51.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 2

xiangivyli/data-science-portfolio
Data Engineering, SQL, Exploratory Data Analysis (EDA), Machine Learning (Python), Business Intelligence (BI)
Language: Jupyter Notebook - Size: 331 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 11 - Forks: 1

officialarete/Data-Professionals
A closer look on how data professionals are thriving,paying close attention to the challenges that hinder there growth and development.
Size: 434 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

huudatscience/huudatscience.github.io Fork of daviddarnes/alembic
⚗️ Máy tính lượng tử & AI | Huu Dat's Blog
Language: SCSS - Size: 7.99 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

Iqrar99/FARMAKAMI-TK-BASDAT-56 📦
Group Project of Database (CSGE602070) course at Fasilkom UI.
Language: Python - Size: 583 KB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

sibyabin/blogs
Technology blogging website from Siby Abin. Talks about dataengineering, aws, spark, python, airflow and more
Language: SCSS - Size: 6.33 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

niyotham/dbt-datacamp-course-note
dbt course form datacamp for data analytic engineer
Size: 2.99 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ericksonlopes/Dicas_Pandas_Linkedin
Language: Jupyter Notebook - Size: 354 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

gutomelo/3GTeam
Projeto do grupo 3GTeam apresentado no Hackathon de Engenharia de Dados da A3Data no mês de Junho de 2021.
Language: Python - Size: 1.57 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 4

nareshk1290/dataquest-DE
Data Quest - Data Engineer Learning and Projects
Language: Jupyter Notebook - Size: 125 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 22 - Forks: 17

BrunoGianetti/BrunoGianetti
Who am I? What I do? How to contact me?
Size: 263 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

bosimanurung/StringChallenge
Have the function DifferentCases(str) take the str parameter being passed and return it in upper camel case format where the first letter of each word is capitalized. The string will only contain letters and some combination of delimeter punctuation character separating each word.
Language: Python - Size: 417 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

bosimanurung/RegEx-That-Works-And-Not
RegEx or Regular Expression is good for your health too
Language: Python - Size: 1.03 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

egorfolley/DataCamp
Courses and projects on Data Camp
Language: Python - Size: 10.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 10 - Forks: 3

victorandradebr/data_engineer
Olá pessoal, esse é um repositório tratando de fim a fim, uma pipeline de dados relacionados a produtos e suas subcategorias, onde simulo que isso seja um pedido do time de negócios, com granularidade diária a ser entregue.
Language: Python - Size: 39.1 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

victorandradebr/ETL_DATABRICKS_BITCOIN
Este projeto se trata de um simples etl com um dataset com as variações dos preços diários do bitcoin no período de 2020-2022. Os códigos do notebook foram desenvolvidos tanto em pyspark quanto em sql, numa simulação de solucão referentes a perguntas de négocio.
Size: 125 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

dscunair/py-scraper
A scraper service project to collect datasets from several websites or third-party APIs.
Language: Python - Size: 23.7 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

manug25/leetcode-solutions-sql
SQL questions and solutions from leetcode in SQL, Spark, PySpark
Size: 19.5 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

deepakksahu/Data_engineering_takehome
Crosslend DE Assignment
Language: Python - Size: 16.6 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

fabinhojorge/simple_etl_pipeline
This projects presents a simple ETL pipeline.
Language: Jupyter Notebook - Size: 1.95 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

IanNarsa/etl-flask-bonobo
ETL with flask and bonobo
Language: Python - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
