An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: dataengineer

RaviSoni804426/Data-Code

📊 This repo Contains my Data Science & Machine Learning Journey with hands on practice.

Language: Jupyter Notebook - Size: 798 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

DarrenDavy12/Earthquake-Events-and-Risks-Project---Azure-Data-Pipeline---API-Connection-

Earthquake Events and Risks Project - Azure Data Pipeline - API Connection

Size: 3.94 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

anandjha90/ANALYTICSWITHANAND

This repository contains all the codes,ppts,project & interview questions which I have used in my LIVE CLASS on YouTube and any other relevant documents and assignments related to the course.

Size: 321 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 69 - Forks: 29

Anas399/SPARK_CLUSTER_DOCKER

Set-up local spark cluster, hadoop (hdfs), airflow, postgresql on docker with ease, without any local installations

Size: 1000 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

holynomad/build-up-SQL-Python-expert

SQL & Python expert 🌊🏄🏻‍♂️

Language: Python - Size: 109 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

imsheth/imsheth.github.io

Ishan Sheth (imsheth) is a seasoned polyglot and can be found in the terminal, the browser, eating, cooking or amongst nature. As a software engineer, he has experienced the rollercoaster journeys of building products from the ground up, onboarding the first customers for products which were eventually acquired and has also offered specialized consultations to corporations which operate at a multi million dollar scale. Owing to this he has got communication, bringing order to chaos and closure under his belt, which are now his forte. Trance music gets him high and keeps him going on with life.

Language: HTML - Size: 52.4 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

ianCristianAriel/2024-argentina-analisis-sistema-educativo

Este proyecto académico se centra en el análisis del sistema educativo de Argentina, utilizando técnicas de ciencia de datos, aprendizaje automático y visualización de datos. El objetivo es obtener insights valiosos que ayuden a comprender mejor la situación educativa en el país.

Size: 3.91 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

vitornimschofsky/Open-Brewery-DB_Azure-ELT

Example of the entire process of ingestion, storage, transformation and visualization of a brewery database

Language: Jupyter Notebook - Size: 1.56 MB - Last synced at: 8 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Wittline/apache-spark-docker

Dockerizing an Apache Spark Standalone Cluster

Language: VBA - Size: 63.7 MB - Last synced at: 10 days ago - Pushed at: almost 3 years ago - Stars: 43 - Forks: 27

pltommasino/pltommasino

Size: 9.77 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

yuriidd/DataLearnDE

repository for Datalearn-Data-Engineer course and Surfalytics projects

Language: Dockerfile - Size: 51.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 2

xiangivyli/data-science-portfolio

Data Engineering, SQL, Exploratory Data Analysis (EDA), Machine Learning (Python), Business Intelligence (BI)

Language: Jupyter Notebook - Size: 331 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 11 - Forks: 1

officialarete/Data-Professionals

A closer look on how data professionals are thriving,paying close attention to the challenges that hinder there growth and development.

Size: 434 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

huudatscience/huudatscience.github.io Fork of daviddarnes/alembic

⚗️ Máy tính lượng tử & AI | Huu Dat's Blog

Language: SCSS - Size: 7.99 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 1

Iqrar99/FARMAKAMI-TK-BASDAT-56 📦

Group Project of Database (CSGE602070) course at Fasilkom UI.

Language: Python - Size: 583 KB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

sibyabin/blogs

Technology blogging website from Siby Abin. Talks about dataengineering, aws, spark, python, airflow and more

Language: SCSS - Size: 6.33 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

niyotham/dbt-datacamp-course-note

dbt course form datacamp for data analytic engineer

Size: 2.99 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ericksonlopes/Dicas_Pandas_Linkedin

Language: Jupyter Notebook - Size: 354 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

gutomelo/3GTeam

Projeto do grupo 3GTeam apresentado no Hackathon de Engenharia de Dados da A3Data no mês de Junho de 2021.

Language: Python - Size: 1.57 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 4

nareshk1290/dataquest-DE

Data Quest - Data Engineer Learning and Projects

Language: Jupyter Notebook - Size: 125 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 22 - Forks: 17

BrunoGianetti/BrunoGianetti

Who am I? What I do? How to contact me?

Size: 263 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

bosimanurung/StringChallenge

Have the function DifferentCases(str) take the str parameter being passed and return it in upper camel case format where the first letter of each word is capitalized. The string will only contain letters and some combination of delimeter punctuation character separating each word.

Language: Python - Size: 417 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

bosimanurung/RegEx-That-Works-And-Not

RegEx or Regular Expression is good for your health too

Language: Python - Size: 1.03 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

egorfolley/DataCamp

Courses and projects on Data Camp

Language: Python - Size: 10.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 10 - Forks: 3

victorandradebr/data_engineer

Olá pessoal, esse é um repositório tratando de fim a fim, uma pipeline de dados relacionados a produtos e suas subcategorias, onde simulo que isso seja um pedido do time de negócios, com granularidade diária a ser entregue.

Language: Python - Size: 39.1 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

victorandradebr/ETL_DATABRICKS_BITCOIN

Este projeto se trata de um simples etl com um dataset com as variações dos preços diários do bitcoin no período de 2020-2022. Os códigos do notebook foram desenvolvidos tanto em pyspark quanto em sql, numa simulação de solucão referentes a perguntas de négocio.

Size: 125 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

dscunair/py-scraper

A scraper service project to collect datasets from several websites or third-party APIs.

Language: Python - Size: 23.7 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

manug25/leetcode-solutions-sql

SQL questions and solutions from leetcode in SQL, Spark, PySpark

Size: 19.5 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

deepakksahu/Data_engineering_takehome

Crosslend DE Assignment

Language: Python - Size: 16.6 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

fabinhojorge/simple_etl_pipeline

This projects presents a simple ETL pipeline.

Language: Jupyter Notebook - Size: 1.95 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

IanNarsa/etl-flask-bonobo

ETL with flask and bonobo

Language: Python - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0