An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-engineer

mikecerton/UserInsight-Streaming-Data-Pipeline

UserInsight-Streaming-Data-Pipeline is a real-time pipeline that ingests API data into Kafka, processes it with Spark, stores it in S3, and uses AWS Lambda to load it into Redshift. The data is then used to create a dashboard in Looker. [Data Engineer]

Language: Python - Size: 325 KB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 1 - Forks: 0

Hippaho/Sparkify

A music streaming company, Sparkify, has decided that it is time to introduce more automation and monitoring to their data warehouse ETL pipelines and come to the conclusion that the best tool to achieve this is Apache Airflow.

Language: Python - Size: 17.6 KB - Last synced at: about 23 hours ago - Pushed at: about 24 hours ago - Stars: 0 - Forks: 0

Shakespear567/Data_Engineering_GCP

Data Engineering Using Google Could Platform and Mage

Size: 1000 Bytes - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

drahulsingh/drahulsingh

I'm an IT graduate with a passion for data, software, and cloud computing. With a knack for problem-solving and a commitment to staying updated with cutting-edge technologies, I aim to contribute to innovative projects and help organizations achieve their goals.

Size: 128 KB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 7 - Forks: 0

zulfaan/e-commerce-analyst-portfolio

Developed and maintained a Luigi‑driven ETL pipeline to scrape, clean, and load Tokopedia Exsport Bag product, stock, and order data into PostgreSQL, enabling in‑depth category‑level sales and revenue analysis that powered data‑driven decision‑making

Language: Jupyter Notebook - Size: 2.73 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

adamjpatterson/adamjpatterson

Data Architect and Full Stack Engineer with an Interest in Data Science

Size: 103 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

swapnaxdata/Nyc-Taxi-Data-Engineering-Project

Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

pierrehanne/pierrehanne

My GitHub Profile Page

Size: 66.4 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

kelvins/awesome-dataops

:sunglasses: A curated list of awesome DataOps tools

Language: Python - Size: 112 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 184 - Forks: 29

dhkdn9192/data_engineer_career

DE직무에 필요한 모든 것

Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 196 - Forks: 28

AhmetFurkanDEMIR/Data-Engineering-Project-with-HDFS-and-Kafka

Data Engineering Project with Hadoop HDFS and Kafka

Language: Python - Size: 3.46 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 102 - Forks: 25

Andessonreis/andessonreis

Size: 2.74 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 0

chrimaho/chrimaho

My Personal Repository

Size: 278 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

data-engineering-community/data-engineering-wiki

The best place to learn data engineering. Built and maintained by the data engineering community.

Language: CSS - Size: 7.83 MB - Last synced at: 11 days ago - Pushed at: 16 days ago - Stars: 1,632 - Forks: 183

BayoAdejare/lightning-containers

Docker powered starter for geospatial analysis of lightning atmospheric data.

Language: Python - Size: 149 MB - Last synced at: 7 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 2

rashakil-ds/Roadmap-Docs

Best Data Science, Data Analytics, AI, and SDE roadmaps. This repository is continually updated based on the top job postings on LinkedIn and Indeed in the data science and AI domain.

Size: 1.16 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 385 - Forks: 149

CharlesEmil/sql-data-warehouse-project

Building a modern data warehouse with SQL Server, Including ETL processes, data modeling and analytics.

Language: TSQL - Size: 1.82 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

rezarajan/whoami

Resume source files

Language: TeX - Size: 800 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

apancoast/Healthcare-Deserts-and-Public-Transit

This dbt-based project aims to analyze the intersection of healthcare accessibility and public transit coverage in Mecklenburg County, NC.

Language: Python - Size: 104 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

wwwlike/vlife

企业级低代码快速开发平台,包含页面可视化配置、自定义表单、自定义报表、权限管理脚手架应用、前后端代码自动生成;主要特点是低代码开发,可实现复杂CRUD功能仅编写数据模型就能完成前后端开发

Language: TypeScript - Size: 9.05 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 709 - Forks: 85

ortizfram/datacamp-Data-Engineer-with-Python-course

datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments

Language: Python - Size: 22 MB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 104 - Forks: 34

higorcazuza81/higorcazuza81

A little about me

Size: 3.56 MB - Last synced at: 16 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

janainacazuza/janainacazuza

A little about me

Size: 517 KB - Last synced at: 14 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

dhamodharanrk/online-cv

Mr.Dhamodharan's Curriculum vitae

Language: CSS - Size: 3.7 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 1

mitgar14/etl-workshop-2

Workshop #2 (ETL process using Airflow) for the ETL course using Apache Airflow to build a data pipeline.

Language: Jupyter Notebook - Size: 7.58 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

imsathiya17/imsathiya17.github.io

Language: CSS - Size: 21.9 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 2 - Forks: 0

Devinterview-io/data-engineer-interview-questions

🟣 Data Engineer interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.

Size: 14.6 KB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 22 - Forks: 10

mikecerton/The-Retail-ELT-Pipeline-End-To-End

This project designs and implements an ETL pipeline using Apache Airflow (Docker Compose) to ingest, process, and store retail data. AWS S3 acts as the data lake, AWS Redshift as the data warehouse, and Looker Studio for visualization. [Data Engineer]

Language: Python - Size: 1.07 MB - Last synced at: 21 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

FadhiilDzaki/etl_superstore

This project automates ETL for Superstore data, extracting from PostgreSQL, transforming in Python, and reloading into PostgreSQL weekly. I conducted data analysis in Jupyter Notebook and built a Metabase dashboard for insights.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

lucasleonetti/ETL_Aviacion_Civil

Proyecto ETL en Hadoop Ecosystem Para responder a consultas de negocio para la ANAC

Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mikecerton/Apache_Kafka_Basic

This repository provides a fundamental understanding of Apache Kafka, including its core components, basic Python scripts to demonstrate how to create topics, produce messages, and consume messages, as well as a docker-compose.yml file for easy setup. [Data Engineer]

Language: Python - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

mikecerton/ETLpython_Oracle

ETL from couchDB to Oracle using python [Data Engineer]

Language: Python - Size: 11.7 KB - Last synced at: 21 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

sanketrs/sql-interview-preparation-questions-with-answers

Designed as a comprehensive resource for aspiring data analysts, data engineers, and database administrators.

Size: 16.6 KB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

DAN3002/Tiktok-Crawler

This is a simple Tiktok crawler that can be used to download videos from Tiktok. It uses the Tiktok API to get the video URL and then downloads the video using the requests library. It can download video from multiple hashtags or download by sound.

Language: Python - Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

rublaman/data-engineering-portfolio

This portfolio presents a compilation of data engineering projects that highlight the knowledge and use of tools in an optimal way for data flow. The repository demonstrates a commitment to developing robust solutions using diverse technologies, addressing practical challenges in the field.

Language: Jupyter Notebook - Size: 1.67 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

mitgar14/etl-workshop-1

Workshop #1 (Data Engineer) for the ETL course using Pandas, Matplotlib, SQLAlchemy and Power BI for the creation of the dashboard.

Language: Jupyter Notebook - Size: 4.93 MB - Last synced at: 30 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Thomas-George-T/Thomas-George-T

Readme for my :octocat: Profile

Size: 203 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 76 - Forks: 87

epsi10nvn/RealTime-Sales-Inventory

A retail data warehouse that processes real-time purchases from Kafka, stores them in PostgreSQL with a star schema, and enables sales analysis, inventory tracking, and interactive reporting.

Language: Python - Size: 298 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

aaaastark/Top-Big-Data-Scientist-Questions-For-Interview

Top Big Tech Data Science Questions

Size: 1.55 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 8

Justmalhar/ThinkLikeAnEngineer

💡 Think Like An Engineer is a roadmap for engineering leadership, a toolkit for growth hacking through engineering, and a manifesto for productivity enhancement

Size: 61.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

mauricioarcez/mauricioarcez.github.io

Te invito a conocerme en mi portafolio. Como puedo ayudarte?

Language: CSS - Size: 5.42 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

nivesayee/nivesayee

Size: 1.07 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mohidex/data-pipeline-on-gcp

The Real-time Ecommerce Data Collection and Processing project empowers businesses with real-time insights by efficiently extracting, processing, and storing ecommerce data from multiple sources. Combining Golang and Python, this cutting-edge solution streamlines data handling from diverse ecommerce websites.

Language: Python - Size: 1010 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

tuanx18/data-engineer-portfolio

This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.

Size: 276 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 40 - Forks: 10

digitalghost-dev/premier-league 📦

A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.

Language: Python - Size: 487 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 224 - Forks: 17

lixx21/spotify-scrapping

Scraping data from Spotify Playlist URL using Python and Selenium

Language: Python - Size: 4.88 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

Candratama/ETLProject

An ETL (Extract, Transform, Load) pipeline that processes WhatsApp messages data from MySQL, transforms it, and loads the results into PostgreSQL and CSV files

Language: Python - Size: 77.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

taufiksatrian/DigitalentKominfo

Dokumentasi Praktikum dan Project Hands On Pelatihan Talent Scouting Academy Digitalent Kominfo

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

lixx21/dbt-shopping-data-transform

This project leverages DBT (Data Build Tool) to transform raw shopping data into a well-structured, analytics-ready format

Language: Dockerfile - Size: 610 KB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

lixx21/airflow-dbt-gcp

A comprehensive data pipeline leveraging Airflow, DBT, Google Cloud Platform (GCP), and Docker to extract, transform, and load data seamlessly from a staging layer to a data warehouse and data mart.

Language: Python - Size: 257 KB - Last synced at: 13 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 5

mjs1995/Book_review

book review for data-engineer

Size: 1.49 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

nottherealtar/Data_Engineering_Assesments

Language: Python - Size: 21.5 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 1

mazzasaverio/data-software-engineering-journal

I decided to start tracking my learning, tips, code, building projects, ideas, and curiosities that I discover on my product engineer development journey. I hope that others might find interesting insights, discover their own paths, and enjoy the journey as well.

Language: TypeScript - Size: 51.7 MB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

mensenvau/data_engineering_solution_no1

Data Engineer Lead Analyst Case Study

Size: 2.95 MB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

mensenvau/leetcode_sql_problems

😊️️️️️️ Leetcode database part solutions

Language: TSQL - Size: 92.8 KB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

mensenvau/internship_sql_analytics

🚀 Internship SQL (East, Advanced)

Language: TSQL - Size: 10.4 MB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

patcha-ranat/Ecommerce-Invoice-End-to-end

End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - dynamic interpretable customer segmentation

Language: Jupyter Notebook - Size: 49.9 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

BayoAdejare/pipeline-ecommerce

E-commerce Data Pipeline

Language: Python - Size: 22.5 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

andkret/Cookbook

The Data Engineering Cookbook

Size: 35.5 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 13,683 - Forks: 2,498

SimonOsipov/SimonOsipov

Description for Github mainpage

Size: 51.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ivanshamaev/python-algorithms-data-engineer

Python Algorithms for Data Engineers | Алгоритмы Python для инженеров данных. Этот репозиторий содержит коллекцию Jupyter notebooks, с решениями различных алгоритмических задач. Notebooks собраны специально для инженеров данных и помогут улучшить навыки problem-solving и algorithmic thinking с помощью упражнений.

Language: Jupyter Notebook - Size: 362 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

longNguyen010203/100Day-Self-Learning-DE

📚💻⌨ Self-study process for more than 3 months with 3-4h/day to prepare for the journey of applying for an intern or fresher position as a Data Engineer in 2024 ️🥇️🏆

Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

cainzuca/scrapy_celphone

Projeto com o objetivo de extrair dados da web.

Language: Python - Size: 63.5 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

mensenvau/mensenvau

I am a Mid Software/Data Engineer.

Language: HTML - Size: 3.15 MB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

unnati-xyz/scalable-data-science-platform

Content for architecting a data science platform for products using Luigi, Spark & Flask.

Language: Jupyter Notebook - Size: 22.7 MB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 163 - Forks: 28

BayoAdejare/pipeline-edtech

Edtech ADF Pipeline Project

Size: 12.7 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

BayoAdejare/pipeline-sleep

Sleep Data Pipeline with Azure Data Factory

Size: 21.5 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

pwenker/data-engineering

My notes for Udacity's Data Engineering Nanodegree.

Size: 1.3 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

mimnets/data-analyst-roadmap-2024

Becoming a Data Analyst: Your Path to a Job-Ready Bootcamp

Size: 1.47 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

zekeriyyaa/Building-A-Data-Pipeline-For-ROS-Compliant-Robotic-System-Via-Amazon-Web-Services

Language: Python - Size: 959 KB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 2

camposvinicius/aws-etl

This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/AdventureWorks.zip, it's a zipped file with some .csvs inside that we will apply transformations.

Language: Smarty - Size: 168 KB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 17 - Forks: 3

RenanBjj/Databricks-PowerBI-OpticalSales

Databricks optical sales

Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Hanan-Nawaz/100_Days_of_Data_Engineering

Journey through 100 days of Data Engineering, featuring daily learning, practice, and projects. This repository includes notes, exercises, and code snippets covering essential topics such as GitHub, Python, ETL, data pipelines, and more

Language: Jupyter Notebook - Size: 10.5 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

jasontanx/prefect-learning

Prefect - Data orchestration tool practice & learning

Language: Python - Size: 314 KB - Last synced at: 27 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

adilkhash/apache-airflow-intro

Language: Python - Size: 9.77 KB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 17 - Forks: 4

anhdanggit/atom-assignments

All assignments of DATAcracy ATOM Open class, which is free and aims to democratize Data Skills for Everyone. The skills includes end-to-end of a simple and small scale data solutions.

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 47

agusputra4/DQLab

This is a repository to share the result of learning from materials and projects in DQLab

Language: Jupyter Notebook - Size: 15.7 MB - Last synced at: 10 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

deordie/deordie-digest

Data Engineering Digest

Language: SCSS - Size: 1.41 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 26 - Forks: 2

n4en/python-for-data-engineers

Python for data engineers

Size: 1.5 MB - Last synced at: 10 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

tn00627974/Data-Engineering-Project

使用ETL data pipeline 將UBER 資料清洗、排程、最後放置在GCP上運行與後續分析 的專案

Language: Jupyter Notebook - Size: 18 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

AyushRaiKhare/Ayush_Khare_Data_Engineering_Portfolio

Ayush @ Data Engineering Portfolio

Size: 18.6 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

WaliUllahbaig/Computer-Vision-and-Deep-Learning

Unlock the world of Computer Vision Projects! Dive into hands-on learning with diverse projects, code, and datasets. Elevate your AI skills and explore the realm of visual data interpretation.

Language: Jupyter Notebook - Size: 46.4 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

dataength/dataength.github.io

Data Eng Thailand

Size: 2.93 KB - Last synced at: 11 months ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

minhaj-313/Data-Science-Books

Welcome to the Data Science Books repository! Dive into a curated collection of resources covering various aspects of data science. Whether you're a beginner or an expert, contribute and explore to enrich our library. Let's empower each other on our data science journey

Size: 120 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

korawica/armored

Armored Models for Data Pipeline & Data Observability

Language: Python - Size: 85 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

thapadipenra/db-capstone-project

meta data engineering capstone project

Language: Jupyter Notebook - Size: 1.49 MB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 1

Menziess/Databook

Data Engineering knowledge as a readable tutorial (collaboratively).

Size: 2.44 MB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 1

alison-carlos/data-collect

Repository for Replication of Professor Teo Calvo's Projects

Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

damodhar918/sdgp

This project Synthetic data generator plus (SDGP) is a python script that generates mock data based on given configurations. It can also edit and scale existing data to create high volume data. It is useful for testing, learning data domine and prototyping purposes.

Language: Python - Size: 1.11 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vmware/versatile-data-kit

One framework to develop, deploy and operate data workflows with Python and SQL.

Language: Python - Size: 109 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 409 - Forks: 54

natayadev/cookiecutter4etls

Una plantilla de CookieCutter para ETLs en Python 🍪

Size: 13.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Keep-Current/web-miner

Crawls sites, to find new content and scrap it

Language: Python - Size: 215 KB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 33 - Forks: 29

Julianadev/Project-Performing-a-Code-Review

Reviewing code from a CSV for cleanliness and better performance and following DRY principles

Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

antonellaobispo/antonellaobispo

My profile description ✨

Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Julianadev/DAG_Operators

Automating pipelines using airflow operators

Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Julianadev/CleanCode_DataFrame

Data cleansing to set up a PostgreSQL database, which will store a campaign's data. Project is part of DataCamp's data engineering studies

Language: Jupyter Notebook - Size: 1020 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

CD-AC/DataEnginner-Streaming_ECommerce

This project is an engineering data pipeline designed to collect real-time data from an e-commerce platform and process it for visualization using various technologies.

Language: Jupyter Notebook - Size: 1.68 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dataengineercafe/dataengineercafe.github.io

Language: HTML - Size: 1.95 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

victorantoniassi/jr_analytics_engineer_practical_test

Minha resolução para um teste prático de uma vaga de Analytics Engineer Júnior

Language: Python - Size: 34 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

lixx21/Airflow-MySQL-To-BigQuery

ETL to move data from MySQL into BigQuery using Airflow

Language: Python - Size: 6.84 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1