GitHub topics: data-engineer
mikecerton/UserInsight-Streaming-Data-Pipeline
UserInsight-Streaming-Data-Pipeline is a real-time pipeline that ingests API data into Kafka, processes it with Spark, stores it in S3, and uses AWS Lambda to load it into Redshift. The data is then used to create a dashboard in Looker. [Data Engineer]
Language: Python - Size: 325 KB - Last synced at: about 18 hours ago - Pushed at: about 19 hours ago - Stars: 1 - Forks: 0

Hippaho/Sparkify
A music streaming company, Sparkify, has decided that it is time to introduce more automation and monitoring to their data warehouse ETL pipelines and come to the conclusion that the best tool to achieve this is Apache Airflow.
Language: Python - Size: 17.6 KB - Last synced at: about 23 hours ago - Pushed at: about 24 hours ago - Stars: 0 - Forks: 0

Shakespear567/Data_Engineering_GCP
Data Engineering Using Google Could Platform and Mage
Size: 1000 Bytes - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

drahulsingh/drahulsingh
I'm an IT graduate with a passion for data, software, and cloud computing. With a knack for problem-solving and a commitment to staying updated with cutting-edge technologies, I aim to contribute to innovative projects and help organizations achieve their goals.
Size: 128 KB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 7 - Forks: 0

zulfaan/e-commerce-analyst-portfolio
Developed and maintained a Luigi‑driven ETL pipeline to scrape, clean, and load Tokopedia Exsport Bag product, stock, and order data into PostgreSQL, enabling in‑depth category‑level sales and revenue analysis that powered data‑driven decision‑making
Language: Jupyter Notebook - Size: 2.73 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

adamjpatterson/adamjpatterson
Data Architect and Full Stack Engineer with an Interest in Data Science
Size: 103 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

swapnaxdata/Nyc-Taxi-Data-Engineering-Project
Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

pierrehanne/pierrehanne
My GitHub Profile Page
Size: 66.4 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

kelvins/awesome-dataops
:sunglasses: A curated list of awesome DataOps tools
Language: Python - Size: 112 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 184 - Forks: 29

dhkdn9192/data_engineer_career
DE직무에 필요한 모든 것
Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 196 - Forks: 28

AhmetFurkanDEMIR/Data-Engineering-Project-with-HDFS-and-Kafka
Data Engineering Project with Hadoop HDFS and Kafka
Language: Python - Size: 3.46 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 102 - Forks: 25

Andessonreis/andessonreis
Size: 2.74 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 0

chrimaho/chrimaho
My Personal Repository
Size: 278 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

data-engineering-community/data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.
Language: CSS - Size: 7.83 MB - Last synced at: 11 days ago - Pushed at: 16 days ago - Stars: 1,632 - Forks: 183

BayoAdejare/lightning-containers
Docker powered starter for geospatial analysis of lightning atmospheric data.
Language: Python - Size: 149 MB - Last synced at: 7 days ago - Pushed at: 12 days ago - Stars: 6 - Forks: 2

rashakil-ds/Roadmap-Docs
Best Data Science, Data Analytics, AI, and SDE roadmaps. This repository is continually updated based on the top job postings on LinkedIn and Indeed in the data science and AI domain.
Size: 1.16 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 385 - Forks: 149

CharlesEmil/sql-data-warehouse-project
Building a modern data warehouse with SQL Server, Including ETL processes, data modeling and analytics.
Language: TSQL - Size: 1.82 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

rezarajan/whoami
Resume source files
Language: TeX - Size: 800 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

apancoast/Healthcare-Deserts-and-Public-Transit
This dbt-based project aims to analyze the intersection of healthcare accessibility and public transit coverage in Mecklenburg County, NC.
Language: Python - Size: 104 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

wwwlike/vlife
企业级低代码快速开发平台,包含页面可视化配置、自定义表单、自定义报表、权限管理脚手架应用、前后端代码自动生成;主要特点是低代码开发,可实现复杂CRUD功能仅编写数据模型就能完成前后端开发
Language: TypeScript - Size: 9.05 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 709 - Forks: 85

ortizfram/datacamp-Data-Engineer-with-Python-course
datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments
Language: Python - Size: 22 MB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 104 - Forks: 34

higorcazuza81/higorcazuza81
A little about me
Size: 3.56 MB - Last synced at: 16 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

janainacazuza/janainacazuza
A little about me
Size: 517 KB - Last synced at: 14 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

dhamodharanrk/online-cv
Mr.Dhamodharan's Curriculum vitae
Language: CSS - Size: 3.7 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 1

mitgar14/etl-workshop-2
Workshop #2 (ETL process using Airflow) for the ETL course using Apache Airflow to build a data pipeline.
Language: Jupyter Notebook - Size: 7.58 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

imsathiya17/imsathiya17.github.io
Language: CSS - Size: 21.9 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 2 - Forks: 0

Devinterview-io/data-engineer-interview-questions
🟣 Data Engineer interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
Size: 14.6 KB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 22 - Forks: 10

mikecerton/The-Retail-ELT-Pipeline-End-To-End
This project designs and implements an ETL pipeline using Apache Airflow (Docker Compose) to ingest, process, and store retail data. AWS S3 acts as the data lake, AWS Redshift as the data warehouse, and Looker Studio for visualization. [Data Engineer]
Language: Python - Size: 1.07 MB - Last synced at: 21 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

FadhiilDzaki/etl_superstore
This project automates ETL for Superstore data, extracting from PostgreSQL, transforming in Python, and reloading into PostgreSQL weekly. I conducted data analysis in Jupyter Notebook and built a Metabase dashboard for insights.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

lucasleonetti/ETL_Aviacion_Civil
Proyecto ETL en Hadoop Ecosystem Para responder a consultas de negocio para la ANAC
Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mikecerton/Apache_Kafka_Basic
This repository provides a fundamental understanding of Apache Kafka, including its core components, basic Python scripts to demonstrate how to create topics, produce messages, and consume messages, as well as a docker-compose.yml file for easy setup. [Data Engineer]
Language: Python - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

mikecerton/ETLpython_Oracle
ETL from couchDB to Oracle using python [Data Engineer]
Language: Python - Size: 11.7 KB - Last synced at: 21 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

sanketrs/sql-interview-preparation-questions-with-answers
Designed as a comprehensive resource for aspiring data analysts, data engineers, and database administrators.
Size: 16.6 KB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

DAN3002/Tiktok-Crawler
This is a simple Tiktok crawler that can be used to download videos from Tiktok. It uses the Tiktok API to get the video URL and then downloads the video using the requests library. It can download video from multiple hashtags or download by sound.
Language: Python - Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

rublaman/data-engineering-portfolio
This portfolio presents a compilation of data engineering projects that highlight the knowledge and use of tools in an optimal way for data flow. The repository demonstrates a commitment to developing robust solutions using diverse technologies, addressing practical challenges in the field.
Language: Jupyter Notebook - Size: 1.67 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

mitgar14/etl-workshop-1
Workshop #1 (Data Engineer) for the ETL course using Pandas, Matplotlib, SQLAlchemy and Power BI for the creation of the dashboard.
Language: Jupyter Notebook - Size: 4.93 MB - Last synced at: 30 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Thomas-George-T/Thomas-George-T
Readme for my :octocat: Profile
Size: 203 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 76 - Forks: 87

epsi10nvn/RealTime-Sales-Inventory
A retail data warehouse that processes real-time purchases from Kafka, stores them in PostgreSQL with a star schema, and enables sales analysis, inventory tracking, and interactive reporting.
Language: Python - Size: 298 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

aaaastark/Top-Big-Data-Scientist-Questions-For-Interview
Top Big Tech Data Science Questions
Size: 1.55 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 8

Justmalhar/ThinkLikeAnEngineer
💡 Think Like An Engineer is a roadmap for engineering leadership, a toolkit for growth hacking through engineering, and a manifesto for productivity enhancement
Size: 61.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

mauricioarcez/mauricioarcez.github.io
Te invito a conocerme en mi portafolio. Como puedo ayudarte?
Language: CSS - Size: 5.42 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

nivesayee/nivesayee
Size: 1.07 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mohidex/data-pipeline-on-gcp
The Real-time Ecommerce Data Collection and Processing project empowers businesses with real-time insights by efficiently extracting, processing, and storing ecommerce data from multiple sources. Combining Golang and Python, this cutting-edge solution streamlines data handling from diverse ecommerce websites.
Language: Python - Size: 1010 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

tuanx18/data-engineer-portfolio
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Size: 276 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 40 - Forks: 10

digitalghost-dev/premier-league 📦
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
Language: Python - Size: 487 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 224 - Forks: 17

lixx21/spotify-scrapping
Scraping data from Spotify Playlist URL using Python and Selenium
Language: Python - Size: 4.88 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

Candratama/ETLProject
An ETL (Extract, Transform, Load) pipeline that processes WhatsApp messages data from MySQL, transforms it, and loads the results into PostgreSQL and CSV files
Language: Python - Size: 77.1 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

taufiksatrian/DigitalentKominfo
Dokumentasi Praktikum dan Project Hands On Pelatihan Talent Scouting Academy Digitalent Kominfo
Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

lixx21/dbt-shopping-data-transform
This project leverages DBT (Data Build Tool) to transform raw shopping data into a well-structured, analytics-ready format
Language: Dockerfile - Size: 610 KB - Last synced at: 19 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

lixx21/airflow-dbt-gcp
A comprehensive data pipeline leveraging Airflow, DBT, Google Cloud Platform (GCP), and Docker to extract, transform, and load data seamlessly from a staging layer to a data warehouse and data mart.
Language: Python - Size: 257 KB - Last synced at: 13 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 5

mjs1995/Book_review
book review for data-engineer
Size: 1.49 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

nottherealtar/Data_Engineering_Assesments
Language: Python - Size: 21.5 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 1

mazzasaverio/data-software-engineering-journal
I decided to start tracking my learning, tips, code, building projects, ideas, and curiosities that I discover on my product engineer development journey. I hope that others might find interesting insights, discover their own paths, and enjoy the journey as well.
Language: TypeScript - Size: 51.7 MB - Last synced at: 11 days ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

mensenvau/data_engineering_solution_no1
Data Engineer Lead Analyst Case Study
Size: 2.95 MB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

mensenvau/leetcode_sql_problems
😊️️️️️️ Leetcode database part solutions
Language: TSQL - Size: 92.8 KB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

mensenvau/internship_sql_analytics
🚀 Internship SQL (East, Advanced)
Language: TSQL - Size: 10.4 MB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

patcha-ranat/Ecommerce-Invoice-End-to-end
End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - dynamic interpretable customer segmentation
Language: Jupyter Notebook - Size: 49.9 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

BayoAdejare/pipeline-ecommerce
E-commerce Data Pipeline
Language: Python - Size: 22.5 MB - Last synced at: 7 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

andkret/Cookbook
The Data Engineering Cookbook
Size: 35.5 MB - Last synced at: 6 months ago - Pushed at: 9 months ago - Stars: 13,683 - Forks: 2,498

SimonOsipov/SimonOsipov
Description for Github mainpage
Size: 51.8 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ivanshamaev/python-algorithms-data-engineer
Python Algorithms for Data Engineers | Алгоритмы Python для инженеров данных. Этот репозиторий содержит коллекцию Jupyter notebooks, с решениями различных алгоритмических задач. Notebooks собраны специально для инженеров данных и помогут улучшить навыки problem-solving и algorithmic thinking с помощью упражнений.
Language: Jupyter Notebook - Size: 362 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

longNguyen010203/100Day-Self-Learning-DE
📚💻⌨ Self-study process for more than 3 months with 3-4h/day to prepare for the journey of applying for an intern or fresher position as a Data Engineer in 2024 ️🥇️🏆
Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

cainzuca/scrapy_celphone
Projeto com o objetivo de extrair dados da web.
Language: Python - Size: 63.5 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

mensenvau/mensenvau
I am a Mid Software/Data Engineer.
Language: HTML - Size: 3.15 MB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

unnati-xyz/scalable-data-science-platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Language: Jupyter Notebook - Size: 22.7 MB - Last synced at: 5 months ago - Pushed at: about 5 years ago - Stars: 163 - Forks: 28

BayoAdejare/pipeline-edtech
Edtech ADF Pipeline Project
Size: 12.7 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

BayoAdejare/pipeline-sleep
Sleep Data Pipeline with Azure Data Factory
Size: 21.5 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

pwenker/data-engineering
My notes for Udacity's Data Engineering Nanodegree.
Size: 1.3 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

mimnets/data-analyst-roadmap-2024
Becoming a Data Analyst: Your Path to a Job-Ready Bootcamp
Size: 1.47 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

zekeriyyaa/Building-A-Data-Pipeline-For-ROS-Compliant-Robotic-System-Via-Amazon-Web-Services
Language: Python - Size: 959 KB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 2

camposvinicius/aws-etl
This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/AdventureWorks.zip, it's a zipped file with some .csvs inside that we will apply transformations.
Language: Smarty - Size: 168 KB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 17 - Forks: 3

RenanBjj/Databricks-PowerBI-OpticalSales
Databricks optical sales
Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Hanan-Nawaz/100_Days_of_Data_Engineering
Journey through 100 days of Data Engineering, featuring daily learning, practice, and projects. This repository includes notes, exercises, and code snippets covering essential topics such as GitHub, Python, ETL, data pipelines, and more
Language: Jupyter Notebook - Size: 10.5 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

jasontanx/prefect-learning
Prefect - Data orchestration tool practice & learning
Language: Python - Size: 314 KB - Last synced at: 27 days ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

adilkhash/apache-airflow-intro
Language: Python - Size: 9.77 KB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 17 - Forks: 4

anhdanggit/atom-assignments
All assignments of DATAcracy ATOM Open class, which is free and aims to democratize Data Skills for Everyone. The skills includes end-to-end of a simple and small scale data solutions.
Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 47

agusputra4/DQLab
This is a repository to share the result of learning from materials and projects in DQLab
Language: Jupyter Notebook - Size: 15.7 MB - Last synced at: 10 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

deordie/deordie-digest
Data Engineering Digest
Language: SCSS - Size: 1.41 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 26 - Forks: 2

n4en/python-for-data-engineers
Python for data engineers
Size: 1.5 MB - Last synced at: 10 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

tn00627974/Data-Engineering-Project
使用ETL data pipeline 將UBER 資料清洗、排程、最後放置在GCP上運行與後續分析 的專案
Language: Jupyter Notebook - Size: 18 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

AyushRaiKhare/Ayush_Khare_Data_Engineering_Portfolio
Ayush @ Data Engineering Portfolio
Size: 18.6 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

WaliUllahbaig/Computer-Vision-and-Deep-Learning
Unlock the world of Computer Vision Projects! Dive into hands-on learning with diverse projects, code, and datasets. Elevate your AI skills and explore the realm of visual data interpretation.
Language: Jupyter Notebook - Size: 46.4 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

dataength/dataength.github.io
Data Eng Thailand
Size: 2.93 KB - Last synced at: 11 months ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 1

minhaj-313/Data-Science-Books
Welcome to the Data Science Books repository! Dive into a curated collection of resources covering various aspects of data science. Whether you're a beginner or an expert, contribute and explore to enrich our library. Let's empower each other on our data science journey
Size: 120 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

korawica/armored
Armored Models for Data Pipeline & Data Observability
Language: Python - Size: 85 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

thapadipenra/db-capstone-project
meta data engineering capstone project
Language: Jupyter Notebook - Size: 1.49 MB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 1

Menziess/Databook
Data Engineering knowledge as a readable tutorial (collaboratively).
Size: 2.44 MB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 1

alison-carlos/data-collect
Repository for Replication of Professor Teo Calvo's Projects
Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

damodhar918/sdgp
This project Synthetic data generator plus (SDGP) is a python script that generates mock data based on given configurations. It can also edit and scale existing data to create high volume data. It is useful for testing, learning data domine and prototyping purposes.
Language: Python - Size: 1.11 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
Language: Python - Size: 109 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 409 - Forks: 54

natayadev/cookiecutter4etls
Una plantilla de CookieCutter para ETLs en Python 🍪
Size: 13.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Keep-Current/web-miner
Crawls sites, to find new content and scrap it
Language: Python - Size: 215 KB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 33 - Forks: 29

Julianadev/Project-Performing-a-Code-Review
Reviewing code from a CSV for cleanliness and better performance and following DRY principles
Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

antonellaobispo/antonellaobispo
My profile description ✨
Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Julianadev/DAG_Operators
Automating pipelines using airflow operators
Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Julianadev/CleanCode_DataFrame
Data cleansing to set up a PostgreSQL database, which will store a campaign's data. Project is part of DataCamp's data engineering studies
Language: Jupyter Notebook - Size: 1020 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

CD-AC/DataEnginner-Streaming_ECommerce
This project is an engineering data pipeline designed to collect real-time data from an e-commerce platform and process it for visualization using various technologies.
Language: Jupyter Notebook - Size: 1.68 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dataengineercafe/dataengineercafe.github.io
Language: HTML - Size: 1.95 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

victorantoniassi/jr_analytics_engineer_practical_test
Minha resolução para um teste prático de uma vaga de Analytics Engineer Júnior
Language: Python - Size: 34 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

lixx21/Airflow-MySQL-To-BigQuery
ETL to move data from MySQL into BigQuery using Airflow
Language: Python - Size: 6.84 KB - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1
