GitHub topics: data-engineer
dhkdn9192/data_engineer_career
DE직무에 필요한 모든 것
Language: Jupyter Notebook - Size: 14.6 MB - Last synced at: about 7 hours ago - Pushed at: 19 days ago - Stars: 201 - Forks: 28

Andessonreis/andessonreis
Size: 2.75 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

rezarajan/whoami
Resume source files
Language: SCSS - Size: 1.05 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Thinh127/Thinh127.github.io
My first blog
Language: SCSS - Size: 459 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

itsSwapnil/Python-to-ELK-data-pipeline-project
A Python-based ETL pipeline that extracts data from an Oracle database using SQL, transforms it into a structured format, and indexes it into Elasticsearch for analytics and reporting.
Language: Python - Size: 44.9 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

itsSwapnil/Data-Interpolation-with-Radial-Basis-Function
A PySpark-based solution for cleaning and interpolating battery sensor data using forward/backward fill and Radial Basis Function (RBF) spatial interpolation. Outputs a clean, fully interpolated dataset in CSV format for advanced analysis.
Language: Python - Size: 13.7 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

itsSwapnil/Milvus-vector-database-project
This project asynchronously scrapes web content, generates semantic text chunks using sentence embeddings, and stores them in a Milvus vector database for efficient similarity search. Built with Python, Langchain, SentenceTransformers, and Milvus for scalable vector-based retrieval.
Language: Python - Size: 17.6 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Hippaho/Sparkify
A music streaming company, Sparkify, has decided that it is time to introduce more automation and monitoring to their data warehouse ETL pipelines and come to the conclusion that the best tool to achieve this is Apache Airflow.
Language: Python - Size: 17.6 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

Shakespear567/Data_Engineering_GCP
Data Engineering Using Google Could Platform and Mage
Size: 1000 Bytes - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

pierrehanne/pierrehanne
My GitHub Profile Page
Size: 76.2 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

itsSwapnil/pyspark-incremental-airflow
This repository contains an Airflow DAG that orchestrates an incremental data pipeline using PySpark scripts. The pipeline automates daily processing data, syncs results to S3, performs housekeeping, and loops until a target date threshold is reached.
Language: Python - Size: 13.7 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

data-engineering-community/data-engineering-wiki
The best place to learn data engineering. Built and maintained by the data engineering community.
Language: CSS - Size: 7.84 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,689 - Forks: 198

chrimaho/chrimaho
My Personal Repository
Size: 299 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 0

rashakil-ds/Roadmap-Docs
Best Data Science, Data Analytics, AI, and SDE roadmaps. This repository is continually updated based on the top job postings on LinkedIn and Indeed in the data science and AI domain.
Size: 1.71 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 534 - Forks: 237

mensenvau/mensenvau
I am a Senior Data Engineer.
Language: HTML - Size: 3.15 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 1

adamjpatterson/adamjpatterson
Data Architect and Full Stack Engineer with an Interest in Data Science
Size: 153 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

mazzasaverio/data-engineering-save
Data Engineering Notes, Resources & Insights
Size: 51.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 4 - Forks: 0

Thomas-George-T/Thomas-George-T
Readme for my :octocat: Profile
Size: 205 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 77 - Forks: 86

Devinterview-io/data-engineer-interview-questions
🟣 Data Engineer interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
Size: 28.3 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 27 - Forks: 12

AldythNahak/AldythNahak
💼 Professional / General Developer Style 👋 Hi there! This is my GitHub profile repository. 🛠️ Here you'll find my pinned projects, code experiments, and learning progress. 🔍 I'm passionate about clean code, automation, and solving problems with technology.
Size: 56.6 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

higorcazuza81/higorcazuza81
A little about me
Size: 3.56 MB - Last synced at: 6 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

BayoAdejare/lightning-containers
Docker powered starter for geospatial analysis of lightning atmospheric data.
Language: Python - Size: 159 MB - Last synced at: about 10 hours ago - Pushed at: 25 days ago - Stars: 6 - Forks: 2

vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
Language: Python - Size: 110 MB - Last synced at: 19 days ago - Pushed at: 21 days ago - Stars: 449 - Forks: 59

sergialonsaco/sergialonsaco
My own readme file
Size: 48.8 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

mikecerton/Apache_Kafka_Basic
This repository provides a fundamental understanding of Apache Kafka, including its core components, basic Python scripts to demonstrate how to create topics, produce messages, and consume messages, as well as a docker-compose.yml file for easy setup. [Data Engineer]
Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

aaaastark/Top-Big-Data-Scientist-Questions-For-Interview
Top Big Tech Data Science Questions
Size: 1.55 MB - Last synced at: 8 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 8

husseinelsaadii/husseinelsaadii
Here's my profile, a quick introduction on me :)
Size: 13.7 KB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 0 - Forks: 0

kelvins/awesome-dataops
:sunglasses: A curated list of awesome DataOps tools
Language: Python - Size: 112 KB - Last synced at: 27 days ago - Pushed at: 8 months ago - Stars: 188 - Forks: 29

mikecerton/The-Retail-ELT-Pipeline-End-To-End
This project designs and implements an ETL pipeline using Apache Airflow (Docker Compose) to ingest, process, and store retail data. AWS S3 acts as the data lake, AWS Redshift as the data warehouse, and Looker Studio for visualization. [Data Engineer]
Language: Python - Size: 1.07 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

mikecerton/UserInsight-Streaming-Data-Pipeline
UserInsight-Streaming-Data-Pipeline is a real-time pipeline that ingests API data into Kafka, processes it with Spark, stores it in S3, and uses AWS Lambda to load it into Redshift. The data is then used to create a dashboard in Looker. [Data Engineer]
Language: Python - Size: 325 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

drahulsingh/drahulsingh
I'm an IT graduate with a passion for data, software, and cloud computing. With a knack for problem-solving and a commitment to staying updated with cutting-edge technologies, I aim to contribute to innovative projects and help organizations achieve their goals.
Size: 128 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 0

zulfaan/e-commerce-analyst-portfolio
Developed and maintained a Luigi‑driven ETL pipeline to scrape, clean, and load Tokopedia Exsport Bag product, stock, and order data into PostgreSQL, enabling in‑depth category‑level sales and revenue analysis that powered data‑driven decision‑making
Language: Jupyter Notebook - Size: 2.73 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

swapnadb/Nyc-Taxi-Data-Engineering-Project
Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: 29 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

AhmetFurkanDEMIR/Data-Engineering-Project-with-HDFS-and-Kafka
Data Engineering Project with Hadoop HDFS and Kafka
Language: Python - Size: 3.46 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 102 - Forks: 25

CharlesEmil/sql-data-warehouse-project
Building a modern data warehouse with SQL Server, Including ETL processes, data modeling and analytics.
Language: TSQL - Size: 1.82 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

apancoast/Healthcare-Deserts-and-Public-Transit
This dbt-based project aims to analyze the intersection of healthcare accessibility and public transit coverage in Mecklenburg County, NC.
Language: Python - Size: 104 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

wwwlike/vlife
企业级低代码快速开发平台,包含页面可视化配置、自定义表单、自定义报表、权限管理脚手架应用、前后端代码自动生成;主要特点是低代码开发,可实现复杂CRUD功能仅编写数据模型就能完成前后端开发
Language: TypeScript - Size: 9.05 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 709 - Forks: 85

ortizfram/datacamp-Data-Engineer-with-Python-course
datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessments
Language: Python - Size: 22 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 104 - Forks: 34

janainacazuza/janainacazuza
A little about me
Size: 517 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

dhamodharanrk/online-cv
Mr.Dhamodharan's Curriculum vitae
Language: CSS - Size: 3.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 1

mitgar14/etl-workshop-2
Workshop #2 (ETL process using Airflow) for the ETL course using Apache Airflow to build a data pipeline.
Language: Jupyter Notebook - Size: 7.58 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

imsathiya17/imsathiya17.github.io
Language: CSS - Size: 21.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

FadhiilDzaki/etl_superstore
This project automates ETL for Superstore data, extracting from PostgreSQL, transforming in Python, and reloading into PostgreSQL weekly. I conducted data analysis in Jupyter Notebook and built a Metabase dashboard for insights.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

lucasleonetti/ETL_Aviacion_Civil
Proyecto ETL en Hadoop Ecosystem Para responder a consultas de negocio para la ANAC
Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mikecerton/ETLpython_Oracle
ETL from couchDB to Oracle using python [Data Engineer]
Language: Python - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

sanketrs/sql-interview-preparation-questions-with-answers
Designed as a comprehensive resource for aspiring data analysts, data engineers, and database administrators.
Size: 16.6 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

DAN3002/Tiktok-Crawler
This is a simple Tiktok crawler that can be used to download videos from Tiktok. It uses the Tiktok API to get the video URL and then downloads the video using the requests library. It can download video from multiple hashtags or download by sound.
Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

rublaman/data-engineering-portfolio
This portfolio presents a compilation of data engineering projects that highlight the knowledge and use of tools in an optimal way for data flow. The repository demonstrates a commitment to developing robust solutions using diverse technologies, addressing practical challenges in the field.
Language: Jupyter Notebook - Size: 1.67 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

mitgar14/etl-workshop-1
Workshop #1 (Data Engineer) for the ETL course using Pandas, Matplotlib, SQLAlchemy and Power BI for the creation of the dashboard.
Language: Jupyter Notebook - Size: 4.93 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

epsi10nvn/RealTime-Sales-Inventory
A retail data warehouse that processes real-time purchases from Kafka, stores them in PostgreSQL with a star schema, and enables sales analysis, inventory tracking, and interactive reporting.
Language: Python - Size: 298 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Justmalhar/ThinkLikeAnEngineer
💡 Think Like An Engineer is a roadmap for engineering leadership, a toolkit for growth hacking through engineering, and a manifesto for productivity enhancement
Size: 61.5 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

mauricioarcez/mauricioarcez.github.io
Te invito a conocerme en mi portafolio. Como puedo ayudarte?
Language: CSS - Size: 5.42 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

nivesayee/nivesayee
Size: 1.07 MB - Last synced at: 27 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

mohidex/data-pipeline-on-gcp
The Real-time Ecommerce Data Collection and Processing project empowers businesses with real-time insights by efficiently extracting, processing, and storing ecommerce data from multiple sources. Combining Golang and Python, this cutting-edge solution streamlines data handling from diverse ecommerce websites.
Language: Python - Size: 1010 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 6 - Forks: 0

tuanx18/data-engineer-portfolio
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Size: 276 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 40 - Forks: 10

digitalghost-dev/premier-league 📦
A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.
Language: Python - Size: 487 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 224 - Forks: 17

lixx21/spotify-scrapping
Scraping data from Spotify Playlist URL using Python and Selenium
Language: Python - Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

Candratama/ETLProject
An ETL (Extract, Transform, Load) pipeline that processes WhatsApp messages data from MySQL, transforms it, and loads the results into PostgreSQL and CSV files
Language: Python - Size: 77.1 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

taufiksatrian/DigitalentKominfo
Dokumentasi Praktikum dan Project Hands On Pelatihan Talent Scouting Academy Digitalent Kominfo
Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lixx21/dbt-shopping-data-transform
This project leverages DBT (Data Build Tool) to transform raw shopping data into a well-structured, analytics-ready format
Language: Dockerfile - Size: 610 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

lixx21/airflow-dbt-gcp
A comprehensive data pipeline leveraging Airflow, DBT, Google Cloud Platform (GCP), and Docker to extract, transform, and load data seamlessly from a staging layer to a data warehouse and data mart.
Language: Python - Size: 257 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 5

mjs1995/Book_review
book review for data-engineer
Size: 1.49 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 5 - Forks: 0

mensenvau/data_engineering_solution_no1
Data Engineer Lead Analyst Case Study
Size: 2.95 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

mensenvau/leetcode_sql_problems
😊️️️️️️ Leetcode database part solutions
Language: TSQL - Size: 92.8 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

mensenvau/internship_sql_analytics
🚀 Internship SQL (East, Advanced)
Language: TSQL - Size: 10.4 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

patcha-ranat/Ecommerce-Invoice-End-to-end
End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - dynamic interpretable customer segmentation
Language: Jupyter Notebook - Size: 49.9 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 7 - Forks: 0

BayoAdejare/pipeline-ecommerce
E-commerce Data Pipeline
Language: Python - Size: 22.5 MB - Last synced at: about 10 hours ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

andkret/Cookbook
The Data Engineering Cookbook
Size: 35.5 MB - Last synced at: 8 months ago - Pushed at: 10 months ago - Stars: 13,683 - Forks: 2,498

SimonOsipov/SimonOsipov
Description for Github mainpage
Size: 51.8 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

ivanshamaev/python-algorithms-data-engineer
Python Algorithms for Data Engineers | Алгоритмы Python для инженеров данных. Этот репозиторий содержит коллекцию Jupyter notebooks, с решениями различных алгоритмических задач. Notebooks собраны специально для инженеров данных и помогут улучшить навыки problem-solving и algorithmic thinking с помощью упражнений.
Language: Jupyter Notebook - Size: 362 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

longNguyen010203/100Day-Self-Learning-DE
📚💻⌨ Self-study process for more than 3 months with 3-4h/day to prepare for the journey of applying for an intern or fresher position as a Data Engineer in 2024 ️🥇️🏆
Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

cainzuca/scrapy_celphone
Projeto com o objetivo de extrair dados da web.
Language: Python - Size: 63.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

unnati-xyz/scalable-data-science-platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Language: Jupyter Notebook - Size: 22.7 MB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 163 - Forks: 28

BayoAdejare/pipeline-edtech
Edtech ADF Pipeline Project
Size: 12.7 KB - Last synced at: about 10 hours ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

BayoAdejare/pipeline-sleep
Sleep Data Pipeline with Azure Data Factory
Size: 21.5 KB - Last synced at: about 10 hours ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

pwenker/data-engineering
My notes for Udacity's Data Engineering Nanodegree.
Size: 1.3 MB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

mimnets/data-analyst-roadmap-2024
Becoming a Data Analyst: Your Path to a Job-Ready Bootcamp
Size: 1.47 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

zekeriyyaa/Building-A-Data-Pipeline-For-ROS-Compliant-Robotic-System-Via-Amazon-Web-Services
Language: Python - Size: 959 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 2

camposvinicius/aws-etl
This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/AdventureWorks.zip, it's a zipped file with some .csvs inside that we will apply transformations.
Language: Smarty - Size: 168 KB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 3

RenanBjj/Databricks-PowerBI-OpticalSales
Databricks optical sales
Language: Jupyter Notebook - Size: 2.08 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Hanan-Nawaz/100_Days_of_Data_Engineering
Journey through 100 days of Data Engineering, featuring daily learning, practice, and projects. This repository includes notes, exercises, and code snippets covering essential topics such as GitHub, Python, ETL, data pipelines, and more
Language: Jupyter Notebook - Size: 10.5 MB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

jasontanx/prefect-learning
Prefect - Data orchestration tool practice & learning
Language: Python - Size: 314 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

adilkhash/apache-airflow-intro
Language: Python - Size: 9.77 KB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 4

anhdanggit/atom-assignments
All assignments of DATAcracy ATOM Open class, which is free and aims to democratize Data Skills for Everyone. The skills includes end-to-end of a simple and small scale data solutions.
Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 47

agusputra4/DQLab
This is a repository to share the result of learning from materials and projects in DQLab
Language: Jupyter Notebook - Size: 15.7 MB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

deordie/deordie-digest
Data Engineering Digest
Language: SCSS - Size: 1.41 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 26 - Forks: 2

tn00627974/Data-Engineering-Project
使用ETL data pipeline 將UBER 資料清洗、排程、最後放置在GCP上運行與後續分析 的專案
Language: Jupyter Notebook - Size: 18 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

AyushRaiKhare/Ayush_Khare_Data_Engineering_Portfolio
Ayush @ Data Engineering Portfolio
Size: 18.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

WaliUllahbaig/Computer-Vision-and-Deep-Learning
Unlock the world of Computer Vision Projects! Dive into hands-on learning with diverse projects, code, and datasets. Elevate your AI skills and explore the realm of visual data interpretation.
Language: Jupyter Notebook - Size: 46.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dataength/dataength.github.io
Data Eng Thailand
Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1

minhaj-313/Data-Science-Books
Welcome to the Data Science Books repository! Dive into a curated collection of resources covering various aspects of data science. Whether you're a beginner or an expert, contribute and explore to enrich our library. Let's empower each other on our data science journey
Size: 120 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

korawica/armored
Armored Models for Data Pipeline & Data Observability
Language: Python - Size: 85 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

thapadipenra/db-capstone-project
meta data engineering capstone project
Language: Jupyter Notebook - Size: 1.49 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

Menziess/Databook
Data Engineering knowledge as a readable tutorial (collaboratively).
Size: 2.44 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 1

arojas3552/citybike-dataEng
Data Engineering Project
Language: Python - Size: 143 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

alison-carlos/data-collect
Repository for Replication of Professor Teo Calvo's Projects
Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

damodhar918/sdgp
This project Synthetic data generator plus (SDGP) is a python script that generates mock data based on given configurations. It can also edit and scale existing data to create high volume data. It is useful for testing, learning data domine and prototyping purposes.
Language: Python - Size: 1.11 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

natayadev/cookiecutter4etls
Una plantilla de CookieCutter para ETLs en Python 🍪
Size: 13.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Keep-Current/web-miner
Crawls sites, to find new content and scrap it
Language: Python - Size: 215 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 33 - Forks: 29

Julianadev/Project-Performing-a-Code-Review
Reviewing code from a CSV for cleanliness and better performance and following DRY principles
Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
