An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: mageai

nathadriele/S3-folder-cleanup

This script automates cleaning up a specific folder in an S3 bucket, deleting all objects within it. It uses secure AWS credentials and is built on the Mage.ai platform. Additionally, the roadmap includes approach handling and logging for greater robustness and monitoring.

Language: Python - Size: 9.77 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 6 - Forks: 0

nathadriele/DMS-CDC-task-status-validator

Automated script developed in Mage.ai to monitor and validate the status of AWS DMS CDC tasks, ensuring data integrity and synchronization. Sends notifications for any detected validation issues.

Language: Python - Size: 31.3 KB - Last synced at: 11 days ago - Pushed at: 20 days ago - Stars: 6 - Forks: 0

nathadriele/DMS-missing-or-duplicate-data-validation-script

Contains a Python script designed to validate data replication tasks in AWS Database Migration Service (DMS). The script checks for potential issues such as missing or duplicate data in the tables being replicated.

Language: Python - Size: 24.4 KB - Last synced at: 11 days ago - Pushed at: 20 days ago - Stars: 9 - Forks: 0

nathadriele/data-engineering-zoomcamp

The Data Engineering Zoomcamp covers essential skills in containerization, workflow orchestration, data warehousing, analytics engineering, batch, and streaming processing. It includes tools like Docker, Terraform, BigQuery, dbt, Spark, Kafka, Kestra, Postgres, Google Data Studio, and Metabase.

Language: Python - Size: 12.5 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 8 - Forks: 1

komminarlabs/terraform-provider-mageai

Terraform provider to manage Mage AI

Language: Go - Size: 170 KB - Last synced at: 22 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

nathadriele/llm-zoomcamp

The Zoomcamp LLM Course focuses on tools for working with LLMs and RAG, including OpenAI API, HuggingFace, Elasticsearch, and Streamlit. It covers vector search, embedding creation, data ingestion with Mage, and monitoring using Grafana, emphasizing practical applications and best practices.

Language: Jupyter Notebook - Size: 1.82 MB - Last synced at: 13 days ago - Pushed at: 2 months ago - Stars: 7 - Forks: 1

nathadriele/datamart-tables-data-type-validation

This project is a Data Engineering solution implemented to validate the data types of columns in PostgreSQL tables in a DataMart. It aims to validate whether the data stored in tables conforms to the expected data types, improving data integrity and reliability.

Language: Python - Size: 16.6 KB - Last synced at: 28 days ago - Pushed at: 9 months ago - Stars: 5 - Forks: 0

nathadriele/AWS-DMS-task-restart-and-status-checker

The AWS DMS Task Restart and Status Checker is a Python script designed to restart various AWS Database Migration Service (DMS) tasks and check their status. This script leverages AWS SDK (Boto3) and Mage.ai for DMS task integration and management, ensuring efficient and reliable task handling.

Language: Python - Size: 2.48 MB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

lunaSnowflake/CryptoCurrency

Engineered a real-time Cryptocurrency dashboard on AWS using Python, MageAI, SQL, and NLP for sentiment analysis, with ML forecasting and PowerBI integration for dynamic reporting and decision-making.

Language: Jupyter Notebook - Size: 43.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

kingabzpro/5-Airflow-Alternatives-for-Data-Orchestration-Tutorial

Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI

Language: Python - Size: 40 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

JBris/mage_ai_docker

Docker image for Mage AI deployment using Docker

Language: Dockerfile - Size: 8.79 KB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

ShubhamMohanty680/Uber_Data_Analysis

Language: Python - Size: 3.99 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

Hamagistral/de-zoomcamp-ui

🎨 UI for the Free Data Engineering Zoomcamp Course provided by DataTalksClub

Language: Python - Size: 103 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 224 - Forks: 100

manuelandersen/football-pipeline

DE Zoomcamp 2024 Final Project 🧙

Language: Python - Size: 975 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

ChukwuemekaAham/uber-gcp-etl-project

Data Engineering Zoomcamp Final Project

Language: Jupyter Notebook - Size: 33.7 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

ChukwuemekaAham/data-engineering-zoomcamp

Datatalks Club Free Data Engineering Zoomcamp Project

Language: Jupyter Notebook - Size: 4.63 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

benitomartin/de-ch-weather

Data Engineering Swiss Air Quality

Language: Python - Size: 8.03 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Siddha911/Citibike-data-project

Data engineering project for the DEZoomcamp 2024

Language: Python - Size: 2.6 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

AymanSulaiman/pokeFastAPI

pokéFastApi: A backend API with Docker Compose integration for retrieving and searching Pokémon data using FastAPI and DuckDB.

Language: Python - Size: 49.8 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

meysamraz/laptop-price-prediction-end-to-end-project-using-ecommerce-website-data

Using machine learning, feature engineering, and web scraping, I created an end-to-end laptop price prediction website by scraping data from a popular Iranian source. Empowering users with accurate pricing estimates and model comparisons

Language: Jupyter Notebook - Size: 2.22 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

AviatorIfeanyi/etl_with_mage_ai

An ETL data pipeline that extracts data from source and loads it to destination, automated using mage.ai

Language: Python - Size: 246 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 2

thedatanerdz/DEP-8

Analyzing Uber data using various tools and technologies, including GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Looker Studio.

Language: Jupyter Notebook - Size: 120 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Hamagistral/NYCTaxi-Analytics-ETL

🚕 Performing Data Analytics on NYC Taxi data using GCP and MageAI

Language: Jupyter Notebook - Size: 4.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0