GitHub topics: aws-glue-data-catalog

Repositories

ev2900/Iceberg_Glue_register_table

Example using the Iceberg register_table command with AWS Glue and Glue Data Catalog

Language: Python - Size: 544 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 1

harika-majji/aws-stock-market-analysis

Language: Jupyter Notebook - Size: 2.38 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

jibbs1703/Tickit-Data-Pipeline

This repository demonstrates the creation of a robust data pipeline using an Orchestrator, on-prem and cloud resources. It collects data from on-premises SQL and NoSQL database and loads it into a SQL database in the cloud.

Language: Python - Size: 5.86 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

deept-agl/Youtube-data-ETL-Analysis-using-AWS

This project creates a scalable data pipeline to analyze YouTube data from Kaggle using AWS services: S3, Glue, Lambda, Athena, and QuickSight. It processes raw JSON and CSV files into cleansed, partitioned datasets, integrates them with ETL workflows, and catalogs data for querying. Final insights are visualized in QuickSight dashboards.

Language: Python - Size: 177 MB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ShubhamMohanty680/Spotify_end_to_end_data_engineering

It is a project build using ETL(Extract, Transform, Load) pipeline using Spotify API on AWS.

Language: Jupyter Notebook - Size: 1.44 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

j3-signalroom/supercharge_streamlit-apache_flink

Engaging, interactive visualizations crafted with Streamlit, seamlessly powered by Apache Flink in batch mode to reveal deep insights from data.

Language: Python - Size: 650 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

SadafAsad/LinkedIn-Jobs-Analysis

Unveiling job market trends with Scrapy and AWS

Language: Python - Size: 562 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

shiv-rna/Youtube-Data-Engineering-Pipeline

This project repo 📺 offers a robust solution meticulously crafted to efficiently manage, process, and analyze YouTube video data leveraging the power of AWS services. Whether you're diving into structured statistics or exploring the nuances of trending key metrics, this pipeline is engineered to handle it all with finesse.

Language: Python - Size: 179 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

omkarfadtare/Practical_data_science

These are the handwritten notes on Coursera's Practical data science specialization course.

Size: 82 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rahulrajan15/Stock_Market_Kafka

Real-Time Stock Market Data Science Project using Apache Kafka: Analyzing and predicting stock market trends in real-time for informed decision-making. Scalable and low-latency data processing.

Language: Jupyter Notebook - Size: 2.48 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Related Keywords

aws-glue-data-catalog 10 aws-s3 8 aws 5 aws-glue 5 aws-glue-crawler 5 aws-athena 5 aws-ec2 3 python 3 aws-lambda 3 aws-quicksight 2 data-engineering-pipeline 2 data-engineering 2 apache-iceberg 2 iceberg 2 kafka 2 pyflink 1 streamlit 1 streamlit-dashboard 1 scrapy 1 aws-cli 1 aws-iam 1 youtube 1 aws-data-wrangler 1 aws-glue-workflow 1 data-analysis-python 1 data-science 1 kafka-consumer 1 kafka-producer 1 kafka-streams 1 kafka-topic 1 sql 1 stock-analysis 1 boto3 1 data-lake 1 database 1 etl-pipeline 1 medallion-architecture 1 mongodb 1 precommit-hooks 1 athena 1 aws-redshift 1 quicksight 1 aws-trigger 1 awscloudwatch 1 glue 1 spotify-api 1 spotipy-library 1 apache-flink 1 flink 1 flink-sql 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos