GitHub topics: bigdatainfrastructure
atlas555/atlas555.github.io
a personal blog
Language: HTML - Size: 28.6 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

JuanParias29/BigDataProcessing
Repositorio con proyectos y laboratorios de procesamiento de datos utilizando Databricks, Apache Spark y Python. Incluye conceptos clave de Big Data, almacenamiento, procesamiento, análisis y aprendizaje automático.
Language: Jupyter Notebook - Size: 3.59 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

kevinndungu-source/Amazon_EMR_Project_Resources
Explore and replicate Amazon EMR (Elastic MapReduce) setup and utilization for big data processing and analytics tasks, featuring comprehensive demonstrations from VPC creation to Spark job execution.
Language: Jupyter Notebook - Size: 561 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

divithraju/awesome-spark Fork of awesome-spark/awesome-spark
A curated list of awesome Apache Spark packages and resources.
Language: Python - Size: 212 KB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

kevinndungu-source/Amazon_EMR_Serverless_Demonstration
Explore the capabilities of Amazon EMR Serverless by processing semi-structured review data with Apache Spark, showcasing efficient big data analysis without managing clusters.
Language: Python - Size: 556 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
