An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: bigdatainfrastructure

atlas555/atlas555.github.io

a personal blog

Language: HTML - Size: 28.6 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

JuanParias29/BigDataProcessing

Repositorio con proyectos y laboratorios de procesamiento de datos utilizando Databricks, Apache Spark y Python. Incluye conceptos clave de Big Data, almacenamiento, procesamiento, análisis y aprendizaje automático.

Language: Jupyter Notebook - Size: 3.59 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

kevinndungu-source/Amazon_EMR_Project_Resources

Explore and replicate Amazon EMR (Elastic MapReduce) setup and utilization for big data processing and analytics tasks, featuring comprehensive demonstrations from VPC creation to Spark job execution.

Language: Jupyter Notebook - Size: 561 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

divithraju/awesome-spark Fork of awesome-spark/awesome-spark

A curated list of awesome Apache Spark packages and resources.

Language: Python - Size: 212 KB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

kevinndungu-source/Amazon_EMR_Serverless_Demonstration

Explore the capabilities of Amazon EMR Serverless by processing semi-structured review data with Apache Spark, showcasing efficient big data analysis without managing clusters.

Language: Python - Size: 556 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0