An open API service providing repository metadata for many open source software ecosystems.

GitHub / coderjolly / pyspark-yelp-data-analysis

A comparative study to understand the computing efficiencies of Pyspark architectures vs python based distributed programming methodologies such as MPI, multi-threading or multi-processing on the Yelp kaggle dataset.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/coderjolly%2Fpyspark-yelp-data-analysis
PURL: pkg:github/coderjolly/pyspark-yelp-data-analysis

Stars: 0
Forks: 0
Open issues: 0

License: gpl-3.0
Language: Jupyter Notebook
Size: 16.8 MB
Dependencies parsed at: Pending

Created at: about 2 years ago
Updated at: about 2 years ago
Pushed at: about 2 years ago
Last synced at: 4 months ago

Commit Stats

Commits: 8
Authors: 1
Mean commits per author: 8.0
Development Distribution Score: 0.0
More commit stats: https://commits.ecosyste.ms/hosts/GitHub/repositories/coderjolly/pyspark-yelp-data-analysis

Topics: distributed-system-design, distributed-systems-challenges, mpi, multiprocessing, multithreading, pyspark, pyspark-python

    Loading...