An open API service providing repository metadata for many open source software ecosystems.

GitHub / arjunsawhney1 / scalable-ML

In this repo, I build a LogisticRegression prediction model with Dask and PySpark and initialize an AWS EMR cluster to run the entire pipeline.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/arjunsawhney1%2Fscalable-ML
PURL: pkg:github/arjunsawhney1/scalable-ML

Fork of rajeevdixit19/Scaleable-Ml
Stars: 0
Forks: 0
Open issues: 0

License: None
Language:
Size: 131 KB
Dependencies parsed at: Pending

Created at: over 3 years ago
Updated at: almost 2 years ago
Pushed at: about 4 years ago
Last synced at: almost 2 years ago

Topics: aws, aws-ec2, aws-emr-clusters, aws-s3, dask, dask-distributed, dask-ml, logistic-regression, pyspark, scalability, spark

    Loading...