GitHub / shaharpit809 / Latent-Dirichlet-allocation-LDA-on-YELP-dataset-using-Apache-Spark
This repository consists of comparison between two LDA algorithms (EM and Online) in Apache Spark 'mllib' library and also finding the best hyper parameters on YELP dataset.
Stars: 3
Forks: 1
Open issues: 3
License: None
Language: Java
Size: 6.43 MB
Dependencies parsed at: Pending
Created at: almost 6 years ago
Updated at: over 2 years ago
Pushed at: about 2 years ago
Last synced at: over 1 year ago
Topics: data-partitions, lda-algorithms, mllib, perplexity, pos-tagging, spark, yelp-dataset