This repository consists of comparison between two LDA algorithms (EM and Online) in Apache Spark 'mllib' library and also finding the best hyper parameters on YELP dataset.
-
Updated
Apr 14, 2023 - Java
This repository consists of comparison between two LDA algorithms (EM and Online) in Apache Spark 'mllib' library and also finding the best hyper parameters on YELP dataset.
Add a description, image, and links to the data-partitions topic page so that developers can more easily learn about it.
To associate your repository with the data-partitions topic, visit your repo's landing page and select "manage topics."