Skip to content
#

mllib

Here are 38 public repositories matching this topic...

apache-spark-jobs is the main repository of the Sprouts Project data mining module. It's intended to generate Apache Spark jobs on business intelligence. These jobs are uploaded to Spark Job Server, and the result of some of them are persisted at MongoDB. The results of the jobs are queried from Sprouts framework.

  • Updated Jun 12, 2017
  • Scala

😅 A topic model of reddit.com/r/EmojiPasta trained with Spark and an LDA model (NSFW) - Trigger Warning: The r/emojipasta subreddit posts controversial content and anything I have crawled is to provide visibility of a topic modeling some of this controversial content. Unfortunately there is also discriminatory speech which must be called out!

  • Updated Mar 19, 2018
  • Scala

Improve this page

Add a description, image, and links to the mllib topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mllib topic, visit your repo's landing page and select "manage topics."

Learn more