An example of using liblinear-java in Scala with the 20 newsgroups data set.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

Liblinear-java 20 Newsgroups Example

An example showing how to use liblinear-java for text classification in Scala with the 20 newsgroups data set. Additionally, this example shows how Apache Lucene can be leveraged for tokenizing, shingling, and more.

##Downloading the Data Set

The 20 newsgroups data set is available here. This example requires the "bydate" version.

Unzip and place this directory in the root of this project.

Building & Running the Example

Requires java and sbt.

Building and running:

$ sbt run