Text Classification Engine

Look at the following tutorial for a Quick Start guide and implementation details.

Release Information

Version 6.0

Use Apache Lucene as tokenizer
Add stopwords filter
Rename Scala package name
Update SBT version

Version 5.0 First Apache Version

Major changes to namespace to reflect donation to the Apache Software Foundation.
Build changes to support modified Apache build mechanism

Version 4.0

Re-structure and design preparator and algo. less memory usage and run time is faster. Move BIDMach, VW & SPPMI algo changes to bidmach branch temporarily.

Version 3.1

Fix DataSource to read "content", "e-mail", and use label "spam" for tutorial data. Fix engine.json for default algorithm setting.

Version 2.2

Modified PreparedData to use MLLib hashing and tf-idf implementations.

Version 2.1

Fixed dot product implementation in the predict methods to work with batch predict method for evaluation.

Version 2.0

Included three different data sets: e-mail spam, 20 newsgroups, and the rotten tomatoes semantic analysis set. Includes Multinomial Logistic Regression algorithm for text classification.

Version 1.2

Fixed import script bug occuring with Python 2.

Version 1.1 Changes

Changed data import Python script to pull straight from the 20 newsgroups page.

Name		Name	Last commit message	Last commit date
Latest commit History 150 Commits
data		data
project		project
src/main/scala		src/main/scala
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
build.sbt		build.sbt
engine.json		engine.json
template.json		template.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text Classification Engine

Release Information

Version 6.0

Version 5.0 First Apache Version

Version 4.0

Version 3.1

Version 2.2

Version 2.1

Version 2.0

Version 1.2

Version 1.1 Changes

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors 10

Uh oh!

Languages

License

apache/predictionio-template-text-classifier

Folders and files

Latest commit

History

Repository files navigation

Text Classification Engine

Release Information

Version 6.0

Version 5.0 First Apache Version

Version 4.0

Version 3.1

Version 2.2

Version 2.1

Version 2.0

Version 1.2

Version 1.1 Changes

About

Topics

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors 10

Uh oh!

Languages

Packages