Skip to content

An efficient maximum entropy classifier, with some convenience functions

Notifications You must be signed in to change notification settings

jlricon/MaxClassify

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MaxClassify

Build Status

This package is a wrapper around the maxent R package. It includes a few convenience functions to be able to work with it using text directly.

It has a few improvements over the base package:

  • Models are S3 objects, to make it easier to work with them (add them to lists, for example)
  • Multicore enabled. While the base training algorithm is unchanged, prediction is multithreaded, and so is a tuning function. These two changes preserve the underlying sparsity, allowing the model to use reasonable amounts of RAM even for big datasets.
  • A function to get a table of the weights, to see which words are relevant for each class
  • Uses quanteda to create training matrices, making it easier to get text ready to classify. Note that this package does not do cleaning. It expects clean input.

About

An efficient maximum entropy classifier, with some convenience functions

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages