Skip to content


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
tree: fd5e2c6858

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.


The "phrasinator" uses a simple Bayesian nonparametric model to segment
text into chunks. The inferred model is then saved so that it can rapidly
predict segments in new (but related) texts.

  Input will be a corpus of sentences, e.g.:

    economists have argued that real interest rates have fallen .

  The output will be a model that, when run with cdec, will produce
  a segmentation into phrasal units, e.g.:

    economists have argued that real_interest_rates have fallen .

To train a model, run ./ and follow instructions.

Something went wrong with that request. Please try again.