Skip to content
Permalink
Branch: master
Commits on Jul 11, 2017
  1. Added unit test for tokenizing celebrity names

    natlungfy committed Jul 11, 2017
  2. Added names of top public figures to user dict

    natlungfy committed Jul 11, 2017
  3. Expand Chinese stop words again

    natlungfy committed Jul 11, 2017
    Fix #172
  4. expand Chinese stopwords list

    natlungfy committed Jul 11, 2017
Commits on Jul 7, 2017
  1. Add logic to detect words instead of filtering out punctuation

    natlungfy committed Jul 7, 2017
  2. Revert "Add logic to detect words instead of filtering out punctuation"

    natlungfy committed Jul 7, 2017
    This reverts commit 3e2adcf.
  3. Add logic to detect words instead of filtering out punctuation

    natlungfy committed Jul 7, 2017
  4. Fix failing Perl unit tests

    natlungfy committed Jul 7, 2017
  5. Added logic to remove Unicode punctuation in tokenization

    natlungfy committed Jul 7, 2017
  6. Expand user dictionary

    natlungfy committed Jul 7, 2017
  7. Remove unncessary comments and dict

    natlungfy committed Jul 7, 2017
  8. Attribute sources for dict & stopwords

    natlungfy committed Jul 7, 2017
Commits on Jun 28, 2017
  1. remove print statements and typos

    natlungfy committed Jun 28, 2017
  2. debug zh.py and remove test_zh.py typos, unit tests passed

    natlungfy committed Jun 28, 2017
Commits on Jun 27, 2017
  1. first commit of zh.py, test_zh.py, zh.pm, added chinese dictionaries,…

    natlungfy committed Jun 27, 2017
    … enabled zh support in Language.pm
You can’t perform that action at this time.