Skip to content

Conversation

@bact
Copy link
Member

@bact bact commented Apr 20, 2019

Bug fix in FrequencySummarizer, word_tokenize, tcc, etc. and small code improvements

bact and others added 30 commits April 9, 2019 02:56
Update from original repo
…ame as other word tokenizers)

- small improvements in thai_strftime() (some formatting extensions are not implemented yet)
Small code refactor in pythainlp.tokenize.etcc, pythainlp.tokenize.tcc, pythainlp.util.date.thai_strftime(), and pythainlp.corpus.ttc
Update README-pypi.md
deepcut & dict_word_tokenize
deepcut + dict_word_tokenize
…gine. Handles Trie, Iterable[str], and str (path to dictionary).
Makes custom dictionary argument more consistent across different engines
- ICU translitrate engine "pyicu" renamed to just "icu"
fix bug transliterate : ain (ไหน) -> nai (ไหน)
Improvement in romanization
@bact bact changed the title Fix FrequencySummarizer issue with word_tokenize() Merge 2.0.3 and 2.0.4 from dev to 2.0 branch Apr 20, 2019
@bact bact merged commit b926334 into 2.0 Apr 20, 2019
@coveralls
Copy link

Coverage Status

Coverage decreased (-0.4%) to 81.382% when pulling 5fb581f on dev into 3c2de29 on 2.0.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants