Skip to content

Latest commit

 

History

History
24 lines (15 loc) · 524 Bytes

TODO.md

File metadata and controls

24 lines (15 loc) · 524 Bytes

To Do

Bugs

(no known bugs)

Features

  • Add a text_untoken() function to turn token sequence into text: insert word-joiner (U+2060) to keep multi-word phrases together; put specified space character (ZWSP or SP) between tokens

  • wrap.pad, width arguments to utf8_print

  • token_kind and token_map functions (?)

  • Add demonstration of dictionary scaling with text_match:

    m <- text_match(x, dict$term)
    score <- tapply(dict$score[m$term], m$text, mean, default = 0)