Case mapping with a break iterator #98

Closed
gagolews opened this Issue Sep 15, 2014 · 2 comments

Comments

Projects
None yet
1 participant
@gagolews
Owner

gagolews commented Sep 15, 2014

use a ucasemap_setBreakIterator - Set the break iterator that is used for titlecasing. - allow for uppercasing first letter of each sentence etc.; also, allow for providing a set of non-stop words ("a", "the", "an", etc.)

@gagolews gagolews changed the title from case mapping with a break iterator to Case mapping with a break iterator Oct 22, 2014

@gagolews

This comment has been minimized.

Show comment
Hide comment
@gagolews

gagolews Oct 24, 2014

Owner

added boundary arg

> stri_trans_totitle("GOOD-OLD cOOkiE mOnSTeR IS watCHinG You. Here HE comes!", boundary="word")
[1] "Good-Old Cookie Monster Is Watching You. Here He Comes!"
> stri_trans_totitle("GOOD-OLD cOOkiE mOnSTeR IS watCHinG You. Here HE comes!", boundary="sentence")
[1] "Good-old cookie monster is watching you. Here he comes!"
Owner

gagolews commented Oct 24, 2014

added boundary arg

> stri_trans_totitle("GOOD-OLD cOOkiE mOnSTeR IS watCHinG You. Here HE comes!", boundary="word")
[1] "Good-Old Cookie Monster Is Watching You. Here He Comes!"
> stri_trans_totitle("GOOD-OLD cOOkiE mOnSTeR IS watCHinG You. Here HE comes!", boundary="sentence")
[1] "Good-old cookie monster is watching you. Here he comes!"

@gagolews gagolews self-assigned this Oct 24, 2014

@gagolews gagolews added this to the stringi-0.3 milestone Oct 24, 2014

@gagolews gagolews closed this Oct 24, 2014

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment