Skip to content

refinr 0.3.0

Compare
Choose a tag to compare
@ChrisMuir ChrisMuir released this 05 May 13:48
· 38 commits to master since this release

NEW FEATURES

  • Rewrote some of the cpp functions to incorporate std::unordered_map(), resulting in a substantial speed improvement when passing large character vectors (length 100,000+) to either of the exported functions (#8).

PKG API CHANGES

  • In function n_gram_merge(), renamed arg edit_dist_weights to weight. The only purpose of this arg is to be passed along to function stringdistmatrix from the stringdist package (which uses the name weight, so this change is simply to match that).

BUG FIXES

  • Fixed issue in which input strings that contained accent marks were not being properly handled/clustered (#9). The fix involved adding stringi to Imports and using stringi::stri_trans_general().

  • Fixed issue in n_gram_merge() in which incorrect values were being return when input arg ignore_strings was not NULL, and arg bus_suffix = FALSE (#7).

  • Fixed issue in which input strings that contained punctuation that was NOT surrounded by spaces was returning incorrect values (#6).

  • Fixed issue in which the edit value assigned to a cluster was sometimes not the most frequent string in that cluster (#5).