Skip to content

Commit

Permalink
add info on speed improvements
Browse files Browse the repository at this point in the history
  • Loading branch information
ChrisMuir committed Mar 26, 2018
1 parent fa6e04b commit 7bfd13c
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 2 deletions.
6 changes: 5 additions & 1 deletion NEWS.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,11 @@ refinr 0.2.0.9000

* Fixed issue in which input strings that contained punctuation that was NOT surrounded by spaces was returning incorrect values ([#6](https://github.com/ChrisMuir/refinr/issues/6)).

* Fixed issue in which the edit value assigned to a cluster was sometimes not the most frequent string in that cluster ([#5](https://github.com/ChrisMuir/refinr/issues/5)).
* Fixed issue in which the edit value assigned to a cluster was sometimes not the most frequent string in that cluster ([#5](https://github.com/ChrisMuir/refinr/issues/5)).

## NEW FEATURES

* Rewrote some of the cpp functions to incorporate `tr1::unordered_map`, resulting in a substantial speed improvement when passing large character vectors (length 100,000+) to either of the exported functions ([#8](https://github.com/ChrisMuir/refinr/issues/8)).

refinr 0.2.0 (2018-01-10)
=========================
Expand Down
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -119,4 +119,4 @@ Notes
- The advantages this package has over OpenRefine:
* Operations are fully automated.
* Facilitates a more reproducible workflow.
* Seems to handle larger datasets better (1000000 - 5000000 observations).
* Faster when working with large input data (character vectors of length 500000+).

0 comments on commit 7bfd13c

Please sign in to comment.