Add support for list of bigrams by khxu · Pull Request #23 · words/dice-coefficient

khxu · 2021-11-06T18:11:32Z

This PR implements #22 to skip "bigram-ifying" if an input is already a bigram by checking if the input is an array.

Used nested ternaries for the logic -- would understand if you'd prefer not having those, though.

wooorm · 2021-11-06T18:53:50Z

  var value_ = String(value).toLowerCase()
  var alt = String(alternative).toLowerCase()


I’m quite sure this is making your later code never run though?
It does two things: cast to string, and lowercase.
The casting could be done when the value isn’t a string
The lowercase could be done for each bigram maybe?

Thanks for the quick review! Ah yeah, there is a bug, but for a different reason -- the later code is run because left and right are assigned to the original input params (value and alternative), rather than value_ and alt. However, the bigram inputs wouldn't be case-insensitive. Made another commit that fixes the case sensitivity, but still probably not optimal from a readability perspective. Will try to come up with a more pleasant to read refactoring.

Okay I think I've got a more readable solution committed.

wooorm · 2021-11-06T18:54:54Z

+
+// bigrams may also be passed as input arguments for improved efficiency
+// when analyzing the same strings repeatedly, for example, when
+// comparing the text of each file in a directory with the text of
+// each file in another directory.
+
+import {bigram} from 'n-gram'
+
+const bigramifiedString1 = bigram('abc') // ['ab', 'bc']
+const bigramifiedString2 = bigram('xyz') // ['xy', 'yz']
+
+diceCoefficient(bigramifiedString1, bigramifiedString2) // => 0


I don‘t think this needs to be in the Use section. but it should probably be in the API section, that arrays of strings are allowed, and a note that they should be bigrams?

Moved the explanation to another code section, if that works.

wooorm

Some prose suggestions. Rest looks good!

Co-authored-by: Titus <tituswormer@gmail.com>

wooorm · 2021-11-10T09:19:21Z

released, thanks!

khxu added 3 commits November 6, 2021 11:07

check if input arg is an array

6a5ee33

add tests

8ff8568

mention bigram input in readme

8515d37

This comment has been minimized.

Sign in to view

wooorm reviewed Nov 6, 2021

View reviewed changes

khxu added 3 commits November 6, 2021 14:56

make bigram input case insensitive

0411407

separate bigram input explanation in API section

0920fa6

refactor for readability

41a3626

wooorm reviewed Nov 9, 2021

View reviewed changes

Comment thread readme.md Outdated

Comment thread readme.md Outdated

Comment thread readme.md Outdated

khxu and others added 3 commits November 9, 2021 08:01

Update readme.md

08d0bc0

Co-authored-by: Titus <tituswormer@gmail.com>

Update readme.md

7d167f4

Co-authored-by: Titus <tituswormer@gmail.com>

Update readme.md

c09b4a2

Co-authored-by: Titus <tituswormer@gmail.com>

wooorm approved these changes Nov 10, 2021

View reviewed changes

wooorm changed the title ~~Allow bigram as input~~ Add support for list of bigrams Nov 10, 2021

wooorm merged commit da9b223 into words:main Nov 10, 2021

wooorm mentioned this pull request Nov 10, 2021

Avoid "bigram-ifying" if an input is an array? #22

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for list of bigrams#23

Add support for list of bigrams#23
wooorm merged 9 commits intowords:mainfrom
khxu:allow-bigram-as-input

khxu commented Nov 6, 2021

Uh oh!

This comment has been minimized.

wooorm Nov 6, 2021

Uh oh!

khxu Nov 6, 2021

Uh oh!

khxu Nov 7, 2021

Uh oh!

wooorm Nov 6, 2021

Uh oh!

khxu Nov 7, 2021

Uh oh!

wooorm left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wooorm commented Nov 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		var value_ = String(value).toLowerCase()
		var alt = String(alternative).toLowerCase()

Uh oh!

Conversation

khxu commented Nov 6, 2021

Uh oh!

This comment has been minimized.

wooorm Nov 6, 2021

Choose a reason for hiding this comment

Uh oh!

khxu Nov 6, 2021

Choose a reason for hiding this comment

Uh oh!

khxu Nov 7, 2021

Choose a reason for hiding this comment

Uh oh!

wooorm Nov 6, 2021

Choose a reason for hiding this comment

Uh oh!

khxu Nov 7, 2021

Choose a reason for hiding this comment

Uh oh!

wooorm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wooorm commented Nov 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants