You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Test case:
ac = new AhoCorasick(['.com.au','.com']);
ac.search("www.yahoo.com")
console> [ ]
If you change the order then it works fine:
ac = new AhoCorasick(['.com','.com.au'])
ac.search("www.yahoo.com")
console> ... 12, ... ".com" ...
Diagnostically, I see that the "output" part of the AhoCorasick object wrongly lists ".com" at the same final node as ".com.au" when the shorter term is put after the longer one:
The text was updated successfully, but these errors were encountered:
HappMacDonald
changed the title
Appears to sometimes handle stemming badly when long matches precede short ones.
Appears to sometimes handle stemming badly when long matches precede short ones in keyword list.
Nov 8, 2018
Test case:
ac = new AhoCorasick(['.com.au','.com']);
ac.search("www.yahoo.com")
console> [ ]
If you change the order then it works fine:
ac = new AhoCorasick(['.com','.com.au'])
ac.search("www.yahoo.com")
console> ... 12, ... ".com" ...
Diagnostically, I see that the "output" part of the AhoCorasick object wrongly lists ".com" at the same final node as ".com.au" when the shorter term is put after the longer one:
console> ...output:
1: []
2: []
3: []
4: []
5: []
6: []
7: (2) [".com.au", ".com"]
The text was updated successfully, but these errors were encountered: