You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
第1条 この法人は、一般社団法人国際銀行協会(以下「本協会」という。)と称し、英文では、 International Bankers Association of Japanと記載する。
and the results are different when using the java version of kuromojin (with Ipadic dictionary) and the tokenizer provided by kuromoji.js. In particular, the following sequence 協会 is splitted in kuromoji.js.
I saw a closed issue (#16) stating this could due to the Viterbi version of the tokenizer. Is there a way to disable it ?
Many thanks in advance,
Best
The text was updated successfully, but these errors were encountered:
Hi,
I was trying to tokenize the following sentence :
第1条 この法人は、一般社団法人国際銀行協会(以下「本協会」という。)と称し、英文では、 International Bankers Association of Japanと記載する。
and the results are different when using the java version of kuromojin (with Ipadic dictionary) and the tokenizer provided by kuromoji.js. In particular, the following sequence 協会 is splitted in kuromoji.js.
I saw a closed issue (#16) stating this could due to the Viterbi version of the tokenizer. Is there a way to disable it ?
Many thanks in advance,
Best
The text was updated successfully, but these errors were encountered: