Skip to content

Commit

Permalink
将<>视作分隔符 fix https://bbs.hankcs.com/t/topic/4527
Browse files Browse the repository at this point in the history
  • Loading branch information
hankcs committed Feb 27, 2022
1 parent 8e750ee commit 4b43124
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/main/java/com/hankcs/hanlp/utility/TextUtility.java
Expand Up @@ -50,7 +50,7 @@ public static int charType(String str)
if (ub1 < 128)
{
if (ub1 <= 32) return CT_OTHER; // NON PRINTABLE CHARACTERS
if ("*\"!,.?()[]{}+=/\\;:|".indexOf((char) b1) != -1)
if ("*\"!,.?()<>[]{}+=/\\;:|".indexOf((char) b1) != -1)
return CT_DELIMITER;
if ("0123456789".indexOf((char)b1) != -1)
return CT_NUM;
Expand Down

1 comment on commit 4b43124

@hanlpbot
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This commit has been mentioned on Butterfly Effect. There might be relevant details there:

https://bbs.hankcs.com/t/topic/4527/2

Please sign in to comment.