Opencc4j is an opensource project for conversion between Traditional Chinese and Simplified Chinese, supporting character-level conversion, phrase-level conversion for java.
-
Strict distinction between "a simple and complex" and "a simple and diverse".
-
Fully compatible with different characters, you can achieve dynamic replacement.
-
Strict review of a simple and more complicated entries, the principle of "can be divided but not consistent."
-
Thesaurus and function library completely separated, you can freely modify, import, expand.
-
Compatible with Windows, Linux, Mac platform.
- OpenCC
OpenCC is an awesome project, but not have direct support jar for java.
- jopencc
jopencc has no word segmentation provided.
<dependency>
<groupId>com.github.houbb</groupId>
<artifactId>opencc4j</artifactId>
<version>1.0.2</version>
</dependency>
String original = "生命不息,奮鬥不止";
String result = ZhConverterUtil.convertToSimple(original);
the result is:
生命不息,奋斗不止
String original = "生命不息,奋斗不止";
String result = ZhConverterUtil.convertToTraditional(original);
the result is:
生命不息,奮鬥不止
OpenCC support the original data.
jieba-analysis support the Chinese word segmentation
Issues & Bugs, welcome to provide valuable suggestions.