neologdn is a Japanese text normalizer for mecab-neologd.
The normalization is based on the neologd's rules: https://github.com/neologd/mecab-ipadic-neologd/wiki/Regexp.ja
Contributions are welcome!
import io.github.ikegamiyukino.neologdn.NeologdNormalizer;
NeologdNormalizer normalizer = new NeologdNormalizer();
String text = " PRML 副 読 本 ";
normalizer.normalize(text);
// => "PRML副読本"
Apache Software License.