We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
想问一下pinyin pro的分词是怎么实现的,分词进行标注,避免错误的多音字。 刚好一个项目用到了分词的功能
The text was updated successfully, but these errors were encountered:
首先分词肯定要有一套词库,然后我基于词库初始化实现了一个 AC自动机,从性能上讲 AC自动机应该是相对简单且高效的多词匹配算法,如果觉得实现有难度的话,就从头对词库的词遍历匹配也行,只不过效率和准确率相对低一点
Sorry, something went wrong.
No branches or pull requests
想问一下pinyin pro的分词是怎么实现的,分词进行标注,避免错误的多音字。
刚好一个项目用到了分词的功能
The text was updated successfully, but these errors were encountered: