-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bad case caused by no-consonant char #11
Comments
@marina-danilevsky would you please take a look at it and possibly answer it? I translated only the trained model, the algorithm part is a black box to me. |
I'm sorry, I'm not at all sure how to answer this (I do not know Chinese, and my co-author who does, and did some of this implementation originally, left the company some time ago). Could you possibly be more specific? I can't quite tell if this is a bug in the mapping generally, or something you're observing with specific input. |
|
修改dimsim中utils/pinyin.py pinyinRewrite方法中58和64行,即注释掉# self.consonant = ""即可 |
In your trained consonantMap_TwoDCode, no-consontant mappings to (99999.0,99999.0), which causes some chars like "我", "一" are not similar with any char with consonant(e.g. "过", "鸡"). Does this make sense?
The text was updated successfully, but these errors were encountered: