Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"鸟事“错误转换为”niao sh" #159

Closed
ledao opened this Issue Mar 28, 2019 · 7 comments

Comments

Projects
None yet
2 participants
@ledao
Copy link

commented Mar 28, 2019

运行环境

  • 操作系统(Linux/macOS/Windows):win10
  • Python 版本:3.6.5
  • pypinyin 版本:0.33.2

问题描述

"鸟事“错误转换为”niao sh"

问题复现步骤

from pypinyin import lazy_pinyin
lazy_pinyin("鸟事")
==>['niao', 'sh']

@mozillazg

This comment has been minimized.

Copy link
Owner

commented Mar 29, 2019

@ledao 感谢反馈。待我有空的时候修复一下。

@ledao

This comment has been minimized.

Copy link
Author

commented Mar 29, 2019

先别关闭issue,我再测试一下,把有错误的都贴上来

@ledao

This comment has been minimized.

Copy link
Author

commented Mar 29, 2019

发现新的错误,如下所示:

lazy_pinyin("虮虱相吊")
['ji', 'shi', 'xieng', 'diao']

lazy_pinyin("别鹤离鸾")
['bie', 'he', 'li', 'laun']

lazy_pinyin("年华垂暮")
['nian', 'hua', 'thui', 'mu']

lazy_pinyin("本枝百世")
['ben', 'zhi', 'boi', 'shi']

lazy_pinyin("操戈同室")
['cao', 'ge', 'toon', 'shi']

lazy_pinyin("丢魂丧胆")
['diu1', 'hun', 'sang', 'dan']

@mozillazg

This comment has been minimized.

Copy link
Owner

commented Mar 30, 2019

@ledao 感谢测试。冒昧问一下,是否方便透露你测试时使用比对数据源,我看看能否通过这个数据源找到更多的异常拼音。

mozillazg added a commit to mozillazg/phrase-pinyin-data that referenced this issue Mar 30, 2019

纠正一批词语的拼音
* `鸟事`
* `虮虱相吊`
* `别鹤离鸾`
* `年华垂暮`
* `本枝百世`
* `操戈同室`
* 部分词语中 `丢` 的拼音

mozillazg/python-pinyin#159

mozillazg added a commit that referenced this issue Mar 31, 2019

使用 phrase-pinyin-data v0.9.1 的词语拼音数据 ref #159
使用 pinyin-data v0.7.0 的拼音数据
@ledao

This comment has been minimized.

Copy link
Author

commented Apr 3, 2019

抱歉,这个语料无法共享~我们数据源已经跑完了,所有错误都已列出。

@mozillazg

This comment has been minimized.

Copy link
Owner

commented Apr 3, 2019

@ledao 抱歉,是我冒昧了才是。感谢分享你们发现的错误。我周末会发布一个版本修复一下这个 issue 中提到的问题。

@mozillazg

This comment has been minimized.

Copy link
Owner

commented Apr 6, 2019

@ledao 最新版已修复这个 issue 提到的所有有问题的拼音。

@mozillazg mozillazg closed this Apr 6, 2019

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.