Skip to content

wklken/pinyin.py

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pinyin.py

汉字转拼音,With Python

Example:

from pinyin import PinYin

test = PinYin()
test.load_word()

def run_test(string):
    print
    print "in: %s" % string
    print 'convert just Hanzi:'
    print "out: %s" % ''.join(test.hanzi2pinyin(string=string))
    print "out: %s" % ''.join(test.hanzi2pinyin(string=string, split="-"))
    print "out: %s" % test.hanzi2pyacronym(string=string)

    print 'convert all:'
    print "out: %s" % ''.join(test.all2pinyin(string=string))
    print "out: %s" % ''.join(test.all2pinyin(string=string, split="-"))
    print "out: %s" % test.all2pyacronym(string=string)
    print '-' * 20


#简单字符串
string = u'Chrome浏览器版本29'
run_test(string)

#补充例子,复杂字符串
string = u'植物大战僵尸2:奇妙时空之旅(Plants vs. Zombies 2: Its About Time)'
run_test(string)

Out:

in: Chrome浏览器版本29
convert just Hanzi:
out: liulanqibanben
out: ------liu-lan-qi-ban-ben--
out: llqbb
convert all:
out: Chromeliulanqibanben29
out: C-h-r-o-m-e-liu-lan-qi-ban-ben-2-9
out: Chromellqbb29
--------------------

in: 植物大战僵尸2:奇妙时空之旅(Plants vs. Zombies 2: Its About Time)
convert just Hanzi:
out: zhiwudazhanjiangshiqimiaoshikongzhilv
out: zhi-wu-da-zhan-jiang-shi---qi-miao-shi-kong-zhi-lv-------- ---- -------- --- ---- ------ -----
out: zwdzjsqmskzl
convert all:
out: zhiwudazhanjiangshi2:qimiaoshikongzhilv(Plants vs. Zombies 2: Its About Time)
out: zhi-wu-da-zhan-jiang-shi-2-:-qi-miao-shi-kong-zhi-lv-(-P-l-a-n-t-s- -v-s-.- -Z-o-m-b-i-e-s- -2-:- -I-t-s- -A-b-o-u-t- -T-i-m-e-)
out: zwdzjs2:qmskzl(Plants vs. Zombies 2: Its About Time)
--------------------

Update:

2013-09-26  wklken
            1.[del] split合并到同一个方法中,作为split参数判断处理
            2.[add] all2pinyin方法,使得模块可以处理返回含有英文,数字,标点符号的字符串
            3.[add] acronym方法,返回拼音首字母字符串
            4.[fix] 修复原有方法传入含空格字符串异常的问题

About

汉字转拼音,With Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%