Skip to content

自然语言控制使用格式 #1335

@vedbobo

Description

@vedbobo

官方是格式是:
for i, j in enumerate(cosyvoice.inference_instruct('在面对挑战时,他展现了非凡的<strong>勇气</strong>与<strong>智慧</strong>。', '中文男', 'Theo \'Crimson\', is a fiery, passionate rebel leader. Fights with fervor for justice, but struggles with impulsiveness.', stream=False)): torchaudio.save('instruct_{}.wav'.format(i), j['tts_speech'], cosyvoice.sample_rate)
但我有数据格式是:
<|zh|><|HAPPY|><|Speech|><|withitn|>在他讲述那个荒诞故事的过程中,他突然停下来 ,因为他自己也被逗笑了。
不能直接丢进去合成,会把这些标签也读出来了。要严格按照demo的来吗?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions