-
Notifications
You must be signed in to change notification settings - Fork 2k
Open
Description
官方是格式是:
for i, j in enumerate(cosyvoice.inference_instruct('在面对挑战时,他展现了非凡的<strong>勇气</strong>与<strong>智慧</strong>。', '中文男', 'Theo \'Crimson\', is a fiery, passionate rebel leader. Fights with fervor for justice, but struggles with impulsiveness.', stream=False)): torchaudio.save('instruct_{}.wav'.format(i), j['tts_speech'], cosyvoice.sample_rate)
但我有数据格式是:
<|zh|><|HAPPY|><|Speech|><|withitn|>在他讲述那个荒诞故事的过程中,他突然停下来 ,因为他自己也被逗笑了。
不能直接丢进去合成,会把这些标签也读出来了。要严格按照demo的来吗?
Metadata
Metadata
Assignees
Labels
No labels