How can I teach whisper new words? #963
-
Hi! I am working on a project that will use whisper to transcribe recordings of my Dungeons and Dragons games. Since we use a lot of made up words and weird names I'd love to include a solution for training them. For example, we often reference NPC names like Zerthimon, Vlaakith, and Mordenkainen as well as place names like Agni'hotri, Tu'narath, and Niam'd'regal. I understand that I would need to collect labeled samples of these words. But once I have that done, how can I feed that data into Whisper? (Preferably in a way that allows me to store the customization in a separate file and doesn't require retraining every time the app is loaded.) Any help you guys can provide would be greatly appreciated. I'm not expecting you to write a full tutorial, but if you could point me in the right direction I'm sure I could figure it out. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
You could make it work to some degree by prompting, by specifying If that doesn't work very reliably, the next bet is to fine-tune a Whisper model with your own examples. #364 has some pointers on how to do this. |
Beta Was this translation helpful? Give feedback.
You could make it work to some degree by prompting, by specifying
--initial_prompt "The following is a conversation during a Dungeons and Dragons game, which includes NPC names like Zerthimon, Vlaakith, and Mordenkainen as well as place names like Agni'hotri, Tu'narath, and Niam'd'regal."
, then the model will have a slightly better chance of spelling those words correctly.If that doesn't work very reliably, the next bet is to fine-tune a Whisper model with your own examples. #364 has some pointers on how to do this.