Add support for openai tts api #354

tarasglek · 2024-01-21T00:02:13Z

As a warmup for #310 one can add a tts feature to chatcraft.
Openai can do tts using really high quality voices: https://platform.openai.com/docs/guides/text-to-speech

We should add a speak menu entry to each message so it can be spoken by these voices. Once the message is spoken, it should also get a download speech menu so we can download the generated speech.

TTS could also allow chatcraft to respond in voice when we ask it questions using the voice feature.

This would be handy for recording voiceovers for demos on youtube, presentations, etc.

tarasglek · 2024-01-21T00:03:38Z

We should also support using browser-local VTT and TTS ala https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API

Unfortunately these are a bit limited in that they don't support working with raw sound :(

Amnish04 · 2024-01-21T04:40:33Z

Sounds Interesting!

I tried to play with it a bit, but for some reason I can't access the method from documentation

Am I using the wrong object?

Amnish04 · 2024-01-21T05:20:59Z

Just found that the version of openai we are using does not support the speech property.

Upgrading to latest fixed it, and I can listen to generated audio.

humphd · 2024-01-21T14:05:36Z

@Amnish04 nice, want to turn your investigation into a PR?

One thing we'd have to do here is make this aware of different providers/models (cc @kliu57). For example, if I'm using OpenRouter.ai vs. OpenAI for my provider and API Key, this won't work.

To start, maybe we only do this if you're using OpenAI as your provider?

Another thing that would be cool is to allow users to set the voice to use as a setting: https://platform.openai.com/docs/guides/text-to-speech/voice-options

Also, it looks like we can stream it vs. waiting to download: https://platform.openai.com/docs/guides/text-to-speech

tarasglek · 2024-01-21T18:30:38Z

I think the model should show up in list endpoint. Can use that to enable disable this feature.

Love how quickly you got an experimental result!

humphd · 2024-01-22T01:01:13Z

I think the model should show up in list endpoint.

Do you mean in the list of models we show the user, so you an "Ask" or "Retry" with the TTS models?

tarasglek · 2024-01-22T05:05:25Z

i did not mean that, but thats a good idea lets do that

humphd assigned Amnish04 Jan 21, 2024

Amnish04 mentioned this issue Jan 22, 2024

Add Text to Speech Support #357

Merged

Amnish04 linked a pull request Jan 24, 2024 that will close this issue

Add Text to Speech Support #357

Merged

Amnish04 closed this as completed in #357 Jan 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for openai tts api #354

Add support for openai tts api #354

tarasglek commented Jan 21, 2024

tarasglek commented Jan 21, 2024 •

edited

Amnish04 commented Jan 21, 2024 •

edited

Amnish04 commented Jan 21, 2024

humphd commented Jan 21, 2024

tarasglek commented Jan 21, 2024 •

edited

humphd commented Jan 22, 2024

tarasglek commented Jan 22, 2024

Add support for openai tts api #354

Add support for openai tts api #354

Comments

tarasglek commented Jan 21, 2024

tarasglek commented Jan 21, 2024 • edited

Amnish04 commented Jan 21, 2024 • edited

Amnish04 commented Jan 21, 2024

humphd commented Jan 21, 2024

tarasglek commented Jan 21, 2024 • edited

humphd commented Jan 22, 2024

tarasglek commented Jan 22, 2024

tarasglek commented Jan 21, 2024 •

edited

Amnish04 commented Jan 21, 2024 •

edited

tarasglek commented Jan 21, 2024 •

edited