feat: [whisper] Partial support for verbose_json format in transcribe endpoint #721

ldotlopez · 2023-07-04T08:28:23Z

Extend the current output to be compatible with the custom_json format referenced in the OpenAI docs

This PR basically adds an extra field in response: 'segments', which contains text divided by time segments.

These changes can be useful for other tools that expect segmented text like some automatic subtitles or other tools like Buzz

Although support is not complete, those changes are enough to get Buzz working at least. More work is needed to perfectly support custom_json format.

See: https://platform.openai.com/docs/api-reference/audio/create#audio/create-response_format

mudler

looks good, thanks. Also passes tests which are testing e2e whisper, so no regressions either

Partial support for verbose_json format in transcribe endpoint

127e162

See: https://platform.openai.com/docs/api-reference/audio/create#audio/create-response_format

ldotlopez mentioned this pull request Jul 4, 2023

Customizable api_base for OpenAI chidiwilliams/buzz#521

Open

mudler changed the title ~~[whisper] Partial support for verbose_json format in transcribe endpoint~~ feat: [whisper] Partial support for verbose_json format in transcribe endpoint Jul 4, 2023

mudler approved these changes Jul 4, 2023

View reviewed changes

mudler merged commit a6839fd into mudler:master Jul 4, 2023
8 checks passed

mudler added the enhancement New feature or request label Jul 16, 2023

renovate bot mentioned this pull request Nov 13, 2023

feat(container): update image quay.io/go-skynet/local-ai to v1.40.0 - autoclosed lenaxia/home-ops-dev#169

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: [whisper] Partial support for verbose_json format in transcribe endpoint #721

feat: [whisper] Partial support for verbose_json format in transcribe endpoint #721

ldotlopez commented Jul 4, 2023

mudler left a comment

feat: [whisper] Partial support for verbose_json format in transcribe endpoint #721

feat: [whisper] Partial support for verbose_json format in transcribe endpoint #721

Conversation

ldotlopez commented Jul 4, 2023

mudler left a comment

Choose a reason for hiding this comment