Whisper transcription, ElevenLabs speech, error handling by preritdas · Pull Request #33 · preritdas/jeeves

preritdas · 2023-04-20T19:54:08Z

Changes:

Added a new folder voice_tools with two new files: transcribe.py and speak.py.
- transcribe.py contains the transcribe_twilio_recording function to transcribe Twilio recordings using OpenAI's Whisper API.
- speak.py contains the speak_jeeves function to generate audio files using the ElevenLabs API.
Modified .gitignore to ignore the voice_cache folder.
Updated api/__init__.py to include the tags as a list instead of a string.
Made several changes to api/voice_inbound.py:
- Replaced the usage of the <Gather> Twilio verb with <Record> to collect user's speech input as a recording for transcription.
- Added new functions: extract_base_url, speak, serve_audio_file, _process_speech_update_call, and updated process_speech_update_call, incoming_call, and process_speech.
- Adjusted routing and error handling.
Updated requirements.txt with new dependencies for ElevenLabs and re-ordered some existing dependencies.

github-actions · 2023-04-20T19:57:05Z

A preview of this pull request has been deployed (or updated) at https://pr-preview-33---personal-api-62o2bh23rq-uc.a.run.app.

preritdas added 14 commits April 20, 2023 18:23

Transcribe user speech with Whisper.

098a596

Put transcribe in a voice_tools module.

c678f99

Require ElevenLabs and speak.

bcca510

Speak with Jeeves and write to file.

96225cd

Expose speak module.

6427914

Serve audio files and use base url.

3e7788d

Fix tagging.

70ff257

Hide voice_cache.

c0eee89

Error and empty query handling.

b444e08

No more 1600 character limit as we're playing audio.

9445f0c

Voice ID in keys.

546ce2f

Remove print debug.

64e0b73

Retry Whisper once if it failed.

444b7b9

Update coverage badge post unit tests.

a781214

preritdas merged commit 8806f5b into master Apr 20, 2023

preritdas deleted the voice-quality branch April 20, 2023 19:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whisper transcription, ElevenLabs speech, error handling#33

Whisper transcription, ElevenLabs speech, error handling#33
preritdas merged 14 commits intomasterfrom
voice-quality

preritdas commented Apr 20, 2023 •

edited

Loading

Uh oh!

github-actions bot commented Apr 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

preritdas commented Apr 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes:

Uh oh!

github-actions bot commented Apr 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

preritdas commented Apr 20, 2023 •

edited

Loading