Skip to content

Whisper transcription, ElevenLabs speech, error handling#33

Merged
preritdas merged 14 commits intomasterfrom
voice-quality
Apr 20, 2023
Merged

Whisper transcription, ElevenLabs speech, error handling#33
preritdas merged 14 commits intomasterfrom
voice-quality

Conversation

@preritdas
Copy link
Owner

@preritdas preritdas commented Apr 20, 2023

Changes:

  • Added a new folder voice_tools with two new files: transcribe.py and speak.py.
    • transcribe.py contains the transcribe_twilio_recording function to transcribe Twilio recordings using OpenAI's Whisper API.
    • speak.py contains the speak_jeeves function to generate audio files using the ElevenLabs API.
  • Modified .gitignore to ignore the voice_cache folder.
  • Updated api/__init__.py to include the tags as a list instead of a string.
  • Made several changes to api/voice_inbound.py:
    • Replaced the usage of the <Gather> Twilio verb with <Record> to collect user's speech input as a recording for transcription.
    • Added new functions: extract_base_url, speak, serve_audio_file, _process_speech_update_call, and updated process_speech_update_call, incoming_call, and process_speech.
    • Adjusted routing and error handling.
  • Updated requirements.txt with new dependencies for ElevenLabs and re-ordered some existing dependencies.

@preritdas preritdas merged commit 8806f5b into master Apr 20, 2023
@preritdas preritdas deleted the voice-quality branch April 20, 2023 19:56
@github-actions
Copy link

A preview of this pull request has been deployed (or updated) at https://pr-preview-33---personal-api-62o2bh23rq-uc.a.run.app.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant