Skip to content

Implement Speech-to-Text Functionality in Chat Input

Choose a tag to compare

@damianvtran damianvtran released this 02 Jun 22:30
· 247 commits to main since this release
72a3fb0

What's Changed

This release introduces speech-to-text capabilities into the chat message input, allowing users to record audio and have it transcribed into text. This feature enhances accessibility and user convenience.

  • What does this change address? This is a new feature that provides an alternative input method for users, improving the overall user experience.
  • What are the key improvements or modifications?
    • Audio Recording: Added a microphone button to the chat input to start and stop audio recording.
    • Visual Indicators: Implemented AudioRecordingIndicator with a pulsing dot and real-time waveform visualization during recording.
    • Transcription Processing: Introduced WaveformAnimation for visual feedback while audio is being transcribed.
    • Transcription API Integration: Integrated a new TranscriptionApi to send recorded audio for transcription and receive the text output.
    • UI State Management: Updated MessageInput to manage recording and transcribing states, disabling other inputs when active.

Impact

  • Does this change introduce any breaking changes? No breaking changes.
  • Are there any dependency updates? No new dependencies.
  • Are there any performance or security implications? Microphone access requires user permission; audio processing is handled efficiently.

PRs

  • feat: Implement Speech-to-Text Functionality in Chat Input by @damianvtran in #64

Full Changelog: v0.10.6...v0.10.7