Rejig dictation by DOsinga · Pull Request #6844 · block/goose

DOsinga · 2026-01-30T17:26:40Z

Summary

I was going to look at making whisper run locally, but noticed that the current flows are, eh, a mess, so cleaning that up first.

Copilot

Pull request overview

This PR refactors the voice dictation feature to simplify the codebase before implementing local whisper support. The changes consolidate multiple scattered files and components into a unified architecture, replacing localStorage-based settings with backend configuration storage.

Changes:

Consolidated audio transcription endpoints from /audio/* to /dictation/* with unified provider handling
Replaced localStorage settings persistence with backend config API (voice_dictation_provider)
Simplified UI components by merging 5 separate dictation components into a single DictationSettings component
Removed the VOICE_DICTATION_ELEVENLABS_ENABLED feature flag
Replaced complex waveform visualizer with simple "Recording..." indicator
Removed 378 lines from useWhisper.ts and replaced with 249-line simplified useAudioRecorder.ts

Reviewed changes

Copilot reviewed 21 out of 21 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
ui/desktop/src/updates.ts	Removed ElevenLabs feature flag
ui/desktop/src/hooks/useWhisper.ts	Deleted, replaced by useAudioRecorder.ts
ui/desktop/src/hooks/useDictationSettings.ts	Deleted, settings now in backend config
ui/desktop/src/hooks/dictationConstants.ts	Deleted, constants moved to backend
ui/desktop/src/hooks/useAudioRecorder.ts	New simplified audio recording hook using backend API
ui/desktop/src/components/settings/dictation/*	Five component files deleted, replaced with single DictationSettings.tsx
ui/desktop/src/components/settings/dictation/DictationSettings.tsx	New unified settings component with provider selection and API key management
ui/desktop/src/components/settings/chat/ChatSettingsSection.tsx	Updated imports and reorganized settings layout
ui/desktop/src/components/ChatInput.tsx	Updated to use new hook, removed WaveformVisualizer, simplified recording UI
ui/desktop/src/api/*	Generated types and SDK methods for new dictation endpoints
ui/desktop/openapi.json	Added new dictation endpoint schemas
crates/goose-server/src/routes/mod.rs	Replaced audio module with dictation module
crates/goose-server/src/routes/dictation.rs	New unified backend handling both OpenAI and ElevenLabs with provider-agnostic API
crates/goose-server/src/routes/audio.rs	Deleted old implementation
crates/goose-server/src/openapi.rs	Updated OpenAPI schema definitions

Copilot · 2026-01-30T17:30:00Z

ui/desktop/src/hooks/useAudioRecorder.ts

+        audio: {
+          echoCancellation: true,
+          noiseSuppression: true,
+          autoGainControl: true,


The sampleRate: 44100 constraint was removed from the audio configuration. While the browser will use its default sample rate, this could result in inconsistent audio quality across different browsers and devices. Consider keeping an explicit sample rate to ensure consistent transcription quality.

Suggested change

autoGainControl: true,

autoGainControl: true,

sampleRate: 44100,

Copilot · 2026-01-30T17:30:00Z

crates/goose-server/src/routes/dictation.rs

+pub fn routes(state: Arc<AppState>) -> Router {
+    Router::new()
+        .route("/dictation/transcribe", post(transcribe_dictation))
+        .route("/dictation/config", get(get_dictation_config))
+        .with_state(state)
+}


The old audio.rs tests were removed but no replacement tests were added for the new dictation.rs module. Consider adding tests to verify the transcription endpoints and provider configuration logic work correctly.

Copilot · 2026-01-30T17:30:00Z

ui/desktop/src/components/ChatInput.tsx

                          <b>Chat</b> {'>'} <b>Voice Dictation.</b>
                        </p>
-                      ) : dictationSettings.provider === null ? (
+                      ) : dictationProvider === null ? (


Variable 'dictationProvider' cannot be of type null, but it is compared to an expression of type null.

Suggested change

) : dictationProvider === null ? (

) : dictationProvider === undefined ? (

Copilot

Pull request overview

Copilot reviewed 22 out of 22 changed files in this pull request and generated no new comments.

DOsinga · 2026-01-30T21:00:25Z

we talked about cleaning this up @Abhijay007 & @lifeizhou-ap - here's my take. my next plan is to build support for local whisper.

lifeizhou-ap

The cleanup is great! It also uses dictation/config to check whether the api key is configured, it is much faster than render the config secret key!

Co-authored-by: Douwe Osinga <douwe@squareup.com> Signed-off-by: Harrison <hcstebbins@gmail.com>

Rejig dictation

cd8e9d4

Copilot AI review requested due to automatic review settings January 30, 2026 17:26

Copilot started reviewing on behalf of DOsinga January 30, 2026 17:27 View session

Copilot AI reviewed Jan 30, 2026

View reviewed changes

Douwe Osinga added 2 commits January 30, 2026 12:33

Manual clean up

1a3afdd

Simplify further

4865f8b

Copilot AI review requested due to automatic review settings January 30, 2026 18:54

Copilot started reviewing on behalf of DOsinga January 30, 2026 18:55 View session

Copilot AI reviewed Jan 30, 2026

View reviewed changes

Format

66d0724

DOsinga requested review from Abhijay007 and lifeizhou-ap January 30, 2026 20:59

lifeizhou-ap approved these changes Feb 2, 2026

View reviewed changes

DOsinga added this pull request to the merge queue Feb 2, 2026

Merged via the queue into main with commit 2661454 Feb 2, 2026
18 checks passed

DOsinga deleted the clean-up-dictation branch February 2, 2026 13:30

Abhijay007 mentioned this pull request Feb 2, 2026

refactor: updated elevenLabs API module and remove button UX #6781

Open

2 tasks

github-actions bot mentioned this pull request Feb 3, 2026

chore(release): release version 1.23.0 (minor) #6906

Merged

stebbins pushed a commit to stebbins/goose that referenced this pull request Feb 4, 2026

Rejig dictation (block#6844)

99116f5

Co-authored-by: Douwe Osinga <douwe@squareup.com> Signed-off-by: Harrison <hcstebbins@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rejig dictation#6844

Rejig dictation#6844
DOsinga merged 4 commits intomainfrom
clean-up-dictation

DOsinga commented Jan 30, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 30, 2026

Uh oh!

Copilot AI Jan 30, 2026

Uh oh!

Copilot AI Jan 30, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

DOsinga commented Jan 30, 2026

Uh oh!

lifeizhou-ap left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	autoGainControl: true,
	autoGainControl: true,
	sampleRate: 44100,

	) : dictationProvider === null ? (
	) : dictationProvider === undefined ? (

Conversation

DOsinga commented Jan 30, 2026

Summary

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

DOsinga commented Jan 30, 2026

Uh oh!

lifeizhou-ap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants