Android: Voice typing: Add setting to allow specifying a glossary #12370

personalizedrefrigerator · 2025-06-02T17:02:21Z

Summary

This pull request adds a setting to allow users to customize the voice typing prompt. Among other things, this allows users to provide spelling and style suggestions for transcriptions.

Testing plan

I've tested this pull request by changing the prompt then starting voice typing and checking the output.

For example, on Android 13, I:

Set the prompt to this text is all lowercase. longer prompts. seem to allow more unusual transcription styles. lowercase. in settings > note.
Read 3-4 paragraphs of text.
Checked that many of the sentences start with lowercase letters.
- With this prompt, sentences sometimes still start with uppercase letters.

personalizedrefrigerator · 2025-06-02T17:03:19Z

packages/lib/models/settings/builtInMetadata.ts

+		'voiceTyping.prompt': {
+			value: '',


Suggested change

'voiceTyping.prompt': {

value: '',

'voiceTyping.prompt': {

advanced: true,

value: '',

It may make sense to move this to the advanced settings section by default.

laurent22 · 2025-06-06T09:17:33Z

packages/lib/models/settings/builtInMetadata.ts

+			label: () => _('Voice typing prompt'),
+			description: () => _('A short example of transcribed text. A prompt can help correct voice typing spelling or change the style of transcription. Leave empty to use the default prompt.'),


I feel this field may be difficult to use or understand as it requires knowing how Whisper works internally.

Would it make sense instead to have a "glossary" property? We ask users to input the words they want the model to understand, separated by commas. Then we automatically prefix this with "glossary:" and set that as a prompt?

Later if we find that access to the actual prompt is needed, we could have a second property for this, but even then I don't think that will be needed. For example if users say they'd like the text to be all lowercase, then we add a property "Set output to lowercase" and we provide a custom prompt ourselves.

Basically we should try to focus on the features the users need, and then convert this to a prompt. Because we know Whisper better than the user we can create better prompts based on their preferences.

Thank you for the feedback!

Would it make sense instead to have a "glossary" property? We ask users to input the words they want the model to understand, separated by commas. Then we automatically prefix this with "glossary:" and set that as a prompt?

Originally, my concern with a "Glossary" property was translating glossary: to all languages supported by Whisper. However, perhaps it would be fine to omit glossary: if we don't have a translation for it?

This should be resolved by 6d3e6cc. It currently works by:

Generating a "Glossary:" prompt based on the voiceTyping.glossary setting (if set).

If the current locale doesn't have a translation for Glossary:, the "Glossary:" prefix is omitted and the voiceTyping.glossary setting is used directly as a prompt.

Concatenating the "Glossary:" prompt with any existing prompt included in the model config.

…ing-prompting

personalizedrefrigerator · 2025-06-06T15:59:49Z

I'm converting this to a draft until the changes from 6d3e6cc have been manually tested.

personalizedrefrigerator · 2025-06-06T19:03:59Z

While testing this with longer audio segments on a low-end device, I've observed several app crashes (perhaps due to high memory usage?). I suspect that the crashes are related to #12352 and not this pull request.

personalizedrefrigerator · 2025-06-06T20:07:18Z

Marking as ready for review — the issue doesn't seem related to this PR.

laurent22 · 2025-06-07T10:27:14Z

readme/dev/spec/voice_typing.md

@@ -10,6 +10,12 @@ By default, Joplin uses Whisper.cpp for voice typing.

 Whisper.cpp provides a number of pre-trained models for transcribing speech in different languages. Both [English-only and multilingual models](https://github.com/openai/whisper?tab=readme-ov-file#available-models-and-languages) are available. The multilingual models support a variety of different languages. Joplin uses the smallest of the multilingual models by default.

+### Preventing spelling mistakes
+
+Joplin allows specifying a glossary for voice typing using the "Voice typing: Glossary" setting (in the "Note" section of settings). Including uncommon words in the glossary makes voice typing more likely to spell them correctly. For example, providing `Scott Joplin, ragtime.` as the prompt helps voice typing correctly spell "Scott Joplin" and "ragtime".


Suggested change

Joplin allows specifying a glossary for voice typing using the "Voice typing: Glossary" setting (in the "Note" section of settings). Including uncommon words in the glossary makes voice typing more likely to spell them correctly. For example, providing `Scott Joplin, ragtime.` as the prompt helps voice typing correctly spell "Scott Joplin" and "ragtime".

Joplin allows specifying a glossary for voice typing using the "Voice typing: Glossary" setting (in the "Note" section of settings). Including uncommon words in the glossary makes voice typing more likely to spell them correctly. For example, providing `Scott Joplin, ragtime.` as the glossary helps voice typing correctly spell "Scott Joplin" and "ragtime".

Suggestion applied in 6fcae46.

laurent22 · 2025-06-07T10:29:26Z

packages/lib/models/settings/builtInMetadata.ts

+			public: true,
+			appTypes: [AppType.Mobile],
+			label: () => _('Voice typing: Glossary'),
+			description: () => _('A comma-separated list of words'),


Suggested change

description: () => _('A comma-separated list of words'),

description: () => _('A comma-separated list of words. May be used for uncommon words, to ensures that voice-typing spells them correctly.'),

Replaced ensures with help, since this setting does not guarantee that voice typing will spell the glossary words correctly. Edit: With this replacement, the suggestion has been applied in f47306f.

Co-authored-by: Laurent Cozic <laurent22@users.noreply.github.com>

personalizedrefrigerator added 2 commits June 2, 2025 09:11

Android: Voice typing: Add setting to allow customizing the prompt

dd48498

Update setting description, documentation

c7b8ad2

personalizedrefrigerator commented Jun 2, 2025

View reviewed changes

personalizedrefrigerator added android Voice typing labels Jun 2, 2025

laurent22 reviewed Jun 6, 2025

View reviewed changes

personalizedrefrigerator added 2 commits June 6, 2025 08:21

Merge remote-tracking branch 'upstream/dev' into pr/android/voice-typ…

0582018

…ing-prompting

Simplify the UI for setting a prompt

6d3e6cc

personalizedrefrigerator marked this pull request as draft June 6, 2025 15:59

personalizedrefrigerator marked this pull request as ready for review June 6, 2025 20:07

laurent22 reviewed Jun 7, 2025

View reviewed changes

personalizedrefrigerator and others added 3 commits June 9, 2025 06:22

Update readme/dev/spec/voice_typing.md

6fcae46

Co-authored-by: Laurent Cozic <laurent22@users.noreply.github.com>

Update setting description

f47306f

Merge branch 'dev' into pr/android/voice-typing-prompting

023eca3

personalizedrefrigerator changed the title ~~Android: Voice typing: Add setting to allow customizing the prompt~~ Android: Voice typing: Add setting to allow specifying a glossary Jun 12, 2025

laurent22 merged commit 6a5c85d into laurent22:dev Jun 28, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Android: Voice typing: Add setting to allow specifying a glossary #12370

Android: Voice typing: Add setting to allow specifying a glossary #12370

Uh oh!

personalizedrefrigerator commented Jun 2, 2025

Uh oh!

personalizedrefrigerator Jun 2, 2025

Uh oh!

laurent22 Jun 6, 2025

Uh oh!

personalizedrefrigerator Jun 6, 2025 •

edited

Loading

Uh oh!

personalizedrefrigerator Jun 6, 2025

Uh oh!

personalizedrefrigerator commented Jun 6, 2025

Uh oh!

personalizedrefrigerator commented Jun 6, 2025

Uh oh!

personalizedrefrigerator commented Jun 6, 2025

Uh oh!

laurent22 Jun 7, 2025

Uh oh!

personalizedrefrigerator Jun 11, 2025

Uh oh!

laurent22 Jun 7, 2025

Uh oh!

personalizedrefrigerator Jun 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

		label: () => _('Voice typing prompt'),
		description: () => _('A short example of transcribed text. A prompt can help correct voice typing spelling or change the style of transcription. Leave empty to use the default prompt.'),

	Joplin allows specifying a glossary for voice typing using the "Voice typing: Glossary" setting (in the "Note" section of settings). Including uncommon words in the glossary makes voice typing more likely to spell them correctly. For example, providing `Scott Joplin, ragtime.` as the prompt helps voice typing correctly spell "Scott Joplin" and "ragtime".
	Joplin allows specifying a glossary for voice typing using the "Voice typing: Glossary" setting (in the "Note" section of settings). Including uncommon words in the glossary makes voice typing more likely to spell them correctly. For example, providing `Scott Joplin, ragtime.` as the glossary helps voice typing correctly spell "Scott Joplin" and "ragtime".

	description: () => _('A comma-separated list of words'),
	description: () => _('A comma-separated list of words. May be used for uncommon words, to ensures that voice-typing spells them correctly.'),

Uh oh!

Android: Voice typing: Add setting to allow specifying a glossary #12370

Android: Voice typing: Add setting to allow specifying a glossary #12370

Uh oh!

Conversation

personalizedrefrigerator commented Jun 2, 2025

Summary

Testing plan

Uh oh!

personalizedrefrigerator Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

laurent22 Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

personalizedrefrigerator Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

personalizedrefrigerator Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

personalizedrefrigerator commented Jun 6, 2025

Uh oh!

personalizedrefrigerator commented Jun 6, 2025

Uh oh!

personalizedrefrigerator commented Jun 6, 2025

Uh oh!

laurent22 Jun 7, 2025

Choose a reason for hiding this comment

Uh oh!

personalizedrefrigerator Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

laurent22 Jun 7, 2025

Choose a reason for hiding this comment

Uh oh!

personalizedrefrigerator Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

personalizedrefrigerator Jun 6, 2025 •

edited

Loading

personalizedrefrigerator Jun 9, 2025 •

edited

Loading