-
Notifications
You must be signed in to change notification settings - Fork 29.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Speech To Text in VS code is awkward on MacOS #213149
Comments
If you could just hook the |
Here's an example of the duplicate-text bug. I'm speaking It seems that if I don't allow enough silence for it to 'settle down' after I've said 'full stop', the utterance text gets double-injected into the window. In TextEdit I can't replicate this particular fail. It isn't 100% right there either. It is inserting unwanted newline characters. Screen.Recording.2024-05-30.at.09.34.53.mov |
Here's a demo of the wordwrap + superposition issue: Screen.Recording.2024-05-30.at.09.49.52.mov |
Here's an example of the Capitilization-of-start-of-new-phrase problem:
There are other situations where I get a Capitalization fail, e.g. inserting the cursor into a sentence and speaking. Screen.Recording.2024-05-30.at.09.57.12.movThis one is probably a really tricky fix, as macOS dictation assistant is clearly scraping the text for the active window and operating over that. I think a VS code native speech tool would be a much appreciated feature! |
Here's a nice repeatable minimal testcase for duplication. All I do here is double-tap And then I just wait. Firstly it DOESN'T create a new paragraph, just a couple of spaces. Screen.Recording.2024-06-02.at.12.15.05.mov |
Type: Bug
Just try using the MacOS inbuilt Dictation tool in VSCode.
(This tool can be activated under SystemSettings -> Keyboard -> Dictation).
Many problems:
I think that the fundamental problem here is with this MacOS tool. I think it's design is overly complex and intricate, and it often falls over.
Given that most VS Code users spend most of their day entering text into VSCode, it would be really nice to have a solution that takes care of SpeechToText. Maybe a fix to interop with this Dictation tool, maybe an extension, maybe a VSCode core functionality.
I'm not bothered about speech-to-code. I'm quite happy to type my code. but if I am editing text files (.txt, .md, .nt, etc.) or modifying text content within the code (e.g. AI prompts, docstrings, strings, comments, etc.) I would like something simple and reliable.
VS Code version: Code 1.89.1 (dc96b83, 2024-05-07T05:14:32.757Z)
OS version: Darwin arm64 23.4.0
Modes:
System Info
canvas_oop_rasterization: enabled_on
direct_rendering_display_compositor: disabled_off_ok
gpu_compositing: enabled
multiple_raster_threads: enabled_on
opengl: enabled_on
rasterization: enabled
raw_draw: disabled_off_ok
skia_graphite: disabled_off
video_decode: enabled
video_encode: enabled
webgl: enabled
webgl2: enabled
webgpu: enabled
Extensions (21)
(1 theme extensions excluded)
A/B Experiments
The text was updated successfully, but these errors were encountered: