-
Notifications
You must be signed in to change notification settings - Fork 4
Description
With the implementation of the new text editor popup, it opens the door to an idea that I've been wishing for in a mobile terminal app: natural dictation. Apple's dictation model obviously doesn't cut it, but I've been playing around with chat models / REST and then third party apps and it works excellently, except for the quite heavy inconvenience of it not being native to the application I want to write in. Switching apps to record, copy and then paste is really bad for productivity, especially on the Apple Vision. Two tiers of implementation:
First, simply using the API you've already built into the app. Add a dictation button to the text editor popup that records audio and sends the request. This is what I've been playing with, to great success. Would be nice to be able to set a custom prompt, so that users could add references to their custom tooling for example.
{
"model": "google/gemini-2.5-flash",
"messages": [
{
"role": "system",
"content": "You translate spoken natural language into a single valid POSIX shell command. Output ONLY the command. No explanations. No backticks."
},
{
"role": "user",
"content": [
{
"type": "input_audio",
"input_audio": {
"data": "base64_encoded_audio_here"
"format": "m4a"
}}]}]}
Second tier would be to add offline recording. Lately I've been using the Parakeet model with Superwhisper. Its lightning fast, only 450mb, and local. But this doesn't seem like a worthwhile implementation tbh, because the translation to POSIX would need to be done online anyway (I briefly tested a couple of tiny LLMs and they don't seem to be smart enough to be reliable for this purpose - with further testing and choosing the right model, I'm sure there would be a good option, but frankly I don't think its worth the effort).
I do think the relatively minimal work (considering you already have AI APIs built into the app) would be a major usability function and would take this app from top 3 terminal apps on the app store, to a clear productivity #1.