-
Notifications
You must be signed in to change notification settings - Fork 2
feat: transcription & AI - Whisper, summaries, task extraction #39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
abc7825 to
5d4dfb5
Compare
aa12b0c to
715b152
Compare
5d4dfb5 to
9cfb638
Compare
715b152 to
36dd058
Compare
This stack of pull requests is managed by Graphite. Learn more about stacking. |
36dd058 to
1093438
Compare
9cfb638 to
5510bfd
Compare
7800586 to
cb49c82
Compare
41e9a77 to
8115bff
Compare
joshsny
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM - I haven't run it locally, but code looks good, looking forward to merging this in and trying it out!
|
|
||
| return false; | ||
| // Check OpenAI key | ||
| if (state.encryptedOpenaiKey) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we can leave this for now if you want, but it'd probably be fairly simple to get this working through the gateway
| return resolvedPath.startsWith(recordingsDirResolved + path.sep); | ||
| } | ||
|
|
||
| async function generateTranscriptSummary( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should go through our llm gateway already, since that supports gpt-4o-mini, I think the AI SDK works fine with a proxy
- OpenAI Whisper transcription (gpt-4o-mini-transcribe) - Auto-generate 3-7 word summaries (GPT-4o-mini) - Extract actionable tasks from transcripts - TranscriptionSection UI component - SettingsPanel for OpenAI key management - Secure API key storage in authStore - 25MB file size limit handling
8115bff to
4da4f22
Compare

Part 4/4 of audio recording feature stack.
Changes
Stack
Review notes
Completes the feature. Focus on AI prompts, error handling, and API key security.