Skip to content

Add streaming support#2

Merged
sumerc merged 11 commits intomainfrom
streaming-text
Feb 8, 2026
Merged

Add streaming support#2
sumerc merged 11 commits intomainfrom
streaming-text

Conversation

@sumerc
Copy link
Copy Markdown
Owner

@sumerc sumerc commented Feb 7, 2026

Add live transcription mode(with auto-paste) using Deepgram streaming API and major refactoring around transcriber api

added unit/integration tests and refactored transcribe backend to use multiple backends easily.

- Add live transcription mode using Deepgram streaming API
- Implement streaming session management with audio chunking and commit logic
- Update TUI to display live transcription text and metrics
- Add support for auto-pasting live transcription text
- Handle errors and disconnections during live streaming sessions
- Update documentation and logs to reflect live streaming support
@sumerc sumerc self-assigned this Feb 7, 2026
@sumerc sumerc marked this pull request as draft February 7, 2026 20:41
sumerc added 10 commits February 7, 2026 23:43
- implement deepgram stream session
- add methods for sending and receiving audio data
- handle connection closure and errors
- parse deepgram stream responses into transcript updates
- change time format to include date
- improve log readability with new format
…ing support

- Remove unused code and refactor main function for clarity
- Add support for streaming transcription with Deepgram API
- Update log functions to use fmt style strings
- Simplify device selection and handle device switch
- Improve error handling and add more informative error messages
- Refactor transcription session code to use a common interface
- Update transcriber code to use a session based approach
- Remove adaptive encoder and threshold based encoding
- Improve code formatting and consistency throughout the project
- Refactored TranscriptionMetrics function to include TLS protocol information
- Improved warm connection handling in TracedClient
- Removed redundant code in batchSession and transcriber files
- Added voiceThreshold constant for no voice warning
- Updated audio level smoothing and no voice warning logic in tui file
- Improved code organization and formatting throughout the project
- Added test flags to enable integration testing
- Created fake audio context and capture device for testing
- Implemented test mode for running zee with pre-recorded audio
- Added tests for batch and stream transcription modes
- Included tests for clipboard functionality and error handling
- Updated transcriber session to track stream metrics and audio duration
- Improved logging and diagnostics for testing and debugging purposes
- Add macOS integration job that runs after unit tests
- Inject GROQ and DEEPGRAM API keys from secrets
- Run make test-integration for end-to-end validation
- Expand make test to run all packages instead of only encoder
- Add FLAC encoder tests covering full encode, empty input, and partial blocks
- Add hybrid hotkey tests for long press, short tap toggle, and multi-cycle
- Add log package tests for path resolution, init, transcription text, and idempotent close
- Add transcriber tests for NetworkMetrics, format parsing, encoder creation, and batch session
@sumerc sumerc marked this pull request as ready for review February 8, 2026 22:44
@sumerc sumerc merged commit a65e64a into main Feb 8, 2026
2 checks passed
@sumerc sumerc mentioned this pull request Feb 8, 2026
sumerc added a commit that referenced this pull request Mar 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant