Conversation
- Add live transcription mode using Deepgram streaming API - Implement streaming session management with audio chunking and commit logic - Update TUI to display live transcription text and metrics - Add support for auto-pasting live transcription text - Handle errors and disconnections during live streaming sessions - Update documentation and logs to reflect live streaming support
- implement deepgram stream session - add methods for sending and receiving audio data - handle connection closure and errors - parse deepgram stream responses into transcript updates
- change time format to include date - improve log readability with new format
…ing support - Remove unused code and refactor main function for clarity - Add support for streaming transcription with Deepgram API - Update log functions to use fmt style strings - Simplify device selection and handle device switch - Improve error handling and add more informative error messages - Refactor transcription session code to use a common interface - Update transcriber code to use a session based approach - Remove adaptive encoder and threshold based encoding - Improve code formatting and consistency throughout the project
- Refactored TranscriptionMetrics function to include TLS protocol information - Improved warm connection handling in TracedClient - Removed redundant code in batchSession and transcriber files - Added voiceThreshold constant for no voice warning - Updated audio level smoothing and no voice warning logic in tui file - Improved code organization and formatting throughout the project
- Added test flags to enable integration testing - Created fake audio context and capture device for testing - Implemented test mode for running zee with pre-recorded audio - Added tests for batch and stream transcription modes - Included tests for clipboard functionality and error handling - Updated transcriber session to track stream metrics and audio duration - Improved logging and diagnostics for testing and debugging purposes
- Add macOS integration job that runs after unit tests - Inject GROQ and DEEPGRAM API keys from secrets - Run make test-integration for end-to-end validation
- Expand make test to run all packages instead of only encoder - Add FLAC encoder tests covering full encode, empty input, and partial blocks - Add hybrid hotkey tests for long press, short tap toggle, and multi-cycle - Add log package tests for path resolution, init, transcription text, and idempotent close - Add transcriber tests for NetworkMetrics, format parsing, encoder creation, and batch session
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Add live transcription mode(with auto-paste) using Deepgram streaming API and major refactoring around transcriber api
added unit/integration tests and refactored transcribe backend to use multiple backends easily.