Status: Work in Progress
A CLI utility and workspace template for importing, organizing, and transcribing audio files from Digital Voice Recorders (DVRs).
Digital voice recorders produce flat directories of MP3 files with minimal metadata. Managing, organizing, and transcribing these recordings is tedious and time-consuming.
A streamlined workflow that:
- Imports recordings from a mounted DVR
- Organizes files into date-based folders (DDMM format)
- Classifies recordings using AI (via short audio samples)
- Transcribes selected recordings with speaker identification
- Exports transcripts and summaries to cloud storage
- CLI for syncing from DVR mount point
- Move (default) or copy modes
- Auto-cleanup of accidental recordings (<10 seconds)
- Date-based folder organization
- Extract 30-second samples from recordings
- Send to Gemini for title and summary generation
- Avoid processing full multi-hour recordings just for metadata
- Selective full transcription via Gemini API
- Voice sample-based speaker diarization
- Accurate speaker labels without manual intervention
For recordings that may serve as legal evidence:
- No file modifications or renaming
- SHA256 checksum calculation
- Cloud backup (Google Drive, S3, WORM storage)
- Meeting notes and discussions
- Personal voice memos
- Research interviews
- Property viewing notes
- Any scenario where voice capture beats typing
- Linux
- Python 3.10+
- Gemini API key
- Mounted DVR (USB mass storage)
Coming soon
Coming soon
TBD