Releases: Parth2684/documind-native
Release list
Documind v1.0.0 — First Stable Release
Documind v1.0.0 First Stable Release
I'm excited to finally release the first stable version of Documind.
Documind is a privacy focused desktop application that helps you extract text from documents using AI and convert text into natural sounding speech directly on your device. The goal is to provide powerful document tools while keeping users in control of their data.
What's Included
📄 AI Powered OCR
Extract text from PDFs and images using Google's Gemini models.
OCR modes are available for:
- Documents
- Notes
- Presentations
- Code
- General text
🎙️ Fully Local Text to Speech
Text to speech is powered by the Kokoro ONNX model and runs locally on your machine.
Features include:
- 27 voices
- Adjustable playback speed
- No cloud TTS services
- No text sent to external providers for speech generation
🔒 Privacy First Security
Security was a core focus from the start.
- PIN protected vaults powered by IOTA Stronghold
- Encrypted Gemini API key storage
- No plaintext API keys
- Local first architecture
- No telemetry
- No tracking
📜 Activity History
Keep track of previous OCR and TTS operations.
- View extracted text
- Open generated audio files
- Delete history records when needed
🖥️ Cross Platform
Available for:
- Windows
- macOS
- Linux
Built With
- Rust
- Tauri
- React
- TypeScript
- SQLite
- IOTA Stronghold
- Gemini API
- Kokoro ONNX Runtime
Notes
Documind requires:
- eSpeak NG for text to speech
- Poppler for PDF processing
Installation instructions for each platform are available in the README.
Feedback
If you encounter any bugs, issues, or have suggestions for improvements, please open an issue on GitHub. Future updates and fixes will be driven by community feedback and reported issues.
Thank you for checking out Documind. I hope you find it useful.
documind-v0.1.0
Features
- OCR for images and PDFs
- Gemini-powered document processing
- Kokoro-based text-to-speech
- Secure API key storage using Stronghold
- Local SQLite history storage
Downloads
Linux
- documind-linux-x86_64.tar.gz
Dependencies
- espeak-ng