Skip to content

Releases: Parth2684/documind-native

Documind v1.0.0 — First Stable Release

Choose a tag to compare

@Parth2684 Parth2684 released this 05 Jun 09:33

Documind v1.0.0 First Stable Release

I'm excited to finally release the first stable version of Documind.

Documind is a privacy focused desktop application that helps you extract text from documents using AI and convert text into natural sounding speech directly on your device. The goal is to provide powerful document tools while keeping users in control of their data.

What's Included

📄 AI Powered OCR

Extract text from PDFs and images using Google's Gemini models.

OCR modes are available for:

  • Documents
  • Notes
  • Presentations
  • Code
  • General text

🎙️ Fully Local Text to Speech

Text to speech is powered by the Kokoro ONNX model and runs locally on your machine.

Features include:

  • 27 voices
  • Adjustable playback speed
  • No cloud TTS services
  • No text sent to external providers for speech generation

🔒 Privacy First Security

Security was a core focus from the start.

  • PIN protected vaults powered by IOTA Stronghold
  • Encrypted Gemini API key storage
  • No plaintext API keys
  • Local first architecture
  • No telemetry
  • No tracking

📜 Activity History

Keep track of previous OCR and TTS operations.

  • View extracted text
  • Open generated audio files
  • Delete history records when needed

🖥️ Cross Platform

Available for:

  • Windows
  • macOS
  • Linux

Built With

  • Rust
  • Tauri
  • React
  • TypeScript
  • SQLite
  • IOTA Stronghold
  • Gemini API
  • Kokoro ONNX Runtime

Notes

Documind requires:

  • eSpeak NG for text to speech
  • Poppler for PDF processing

Installation instructions for each platform are available in the README.

Feedback

If you encounter any bugs, issues, or have suggestions for improvements, please open an issue on GitHub. Future updates and fixes will be driven by community feedback and reported issues.

Thank you for checking out Documind. I hope you find it useful.

documind-v0.1.0

Choose a tag to compare

@Parth2684 Parth2684 released this 01 Jun 19:31

Features

  • OCR for images and PDFs
  • Gemini-powered document processing
  • Kokoro-based text-to-speech
  • Secure API key storage using Stronghold
  • Local SQLite history storage

Downloads

Linux

  • documind-linux-x86_64.tar.gz

Dependencies

  • espeak-ng