Skanna tryckt text med kameran och få den uppläst.
Scan printed text with your camera and have it read aloud — with Swedish voice support.
- Camera OCR — Capture text from printed documents using your webcam
- Text-to-Speech — Read scanned text aloud with Swedish or English voices
- Accessible UI — Large buttons, high contrast, screen reader friendly (ARASAAC-compatible)
- Multi-language — Swedish as primary language, English fallback
- Editable text — Review and edit scanned text before reading
- Multiple TTS backends — Piper, espeak-ng, macOS
say, or pyttsx3
Ubuntu/Debian:
sudo apt install python3-gi python3-gi-cairo gir1.2-gtk-4.0 gir1.2-adw-1
sudo apt install tesseract-ocr tesseract-ocr-swe
sudo apt install espeak-ngFedora:
sudo dnf install python3-gobject gtk4 libadwaita
sudo dnf install tesseract tesseract-langpack-swe
sudo dnf install espeak-ngmacOS (Homebrew):
brew install pygobject3 gtk4 libadwaita
brew install tesseract tesseract-langOptional — Piper TTS (high-quality Swedish voices):
pip install piper-tts
# Download Swedish voice model:
# https://github.com/rhasspy/piper/releasespip install -r requirements.txtpip install .msgfmt locale/sv/LC_MESSAGES/readaloud.po -o locale/sv/LC_MESSAGES/readaloud.moreadaloudOr run directly:
python -m readaloud.main- Click "Skanna text" to capture from your camera
- Review the extracted text in the text area
- Click "Läs upp" to hear the text read aloud
- Use Pause/Stop to control playback
ReadAloud/
├── readaloud/
│ ├── __init__.py # Package metadata
│ ├── main.py # Entry point
│ ├── app.py # Gtk.Application
│ ├── window.py # Main window UI
│ ├── ocr.py # OpenCV + Tesseract OCR
│ ├── tts.py # Text-to-Speech engine
│ └── i18n.py # Internationalization
├── locale/
│ ├── readaloud.pot # Translation template
│ └── sv/LC_MESSAGES/
│ └── readaloud.po # Swedish translation
├── data/
│ └── se.readaloud.App.desktop
├── setup.py
├── requirements.txt
└── README.md
ReadAloud is designed for users with visual impairments and dyslexia:
- Large touch targets (minimum 52px height)
- High-contrast text (16px+ font size)
- Full keyboard navigation (Ctrl+Q to quit)
- Screen reader compatible labels and tooltips
- ARASAAC-compatible design principles
GPL-3.0