AnkiOCR

Anki 2.1 addon to generate OCR text from images inside of Anki notes/cards. Note that this is only designed for computer generated text, not handwritten.

The aim of this addon was to generate searchable text for image-heavy notes, it is not intended to produce high quality, perfectly ordered text!

Note that because this addon changes the note template, you will see a warning about changing the database and uploading to AnkiWeb. This is normal.

Usage

Open the card browser and select the note(s) you want to process. Use the search bar at the top, select tags, decks, etc.
On the toolbar at the top, select 'Cards', then 'AnkiOCR', and select 'Run AnkiOCR on selected notes', as shown below

After processing, each of the images in the note will have the ocr data embedded in the title html tag, viewable as a tooltip:

If you want to remove the OCR data from any notes, select them and then use the "Remove OCR data from selected notes" option in the menu shown above

If you wish to have the OCR data outputted to a separate 'OCR' field on the note, which will modify your note types in your deck, you can set the text_output_location config option to new_field

If you want to add new languages, you need to download the appropriate language data from here.

Installation

AnkiOCR depends on the Tesseract OCR library.

If you're on Windows or Mac, tesseract is bundled with the addon.

If you're on Linux carefully follow the instructions here

AnkiOCR was built on Python 3.9.

It is highly recommended to to use inside the Anki application, by installing the addon from AnkiWeb If you want to run it externally to anki, see below:

Ensure you have pyenv and poetry installed
Then clone the git repo: git clone https://github.com/cfculhane/AnkiOCR
Setup env and install dependencies make install

Testing

make test

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
docs		docs
scripts		scripts
src/anki_ocr		src/anki_ocr
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs

docs

scripts

scripts

src/anki_ocr

src/anki_ocr

tests

tests

.coveragerc

.coveragerc

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

CHANGELOG.md

CHANGELOG.md

LICENSE

LICENSE

Makefile

Makefile

README.md

README.md

poetry.lock

poetry.lock

pyproject.toml

pyproject.toml

Repository files navigation

AnkiOCR

Usage

Installation

Testing

About

Releases 19

Packages

Languages

License

cfculhane/AnkiOCR

Folders and files

Latest commit

History

Repository files navigation

AnkiOCR

Usage

Installation

Testing

About

Resources

License

Stars

Watchers

Forks

Languages