-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gImageREADER does not find non-english dicts #13
Comments
Hi,
So in short, while you installed the spellchecking dictionaries, you are missing the actual language support for tesseract. For German, you'll want to download this [2] and place the [1] https://code.google.com/p/tesseract-ocr/downloads/list |
Hi Sandro,
I indeed followed an article of the German c't magazine 4/2015 where that is
|
So you should have something like this in
If this does not work (though that would really be a first), try only with the |
Great! Now I've got it. My mistake was that I copied the deu.traineddata into
|
Ok cool! |
Maybe quite a noobish question, but I'm trying to add the Dutch tesseract data to gImageReader. A Google search led me to this page. Since the tesseract code has been transferred to GitHub, I started looking there. I'm wondering which files exactly I should copy. All of them, or just the wordlist? |
Found it, languages can now be dowloaded at: |
Hi Sandro,
|
To which tessdata folder did you download the traineddata files? gImageReader bundles tesseract, so you need to make sure you place the traineddata files in the ...\gImageReader\share\tessdata folder. |
The following files are in the ...\gImageReader\share\tessdata folder: deu.traineddata Am 7/24/2016 um 10:12 PM schrieb Sandro Mani:
Dr. Walter T. Penzhorn |
Did you make sure you downloaded the actual binary blob and not the html page on github for the traineddata file? Can you try with the integrated tessdata manager (you'll need to start the program as administrator)? |
The following files and their sizes are in the deu.traineddata 13 054 KB
I have run the program as administrator, using the gImageReadr - without Am 7/25/2016 um 12:37 PM schrieb Sandro Mani:
Dr. Walter T. Penzhorn |
The integrated tessdata manger can be launched from the language selection menu -> "manage languages..." |
On Ubuntu the solution is
Other languages have their own myspell file, examples:
|
This is for gImageReader 3.0.1 under Windows 7.
I followed the dictionary installation instructions and downloaded the german de_DE.zip and copied the de_DE.aff and de_DE.dic into /share/myspell/dicts. They are there along with the en_US files.
But when I try to select "German" with "Recognize selection", even after "Redetect Languages" I can't select "German" (or "Deutsch"). There is just "English" -> "English (United States)" or "Multilingual" -> "English".
The text was updated successfully, but these errors were encountered: