Unicharset_extractor meet ICU ERROR with 64 bit Windows installer #1625
Comments
|
Please try with w32 version for beta.1. Does that get the same error? |
|
Thank you! I have tried win32 version for beta1. It works well. |
|
@stweil has mentioned on the page that w64 version is experimental. |
|
@yzy1996, can you provide |
|
@stweil Thank you! I am glad to help find and modify the problem. This is a simple file named This file is enough, execute |
|
This line fails. I have currently no idea why |
|
@egorpugin Can you check whether this works with the Visual Studio 64bit version? |
|
With 64 bit build for me it is: |
|
@yzy1996 Are you using cppan builds or prebuilt installers from Mannheim? |
|
@egorpugin Thanks for testing. Are the artifacts from appyeyor builds directly usable? https://ci.appveyor.com/project/zdenop/tesseract/build/job/xh1atjcupa6qxlox/artifacts |
|
Sure, already for a long time. |
|
Do these include the training tools? I would like to add the info to https://github.com/tesseract-ocr/tesseract/wiki#windows What would be the best way to describe this so that people know that VS2017 and VS2015 builds are available for download? |
|
The download is a little bit tricky because there is no fix URL. You have to select an Appveyor build which was successful and get the artifacts, a ZIP file which contains the executables. By the way, it looks like the current Appveyor build needs maintenance. It reports "Error uploading cache entry to the cache storage: Remote server returned 500: There is not enough space on the disk." The build time increased from about 15 minutes to about 80 minutes. |
That is because of additional VS2015 builds. Cppan cache on appveyor exceeds 100 MBs now for 4 builds, so it cannot be saved. At the moment I don't have funds to create cppan binary cache on the internet (to cache and download binaries instead of long source builds - which is cheaper in terms of used server resources). |
Can they not be posted in a github repo? |
|
If you are talking about every cppan package, I'm afraid github will ban me for abuse. :) |
|
You could try it only for those projects where Cppan cache on appveyor exceeds 100 MBs, starting with tesseract deps :-) |
|
@egorpugin I am using prebuilt installers from Mannheim. |
|
It looks like the problem is caused by a buggy Cygwin package. Extract from Cygwin package files: While the 32 bit package has a large |
|
cppan is using a newer version of icudata Unpacking : pvt.cppan.demo.unicode.icu.i18n-61.1.0... |
|
A local build of the code used by Cygwin works fine for me, so I have working 64 bit DLL files. Here is a short test program: |
It means you have no icu data (or partially). |
|
I reported the problem on the Cygwin mailing list. The next update of the Mannheim installer will include fixed icu DLL files. |
|
The new installer is now available and includes two ICU DLL files which fix this issue. @yzy1996, maybe you can give it a try. I also suggest to add some information to the title of this issue: "Unicharset_extractor meet ICU ERROR with 64 bit Windows installer". |
|
Thank you for helping solve the problem. I still have a lot to learn and I hope to be a member of you one day. |
|
Please close the issue as solved. |
Environment
Current Behavior:
C:\Users\Jerry\Desktop\新建文件夹>unicharset_extractor chi_my.font.exp0.box
Extracting unicharset from box file chi_my.font.exp0.box
ICU ERROR: U_FILE_ACCESS_ERROR
But I find this will be solved by use [tesseract-ocr-setup-4.00.00dev.exe] . It will occur by use [tesseract-ocr-w64-setup-v4.0.0-beta.1.20180414.exe]
The text was updated successfully, but these errors were encountered: