Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JXL (JPEG XL) support #4048

Closed
schrmh opened this issue Apr 7, 2023 · 1 comment
Closed

JXL (JPEG XL) support #4048

schrmh opened this issue Apr 7, 2023 · 1 comment

Comments

@schrmh
Copy link

schrmh commented Apr 7, 2023

Your Feature Request

I want to use tesseract to search text in screenshots stored in JXL but I get this when trying tesseract image.jxl text:

Error in fopenReadStream: file not found
Error in pixRead: image file not found: �
Image file � cannot be read!
Error during processing.

I have hundreds of thousands in screenshots in JXL by now since those files take on average, after lossless convertion only about 47% of the space compared to when they were PNG (cjxl -q 100 -m 1 -e 9 --brotli_effort 11 -E 3 -I 100 -g 3 -j 1 in.png out.jxl). No other common file format can compete there.

@zdenop
Copy link
Contributor

zdenop commented Apr 8, 2023

tesseract uses leptonica for image-opening images, so we can do nothing here.
The future of JPEG XL is questionable (see e.g. https://www.phoronix.com/news/Chrome-Dropping-JPEG-XL-Reasons), so I doubt if anyone would be willing to invest time and resources to support it in leptonica.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants