-
-
Notifications
You must be signed in to change notification settings - Fork 386
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Store Cloud Vision OCR json files compressed in gzip format #6273
Comments
Trying nginx_static. This does not work:
Maybe because of https://trac.nginx.org/nginx/ticket/1570 |
This issue is stale because it has been open 90 days with no activity. |
Is this issue still relevant? It seems the change is done in production, I can only find gzipped OCR JSON file. |
Still relevant, see https://github.com/openfoodfacts/openfoodfacts-server/blob/main/lib/ProductOpener/Images.pm#L1833 we do not compress yet… |
As a summary (because the previous exchange on this thread does not reflect the current status):
|
This has been fixed by #8320 |
What
We currently run Cloud Vision on all source images of products, and store the result as .json files, uncompressed:
html/images/products/330/201/010/0105$ ls -lrt
total 1128
-rw-r--r-- 1 off off 152033 Mar 28 2021 1.jpg.orig
-rw-r--r-- 1 off off 182454 Mar 28 2021 1.jpg
-rw-r--r-- 1 off off 56126 Mar 28 2021 2.jpg.orig
-rw-r--r-- 1 off off 68245 Mar 28 2021 2.jpg
-rw-r--r-- 1 off off 170344 Mar 28 2021 1.json
-rw-r--r-- 1 off off 221838 Mar 28 2021 2.json
It takes a lot of space. We could instead store the files directly in gzip format.
Proposed solution
nginx on debian is compiled with --with-http_gunzip_module --with-http_gzip_static_module
http://nginx.org/en/docs/http/ngx_http_gzip_static_module.html
Part of
The text was updated successfully, but these errors were encountered: