Make export pipeline logs more readable #111

zigaLuksic · 2022-10-18T13:20:17Z

Silences output of gdal calls in favor of tqdm, making logs much more readable.

In the logs there was a constant warning:

Warning 1: General options of gdal_translate make the COPY_SRC_OVERVIEWS creation option ineffective as they hide the overviews

I have removed this option in this MR, but it should be investigated if that is really the way to go. Link to cogification docs

mlubej

Not much to review, the changes make sense to me. One question, should this be parametrizable via the pipeline? I guess not, but just wanted to point it out anyway.

zigaLuksic · 2022-10-18T14:02:27Z

Not much to review, the changes make sense to me. One question, should this be parametrizable via the pipeline? I guess not, but just wanted to point it out anyway.

i was thinking about it, but I didn't see much point in keeping the old logs. Also not having them silenced breaks the tqdm

zigaLuksic · 2022-10-19T08:32:42Z

I talked to the GDAL wizards and they informed me that we are idiots (which we knew) and are using a cogification method for old GDAL and/or planar tiffs, which are not suitable for us (which we didn't know).

So i switched to the recommended way of cogification for newer GDAL versions.

mlubej · 2022-10-19T08:34:24Z

So i switched to the recommended way of cogification for newer GDAL versions.

Did you check if the GDAL version used is the appropriate one? Should be GDAL 3.1 or newer.

mlubej · 2022-10-19T08:38:40Z

I remembered about one more potential issue. if you're using deflate compression with Float32 values, you should set the predictor to 3

from docs:

NOTE: for many types of data adding a predictor can further reduce the file size. It is best you test this on your own data. To enable the predictor, add to the above command -co PREDICTOR=2 for integers, and -co PREDICTOR=3 for floating points.

zigaLuksic · 2022-10-19T08:52:11Z

So i switched to the recommended way of cogification for newer GDAL versions.

Did you check if the GDAL version used is the appropriate one? Should be GDAL 3.1 or newer.

GDAL 3.1 was released in 2020. Do you think we need to check the version and raise an exception if the GDAL version is older than that?

I remembered about one more potential issue. if you're using deflate compression with Float32 values, you should set the predictor to 3

Hmmm, at this point perhaps the utility functions should have a is_discrete flag which then sets these parameters accordingly 🤔

mlubej · 2022-10-19T08:58:14Z

GDAL 3.1 was released in 2020. Do you think we need to check the version and raise an exception if the GDAL version is older than that?

It might make sense, because older versions of GDAL had some mismatch issues where the resulting tiff could contain an offset. Not sure how relevant it is due to it being released in 2020, but I imagine it could happen..

Some more info I remember from https://git.sinergise.com/sentinel-core/java/-/issues/1400

Hmmm, at this point perhaps the utility functions should have a is_discrete flag which then sets these parameters accordingly 🤔

What if we use the dtype parameter for this, since it's limited to ["int8", "int16", "uint8", "uint16", "float32"]? This way if the user provides float32 (or if float32 is recognized automatically), then the predictor 3 would be used. I would assume if the inputs are 1.0, 2.0, 3.0, ... the user would provide dtype int8 use integers.

zigaLuksic · 2022-10-19T09:04:14Z

What if we use the dtype parameter for this, since it's limited to ["int8", "int16", "uint8", "uint16", "float32"]? This way if the user provides float32 (or if float32 is recognized automatically), then the predictor 3 would be used. I would assume if the inputs are 1.0, 2.0, 3.0, ... the user would provide dtype int8 use integers.

how in the absolute hell did i miss that i have dtype at my disposal....

shorten logs by using tqdm instead of gdals progress

012af6b

mlubej approved these changes Oct 18, 2022

View reviewed changes

switch to suggested method of creating COGs

c966f72

adjust resampling options and fix tests to remove warnings

821e617

zigaLuksic added 2 commits October 19, 2022 11:08

Use dtype param for configuring cogification

8beab75

add warning for old GDAL versions

b68eda3

zigaLuksic merged commit 2210bff into develop Oct 19, 2022

zigaLuksic deleted the export-pipeline-logs branch October 19, 2022 09:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make export pipeline logs more readable #111

Make export pipeline logs more readable #111

zigaLuksic commented Oct 18, 2022

mlubej left a comment

zigaLuksic commented Oct 18, 2022

zigaLuksic commented Oct 19, 2022

mlubej commented Oct 19, 2022 •

edited

mlubej commented Oct 19, 2022

zigaLuksic commented Oct 19, 2022

mlubej commented Oct 19, 2022

zigaLuksic commented Oct 19, 2022

Make export pipeline logs more readable #111

Make export pipeline logs more readable #111

Conversation

zigaLuksic commented Oct 18, 2022

mlubej left a comment

Choose a reason for hiding this comment

zigaLuksic commented Oct 18, 2022

zigaLuksic commented Oct 19, 2022

mlubej commented Oct 19, 2022 • edited

mlubej commented Oct 19, 2022

zigaLuksic commented Oct 19, 2022

mlubej commented Oct 19, 2022

zigaLuksic commented Oct 19, 2022

mlubej commented Oct 19, 2022 •

edited