Return None for unreadable images and try to infer num channels #1307

hungcs · 2021-09-20T03:46:24Z

No description provided.

tgaddair

Nice fix! Only question is about need to new parameter.

ludwig/features/image_feature.py

tgaddair · 2021-09-20T20:24:48Z

ludwig/utils/image_utils.py

@@ -30,16 +31,15 @@
 logger = logging.getLogger(__name__)


+@functools.lru_cache(maxsize=32)


Is the purpose this cache to avoid downloading the image twice when we infer the image dimensions (1) and then process it (2)?

Yeah, or to avoid downloading images with the same url in each step

tgaddair · 2021-09-20T20:28:06Z

ludwig/features/image_feature.py

+
+                height, width = round(height_avg), round(width_avg)
+                logger.debug("Inferring height: {0} and width: {1}".format(height, width))
+            elif first_image is not None:


It looks like we can generalize this first_image code path as a special case of the infer_image_dimensions case. Specifically, what if we removed the infer_image_dimensions param and just used infer_image_sample_size. Then if the user sets this value to 1, we get the same effect as setting infew_image_dimensions=False, right?

sounds good!

actually I just realized there is a slight difference -infer_image_dimensions also sets should_resize = True, which will prevent Ludwig from throwing an error on the first mismatch - only having infer_image_sample_size=1 will mean that a size mismatch error will never be thrown because everything will be silently resized

I see. In this case, I would be in favor of making should_resize=True the default (because the default is now to infer image dimensions). Then if the user wants the old behavior, they would need to set should_resize=False manually. An additional thing we can do in this case, when should_resize=False and infer_image_sample_size > 1 is raise the error if not all the elements of the sample match. Would that solve the issue?

By user setting the older behavior, do you mean exposing should_resize and user_specified_num_channels through preprocessing parameters? Or meaning they would just edit should_resize = False in the code.

If we expose should_resize and user_specified_num_channels through preprocessing parameters, then those would just be replacing infer_image_dimensions, so I don't know if that would decrease complexity. There also could be conflicts (i.e. user specifies explicit width and height but then has should_resize set to False.

One possible solution is to make infer_image_sample_size = 0 the old case, to not infer anything and fail on any mismatch, and then anything > 0 would implicitly mean should_resize = True, so we wouldn't need any other parameters. The only question is if = 0 is intuitive enough to encode this information

Hmm, yeah I see the issue here. I think given how much we would need to overhaul the config params to make it work, it's probably fine to leave it as-is for now.

hungcs added 5 commits September 19, 2021 16:47

Fail gracefully for 404 image urls

10c27ce

logging

4c680d7

cache

f4d5cbf

handle chanels

47860cd

typo

1f06e22

hungcs requested a review from tgaddair September 20, 2021 03:46

hungcs added 4 commits September 19, 2021 21:43

move caching

831805b

iloc

909acfb

head

4f9f3d9

tests

a4a5f3d

tgaddair reviewed Sep 20, 2021

View reviewed changes

ludwig/features/image_feature.py Show resolved Hide resolved

hungcs added 2 commits September 20, 2021 10:06

remove infer_image_num_channels

15a09eb

cleanup

50a3b43

hungcs requested a review from tgaddair September 20, 2021 17:23

tgaddair reviewed Sep 20, 2021

View reviewed changes

tgaddair approved these changes Sep 21, 2021

View reviewed changes

tgaddair merged commit f3fcae7 into master Sep 21, 2021

tgaddair deleted the image-fail branch September 21, 2021 15:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return None for unreadable images and try to infer num channels #1307

Return None for unreadable images and try to infer num channels #1307

hungcs commented Sep 20, 2021

tgaddair left a comment

tgaddair Sep 20, 2021

hungcs Sep 20, 2021

tgaddair Sep 20, 2021

hungcs Sep 20, 2021

hungcs Sep 20, 2021 •

edited

Loading

tgaddair Sep 20, 2021

hungcs Sep 20, 2021 •

edited

Loading

tgaddair Sep 21, 2021

		@@ -30,16 +31,15 @@
		logger = logging.getLogger(__name__)


		@functools.lru_cache(maxsize=32)

Return None for unreadable images and try to infer num channels #1307

Return None for unreadable images and try to infer num channels #1307

Conversation

hungcs commented Sep 20, 2021

tgaddair left a comment

Choose a reason for hiding this comment

tgaddair Sep 20, 2021

Choose a reason for hiding this comment

hungcs Sep 20, 2021

Choose a reason for hiding this comment

tgaddair Sep 20, 2021

Choose a reason for hiding this comment

hungcs Sep 20, 2021

Choose a reason for hiding this comment

hungcs Sep 20, 2021 • edited Loading

Choose a reason for hiding this comment

tgaddair Sep 20, 2021

Choose a reason for hiding this comment

hungcs Sep 20, 2021 • edited Loading

Choose a reason for hiding this comment

tgaddair Sep 21, 2021

Choose a reason for hiding this comment

hungcs Sep 20, 2021 •

edited

Loading

hungcs Sep 20, 2021 •

edited

Loading