Vectorize image similarity interfaces #110

schencej · 2022-04-08T18:41:38Z

Similarity interfaces vectorized per out discussion.

One point of contention:

Currently the query images input for the high-level interface is an Iterable. If we switched to Sequence we could check that the number of output heatmaps matches the number of input images

codecov · 2022-04-08T18:42:59Z

Codecov Report

Merging #110 (8f89df8) into master (d51dcc3) will increase coverage by 0.00%.
The diff coverage is 100.00%.

❗ Current head 8f89df8 differs from pull request most recent head 2aed5af. Consider uploading reports for the commit 2aed5af to get more accurate results

@@           Coverage Diff           @@
##           master     #110   +/-   ##
=======================================
  Coverage   99.87%   99.87%           
=======================================
  Files          57       57           
  Lines        2327     2329    +2     
=======================================
+ Hits         2324     2326    +2     
  Misses          3        3

Flag	Coverage Δ
unittests	`99.87% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
xaitk_saliency/exceptions.py	`60.00% <ø> (-6.67%)`	⬇️
...cy/impls/gen_image_similarity_blackbox_sal/sbsm.py	`100.00% <ø> (ø)`
...aitk_saliency/interfaces/gen_descriptor_sim_sal.py	`100.00% <ø> (ø)`
.../gen_descriptor_sim_sal/test_similarity_scoring.py	`100.00% <100.00%> (ø)`
...pls/gen_image_similarity_blackbox_sal/test_sbsm.py	`100.00% <100.00%> (ø)`
...imilarity_blackbox_sal/test_sim_occlusion_based.py	`100.00% <100.00%> (ø)`
tests/interfaces/test_gen_descriptor_sim_sal.py	`100.00% <100.00%> (ø)`
...terfaces/test_gen_image_similarity_blackbox_sal.py	`100.00% <100.00%> (ø)`
...impls/gen_descriptor_sim_sal/similarity_scoring.py	`100.00% <100.00%> (ø)`
...n_image_similarity_blackbox_sal/occlusion_based.py	`100.00% <100.00%> (ø)`
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d51dcc3...2aed5af. Read the comment docs.

xaitk_saliency/impls/gen_descriptor_sim_sal/similarity_scoring.py

xaitk_saliency/interfaces/gen_descriptor_sim_sal.py

tests/interfaces/test_gen_image_similarity_blackbox_sal.py

brianhhu

Overall, LGTM- thanks for the update! Added a few minor suggestions for typo and grammar fixes.

xaitk_saliency/impls/gen_descriptor_sim_sal/similarity_scoring.py

xaitk_saliency/interfaces/gen_descriptor_sim_sal.py

xaitk_saliency/interfaces/gen_image_similarity_blackbox_sal.py

schencej · 2022-04-12T13:58:45Z

@brianhhu @Purg Did you see my initial comment above? I had an implementation question about the type of our query images input.

Purg · 2022-04-12T18:21:04Z

Did you see my initial comment above?

I did! I thought I wrote a comment related to that, but I see that I must not have (or missed pressing the "Comment" button).

Changing the requirement to be a Sequence would be the simplest way to allow that check at the interface level.

Alternatively, we can still acquire such a count without requiring an explicit sequence by creating a wrapper iterator around it:

def generate(...):
    ...
    num_query = 0
    def wrap():
        nonlocal num_query
        for q in query_images:
            yield q
            num_query += 1

    # the same "get actual output" line as before, but changing the iterable input to the wrapper iterator
    output = self._generate(ref_image, wrap(), blackbox)

    # num_query should now be the number of query images
    assert output.shape[0] == num_query, f"Output heatmaps didn't match input images: {output.shape[0]} != {num_query}"

    ...

I suppose technically the iterator count be copied into separate processes (e.g. torch dataloader with iterable dataset), and there is a solution to that as well using multiprocessing.Manager(), but I an open to feedback from @brianhhu + others on if that complexity is OK for this point in time. I'm sure the following could be improved because the space of corner cases is a bit larger.

def generate(...):
    ...
    with multiprocessing.Manager() as manager:
        num_query = manager.Value(int, 0)
        idx_obs = manager.dict()
        lock = manager.Lock()
        def wrap():
            nonlocal lock, count, idx_obs
            for i, q in enumerate(query_images):
                yield q
                with lock:
                    if i not in idx_obs:
                        num_query.value += 1
                        idx_obs[i] = 1

        output = self._generate(ref_image, wrap(), blackbox)

        # num_query should now be the number of query images
        assert output.shape[0] == num_query.value, \
            f"Output heatmaps didn't match input images: {output.shape[0]} != {num_query.value}"

    ...

brianhhu · 2022-04-13T12:33:05Z

I have no strong opinion on this matter, but what are the drawbacks of just making it a Sequence? At first glance, it looks like a lot of added complexity for that check if we decide to still use iterators. In the end, I'm open to whatever we think is best- thanks!

Purg · 2022-04-13T19:49:37Z

but what are the drawbacks of just making it a Sequence?

The only obvious drawback is that we would not be able to operate on an unsized container, e.g. a stream, but considering that use-case in this context is likely over-design. I think your comment on complexity is valid. We can change to the simpler Sequence type for input for now until we identify use-cases that would actually require Iterable.

brianhhu · 2022-04-15T12:26:15Z

xaitk_saliency/impls/gen_descriptor_sim_sal/similarity_scoring.py

@@ -98,7 +94,7 @@ def generate(
        sal = np.clip(sal, -1, 1)

        # return just HxW components


This comment is extraneous given we reverted back to returning NxHxW saliency maps.

brianhhu

LGTM, thanks!

xaitk_saliency/impls/gen_descriptor_sim_sal/similarity_scoring.py

sonarcloud · 2022-04-19T21:44:12Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

No Coverage information
0.0% Duplication

schencej · 2022-04-19T21:44:15Z

Addressed the last comment and autosquashed. Should be good to go.

schencej force-pushed the vectorize-similarity branch from 8fb1603 to cba7ef8 Compare April 8, 2022 19:19

Purg reviewed Apr 11, 2022

View reviewed changes

brianhhu reviewed Apr 11, 2022

View reviewed changes

brianhhu reviewed Apr 15, 2022

View reviewed changes

brianhhu approved these changes Apr 15, 2022

View reviewed changes

Purg reviewed Apr 19, 2022

View reviewed changes

xaitk_saliency/impls/gen_descriptor_sim_sal/similarity_scoring.py Outdated Show resolved Hide resolved

Vectorize image similarity interfaces

2aed5af

schencej force-pushed the vectorize-similarity branch from 8f89df8 to 2aed5af Compare April 19, 2022 21:43

Purg approved these changes Apr 20, 2022

View reviewed changes

Purg merged commit 2508e77 into XAITK:master Apr 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vectorize image similarity interfaces #110

Vectorize image similarity interfaces #110

schencej commented Apr 8, 2022

codecov bot commented Apr 8, 2022 •

edited

brianhhu left a comment

schencej commented Apr 12, 2022

Purg commented Apr 12, 2022 •

edited

brianhhu commented Apr 13, 2022

Purg commented Apr 13, 2022 •

edited

brianhhu Apr 15, 2022

brianhhu left a comment

sonarcloud bot commented Apr 19, 2022

schencej commented Apr 19, 2022

		@@ -98,7 +94,7 @@ def generate(
		sal = np.clip(sal, -1, 1)

		# return just HxW components

Vectorize image similarity interfaces #110

Vectorize image similarity interfaces #110

Conversation

schencej commented Apr 8, 2022

codecov bot commented Apr 8, 2022 • edited

Codecov Report

brianhhu left a comment

Choose a reason for hiding this comment

schencej commented Apr 12, 2022

Purg commented Apr 12, 2022 • edited

brianhhu commented Apr 13, 2022

Purg commented Apr 13, 2022 • edited

brianhhu Apr 15, 2022

Choose a reason for hiding this comment

brianhhu left a comment

Choose a reason for hiding this comment

sonarcloud bot commented Apr 19, 2022

schencej commented Apr 19, 2022

codecov bot commented Apr 8, 2022 •

edited

Purg commented Apr 12, 2022 •

edited

Purg commented Apr 13, 2022 •

edited