To tile coords #939

XinyueLi1012 · 2023-09-16T21:12:13Z

avoid for loop over tiles
-- use sort(), it would be slow when the image is big (eg. 4000* 4000)

codecov · 2023-09-16T21:30:18Z

Codecov Report

Merging #939 (d4e91c9) into master (0a8b6db) will increase coverage by 0.05%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #939      +/-   ##
==========================================
+ Coverage   95.82%   95.88%   +0.05%     
==========================================
  Files          26       26              
  Lines        3211     3233      +22     
==========================================
+ Hits         3077     3100      +23     
+ Misses        134      133       -1

Flag	Coverage Δ
unittests	`95.88% <100.00%> (+0.05%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Changed	Coverage Δ
bliss/catalog.py	`99.66% <100.00%> (+0.39%)`	⬆️
bliss/data_augmentation.py	`98.43% <100.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

jeff-regier · 2023-09-17T15:28:07Z

Great work @XinyueLi1012! I timed both the old and new versions of to_tile_coords: your new one is more than 10 times faster (0.11 seconds vs 1.6 seconds).

The output from both implementations usually matches, so I think the new implementation is at least nearly right. (Possibly it is perfect and the old implementation is slightly off.) But I'm reluctant to merge until they match exactly or until we're really confident that the new version is correct. Please see my trial below. The difference may have to do with how these two implementations handle negative locations. Should we be filtering sources with out-of-bounds locations (either too high or too low) as part of our data augmentation procedure?

> new_tp = aug_full.to_tile_params(4, 4)
> old_tp = aug_full.old_to_tile_params(4, 4)
> aug_full.n_sources.sum()
tensor(4681, device='cuda:0')
> new_tp.n_sources.sum()
tensor(4681, device='cuda:0')
> old_tp.n_sources.sum()
tensor(4681, device='cuda:0')
> (new_tp.locs < 0).sum()
tensor(0, device='cuda:0')
> (old_tp.locs < 0).sum()
tensor(190, device='cuda:0')
> new_tp.locs.isclose(old_tp.locs).sum()
tensor(204610, device='cuda:0')
> (~new_tp.locs.isclose(old_tp.locs)).sum()
tensor(190, device='cuda:0')

jeff-regier · 2023-09-17T15:39:03Z

Also, would you add a test to verify that if we rotate a catalog by 90 degrees and next by 270 degree, then we get back the original catalog? (Or do we already have a test like that?)

XinyueLi1012 · 2023-09-17T15:44:24Z

Thanks for the testing results. I didn't do much testing for the new implementation so I'm also a little worry about some special cases. I'll add some tests for the new implementation and data augmentation.

XinyueLi1012 · 2023-09-19T21:53:28Z

@jeff-regier I added some filtering for the negative "locs" in the to_tile_params (easier than filter in data augmentation)

jeff-regier · 2023-09-20T13:40:19Z

Thanks @XinyueLi1012. Before we merge, can you also verify that our performance on DC2 stays the same with this PR?

jeff-regier · 2023-09-20T14:40:42Z

tests/test_dc2.py

+        _, aug_full270 = aug_rotate270(origin_full, aug_input_images)
+
+        _, aug_full90180 = aug_rotate180(aug_full90, aug_image90)
+        _, aug_full90270 = aug_rotate270(aug_full90, aug_image90)


This is a good test to have, but I realize it doesn't quite test what I thought it would, because there's no conversion btw tile and full catalogs happening.

Would you add an additional test that converts from a tile catalog to a full catalog and then back again to a tile catalog? (And that the first and last tile catalogs are equal?)

Yes make sense

jeff-regier · 2023-09-20T14:45:25Z

bliss/catalog.py

+            x0_mask = (plocs_ii[:, 0] > 0) & (plocs_ii[:, 0] < self.height)
+            x1_mask = (plocs_ii[:, 1] > 0) & (plocs_ii[:, 1] < self.width)
+            x_mask = x0_mask * x1_mask
+            n_filter_sources = x_mask.sum()


I think it'd be better to do the filtering only on demand: better to give users an error if they try to convert catalog formats without first removing out-of-bounds locations. Would you a new method called something like filter_out_of_bounds? And then call that method following data augmentation?

That would be tricky. Let me have a try first.

Here's an alternative that may be less tricky: add a keyword argument to to_tile_coords called filter_oob which is false by default. Only if it's true will we filter light sources that are out of bounds (either too negative or too large).

that's clever

jeff-regier · 2023-09-20T14:47:12Z

bliss/catalog.py

@@ -504,7 +510,9 @@ def to_tile_params(
            mask_sources = mask_sources.scatter_(1, top_indices, 1) & source_mask

            # get n_sources for each tile
-            tile_n_sources[ii] = mask_sources.reshape(n_tiles_h, n_tiles_w, n_sources).sum(-1)
+            tile_n_sources[ii] = mask_sources.reshape(n_tiles_h, n_tiles_w, n_filter_sources).sum(
+                -1


Please shorten a variable name here or somehow rewrite this assignment so there isn't a stray -1 alone on a line.

XinyueLi1012 · 2023-09-21T15:24:19Z

Thanks @XinyueLi1012. Before we merge, can you also verify that our performance on DC2 stays the same with this PR?

Some times when I run new to_tile_coords on full dc2 catalog (image size 4000*4000) in a jupyter notebook, the kernel dead, I'm digging into it to find if there is any bug. The performance using new to_tile_coords should be similar with the results we got previously.

Xinyue Li added 4 commits September 16, 2023 17:09

add data augmentation and dc2 psf/deconv

7b2affa

add data augmentation and dc2 psf/deconv

36b846c

add data augmentation and dc2 psf/deconv

6c7e51d

new to_tile_coords

2da95ee

new to_tile_coords

883f34f

XinyueLi1012 linked an issue Sep 16, 2023 that may be closed by this pull request

to_tile_params is too slow to be used for data augmentation #936

Closed

XinyueLi1012 requested a review from jeff-regier September 16, 2023 21:59

add tests and filter

dbf4c60

jeff-regier reviewed Sep 20, 2023

View reviewed changes

new to_tile_coords

d4e91c9

jeff-regier approved these changes Sep 25, 2023

View reviewed changes

jeff-regier merged commit bd59bd1 into master Sep 25, 2023
3 checks passed

jeff-regier deleted the to-tile-coords branch September 25, 2023 16:17

XinyueLi1012 mentioned this pull request Feb 8, 2024

add dc2 script #956

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

To tile coords #939

To tile coords #939

XinyueLi1012 commented Sep 16, 2023

codecov bot commented Sep 16, 2023 •

edited

Loading

jeff-regier commented Sep 17, 2023 •

edited

Loading

jeff-regier commented Sep 17, 2023

XinyueLi1012 commented Sep 17, 2023

XinyueLi1012 commented Sep 19, 2023

jeff-regier commented Sep 20, 2023

jeff-regier Sep 20, 2023

XinyueLi1012 Sep 21, 2023

jeff-regier Sep 20, 2023 •

edited

Loading

XinyueLi1012 Sep 21, 2023

jeff-regier Sep 22, 2023

XinyueLi1012 Sep 22, 2023

jeff-regier Sep 20, 2023

XinyueLi1012 commented Sep 21, 2023

To tile coords #939

To tile coords #939

Conversation

XinyueLi1012 commented Sep 16, 2023

codecov bot commented Sep 16, 2023 • edited Loading

Codecov Report

jeff-regier commented Sep 17, 2023 • edited Loading

jeff-regier commented Sep 17, 2023

XinyueLi1012 commented Sep 17, 2023

XinyueLi1012 commented Sep 19, 2023

jeff-regier commented Sep 20, 2023

jeff-regier Sep 20, 2023

Choose a reason for hiding this comment

XinyueLi1012 Sep 21, 2023

Choose a reason for hiding this comment

jeff-regier Sep 20, 2023 • edited Loading

Choose a reason for hiding this comment

XinyueLi1012 Sep 21, 2023

Choose a reason for hiding this comment

jeff-regier Sep 22, 2023

Choose a reason for hiding this comment

XinyueLi1012 Sep 22, 2023

Choose a reason for hiding this comment

jeff-regier Sep 20, 2023

Choose a reason for hiding this comment

XinyueLi1012 commented Sep 21, 2023

codecov bot commented Sep 16, 2023 •

edited

Loading

jeff-regier commented Sep 17, 2023 •

edited

Loading

jeff-regier Sep 20, 2023 •

edited

Loading