Add segmentation.RandomMaskPixel operator #2445

jantonguirao · 2020-11-09T16:18:14Z

Signed-off-by: Joaquin Anton janton@nvidia.com

Why we need this PR?

Pick one, remove the rest

It adds new feature needed to be able to perform a random crop operation where foreground is always represented up to a minimum ratio

What happened in this PR?

Fill relevant points, put NA otherwise. Replace anything inside []

What solution was applied:
Added a new operator segmentation.RandomMaskPixel that selects a pixel either randomly or to point to any of the available foreground pixels, specified by an input mask and threhold/value arguments
Affected modules and functionalities:
No existing functionality is affected
Key points relevant for the review:
All
Validation and testing:
Python tests
Documentation (including examples):
Operator documentation in schema

JIRA TASK: [DALI-1723]

Signed-off-by: Joaquin Anton <janton@nvidia.com>

JanuszL · 2020-11-09T16:30:37Z

dali/operators/segmentation/biased_crop_center.cc

+
+DALI_SCHEMA(segmentation__BiasedCropCenter)
+    .DocStr(R"(Selects a cropping window center which can be selected randomly from either
+any position in the input or any position of a foreground pixel in the input, based on an


Suggested change

any position in the input or any position of a foreground pixel in the input, based on an

at any position in the input or any position of a foreground pixel in the input, based on an

JanuszL · 2020-11-09T16:33:14Z

dali/operators/segmentation/biased_crop_center.cc

+
+DALI_SCHEMA(segmentation__BiasedCropCenter)
+    .DocStr(R"(Selects a cropping window center which can be selected randomly from either
+any position in the input or any position of a foreground pixel in the input, based on an


I don't know if we should use foreground or just non zero value, because the meaning of the foreground if not defined.

JanuszL · 2020-11-09T16:38:33Z

dali/operators/segmentation/biased_crop_center.cc

+
+When foreground != 0, the cropping center is first picked to match any of the foreground
+pixels in the input. If the selected crop center results in an out of bounds cropping window,
+the center is shifted as necessary so that the window remains within bounds.


I wonder if it makes any sense not to shift?

I think both things can make sense. If cropping out of bounds is OK, the user can let shape=None and any pixel would be considered a valid center. If the shape is specified, we'll assure that the cropping window is within bounds.

We've already had github requests for out-of-bounds cropping. Perhaps we should make this configurable.

We already have out-of-bounds cropping, and this is already configured (providing a shape or not).

JanuszL · 2020-11-09T16:48:37Z

dali/operators/segmentation/biased_crop_center.cc

+
+  has_crop_shape_ = spec_.ArgumentDefined("shape");
+  if (has_crop_shape_) {
+    GetShapeArgument(crop_shape_, spec_, "shape", ws, ndim, nsamples);


Shouldn't you do anything with the return value as well?

not really, I don't need to infer the batch size or the number of dimensions.

dali/operators/segmentation/biased_crop_center.cc

JanuszL · 2020-11-09T17:07:55Z

dali/operators/segmentation/biased_crop_center.cc

+      [&, sample_idx](int thread_id) {
+        auto mask = masks_view[sample_idx];
+        auto center = center_view[sample_idx];
+        SearchableRLEMask rle_mask(mask);


Can you tell how fast/slow it is for the typical inputs?
Do we assume that masks are uniform regarding the size across the batch or they can vary?
If can vary maybe we can differently split the work across the threads. Ask each thread to create a mask for only one plan in each volume.
Does it make any sense to have it on the GPU?

And how that works with 3D masks?

I can benchmark it. I wouldn't assume uniform shape.
SearchableRLEMask works on flat indices. We are not doing anything with the dimensions.
Regarding the GPU, encoding the mask is a rather serial task. Having a mask per plane would complicate things, so I wouldn't go there unless we see there is a performance bottleneck here. On top of that, Slice operator doesn't take GPU anchor/shape.

Ok, let us benchmark first and see if there is anything to fight for.

mzient · 2020-11-10T10:07:53Z

dali/operators/segmentation/biased_crop_center.cc

+          // Adjust center if necessary
+          if (has_crop_shape_) {
+            for (int d = 0; d < ndim; d++) {
+              int64_t w = crop_sh[d] >> 1;


I don't know what this w was supposed to mean, but it read as width (which apprently it isn't).

Suggested change

int64_t w = crop_sh[d] >> 1;

int64_t half = crop_sh[d] >> 1;

mzient · 2020-11-10T10:12:02Z

dali/operators/segmentation/biased_crop_center.cc

+            for (int d = 0; d < ndim; d++) {
+              int64_t w = crop_sh[d] >> 1;
+              center.data[d] =
+                  boundary::idx_clamp(center.data[d], w, mask_sh[d] - (crop_sh[d] - w));


Suggested change

boundary::idx_clamp(center.data[d], w, mask_sh[d] - (crop_sh[d] - w));

clamp(center.data[d], w, mask_sh[d] - (crop_sh[d] - w));

If you're just clamping, you don't need any fancy boundary handling (ordinary clamp will do).
Also, boundary clamp is an exclusive clamp (to size-1) and we want to have inclusive clamp.
Example:
mask_sh[d] = 100
crop_sh[d] = 10
w = 5
maximum valid value is 95 (the window will start at 90 and end at 99)

Signed-off-by: Joaquin Anton <janton@nvidia.com>

mzient · 2020-11-12T17:58:37Z