COCO Reader pixelwise_masks Emtpy Output #5404

5had3z · 2024-04-02T01:31:18Z

Version

1.35.0

Describe the bug.

When enabling pixelwise_masks (with intention to use for panoptic sementation), the output image is only zeros (with sometimes a stray non-zero pixel, which is also concerning).

I've tried this with both the normal coco dataset, and with examples in DALI_extra, both have the same behaviour.

I'm also not sure what the format of pixelwise_masks is meant to be. The index of the segment?

Minimum reproducible example

from nvidia.dali.pipeline import Pipeline
import nvidia.dali.fn as fn
from nvidia.dali.types import DALIImageType
from matplotlib import pyplot as plt

file_root = REPLACEME
annotations_file = REPLACEME

pipe = Pipeline(batch_size=1, num_threads=4, device_id=0)
with pipe:
    jpegs, bboxes, labels, masks = fn.readers.coco(
        file_root=file_root,
        annotations_file=annotations_file,
        pixelwise_masks=True,
        ratio=True,
    )
    images = fn.decoders.image(jpegs, device="mixed", output_type=DALIImageType.RGB)
    pipe.set_outputs(images, bboxes, labels, masks)
pipe.build()

pipe_out = pipe.run()

images_cpu = pipe_out[0].as_cpu()
bboxes_cpu = pipe_out[1]
labels_cpu = pipe_out[2]
masks = pipe_out[3]

print(labels_cpu)
print(bboxes_cpu)
plt.subplot(121)
plt.imshow(images_cpu.as_array()[0, ...])
plt.subplot(122)
plt.imshow(masks.as_array()[0, ..., 0])
plt.show()

Relevant log output

No response

Other/Misc.

COCO Reader tutorial also needs to be updated to
from nvidia.dali.types import DALIImageType instead of import nvidia.dali.types as types with its usage in fn.decoders.image.

Check for duplicates

I have searched the open bugs/issues and have found no duplicates for this bug report

The text was updated successfully, but these errors were encountered:

5had3z · 2024-04-02T02:52:46Z

polygon_masks=True returns valid poly and vert data btw. In the meantime, I will probably use this to render image with a plugin that runs over these with cv::fillConvexPoly. Could probably be interesting see how well that works compared to doing it manually in COCOReader::PixelwiseMasks, and potentially replace it...

5had3z · 2024-04-02T03:35:13Z

If ratio=True, the poly points are internally transformed so it tries to rasterize the mask with normalized points and fails. I don't think its unreasonable to have ratio=True for normalized bboxes and have a segmentation image.

In regards to the segmentation image itself, it is initialized with zeros, but the poly masks may not cover all pixels, and there might be a zero class, so there will be incorrect loss applied to areas that should be ignore class, specified by the user.But I think one better would be to not create a semantic segmentation, rather just a mask segmentation that corresponds with the labels and boxes. Instances are hence still separated so panoptic segmentation can be performed with this, rather than only semantic (class recovered via label tensor index). For this mask segmentation, -1 can be void area (since we're using int32).

I might still try testing throughput of the current RLE implementation and cv::fillConvexPoly to simplify the codebase and remove pycocotools may be useful. RLE is not the prettiest thing to look at, I'll have to have a deeper understanding to figure out how to change from class to mask id.

5had3z · 2024-04-02T05:22:02Z

Current plan is operator rasterize_vert_poly that rasterizes from vert/poly list, will basically follow the meta algorithm that works in normal python below.

mask = np.zeros(image.shape[:-1], dtype=np.int32)
vert_cpu *= mask.shape
vert_cpu = vert_cpu.astype(np.int32)
for i, poly_ in enumerate(poly_cpu, 1):
    points = vert_cpu[poly_[1] : poly_[2]]
    mask = cv2.fillConvexPoly(mask, points, i)

5had3z · 2024-04-02T06:56:59Z

Without actually benchmarking any performance difference, I have a plugin that works as expected.

JanuszL · 2024-04-02T13:14:00Z

Hi @5had3z,

Thank you for reporting the bug. Yes, indeed it is about the mishandling of the ratio=True case.
#5407 should fix that.
If you feel that you can improve the COCO reader behavior we would be more than happy to review and merge a PR (the functionality itself is an external contribution as well #2248).

5had3z · 2024-04-02T22:13:05Z

I had my blinders on and didn't stop to think I need panoptic, not instance segmentation, so I need to make a COCO panoptic reader plugin anyway. I think there is some overlap with my existing cityscapes plugins, if there is interest reading coco panoptic format I can upstream this, but there will probably have to be some lenghty design discussions.

FYI the post-hoc rasterize Verticies + Poly is relatively simple to implement, however I haven't benchmarked check if its eggregiously slower than the existing impl.

using ConstTensor = dali::ConstSampleView<dali::CPUBackend>;
using Tensor = dali::SampleView<dali::CPUBackend>;

void rasterizeVertPoly(ConstTensor polyTensor, ConstTensor vertTensor, Tensor maskTensor, bool normCoords)
{
    struct Poly
    {
        int maskIdx;
        int startIdx;
        int endIdx;
    };

    const auto outShape = maskTensor.shape();
    cv::Mat maskMat(outShape[0], outShape[1], CV_32S, maskTensor.raw_mutable_data());
    maskMat.setTo(-1); // FIXME user defined void area

    std::span polyData(static_cast<const Poly*>(polyTensor.raw_data()), polyTensor.shape()[0]);
    std::span vertData(static_cast<const cv::Point2f*>(vertTensor.raw_data()), vertTensor.shape()[0]);

    std::vector<cv::Point> vertices(vertData.size()); // Transform  poly points from float to int

    if (normCoords)
    {
        std::transform(std::execution::unseq, vertData.begin(), vertData.end(), vertices.begin(),
            [h = outShape[0], w = outShape[1]](cv::Point2f p) { return cv::Point(p.x * w, p.y * h); });
    }
    else
    {
        std::transform(std::execution::unseq, vertData.begin(), vertData.end(), vertices.begin(),
            [](cv::Point2f p) { return static_cast<cv::Point>(p); });
    }

    for (const auto& polyPoint : polyData)
    {
        cv::Point* start = vertices.data() + polyPoint.startIdx;
        const int nPoints = polyPoint.endIdx - polyPoint.startIdx;
        cv::fillConvexPoly(maskMat, start, nPoints, polyPoint.maskIdx);
    }
}

template <>
void RasterizeVertPoly<dali::CPUBackend>::RunImpl(dali::Workspace& ws)
{
    const auto& polyIn = ws.Input<dali::CPUBackend>(0);
    const auto& vertIn = ws.Input<dali::CPUBackend>(1);
    auto& maskOut = ws.Output<dali::CPUBackend>(0);

    auto& tPool = ws.GetThreadPool();

    for (int sampleId = 0; sampleId < ws.GetRequestedBatchSize(0); ++sampleId)
    {
        tPool.AddWork([&, sampleId](int)
            { rasterizeVertPoly(polyIn[sampleId], vertIn[sampleId], maskOut[sampleId], mNormalizedCoords); });
    }

    tPool.RunAll();
}

JanuszL · 2024-04-03T09:29:42Z

Hi @5had3z,

Thank you for sharing your implementation. The current one is an external contribution and was not benchmarked thoroughly as well. If you have some spare time feel free to compare both your implementations to see which one yields better perf results.

if there is interest reading coco panoptic format I can upstream this
We haven't hard such a request yet but feel free to contribute and extend the coco reader.

5had3z added the bug Something isn't working label Apr 2, 2024

dali-automaton assigned jantonguirao Apr 2, 2024

JanuszL mentioned this issue Apr 2, 2024

Fix pixelwise_masks support when the ratio is on in the coco reader #5407

Merged

18 tasks

JanuszL added this to the Release_1.36.0 milestone Apr 2, 2024

5had3z closed this as completed Apr 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

COCO Reader pixelwise_masks Emtpy Output #5404

COCO Reader pixelwise_masks Emtpy Output #5404

5had3z commented Apr 2, 2024

5had3z commented Apr 2, 2024

5had3z commented Apr 2, 2024

5had3z commented Apr 2, 2024

5had3z commented Apr 2, 2024

JanuszL commented Apr 2, 2024

5had3z commented Apr 2, 2024 •

edited

Loading

JanuszL commented Apr 3, 2024

COCO Reader pixelwise_masks Emtpy Output #5404

COCO Reader pixelwise_masks Emtpy Output #5404

Comments

5had3z commented Apr 2, 2024

Version

Describe the bug.

Minimum reproducible example

Relevant log output

Other/Misc.

Check for duplicates

5had3z commented Apr 2, 2024

5had3z commented Apr 2, 2024

5had3z commented Apr 2, 2024

5had3z commented Apr 2, 2024

JanuszL commented Apr 2, 2024

5had3z commented Apr 2, 2024 • edited Loading

JanuszL commented Apr 3, 2024

5had3z commented Apr 2, 2024 •

edited

Loading