semantic segmentation #4

ajtao · 2021-06-25T16:34:57Z

Hello, thanks very much for sharing the code for your tremendous research!

For semantic segmentation, did you just run evaluation with multiple square tiles to handle the non-square resolution of Cityscapes? Can you share any details, like decoder head architecture?

houqb · 2021-06-26T05:01:51Z

We process each image in a sliding window way. As mentioned in the paper, we use the UperNet head as our decoder head.

ajtao · 2021-06-26T15:49:05Z

right thanks.

ajtao · 2021-06-26T16:31:28Z

@Andrew-Qibin I'm seeing rather poor Cityscapes segmentation results (training starts at 20 IOU first epoch and only gets to 53 IOU) right out of the box using a volo_d2 trunk (using imagenet pretrained weight). Probably i've got some tensor ordering wrong or something, but is there any trick to adapting the code to higher resolution? I had to of course override the positional encodings from the checkpoint with a new higher resolution positional encoding (1024). And i created a new forward() that supplies the features in N, C, H, W form. But hmm, not sure what i've got wrong right now.

ajtao closed this as completed Jun 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

semantic segmentation #4

semantic segmentation #4

ajtao commented Jun 25, 2021

houqb commented Jun 26, 2021

ajtao commented Jun 26, 2021

ajtao commented Jun 26, 2021 •

edited

Loading

semantic segmentation #4

semantic segmentation #4

Comments

ajtao commented Jun 25, 2021

houqb commented Jun 26, 2021

ajtao commented Jun 26, 2021

ajtao commented Jun 26, 2021 • edited Loading

ajtao commented Jun 26, 2021 •

edited

Loading