Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

semantic segmentation #4

Closed
ajtao opened this issue Jun 25, 2021 · 3 comments
Closed

semantic segmentation #4

ajtao opened this issue Jun 25, 2021 · 3 comments

Comments

@ajtao
Copy link

ajtao commented Jun 25, 2021

Hello, thanks very much for sharing the code for your tremendous research!

For semantic segmentation, did you just run evaluation with multiple square tiles to handle the non-square resolution of Cityscapes? Can you share any details, like decoder head architecture?

@houqb
Copy link
Collaborator

houqb commented Jun 26, 2021

We process each image in a sliding window way. As mentioned in the paper, we use the UperNet head as our decoder head.

@ajtao
Copy link
Author

ajtao commented Jun 26, 2021

right thanks.

@ajtao ajtao closed this as completed Jun 26, 2021
@ajtao
Copy link
Author

ajtao commented Jun 26, 2021

@Andrew-Qibin I'm seeing rather poor Cityscapes segmentation results (training starts at 20 IOU first epoch and only gets to 53 IOU) right out of the box using a volo_d2 trunk (using imagenet pretrained weight). Probably i've got some tensor ordering wrong or something, but is there any trick to adapting the code to higher resolution? I had to of course override the positional encodings from the checkpoint with a new higher resolution positional encoding (1024). And i created a new forward() that supplies the features in N, C, H, W form. But hmm, not sure what i've got wrong right now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants