Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training procedure cropping and resizing for semantic segmentation #41

Closed
ksagoog opened this issue Jul 23, 2021 · 2 comments
Closed

Training procedure cropping and resizing for semantic segmentation #41

ksagoog opened this issue Jul 23, 2021 · 2 comments

Comments

@ksagoog
Copy link

ksagoog commented Jul 23, 2021

Hi,

Thanks for your great paper. For the semantic segmentation model on ADE20K, you state the following:

"""Images are resized to 520 pixels side length.
We use random horizontal flipping and random rescaling in
the range ∈ (0.5, 2.0) for data augmentation. We train on
square random crops of size 480."""

I feel I must not understand the procedure as randomly scaling a 520-pixel length image between the range (.5, 2.0) will result in some images of side-length less than your random crop size of 480. Could you please clarify the order of operations and any missing detail here? Thank you!

@ranftlr
Copy link
Contributor

ranftlr commented Jul 25, 2021

We used pytorch-encoding as our training framework. See here for the transform that is applied to the input during training: https://github.com/zhanghang1989/PyTorch-Encoding/blob/331ecdd5306104614cb414b16fbcd9d1a8d40e1e/encoding/datasets/base.py#L64

We used this class with base_size=520 and crop_size=480.

@ksagoog
Copy link
Author

ksagoog commented Jul 26, 2021

Thanks!

@ksagoog ksagoog closed this as completed Jul 26, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants