New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
checkerboard #59
Comments
Hi, yes that pattern is quite weird, I propose the following things to test (although I am not sure which will work):
Hope this helps |
Hi Yassine! |
the decodes use pixel shuffles instead of deconvolution, but I think maybe replacing it with a deconv decoder might help |
Hi @yassouali |
yes that is correct, you'll need to replace the pixel shuffle with a |
The PixelShuffle layer is wrapped in a for-loop after a conv layer.
What would be the correct parameters for (nn.ConvTranspose2d(in_channels=?, out_channels=?, kernel_size=?)) Thank you for your help! |
the for loop is for upsampling with a factor of x2 each time, and the conv is part of the pixel shuffle method, you can try replacing the conv2d by conv transpose and remove the pixel shuffle, as for an example, you might need to try different variation to find the best one, but maybe |
Hi Yassine,
I am using the CCT model to train on a satellite dataset. The images are size 128x128. For some reason the predictions show a clear checkerboard pattern as shown in this example. Left: prediction, Right: ground truth.
Do you have any idea what causes this and how to avoid it?
The text was updated successfully, but these errors were encountered: