New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about your ablation studies #1
Comments
We cannot directly train a model with the original MHSA. As described in the introduction of our paper, the computational cost and memory usage of original MHSA is quadratic to the image size. The memory usage can even exceed to the NVIDIA A100 limit. Maybe original MHSA can achieve better result, but it is impossible to implement it under the current hardware situation. |
Thank you for your reply. |
Almost right. However, this may cause that some pixels are not taken into account without padding. Regarding this issue, we use adaptive kernel size, stride, and padding to make sure that all pixels are for computation. |
Hello,
I have some questions about your ablation studies of pyramid pooling.
Could you detail about your baseline version in Table 9?
First, you say that you replace P-MHSA with an MHSA with a single pooling operation, what is the detail about single pooling operation? Ex: Pooling Ratios?
Second, do you compared your method with original MHSA?
The text was updated successfully, but these errors were encountered: