Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is the code consistent with the description in the paper? #5

Closed
AlphaPlusTT opened this issue Sep 5, 2021 · 4 comments
Closed

Is the code consistent with the description in the paper? #5

AlphaPlusTT opened this issue Sep 5, 2021 · 4 comments

Comments

@AlphaPlusTT
Copy link

According to formula 5 and formula 6 in the paper, the class-specific residual attention (CSRA) feature f should be sent to
the classifier to obtain the final logits, but in your code, you use the f as the final logits, what's the difference?

@Kevinz-code
Copy link
Owner

Hi, @AlphaPlusTT

Actually there are no difference and the code is consistent with the paper.

The logit for the i-th class can be expressed by Eq. (7) and Eq. (8) in the paper (in which ''x_k \cdot m_i'' can be replaced by s_k^i ), so the form of the second term of Eq. (8) is equal to the line 26 at "pipeline/csra.py''.

Best,
Authors

@AlphaPlusTT
Copy link
Author

Thanks for your reply. I try to use your method for pedestrian attribute recognition task with dataset rapv2, but the CSRA dose not work. Do you have any suggestions?

@abhigoku10
Copy link

@AlphaPlusTT i am also trying the same on PETA dataset and facing error , what error are you facing ? @Kevinz-code pls do share ur thoughts

@Kevinz-code
Copy link
Owner

@abhigoku10 @AlphaPlusTT
Thanks for reading.

  1. Actually we didn't run CSRA model on rapv2 and PETA, which weren't the popular multi-label recognition datasets in relevant literatures.

  2. We are not sure what "error" or "does not work" in your own implementation. Say, does the improvement of CSRA is low in comparison with the baseline method? It is advised that you provide the training details like model name, data aug, hyper-params, mAP, etc.

  3. In our implementation in Wider-Attribute, the improvement is consistent by using VIT models, you can try VIT models and use hyper-parameters similar to our code as well.

Best,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants