How do you implement the equation(15) in your paper? #5

Dorispaopao · 2019-08-23T02:32:45Z

And how to consider the gradient backpropgation in your implement?

XiaLiPKU · 2019-08-23T04:37:22Z

For the first question:
I implemented it in the 'train.py'

Line 134 in 9a492d8

self.net.module.ema.mu *= momentum

. Implement it in the EMAU module may be more good-looking. But as the \mu has to be averaged on the whole batch, implementing it in the module needs the 'reduce' operation as in SyncBN. So I just write the line in the 'train.py', where the \mu from all GPUs are already together here.

XiaLiPKU · 2019-08-23T04:40:27Z

And how to consider the gradient backpropgation in your implement?

For the second question:

I simple cut off the gradients for the A_E and A_M iterations as

EMANet/network.py

Line 227 in 9a492d8

with torch.no_grad():

.
To be honest, there lacks deep exploration of what happens inside the EMA. So EMANet is just a naive exploration on the EM + Attention mechanism. So, I just look forward for more deep analysis by dear followers.

XiaLiPKU closed this as completed Sep 17, 2019

RIKOYUKI mentioned this issue Nov 18, 2019

I had some trouble，could you help me? #28

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do you implement the equation(15) in your paper? #5

How do you implement the equation(15) in your paper? #5

Dorispaopao commented Aug 23, 2019

XiaLiPKU commented Aug 23, 2019

XiaLiPKU commented Aug 23, 2019

How do you implement the equation(15) in your paper? #5

How do you implement the equation(15) in your paper? #5

Comments

Dorispaopao commented Aug 23, 2019

XiaLiPKU commented Aug 23, 2019

XiaLiPKU commented Aug 23, 2019