You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Because the target in pre-training process is normalized, so the predict of model is unreal.
To visualize the reconstruction image, we add the predict and the original mean and var of each patch.
So, to avoid it, you need to use the real pixels as the target by setting --normlize_target to False.
In fact, I am not sure that the reconstruction images shown in the paper from what kind of supervision.
And I will add the comment to avoid misunderstanding.
God job.
Hi, I think this operation will leak the information of the original input image(mean and var of one patch).
MAE-pytorch/run_mae_vis.py
Line 124 in 3546179
Anyone who uses the model trained with the normalized loss for visualization should pay attention to this operation.
I also suggest the author add the comment on this line. @pengzhiliang
The text was updated successfully, but these errors were encountered: