Will L2 normalization for image and text leads to better results? #30

FranklinLingfeng · 2024-02-19T13:38:45Z

When aligning image and text, why don't you need to l2 normalize the image and text features? Will this not cause the module length of the image feature to become very large in order to reduce the i2t loss in the second stage of training?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Will L2 normalization for image and text leads to better results? #30

Will L2 normalization for image and text leads to better results? #30

FranklinLingfeng commented Feb 19, 2024

Will L2 normalization for image and text leads to better results? #30

Will L2 normalization for image and text leads to better results? #30

Comments

FranklinLingfeng commented Feb 19, 2024