Question about image transformation: short edge is still 384 for the fine-tuning task? #14

Jxu-Thu · 2021-07-08T11:56:22Z

Thanks for your great codes!
I carefully read your paper.

(in your paper) We resize the shorter edge of input images to 384 and limit the longer edge to under 640 while preserving the aspect ratio. This resizing scheme is also used during object detection in other VLP models, but with a larger size of the shorter edge (800). Patch projection of ViLT-B/32 yields 12 × 20 = 240 patches for an image with a resolution of 384*640.

However, I find that the "image_size=384" for all downstream tasks in this codes?

Would it have an effect on the performance of downstream tasks? At least with a shorter edge 800 can greatly increase the length of the sequence. So It should have a smaller batch size when using "shorter edge 800"

dandelin · 2021-07-09T02:33:25Z

We do use the shorter size of 384 for downstream tasks.
def config() is the default configuration, and the values in the configuration are used as-is unless named configs or command-line modifications do not modify them.

You can check the final configuration of an execution by print_config option.

Jxu-Thu · 2021-07-11T12:07:22Z

Thanks.

jkkishore1999 mentioned this issue Jul 8, 2021

python run.py with data_root="/arrows_flickr30k" num_gpus=1 num_nodes=1 task_finetune_irtr_f30k_randaug per_gpu_batchsize=4 load_path="vilt_200k_mlm_itm.ckpt" #6

Open

Jxu-Thu closed this as completed Jul 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about image transformation: short edge is still 384 for the fine-tuning task? #14

Question about image transformation: short edge is still 384 for the fine-tuning task? #14

Jxu-Thu commented Jul 8, 2021 •

edited

dandelin commented Jul 9, 2021

Jxu-Thu commented Jul 11, 2021

Question about image transformation: short edge is still 384 for the fine-tuning task? #14

Question about image transformation: short edge is still 384 for the fine-tuning task? #14

Comments

Jxu-Thu commented Jul 8, 2021 • edited

dandelin commented Jul 9, 2021

Jxu-Thu commented Jul 11, 2021

Jxu-Thu commented Jul 8, 2021 •

edited