Issue about training configuration on Binary Adapter with VTAB-1k Dataset #19

leoli646 · 2024-04-24T21:11:10Z

Hello,
When I tried to replicate your binary_adapter experiment using the VTAB-1k dataset, I was unable to reproduce the results that you reported. I would like to discuss some potential issues with the training configuration that might be causing this discrepancy.

In similar works like VPT and SSF, different hyper-parameters (such as lr_rate, weight-decay, drop-path, etc.) are utilized for various datasets within VTAB-1k. However, the train.sh script in the binary_adapter codebase doesn't seem to account for these variations and applies default hyperparameters universally.

Could you advise on whether I should:

Conduct a grid search to find the best hyperparameter set for each dataset?
Or, should I use the hyperparameter settings from another public work like SSF, for instance?

Your insights would be greatly appreciated as I continue my experiments.

Looking forward to your reply!

JieShibo · 2024-04-25T08:39:45Z

In our experiments, we only searched for the scale factor. All the experiments are conducted on RTX3090 GPUs and may exhibit slight variations in results when executed on different devices. Further exploration of hyperparameters such as learning rate and weight decay could potentially enhance performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue about training configuration on Binary Adapter with VTAB-1k Dataset #19

Issue about training configuration on Binary Adapter with VTAB-1k Dataset #19

leoli646 commented Apr 24, 2024

JieShibo commented Apr 25, 2024

Issue about training configuration on Binary Adapter with VTAB-1k Dataset #19

Issue about training configuration on Binary Adapter with VTAB-1k Dataset #19

Comments

leoli646 commented Apr 24, 2024

JieShibo commented Apr 25, 2024