运用prefix_projection 方法训练test acc不变一直是62.1 #13

yh351016 · 2021-12-07T11:31:00Z

12/07/2021 19:30:19 - INFO - training.trainer_base - ***** Epoch 12: Best results ***** 12/07/2021 19:30:19 - INFO - training.trainer_base - best_epoch = 0
12/07/2021 19:30:19 - INFO - training.trainer_base - best_eval_accuracy = 0.6217125382262997
12/07/2021 19:30:19 - INFO - training.trainer_base - epoch = 12.0
OrderedDict([('best_epoch', 0), ('best_eval_accuracy', 0.6217125382262997), ('epoch', 13.0)])
{'loss': 0.7488, 'learning_rate': 0.006054054054054054, 'epoch': 13.51}

Xiao9905 · 2021-12-07T13:45:35Z

Hi, prefix projection needs a careful hyper-parameter tuning, and in practice we find it does not work for every tasks.

yh351016 · 2021-12-09T09:30:17Z

BoolQ and RTE result on the table of github is not same with paper report

Xiao9905 · 2021-12-09T09:34:56Z

As we stated in the README.md, the hyperparameters provided in the repo is reimplemented with 3090 GPUs, rather than A100s (on which our paper results come from).
Since experiments reported in our paper are all conducted on NVIDIA DGX-A100 servers (which might be difficult to acquire), we reimplement P-tuning v2's results on BERT-large/RoBERTa-large with:

yh351016 · 2021-12-09T09:38:11Z

As we stated in the README.md, the hyperparameters provided in the repo is reimplemented with 3090 GPUs, rather than A100s (on which our paper results come from). Since experiments reported in our paper are all conducted on NVIDIA DGX-A100 servers (which might be difficult to acquire), we reimplement P-tuning v2's results on BERT-large/RoBERTa-large with:

thanks,I think it will only affect the speed. Does it have anything to do with the result？

Xiao9905 · 2021-12-09T11:24:29Z

As is in the README.md
We notice that the best hyper-parameters can be sensitive to your server environment and package version.

If you do not have the exact same environment, we highly recommend you to run hyper-parameter search in your environment
based on our example hyper-parameter search script in search_script and result collection scripts search.py.

Xiao9905 closed this as completed Dec 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

运用prefix_projection 方法训练test acc不变一直是62.1 #13

运用prefix_projection 方法训练test acc不变一直是62.1 #13

yh351016 commented Dec 7, 2021

Xiao9905 commented Dec 7, 2021

yh351016 commented Dec 9, 2021

Xiao9905 commented Dec 9, 2021

yh351016 commented Dec 9, 2021

Xiao9905 commented Dec 9, 2021 •

edited

运用prefix_projection 方法训练test acc不变一直是62.1 #13

运用prefix_projection 方法训练test acc不变一直是62.1 #13

Comments

yh351016 commented Dec 7, 2021

Xiao9905 commented Dec 7, 2021

yh351016 commented Dec 9, 2021

Xiao9905 commented Dec 9, 2021

yh351016 commented Dec 9, 2021

Xiao9905 commented Dec 9, 2021 • edited

Xiao9905 commented Dec 9, 2021 •

edited