Implementaion of experiment for applying PCGrad method on robeta-based model & multi NLP tasks. Refer to https://han0ahblog.tistory.com/2 for details.
pcgrad.py is copy of https://github.com/WeiChengTseng/Pytorch-PCGrad/blob/master/pcgrad.py
python 3.10
pytorch==1.13.1
transformers==4.25.1
python train.py # for baseline
python train_pcgrad.py # with pcgrad
Validation Loss
baseline | +pcgrad | |
---|---|---|
PAWS-KR | 0.4793 | 0.4071 |
KLUE-NLI | 0.4432 | 0.4365 |
Validation Accuracy
baseline | +pcgrad | |
---|---|---|
PAWS-KR | 0.8030 | 0.8325 |
KLUE-NLI | 0.8486 | 0.8520 |