Fix run_glue evaluation when model has a label correspondence #10401

sgugger · 2021-02-25T19:02:46Z

What does this PR do?

The run_glue script uses the correspondence id to label stored in a given model but when using

AutoModelForSequenceClassication.from_pretrained(xxx, num_labels=x)

that correspondence is reset. This PR fixes that, along with a few other bugs in the script. To confirm MNLI evaluation does take the correspondence in a model config

python examples/text-classification/run_glue.py --model_name_or_path roberta-large-mnli --task_name mnli --max_seq_length 128 --output_dir ~/tmp/test-mnli --do_eval

gices 90.6%/90.1% accuracy (matched/mismatched) after this PR, vs 4.28%/4.86% accuracy on current master.

LysandreJik

Yes, LGTM!

Fix run_glue evaluation when model has a label correspondence

1f64071

sgugger requested a review from LysandreJik February 25, 2021 19:02

Add fixes in script

1dd08d3

LysandreJik approved these changes Feb 25, 2021

View reviewed changes

sgugger merged commit 17b6e0d into master Feb 25, 2021

sgugger deleted the mnli_script branch February 25, 2021 20:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix run_glue evaluation when model has a label correspondence #10401

Fix run_glue evaluation when model has a label correspondence #10401

sgugger commented Feb 25, 2021

LysandreJik left a comment

Fix run_glue evaluation when model has a label correspondence #10401

Fix run_glue evaluation when model has a label correspondence #10401

Conversation

sgugger commented Feb 25, 2021

What does this PR do?

LysandreJik left a comment

Choose a reason for hiding this comment