The MNLI score in lm-evaluation-harness #61

wang99711123 · 2023-06-09T09:42:02Z

Thanks for the great work!

I'm trying to reproduce the results you report. I downloaded the model weights from link https://huggingface.co/yahma/alpaca-7b-lora and evaluated them under the framework of lm-evaluation-harness. But I only got 41.7% accuracy on MNLI dataset.

When using lm-evaluation-harness, did you perform other data processing tricks to get 51.6% acc?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The MNLI score in lm-evaluation-harness #61

The MNLI score in lm-evaluation-harness #61

wang99711123 commented Jun 9, 2023

The MNLI score in lm-evaluation-harness #61

The MNLI score in lm-evaluation-harness #61

Comments

wang99711123 commented Jun 9, 2023