使用案例指令评估LLaMA-7b模型在MMLU和C-Eval数据集上的性能时评测结果为空 #590

liujunfei678 · 2023-11-14T08:29:28Z

liujunfei678
Nov 14, 2023

案例：
按照上述步骤确保OpenCompass正确安装并准备好数据集后，您可以使用以下命令评估LLaMA-7b模型在MMLU和C-Eval数据集上的性能：
python run.py --models hf_llama_7b --datasets mmlu_ppl ceval_ppl

结果：

过程中存在的报错：

11/14 15:44:18 - OpenCompass - ERROR - /home/liujunfei/opencompass/opencompass/runners/base.py - summarize - 63 - OpenICLInfer[llama-7b-hf/lukaemon_mmlu_professional_law_0] failed with code 1
11/14 15:44:18 - OpenCompass - ERROR - /home/liujunfei/opencompass/opencompass/runners/base.py - summarize - 63 - OpenICLInfer[llama-7b-hf/lukaemon_mmlu_professional_law_1] failed with code 1

我的问题：
我跟着opencompass的安装指南一步步操作下来，不知道为什么出现了** 评测结果为空**，且出现了一些我不理解的报错消息。

tonysy · 2023-11-14T08:30:46Z

tonysy
Nov 14, 2023
Maintainer

Please check the log in *.out file

2 replies

liujunfei678 Nov 14, 2023
Author

我看了一下四个out日志文件
大概是由于在建立与“huggingface.co”的安全时发生了SSL错误。“SSL: UNEXPECTED_EOF_WHILE_READING”
但我不知道这个要怎么解决，是我的防火墙的问题吗？

tonysy Nov 14, 2023
Maintainer

It seems the erro is caused by the Intenet connection of Huggingface. If you have downloaded the checkpoints before, please use the specific path of the checkpoint, and use HF_EVALUATE_OFFLINE=1 HF_DATASETS_OFFLINE=1 TRANSFORMERS_OFFLINE=1 to avoid the network error

YangYu-Li · 2024-02-05T08:56:24Z

YangYu-Li
Feb 5, 2024

您好，您遇到的这个问题解决了吗？

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

使用案例指令评估LLaMA-7b模型在MMLU和C-Eval数据集上的性能时评测结果为空 #590

{{title}}

Replies: 2 comments 2 replies

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

使用案例指令评估LLaMA-7b模型在MMLU和C-Eval数据集上的性能时评测结果为空 #590

liujunfei678 Nov 14, 2023

Replies: 2 comments · 2 replies

tonysy Nov 14, 2023 Maintainer

liujunfei678 Nov 14, 2023 Author

tonysy Nov 14, 2023 Maintainer

YangYu-Li Feb 5, 2024

liujunfei678
Nov 14, 2023

Replies: 2 comments 2 replies

tonysy
Nov 14, 2023
Maintainer

liujunfei678 Nov 14, 2023
Author

tonysy Nov 14, 2023
Maintainer

YangYu-Li
Feb 5, 2024