Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion tests/ci_use/XPU_45T/run_w4a8.py
Original file line number Diff line number Diff line change
Expand Up @@ -36,7 +36,7 @@ def test_w4a8():
)
print(response.choices[0].message.content)
# print(base_response)
assert any(keyword in response.choices[0].message.content for keyword in ["人工智能", "文心一言"])
assert any(keyword in response.choices[0].message.content for keyword in ["人工智能", "文心一言", "小度"])
Copy link

Copilot AI Dec 1, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

此PR的描述信息不完整。虽然标题说明了要更改基准值,但PR描述中的"Motivation"和"Modifications"部分都是空的,没有解释为什么要添加"小度"作为可接受的关键词。

建议在PR描述中说明:

  1. Motivation: 为什么需要添加"小度"作为可接受的响应关键词?是因为模型响应行为发生了变化,还是为了支持新的模型版本?
  2. Modifications: 明确说明在w4a8量化测试中扩展了验证关键词列表,从["人工智能", "文心一言"]增加到["人工智能", "文心一言", "小度"]。
  3. Accuracy Tests: 如果这个改动影响了模型输出验证逻辑,请提供相关的测试结果或示例响应。

这样可以帮助审阅者理解修改的必要性和合理性。

Copilot generated this review using guidance from repository custom instructions.


if __name__ == "__main__":
Expand Down
Loading