[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism
benchmark
framework
evaluation
dataset
gpt4
large-language-models
llm
chatgpt
ernie-bot
gpt35turbo
chatglm2-6b
xverse
internlm-20b
baichaun2
aquila2
qwen-14b
chatglm3-6b
acl2024
-
Updated
Jun 25, 2024 - Python