[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism
benchmark framework evaluation dataset gpt4 large-language-models llm chatgpt ernie-bot gpt35turbo chatglm2-6b xverse internlm-20b baichaun2 aquila2 qwen-14b chatglm3-6b acl2024
-
Updated
Jun 25, 2024 - Python