View a PDF of the paper titled CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations, by Jiahao Zhao and 10 other authors
Abstract:In this paper, we introduce a novel psychological benchmark, CPsyExam, constructed from questions sourced from Chinese language examinations. CPsyExam is designed to prioritize psychological knowledge and case analysis separately, recognizing the significance of applying psychological knowledge to real-world scenarios. From the pool of 22k questions, we utilize 4k to create the benchmark that offers balanced coverage of subjects and incorporates a diverse range of case analysis this http URL, we evaluate a range of existing large language models~(LLMs), spanning from open-sourced to API-based models. Our experiments and analysis demonstrate that CPsyExam serves as an effective benchmark for enhancing the understanding of psychology within LLMs and enables the comparison of LLMs across various granularities.
Submission history
From: Zhao Jiahao [view email]
[v1]
Thu, 16 May 2024 16:02:18 UTC (9,573 KB)
[v2]
Sat, 18 May 2024 07:55:58 UTC (9,573 KB)
[v3]
Tue, 10 Dec 2024 14:44:41 UTC (6,369 KB)
Source link
lol