CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations, by Jiahao Zhao and 10 other authors

View PDF
HTML (experimental)

Abstract:In this paper, we introduce a novel psychological benchmark, CPsyExam, constructed from questions sourced from Chinese language examinations. CPsyExam is designed to prioritize psychological knowledge and case analysis separately, recognizing the significance of applying psychological knowledge to real-world scenarios. From the pool of 22k questions, we utilize 4k to create the benchmark that offers balanced coverage of subjects and incorporates a diverse range of case analysis this http URL, we evaluate a range of existing large language models~(LLMs), spanning from open-sourced to API-based models. Our experiments and analysis demonstrate that CPsyExam serves as an effective benchmark for enhancing the understanding of psychology within LLMs and enables the comparison of LLMs across various granularities.

Submission history

From: Zhao Jiahao [view email]
[v1]
Thu, 16 May 2024 16:02:18 UTC (9,573 KB)
[v2]
Sat, 18 May 2024 07:55:58 UTC (9,573 KB)
[v3]
Tue, 10 Dec 2024 14:44:41 UTC (6,369 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.