EmPO: Emotion Grounding for Empathetic Response Generation through Preference Optimization

stp2ySeptember 18, 20240 Comments

How to Evaluate an LLM's Ability to Follow Instructions

[Submitted on 27 Jun 2024 (v1), last revised 17 Sep 2024 (this version, v2)]

View a PDF of the paper titled EmPO: Emotion Grounding for Empathetic Response Generation through Preference Optimization, by Ondrej Sotolar and 5 other authors

View PDF
HTML (experimental)

Abstract:Empathetic response generation is a desirable aspect of conversational agents, crucial for facilitating engaging and emotionally intelligent multi-turn conversations between humans and machines. Leveraging large language models for this task has shown promising results, yet challenges persist in ensuring both the empathetic quality of the responses and retention of the generalization performance of the models. We propose a novel approach where we construct theory-driven preference datasets based on emotion grounding and use them to align LLMs with preference optimization algorithms to address these challenges. To evaluate empathetic response generation, we employ the EmpatheticDialogues dataset, assessing empathy with the diff-Epitome and BERTscore metrics and with multi-dimensional human evaluation. Additionally, we measure diversity and emotional valence using feature-based methods. We also evaluate the impact of training on the generalization performance using the MMLU benchmark and tasks from the Open LLM Leaderboard. The results show that LLMs can be aligned for empathetic response generation by preference optimization while retaining their general performance and that emotion grounding can guide preference dataset creation. We make all datasets, source code, and models publicly available. this https URL

Submission history

From: Ondřej Sotolář [view email]
[v1]
Thu, 27 Jun 2024 10:41:22 UTC (7,944 KB)
[v2]
Tue, 17 Sep 2024 14:24:47 UTC (4,271 KB)

Source link
lol

By stp2y