Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding

stp2yDecember 25, 20240 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 17 Dec 2024]

View a PDF of the paper titled Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding, by Tadesse Destaw Belay and 7 other authors

View PDF
HTML (experimental)

Abstract:Large Language Models (LLMs) show promising learning and reasoning abilities. Compared to other NLP tasks, multilingual and multi-label emotion evaluation tasks are under-explored in LLMs. In this paper, we present EthioEmo, a multi-label emotion classification dataset for four Ethiopian languages, namely, Amharic (amh), Afan Oromo (orm), Somali (som), and Tigrinya (tir). We perform extensive experiments with an additional English multi-label emotion dataset from SemEval 2018 Task 1. Our evaluation includes encoder-only, encoder-decoder, and decoder-only language models. We compare zero and few-shot approaches of LLMs to fine-tuning smaller language models. The results show that accurate multi-label emotion classification is still insufficient even for high-resource languages such as English, and there is a large gap between the performance of high-resource and low-resource languages. The results also show varying performance levels depending on the language and model type. EthioEmo is available publicly to further improve the understanding of emotions in language models and how people convey emotions through various languages.

Submission history

From: Tadesse Destaw Belay [view email]
[v1]
Tue, 17 Dec 2024 07:42:39 UTC (4,136 KB)

Source link
lol

By stp2y