Dialectal Coverage And Generalization in Arabic Speech Recognition

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


[Submitted on 7 Nov 2024]

View a PDF of the paper titled Dialectal Coverage And Generalization in Arabic Speech Recognition, by Amirbek Djanibekov and 4 other authors

View PDF
HTML (experimental)

Abstract:Developing robust automatic speech recognition (ASR) systems for Arabic, a language characterized by its rich dialectal diversity and often considered a low-resource language in speech technology, demands effective strategies to manage its complexity. This study explores three critical factors influencing ASR performance: the role of dialectal coverage in pre-training, the effectiveness of dialect-specific fine-tuning compared to a multi-dialectal approach, and the ability to generalize to unseen dialects. Through extensive experiments across different dialect combinations, our findings offer key insights towards advancing the development of ASR systems for pluricentric languages like Arabic.

Submission history

From: Amirbek Djanibekov [view email]
[v1]
Thu, 7 Nov 2024 22:23:30 UTC (7,092 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.