YAD: Leveraging T5 for Improved Automatic Diacritization of Yorùbá Text

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning



arXiv:2412.20218v1 Announce Type: new
Abstract: In this work, we present Yor`ub’a automatic diacritization (YAD) benchmark dataset for evaluating Yor`ub’a diacritization systems. In addition, we pre-train text-to-text transformer, T5 model for Yor`ub’a and showed that this model outperform several multilingually trained T5 models. Lastly, we showed that more data and larger models are better at diacritization for Yor`ub’a



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.