A Simple HMM with Self-Supervised Representations for Phone Segmentation

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning


View a PDF of the paper titled A Simple HMM with Self-Supervised Representations for Phone Segmentation, by Gene-Ping Yang and Hao Tang

View PDF
HTML (experimental)

Abstract:Despite the recent advance in self-supervised representations, unsupervised phonetic segmentation remains challenging. Most approaches focus on improving phonetic representations with self-supervised learning, with the hope that the improvement can transfer to phonetic segmentation. In this paper, contrary to recent approaches, we show that peak detection on Mel spectrograms is a strong baseline, better than many self-supervised approaches. Based on this finding, we propose a simple hidden Markov model that uses self-supervised representations and features at the boundaries for phone segmentation. Our results demonstrate consistent improvements over previous approaches, with a generalized formulation allowing versatile design adaptations.

Submission history

From: Gene-Ping Yang [view email]
[v1]
Sun, 15 Sep 2024 07:44:23 UTC (278 KB)
[v2]
Fri, 20 Sep 2024 06:49:10 UTC (278 KB)



Source link
lol

By stp2y

Leave a Reply

Your email address will not be published. Required fields are marked *

No widgets found. Go to Widget page and add the widget in Offcanvas Sidebar Widget Area.