Speech

发布日期: 2023-01-04

2023-01-04 更新

Supervised Acoustic Embeddings And Their Transferability Across Languages

Authors:Sreepratha Ram, Hanan Aldarmaki

In speech recognition, it is essential to model the phonetic content of the input signal while discarding irrelevant factors such as speaker variations and noise, which is challenging in low-resource settings. Self-supervised pre-training has been proposed as a way to improve both supervised and unsupervised speech recognition, including frame-level feature representations and Acoustic Word Embeddings (AWE) for variable-length segments. However, self-supervised models alone cannot learn perfect separation of the linguistic content as they are trained to optimize indirect objectives. In this work, we experiment with different pre-trained self-supervised features as input to AWE models and show that they work best within a supervised framework. Models trained on English can be transferred to other languages with no adaptation and outperform self-supervised models trained solely on the target languages.
PDF Presented at ICNLSP 2022

点此查看论文截图

木子已

https://ipaper.today/2023/01/04/2023-01-04-speech/

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源木子已 !

Speech

Domain Adaptation

2023-01-05 Domain Adaptation

Domain Adaptation

检测/分割/跟踪

2023-01-04 检测/分割/跟踪

检测分割跟踪

Speech

2023-01-04 更新

Supervised Acoustic Embeddings And Their Transferability Across Languages

打赏用于支持本站流量费