Speech

发布日期: 2023-01-30

2023-01-30 更新

BayesSpeech: A Bayesian Transformer Network for Automatic Speech Recognition

Authors:Will Rieger

Recent developments using End-to-End Deep Learning models have been shown to have near or better performance than state of the art Recurrent Neural Networks (RNNs) on Automatic Speech Recognition tasks. These models tend to be lighter weight and require less training time than traditional RNN-based approaches. However, these models take frequentist approach to weight training. In theory, network weights are drawn from a latent, intractable probability distribution. We introduce BayesSpeech for end-to-end Automatic Speech Recognition. BayesSpeech is a Bayesian Transformer Network where these intractable posteriors are learned through variational inference and the local reparameterization trick without recurrence. We show how the introduction of variance in the weights leads to faster training time and near state-of-the-art performance on LibriSpeech-960.
PDF

点此查看论文截图

木子已

https://ipaper.today/2023/01/30/2023-01-30-speech/

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源木子已 !

Speech

NeRF

2023-01-30 NeRF

NeRF

无监督/半监督/对比学习

2023-01-30 无监督/半监督/对比学习

无监督半监督对比学习

Speech

2023-01-30 更新

BayesSpeech: A Bayesian Transformer Network for Automatic Speech Recognition

打赏用于支持本站流量费