Publication Details

Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM

KARAFIÁT, M.; BASKAR, M.; MATĚJKA, P.; VESELÝ, K.; GRÉZL, F.; ČERNOCKÝ, J. Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM. In Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016. p. 637-643. ISBN: 978-1-5090-4903-5.
Czech title
Multilingvální BLSTM a adaptace pomocí vektorů specifických pro řečníka ve VUT Babel 2016 systému
Type
conference paper
Language
English
Authors
URL
Keywords

Automatic speech recognition, Multilingual neural networks, Bidirectional Long Short Term Memory, i-vectors, Sequence Summarizing Neural Networks.

Abstract

This paper provides an extensive summary of BUT 2016 system for the last Babel evaluations. It concentrates on multi-lingual training of both DNN-based features and acoustic models and on the lowdimensional to speaker adaptation.

Annotation

This paper provides an extensive summary of BUT 2016 system for the last IARPA Babel evaluations. It concentrates on multi-lingual training of both deep neural network (DNN)-based feature extraction and acoustic models including multilingual training of bidirectional Long Short Term memory networks. Next, two low-dimensional vector approaches to speaker adaptation are investigated: i-vectors and sequence-summarizing neural networks (SSNN). The results provided on three Babel Year 4 languages show clear advantage of both approaches in case limited amount of training data is available. The time necessary for the development of a new system is addressed too, as some of the investigated techniques do not require extensive re-training of the whole system.

Published
2016
Pages
637–643
Proceedings
Proceedings of SLT 2016
ISBN
978-1-5090-4903-5
Publisher
IEEE Signal Processing Society
Place
San Diego
DOI
UT WoS
000399128000093
EID Scopus
BibTeX
@inproceedings{BUT132604,
  author="Martin {Karafiát} and Murali Karthick {Baskar} and Pavel {Matějka} and Karel {Veselý} and František {Grézl} and Jan {Černocký}",
  title="Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM",
  booktitle="Proceedings of SLT 2016",
  year="2016",
  pages="637--643",
  publisher="IEEE Signal Processing Society",
  address="San Diego",
  doi="10.1109/SLT.2016.7846330",
  isbn="978-1-5090-4903-5",
  url="https://www.fit.vut.cz/research/publication/11310/"
}
Back to top