Publication Details
Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM
Baskar Murali Karthick, Ing., Ph.D.
Matějka Pavel, Ing., Ph.D. (DCGM)
Veselý Karel, Ing., Ph.D. (DCGM)
Grézl František, Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
Automatic speech recognition, Multilingual neural networks, Bidirectional Long Short Term Memory, i-vectors, Sequence Summarizing Neural Networks.
This paper provides an extensive summary of BUT 2016 system for the last Babel evaluations. It concentrates on multi-lingual training of both DNN-based features and acoustic models and on the lowdimensional to speaker adaptation.
This paper provides an extensive summary of BUT 2016 system for the last IARPA Babel evaluations. It concentrates on multi-lingual training of both deep neural network (DNN)-based feature extraction and acoustic models including multilingual training of bidirectional Long Short Term memory networks. Next, two low-dimensional vector approaches to speaker adaptation are investigated: i-vectors and sequence-summarizing neural networks (SSNN). The results provided on three Babel Year 4 languages show clear advantage of both approaches in case limited amount of training data is available. The time necessary for the development of a new system is addressed too, as some of the investigated techniques do not require extensive re-training of the whole system.
@inproceedings{BUT132604,
author="Martin {Karafiát} and Murali Karthick {Baskar} and Pavel {Matějka} and Karel {Veselý} and František {Grézl} and Jan {Černocký}",
title="Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM",
booktitle="Proceedings of SLT 2016",
year="2016",
pages="637--643",
publisher="IEEE Signal Processing Society",
address="San Diego",
doi="10.1109/SLT.2016.7846330",
isbn="978-1-5090-4903-5",
url="https://www.fit.vut.cz/research/publication/11310/"
}