Publication Details
BUT OpenSAT 2017 speech recognition system
KARAFIÁT, M.; BASKAR, M.; SZŐKE, I.; MALENOVSKÝ, V.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. BUT OpenSAT 2017 speech recognition system. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 2638-2642. ISSN: 1990-9772.
Czech title
VUT systém rozpoznávání řeči pro OpenSAT 2017
Type
conference paper
Language
English
Authors
Karafiát Martin, Ing., Ph.D.
(DCGM)
Baskar Murali Karthick, Ing., Ph.D.
Szőke Igor, Ing., Ph.D. (DCGM)
Malenovský Vladimír, Ing., Ph.D. (DCGM)
Veselý Karel, Ing., Ph.D. (DCGM)
Grézl František, Ing., Ph.D. (DCGM)
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
Baskar Murali Karthick, Ing., Ph.D.
Szőke Igor, Ing., Ph.D. (DCGM)
Malenovský Vladimír, Ing., Ph.D. (DCGM)
Veselý Karel, Ing., Ph.D. (DCGM)
Grézl František, Ing., Ph.D. (DCGM)
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
URL
Keywords
speech recognition, multilingual training, BLSTM, data augmentation, robustness
Abstract
(ASR) systems for two domains in OpenSAT evaluations: Low Resourced Languages and Public Safety Communications. The first was challenging due to lack of training data, therefore multilingual approaches for BLSTM training were employed and recently published Residual Memory Networks requiring less training data were used. Combination of both approaches led to superior performance. The second domain was challenging due to recording in extreme conditions: specific channel, speaker under stress, high levels of noise. A data augmentation process was very important to get reasonably good performance.
Published
2018
Pages
2638–2642
Journal
Proceedings of Interspeech, vol. 2018, no. 9, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2018
Publisher
International Speech Communication Association
Place
Hyderabad
DOI
UT WoS
000465363900553
EID Scopus
BibTeX
@inproceedings{BUT155099,
author="Martin {Karafiát} and Murali Karthick {Baskar} and Igor {Szőke} and Vladimír {Malenovský} and Karel {Veselý} and František {Grézl} and Lukáš {Burget} and Jan {Černocký}",
title="BUT OpenSAT 2017 speech recognition system",
booktitle="Proceedings of Interspeech 2018",
year="2018",
journal="Proceedings of Interspeech",
volume="2018",
number="9",
pages="2638--2642",
publisher="International Speech Communication Association",
address="Hyderabad",
doi="10.21437/Interspeech.2018-2457",
issn="1990-9772",
url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/2457.html"
}