Publication Details

BUT system for low resource Indian language ASR

PULUGUNDLA, B.; BASKAR, M.; KESIRAJU, S.; EGOROVA, E.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. BUT system for low resource Indian language ASR. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 3182-3186. ISSN: 1990-9772.
Czech title
VUT systém pro rozpoznávání indických jazyků s omezenými zdroji
Type
conference paper
Language
English
Authors
Pulugundla Bhargav, M.Sc.
Baskar Murali Karthick, Ing., Ph.D.
Kesiraju Santosh, Ph.D. (DCGM)
Egorova Ekaterina, Ing., Ph.D.
Karafiát Martin, Ing., Ph.D. (DCGM)
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
URL
Keywords

Indian languages, low resource ASR, multilingual, LF-MMI

Abstract

This paper describes the BUT Jilebi teams speech recognition systems created for the 2018 low resource speech recognition challenge for Indian languages. We investigate modifications of multilingual time-delay neural network (TDNN) architectures with transfer learning and compare them to bi-directional residual memory networks (BRMN) and bi-directional LSTM. Our best submission based on system combination achieved word error rates of 13.92% (Tamil), 14.71% (Telugu) and 14.06% (Gujarati). We present the details of submitted systems and also the post-evaluation analysis done for lexicon discovery using unsupervised word segmentation.

Published
2018
Pages
3182–3186
Journal
Proceedings of Interspeech, vol. 2018, no. 9, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2018
Publisher
International Speech Communication Association
Place
Hyderabad
DOI
UT WoS
000465363900663
EID Scopus
BibTeX
@inproceedings{BUT155101,
  author="Bhargav {Pulugundla} and Murali Karthick {Baskar} and Santosh {Kesiraju} and Ekaterina {Egorova} and Martin {Karafiát} and Lukáš {Burget} and Jan {Černocký}",
  title="BUT system for low resource Indian language ASR",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="3182--3186",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-1302",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1302.html"
}
Back to top