Publication Details

Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition

ŠŮSTEK, M.; SADHU, S.; HEŘMANSKÝ, H. Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Incheon: International Speech Communication Association, 2022. p. 1046-1050. ISSN: 1990-9772.

Czech title

Vypořádání se s neznámými testovacími prostředími v kontextu kontinuálního učení a end-to-end automatického rozpoznávače řeči

Type

conference paper

Language

English

Authors

Šůstek Martin, Ing. (DCGM)
SADHU, S.
Heřmanský Hynek, prof. Ing. (DCGM)

URL

https://www.isca-speech.org/archive/pdfs/interspeech_2022/sustek22_interspeech.pdf

Keywords

continual learning, multistream speech recognition, speech recognition

Abstract

Learning continually from data is a task executed effortlessly by humans but remains to be of significant challenge for machines. Moreover, when encountering unknown test scenarios machines fail to generalize. We propose a mathematically motivated dynamically expanding end-to-end model of independent sequence-to-sequence components trained on different data sets that avoid catastrophically forgetting knowledge acquired from previously seen data while seamlessly integrating knowledge from new data. During inference, the likelihoods of the unknown test scenario are computed using internal model activation distributions. The inference made by each independent component is weighted by the normalized likelihood values to obtain the final decision.

Published

2022

Pages

1046–1050

Journal

Proceedings of Interspeech, vol. 2022, no. 9, ISSN 1990-9772

Proceedings

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Publisher

International Speech Communication Association

Place

Incheon

DOI

10.21437/Interspeech.2022-11139

UT WoS

000900724501045

EID Scopus

2-s2.0-85140076092

BibTeX

@inproceedings{BUT182527,
  author="ŠŮSTEK, M. and SADHU, S. and HEŘMANSKÝ, H.",
  title="Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition",
  booktitle="Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",
  year="2022",
  journal="Proceedings of Interspeech",
  volume="2022",
  number="9",
  pages="1046--1050",
  publisher="International Speech Communication Association",
  address="Incheon",
  doi="10.21437/Interspeech.2022-11139",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/pdfs/interspeech_2022/sustek22_interspeech.pdf"
}