Publication Details

Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set

EGOROVA, E.; VESELÝ, K.; KARAFIÁT, M.; JANDA, M.; ČERNOCKÝ, J. Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set. In Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013. p. 7324-7328. ISBN: 978-1-4799-0355-9.

Czech title

Manuální a poloautomatické přístupy k tvorbě multilingvální fonémové sady

Type

conference paper

Language

English

Authors

Egorova Ekaterina, Ing., Ph.D.
Veselý Karel, Ing., Ph.D. (DCGM)
Karafiát Martin, Ing., Ph.D. (DCGM)
Janda Miloš, Ing.
Černocký Jan, prof. Dr. Ing. (DCGM)

URL

http://www.fit.vutbr.cz/research/groups/speech/publi/2013/egorova_icassp2013_0007324.pdf PDF

Keywords

multilingual speech recognition, phoneme setmapping, phoneme confusion matrix

Abstract

This articles describes a comparison between manual and semi-automatic approachesto building a multilingual phoneme set. The two approacheswere compared in cases of 1) a multilingual system withabundant data for all the languages, 2) multilingual systems excludingtarget language 3) multilingual systems with small amount ofdata for target languages. The work shows that careful choice ofmerging methods can help improve recognition of languages with noor little training data and reasonably reduce multilingual phonemeset without losing a lot of accuracy.

Annotation

The paper addresses manual and semi-automatic approaches to building a multilingual phoneme set for automatic speech recognition. The first approach involves mapping and reduction of the phoneme set based on IPA and expert knowledge, the later one involves phoneme confusion matrix generated by a neural network. The comparison is done for 8 languages selected from GlobalPhone on three scenarios: 1) multilingual system with abundant data for all the languages, 2) multilingual systems excluding target language 3) multilingual systems with small amount of data for target languages. For 3), the multilingual system brought improvement for languages close enough to the others in the set.

Published

2013

Pages

7324–7328

Proceedings

Proceedings of ICASSP 2013

Conference

38th International Conference on Acoustics, Speech, and Signal Processing, Vancouver, CA

ISBN

978-1-4799-0355-9

Publisher

IEEE Signal Processing Society

Place

Vancouver

UT WoS

000329611507098

BibTeX

@inproceedings{BUT103490,
  author="Ekaterina {Egorova} and Karel {Veselý} and Martin {Karafiát} and Miloš {Janda} and Jan {Černocký}",
  title="Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set",
  booktitle="Proceedings of ICASSP 2013",
  year="2013",
  pages="7324--7328",
  publisher="IEEE Signal Processing Society",
  address="Vancouver",
  isbn="978-1-4799-0355-9",
  url="https://www.fit.vut.cz/research/publication/10323/"
}

Files

pdf egorova_icassp2013_0007324.pdf 221 kB