Publication Details

Tuning phone decoders for language identification

SANTHOSH KUMAR, C.; LI, H.; TONG, R.; MATĚJKA, P.; BURGET, L.; ČERNOCKÝ, J. Tuning phone decoders for language identification. Proc. International Conference on Acoustics, Speech, and Signal Processing 2010. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. p. 5010-5013. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149.

Czech title

Ladění fonémových dekodérů pro identifikaci jazyka

Type

conference paper

Language

English

Authors

Santhosh Kumar Chellappan Pillai
Li Haizhou
Tong Rong
Matějka Pavel, Ing., Ph.D. (DCGM)
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)

URL

http://www.fit.vutbr.cz/research/groups/speech/publi/2010/kumar_icassp2010_5010.pdf

Keywords

Phonotactic language identification, hidden Markov models, neural networks, mutual information, multilingual

Abstract

This paper is on tuning phone decoders for language identification. In this work, we explore how language identification accuracy of a phone decoder can be enhanced.

Annotation

Phonotactic approach, phone recognition to be followed by language modeling, is one of the most popular approaches to language identification (LID). In this work, we explore how language identification accuracy of a phone decoder can be enhanced by varying acoustic resolution of the phone decoder, and subsequently how multiresolution versions of the same decoder can be integrated to improve the LID accuracy. We use mutual information to select the optimum set of phones for a specific acoustic resolution. Further, we propose strategies for building multilingual systems suitable for LID applications, and subsequently fine tune these systems to enhance the overall accuracy.

Published

2010

Pages

5010–5013

Journal

Proc. International Conference on Acoustics, Speech, and Signal Processing, vol. 2010, no. 3, ISSN 1520-6149

Proceedings

Proc. International Conference on Acoustics, Speech, and Signal Processing 2010

ISBN

978-1-4244-4296-6

Publisher

IEEE Signal Processing Society

Place

Dallas

BibTeX

@inproceedings{BUT34848,
  author="Chellappan Pillai {Santhosh Kumar} and Haizhou {Li} and Rong {Tong} and Pavel {Matějka} and Lukáš {Burget} and Jan {Černocký}",
  title="Tuning phone decoders for language identification",
  booktitle="Proc. International Conference on Acoustics, Speech, and Signal Processing 2010",
  year="2010",
  journal="Proc. International Conference on Acoustics, Speech, and Signal Processing",
  volume="2010",
  number="3",
  pages="5010--5013",
  publisher="IEEE Signal Processing Society",
  address="Dallas",
  isbn="978-1-4244-4296-6",
  issn="1520-6149",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/kumar_icassp2010_5010.pdf"
}