Publication Details

Towards Lower Error Rates in Phoneme Recognition

SCHWARZ, P., MATĚJKA, P., ČERNOCKÝ, J. Towards Lower Error Rates in Phoneme Recognition. In Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004. p. 465 ( p.)ISBN: 3-540-23049-1.
Czech title
Postup k Nižší Chybovosti Fonémového Rozpoznávače
Type
conference paper
Language
English
Authors
URL
Keywords

phoneme recognition, traps, speech recognition, feature extraction

Abstract

We investigate techniques for acoustic modeling in automatic recognition of context-independent phoneme strings from the TIMIT database. The baseline phoneme recognizer is based on TempoRAl Patterns (TRAP). This recognizer is simplified to shorten processing times and reduce computational requirements. More states per phoneme and bi-gram language models are incorporated into the system and evaluated. The question of insufficient amount of training data is discussed and the system is improved. All modifications lead to a faster system with about 23.6% relative improvement over the baseline in phoneme error rate.

Annotation

We investigate techniques for acoustic modeling in automatic recognition of context-independent phoneme strings from the TIMIT database. The baseline phoneme recognizer is based on TempoRAl Patterns (TRAP). This recognizer is simplified to shorten processing times and reduce computational requirements. More states per phoneme and bi-gram language models are incorporated into the system and evaluated. The question of insufficient amount of training data is discussed and the system is improved. All modifications lead to a faster system with about 23.6% relative improvement over the baseline in phoneme error rate.

Published
2004
Pages
8
Proceedings
Proceedings of 7th International Conference Text,Speech and Dialoque 2004
ISBN
3-540-23049-1
Publisher
Springer Verlag
Place
Brno
BibTeX
@inproceedings{BUT17585,
  author="Petr {Schwarz} and Pavel {Matějka} and Jan {Černocký}",
  title="Towards Lower Error Rates in Phoneme Recognition",
  booktitle="Proceedings of 7th International Conference Text,Speech and Dialoque 2004",
  year="2004",
  pages="8",
  publisher="Springer Verlag",
  address="Brno",
  isbn="3-540-23049-1",
  url="http://www.fit.vutbr.cz/~matejkap/publi/2004/tsd2004_phn.pdf"
}
Back to top