Publication Details
Dealing with Numbers in Grapheme-Based Speech Recognition
LVCSR, ASR, grapheme, phoneme, speech recognition.
Grapheme-based speech recognition approach is suitable in situation of low resource languages, where obtaining of pronunciation dictionary is time- and cost-consuming. The paper describes the process of automatic generation of pronunciation dictionaries with emphasis on the expansion of numbers and presents results on GlobalPhone database.
This article presents the results of grapheme-based speech recognition for eight languages. The need for this approach arises in situation of low resource languages, where obtaining a pronunciation dictionary is time- and cost-consuming or impossible. In such scenarios, usage of grapheme dictionaries is the most simplest and straight-forward. The paper describes the process of automatic generation of pronunciation dictionaries with emphasis on the expansion of numbers. Experiments on GlobalPhone database show that grapheme-based systems have results comparable to the phoneme-based ones, especially for phonetic languages.
@inproceedings{BUT97033,
author="Miloš {Janda} and Martin {Karafiát} and Jan {Černocký}",
title="Dealing with Numbers in Grapheme-Based Speech Recognition",
booktitle="Proceedings of 15th International Conference on Text, Speech and Dialogue",
year="2012",
series="Lecture Notes in Computer Science, 2012, Volume 7499",
journal="Lecture Notes in Computer Science",
volume="2012",
number="9",
pages="438--445",
publisher="Springer Verlag",
address="Springer-Verlag Berlin Heidelberg 2012",
doi="10.1007/978-3-642-32790-2\{_}53",
isbn="978-3-642-32789-6",
issn="0302-9743",
url="http://www.springerlink.com/content/yx9807202033v381/"
}