Publication Details
Subword-based spoken term detection in audio course lectures
Speech recognition, spoken term detection
This paper regards the subword-based spoken term detection in audio course lectures. It investigates spoken term dection (STD) from audio recordings.
This paper investigates spoken term detection (STD) from audio recordings of course lectures obtained from an existing media repository. STD is performed from word lattices generated offline using an automatic speech recognition (ASR) system configured from a meetings domain. An efficient STD approach is presented where lattice paths which are likely to contain search terms are identified and an efficient phone based distance is used to detect the occurrence of search terms in phonetic expansions of promising lattice paths. STD and ASR results are reported for both in-vocabulary (IV) and outof- vocabulary (OOV) search terms in this lecture speech domain.
@inproceedings{BUT34923,
author="Richard {Rose} and Atta {Norouzian} and Aarthi {Reddy} and Andre {Coy} and Vishwa {Gupta} and Martin {Karafiát}",
title="Subword-based spoken term detection in audio course lectures",
booktitle="Proc. International Conference on Acoustics, Speech, and Signal Processing",
year="2010",
journal="Proc. International Conference on Acoustics, Speech, and Signal Processing",
volume="2010",
number="3",
pages="5282--5285",
publisher="IEEE Signal Processing Society",
address="Dallas",
isbn="978-1-4244-4296-6",
issn="1520-6149",
url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/rose_icassp2010_5282.pdf"
}