Publication Details

Subword-based spoken term detection in audio course lectures

ROSE, R.; NOROUZIAN, A.; REDDY, A.; COY, A.; GUPTA, V.; KARAFIÁT, M. Subword-based spoken term detection in audio course lectures. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. p. 5282-5285. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149.
Czech title
Pod-slovní jednotky pro detekci klíčových frází v audiozáznamech přednášek
Type
conference paper
Language
English
Authors
Rose Richard
Norouzian Atta
Reddy Aarthi
Coy Andre
Gupta Vishwa
Karafiát Martin, Ing., Ph.D. (DCGM)
URL
Keywords

Speech recognition, spoken term detection

Abstract

This paper regards the subword-based spoken term detection in audio course lectures. It investigates spoken term dection (STD) from audio recordings.

Annotation

This paper investigates spoken term detection (STD) from audio recordings of course lectures obtained from an existing media repository. STD is performed from word lattices generated offline using an automatic speech recognition (ASR) system configured from a meetings domain. An efficient STD approach is presented where lattice paths which are likely to contain search terms are identified and an efficient phone based distance is used to detect the occurrence of search terms in phonetic expansions of promising lattice paths. STD and ASR results are reported for both in-vocabulary (IV) and outof- vocabulary (OOV) search terms in this lecture speech domain.

Published
2010
Pages
5282–5285
Journal
Proc. International Conference on Acoustics, Speech, and Signal Processing, vol. 2010, no. 3, ISSN 1520-6149
Proceedings
Proc. International Conference on Acoustics, Speech, and Signal Processing
ISBN
978-1-4244-4296-6
Publisher
IEEE Signal Processing Society
Place
Dallas
BibTeX
@inproceedings{BUT34923,
  author="Richard {Rose} and Atta {Norouzian} and Aarthi {Reddy} and Andre {Coy} and Vishwa {Gupta} and Martin {Karafiát}",
  title="Subword-based spoken term detection in audio course lectures",
  booktitle="Proc. International Conference on Acoustics, Speech, and Signal Processing",
  year="2010",
  journal="Proc. International Conference on Acoustics, Speech, and Signal Processing",
  volume="2010",
  number="3",
  pages="5282--5285",
  publisher="IEEE Signal Processing Society",
  address="Dallas",
  isbn="978-1-4244-4296-6",
  issn="1520-6149",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/rose_icassp2010_5282.pdf"
}
Back to top