Publication Details
Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech
SZŐKE, I.; SCHWARZ, P.; BURGET, L.; KARAFIÁT, M.; MATĚJKA, P.; ČERNOCKÝ, J. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, 2005, vol. 2005, no. 3658, p. 302-309. ISSN: 0302-9743.
Czech title
Fonémový detektor klíčových slov založený na akustice pro neformální konverzační řeč
Type
journal article
Language
English
Authors
Szőke Igor, Ing., Ph.D.
(DCGM)
Schwarz Petr, Ing., Ph.D. (DCGM)
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Karafiát Martin, Ing., Ph.D. (DCGM)
Matějka Pavel, Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
Schwarz Petr, Ing., Ph.D. (DCGM)
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Karafiát Martin, Ing., Ph.D. (DCGM)
Matějka Pavel, Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
URL
Keywords
acoustic keyword spotting, hidden Markov model, phoneme, recognition network
Abstract
This paper describes several ways of acoustic keywords spotting (KWS), based on Gaussian mixture model (GMM) hidden Markov models (HMM) and phoneme posterior probabilities from FeatureNet. Context-independent and dependent phoneme models are used in the GMM/HMM system. The systems were trained and evaluated on informal continuous speech. We used different complexities of KWS recognition network and different types of phoneme models. We study the impact of these parameters on the accuracy and computational complexity, and conclude that phoneme posteriors outperform conventional GMM/HMM system.
Published
2005
Pages
302–309
Journal
Lecture Notes in Computer Science, vol. 2005, no. 3658, ISSN 0302-9743
BibTeX
@article{BUT42913,
author="Igor {Szőke} and Petr {Schwarz} and Lukáš {Burget} and Martin {Karafiát} and Pavel {Matějka} and Jan {Černocký}",
title="Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech",
journal="Lecture Notes in Computer Science",
year="2005",
volume="2005",
number="3658",
pages="302--309",
issn="0302-9743",
url="https://www.fit.vut.cz/research/publication/7882/"
}