Project Details
Interaktivní detektor klíčových slov
Project Period: 1. 1. 2006 – 31. 12. 2008
Project Type: grant
Code: GP102/06/P383
Agency: Czech Science Foundation
Program: Postdoktorandské granty
keyword spotting, phoneme recognition, phoneme lattice
In the last years, we have been involved in European projects M4 and AMI. One of the objectives of these projects was development of a browser allowing for easy navigation in recorded meeting, tracing its interesting parts and their playback. It was found that it would be extremely valuable to extend existing browser with functionality of fast interactive keyword detection. The classical keyword spotting methods based only on evaluation of statistical acoustic models are too slow for this purpose. The aim of this project is to develop a fast and reliable detector allowing for an interactive keyword search in tens of hours of recorded meetings. The detector will use a hierarchical approach, where acoustic data are first converted into form of phoneme lattices by phone recognizer. When a keyword is specified, it can be quickly looked up in the lattices. The found keyword occurrences will be further verified using statistical models on acoustic data to increase the keyword detection
2009
- GLEMBEK, O.; BURGET, L.; DEHAK, N.; BRÜMMER, N.; KENNY, P. Comparison of Scoring Methods used in Speaker Recognition with Joint Factor Analysis. Proc. ICASSP 2009. Taipei: IEEE Signal Processing Society, 2009.
p. 1-4. ISBN: 978-1-4244-2354-5. Detail
2008
- BURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008.
p. 1-4. ISBN: 1-4244-1484-9. Detail - GLEMBEK, O.; MATĚJKA, P.; BURGET, L.; MIKOLOV, T. Advances in Phonotactic Language Recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008.
p. 1-4. ISSN: 1990-9772. Detail - HUBEIKA, V.; BURGET, L.; MATĚJKA, P.; SCHWARZ, P. Discriminative Training and Channel Compensation for Acoustic Language Recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008.
p. 1-4. ISSN: 1990-9772. Detail - KARAFIÁT, M.; BURGET, L.; HAIN, T.; ČERNOCKÝ, J. Discrimininative training of narrow band - wide band adaptated systems for meeting recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008.
p. 1-4. ISSN: 1990-9772. Detail - KOCKMANN, M.; BURGET, L. Contour modeling of prosodic and acoustic features for speaker recognition. Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008.
p. 1-4. ISBN: 978-1-4244-3472-5. Detail - MATĚJKA, P.; BURGET, L.; GLEMBEK, O.; SCHWARZ, P.; HUBEIKA, V.; FAPŠO, M.; MIKOLOV, T.; PLCHOT, O.; ČERNOCKÝ, J. BUT language recognition system for NIST 2007 evaluations. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane, Australia: International Speech Communication Association, 2008.
p. 1-4. ISSN: 1990-9772. Detail - OPARIN, I.; GLEMBEK, O.; BURGET, L.; ČERNOCKÝ, J. Morphological random forests for language modeling of inflectional languages. Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008.
p. 1-4. ISBN: 978-1-4244-3472-5. Detail - PLCHOT, O.; HUBEIKA, V.; BURGET, L.; SCHWARZ, P.; MATĚJKA, P. Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition. Proc. 11th International Conference on Text, Speech and Dialogue. Berlin: Springer Verlag, 2008.
p. 477-483. ISBN: 978-3-540-87390-7. Detail - SZŐKE, I.; FAPŠO, M.; BURGET, L.; ČERNOCKÝ, J. Hybrid word-subword decoding for spoken term detection. Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008.
p. 1-4. ISBN: 978-90-365-2697-5. Detail
2007
- BRÜMMER, N.; BURGET, L.; ČERNOCKÝ, J.; GLEMBEK, O.; GRÉZL, F.; KARAFIÁT, M.; VAN LEEUWEN, D.; MATĚJKA, P.; SCHWARZ, P.; STRASHEIM, A. Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006. IEEE Transactions on Audio, Speech, and Language Processing, 2007, vol. 15, no. 7,
p. 2072-2084. ISSN: 1558-7916. Detail - BURGET, L.; MATĚJKA, P.; SCHWARZ, P.; GLEMBEK, O.; ČERNOCKÝ, J. Analysis of feature extraction and channel compensation in GMM speaker recognition system. IEEE Transactions on Audio, Speech, and Language Processing, 2007, vol. 15, no. 7,
p. 1979-1986. ISSN: 1558-7916. Detail - HUBEIKA, V.; BURGET, L.; MATĚJKA, P.; ČERNOCKÝ, J. Channel Compensation for Speaker Recognition. Brno: 2007.
p. 1 (1 s.). Detail - HUBEIKA, V.; SZŐKE, I.; BURGET, L.; ČERNOCKÝ, J. Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System. In Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). Pilsen: Springer Verlag, 2007.
p. 1-6. ISBN: 978-3-540-74627-0. Detail - MIKOLOV, T.; OPARIN, I.; GLEMBEK, O.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Použití mluvených korpusů ve vývoji systému pro rozpoznávání českých přednášek. Praha: Univerzita Karlova v Praze, 2007.
s. 1-5. Detail - SZŐKE, I.; BURGET, L.; KARAFIÁT, M. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno: 2007.
p. 1 (1 s.). Detail - SZŐKE, I.; FAPŠO, M.; KARAFIÁT, M.; BURGET, L.; GRÉZL, F.; SCHWARZ, P.; GLEMBEK, O.; MATĚJKA, P.; KOPECKÝ, J.; ČERNOCKÝ, J. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno: 2007.
p. 1 (1 s.). Detail
2006
- KARAFIÁT, M.; GRÉZL, F.; SCHWARZ, P.; BURGET, L.; ČERNOCKÝ, J. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition. In Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006). LNCS 4299. Berlin: Springer Verlag, 2006.
p. 275-284. ISBN: 3-540-69267-3. Detail