Publications
-
2023
SKOWRON, M.; BACKFRIED, G.; NAVAS, E.; BERZINŠ, A.; VAN, J.; DE, F.; DEMARCO, A.; POLÁK, P.; KOVÁČ, M.; POLÁK, P.; ROHDIN, J.; ROSNER, M.; SANCHEZ, J.; SARATXAGA, I.; SCHWARZ, P. Deep Dive Speech Technology. In European Language Equality. Cham: Springer Nature Switzerland AG, 2023.
p. 289-312. ISBN: 978-3-031-28819-7. Detail -
2018
BARTOS, A.; CIPR, T.; NELSON, D.; SCHWARZ, P.; BANOWETZ, J.; JERABEK, L. Noise-robust speech triage. Journal of the Acoustical Society of America, 2018, vol. 143, no. 4,
p. 2313-2320. ISSN: 1520-8524. DetailSILNOVA, A.; MATĚJKA, P.; GLEMBEK, O.; PLCHOT, O.; NOVOTNÝ, O.; GRÉZL, F.; SCHWARZ, P.; ČERNOCKÝ, J. BUT/Phonexia Bottleneck Feature Extractor. In Proceedings of Odyssey 2018. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Les Sables d´Olonne: International Speech Communication Association, 2018.
p. 283-287. ISSN: 2312-2846. Detail -
2015
GLEMBEK, O.; MATĚJKA, P.; BURGET, L.; SCHWARZ, P.; PEŠÁN, J.; PLCHOT, O. Voice-print transformation for migration between automatic speaker identification systems. Abstract book of the 7th European Academy of Forensic Science Conference. Praha: Criminal Police Department Prague, 2015.
p. 345-345. ISBN: 978-80-260-8659-8. DetailGLEMBEK, O.; MATĚJKA, P.; PLCHOT, O.; PEŠÁN, J.; BURGET, L.; SCHWARZ, P. Migrating i-vectors Between Speaker Recognition Systems Using Regression Neural Networks. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015.
p. 2327-2331. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772. Detail -
2013
KHOURY, E.; VESNICER, B.; FRANCO-PEDROSO, J.; DIEZ SÁNCHEZ, M.; CIPR, T.; SCHWARZ, P.; VAN LEEUWEN, D.; PETROVSKA-DELACRETAZ, D.; MATĚJKA, P.; RODRIGUEZ-FUENTES, L.; CHOLLET, G.; MARCEL, S. The 2013 Speaker Recognition Evaluation in Mobile Environment. Proceedings of Biometrics (ICB), 2013 International Conference on. Madrid: IEEE Biometric Council, 2013.
p. 1-8. ISBN: 978-1-4799-0310-8. Detail -
2011
POVEY, D.; BURGET, L.; AGARWAL, M.; AKYAZI, P.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. The subspace Gaussian mixture model-A structured model for speech recognition. COMPUTER SPEECH AND LANGUAGE, 2011, vol. 25, no. 2,
p. 404-439. ISSN: 0885-2308. DetailPOVEY, D.; GHOSHAL, A.; BOULIANNE, G.; BURGET, L.; GLEMBEK, O.; GOEL, N.; HANNEMANN, M.; MOTLÍČEK, P.; QIAN, Y.; SCHWARZ, P.; SILOVSKÝ, J.; STEMMER, G.; VESELÝ, K. The Kaldi Speech Recognition Toolkit. Proceedings of ASRU 2011. Hilton Waikoloa Village Resort, Hawaii: IEEE Signal Processing Society, 2011.
p. 1-4. ISBN: 978-1-4673-0366-8. DetailPOVEY, D.; KARAFIÁT, M.; GHOSHAL, A.; SCHWARZ, P. A Symmetrization of the Subspace Gaussian Mixture Model. Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing. Praha: IEEE Signal Processing Society, 2011.
p. 4504-4507. ISBN: 978-1-4577-0537-3. Detail -
2010
BURGET, L.; SCHWARZ, P.; AGARWAL, M.; AKYAZI, P.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; POVEY, D.; RASTROW, A.; ROSE, R.; THOMAS, S. Multilingual acoustic modeling for speech recognition based on Subspace Gaussian Mixture Models. Proc. International Conference on Acoustictics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010.
p. 4334-4337. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. DetailGHOSHAL, A.; POVEY, D.; AGARWAL, M.; AKYAZI, P.; BURGET, L.; FENG, K.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. A novel estimation of feature-space MLLR for full_covariance models. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010.
p. 4310-4313. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. DetailGOEL, N.; THOMAS, S.; AGARWAL, M.; AKYAZI, P.; BURGET, L.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; KARAFIÁT, M.; POVEY, D.; RASTROW, A.; ROSE, R.; SCHWARZ, P. Approaches to automatic lexicon learning with limited training examples. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010.
p. 5094-5097. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. DetailPOVEY, D.; BURGET, L.; AGARWAL, M.; AKYAZI, P.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. Subspace Gaussian mixture models for speech recognition. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010.
p. 4330-4333. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. Detail -
2009
BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. BUT system for NIST 2008 speaker recognition evaluation. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009.
p. 2335-2338. ISBN: 978-1-61567-692-7. ISSN: 1990-9772. Detail -
2008
BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. Brno University Of Technology - NIST 2008 SRE. Montreal: 2008.
p. 1-28. DetailBURGET, L.; SCHWARZ, P.; MATĚJKA, P.; HANNEMANN, M.; RASTROW, A.; WHITE, C.; KHUDANPUR, S.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008.
p. 1-4. ISBN: 1-4244-1484-9. DetailHUBEIKA, V.; BURGET, L.; MATĚJKA, P.; SCHWARZ, P. Discriminative Training and Channel Compensation for Acoustic Language Recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008.
p. 1-4. ISSN: 1990-9772. DetailMATĚJKA, P.; BURGET, L.; GLEMBEK, O.; SCHWARZ, P.; HUBEIKA, V.; FAPŠO, M.; MIKOLOV, T.; PLCHOT, O.; ČERNOCKÝ, J. BUT language recognition system for NIST 2007 evaluations. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane, Australia: International Speech Communication Association, 2008.
p. 1-4. ISSN: 1990-9772. DetailPLCHOT, O.; HUBEIKA, V.; BURGET, L.; SCHWARZ, P.; MATĚJKA, P. Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition. Proc. 11th International Conference on Text, Speech and Dialogue. Berlin: Springer Verlag, 2008.
p. 477-483. ISBN: 978-3-540-87390-7. DetailWHITE, C.; ZWEIG, G.; BURGET, L.; SCHWARZ, P.; HEŘMANSKÝ, H. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments. Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas: IEEE Signal Processing Society, 2008.
p. 1-4. ISBN: 1-4244-1484-9. Detail -
2007
BRÜMMER, N.; BURGET, L.; ČERNOCKÝ, J.; GLEMBEK, O.; GRÉZL, F.; KARAFIÁT, M.; VAN LEEUWEN, D.; MATĚJKA, P.; SCHWARZ, P.; STRASHEIM, A. Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006. IEEE Transactions on Audio, Speech, and Language Processing, 2007, vol. 15, no. 7,
p. 2072-2084. ISSN: 1558-7916. DetailBURGET, L.; MATĚJKA, P.; SCHWARZ, P.; GLEMBEK, O.; ČERNOCKÝ, J. Analysis of feature extraction and channel compensation in GMM speaker recognition system. IEEE Transactions on Audio, Speech, and Language Processing, 2007, vol. 15, no. 7,
p. 1979-1986. ISSN: 1558-7916. DetailMATĚJKA, P.; BURGET, L.; SCHWARZ, P.; GLEMBEK, O.; KARAFIÁT, M.; GRÉZL, F.; ČERNOCKÝ, J.; VAN LEEUWEN, D.; BRÜMMER, N.; STRASHEIM, A. STBU system for the NIST 2006 speaker recognition evaluation. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007.
p. 221-224. ISBN: 1-4244-0728-1. DetailSZŐKE, I.; FAPŠO, M.; KARAFIÁT, M.; BURGET, L.; GRÉZL, F.; SCHWARZ, P.; GLEMBEK, O.; MATĚJKA, P.; KOPECKÝ, J.; ČERNOCKÝ, J. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno: 2007.
p. 1 (1 s.). Detail -
2006
FAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; SCHWARZ, M.; ČERNOCKÝ, J.; KARAFIÁT, M.; BURGET, L. Information Retrieval from Spoken Documents. Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006). Mexico City: Springer Verlag, 2006.
p. 410-416. ISBN: 3-540-32205-1. DetailKARAFIÁT, M.; GRÉZL, F.; SCHWARZ, P.; BURGET, L.; ČERNOCKÝ, J. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition. Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006). Lecture Notes in Computer Science. Berlin: Springer Verlag, 2006.
p. 275-284. ISBN: 3-540-69267-3. DetailMATĚJKA, P.; BURGET, L.; SCHWARZ, P.; ČERNOCKÝ, J. Brno University of Technology System for NIST 2005 Language Recognition Evaluation. Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop. San Juan: 2006.
p. 57-64. ISBN: 1-4244-0472-X. Detail -
2005
FAPŠO, M.; SCHWARZ, P.; SZŐKE, I.; ČERNOCKÝ, J.; SMRŽ, P.; BURGET, L.; KARAFIÁT, M. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh: 2005.
p. 0-0. DetailFAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Systém pre efektívne vyhľadávanie v rečových databázach. Sborník databázové konference DATAKON 2005. Brno: Masaryk University, 2005.
s. 323-333. ISBN: 80-210-3813-6. DetailMATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J.; CHYTIL, P. Phonotactic Language Identification using High Quality Phoneme Recognition. Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. European Conference EUROSPEECH. Lisbon: International Speech Communication Association, 2005.
p. 2237-2240. ISSN: 1018-4074. DetailMATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J.; CHYTIL, P. Phonotactic Language Identification. Proceedings of Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 140-143. ISBN: 80-214-2904-6. DetailMATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J.; CHYTIL, P. Tuning Phonotactic Language Identificaion System. Brno: Faculty of Information Technology BUT, 2005.
p. 1-5. DetailSZŐKE, I.; SCHWARZ, P.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Phoneme based acoustics keyword spotting in informal continuous speech. Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 195-198. ISBN: 80-214-2904-6. DetailSZŐKE, I.; SCHWARZ, P.; BURGET, L.; KARAFIÁT, M.; MATĚJKA, P.; ČERNOCKÝ, J. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, 2005, vol. 2005, no. 3658,
p. 302-309. ISSN: 0302-9743. Detail -
2004
MATĚJKA, P.; SZŐKE, I.; SCHWARZ, P.; ČERNOCKÝ, J. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004.
p. 147-154. ISBN: 3-540-23049-1. DetailMATĚJKA, P.; SZŐKE, I.; SCHWARZ, P.; ČERNOCKÝ, J. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206,
p. 147-154. ISSN: 0302-9743. DetailSCHWARZ, P.; MATĚJKA, P. Phoneme Recognition from a Long Temporal Context. Martigny: 2004.
p. 0 (1 s.). DetailSCHWARZ, P.; MATĚJKA, P.; ČERNOCKÝ, J. Phoneme Recognition from a Long Temporal Context. poster at JOINT AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Martigny: Institute for Perceptual Artificial Intelligence, 2004.
p. 1 (1 s.). DetailSCHWARZ, P.; MATĚJKA, P.; ČERNOCKÝ, J. Towards Lower Error Rates in Phoneme Recognition. Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004.
p. 465-472. ISBN: 3-540-23049-1. DetailSCHWARZ, P.; MATĚJKA, P.; ČERNOCKÝ, J. Towards Lower Error Rates in Phoneme Recognition. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206,
p. 465-472. ISSN: 0302-9743. DetailSCHWARZ, P.; MATĚJKA, P.; ČERNOCKÝ, J. Towards Lower Error Rates In Phoneme Recognition. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206,
p. 465-472. ISSN: 0302-9743. Detail -
2003
MATĚJKA, P.; SCHWARZ, P.; GRÉZL, F.; ČERNOCKÝ, J. Phoneme Classification using Temporal Patterns. Proc. 13th International scientific conference Radioelektronika 2003. Brno: Faculty of Electrical Engineering and Communication BUT, 2003.
p. 1-4. ISBN: 80-214-2383-8. DetailMATĚJKA, P.; SCHWARZ, P.; HEŘMANSKÝ, H.; ČERNOCKÝ, J. Phoneme Recognition using Temporal Patterns. Proc. 6th International Conference Text, Speech and Dialogue, TSD2003. Ceske Budejovice: Springer Verlag, 2003.
p. 465-472. ISBN: 3-540-20024-X. DetailSCHWARZ, P. Would You Like To Make Your Programs Understand Human Voice?. Proceedings of 9th Conference STUDENT EEICT 2003. Brno: Faculty of Electrical Engineering and Communication BUT, 2003.
p. 231-235. ISBN: 80-214-2379-X. DetailSCHWARZ, P.; HEŘMANSKÝ, H.; MATĚJKA, P. Použití časové dynamiky k rozpoznávání jazyků z mluvené řeči. Proceedings of Language Recognition Workshop 2003. NIST Gaithersburg, MD USA: 2003.
s. 56-62. DetailSCHWARZ, P.; MATĚJKA, P.; ČERNOCKÝ, J. Recognition of Phoneme Strings using TRAP Technique. Proceedings of 8th International Conference Eurospeech. European Conference EUROSPEECH. Geneve: International Speech Communication Association, 2003.
p. 1-4. ISSN: 1018-4074. Detail -
2002
MATĚJKA, P.; SCHWARZ, P.; KARAFIÁT, M.; ČERNOCKÝ, J. Some like it Gaussian... Proc. 5th International Conference Text, Speech and Dialogue, TSD2002. Lecture notes in artificial intelligence 2448. Berlin: Springer Verlag, 2002.
p. 321-324. ISBN: 3-540-44129-8. DetailSCHWARZ, P. Modifications of Viterbi algorithms for keyword detection. Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering and Communication BUT, 2002.
p. 0-0. ISBN: 80-214-2116-9. DetailSCHWARZ, P.; ČERNOCKÝ, J. Keyword detection in Czech fluent speech. Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002.
p. 1-4. ISBN: 80-227-1700-2. Detail