Publications
-
2022
KOCOUR, M.; UMESH, J.; KARAFIÁT, M.; ŠVEC, J.; LOPEZ, F.; BENEŠ, K.; DIEZ SÁNCHEZ, M.; SZŐKE, I.; LUQUE, J.; VESELÝ, K.; BURGET, L.; ČERNOCKÝ, J. BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge. Proceedings of IberSpeech 2022. Granada: International Speech Communication Association, 2022.
p. 276-280. Detail -
2021
KARAFIÁT, M.; VESELÝ, K.; ČERNOCKÝ, J.; PROFANT, J.; NYTRA, J.; HLAVÁČEK, M.; PAVLÍČEK, T. Analysis of X-Vectors for Low-Resource Speech Recognition. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021.
p. 6998-7002. ISBN: 978-1-7281-7605-5. DetailKOCOUR, M.; CÁMBARA, G.; LUQUE, J.; BONET, D.; FARRÚS, M.; KARAFIÁT, M.; VESELÝ, K.; ČERNOCKÝ, J. BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge. Proceedings of IberSPEECH 2021. Vallaloid: International Speech Communication Association, 2021.
p. 113-117. DetailVYDANA, H.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. The IWSLT 2021 BUT Speech Translation Systems. In Proceedings of 18th International Conference on Spoken Language Translation (IWSLT). Bangkok, on-line: Association for Computational Linguistics, 2021.
p. 75-83. ISBN: 978-1-7138-3378-9. DetailVYDANA, H.; KARAFIÁT, M.; ŽMOLÍKOVÁ, K.; BURGET, L.; ČERNOCKÝ, J. Jointly Trained Transformers Models for Spoken Language Translation. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021.
p. 7513-7517. ISBN: 978-1-7281-7605-5. Detail -
2020
ŽMOLÍKOVÁ, K.; KOCOUR, M.; LANDINI, F.; BENEŠ, K.; KARAFIÁT, M.; VYDANA, H.; LOZANO DÍEZ, A.; PLCHOT, O.; BASKAR, M.; ŠVEC, J.; MOŠNER, L.; MALENOVSKÝ, V.; BURGET, L.; YUSUF, B.; NOVOTNÝ, O.; GRÉZL, F.; SZŐKE, I.; ČERNOCKÝ, J. BUT System for CHiME-6 Challenge. Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020.
p. 1-3. Detail -
2019
BASKAR, M.; BURGET, L.; WATANABE, S.; KARAFIÁT, M.; HORI, T.; ČERNOCKÝ, J. Promising Accurate Prefix Boosting For Sequence-to-sequence ASR. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019.
p. 5646-5650. ISBN: 978-1-5386-4658-8. DetailKARAFIÁT, M.; BASKAR, M.; WATANABE, S.; HORI, T.; WIESNER, M.; ČERNOCKÝ, J. Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems. In Proceedings of Interspeech. Proceedings of Interspeech. Graz: International Speech Communication Association, 2019.
p. 2220-2224. ISSN: 1990-9772. Detail -
2018
CHO, J.; BASKAR, M.; LI, R.; WIESNER, M.; MALLIDI, S.; YALTA, N.; KARAFIÁT, M.; WATANABE, S.; HORI, T. Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling. In Proceedings of 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018). Athens: IEEE Signal Processing Society, 2018.
p. 521-527. ISBN: 978-1-5386-4334-1. DetailKARAFIÁT, M.; BASKAR, M.; SZŐKE, I.; MALENOVSKÝ, V.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. BUT OpenSAT 2017 speech recognition system. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 2638-2642. ISSN: 1990-9772. DetailKARAFIÁT, M.; BASKAR, M.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. Analysis of Multilingual BLSTM Acoustic Model on Low and High Resource Languages. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018.
p. 5789-5793. ISBN: 978-1-5386-4658-8. DetailPULUGUNDLA, B.; BASKAR, M.; KESIRAJU, S.; EGOROVA, E.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. BUT system for low resource Indian language ASR. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 3182-3186. ISSN: 1990-9772. Detail -
2017
BASKAR, M.; KARAFIÁT, M.; BURGET, L.; VESELÝ, K.; GRÉZL, F.; ČERNOCKÝ, J. Residual Memory Networks: Feed-forward approach to learn long-term temporal dependencies. In Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017.
p. 4810-4814. ISBN: 978-1-5090-4117-6. DetailKARAFIÁT, M.; BASKAR, M.; MATĚJKA, P.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. 2016 BUT Babel system: Multilingual BLSTM acoustic model with i-vector based adaptation. In Proceedings of Interspeech 2017. Proceedings of Interspeech. Stockholm: International Speech Communication Association, 2017.
p. 719-723. ISSN: 1990-9772. DetailKARAFIÁT, M.; VESELÝ, K.; ŽMOLÍKOVÁ, K.; DELCROIX, M.; WATANABE, S.; BURGET, L.; ČERNOCKÝ, J.; SZŐKE, I. Training Data Augmentation and Data Selection. In New Era for Robust Speech Recognition: Exploiting Deep Learning. Computer Science, Artificial Intelligence. Heidelberg: Springer International Publishing, 2017.
p. 245-260. ISBN: 978-3-319-64679-4. DetailPAPADOPOULOS, P.; TRAVADI, R.; VAZ, C.; MALANDRAKIS, N.; HERMJAKOB, U.; POURDAMGHANI, N.; PUST, M.; ZHANG, B.; PAN, X.; LU, D.; LIN, Y.; GLEMBEK, O.; BASKAR, M.; KARAFIÁT, M.; BURGET, L.; HASEGAWA-JOHNSON, M.; JI, H.; MAY, J.; KNIGHT, K.; NARAYANAN, S. Team ELISA System for DARPA LORELEI Speech Evaluation 2016. In Proceedings of Interspeech 2017. Proceedings of Interspeech. Stockholm: International Speech Communication Association, 2017.
p. 2053-2057. ISSN: 1990-9772. Detail -
2016
CHALUPNÍČEK, K.; KARAFIÁT, M.; ŽIŽKA, J. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2016. Brno: ReplayWell, s. r. o., 2016.
s. 1 (1 s.). DetailGRÉZL, F.; EGOROVA, E.; KARAFIÁT, M. Study of Large Data Resources for Multilingual Training and System Porting. In Procedia Computer Science. Procedia Computer Science. Yogyakarta: Elsevier Science, 2016.
p. 15-22. ISSN: 1877-0509. DetailGRÉZL, F.; KARAFIÁT, M. Boosting Performance on Low-resource Languages by Standard Corpora: AN ANALYSIS. In Proceeding of SLT 2016. San Diego: IEEE Signal Processing Society, 2016.
p. 629-636. ISBN: 978-1-5090-4903-5. DetailGRÉZL, F.; KARAFIÁT, M. Bottle-Neck Feature Extraction Structures for Multilingual Training and Porting. In Procedia Computer Science. Procedia Computer Science. Yogyakarta: Elsevier Science, 2016.
p. 144-151. ISSN: 1877-0509. DetailKARAFIÁT, M.; BASKAR, M.; MATĚJKA, P.; VESELÝ, K.; GRÉZL, F.; ČERNOCKÝ, J. Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM. In Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016.
p. 637-643. ISBN: 978-1-5090-4903-5. DetailKARAFIÁT, M.; BURGET, L.; GRÉZL, F.; VESELÝ, K.; ČERNOCKÝ, J. Multilingual Region-Dependent Transforms. In Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016.
p. 5430-5434. ISBN: 978-1-4799-9988-0. DetailPLCHOT, O.; MATĚJKA, P.; FÉR, R.; GLEMBEK, O.; NOVOTNÝ, O.; PEŠÁN, J.; VESELÝ, K.; ONDEL YANG, L.; KARAFIÁT, M.; GRÉZL, F.; KESIRAJU, S.; BURGET, L.; BRUMMER, J.; SWART, A.; CUMANI, S.; MALLIDI, S.; LI, R. BAT System Description for NIST LRE 2015. In Proceedings of Odyssey 2016, The Speaker and Language Recognition Workshop. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Bilbao: International Speech Communication Association, 2016.
p. 166-173. ISSN: 2312-2846. DetailSKÁCEL, M.; KARAFIÁT, M.; ONDEL YANG, L.; UCHYTIL, A.; SZŐKE, I. BUT Zero-Cost Speech Recognition 2016 System Description. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016.
p. 1-3. ISSN: 1613-0073. DetailVESELÝ, K.; WATANABE, S.; ŽMOLÍKOVÁ, K.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. Sequence Summarizing Neural Network for Speaker Adaptation. In Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016.
p. 5315-5319. ISBN: 978-1-4799-9988-0. DetailŽMOLÍKOVÁ, K.; KARAFIÁT, M.; VESELÝ, K.; DELCROIX, M.; WATANABE, S.; BURGET, L.; ČERNOCKÝ, J. Data selection by sequence summarizing neural network in mismatch condition training. In Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016.
p. 2354-2358. ISBN: 978-1-5108-3313-5. Detail -
2015
HSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015.
p. 533-538. ISBN: 978-1-4799-7291-3. DetailKARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J. Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015.
p. 2454-2458. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772. Detail -
2014
CHALUPNÍČEK, K.; KARAFIÁT, M.; ŽIŽKA, J. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2014. Brno: ReplayWell, s. r. o., 2014.
s. 1 (1 s.). DetailGRÉZL, F.; EGOROVA, E.; KARAFIÁT, M. Further Investigation into Multilingual Training and Adaptation of Stacked Bottle-neck Neural Network Structure. In Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014.
p. 48-53. ISBN: 978-1-4799-7129-9. DetailGRÉZL, F.; KARAFIÁT, M. Adapting Multilingual Neural Network Hierarchy to a New Language. Proceedings of the 4th International Workshop on Spoken Language Technologies for Under- resourced Languages SLTU-2014. St. Petersburg, Russia, 2014. St. Petersburg: International Speech Communication Association, 2014.
p. 39-45. ISBN: 978-5-8088-0908-6. DetailGRÉZL, F.; KARAFIÁT, M. Combination of Multilingual and Semi-Supervised Training for Under-Resourced Languages. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014.
p. 820-824. ISBN: 978-1-63439-435-2. DetailGRÉZL, F.; KARAFIÁT, M.; VESELÝ, K. Adaptation of Multilingual Stacked Bottle-neck Neural Network Structure for New Language. In Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014.
p. 7704-7708. ISBN: 978-1-4799-2892-7. DetailKARAFIÁT, M.; GRÉZL, F.; HANNEMANN, M.; ČERNOCKÝ, J. BUT Neural Network Features for Spontaneous Vietnamese in BABEL. In Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014.
p. 5659-5663. ISBN: 978-1-4799-2892-7. DetailKARAFIÁT, M.; GRÉZL, F.; VESELÝ, K.; HANNEMANN, M.; SZŐKE, I.; ČERNOCKÝ, J. BUT 2014 Babel System: Analysis of adaptation in NN based systems. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014.
p. 3002-3006. ISBN: 978-1-63439-435-2. DetailKARAFIÁT, M.; VESELÝ, K.; SZŐKE, I.; BURGET, L.; GRÉZL, F.; HANNEMANN, M.; ČERNOCKÝ, J. BUT ASR System for BABEL Surprise Evaluation 2014. In Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014.
p. 501-506. ISBN: 978-1-4799-7129-9. DetailNG, T.; HSIAO, R.; ZHANG, L.; KARAKOS, D.; MALLIDI, S.; KARAFIÁT, M.; VESELÝ, K.; SZŐKE, I.; ZHANG, B.; NGUYEN, L.; SCHWARTZ, R. Progress in the BBN Keyword Search System for the DARPA RATS Program. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014.
p. 959-963. ISBN: 978-1-63439-435-2. Detail -
2013
EGOROVA, E.; VESELÝ, K.; KARAFIÁT, M.; JANDA, M.; ČERNOCKÝ, J. Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set. In Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013.
p. 7324-7328. ISBN: 978-1-4799-0355-9. DetailGRÉZL, F.; KARAFIÁT, M. Semi-Supervised Bootstrapping Approach For Neural Network Feature Extractor Training. Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013.
p. 470-475. ISBN: 978-1-4799-2755-5. DetailGRÉZL, F.; KARAFIÁT, M.; VESELÝ, K.; ŽIŽKA, J. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2013. Brno: ReplayWell, s. r. o., 2013.
s. 0-0. DetailKARAFIÁT, M.; GRÉZL, F.; HANNEMANN, M.; VESELÝ, K.; ČERNOCKÝ, J. BUT BABEL System for Spontaneous Cantonese. Proceedings of Interspeech 2013. Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013). Lyon: International Speech Communication Association, 2013.
p. 2589-2593. ISBN: 978-1-62993-443-3. ISSN: 2308-457X. DetailKARAKOS, D.; SCHWARTZ, R.; TSAKALIDIS, S.; ZHANG, L.; RANJAN, S.; NG, T.; HSIAO, R.; NGUYEN, L.; GRÉZL, F.; HANNEMANN, M.; KARAFIÁT, M.; SZŐKE, I.; VESELÝ, K. Score Normalization and System Combination for Improved Keyword Spotting. In Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013.
p. 210-215. ISBN: 978-1-4799-2755-5. DetailMOTLÍČEK, P.; POVEY, D.; KARAFIÁT, M. Feature And Score Level Combination Of Subspace Gaussians In LVCSR Task. Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013.
p. 7604-7608. ISBN: 978-1-4799-0355-9. DetailRATH, S.; BURGET, L.; KARAFIÁT, M.; GLEMBEK, O.; ČERNOCKÝ, J. A Region-specific Feature-space Transformation for Speaker Adaptation and Singularity Analysis of Jacobian Matrix. Proceedings of Interspeeech 2013. Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013). Lyon: International Speech Communication Association, 2013.
p. 1228-1232. ISBN: 978-1-62993-443-3. ISSN: 2308-457X. Detail -
2012
BRUMMER, J.; CUMANI, S.; GLEMBEK, O.; KARAFIÁT, M.; MATĚJKA, P.; PEŠÁN, J.; PLCHOT, O.; SOUFIFAR, M.; DE VILLIERS, E.; ČERNOCKÝ, J. Description and analysis of the Brno276 system for LRE2011. In Proceedings of Odyssey 2012: The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012.
p. 216-223. ISBN: 978-981-07-3093-2. DetailCUMANI, S.; PLCHOT, O.; KARAFIÁT, M. Independent Component Analysis and MLLR Transforms for Speaker Identification. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012.
p. 4365-4368. ISBN: 978-1-4673-0044-5. DetailHAIN, T.; BURGET, L.; DINES, J.; GARNER, P.; GRÉZL, F.; EL HANNANI, A.; HUIJBREGTS, M.; KARAFIÁT, M.; LINCOLN, M.; WAN, V. Transcribing Meetings with the AMIDA System. IEEE Transactions on Audio, Speech, and Language Processing, 2012, vol. 20, no. 2,
p. 486-498. ISSN: 1558-7916. DetailJANDA, M.; KARAFIÁT, M.; ČERNOCKÝ, J. Dealing with Numbers in Grapheme-Based Speech Recognition. Proceedings of 15th International Conference on Text, Speech and Dialogue. Lecture Notes in Computer Science. Lecture Notes in Computer Science, 2012, Volume 7499. Springer-Verlag Berlin Heidelberg 2012: Springer Verlag, 2012.
p. 438-445. ISBN: 978-3-642-32789-6. ISSN: 0302-9743. DetailKARAFIÁT, M.; JANDA, M.; ČERNOCKÝ, J.; BURGET, L. Region Dependent Linear Transforms in Multilingual Speech Recognition. In Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012.
p. 4885-4888. ISBN: 978-1-4673-0044-5. DetailKOMBRINK, S.; MIKOLOV, T.; KARAFIÁT, M.; BURGET, L. Improving Language Models for ASR Using Translated In-domain Data. Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012.
p. 4405-4408. ISBN: 978-1-4673-0044-5. DetailPLCHOT, O.; KARAFIÁT, M.; BRUMMER, J.; GLEMBEK, O.; MATĚJKA, P.; DE VILLIERS, E.; ČERNOCKÝ, J. Speaker vectors from Subspace Gaussian Mixture Model as complementary features for Language Identification. In Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012.
p. 330-333. ISBN: 978-981-07-3093-2. DetailPOVEY, D.; HANNEMANN, M.; BOULIANNE, G.; BURGET, L.; GHOSHAL, A.; JANDA, M.; KARAFIÁT, M.; KOMBRINK, S.; MOTLÍČEK, P.; QIAN, Y.; RIEDHAMMER, K.; VESELÝ, K.; VU, N. Generating Exact Lattices in The WFST Framework. Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012.
p. 4213-4216. ISBN: 978-1-4673-0044-5. DetailRATH, S.; KARAFIÁT, M.; GLEMBEK, O.; ČERNOCKÝ, J. A factorized representation of FMLLR transform based on QR-decomposition. Proceedings of Interspeech 2012. Proceedings of Interspeech. Portland, Oregon: International Speech Communication Association, 2012.
p. 1-4. ISBN: 978-1-62276-759-5. ISSN: 1990-9772. DetailVESELÝ, K.; KARAFIÁT, M.; GRÉZL, F.; JANDA, M.; EGOROVA, E. The Language-Independent Bottleneck Features. Proceedings of IEEE 2012 Workshop on Spoken Language Technology. Miami: IEEE Signal Processing Society, 2012.
p. 336-341. ISBN: 978-1-4673-5124-9. Detail -
2011
DEORAS, A.; MIKOLOV, T.; KOMBRINK, S.; KARAFIÁT, M.; KHUDANPUR, S. Variational Approximation of Long-span Language Models for LVCSR. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011.
p. 5532-5535. ISBN: 978-1-4577-0537-3. DetailGLEMBEK, O.; BURGET, L.; KENNY, P.; KARAFIÁT, M.; MATĚJKA, P. Simplification and optimization of I-Vector Extraction. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011.
p. 4516-4519. ISBN: 978-1-4577-0537-3. DetailGRÉZL, F.; KARAFIÁT, M. Integrating recent MLP feature extraction techniques into TRAP architecture. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011.
p. 1229-1232. ISBN: 978-1-61839-270-1. ISSN: 1990-9772. DetailGRÉZL, F.; KARAFIÁT, M.; JANDA, M. Study of Probabilistic and Bottle-Neck Features in Multilingual Environment. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011.
p. 359-364. ISBN: 978-1-4673-0366-8. DetailKARAFIÁT, M.; BURGET, L.; MATĚJKA, P.; GLEMBEK, O.; ČERNOCKÝ, J. iVector-Based Discriminative Adaptation for Automatic Speech Recognition. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011.
p. 152-157. ISBN: 978-1-4673-0366-8. DetailKOMBRINK, S.; MIKOLOV, T.; KARAFIÁT, M.; BURGET, L. Recurrent Neural Network based Language Modeling in Meeting Recognition. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011.
p. 2877-2880. ISBN: 978-1-61839-270-1. ISSN: 1990-9772. DetailPOVEY, D.; BURGET, L.; AGARWAL, M.; AKYAZI, P.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. The subspace Gaussian mixture model-A structured model for speech recognition. COMPUTER SPEECH AND LANGUAGE, 2011, vol. 25, no. 2,
p. 404-439. ISSN: 0885-2308. DetailPOVEY, D.; KARAFIÁT, M.; GHOSHAL, A.; SCHWARZ, P. A Symmetrization of the Subspace Gaussian Mixture Model. Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing. Praha: IEEE Signal Processing Society, 2011.
p. 4504-4507. ISBN: 978-1-4577-0537-3. DetailVESELÝ, K.; KARAFIÁT, M.; GRÉZL, F. Convolutive Bottleneck Network Features for LVCSR. Proceedings of ASRU 2011. Big Island, Hawaii: IEEE Signal Processing Society, 2011.
p. 42-47. ISBN: 978-1-4673-0366-8. Detail -
2010
BRUMMER, J.; BURGET, L.; KENNY, P.; MATĚJKA, P.; DE VILLIERS, E.; KARAFIÁT, M.; KOCKMANN, M.; GLEMBEK, O.; PLCHOT, O.; BAUM, D.; SENOUSSAUOI, M. ABC System description for NIST SRE 2010. Proc. NIST 2010 Speaker Recognition Evaluation. Brno: National Institute of Standards and Technology, 2010.
p. 1-20. DetailBURGET, L.; SCHWARZ, P.; AGARWAL, M.; AKYAZI, P.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; POVEY, D.; RASTROW, A.; ROSE, R.; THOMAS, S. Multilingual acoustic modeling for speech recognition based on Subspace Gaussian Mixture Models. Proc. International Conference on Acoustictics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010.
p. 4334-4337. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. DetailGHOSHAL, A.; POVEY, D.; AGARWAL, M.; AKYAZI, P.; BURGET, L.; FENG, K.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. A novel estimation of feature-space MLLR for full_covariance models. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010.
p. 4310-4313. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. DetailGOEL, N.; THOMAS, S.; AGARWAL, M.; AKYAZI, P.; BURGET, L.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; KARAFIÁT, M.; POVEY, D.; RASTROW, A.; ROSE, R.; SCHWARZ, P. Approaches to automatic lexicon learning with limited training examples. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010.
p. 5094-5097. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. DetailGRÉZL, F.; KARAFIÁT, M. Hierarchical Neural Net Architectures for Feature Extraction in ASR. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010.
p. 1201-1204. ISBN: 978-1-61782-123-3. ISSN: 1990-9772. DetailHAIN, T.; BURGET, L.; DINES, J.; GARNER, P.; EL HANNANI, A.; HUIJBREGTS, M.; KARAFIÁT, M.; LINCOLN, M.; WAN, V. The AMIDA 2009 Meeting Transcription System. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010.
p. 358-361. ISBN: 978-1-61782-123-3. ISSN: 1990-9772. DetailHANNEMANN, M.; KOMBRINK, S.; KARAFIÁT, M.; BURGET, L. Similarity Scoring for Recognizing Repeated Out-of-VocabularyWords. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010.
p. 897-900. ISBN: 978-1-61782-123-3. ISSN: 1990-9772. DetailJANČÍK, Z.; PLCHOT, O.; BRUMMER, J.; BURGET, L.; GLEMBEK, O.; HUBEIKA, V.; KARAFIÁT, M.; MATĚJKA, P.; MIKOLOV, T.; STRASHEIM, A.; ČERNOCKÝ, J. Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system. In Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010.
p. 215-221. ISBN: 978-80-214-4114-9. DetailKARAFIÁT, M.; SZŐKE, I.; ČERNOCKÝ, J. Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data. Proc. Text, Speech and Dialog 2010. Lecture Notes in Computer Science. LNAI 6231. Brno: Springer Verlag, 2010.
p. 322-329. ISBN: 978-3-642-15759-2. ISSN: 0302-9743. DetailMIKOLOV, T.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J.; KHUDANPUR, S. Recurrent neural network based language model. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010.
p. 1045-1048. ISBN: 978-1-61782-123-3. ISSN: 1990-9772. DetailPOVEY, D.; BURGET, L.; AGARWAL, M.; AKYAZI, P.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. Subspace Gaussian mixture models for speech recognition. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010.
p. 4330-4333. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. DetailROSE, R.; NOROUZIAN, A.; REDDY, A.; COY, A.; GUPTA, V.; KARAFIÁT, M. Subword-based spoken term detection in audio course lectures. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010.
p. 5282-5285. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. Detail -
2009
BRÜMMER, N.; BURGET, L.; GLEMBEK, O.; HUBEIKA, V.; JANČÍK, Z.; KARAFIÁT, M.; MATĚJKA, P.; MIKOLOV, T.; PLCHOT, O.; STRASHEIM, A. BUT-AGNITIO System Description for NIST Language Recognition Evaluation 2009. Proceedings NIST 2009 Language Recognition Evaluation Workshop. Baltimore, Maryland, USA: National Institute of Standards and Technology, 2009.
p. 1-7. DetailBURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. BUT system for NIST 2008 speaker recognition evaluation. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009.
p. 2335-2338. ISBN: 978-1-61567-692-7. ISSN: 1990-9772. DetailGARNER, P.; DINES, J.; HAIN, T.; EL HANNANI, A.; KARAFIÁT, M.; KORCHAGIN, D.; LINCOLN, M.; WAN, V.; ZHANG, L. Real-Time ASR from Meetings. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009.
p. 2119-2122. ISSN: 1990-9772. DetailGRÉZL, F.; KARAFIÁT, M.; BURGET, L. Investigation into bottle-neck features for meeting speech recognition. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009.
p. 2947-2950. ISBN: 978-1-61567-692-7. ISSN: 1990-9772. DetailKOMBRINK, S.; BURGET, L.; MATĚJKA, P.; KARAFIÁT, M.; HEŘMANSKÝ, H. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009.
p. 80-83. ISSN: 1990-9772. Detail -
2008
BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. Brno University Of Technology - NIST 2008 SRE. Montreal: 2008.
p. 1-28. DetailKARAFIÁT, M.; BURGET, L.; HAIN, T.; ČERNOCKÝ, J. Discrimininative training of narrow band - wide band adaptated systems for meeting recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008.
p. 1-4. ISSN: 1990-9772. DetailKOPECKÝ, J.; GLEMBEK, O.; KARAFIÁT, M. Advances in Acoustic Modeling for the Recognition of Czech. Proc. 11th International Conference on Text, Speech and Dialogue. Lecture Notes in Computer Science. Berlin: Springer Verlag, 2008.
p. 357-363. ISBN: 978-3-540-87390-7. Detail -
2007
BRÜMMER, N.; BURGET, L.; ČERNOCKÝ, J.; GLEMBEK, O.; GRÉZL, F.; KARAFIÁT, M.; VAN LEEUWEN, D.; MATĚJKA, P.; SCHWARZ, P.; STRASHEIM, A. Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006. IEEE Transactions on Audio, Speech, and Language Processing, 2007, vol. 15, no. 7,
p. 2072-2084. ISSN: 1558-7916. DetailGRÉZL, F.; KARAFIÁT, M.; ČERNOCKÝ, J. Neural network topologies and bottle neck features in speech recognition. Brno: 2007.
p. 78-82. DetailMATĚJKA, P.; BURGET, L.; SCHWARZ, P.; GLEMBEK, O.; KARAFIÁT, M.; GRÉZL, F.; ČERNOCKÝ, J.; VAN LEEUWEN, D.; BRÜMMER, N.; STRASHEIM, A. STBU system for the NIST 2006 speaker recognition evaluation. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007.
p. 221-224. ISBN: 1-4244-0728-1. DetailMIKOLOV, T.; OPARIN, I.; GLEMBEK, O.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Použití mluvených korpusů ve vývoji systému pro rozpoznávání českých přednášek. Praha: Univerzita Karlova, 2007.
s. 1-5. DetailSZŐKE, I.; BURGET, L.; KARAFIÁT, M. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno: 2007.
p. 1 (1 s.). DetailSZŐKE, I.; FAPŠO, M.; KARAFIÁT, M.; BURGET, L.; GRÉZL, F.; SCHWARZ, P.; GLEMBEK, O.; MATĚJKA, P.; KOPECKÝ, J.; ČERNOCKÝ, J. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno: 2007.
p. 1 (1 s.). Detail -
2006
FAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; SCHWARZ, M.; ČERNOCKÝ, J.; KARAFIÁT, M.; BURGET, L. Information Retrieval from Spoken Documents. Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006). Mexico City: Springer Verlag, 2006.
p. 410-416. ISBN: 3-540-32205-1. DetailKARAFIÁT, M.; GRÉZL, F.; SCHWARZ, P.; BURGET, L.; ČERNOCKÝ, J. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition. Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006). Lecture Notes in Computer Science. Berlin: Springer Verlag, 2006.
p. 275-284. ISBN: 3-540-69267-3. Detail -
2005
FAPŠO, M.; SCHWARZ, P.; SZŐKE, I.; ČERNOCKÝ, J.; SMRŽ, P.; BURGET, L.; KARAFIÁT, M. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh: 2005.
p. 0-0. DetailFAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Systém pre efektívne vyhľadávanie v rečových databázach. Sborník databázové konference DATAKON 2005. Brno: Masaryk University, 2005.
s. 323-333. ISBN: 80-210-3813-6. DetailSZŐKE, I.; SCHWARZ, P.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Phoneme based acoustics keyword spotting in informal continuous speech. Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 195-198. ISBN: 80-214-2904-6. DetailSZŐKE, I.; SCHWARZ, P.; BURGET, L.; KARAFIÁT, M.; MATĚJKA, P.; ČERNOCKÝ, J. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, 2005, vol. 2005, no. 3658,
p. 302-309. ISSN: 0302-9743. Detail -
2004
KARAFIÁT, M.; GRÉZL, F.; BURGET, L. Combination of MFCC and TRAP features for LVCSR of meeting data. Martigny: 2004.
p. 0-0. DetailKARAFIÁT, M.; GRÉZL, F.; ČERNOCKÝ, J. TRAP based features for LVCSR of meeting data. Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004.
p. 437-440. ISSN: 1225-4111. Detail -
2003
KARAFIÁT, M.; GRÉZL, F. Using MATLAB for Analysis of TRAP system. Radioengineering, 2003, vol. 2003, no. 4,
p. 38-41. ISSN: 1210-2512. Detail -
2002
KARAFIÁT, M.; ČERNOCKÝ, J. Context dependent Hidden Markov models in recognition of Czech. Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002.
p. 0-0. ISBN: 80-227-1700-2. DetailKARAFIÁT, M.; ČERNOCKÝ, J. Differences between context dependent and context independent Hidden Markov Models for recognition of Czech. Proc. of 8th student conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering TUB, 2002.
p. 328-332. ISBN: 80-214-2116-9. DetailMATĚJKA, P.; SCHWARZ, P.; KARAFIÁT, M.; ČERNOCKÝ, J. Some like it Gaussian... Proc. 5th International Conference Text, Speech and Dialogue, TSD2002. Lecture notes in artificial intelligence 2448. Berlin: Springer Verlag, 2002.
p. 321-324. ISBN: 3-540-44129-8. Detail