Ing.

Martin Karafiát

Ph.D.

Researcher

+420 54114 1279
karafiat@fit.vut.cz
L226 Office
12938/BUT personal ID

Publications

  • 2022

    KOCOUR, M.; UMESH, J.; KARAFIÁT, M.; ŠVEC, J.; LOPEZ, F.; BENEŠ, K.; DIEZ SÁNCHEZ, M.; SZŐKE, I.; LUQUE, J.; VESELÝ, K.; BURGET, L.; ČERNOCKÝ, J. BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge. Proceedings of IberSpeech 2022. Granada: International Speech Communication Association, 2022. p. 276-280. Detail

  • 2021

    KARAFIÁT, M.; VESELÝ, K.; ČERNOCKÝ, J.; PROFANT, J.; NYTRA, J.; HLAVÁČEK, M.; PAVLÍČEK, T. Analysis of X-Vectors for Low-Resource Speech Recognition. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021. p. 6998-7002. ISBN: 978-1-7281-7605-5. Detail

    KOCOUR, M.; CÁMBARA, G.; LUQUE, J.; BONET, D.; FARRÚS, M.; KARAFIÁT, M.; VESELÝ, K.; ČERNOCKÝ, J. BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge. Proceedings of IberSPEECH 2021. Vallaloid: International Speech Communication Association, 2021. p. 113-117. Detail

    VYDANA, H.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. The IWSLT 2021 BUT Speech Translation Systems. In Proceedings of 18th International Conference on Spoken Language Translation (IWSLT). Bangkok, on-line: Association for Computational Linguistics, 2021. p. 75-83. ISBN: 978-1-7138-3378-9. Detail

    VYDANA, H.; KARAFIÁT, M.; ŽMOLÍKOVÁ, K.; BURGET, L.; ČERNOCKÝ, J. Jointly Trained Transformers Models for Spoken Language Translation. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021. p. 7513-7517. ISBN: 978-1-7281-7605-5. Detail

  • 2020

    ŽMOLÍKOVÁ, K.; KOCOUR, M.; LANDINI, F.; BENEŠ, K.; KARAFIÁT, M.; VYDANA, H.; LOZANO DÍEZ, A.; PLCHOT, O.; BASKAR, M.; ŠVEC, J.; MOŠNER, L.; MALENOVSKÝ, V.; BURGET, L.; YUSUF, B.; NOVOTNÝ, O.; GRÉZL, F.; SZŐKE, I.; ČERNOCKÝ, J. BUT System for CHiME-6 Challenge. Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020. p. 1-3. Detail

  • 2019

    BASKAR, M.; BURGET, L.; WATANABE, S.; KARAFIÁT, M.; HORI, T.; ČERNOCKÝ, J. Promising Accurate Prefix Boosting For Sequence-to-sequence ASR. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019. p. 5646-5650. ISBN: 978-1-5386-4658-8. Detail

    KARAFIÁT, M.; BASKAR, M.; WATANABE, S.; HORI, T.; WIESNER, M.; ČERNOCKÝ, J. Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems. In Proceedings of Interspeech. Proceedings of Interspeech. Graz: International Speech Communication Association, 2019. p. 2220-2224. ISSN: 1990-9772. Detail

  • 2018

    CHO, J.; BASKAR, M.; LI, R.; WIESNER, M.; MALLIDI, S.; YALTA, N.; KARAFIÁT, M.; WATANABE, S.; HORI, T. Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling. In Proceedings of 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018). Athens: IEEE Signal Processing Society, 2018. p. 521-527. ISBN: 978-1-5386-4334-1. Detail

    KARAFIÁT, M.; BASKAR, M.; SZŐKE, I.; MALENOVSKÝ, V.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. BUT OpenSAT 2017 speech recognition system. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 2638-2642. ISSN: 1990-9772. Detail

    KARAFIÁT, M.; BASKAR, M.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. Analysis of Multilingual BLSTM Acoustic Model on Low and High Resource Languages. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018. p. 5789-5793. ISBN: 978-1-5386-4658-8. Detail

    PULUGUNDLA, B.; BASKAR, M.; KESIRAJU, S.; EGOROVA, E.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. BUT system for low resource Indian language ASR. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 3182-3186. ISSN: 1990-9772. Detail

  • 2017

    BASKAR, M.; KARAFIÁT, M.; BURGET, L.; VESELÝ, K.; GRÉZL, F.; ČERNOCKÝ, J. Residual Memory Networks: Feed-forward approach to learn long-term temporal dependencies. In Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017. p. 4810-4814. ISBN: 978-1-5090-4117-6. Detail

    KARAFIÁT, M.; BASKAR, M.; MATĚJKA, P.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. 2016 BUT Babel system: Multilingual BLSTM acoustic model with i-vector based adaptation. In Proceedings of Interspeech 2017. Proceedings of Interspeech. Stockholm: International Speech Communication Association, 2017. p. 719-723. ISSN: 1990-9772. Detail

    KARAFIÁT, M.; VESELÝ, K.; ŽMOLÍKOVÁ, K.; DELCROIX, M.; WATANABE, S.; BURGET, L.; ČERNOCKÝ, J.; SZŐKE, I. Training Data Augmentation and Data Selection. In New Era for Robust Speech Recognition: Exploiting Deep Learning. Computer Science, Artificial Intelligence. Heidelberg: Springer International Publishing, 2017. p. 245-260. ISBN: 978-3-319-64679-4. Detail

    PAPADOPOULOS, P.; TRAVADI, R.; VAZ, C.; MALANDRAKIS, N.; HERMJAKOB, U.; POURDAMGHANI, N.; PUST, M.; ZHANG, B.; PAN, X.; LU, D.; LIN, Y.; GLEMBEK, O.; BASKAR, M.; KARAFIÁT, M.; BURGET, L.; HASEGAWA-JOHNSON, M.; JI, H.; MAY, J.; KNIGHT, K.; NARAYANAN, S. Team ELISA System for DARPA LORELEI Speech Evaluation 2016. In Proceedings of Interspeech 2017. Proceedings of Interspeech. Stockholm: International Speech Communication Association, 2017. p. 2053-2057. ISSN: 1990-9772. Detail

  • 2016

    CHALUPNÍČEK, K.; KARAFIÁT, M.; ŽIŽKA, J. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2016. Brno: ReplayWell, s. r. o., 2016. s. 1 (1 s.). Detail

    GRÉZL, F.; EGOROVA, E.; KARAFIÁT, M. Study of Large Data Resources for Multilingual Training and System Porting. In Procedia Computer Science. Procedia Computer Science. Yogyakarta: Elsevier Science, 2016. p. 15-22. ISSN: 1877-0509. Detail

    GRÉZL, F.; KARAFIÁT, M. Boosting Performance on Low-resource Languages by Standard Corpora: AN ANALYSIS. In Proceeding of SLT 2016. San Diego: IEEE Signal Processing Society, 2016. p. 629-636. ISBN: 978-1-5090-4903-5. Detail

    GRÉZL, F.; KARAFIÁT, M. Bottle-Neck Feature Extraction Structures for Multilingual Training and Porting. In Procedia Computer Science. Procedia Computer Science. Yogyakarta: Elsevier Science, 2016. p. 144-151. ISSN: 1877-0509. Detail

    KARAFIÁT, M.; BASKAR, M.; MATĚJKA, P.; VESELÝ, K.; GRÉZL, F.; ČERNOCKÝ, J. Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM. In Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016. p. 637-643. ISBN: 978-1-5090-4903-5. Detail

    KARAFIÁT, M.; BURGET, L.; GRÉZL, F.; VESELÝ, K.; ČERNOCKÝ, J. Multilingual Region-Dependent Transforms. In Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016. p. 5430-5434. ISBN: 978-1-4799-9988-0. Detail

    PLCHOT, O.; MATĚJKA, P.; FÉR, R.; GLEMBEK, O.; NOVOTNÝ, O.; PEŠÁN, J.; VESELÝ, K.; ONDEL YANG, L.; KARAFIÁT, M.; GRÉZL, F.; KESIRAJU, S.; BURGET, L.; BRUMMER, J.; SWART, A.; CUMANI, S.; MALLIDI, S.; LI, R. BAT System Description for NIST LRE 2015. In Proceedings of Odyssey 2016, The Speaker and Language Recognition Workshop. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Bilbao: International Speech Communication Association, 2016. p. 166-173. ISSN: 2312-2846. Detail

    SKÁCEL, M.; KARAFIÁT, M.; ONDEL YANG, L.; UCHYTIL, A.; SZŐKE, I. BUT Zero-Cost Speech Recognition 2016 System Description. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016. p. 1-3. ISSN: 1613-0073. Detail

    VESELÝ, K.; WATANABE, S.; ŽMOLÍKOVÁ, K.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. Sequence Summarizing Neural Network for Speaker Adaptation. In Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016. p. 5315-5319. ISBN: 978-1-4799-9988-0. Detail

    ŽMOLÍKOVÁ, K.; KARAFIÁT, M.; VESELÝ, K.; DELCROIX, M.; WATANABE, S.; BURGET, L.; ČERNOCKÝ, J. Data selection by sequence summarizing neural network in mismatch condition training. In Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016. p. 2354-2358. ISBN: 978-1-5108-3313-5. Detail

  • 2015

    HSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015. p. 533-538. ISBN: 978-1-4799-7291-3. Detail

    KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J. Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. p. 2454-2458. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772. Detail

  • 2014

    CHALUPNÍČEK, K.; KARAFIÁT, M.; ŽIŽKA, J. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2014. Brno: ReplayWell, s. r. o., 2014. s. 1 (1 s.). Detail

    GRÉZL, F.; EGOROVA, E.; KARAFIÁT, M. Further Investigation into Multilingual Training and Adaptation of Stacked Bottle-neck Neural Network Structure. In Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014. p. 48-53. ISBN: 978-1-4799-7129-9. Detail

    GRÉZL, F.; KARAFIÁT, M. Adapting Multilingual Neural Network Hierarchy to a New Language. Proceedings of the 4th International Workshop on Spoken Language Technologies for Under- resourced Languages SLTU-2014. St. Petersburg, Russia, 2014. St. Petersburg: International Speech Communication Association, 2014. p. 39-45. ISBN: 978-5-8088-0908-6. Detail

    GRÉZL, F.; KARAFIÁT, M. Combination of Multilingual and Semi-Supervised Training for Under-Resourced Languages. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014. p. 820-824. ISBN: 978-1-63439-435-2. Detail

    GRÉZL, F.; KARAFIÁT, M.; VESELÝ, K. Adaptation of Multilingual Stacked Bottle-neck Neural Network Structure for New Language. In Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014. p. 7704-7708. ISBN: 978-1-4799-2892-7. Detail

    KARAFIÁT, M.; GRÉZL, F.; HANNEMANN, M.; ČERNOCKÝ, J. BUT Neural Network Features for Spontaneous Vietnamese in BABEL. In Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014. p. 5659-5663. ISBN: 978-1-4799-2892-7. Detail

    KARAFIÁT, M.; GRÉZL, F.; VESELÝ, K.; HANNEMANN, M.; SZŐKE, I.; ČERNOCKÝ, J. BUT 2014 Babel System: Analysis of adaptation in NN based systems. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014. p. 3002-3006. ISBN: 978-1-63439-435-2. Detail

    KARAFIÁT, M.; VESELÝ, K.; SZŐKE, I.; BURGET, L.; GRÉZL, F.; HANNEMANN, M.; ČERNOCKÝ, J. BUT ASR System for BABEL Surprise Evaluation 2014. In Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014. p. 501-506. ISBN: 978-1-4799-7129-9. Detail

    NG, T.; HSIAO, R.; ZHANG, L.; KARAKOS, D.; MALLIDI, S.; KARAFIÁT, M.; VESELÝ, K.; SZŐKE, I.; ZHANG, B.; NGUYEN, L.; SCHWARTZ, R. Progress in the BBN Keyword Search System for the DARPA RATS Program. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014. p. 959-963. ISBN: 978-1-63439-435-2. Detail

  • 2013

    EGOROVA, E.; VESELÝ, K.; KARAFIÁT, M.; JANDA, M.; ČERNOCKÝ, J. Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set. In Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013. p. 7324-7328. ISBN: 978-1-4799-0355-9. Detail

    GRÉZL, F.; KARAFIÁT, M. Semi-Supervised Bootstrapping Approach For Neural Network Feature Extractor Training. Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013. p. 470-475. ISBN: 978-1-4799-2755-5. Detail

    GRÉZL, F.; KARAFIÁT, M.; VESELÝ, K.; ŽIŽKA, J. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2013. Brno: ReplayWell, s. r. o., 2013. s. 0-0. Detail

    KARAFIÁT, M.; GRÉZL, F.; HANNEMANN, M.; VESELÝ, K.; ČERNOCKÝ, J. BUT BABEL System for Spontaneous Cantonese. Proceedings of Interspeech 2013. Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013). Lyon: International Speech Communication Association, 2013. p. 2589-2593. ISBN: 978-1-62993-443-3. ISSN: 2308-457X. Detail

    KARAKOS, D.; SCHWARTZ, R.; TSAKALIDIS, S.; ZHANG, L.; RANJAN, S.; NG, T.; HSIAO, R.; NGUYEN, L.; GRÉZL, F.; HANNEMANN, M.; KARAFIÁT, M.; SZŐKE, I.; VESELÝ, K. Score Normalization and System Combination for Improved Keyword Spotting. In Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013. p. 210-215. ISBN: 978-1-4799-2755-5. Detail

    MOTLÍČEK, P.; POVEY, D.; KARAFIÁT, M. Feature And Score Level Combination Of Subspace Gaussians In LVCSR Task. Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013. p. 7604-7608. ISBN: 978-1-4799-0355-9. Detail

    RATH, S.; BURGET, L.; KARAFIÁT, M.; GLEMBEK, O.; ČERNOCKÝ, J. A Region-specific Feature-space Transformation for Speaker Adaptation and Singularity Analysis of Jacobian Matrix. Proceedings of Interspeeech 2013. Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013). Lyon: International Speech Communication Association, 2013. p. 1228-1232. ISBN: 978-1-62993-443-3. ISSN: 2308-457X. Detail

  • 2012

    BRUMMER, J.; CUMANI, S.; GLEMBEK, O.; KARAFIÁT, M.; MATĚJKA, P.; PEŠÁN, J.; PLCHOT, O.; SOUFIFAR, M.; DE VILLIERS, E.; ČERNOCKÝ, J. Description and analysis of the Brno276 system for LRE2011. In Proceedings of Odyssey 2012: The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012. p. 216-223. ISBN: 978-981-07-3093-2. Detail

    CUMANI, S.; PLCHOT, O.; KARAFIÁT, M. Independent Component Analysis and MLLR Transforms for Speaker Identification. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012. p. 4365-4368. ISBN: 978-1-4673-0044-5. Detail

    HAIN, T.; BURGET, L.; DINES, J.; GARNER, P.; GRÉZL, F.; EL HANNANI, A.; HUIJBREGTS, M.; KARAFIÁT, M.; LINCOLN, M.; WAN, V. Transcribing Meetings with the AMIDA System. IEEE Transactions on Audio, Speech, and Language Processing, 2012, vol. 20, no. 2, p. 486-498. ISSN: 1558-7916. Detail

    JANDA, M.; KARAFIÁT, M.; ČERNOCKÝ, J. Dealing with Numbers in Grapheme-Based Speech Recognition. Proceedings of 15th International Conference on Text, Speech and Dialogue. Lecture Notes in Computer Science. Lecture Notes in Computer Science, 2012, Volume 7499. Springer-Verlag Berlin Heidelberg 2012: Springer Verlag, 2012. p. 438-445. ISBN: 978-3-642-32789-6. ISSN: 0302-9743. Detail

    KARAFIÁT, M.; JANDA, M.; ČERNOCKÝ, J.; BURGET, L. Region Dependent Linear Transforms in Multilingual Speech Recognition. In Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012. p. 4885-4888. ISBN: 978-1-4673-0044-5. Detail

    KOMBRINK, S.; MIKOLOV, T.; KARAFIÁT, M.; BURGET, L. Improving Language Models for ASR Using Translated In-domain Data. Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012. p. 4405-4408. ISBN: 978-1-4673-0044-5. Detail

    PLCHOT, O.; KARAFIÁT, M.; BRUMMER, J.; GLEMBEK, O.; MATĚJKA, P.; DE VILLIERS, E.; ČERNOCKÝ, J. Speaker vectors from Subspace Gaussian Mixture Model as complementary features for Language Identification. In Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012. p. 330-333. ISBN: 978-981-07-3093-2. Detail

    POVEY, D.; HANNEMANN, M.; BOULIANNE, G.; BURGET, L.; GHOSHAL, A.; JANDA, M.; KARAFIÁT, M.; KOMBRINK, S.; MOTLÍČEK, P.; QIAN, Y.; RIEDHAMMER, K.; VESELÝ, K.; VU, N. Generating Exact Lattices in The WFST Framework. Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012. p. 4213-4216. ISBN: 978-1-4673-0044-5. Detail

    RATH, S.; KARAFIÁT, M.; GLEMBEK, O.; ČERNOCKÝ, J. A factorized representation of FMLLR transform based on QR-decomposition. Proceedings of Interspeech 2012. Proceedings of Interspeech. Portland, Oregon: International Speech Communication Association, 2012. p. 1-4. ISBN: 978-1-62276-759-5. ISSN: 1990-9772. Detail

    VESELÝ, K.; KARAFIÁT, M.; GRÉZL, F.; JANDA, M.; EGOROVA, E. The Language-Independent Bottleneck Features. Proceedings of IEEE 2012 Workshop on Spoken Language Technology. Miami: IEEE Signal Processing Society, 2012. p. 336-341. ISBN: 978-1-4673-5124-9. Detail

  • 2011

    DEORAS, A.; MIKOLOV, T.; KOMBRINK, S.; KARAFIÁT, M.; KHUDANPUR, S. Variational Approximation of Long-span Language Models for LVCSR. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 5532-5535. ISBN: 978-1-4577-0537-3. Detail

    GLEMBEK, O.; BURGET, L.; KENNY, P.; KARAFIÁT, M.; MATĚJKA, P. Simplification and optimization of I-Vector Extraction. Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4516-4519. ISBN: 978-1-4577-0537-3. Detail

    GRÉZL, F.; KARAFIÁT, M. Integrating recent MLP feature extraction techniques into TRAP architecture. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. p. 1229-1232. ISBN: 978-1-61839-270-1. ISSN: 1990-9772. Detail

    GRÉZL, F.; KARAFIÁT, M.; JANDA, M. Study of Probabilistic and Bottle-Neck Features in Multilingual Environment. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 359-364. ISBN: 978-1-4673-0366-8. Detail

    KARAFIÁT, M.; BURGET, L.; MATĚJKA, P.; GLEMBEK, O.; ČERNOCKÝ, J. iVector-Based Discriminative Adaptation for Automatic Speech Recognition. Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 152-157. ISBN: 978-1-4673-0366-8. Detail

    KOMBRINK, S.; MIKOLOV, T.; KARAFIÁT, M.; BURGET, L. Recurrent Neural Network based Language Modeling in Meeting Recognition. Proceedings of Interspeech 2011. Proceedings of Interspeech. Florence: International Speech Communication Association, 2011. p. 2877-2880. ISBN: 978-1-61839-270-1. ISSN: 1990-9772. Detail

    POVEY, D.; BURGET, L.; AGARWAL, M.; AKYAZI, P.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. The subspace Gaussian mixture model-A structured model for speech recognition. COMPUTER SPEECH AND LANGUAGE, 2011, vol. 25, no. 2, p. 404-439. ISSN: 0885-2308. Detail

    POVEY, D.; KARAFIÁT, M.; GHOSHAL, A.; SCHWARZ, P. A Symmetrization of the Subspace Gaussian Mixture Model. Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing. Praha: IEEE Signal Processing Society, 2011. p. 4504-4507. ISBN: 978-1-4577-0537-3. Detail

    VESELÝ, K.; KARAFIÁT, M.; GRÉZL, F. Convolutive Bottleneck Network Features for LVCSR. Proceedings of ASRU 2011. Big Island, Hawaii: IEEE Signal Processing Society, 2011. p. 42-47. ISBN: 978-1-4673-0366-8. Detail

  • 2010

    BRUMMER, J.; BURGET, L.; KENNY, P.; MATĚJKA, P.; DE VILLIERS, E.; KARAFIÁT, M.; KOCKMANN, M.; GLEMBEK, O.; PLCHOT, O.; BAUM, D.; SENOUSSAUOI, M. ABC System description for NIST SRE 2010. Proc. NIST 2010 Speaker Recognition Evaluation. Brno: National Institute of Standards and Technology, 2010. p. 1-20. Detail

    BURGET, L.; SCHWARZ, P.; AGARWAL, M.; AKYAZI, P.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; POVEY, D.; RASTROW, A.; ROSE, R.; THOMAS, S. Multilingual acoustic modeling for speech recognition based on Subspace Gaussian Mixture Models. Proc. International Conference on Acoustictics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. p. 4334-4337. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. Detail

    GHOSHAL, A.; POVEY, D.; AGARWAL, M.; AKYAZI, P.; BURGET, L.; FENG, K.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. A novel estimation of feature-space MLLR for full_covariance models. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. p. 4310-4313. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. Detail

    GOEL, N.; THOMAS, S.; AGARWAL, M.; AKYAZI, P.; BURGET, L.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; KARAFIÁT, M.; POVEY, D.; RASTROW, A.; ROSE, R.; SCHWARZ, P. Approaches to automatic lexicon learning with limited training examples. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. p. 5094-5097. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. Detail

    GRÉZL, F.; KARAFIÁT, M. Hierarchical Neural Net Architectures for Feature Extraction in ASR. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 1201-1204. ISBN: 978-1-61782-123-3. ISSN: 1990-9772. Detail

    HAIN, T.; BURGET, L.; DINES, J.; GARNER, P.; EL HANNANI, A.; HUIJBREGTS, M.; KARAFIÁT, M.; LINCOLN, M.; WAN, V. The AMIDA 2009 Meeting Transcription System. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 358-361. ISBN: 978-1-61782-123-3. ISSN: 1990-9772. Detail

    HANNEMANN, M.; KOMBRINK, S.; KARAFIÁT, M.; BURGET, L. Similarity Scoring for Recognizing Repeated Out-of-VocabularyWords. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 897-900. ISBN: 978-1-61782-123-3. ISSN: 1990-9772. Detail

    JANČÍK, Z.; PLCHOT, O.; BRUMMER, J.; BURGET, L.; GLEMBEK, O.; HUBEIKA, V.; KARAFIÁT, M.; MATĚJKA, P.; MIKOLOV, T.; STRASHEIM, A.; ČERNOCKÝ, J. Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system. In Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010. p. 215-221. ISBN: 978-80-214-4114-9. Detail

    KARAFIÁT, M.; SZŐKE, I.; ČERNOCKÝ, J. Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data. Proc. Text, Speech and Dialog 2010. Lecture Notes in Computer Science. LNAI 6231. Brno: Springer Verlag, 2010. p. 322-329. ISBN: 978-3-642-15759-2. ISSN: 0302-9743. Detail

    MIKOLOV, T.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J.; KHUDANPUR, S. Recurrent neural network based language model. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. p. 1045-1048. ISBN: 978-1-61782-123-3. ISSN: 1990-9772. Detail

    POVEY, D.; BURGET, L.; AGARWAL, M.; AKYAZI, P.; FENG, K.; GHOSHAL, A.; GLEMBEK, O.; GOEL, N.; KARAFIÁT, M.; RASTROW, A.; ROSE, R.; SCHWARZ, P.; THOMAS, S. Subspace Gaussian mixture models for speech recognition. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. p. 4330-4333. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. Detail

    ROSE, R.; NOROUZIAN, A.; REDDY, A.; COY, A.; GUPTA, V.; KARAFIÁT, M. Subword-based spoken term detection in audio course lectures. Proc. International Conference on Acoustics, Speech, and Signal Processing. Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010. p. 5282-5285. ISBN: 978-1-4244-4296-6. ISSN: 1520-6149. Detail

  • 2009

    BRÜMMER, N.; BURGET, L.; GLEMBEK, O.; HUBEIKA, V.; JANČÍK, Z.; KARAFIÁT, M.; MATĚJKA, P.; MIKOLOV, T.; PLCHOT, O.; STRASHEIM, A. BUT-AGNITIO System Description for NIST Language Recognition Evaluation 2009. Proceedings NIST 2009 Language Recognition Evaluation Workshop. Baltimore, Maryland, USA: National Institute of Standards and Technology, 2009. p. 1-7. Detail

    BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. BUT system for NIST 2008 speaker recognition evaluation. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 2335-2338. ISBN: 978-1-61567-692-7. ISSN: 1990-9772. Detail

    GARNER, P.; DINES, J.; HAIN, T.; EL HANNANI, A.; KARAFIÁT, M.; KORCHAGIN, D.; LINCOLN, M.; WAN, V.; ZHANG, L. Real-Time ASR from Meetings. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 2119-2122. ISSN: 1990-9772. Detail

    GRÉZL, F.; KARAFIÁT, M.; BURGET, L. Investigation into bottle-neck features for meeting speech recognition. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 2947-2950. ISBN: 978-1-61567-692-7. ISSN: 1990-9772. Detail

    KOMBRINK, S.; BURGET, L.; MATĚJKA, P.; KARAFIÁT, M.; HEŘMANSKÝ, H. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. p. 80-83. ISSN: 1990-9772. Detail

  • 2008

    BURGET, L.; FAPŠO, M.; HUBEIKA, V.; GLEMBEK, O.; KARAFIÁT, M.; KOCKMANN, M.; MATĚJKA, P.; SCHWARZ, P.; ČERNOCKÝ, J. Brno University Of Technology - NIST 2008 SRE. Montreal: 2008. p. 1-28. Detail

    KARAFIÁT, M.; BURGET, L.; HAIN, T.; ČERNOCKÝ, J. Discrimininative training of narrow band - wide band adaptated systems for meeting recognition. Proc. Interspeech 2008. Proceedings of Interspeech. Brisbane: International Speech Communication Association, 2008. p. 1-4. ISSN: 1990-9772. Detail

    KOPECKÝ, J.; GLEMBEK, O.; KARAFIÁT, M. Advances in Acoustic Modeling for the Recognition of Czech. Proc. 11th International Conference on Text, Speech and Dialogue. Lecture Notes in Computer Science. Berlin: Springer Verlag, 2008. p. 357-363. ISBN: 978-3-540-87390-7. Detail

  • 2007

    BRÜMMER, N.; BURGET, L.; ČERNOCKÝ, J.; GLEMBEK, O.; GRÉZL, F.; KARAFIÁT, M.; VAN LEEUWEN, D.; MATĚJKA, P.; SCHWARZ, P.; STRASHEIM, A. Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006. IEEE Transactions on Audio, Speech, and Language Processing, 2007, vol. 15, no. 7, p. 2072-2084. ISSN: 1558-7916. Detail

    GRÉZL, F.; KARAFIÁT, M.; ČERNOCKÝ, J. Neural network topologies and bottle neck features in speech recognition. Brno: 2007. p. 78-82. Detail

    MATĚJKA, P.; BURGET, L.; SCHWARZ, P.; GLEMBEK, O.; KARAFIÁT, M.; GRÉZL, F.; ČERNOCKÝ, J.; VAN LEEUWEN, D.; BRÜMMER, N.; STRASHEIM, A. STBU system for the NIST 2006 speaker recognition evaluation. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007. p. 221-224. ISBN: 1-4244-0728-1. Detail

    MIKOLOV, T.; OPARIN, I.; GLEMBEK, O.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Použití mluvených korpusů ve vývoji systému pro rozpoznávání českých přednášek. Praha: Univerzita Karlova v Praze, 2007. s. 1-5. Detail

    SZŐKE, I.; BURGET, L.; KARAFIÁT, M. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno: 2007. p. 1 (1 s.). Detail

    SZŐKE, I.; FAPŠO, M.; KARAFIÁT, M.; BURGET, L.; GRÉZL, F.; SCHWARZ, P.; GLEMBEK, O.; MATĚJKA, P.; KOPECKÝ, J.; ČERNOCKÝ, J. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno: 2007. p. 1 (1 s.). Detail

  • 2006

    FAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; SCHWARZ, M.; ČERNOCKÝ, J.; KARAFIÁT, M.; BURGET, L. Information Retrieval from Spoken Documents. In Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006). Mexico City: Springer Verlag, 2006. p. 410-416. ISBN: 3-540-32205-1. Detail

    KARAFIÁT, M.; GRÉZL, F.; SCHWARZ, P.; BURGET, L.; ČERNOCKÝ, J. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition. In Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006). LNCS 4299. Berlin: Springer Verlag, 2006. p. 275-284. ISBN: 3-540-69267-3. Detail

  • 2005

    FAPŠO, M., SCHWARZ, P., SZŐKE, I., ČERNOCKÝ, J., SMRŽ, P., BURGET, L., KARAFIÁT, M. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh: 2005. Detail

    FAPŠO, M., SMRŽ, P., SCHWARZ, P., SZŐKE, I., BURGET, L., KARAFIÁT, M., ČERNOCKÝ, J. Systém pre efektívne vyhľadávanie v rečových databázach. In Sborník databázové konference DATAKON 2005. Brno: Masaryk University, 2005. s. 323-333. ISBN: 80-210-3813-6. Detail

    SZŐKE, I., SCHWARZ, P., BURGET, L., KARAFIÁT, M., ČERNOCKÝ, J. Phoneme based acoustics keyword spotting in informal continuous speech. In Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005. p. 195-198. ISBN: 80-214-2904-6. Detail

    SZŐKE, I., SCHWARZ, P., BURGET, L., KARAFIÁT, M., MATĚJKA, P., ČERNOCKÝ, J. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, 2005, vol. 2005, no. 3658, p. 302 ( p.)ISSN: 0302-9743. Detail

  • 2004

    KARAFIÁT, M., GRÉZL, F., ČERNOCKÝ, J. TRAP based features for LVCSR of meeting data. In Proc. 8th International Conference on Spoken Language Processing. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004. p. 437-440. ISSN: 1225-4111. Detail

  • 2003

    KARAFIÁT, M.; GRÉZL, F. Using MATLAB for Analysis of TRAP system. Radioengineering, 2003, vol. 2003, no. 4, p. 38-41. ISSN: 1210-2512. Detail

  • 2002

    KARAFIÁT, M.; ČERNOCKÝ, J. Context dependent Hidden Markov models in recognition of Czech. Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002. p. 0-0. ISBN: 80-227-1700-2. Detail

    KARAFIÁT, M.; ČERNOCKÝ, J. Differences between context dependent and context independent Hidden Markov Models for recognition of Czech. Proc. of 8th student conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering TUB, 2002. p. 328-332. ISBN: 80-214-2116-9. Detail

    MATĚJKA, P.; SCHWARZ, P.; KARAFIÁT, M.; ČERNOCKÝ, J. Some like it Gaussian... Proc. 5th International Conference Text, Speech and Dialogue, TSD2002. Lecture notes in artificial intelligence 2448. Berlin: Springer Verlag, 2002. p. 321-324. ISBN: 3-540-44129-8. Detail

Back to top