Ing.

Igor Szőke

Ph.D.

odborný asistent

+420 54114 1287
szoke@fit.vut.cz
L226 Kancelář
17355/osobní číslo VUT

Publikace

  • 2023

    ZULUAGA-GOMEZ, J.; NIGMATULINA, I.; PRASAD, A.; MOTLÍČEK, P.; KHALIL, D.; MADIKERI, S.; TART, A.; SZŐKE, I.; LENDERS, V.; RIGAULT, M.; CHOUKRI, K. Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding. Aerospace, 2023, vol. 2023, no. 10, p. 1-33. ISSN: 2226-4310. Detail

  • 2022

    BLATT, A.; KOCOUR, M.; VESELÝ, K.; SZŐKE, I.; KLAKOW, D. Call-Sign Recognition and Understanding for Noisy Air-Traffic Transcripts Using Surveillance Information. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022. p. 8357-8361. ISBN: 978-1-6654-0540-9. Detail

    KOCOUR, M.; UMESH, J.; KARAFIÁT, M.; ŠVEC, J.; LOPEZ, F.; BENEŠ, K.; DIEZ SÁNCHEZ, M.; SZŐKE, I.; LUQUE, J.; VESELÝ, K.; BURGET, L.; ČERNOCKÝ, J. BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge. Proceedings of IberSpeech 2022. Granada: International Speech Communication Association, 2022. p. 276-280. Detail

  • 2021

    KOCOUR, M.; VESELÝ, K.; BLATT, A.; ZULUAGA-GOMEZ, J.; SZŐKE, I.; ČERNOCKÝ, J.; KLAKOW, D.; MOTLÍČEK, P. Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021. p. 3301-3305. ISSN: 1990-9772. Detail

    KOCOUR, M.; VESELÝ, K.; SZŐKE, I.; KESIRAJU, S.; ZULUAGA-GOMEZ, J.; BLATT, A.; PRASAD, A.; NIGMATULINA, I.; MOTLÍČEK, P.; KLAKOW, D.; TART, A.; KOLČÁREK, P.; ČERNOCKÝ, J.; CEVENINI, C.; CHOUKRI, K.; RIGAULT, M.; LANDIS, F.; SARFJOO, S. Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data. In Proceedings of 9th OpenSky Symposium 2021, OpenSky Network, Brussels, Belgium. Proceedings. Brussels: MDPI, 2021. p. 1-10. ISSN: 2504-3900. Detail

    SZŐKE, I.; KESIRAJU, S.; NOVOTNÝ, O.; KOCOUR, M.; VESELÝ, K.; ČERNOCKÝ, J. Detecting English Speech in the Air Traffic Control Voice Communication. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021. p. 3286-3290. ISSN: 1990-9772. Detail

    ZULUAGA-GOMEZ, J.; NIGMATULINA, I.; PRASAD, A.; MOTLÍČEK, P.; VESELÝ, K.; KOCOUR, M.; SZŐKE, I. Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021. p. 3296-3300. ISSN: 1990-9772. Detail

  • 2020

    ZULUAGA-GOMEZ, J.; VESELÝ, K.; BLATT, A.; MOTLÍČEK, P.; KLAKOW, D.; TART, A.; SZŐKE, I.; PRASAD, A.; SARFJOO, S.; KOLČÁREK, P.; KOCOUR, M.; ČERNOCKÝ, J.; CEVENINI, C.; CHOUKRI, K.; RIGAULT, M.; LANDIS, F. Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications. Proceedings of the 8th OpenSky Symposium 2020. Proceedings. Brusel: MDPI, 2020. p. 1-10. ISSN: 2504-3900. Detail

    ŽMOLÍKOVÁ, K.; KOCOUR, M.; LANDINI, F.; BENEŠ, K.; KARAFIÁT, M.; VYDANA, H.; LOZANO DÍEZ, A.; PLCHOT, O.; BASKAR, M.; ŠVEC, J.; MOŠNER, L.; MALENOVSKÝ, V.; BURGET, L.; YUSUF, B.; NOVOTNÝ, O.; GRÉZL, F.; SZŐKE, I.; ČERNOCKÝ, J. BUT System for CHiME-6 Challenge. Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020. p. 1-3. Detail

  • 2019

    SZŐKE, I.; SKÁCEL, M.; MOŠNER, L.; PALIESEK, J.; ČERNOCKÝ, J. Building and Evaluation of a Real Room Impulse Response Dataset. IEEE J-STSP, 2019, vol. 13, no. 4, p. 863-876. ISSN: 1932-4553. Detail

  • 2018

    KARAFIÁT, M.; BASKAR, M.; SZŐKE, I.; MALENOVSKÝ, V.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. BUT OpenSAT 2017 speech recognition system. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 2638-2642. ISSN: 1990-9772. Detail

    VESELÝ, K.; PERALES, C.; SZŐKE, I.; LUQUE, J.; ČERNOCKÝ, J. Lightly supervised vs. semi-supervised training of acoustic model on Luxembourgish for low-resource automatic speech recognition. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 2883-2887. ISSN: 1990-9772. Detail

  • 2017

    KARAFIÁT, M.; VESELÝ, K.; ŽMOLÍKOVÁ, K.; DELCROIX, M.; WATANABE, S.; BURGET, L.; ČERNOCKÝ, J.; SZŐKE, I. Training Data Augmentation and Data Selection. In New Era for Robust Speech Recognition: Exploiting Deep Learning. Computer Science, Artificial Intelligence. Heidelberg: Springer International Publishing, 2017. p. 245-260. ISBN: 978-3-319-64679-4. Detail

  • 2016

    KESIRAJU, S.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J. Learning document representations using subspace multinomial model. In Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016. p. 700-704. ISBN: 978-1-5108-3313-5. Detail

    SKÁCEL, M.; KARAFIÁT, M.; ONDEL YANG, L.; UCHYTIL, A.; SZŐKE, I. BUT Zero-Cost Speech Recognition 2016 System Description. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016. p. 1-3. ISSN: 1613-0073. Detail

    SZŐKE, I.; ANGUERA, X. Zero-Cost Speech Recognition Task at Mediaeval 2016. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016. p. 1-3. ISSN: 1613-0073. Detail

  • 2015

    ANGUERA, X.; RODRIGUEZ-FUENTES, L.; BUZO, A.; METZE, F.; SZŐKE, I.; PENAGARIKANO, M. QUESST 2014: Evaluating Query-By-Example Speech Search in a Zero-Resource. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015. p. 5833-5837. ISBN: 978-1-4673-6997-8. Detail

    HSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015. p. 533-538. ISBN: 978-1-4799-7291-3. Detail

    KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J. Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. p. 2454-2458. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772. Detail

    SKÁCEL, M.; SZŐKE, I. BUT QUESST 2015 System Description. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Wurzen: CEUR-WS.org, 2015. p. 1-3. ISSN: 1613-0073. Detail

    SZŐKE, I.; METZE, F.; RODRIGUEZ-FUENTES, L.; PROENCA, J.; BUZO, A.; LOJKA, M.; ANGUERA, X.; XIONG, X. Query by Example Search on Speech at Mediaeval 2015. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Wurzen: CEUR-WS.org, 2015. p. 1-3. ISSN: 1613-0073. Detail

    SZŐKE, I.; SKÁCEL, M.; ČERNOCKÝ, J.; BURGET, L. Coping with Channel Mismatch in Query-By-Example - BUT QUESST 2014. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015. p. 5838-5842. ISBN: 978-1-4673-6997-8. Detail

  • 2014

    ANGUERA, X.; RODRIGUEZ-FUENTES, L.; SZŐKE, I.; BUZO, A.; METZE, F. Query-by-example Spoken Term Detection Evaluation on Low-resource Languages. Proceedings of the 4th International Workshop on Spoken Language Technologies for Under- resourced Languages SLTU-2014. St. Petersburg, Russia. St. Petersburg: International Speech Communication Association, 2014. p. 24-31. ISBN: 978-5-8088-0908-6. Detail

    ANGUERA, X.; RODRIGUEZ-FUENTES, L.; SZŐKE, I.; BUZO, A.; METZE, F. Query by Example Search on Speech at Mediaeval 2014. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014. p. 1-2. ISSN: 1613-0073. Detail

    KARAFIÁT, M.; GRÉZL, F.; VESELÝ, K.; HANNEMANN, M.; SZŐKE, I.; ČERNOCKÝ, J. BUT 2014 Babel System: Analysis of adaptation in NN based systems. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014. p. 3002-3006. ISBN: 978-1-63439-435-2. Detail

    KARAFIÁT, M.; VESELÝ, K.; SZŐKE, I.; BURGET, L.; GRÉZL, F.; HANNEMANN, M.; ČERNOCKÝ, J. BUT ASR System for BABEL Surprise Evaluation 2014. In Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014. p. 501-506. ISBN: 978-1-4799-7129-9. Detail

    NG, T.; HSIAO, R.; ZHANG, L.; KARAKOS, D.; MALLIDI, S.; KARAFIÁT, M.; VESELÝ, K.; SZŐKE, I.; ZHANG, B.; NGUYEN, L.; SCHWARTZ, R. Progress in the BBN Keyword Search System for the DARPA RATS Program. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014. p. 959-963. ISBN: 978-1-63439-435-2. Detail

    SZŐKE, I.; BURGET, L.; GRÉZL, F.; ČERNOCKÝ, J.; ONDEL YANG, L. Calibration and Fusion of Query-by-example Systems - BUT SWS 2013. In Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014. p. 7899-7903. ISBN: 978-1-4799-2892-7. Detail

    SZŐKE, I.; SKÁCEL, M.; BURGET, L. BUT QUESST 2014 System Description. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014. p. 1-2. ISSN: 1613-0073. Detail

  • 2013

    ANGUERA, X.; METZE, F.; BUZO, A.; SZŐKE, I.; RODRIGUEZ-FUENTES, L. The Spoken Web Search Task. CEUR Workshop Proceedings. CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2013. p. 1-2. ISSN: 1613-0073. Detail

    KARAKOS, D.; SCHWARTZ, R.; TSAKALIDIS, S.; ZHANG, L.; RANJAN, S.; NG, T.; HSIAO, R.; NGUYEN, L.; GRÉZL, F.; HANNEMANN, M.; KARAFIÁT, M.; SZŐKE, I.; VESELÝ, K. Score Normalization and System Combination for Improved Keyword Spotting. In Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013. p. 210-215. ISBN: 978-1-4799-2755-5. Detail

    SZŐKE, I.; BURGET, L.; GRÉZL, F.; ONDEL YANG, L. BUT SWS 2013 - Massive Parallel Approach. In Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop. CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2013. p. 1-2. ISSN: 1613-0073. Detail

  • 2012

    METZE, F.; RAJPUT, N.; ANGUERA, X.; DAVEL, M.; GRAVIER, G.; HEERDEN, C.; MANTENA, G.; MUSCARIELLO, A.; PRAHALLAD, K.; SZŐKE, I.; TEJEDOR, J. The Spoken WEB Search Task At Mediaeval 2011. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012. p. 5165-5168. ISBN: 978-1-4673-0044-5. Detail

    MOTLÍČEK, P.; VALENTE, F.; SZŐKE, I. Improving Acoustic Based Keyword Spotting Using LVCSR Lattices. Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012. p. 4413-4416. ISBN: 978-1-4673-0044-5. Detail

    SZŐKE, I.; FAPŠO, M.; VESELÝ, K. BUT2012 přístup pro Spoken Web Search úkol na MediaEval2012. Working Notes Proceedings of the MediaEval 2012 Workshop. CEUR Workshop Proceedings. Pisa: CEUR-WS.org, 2012. s. 1-2. ISSN: 1613-0073. Detail

    SZŐKE, I.; FAPŠO, M.; ŽIŽKA, J.; BERAN, V.; ČERNOCKÝ, J. Efektivní přístup ke znalostem v audio-vizuálních záznamech. Proceedings of the Annual Database Conference. Praha: Technická univerzita v Košiciach, 2012. s. 57-74. ISBN: 978-80-553-1049-7. Detail

    TEJEDOR, J.; FAPŠO, M.; SZŐKE, I.; ČERNOCKÝ, J.; GRÉZL, F. Comparison of methods for language-dependent and language-independent query-by-example spoken term detection. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, vol. 2012, no. 30, p. 1-34. ISSN: 1046-8188. Detail

  • 2010

    ČERNOCKÝ, J.; SZŐKE, I.; HANNEMANN, M.; KOMBRINK, S. Word-subword based keyword spotting with implications in OOV detection. Pacific Grove: Institute of Electrical and Electronics Engineers, 2010. p. 0-0. Detail

    KARAFIÁT, M.; SZŐKE, I.; ČERNOCKÝ, J. Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data. Proc. Text, Speech and Dialog 2010. Lecture Notes in Computer Science. LNAI 6231. Brno: Springer Verlag, 2010. p. 322-329. ISBN: 978-3-642-15759-2. ISSN: 0302-9743. Detail

    SZŐKE, I.; ČERNOCKÝ, J.; FAPŠO, M.; ŽIŽKA, J. SPEECH@FIT LECTURE BROWSER. Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010. p. 157-158. ISBN: 978-1-4244-7902-3. Detail

    SZŐKE, I.; GRÉZL, F.; ČERNOCKÝ, J.; FAPŠO, M. Acoustic keyword spotter - optimization from end-user perspective. Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010. p. 177-181. ISBN: 978-1-4244-7902-3. Detail

    TEJEDOR, J.; SZŐKE, I.; FAPŠO, M. Novel Methods for Query Selection and Query Combination in Query-By-Example Spoken Term Detection. Proceedings of the ACM Multimedia 2010 International Conference. Copyright 2010 ACM 978-1-4503-0162-6/10/10. Florencie: Association for Computing Machinery, 2010. p. 15-20. ISBN: 978-1-60558-933-6. Detail

    ŽIŽKA, J.; ČERNOCKÝ, J.; FAPŠO, M.; SZŐKE, I. Web-Based Lecture Browser with Speech Search. Znalosti 2010. Sborník příspěvků 9. ročníku konference. Jindřichův Hradec: Fakulty of management and information, 2010. p. 287-290. ISBN: 978-80-245-1636-3. Detail

  • 2008

    PINTO, J.; SZŐKE, I.; PRASANNA, S.; HEŘMANSKÝ, H. Fast Approximate Spoken Term Detection from Sequence of Phonemes. The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore. Singapore: Association for Computing Machinery, 2008. p. 28-33. ISBN: 978-90-365-2697-5. Detail

    SZŐKE, I.; BURGET, L.; ČERNOCKÝ, J.; FAPŠO, M. Sub-word modeling of out of vocabulary words in spoken term detection. Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008. p. 1-4. ISBN: 978-1-4244-3472-5. Detail

    SZŐKE, I.; FAPŠO, M.; BURGET, L.; ČERNOCKÝ, J. Hybrid word-subword decoding for spoken term detection. Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008. p. 1-4. ISBN: 978-90-365-2697-5. Detail

  • 2007

    HUBEIKA, V.; SZŐKE, I.; BURGET, L.; ČERNOCKÝ, J. Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System. In Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). Pilsen: Springer Verlag, 2007. p. 1-6. ISBN: 978-3-540-74627-0. Detail

    SZŐKE, I.; BURGET, L.; KARAFIÁT, M. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno: 2007. p. 1 (1 s.). Detail

    SZŐKE, I.; FAPŠO, M.; KARAFIÁT, M.; BURGET, L.; GRÉZL, F.; SCHWARZ, P.; GLEMBEK, O.; MATĚJKA, P.; KOPECKÝ, J.; ČERNOCKÝ, J. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno: 2007. p. 1 (1 s.). Detail

  • 2006

    FAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; SCHWARZ, M.; ČERNOCKÝ, J.; KARAFIÁT, M.; BURGET, L. Information Retrieval from Spoken Documents. In Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006). Mexico City: Springer Verlag, 2006. p. 410-416. ISBN: 3-540-32205-1. Detail

    SZŐKE, I. Keyword Spotting in Meeting Data. In Proceedings of the 12th Conference Student EEICT 2006 Volume 4. Brno: Faculty of Electrical Engineering and Communication BUT, 2006. p. 440-444. ISBN: 80-214-3163-6. Detail

  • 2005

    FAPŠO, M., SCHWARZ, P., SZŐKE, I., ČERNOCKÝ, J., SMRŽ, P., BURGET, L., KARAFIÁT, M. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh: 2005. Detail

    FAPŠO, M., SMRŽ, P., SCHWARZ, P., SZŐKE, I., BURGET, L., KARAFIÁT, M., ČERNOCKÝ, J. Systém pre efektívne vyhľadávanie v rečových databázach. In Sborník databázové konference DATAKON 2005. Brno: Masaryk University, 2005. s. 323-333. ISBN: 80-210-3813-6. Detail

    SZŐKE, I. Smooth Pitch Tracker Based on Harmonic and Noise Model. In STUDENT EEICT 2005. Brno: Faculty of Information Technology BUT, 2005. p. 673-677. ISBN: 80-214-2890-2. Detail

    SZŐKE, I., SCHWARZ, P., BURGET, L., KARAFIÁT, M., ČERNOCKÝ, J. Phoneme based acoustics keyword spotting in informal continuous speech. In Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005. p. 195-198. ISBN: 80-214-2904-6. Detail

    SZŐKE, I., SCHWARZ, P., BURGET, L., KARAFIÁT, M., MATĚJKA, P., ČERNOCKÝ, J. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, 2005, vol. 2005, no. 3658, p. 302 ( p.)ISSN: 0302-9743. Detail

  • 2004

    MATĚJKA, P., SZŐKE, I., SCHWARZ, P., ČERNOCKÝ, J. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206, p. 147 ( p.)ISSN: 0302-9743. Detail

    MATĚJKA, P.; SZŐKE, I.; SCHWARZ, P.; ČERNOCKÝ, J. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004. p. 147-154. ISBN: 3-540-23049-1. Detail

    SZŐKE, I. Speech units automatically generated by ergodic hidden Markov model. Proceedings of 10th Conference and Competition STUDENT EEICT 2004. Brno: Faculty of Electrical Engineering and Communication BUT, 2004. p. 1-5. Detail

  • 2003

    SZŐKE, I. Prosodic Modification of Synthetic Speech. Proceedings of The International Conference and Competition STUDENT EEICT 2003. Brno: Faculty of Electrical Engineering and Communication BUT, 2003. p. 349-352. ISBN: 80-214-2401-X. Detail

Nahoru