Publications
-
2023
ZULUAGA-GOMEZ, J.; NIGMATULINA, I.; PRASAD, A.; MOTLÍČEK, P.; KHALIL, D.; MADIKERI, S.; TART, A.; SZŐKE, I.; LENDERS, V.; RIGAULT, M.; CHOUKRI, K. Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding. Aerospace, 2023, vol. 2023, no. 10,
p. 1-33. ISSN: 2226-4310. Detail -
2022
BLATT, A.; KOCOUR, M.; VESELÝ, K.; SZŐKE, I.; KLAKOW, D. Call-Sign Recognition and Understanding for Noisy Air-Traffic Transcripts Using Surveillance Information. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022.
p. 8357-8361. ISBN: 978-1-6654-0540-9. DetailKOCOUR, M.; UMESH, J.; KARAFIÁT, M.; ŠVEC, J.; LOPEZ, F.; BENEŠ, K.; DIEZ SÁNCHEZ, M.; SZŐKE, I.; LUQUE, J.; VESELÝ, K.; BURGET, L.; ČERNOCKÝ, J. BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge. Proceedings of IberSpeech 2022. Granada: International Speech Communication Association, 2022.
p. 276-280. Detail -
2021
KOCOUR, M.; VESELÝ, K.; BLATT, A.; ZULUAGA-GOMEZ, J.; SZŐKE, I.; ČERNOCKÝ, J.; KLAKOW, D.; MOTLÍČEK, P. Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021.
p. 3301-3305. ISSN: 1990-9772. DetailKOCOUR, M.; VESELÝ, K.; SZŐKE, I.; KESIRAJU, S.; ZULUAGA-GOMEZ, J.; BLATT, A.; PRASAD, A.; NIGMATULINA, I.; MOTLÍČEK, P.; KLAKOW, D.; TART, A.; KOLČÁREK, P.; ČERNOCKÝ, J.; CEVENINI, C.; CHOUKRI, K.; RIGAULT, M.; LANDIS, F.; SARFJOO, S. Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data. In Proceedings of 9th OpenSky Symposium 2021, OpenSky Network, Brussels, Belgium. Proceedings. Brussels: MDPI, 2021.
p. 1-10. ISSN: 2504-3900. DetailSZŐKE, I.; KESIRAJU, S.; NOVOTNÝ, O.; KOCOUR, M.; VESELÝ, K.; ČERNOCKÝ, J. Detecting English Speech in the Air Traffic Control Voice Communication. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021.
p. 3286-3290. ISSN: 1990-9772. DetailZULUAGA-GOMEZ, J.; NIGMATULINA, I.; PRASAD, A.; MOTLÍČEK, P.; VESELÝ, K.; KOCOUR, M.; SZŐKE, I. Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021.
p. 3296-3300. ISSN: 1990-9772. Detail -
2020
ŽMOLÍKOVÁ, K.; KOCOUR, M.; LANDINI, F.; BENEŠ, K.; KARAFIÁT, M.; VYDANA, H.; LOZANO DÍEZ, A.; PLCHOT, O.; BASKAR, M.; ŠVEC, J.; MOŠNER, L.; MALENOVSKÝ, V.; BURGET, L.; YUSUF, B.; NOVOTNÝ, O.; GRÉZL, F.; SZŐKE, I.; ČERNOCKÝ, J. BUT System for CHiME-6 Challenge. Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020.
p. 1-3. DetailZULUAGA-GOMEZ, J.; VESELÝ, K.; BLATT, A.; MOTLÍČEK, P.; KLAKOW, D.; TART, A.; SZŐKE, I.; PRASAD, A.; SARFJOO, S.; KOLČÁREK, P.; KOCOUR, M.; ČERNOCKÝ, J.; CEVENINI, C.; CHOUKRI, K.; RIGAULT, M.; LANDIS, F. Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications. Proceedings of the 8th OpenSky Symposium 2020. Proceedings. Brusel: MDPI, 2020.
p. 1-10. ISSN: 2504-3900. Detail -
2019
SZŐKE, I.; SKÁCEL, M.; MOŠNER, L.; PALIESEK, J.; ČERNOCKÝ, J. Building and Evaluation of a Real Room Impulse Response Dataset. IEEE J-STSP, 2019, vol. 13, no. 4,
p. 863-876. ISSN: 1932-4553. Detail -
2018
KARAFIÁT, M.; BASKAR, M.; SZŐKE, I.; MALENOVSKÝ, V.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. BUT OpenSAT 2017 speech recognition system. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 2638-2642. ISSN: 1990-9772. DetailVESELÝ, K.; PERALES, C.; SZŐKE, I.; LUQUE, J.; ČERNOCKÝ, J. Lightly supervised vs. semi-supervised training of acoustic model on Luxembourgish for low-resource automatic speech recognition. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 2883-2887. ISSN: 1990-9772. Detail -
2017
KARAFIÁT, M.; VESELÝ, K.; ŽMOLÍKOVÁ, K.; DELCROIX, M.; WATANABE, S.; BURGET, L.; ČERNOCKÝ, J.; SZŐKE, I. Training Data Augmentation and Data Selection. In New Era for Robust Speech Recognition: Exploiting Deep Learning. Computer Science, Artificial Intelligence. Heidelberg: Springer International Publishing, 2017.
p. 245-260. ISBN: 978-3-319-64679-4. Detail -
2016
KESIRAJU, S.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J. Learning document representations using subspace multinomial model. In Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016.
p. 700-704. ISBN: 978-1-5108-3313-5. DetailSKÁCEL, M.; KARAFIÁT, M.; ONDEL YANG, L.; UCHYTIL, A.; SZŐKE, I. BUT Zero-Cost Speech Recognition 2016 System Description. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016.
p. 1-3. ISSN: 1613-0073. DetailSZŐKE, I.; ANGUERA, X. Zero-Cost Speech Recognition Task at Mediaeval 2016. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016.
p. 1-3. ISSN: 1613-0073. Detail -
2015
ANGUERA, X.; RODRIGUEZ-FUENTES, L.; BUZO, A.; METZE, F.; SZŐKE, I.; PENAGARIKANO, M. QUESST 2014: Evaluating Query-By-Example Speech Search in a Zero-Resource. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015.
p. 5833-5837. ISBN: 978-1-4673-6997-8. DetailHSIAO, R.; MA, J.; HARTMANN, W.; KARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J.; WATANABE, S.; CHEN, Z.; MALLIDI, S.; HEŘMANSKÝ, H.; TSAKALIDIS, S.; SCHWARTZ, R. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015.
p. 533-538. ISBN: 978-1-4799-7291-3. DetailKARAFIÁT, M.; GRÉZL, F.; BURGET, L.; SZŐKE, I.; ČERNOCKÝ, J. Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015.
p. 2454-2458. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772. DetailSKÁCEL, M.; SZŐKE, I. BUT QUESST 2015 System Description. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Wurzen: CEUR-WS.org, 2015.
p. 1-3. ISSN: 1613-0073. DetailSZŐKE, I.; METZE, F.; RODRIGUEZ-FUENTES, L.; PROENCA, J.; BUZO, A.; LOJKA, M.; ANGUERA, X.; XIONG, X. Query by Example Search on Speech at Mediaeval 2015. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Wurzen: CEUR-WS.org, 2015.
p. 1-3. ISSN: 1613-0073. DetailSZŐKE, I.; SKÁCEL, M.; ČERNOCKÝ, J.; BURGET, L. Coping with Channel Mismatch in Query-By-Example - BUT QUESST 2014. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015.
p. 5838-5842. ISBN: 978-1-4673-6997-8. Detail -
2014
ANGUERA, X.; RODRIGUEZ-FUENTES, L.; SZŐKE, I.; BUZO, A.; METZE, F. Query-by-example Spoken Term Detection Evaluation on Low-resource Languages. Proceedings of the 4th International Workshop on Spoken Language Technologies for Under- resourced Languages SLTU-2014. St. Petersburg, Russia. St. Petersburg: International Speech Communication Association, 2014.
p. 24-31. ISBN: 978-5-8088-0908-6. DetailANGUERA, X.; RODRIGUEZ-FUENTES, L.; SZŐKE, I.; BUZO, A.; METZE, F. Query by Example Search on Speech at Mediaeval 2014. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014.
p. 1-2. ISSN: 1613-0073. DetailKARAFIÁT, M.; GRÉZL, F.; VESELÝ, K.; HANNEMANN, M.; SZŐKE, I.; ČERNOCKÝ, J. BUT 2014 Babel System: Analysis of adaptation in NN based systems. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014.
p. 3002-3006. ISBN: 978-1-63439-435-2. DetailKARAFIÁT, M.; VESELÝ, K.; SZŐKE, I.; BURGET, L.; GRÉZL, F.; HANNEMANN, M.; ČERNOCKÝ, J. BUT ASR System for BABEL Surprise Evaluation 2014. In Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014.
p. 501-506. ISBN: 978-1-4799-7129-9. DetailNG, T.; HSIAO, R.; ZHANG, L.; KARAKOS, D.; MALLIDI, S.; KARAFIÁT, M.; VESELÝ, K.; SZŐKE, I.; ZHANG, B.; NGUYEN, L.; SCHWARTZ, R. Progress in the BBN Keyword Search System for the DARPA RATS Program. In Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014.
p. 959-963. ISBN: 978-1-63439-435-2. DetailSZŐKE, I.; BURGET, L.; GRÉZL, F.; ČERNOCKÝ, J.; ONDEL YANG, L. Calibration and Fusion of Query-by-example Systems - BUT SWS 2013. In Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014.
p. 7899-7903. ISBN: 978-1-4799-2892-7. DetailSZŐKE, I.; SKÁCEL, M.; BURGET, L. BUT QUESST 2014 System Description. In CEUR Workshop Proceedings. CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014.
p. 1-2. ISSN: 1613-0073. Detail -
2013
ANGUERA, X.; METZE, F.; BUZO, A.; SZŐKE, I.; RODRIGUEZ-FUENTES, L. The Spoken Web Search Task. CEUR Workshop Proceedings. CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2013.
p. 1-2. ISSN: 1613-0073. DetailKARAKOS, D.; SCHWARTZ, R.; TSAKALIDIS, S.; ZHANG, L.; RANJAN, S.; NG, T.; HSIAO, R.; NGUYEN, L.; GRÉZL, F.; HANNEMANN, M.; KARAFIÁT, M.; SZŐKE, I.; VESELÝ, K. Score Normalization and System Combination for Improved Keyword Spotting. In Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013.
p. 210-215. ISBN: 978-1-4799-2755-5. DetailSZŐKE, I.; BURGET, L.; GRÉZL, F.; ONDEL YANG, L. BUT SWS 2013 - Massive Parallel Approach. In Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop. CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2013.
p. 1-2. ISSN: 1613-0073. Detail -
2012
METZE, F.; RAJPUT, N.; ANGUERA, X.; DAVEL, M.; GRAVIER, G.; HEERDEN, C.; MANTENA, G.; MUSCARIELLO, A.; PRAHALLAD, K.; SZŐKE, I.; TEJEDOR, J. The Spoken WEB Search Task At Mediaeval 2011. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012.
p. 5165-5168. ISBN: 978-1-4673-0044-5. DetailMOTLÍČEK, P.; VALENTE, F.; SZŐKE, I. Improving Acoustic Based Keyword Spotting Using LVCSR Lattices. Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012.
p. 4413-4416. ISBN: 978-1-4673-0044-5. DetailSZŐKE, I.; FAPŠO, M.; VESELÝ, K. BUT2012 přístup pro Spoken Web Search úkol na MediaEval2012. Working Notes Proceedings of the MediaEval 2012 Workshop. CEUR Workshop Proceedings. Pisa: CEUR-WS.org, 2012.
s. 1-2. ISSN: 1613-0073. DetailSZŐKE, I.; FAPŠO, M.; ŽIŽKA, J.; BERAN, V.; ČERNOCKÝ, J. Efektivní přístup ke znalostem v audio-vizuálních záznamech. Proceedings of the Annual Database Conference. Praha: Technická univerzita v Košiciach, 2012.
s. 57-74. ISBN: 978-80-553-1049-7. DetailTEJEDOR, J.; FAPŠO, M.; SZŐKE, I.; ČERNOCKÝ, J.; GRÉZL, F. Comparison of methods for language-dependent and language-independent query-by-example spoken term detection. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, vol. 2012, no. 30,
p. 1-34. ISSN: 1046-8188. Detail -
2010
ČERNOCKÝ, J.; SZŐKE, I.; HANNEMANN, M.; KOMBRINK, S. Word-subword based keyword spotting with implications in OOV detection. Pacific Grove: Institute of Electrical and Electronics Engineers, 2010.
p. 0-0. DetailKARAFIÁT, M.; SZŐKE, I.; ČERNOCKÝ, J. Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data. Proc. Text, Speech and Dialog 2010. Lecture Notes in Computer Science. LNAI 6231. Brno: Springer Verlag, 2010.
p. 322-329. ISBN: 978-3-642-15759-2. ISSN: 0302-9743. DetailSZŐKE, I.; ČERNOCKÝ, J.; FAPŠO, M.; ŽIŽKA, J. SPEECH@FIT LECTURE BROWSER. Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010.
p. 157-158. ISBN: 978-1-4244-7902-3. DetailSZŐKE, I.; GRÉZL, F.; ČERNOCKÝ, J.; FAPŠO, M. Acoustic keyword spotter - optimization from end-user perspective. Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010.
p. 177-181. ISBN: 978-1-4244-7902-3. DetailTEJEDOR, J.; SZŐKE, I.; FAPŠO, M. Novel Methods for Query Selection and Query Combination in Query-By-Example Spoken Term Detection. Proceedings of the ACM Multimedia 2010 International Conference. Copyright 2010 ACM 978-1-4503-0162-6/10/10. Florencie: Association for Computing Machinery, 2010.
p. 15-20. ISBN: 978-1-60558-933-6. DetailŽIŽKA, J.; ČERNOCKÝ, J.; FAPŠO, M.; SZŐKE, I. Web-Based Lecture Browser with Speech Search. Znalosti 2010. Sborník příspěvků 9. ročníku konference. Jindřichův Hradec: Fakulty of management and information, 2010.
p. 287-290. ISBN: 978-80-245-1636-3. Detail -
2008
PINTO, J.; SZŐKE, I.; PRASANNA, S.; HEŘMANSKÝ, H. Fast Approximate Spoken Term Detection from Sequence of Phonemes. The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore. Singapore: Association for Computing Machinery, 2008.
p. 28-33. ISBN: 978-90-365-2697-5. DetailSZŐKE, I.; BURGET, L.; ČERNOCKÝ, J.; FAPŠO, M. Sub-word modeling of out of vocabulary words in spoken term detection. Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008.
p. 1-4. ISBN: 978-1-4244-3472-5. DetailSZŐKE, I.; FAPŠO, M.; BURGET, L.; ČERNOCKÝ, J. Hybrid word-subword decoding for spoken term detection. Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008.
p. 1-4. ISBN: 978-90-365-2697-5. Detail -
2007
HUBEIKA, V.; SZŐKE, I.; BURGET, L.; ČERNOCKÝ, J. Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System. Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). Pilsen: Springer Verlag, 2007.
p. 1-6. ISBN: 978-3-540-74627-0. DetailSZŐKE, I.; BURGET, L.; KARAFIÁT, M. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno: 2007.
p. 1 (1 s.). DetailSZŐKE, I.; FAPŠO, M.; KARAFIÁT, M.; BURGET, L.; GRÉZL, F.; SCHWARZ, P.; GLEMBEK, O.; MATĚJKA, P.; KOPECKÝ, J.; ČERNOCKÝ, J. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno: 2007.
p. 1 (1 s.). Detail -
2006
FAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; SCHWARZ, M.; ČERNOCKÝ, J.; KARAFIÁT, M.; BURGET, L. Information Retrieval from Spoken Documents. Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006). Mexico City: Springer Verlag, 2006.
p. 410-416. ISBN: 3-540-32205-1. DetailSZŐKE, I. Keyword Spotting in Meeting Data. Proceedings of the 12th Conference Student EEICT 2006 Volume 4. Brno: Faculty of Electrical Engineering and Communication BUT, 2006.
p. 440-444. ISBN: 80-214-3163-6. Detail -
2005
FAPŠO, M.; SCHWARZ, P.; SZŐKE, I.; ČERNOCKÝ, J.; SMRŽ, P.; BURGET, L.; KARAFIÁT, M. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh: 2005.
p. 0-0. DetailFAPŠO, M.; SMRŽ, P.; SCHWARZ, P.; SZŐKE, I.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Systém pre efektívne vyhľadávanie v rečových databázach. Sborník databázové konference DATAKON 2005. Brno: Masaryk University, 2005.
s. 323-333. ISBN: 80-210-3813-6. DetailSZŐKE, I. Smooth Pitch Tracker Based on Harmonic and Noise Model. STUDENT EEICT 2005. Brno: Faculty of Information Technology BUT, 2005.
p. 673-677. ISBN: 80-214-2890-2. DetailSZŐKE, I.; SCHWARZ, P.; BURGET, L.; KARAFIÁT, M.; ČERNOCKÝ, J. Phoneme based acoustics keyword spotting in informal continuous speech. Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 195-198. ISBN: 80-214-2904-6. DetailSZŐKE, I.; SCHWARZ, P.; BURGET, L.; KARAFIÁT, M.; MATĚJKA, P.; ČERNOCKÝ, J. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, 2005, vol. 2005, no. 3658,
p. 302-309. ISSN: 0302-9743. Detail -
2004
MATĚJKA, P.; SZŐKE, I.; SCHWARZ, P.; ČERNOCKÝ, J. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004.
p. 147-154. ISBN: 3-540-23049-1. DetailMATĚJKA, P.; SZŐKE, I.; SCHWARZ, P.; ČERNOCKÝ, J. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206,
p. 147-154. ISSN: 0302-9743. DetailSZŐKE, I. Speech units automatically generated by ergodic hidden Markov model. Proceedings of 10th Conference and Competition STUDENT EEICT 2004. Brno: Faculty of Electrical Engineering and Communication BUT, 2004.
p. 1-5. DetailSZŐKE, I.; MOTLÍČEK, P. Kódování řeči na velmi nízkých bitových rychlostech. GACR 102/02/0124 "Hlasové technologie v podpoře informační společnosti", souhrnný přehled aktivit řešitelských klektivů. Praha: Fakulta elektrotechniky ČVUT, 2004.
s. 0-0. ISBN: 80-01-02957-3. Detail -
2003
SZŐKE, I. Prosodic Modification of Synthetic Speech. Proceedings of The International Conference and Competition STUDENT EEICT 2003. Brno: Faculty of Electrical Engineering and Communication BUT, 2003.
p. 349-352. ISBN: 80-214-2401-X. DetailSZŐKE, I. Změny prozodie syntetické řeči. Proceedings of 9th Conference and Competition STUDENT EEICT 2003. Brno: Fakulta elektrotechniky a komunikačních technologií VUT v Brně, 2003.
s. 252-254. ISBN: 80-214-2377-3. Detail