Publikace
-
2023
BHATTACHARJEE, M.; MOTLÍČEK, P.; NIGMATULINA, I.; HELMKE, H.; OHNEISER, O.; KLEINERT, M.; EHR, H. Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training. Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023.
p. 1-8. DetailBURDISSO, S.; VILLATORO-TELLO, E.; MADIKERI, S.; MOTLÍČEK, P. Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews. In Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 3617-3621. ISSN: 1990-9772. DetailFAJČÍK, M.; MOTLÍČEK, P.; SMRŽ, P. Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction. In Findings of the Association for Computational Linguistics: ACL 2023. ACL. Toronto: Association for Computational Linguistics, 2023.
p. 10184-10205. ISBN: 978-1-959429-62-3. DetailHELMKE, H.; KLEINERT, M.; AHRENHOLD, N.; EHR, H.; MÜHLHAUSEN, T.; PINSKA, E.; OHNEISER, O.; KLAMERT, L.; MOTLÍČEK, P.; PRASAD, A.; ZULUAGA-GOMEZ, J.; DOKIC, J. Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers' Workload. Proceedings of ATM Seminar. Savannah, Georgia: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2023.
p. 1-11. DetailKHALIL, D.; PRASAD, A.; MOTLÍČEK, P.; ZULUAGA-GOMEZ, J.; NIGMATULINA, I.; MADIKERI, S.; SCHUEPBACH, C. An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain. Aerospace, 2023, vol. 10, no. 10,
p. 1-14. ISSN: 2226-4310. DetailMAI, F.; ZULUAGA-GOMEZ, J.; PARCOLLET, T.; MOTLÍČEK, P. HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition. In Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 2213-2217. ISSN: 1990-9772. DetailMOTLÍČEK, P.; PRASAD, A.; NIGMATULINA, I.; HELMKE, H.; OHNEISER, O.; KLEINERT, M. Automatic Speech Analysis Framework for ATC Communication in HAAWAII. Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023.
p. 1-9. DetailNIGMATULINA, I.; MADIKERI, S.; VILLATORO-TELLO, E.; MOTLÍČEK, P.; ZULUAGA-GOMEZ, J.; PANDIA, K.; GANAPATHIRAJU, A. Implementing contextual biasing in GPU decoder for online ASR. In Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 4494-4498. ISSN: 1990-9772. DetailVANDERREYDT, G.; PRASAD, A.; KHALIL, D.; MADIKERI, S.; DEMUYNCK, K.; MOTLÍČEK, P. Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition. Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei: IEEE Signal Processing Society, 2023.
p. 1-7. ISBN: 979-8-3503-0689-7. DetailVILLATORO-TELLO, E.; MADIKERI, S.; ZULUAGA-GOMEZ, J.; SHARMA, B.; SARFJOO, S.; NIGMATULINA, I.; MOTLÍČEK, P.; IVANOV, V.; GANAPATHIRAJU, A. Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding Tasks. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023.
p. 1-5. ISBN: 978-1-7281-6327-7. DetailZULUAGA-GOMEZ, J.; NIGMATULINA, I.; PRASAD, A.; MOTLÍČEK, P.; KHALIL, D.; MADIKERI, S.; TART, A.; SZŐKE, I.; LENDERS, V.; RIGAULT, M.; CHOUKRI, K. Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding. Aerospace, 2023, vol. 2023, no. 10,
p. 1-33. ISSN: 2226-4310. DetailZULUAGA-GOMEZ, J.; PRASAD, A.; NIGMATULINA, I.; MOTLÍČEK, P.; KLEINERT, M.;. A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers. Aerospace, 2023, vol. 10, no. 5,
p. 1-25. ISSN: 2226-4310. DetailZULUAGA-GOMEZ, J.; PRASAD, A.; NIGMATULINA, I.; SARFJOO, S.; MOTLÍČEK, P.; KLEINERT, M.; HELMKE, H.; OHNEISER, O.; ZHAN, Q. How Does Pre-Trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? an Extensive Benchmark on Air Traffic Control Communications. In IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023.
p. 205-212. ISBN: 978-1-6654-7189-3. DetailZULUAGA-GOMEZ, J.; SARFJOO, S.; PRASAD, A.; NIGMATULINA, I.; MOTLÍČEK, P.; ONDŘEJ, K.; OHNEISER, O.; HELMKE, H. BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications. In IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023.
p. 633-640. ISBN: 978-1-6654-7189-3. Detail -
2022
BURDISSO, S.; FAJČÍK, M.; SMRŽ, P.; MOTLÍČEK, P. IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach. In Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2022). Abu Dhabi: Association for Computational Linguistics, 2022.
p. 61-69. ISBN: 978-1-959429-05-0. DetailFAJČÍK, M.; SMRŽ, P.; MOTLÍČEK, P.; BURDISSO, S. IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model. In Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2022). Abu Dhabi: Association for Computational Linguistics, 2022.
p. 70-78. ISBN: 978-1-959429-05-0. DetailNIGMATULINA, I.; ZULUAGA-GOMEZ, J.; PRASAD, A.; SARFJOO, S.; MOTLÍČEK, P. A Two-Step Approach to Leverage Contextual Data: Speech Recognition in Air-Traffic Communications. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022.
p. 6282-6286. ISBN: 978-1-6654-0540-9. DetailPRASAD, A.; ZULUAGA-GOMEZ, J.; MOTLÍČEK, P.; SARFJOO, S.; NIGMATULINA, I.; OHNEISER, O.; HELMKE, H. Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition. Proceedings of the 12th SESAR Innovation Days. Budapest: 2022.
p. 1-9. DetailPRASAD, A.; ZULUAGA-GOMEZ, J.; MOTLÍČEK, P.; SARFJOO, S.; NIGMATULINA, I.; VESELÝ, K. Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator. Proceedings of the 12th SESAR Innovation Days. Budapest: 2022.
p. 1-9. Detail -
2021
HELMKE, H.; KLEINERT, M.; SHETTY, S.; OHNEISER, O.; EHR, H.; PRASAD, A.; MOTLÍČEK, P.; VESELÝ, K.; ONDŘEJ, K.; SMRŽ, P.; HARFMANN, J.; WINDISCH, C. Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety. In Proceedings of ATM Seminar. on-line: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2021.
p. 1-10. DetailHELMKE, H.; SHETTY, S.; KLEINERT, M.; OHNEISER, O.; EHR, H.; MOTLÍČEK, P.; PRASAD, A.; WINDISCH, C. Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates. Proceedings of 11th SESAR Innovation Days 2021. Belgie: 2021.
p. 1-8. DetailKLEINERT, M.; HELMKE, H.; SHETTY, S.; OHNEISER, O.; EHR, H.; PRASAD, A.; MOTLÍČEK, P.; HARFMANN, J. Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning. In Proceedings of DASC 2021. San Antonio, Texas: Institute of Electrical and Electronics Engineers, 2021.
p. 1-9. ISBN: 978-1-6654-3420-1. DetailKOCOUR, M.; VESELÝ, K.; BLATT, A.; ZULUAGA-GOMEZ, J.; SZŐKE, I.; ČERNOCKÝ, J.; KLAKOW, D.; MOTLÍČEK, P. Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021.
p. 3301-3305. ISSN: 1990-9772. DetailKOCOUR, M.; VESELÝ, K.; SZŐKE, I.; KESIRAJU, S.; ZULUAGA-GOMEZ, J.; BLATT, A.; PRASAD, A.; NIGMATULINA, I.; MOTLÍČEK, P.; KLAKOW, D.; TART, A.; KOLČÁREK, P.; ČERNOCKÝ, J.; CEVENINI, C.; CHOUKRI, K.; RIGAULT, M.; LANDIS, F.; SARFJOO, S. Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data. In Proceedings of 9th OpenSky Symposium 2021, OpenSky Network, Brussels, Belgium. Proceedings. Brussels: MDPI, 2021.
p. 1-10. ISSN: 2504-3900. DetailZULUAGA-GOMEZ, J.; NIGMATULINA, I.; PRASAD, A.; MOTLÍČEK, P.; VESELÝ, K.; KOCOUR, M.; SZŐKE, I. Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems. In Proceedings Interspeech 2021. Proceedings of Interspeech. Brno: International Speech Communication Association, 2021.
p. 3296-3300. ISSN: 1990-9772. Detail -
2020
ZULUAGA-GOMEZ, J.; MOTLÍČEK, P.; ZHAN, Q.; VESELÝ, K.; BRAUN, R. Automatic Speech Recognition Benchmark for Air-Traffic Communications. In Proceedings of Interspeech 2020. Proceedings of Interspeech. Shanghai: International Speech Communication Association, 2020.
p. 2297-2301. ISSN: 1990-9772. DetailZULUAGA-GOMEZ, J.; VESELÝ, K.; BLATT, A.; MOTLÍČEK, P.; KLAKOW, D.; TART, A.; SZŐKE, I.; PRASAD, A.; SARFJOO, S.; KOLČÁREK, P.; KOCOUR, M.; ČERNOCKÝ, J.; CEVENINI, C.; CHOUKRI, K.; RIGAULT, M.; LANDIS, F. Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications. Proceedings of the 8th OpenSky Symposium 2020. Proceedings. Brusel: MDPI, 2020.
p. 1-10. ISSN: 2504-3900. Detail -
2015
MOTLÍČEK, P.; DEY, S.; MADIKERI, S.; BURGET, L. Employment of Subspace Gaussian Mixture Models in Speaker Recognition. In Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015.
p. 4445-4449. ISBN: 978-1-4673-6997-8. Detail -
2013
MOTLÍČEK, P.; POVEY, D.; KARAFIÁT, M. Feature And Score Level Combination Of Subspace Gaussians In LVCSR Task. Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013.
p. 7604-7608. ISBN: 978-1-4799-0355-9. Detail -
2012
MOTLÍČEK, P.; VALENTE, F.; SZŐKE, I. Improving Acoustic Based Keyword Spotting Using LVCSR Lattices. Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012.
p. 4413-4416. ISBN: 978-1-4673-0044-5. DetailPOVEY, D.; HANNEMANN, M.; BOULIANNE, G.; BURGET, L.; GHOSHAL, A.; JANDA, M.; KARAFIÁT, M.; KOMBRINK, S.; MOTLÍČEK, P.; QIAN, Y.; RIEDHAMMER, K.; VESELÝ, K.; VU, N. Generating Exact Lattices in The WFST Framework. Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012.
p. 4213-4216. ISBN: 978-1-4673-0044-5. Detail -
2011
POVEY, D.; GHOSHAL, A.; BOULIANNE, G.; BURGET, L.; GLEMBEK, O.; GOEL, N.; HANNEMANN, M.; MOTLÍČEK, P.; QIAN, Y.; SCHWARZ, P.; SILOVSKÝ, J.; STEMMER, G.; VESELÝ, K. The Kaldi Speech Recognition Toolkit. Proceedings of ASRU 2011. Hilton Waikoloa Village Resort, Hawaii: IEEE Signal Processing Society, 2011.
p. 1-4. ISBN: 978-1-4673-0366-8. Detail -
2005
MOTLÍČEK, P., BURGET, L., ČERNOCKÝ, J. VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION. In Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005.
p. 187-190. ISBN: 80-214-2904-6. Detail -
2004
MOTLÍČEK, P. Visual Feature Extreaction for Phoneme Recognition of Meetings. Brno: Department of Computer Graphics and Multimedia FIT BUT, 2004. Detail
MOTLÍČEK, P., ČERNOCKÝ, J. Multimodal Phoneme Recognition of Meeting Data. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206,
p. 379 ( p.) ISSN: 0302-9743. DetailMOTLÍČEK, P., ČERNOCKÝ, J. Multimodal Phoneme Recognition of Meeting Data. In 7th International Conference, TSD 2004 Brno, Czech Republic, September 2004 Proceedings. Lecture Notes in Computer Science. Brno: Springer Verlag, 2004.
p. 379-384. ISBN: 3-540-23049-1. ISSN: 0302-9743. Detail -
2003
MOTLÍČEK, P. Derivation of TRAPs in Auditory Domain. Proceedings of 9th Conference and Competition STUDENT EEICT 2003. Brno: Dean Office of FEEC BUT, 2003.
p. 598-602. ISBN: 80-214-2379-X. DetailMOTLÍČEK, P. Modeling of Spectra and Temporal Trajectories in Speech Processing. Sborník příspěvků a prezentací akce Odborné semináře 2003. REL02V. Brno: Department of Radioelectronics FEEC BUT, 2003.
s. 0-0. DetailMOTLÍČEK, P.; ČERNOCKÝ, J. All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task. 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. Lecture Notes in Computer Science. České Budějovice: University of West Bohemia in Pilsen, 2003.
p. 295-300. ISBN: 3-540-20024-X. ISSN: 0302-9743. DetailMOTLÍČEK, P.; ČERNOCKÝ, J. Autoregressive Modeling based Feature Extraction for Aurora3 DSR Task. Proc. EUROSPEECH 2003. European Conference EUROSPEECH. Geneva: Institute for Perceptual Artificial Intelligence, 2003.
p. 1801-1804. ISSN: 1018-4074. DetailMOTLÍČEK, P.; ČERNOCKÝ, J. Time-domain based Temporal Processing with Application of. Proc. EUROSPEECH 2003. European Conference EUROSPEECH. Geneva: Institute for Perceptual Artificial Intelligence, 2003.
p. 821-824. ISSN: 1018-4074. Detail -
2002
BURGET, L.; MOTLÍČEK, P.; GRÉZL, F.; JAIN, P. Distributed speech recognition. Radioengineering, 2002, vol. 2002, no. 4,
p. 12-16. ISSN: 1210-2512. DetailGARUDADRI, H.; HEŘMANSKÝ, H.; MORGAN, N.; BENITEZ, C.; BURGET, L.; KAJAREKAR, S.; GRÉZL, F.; JAIN, P.; MOTLÍČEK, P. Distributed Voice Recognition System Utilizing Multistream Network Feature Processing. San Diego: Qualcomm, 2002.
p. 0-0. DetailMOTLÍČEK, P. Application of Mel-scale Filter bank for Noise Estimation in Speech Processing. 12th International Czech-Slovak Scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002.
p. 1-4. ISBN: 80-227-1700-2. DetailMOTLÍČEK, P.; BURGET, L. Efficient Noise Estimation and its Application for Robust Speech Recognition. 5th International Conference, TSD 2002 Brno, Czech Republic, September 2002 Proceedings. Berlin: Springer Verlag, 2002.
p. 229-236. ISBN: 3-540-44129-8. DetailMOTLÍČEK, P.; BURGET, L. Noise estimation for efficient speech enhancement and robust speech recognition. Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002.
p. 1033-1036. ISBN: 1-876346-42-6. Detail -
2001
MOTLÍČEK, P. Application of Re-segmentation in Very Low Bit Rate Speech Coding. Proceedings of 7th Conference STUDENT EEICT 2001. Brno: Faculty of Electrical Engineering and Communication BUT, 2001.
p. 269-274. ISBN: 80-214-1860-5. DetailMOTLÍČEK, P.; BAUDOIN, G.; ČERNOCKÝ, J.; CHOLLET, G. Minimization of transition noise and HNM synthesis in very low bit rate speech coding. 4th International Conference, TSD 2001 Železná Ruda, Czech Republic, September 2001 Proceedings. Berlin: Springer Verlag, 2001.
p. 305-312. ISBN: 3-540-42557-8. DetailMOTLÍČEK, P.; ČERNOCKÝ, J. Minimization of transition noise in very low bit rate speech coding. Proc. Radioelektronika 2001. Brno: Faculty of Electrical Engineering and Computer Science BUT, 2001.
p. 396-399. ISBN: 80-214-1861-3. DetailMOTLÍČEK, P.; ČERNOCKÝ, J.; BAUDOIN, G. Diphone-like units for very low bit rate speech coder. Proceeding of Internal Conference on Acoustics, Speech, and Signal Processing. Student session. Salt Lake City: 2001.
p. 48-48. ISBN: 0-7803-7041-4. DetailMOTLÍČEK, P.; ČERNOCKÝ, J.; BAUDOIN, G. Diphone-like units without phonemes-option for very low bit rate speech coding. Proc. Eurocon2001. Bratislava: Faculty of Electrical Engineering and Information Technology, Slovak University of Technology in Bratislava, 2001.
p. 463-467. ISBN: 0-7803-6490-2. Detail