Ústav počítačové grafiky a multimédií
2024
- BENEŠ, K.; KOCOUR, M.; BURGET, L. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024.
p. 11276-11280. ISBN: 979-8-3503-4485-1. Detail - ČIEF, M.; KOMPAN, M. Pessimistic Off-Policy Optimization for Learning to Rank. 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE. Frontiers in Artificial Intelligence and Applications. Santiago de Compostela: 2024.
p. 1896-1903. ISBN: 978-1-64368-548-9. Detail - HAN, J.; LANDINI, F.; ROHDIN, J.; DIEZ SÁNCHEZ, M.; BURGET, L.; CAO, Y.; LU, H.; ČERNOCKÝ, J. Diacorrect: Error Correction Back-End for Speaker Diarization. In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024.
p. 11181-11185. ISBN: 979-8-3503-4485-1. Detail - HANÁK, J.; NOVÁK, J.; CHUDÝ, P. Cognitive Modeling Approach for Generating Authentic Tactical Agent Behavior. AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024.
p. 0-0. Detail - HANÁK, J.; NOVÁK, J.; CHUDÝ, P. Tactical Scenario Adaptation for Pilot Training. AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024.
p. 0-0. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Automatic 3D-Display-Friendly Scene Extraction from Video Sequences and Optimal Focusing Distance Identification. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, vol. 83, no. 7,
p. 1-29. ISSN: 1573-7721. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. How Capturing Camera Trajectory Distortion Affects User Experience on Looking Glass 3D Display. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, vol. 2024, no. 83,
p. 20265-20287. ISSN: 1573-7721. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Lightweight All-Focused Light Field Rendering. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, vol. 244, no. 7,
p. 7-8. ISSN: 1077-3142. Detail - CHLUBNA, T.; ZEMČÍK, P.; MILET, T. Efficient Random-Access GPU Video Decoding for Light-Field Rendering. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, vol. 2024, no. 102,
p. 1-14. ISSN: 1047-3203. Detail - KAPINUS, M.; BERAN, V.; MATERNA, Z.; BAMBUŠEK, D. Augmented Reality Spatial Programming Paradigm Applied to End-User Robot Programming. ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2024, vol. 89, no. 89,
p. 1-13. ISSN: 0736-5845. Detail - KAŠPÁREK, T.; CHUDÝ, P. Pulsar Signal Adaptive Surrogate Modeling. Aerospace, 2024, vol. 11, no. 10,
p. 1-22. ISSN: 2226-4310. Detail - LANDINI, F.; DIEZ SÁNCHEZ, M.; STAFYLAKIS, T.; BURGET, L. DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE Transactions on Audio, Speech, and Language Processing, 2024, vol. 32, no. 7,
p. 3450-3465. ISSN: 1558-7916. Detail - NOVÁK, J.; CHUDÝ, P. Dynamic Soaring in Uncertain Wind Conditions: Polynomial Chaos Expansion Approach. In Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Grasmere: Springer Nature Switzerland AG, 2024.
p. 104-115. ISBN: 978-3-031-53968-8. ISSN: 0302-9743. Detail - NOVÁK, J.; CHUDÝ, P.; HANÁK, J. Weight-varying Model Predictive Control for Coupled Cyber-Physical Systems: Aerial Grasping Study. Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Castiglione della Pescaia: 2024.
p. 1-15. Detail
2023
- APAROVICH, M.; KESIRAJU, S.; DUFKOVÁ, A.; SMRŽ, P. FIT BUT at SemEval-2023 Task 12: Sentiment Without Borders - Multilingual Domain Adaptation for Low-Resource Sentiment Classification. In Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023). Toronto (online): Association for Computational Linguistics, 2023.
p. 1518-1524. ISBN: 978-1-959429-99-9. Detail - BAMBUŠEK, D.; MATERNA, Z.; KAPINUS, M.; BERAN, V.; SMRŽ, P. How Do I Get There? Overcoming Reachability Limitations of Constrained Industrial Environments in Augmented Reality Applications. In 2023 IEEE Conference on Virtual Reality and 3D User Interfaces (VR). Shanghai: Institute of Electrical and Electronics Engineers, 2023.
p. 115-122. ISBN: 979-8-3503-4815-6. Detail - BAŘINA, D. Experimental lossless data compressor. Microprocessors and Microsystems, 2023, vol. 98, no. 4,
p. 104803-104803. ISSN: 0141-9331. Detail - BHATTACHARJEE, M.; MOTLÍČEK, P.; NIGMATULINA, I.; HELMKE, H.; OHNEISER, O.; KLEINERT, M.; EHR, H. Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training. Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023.
p. 1-8. Detail - BOBÁK, P.; ČMOLÍK, L.; ČADÍK, M. Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023,
p. 1-14. ISSN: 1077-2626. Detail - BURDISSO, S.; VILLATORO-TELLO, E.; MADIKERI, S.; MOTLÍČEK, P. Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews. In Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 3617-3621. ISSN: 1990-9772. Detail - DE LEON MARTINEZ, S.; MORO, R.; BIELIKOVÁ, M. Eye Tracking as a Source of Implicit Feedback in Recommender Systems: A Preliminary Analysis. In ETRA '23: Proceedings of the 2023 Symposium on Eye Tracking Research and Applications. New York, NY: Association for Computing Machinery, 2023.
p. 1-3. ISBN: 979-8-4007-0150-4. Detail - DELCROIX, M.; TAWARA, N.; DIEZ SÁNCHEZ, M.; LANDINI, F.; SILNOVA, A.; OGAWA, A.; NAKATANI, T.; BURGET, L.; ARAKI, S. Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 3477-3481. ISSN: 1990-9772. Detail - FAJČÍK, M.; MOTLÍČEK, P.; SMRŽ, P. Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction. In Findings of the Association for Computational Linguistics: ACL 2023. ACL. Toronto: Association for Computational Linguistics, 2023.
p. 10184-10205. ISBN: 978-1-959429-62-3. Detail - GAVRIELIDES, A.; SOPHOCLEOUS, M.; AGAPIOU, G.; LESSI, C.; ŠPAŇHEL, J.; LENDINEZ, A.; QIU, R.; LI, D. Implementing Network Applications for 5G-Enabled Robots Through the 5G-ERA Platform. In IFIP Advances in Information and Communication Technology. IFIP Advances in Information and Communication Technology. Artificial Intelligence Applications and Innovations. Cham: Springer Nature Switzerland AG, 2023.
p. 55-65. ISBN: 978-3-031-34170-0. ISSN: 1868-422X. Detail - HANÁK, J.; CHUDÝ, P.; VLK, J. Collaborative Agents for Synthetic Tactical Training. In AIAA/IEEE Digital Avionics Systems Conference - Proceedings. Barcelona: Institute of Electrical and Electronics Engineers, 2023.
p. 1-9. ISBN: 979-8-3503-3357-2. ISSN: 2155-7195. Detail - HELMKE, H.; KLEINERT, M.; AHRENHOLD, N.; EHR, H.; MÜHLHAUSEN, T.; PINSKA, E.; OHNEISER, O.; KLAMERT, L.; MOTLÍČEK, P.; PRASAD, A.; ZULUAGA-GOMEZ, J.; DOKIC, J. Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers' Workload. Proceedings of ATM Seminar. Savannah, Georgia: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2023.
p. 1-11. Detail - HROMÁDKA, T.; SMOLEŇ, T.; REMIŠ, T.; PECHER, B.; SRBA, I. KInITVeraAI at SemEval-2023 Task 3: Simple yet Powerful Multilingual Fine-Tuning for Persuasion Techniques Detection. In 17th International Workshop on Semantic Evaluation, SemEval 2023 - Proceedings of the Workshop. Toronto: Association for Computational Linguistics, 2023.
p. 629-637. ISBN: 978-1-959429-99-9. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P.; KULA, M. Real-Time Light Field Video Focusing and GPU Accelerated Streaming. Journal of Signal Processing Systems for Signal Image and Video Technology, 2023, vol. 95, no. 6,
p. 703-719. ISSN: 1939-8115. Detail - JURÁNEK, R.; KLEPÁRNÍK, P.; KAPINUS, M.; DOBEŠ, P.; SMRŽ, P. A Study of Real-time Computer Vision Tasks in 5G-enhanced Environment. EuCNC & 6G Summit Proceedings. Gothenburg: 2023.
p. 1-5. Detail - JURÁNEK, R.; MUSIL, P.; MUSIL, M.; NOSKO, S.; ZEMČÍK, P. Smart camera for traffic applications. Journal of Signal Processing Systems for Signal Image and Video Technology, 2023, vol. 95, no. 9,
p. 1067-1077. ISSN: 1939-8115. Detail - KAKOUROS, S.; STAFYLAKIS, T.; MOŠNER, L.; BURGET, L. Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing. In Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023.
p. 1-5. ISBN: 978-1-7281-6327-7. Detail - KAŠPÁREK, T.; JAVORKA, M.; CHUDÝ, P.; PITOŇÁK, R. On-board data processing for meteor detection on SLAVIA mission. In Proceedings of the 2023 European Data Handling and Data Processing Conference for Space, EDHPC 2023. Juan Les Pins: Institute of Electrical and Electronics Engineers, 2023.
p. 1-5. ISBN: 978-90-90-37924-1. Detail - KESIRAJU, S.; BENEŠ, K.; TIKHONOV, M.; ČERNOCKÝ, J. BUT Systems for IWSLT 2023 Marathi - Hindi Low Resource Speech Translation Task. In 20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference. Toronto (in-person and online): Association for Computational Linguistics, 2023.
p. 227-234. ISBN: 978-1-959429-84-5. Detail - KESIRAJU, S.; SARVAŠ, M.; PAVLÍČEK, T.; MACAIRE, C.; CIUBA, A. Strategies for Improving Low Resource Speech to Text Translation Relying on Pre-trained ASR Models. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 2148-2152. ISSN: 1990-9772. Detail - KHALIL, D.; PRASAD, A.; MOTLÍČEK, P.; ZULUAGA-GOMEZ, J.; NIGMATULINA, I.; MADIKERI, S.; SCHUEPBACH, C. An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain. Aerospace, 2023, vol. 10, no. 10,
p. 1-14. ISSN: 2226-4310. Detail - KIEFER, B.; BARTL, V.; ŠPAŇHEL, J.; HEROUT, A.; YANG, M., et al. 1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW). Waikoloa, Hawaii: Institute of Electrical and Electronics Engineers, 2023.
p. 265-302. ISBN: 979-8-3503-2056-5. Detail - KIŠŠ, M.; HRADIŠ, M.; BENEŠ, K.; BUCHAL, P.; KULA, M. SoftCTC-semi-supervised learning for text recognition using soft pseudo-labels. International Journal on Document Analysis and Recognition, 2023, vol. 2024, no. 27,
p. 177-193. ISSN: 1433-2825. Detail - KOCUR, V.; HEGROVÁ, V.; PATOČKA, M.; NEUMAN, J.; HEROUT, A. Correction of AFM data artifacts using a convolutional neural network trained with synthetically generated data. Ultramicroscopy, 2023, vol. 246, no. 1,
p. 113666-113666. ISSN: 0304-3991. Detail - KOHÚT, J.; HRADIŠ, M. Finetuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition. In Document Analysis and Recognition - ICDAR 2023. Lecture Notes in Computer Science. Lecture Notes in Computer Science. San José: Springer Nature Switzerland AG, 2023.
p. 269-286. ISBN: 978-3-031-41684-2. ISSN: 0302-9743. Detail - KOHÚT, J.; HRADIŠ, M.; KIŠŠ, M. Towards Writing Style Adaptation in Handwriting Recognition. In Document Analysis and Recognition - ICDAR 2023. Lecture Notes in Computer Science. Lecture Notes in Computer Science. San José: Springer Nature Switzerland AG, 2023.
p. 377-394. ISBN: 978-3-031-41684-2. ISSN: 0302-9743. Detail - LANDINI, F.; DIEZ SÁNCHEZ, M.; LOZANO DÍEZ, A.; BURGET, L. Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization. In Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023.
p. 1-5. ISBN: 978-1-7281-6327-7. Detail - LI, J.; ELLIS, D.; KODYM, O.; HEROUT, A.; ŠPANĚL, M.; EGGER, J. Towards clinical applicability and computational efficiency in automatic cranial implant design: An overview of the AutoImplant 2021 cranial implant design challenge. MEDICAL IMAGE ANALYSIS, 2023, vol. 88, no. 102865,
p. 1-15. ISSN: 1361-8423. Detail - MAI, F.; ZULUAGA-GOMEZ, J.; PARCOLLET, T.; MOTLÍČEK, P. HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition. In Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 2213-2217. ISSN: 1990-9772. Detail - MATĚJKA, P.; SILNOVA, A.; SLAVÍČEK, J.; MOŠNER, L.; PLCHOT, O.; KLČO, M.; PENG, J.; STAFYLAKIS, T.; BURGET, L. Description and Analysis of ABC Submission to NIST LRE 2022. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 511-515. ISSN: 1990-9772. Detail - MATERNA, Z.; KAPINUS, M.; DOBEŠ, P.; JURÁNEK, R.; SMRŽ, P. Communication Framework for 5G-Enabled Network Applications. EuCNC & 6G Summit Proceedings. Gothenburg: 2023.
p. 1-5. Detail - MOŠNER, L.; PLCHOT, O.; PENG, J.; BURGET, L.; ČERNOCKÝ, J. Multi-Channel Speech Separation with Cross-Attention and Beamforming. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 1693-1697. ISSN: 1990-9772. Detail - MOTLÍČEK, P.; PRASAD, A.; NIGMATULINA, I.; HELMKE, H.; OHNEISER, O.; KLEINERT, M. Automatic Speech Analysis Framework for ATC Communication in HAAWAII. Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023.
p. 1-9. Detail - NIGMATULINA, I.; MADIKERI, S.; VILLATORO-TELLO, E.; MOTLÍČEK, P.; ZULUAGA-GOMEZ, J.; PANDIA, K.; GANAPATHIRAJU, A. Implementing contextual biasing in GPU decoder for online ASR. In Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 4494-4498. ISSN: 1990-9772. Detail - NOVÁK, J.; CHUDÝ, P. Surrogate Modeling of Optimal Control Based Collision Avoidance System for Multirotor Unmanned Aerial Vehicles. In AIAA/IEEE Digital Avionics Systems Conference - Proceedings. Barcelona: Institute of Electrical and Electronics Engineers, 2023.
p. 1-7. ISBN: 979-8-3503-3357-2. ISSN: 2155-7195. Detail - OMACHTOVÁ, A.; HEROUT, A.; BAMBUŠEK, D.; JUŘÍK, V. How to shoot yourself right with a smartphone?. VIRTUAL REALITY, 2023, vol. 2023, no. 1,
p. 1-13. ISSN: 1434-9957. Detail - PAPŠO, R. Complementary Product Recommendation for Long-tail Products. The 17th ACM Recommender Systems Conference. New York: Association for Computing Machinery, 2023.
p. 1305-1311. ISBN: 979-8-4007-0241-9. Detail - PENG, J.; PLCHOT, O.; STAFYLAKIS, T.; MOŠNER, L.; BURGET, L.; ČERNOCKÝ, J. An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification. In 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023.
p. 555-562. ISBN: 978-1-6654-7189-3. Detail - PENG, J.; PLCHOT, O.; STAFYLAKIS, T.; MOŠNER, L.; BURGET, L.; ČERNOCKÝ, J. Improving Speaker Verification with Self-Pretrained Transformer Models. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023.
p. 5361-5365. ISSN: 1990-9772. Detail - PENG, J.; STAFYLAKIS, T.; GU, R.; PLCHOT, O.; MOŠNER, L.; BURGET, L.; ČERNOCKÝ, J. Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023.
p. 1-5. ISBN: 978-1-7281-6327-7. Detail - POLÁŠEK, T.; ČADÍK, M. Predicting Photovoltaic Power Production using High-Uncertainty Weather Forecasts. APPLIED ENERGY, 2023, vol. 2023, no. 339,
p. 120989-121004. ISSN: 0306-2619. Detail - POLÁŠEK, T.; ČADÍK, M.; KELLER, Y.; BENEŠ, B. Vision UFormer: Long-Range Monocular Absolute Depth Estimation. COMPUTERS & GRAPHICS-UK, 2023, vol. 111, no. 4,
p. 180-189. ISSN: 0097-8493. Detail - ŘIHÁČEK, T.; NEHYBA, J.; ČEVELÍČEK, M.; POLOK, A.; MATĚJKA, P.; DOLEŽAL, P. DeePsy: Představení online nástroje pro zpětnou vazbu v psychoterapii. Psychoterapie. Masarykova univerzita AN FL, 2023, roč. 17, č. 1,
s. 1-11. ISSN: 1802-3983. Detail - SILNOVA, A.; BRUMMER, J.; SWART, A.; BURGET, L. Toroidal Probabilistic Spherical Discriminant Analysis. In Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023.
p. 1-5. ISBN: 978-1-7281-6327-7. Detail - SILNOVA, A.; SLAVÍČEK, J.; MOŠNER, L.; KLČO, M.; PLCHOT, O.; MATĚJKA, P.; PENG, J.; STAFYLAKIS, T.; BURGET, L. ABC System Description for NIST LRE 2022. Proceedings of NIST LRE 2022 Workshop. Washington DC: National Institute of Standards and Technology, 2023.
p. 1-5. Detail - SKOWRON, M.; BACKFRIED, G.; NAVAS, E.; BERZINŠ, A.; VAN, J.; DE, F.; DEMARCO, A.; POLÁK, P.; KOVÁČ, M.; POLÁK, P.; ROHDIN, J.; ROSNER, M.; SANCHEZ, J.; SARATXAGA, I.; SCHWARZ, P. Deep Dive Speech Technology. In European Language Equality. Cham: Springer Nature Switzerland AG, 2023.
p. 289-312. ISBN: 978-3-031-28819-7. Detail - SRBA, I.; MÓRO, R.; TOMLEIN, M.; PECHER, B.; ŠIMKO, J.; ŠTEFANCOVÁ, E.; KOMPAN, M.; HRČKOVÁ, A.; PODROUŽEK, J.; GAVORNÍK, A.; BIELIKOVÁ, M. Auditing YouTube's Recommendation Algorithm for Misinformation Filter Bubbles. ACM transactions on recommender systems, 2023, vol. 1, no. 1,
p. 1-33. ISSN: 2770-6699. Detail - STAFYLAKIS, T.; MOŠNER, L.; KAKOUROS, S.; PLCHOT, O.; BURGET, L.; ČERNOCKÝ, J. Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations. In 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023.
p. 1136-1143. ISBN: 978-1-6654-7189-3. Detail - SUKEI, E.; DE LEON MARTINEZ, S.; OLMOS, M.; ARTES, A. Automatic Patient Functionality Assessment from Multimodal Data using Deep Learning Techniques - Development and Feasibility Evaluation. Internet Interventions, 2023, vol. 33, no. 100657,
p. 1-9. ISSN: 2214-7829. Detail - SUKEI, E.; ROMERO-MEDRANO, L.; DE LEON MARTINEZ, S.; HERRERA, J.; CAMPANA-MONTES, J.; OLMOS, M.; BACA-GARCIA, E.; ARTES, A. Continuous Assessment of Function and Disability via Mobile Sensing: Real-World Data-Driven Feasibility Study. JMIR Formative Research, 2023, vol. 7, no. 2023,
p. 1-10. ISSN: 2561-326X. Detail - VANDERREYDT, G.; PRASAD, A.; KHALIL, D.; MADIKERI, S.; DEMUYNCK, K.; MOTLÍČEK, P. Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition. Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei: IEEE Signal Processing Society, 2023.
p. 1-7. ISBN: 979-8-3503-0689-7. Detail - VILLATORO-TELLO, E.; MADIKERI, S.; ZULUAGA-GOMEZ, J.; SHARMA, B.; SARFJOO, S.; NIGMATULINA, I.; MOTLÍČEK, P.; IVANOV, V.; GANAPATHIRAJU, A. Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding Tasks. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023.
p. 1-5. ISBN: 978-1-7281-6327-7. Detail - YU, D.; GONG, Y.; PICHENY, A.; RAMABHADRAN, B.; HAKKANI-TÜR, D.; PRASAD, R.; ZEN, H.; SKOGLUND, J.; ČERNOCKÝ, J.; BURGET, L.; MOHAMED, A. Twenty-Five Years of Evolution in Speech and Language Processing. IEEE SIGNAL PROCESSING MAGAZINE, 2023, vol. 40, no. 5,
p. 27-39. ISSN: 1558-0792. Detail - YUSUF, B.; ČERNOCKÝ, J.; SARAÇLAR, M. End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2023, vol. 31, no. 08,
p. 3070-3080. ISSN: 2329-9290. Detail - YUSUF, B.; GOURAV, A.; GANDHE, A.; BULYKO, I. On-the-Fly Text Retrieval for end-to-end ASR Adaptation. In Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023.
p. 1-5. ISBN: 978-1-7281-6327-7. Detail - ZULUAGA-GOMEZ, J.; NIGMATULINA, I.; PRASAD, A.; MOTLÍČEK, P.; KHALIL, D.; MADIKERI, S.; TART, A.; SZŐKE, I.; LENDERS, V.; RIGAULT, M.; CHOUKRI, K. Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding. Aerospace, 2023, vol. 2023, no. 10,
p. 1-33. ISSN: 2226-4310. Detail - ZULUAGA-GOMEZ, J.; PRASAD, A.; NIGMATULINA, I.; MOTLÍČEK, P.; KLEINERT, M.;. A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers. Aerospace, 2023, vol. 10, no. 5,
p. 1-25. ISSN: 2226-4310. Detail - ZULUAGA-GOMEZ, J.; PRASAD, A.; NIGMATULINA, I.; SARFJOO, S.; MOTLÍČEK, P.; KLEINERT, M.; HELMKE, H.; OHNEISER, O.; ZHAN, Q. How Does Pre-Trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? an Extensive Benchmark on Air Traffic Control Communications. In IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023.
p. 205-212. ISBN: 978-1-6654-7189-3. Detail - ZULUAGA-GOMEZ, J.; SARFJOO, S.; PRASAD, A.; NIGMATULINA, I.; MOTLÍČEK, P.; ONDŘEJ, K.; OHNEISER, O.; HELMKE, H. BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications. In IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023.
p. 633-640. ISBN: 978-1-6654-7189-3. Detail - ŽMOLÍKOVÁ, K.; DELCROIX, M.; OCHIAI, T.; ČERNOCKÝ, J.; KINOSHITA, K.; YU, D. Neural Target Speech Extraction: An overview. IEEE SIGNAL PROCESSING MAGAZINE, 2023, vol. 40, no. 3,
p. 8-29. ISSN: 1558-0792. Detail