Department of Computer Graphics and Multimedia

Author Title (all words) Keywords (any word) Year Years Publication Type

2025

PENG Junyi, MOŠNER Ladislav, ZHANG Lin, PLCHOT Oldřich, STAFYLAKIS Themos, BURGET Lukáš and ČERNOCKÝ Jan. CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025, pp. 1-5. ISBN 979-8-3503-6874-1. Detail
CHLUBNA Tomáš and ZEMČÍK Pavel. Comparative Survey of Image Compression Methods Across Different Pixel Formats and Bit Depths. Signal, Image and Video Processing, vol. 19, no. 12, 2025, p. 13. ISSN 1863-1703. Detail
HANÁK Jiří, NOVÁK Jiří, CHUDÝ Peter and BEN-ASHER Joseph Z. Cross-Entropy Method for Laser Defense Applications. Journal of Aerospace Information Systems, vol. 22, no. 1, 2025, pp. 53-58. ISSN 2327-3097. Detail
ČIEF Matej and KOMPAN Michal. Cross-Validated Off-Policy Evaluation. In: Proceedings of the AAAI Conference on Artificial Intelligence. Pennsylvania, 2025, pp. 16073-16081. ISBN 978-1-57735-897-8. Detail
HORI Takaaki, KOCOUR Martin, HAIDER Adnan, MCDERMOTT Erik and ZHUANG Xiaodan. Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025, pp. 1-5. ISBN 979-8-3503-6874-1. Detail
ŠILLING Petr and ŠPANĚL Michal. DEMIS: Electron Microscopy Image Stitching using Deep Learning Features and Global Optimisation. In: Proceedings of the 18th International Joint Conference on Biomedical Engineering Systems and Technologies - BIOIMAGING. Porto: Institute for Systems and Technologies of Information, Control and Communication, 2025, pp. 255-256. ISBN 978-989-758-731-3. Detail
DE Leon Martinez Santiago Jose, FOUCHER Valentin and MÓRO Róbert. Eye Movements as Indicators of Deception: A Machine Learning Approach. In: ETRA '25: Proceedings of the 2025 Symposium on Eye Tracking Research and Applications. New York, 2025, pp. 1-7. Detail
CHLUBNA Tomáš, VLNAS Michal, BAŘINA David, MILET Tomáš and ZEMČÍK Pavel. Focus-aware compression and image quality metric for 3D displays. Signal Processing, vol. 2026, no. 238, 2025, pp. 1-14. ISSN 0165-1684. Detail
CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. How Color Profile Affects the Visual Quality in Light Field Rendering and Novel View Synthesis. Multimedia Tools and Applications, vol. 84, no. 14, 2025, pp. 11079-11095. ISSN 1573-7721. Detail
BAŘINA David. Improved verification limit for the convergence of the Collatz conjecture. The Journal of Supercomputing, vol. 81, no. 1, 2025, pp. 1-14. ISSN 1573-0484. Detail
PÁLKA Petr, LANDINI Federico Nicolás, KLEMENT Dominik, DIEZ Sánchez Mireia, SILNOVA Anna, DELCROIX Marc and BURGET Lukáš. Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization. In: Proceedings of Eusipco 2025. Palermo: IEEE Signal Processing Society, 2025, pp. 1-5. Detail
HAN Jiangyu, LANDINI Federico Nicolás, ROHDIN Johan A., SILNOVA Anna, DIEZ Sánchez Mireia and BURGET Lukáš. Leveraging Self-Supervised Learning for Speaker Diarization. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025, pp. 1-5. ISBN 979-8-3503-6874-1. Detail
CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. Light Field Video Streaming on GPU. Signal Processing: Image Communication, vol. 2025, no. 138, p. 12. ISSN 0923-5965. Detail
ČEGIŇ Ján and ŠIMKO Jakub. LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs?. In: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Albuquerque, New Mexico: Association for Computational Linguistics, 2025, pp. 10476-10496. ISBN 979-8-8917-6189-6. Detail
SKOG Kasper, KOHOUT Tomáš, KAŠPÁREK Tomáš and WOLFMAYR Monika et al. Lossless Hyperspectral Image Compression in Comet Interceptor and Hera Missions with Restricted Bandwith. Remote Sensing, vol. 17, no. 899, 2025, pp. 1-18. ISSN 2072-4292. Detail
VLNAS Michal, MILET Tomáš and ZEMČÍK Pavel. Low-error Reconstruction of Directional Functions with Spherical Harmonics. IEEE Transactions on Visualization and Computer Graphics, vol. 31, no. 10, 2025, pp. 8413-8424. ISSN 1077-2626. Detail
LOJDA Jakub, STRNADEL Josef, SMRŽ Pavel and ŠIMEK Václav. Multi-Partner Project: LoLiPoP-IoT - Design and Simulation of Energy-Efficient Devices for the Internet of Things. In: 2025 Design, Automation & Test in Europe Conference (DATE) Proceedings. Lyon: Institute of Electrical and Electronics Engineers, 2025, pp. 1-7. ISBN 978-3-9826741-0-0. Detail
LOJDA Jakub, JOYCE Daire, SMRŽ Pavel, KATHURIA Shruti, STRNADEL Josef, QUINN Caitlin, ŠIMEK Václav and STAROŇ Patrik. Portable Simulation Models for Energy Aspects of IoT Devices in the LoLiPoP-IoT Project. In: 2025 28th Euromicro Conference on Digital System Design (DSD). Salerno: IEEE Computer Society, 2025, pp. 368-375. ISBN 979-8-3315-8499-3. Detail
DE Leon Martinez Santiago Jose, KANG Jingwei, MORO Robert, DE Rijke Maarten, KVETON Branislav, OOOSTERHUIS Harrie and BIELIKOVÁ Mária. RecGaze: The First Eye Tracking and User Interaction Dataset for Carousel Interfaces. In: SIGIR '25: Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: Association for Computing Machinery, 2025, pp. 3702-3711. ISBN 979-8-4007-1592-1. Detail
KANG Jingwei, DE Rijke Maarten, DE Leon Martinez Santiago Jose and OOOSTERHUIS Harrie. Beyond the Single List: How Should One Design Click Models for Carousel Interfaces?. In: ICTIR '25: Proceedings of the 2025 International ACM SIGIR Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR). New York City : Association for Computing Machinery, 2025, pp. 44-55. ISBN 979-8-4007-1861-8. Detail
VYKOPAL Ivan, OSTERMANN Simon and ŠIMKO Marián. Soft Language Prompts for Language Transfer. In: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). Albuquerque: Association for Computational Linguistics, 2025, pp. 10294-10313. ISBN 979-8-8917-6189-6. Detail
ŠILLING Petr, PUKANEC Dávid, KUBÍK Tibor and ŠPANĚL Michal. Souhrnná výzkumná zpráva k projektu TESCAN 3DIM - Nové trendy v analýze obrazových a 3D dat. Brno: TESCAN 3DIM, s.r.o., 2025. Detail
POLOK Alexander, KLEMENT Dominik, WIESNER Matthew, KHUDANPUR Sanjeev, ČERNOCKÝ Jan and BURGET Lukáš. Target Speaker ASR with Whisper. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025, pp. 1-5. ISBN 979-8-3503-6874-1. Detail
ANTTI Näsilä, KOHOUT Tomáš and KAŠPÁREK Tomáš et al. The Asteroid Spectral Imager (ASPECT) on the Milani CubeSat. Space Science Reviews, vol. 2025, no. 221, pp. 1-27. ISSN 1572-9672. Detail
KUBÍK Tibor, GUIBAULT François, ŠPANĚL Michal and LOMBAERT Hervé. ToothForge: Automatic Dental Shape Generation using Synchronized Spectral Embeddings. In: Proceedings of Information Processing in Medical Imaging 2025. Kos, 2025, pp. 1-14. Detail
PENG Junyi, ASHIHARA Takanori, DELCROIX Marc, OCHIAI Tsubasa, PLCHOT Oldřich, ARAKI Shoko and ČERNOCKÝ Jan. TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Hyderabad: IEEE Signal Processing Society, 2025, pp. 1-5. ISBN 979-8-3503-6874-1. Detail
NOVÁK Jiří, CHUDÝ Peter and HANÁK Jiří. Weight-varying Model Predictive Control for Coupled Cyber-Physical Systems: Aerial Grasping Study. In: Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Castiglione della Pescaia: Springer Nature Switzerland AG, 2025, pp. 1-15. ISSN 0302-9743. Detail

2024

PECHER Branislav, SRBA Ivan and BIELIKOVÁ Mária. A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness. ACM Computing Surveys, vol. 57, no. 1, 2024, pp. 1-40. ISSN 0360-0300. Detail
ALAM Jahangir, BARAHONA Quirós Sara, BOBOŠ Dominik, BURGET Lukáš, CUMANI Sandro, DAHMANE Mohamed, HAN Jiangyu, HLAVÁČEK Miroslav, KODOVSKÝ Martin, LANDINI Federico Nicolás, MOŠNER Ladislav, PÁLKA Petr, PAVLÍČEK Tomáš, PENG Junyi, PLCHOT Oldřich, RAJASEKHAR Gnana Praveen, ROHDIN Johan A., SILNOVA Anna, STAFYLAKIS Themos and ZHANG Lin. ABC SYSTEM DESCRIPTION FOR NIST SRE 2024. In: Proceedings of NIST SRE 2024. San Juan: National Institute of Standards and Technology, 2024, pp. 1-9. Detail
WANG Shuai, CHEN Zhengyang, HAN Bing, WANG Hongji, XIANG Xu, ROHDIN Johan A., SILNOVA Anna, QIAN Yanmin and LI Haizhou et al. Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Communication, vol. 162, no. 103104, 2024, pp. 1-12. ISSN 0167-6393. Detail
KAPINUS Michal, BERAN Vítězslav, MATERNA Zdeněk and BAMBUŠEK Daniel. Augmented Reality Spatial Programming Paradigm Applied to End-User Robot Programming. Robotics and Computer-Integrated Manufacturing, vol. 89, no. 89, 2024, pp. 1-13. ISSN 0736-5845. Detail
CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. Automatic 3D-Display-Friendly Scene Extraction from Video Sequences and Optimal Focusing Distance Identification. Multimedia Tools and Applications, vol. 83, no. 7, 2024, pp. 1-29. ISSN 1573-7721. Detail
PEŠÁN Jan, JUŘÍK Vojtěch, KARAFIÁT Martin and ČERNOCKÝ Jan. BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 1355-1359. ISSN 1990-9772. Detail
ROHDIN Johan A., ZHANG Lin, PLCHOT Oldřich, STANĚK Vojtěch, MIHOLA David, PENG Junyi, STAFYLAKIS Themos, BEVERAKI Dmitriy, SILNOVA Anna, BRUKNER Jan and BURGET Lukáš. BUT systems and analyses for the ASVspoof 5 Challenge. In: Proceedings of ASV spoof 2024 Workshop. Kos Island: International Speech Communication Association, 2024, pp. 24-31. Detail
POLOK Alexander, KLEMENT Dominik, HAN Jiangyu, SEDLÁČEK Šimon, YUSUF Bolaji, MACIEJEWSKI Matthew, WIESNER Matthew and BURGET Lukáš. BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge. In: Proceedings of CHiME 2024 Workshop. Kos Island: International Speech Communication Association, 2024, pp. 18-22. Detail
HANÁK Jiří, NOVÁK Jiří and CHUDÝ Peter. Cognitive Modeling Approach for Generating Authentic Tactical Agent Behavior. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024, pp. 1-15. ISBN 979-8-3503-4961-0. ISSN 2155-7195. Detail
KUNEŠOVÁ Marie, ZAJÍC Zbyněk, ŠMÍDL Luboš and KARAFIÁT Martin. Comparison of wav2vec 2.0 models on three speech processing tasks. International Journal of Speech Technology, vol. 27, no. 4, 2024, pp. 847-859. ISSN 1572-8110. Detail
BHATTACHARJEE Mrinmoy, NIGMATULINA Iuliia, PRASAD Amrutha, RANGAPPA Pradeep, MADIKERI Srikanth, MOTLÍČEK Petr, HELMKE Hartmut and KLEINERT Matthias. Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 12652-12656. ISBN 979-8-3503-4485-1. Detail
BURDISSO Sergio, RAMIREZ Reyes Ernesto Antonio, VILLATORO-TELLO Esaú, SÁNCHEZ-VEGA Fernando, LÓPEZ-MONROY A. Pastor and MOTLÍČEK Petr. DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews. In: Proceedings of the 6th Clinical Natural Language Processing Workshop. Association for Computational Linguistics. Mexico City: Association for Computational Linguistics, 2024, pp. 82-90. Detail
RANGAPPA Pradeep, MUSCAT Amanda, SANCHEZ-LARA Alejandra, MOTLÍČEK Petr, ANTONOPOULOU Michaela, FOURFOURIS Ioannis, SKARLATOS Antonios, AVGERINOS Nikos, TSANGARIS Manolis and KOSTKA Kasia. Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project. In: Proceedings of the15th EAI International Conference on Digital Forensics & Cyber Crime (EAI-ICDF2C24). Dubrovnik, 2024, pp. 1-15. Detail
HAN Jiangyu, LANDINI Federico Nicolás, ROHDIN Johan A., DIEZ Sánchez Mireia, BURGET Lukáš, CAO Yuhang, LU Heng and ČERNOCKÝ Jan. Diacorrect: Error Correction Back-End for Speaker Diarization. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, pp. 11181-11185. ISBN 979-8-3503-4485-1. Detail
LANDINI Federico Nicolás, DIEZ Sánchez Mireia, STAFYLAKIS Themos and BURGET Lukáš. DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE Transactions on Audio, Speech, and Language Processing, vol. 32, no. 7, 2024, pp. 3450-3465. ISSN 1558-7916. Detail
KLEMENT Dominik, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, SILNOVA Anna, DELCROIX Marc and TAWARA Naohiro. Discriminative Training of VBx Diarization. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11871-11875. ISBN 979-8-3503-4485-1. Detail
VYKOPAL Ivan, PIKULIAK Matúš, SRBA Ivan, MÓRO Róbert, MACKO Dominik and BIELIKOVÁ Mária. Disinformation Capabilities of Large Language Models. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024, pp. 14830-14847. ISBN 979-8-8917-6094-3. Detail
ZHANG Lin, STAFYLAKIS Themos, LANDINI Federico Nicolás, DIEZ Sánchez Mireia, SILNOVA Anna and BURGET Lukáš. Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024, pp. 123-130. Detail
NOVÁK Jiří and CHUDÝ Peter. Dynamic Soaring in Uncertain Wind Conditions: Polynomial Chaos Expansion Approach. In: Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Grasmere: Springer Nature Switzerland AG, 2024, pp. 104-115. ISBN 978-3-031-53968-8. ISSN 0302-9743. Detail
ČEGIŇ Ján, PECHER Branislav, ŠIMKO Jakub, SRBA Ivan and BIELIKOVÁ Mária. Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024, pp. 13148-13171. ISBN 979-8-8917-6094-3. Detail
CHLUBNA Tomáš, ZEMČÍK Pavel and MILET Tomáš. Efficient Random-Access GPU Video Decoding for Light-Field Rendering. Journal of Visual Communication and Image Representation, vol. 2024, no. 102, pp. 1-14. ISSN 1047-3203. Detail
DEKEL Shay, KELLER Yosi and ČADÍK Martin. Estimating Extreme 3D Image Rotations using Cascaded Attention. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle: IEEE Computer Society, 2024, pp. 2588-2598. ISBN 979-8-3503-5301-3. Detail
PECHER Branislav, ČEGIŇ Ján, BELANEC Róbert, SRBA Ivan, ŠIMKO Jakub and BIELIKOVÁ Mária. Fighting Randomness With Randomness: Mitigating Optimisation Instability of Fine-Tuning Using Ensemble and Noise Regularisation. In: Findings of the Association for Computational Linguistics: EMNLP 2024. Miami: Association for Computational Linguistics, 2024, pp. 11005-11044. ISBN 979-8-8917-6168-1. Detail
PRASAD Amrutha, CAROFILIS Andrés, VANDERREYDT Geoffroy, KHALIL Driss, MADIKERI Srikanth, MOTLÍČEK Petr and SCHUEPBACH Christof. Fine-Tuning Self-Supervised Models for Language Identification Using Orthonormal Constraint. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11921-11925. ISBN 979-8-3503-4485-1. Detail
LOJDA Jakub, STRNADEL Josef, SMRŽ Pavel and ŠIMEK Václav. First Steps Towards Unified Low-Power IoT Design: The "DYNAMIC" Framework. In: 2024 IEEE East-West Design and Test Symposium, EWDTS 2024 - Proceedings. Yerevan: Institute of Electrical and Electronics Engineers, 2024, pp. 1-6. ISBN 979-8-3315-1576-8. Detail
CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. How Capturing Camera Trajectory Distortion Affects User Experience on Looking Glass 3D Display. Multimedia Tools and Applications, vol. 2024, no. 83, pp. 20265-20287. ISSN 1573-7721. Detail
NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Hybrid Modeling Approach for Optimization Based Control of Multirotor Unmanned Aerial Vehicles. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, pp. 1-10. ISSN 2958-4647. Detail
BENEŠ Karel, KOCOUR Martin and BURGET Lukáš. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11276-11280. ISBN 979-8-3503-4485-1. Detail
STAFYLAKIS Themos, SILNOVA Anna, ROHDIN Johan A., PLCHOT Oldřich and BURGET Lukáš. Challenging margin-based speaker embedding extractors by using the variational information bottleneck. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 3220-3224. ISSN 1990-9772. Detail
ČIEF Matej. Learning Action Embeddings for Off-Policy Evaluation. In: ECIR 2024: Advances in Information Retrieval. Advances in Information Retrieval. Glasgow: Springer Nature Switzerland AG, 2024, pp. 108-122. Detail
CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. Lightweight All-Focused Light Field Rendering. Computer Vision and Image Understanding, vol. 244, no. 7, 2024, pp. 7-8. ISSN 1077-3142. Detail
KUBÍK Tibor and ŠPANĚL Michal. LMVSegRNN and Poseidon3D: Addressing Challenging Teeth Segmentation Cases in 3D Dental Surface Orthodontic Scans. Bioengineering, vol. 11, no. 10, 2024, pp. 1-18. ISSN 2306-5354. Detail
STRNADEL Josef, LOJDA Jakub, SMRŽ Pavel and ŠIMEK Václav. Machine Learning in Context of IoT/Edge Devices and LoLiPoP-IoT Project. In: Proceedings of 32nd Austrian Workshop on Microelectronics (Austrochip 2024). Vienna: Institute of Electrical and Electronics Engineers, 2024, pp. 1-4. ISBN 979-8-3315-1617-8. Detail
NOVÁK Jiří, CHUDÝ Peter and HANÁK Jiří. Model Predictive Control Driven Aerial Grasping with Soft Operational Constraints. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, pp. 1-15. ISSN 2958-4647. Detail
MOŠNER Ladislav, SERIZEL Romain, BURGET Lukáš, PLCHOT Oldřich, VINCENT Emmanuel, PENG Junyi and ČERNOCKÝ Jan. Multi-Channel Extension of Pre-trained Models for Speaker Verification. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Kos: International Speech Communication Association, 2024, pp. 2135-2139. ISSN 1990-9772. Detail
KUMAR Sashi, MADIKERI Srikanth, NIGMATULINA Iuliia, VILLATORO-TELLO Esaú, MOTLÍČEK Petr, PANDIA Karthick, DUBAGUNTA S. Pavankumar and GANAPATHIRAJU Aravind. Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, pp. 12592-12596. ISBN 979-8-3503-4485-1. Detail
ESPUNA Fontcuberta Aleix, PRASAD Amrutha, MOTLÍČEK Petr, MADIKERI Srikanth and SCHUEPBACH Christof. Normalising Flows for Speaker and Language Recognition Backend. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Quebec: International Speech Communication Association, 2024, pp. 74-80. Detail
PECHER Branislav, SRBA Ivan and BIELIKOVÁ Mária. On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices. In: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami: Association for Computational Linguistics, 2024, pp. 522-556. ISBN 979-8-8917-6164-3. Detail
STRNADEL Josef, LOJDA Jakub, SMRŽ Pavel and ŠIMEK Václav. On SMC-Based Dependability Analysis in LoLiPoP-IoT Project. In: Steffen, B. (eds) Bridging the Gap Between AI and Reality (AISolA 2024). Lecture Notes in Computer Science, vol. 15217. Limenas Hersonissou: Springer Nature Switzerland AG, 2024, pp. 420-445. ISBN 978-3-031-75434-0. ISSN 0302-9743. Detail
CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. Out-of-Focus Artifacts Mitigation and Autofocus Methods for 3D Displays. Visual Informatics, vol. 9, no. 1, 2024, pp. 31-42. ISSN 2468-502X. Detail
ČIEF Matej and KOMPAN Michal. Pessimistic Off-Policy Optimization for Learning to Rank. In: 27TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE. Frontiers in Artificial Intelligence and Applications. Santiago de Compostela, 2024, pp. 1896-1903. ISBN 978-1-64368-548-9. Detail
NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Predictive Control Driven Tactical Maneuvering. In: ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024, pp. 1-12. ISSN 2958-4647. Detail
YUSUF Bolaji, ČERNOCKÝ Jan and SARAÇLAR Murat. Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Kos: International Speech Communication Association, 2024, pp. 5068-5072. ISSN 1990-9772. Detail
VILLATORO-TELLO Esaú, MADIKERI Srikanth, SHARMA Bidisha, KHALIL Driss, KUMAR Sashi, NIGMATULINA Iuliia, MOTLÍČEK Petr and GANAPATHIRAJU Aravind. Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, pp. 12617-12621. ISBN 979-8-3503-4485-1. Detail
PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, ASHIHARA Takanori, PLCHOT Oldřich, ARAKI Shoko and ČERNOCKÝ Jan. Probing Self-Supervised Learning Models With Target Speech Extraction. In: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 535-539. ISBN 979-8-3503-7451-3. Detail
KAŠPÁREK Tomáš and CHUDÝ Peter. Pulsar Signal Adaptive Surrogate Modeling. Aerospace, vol. 11, no. 10, 2024, pp. 1-22. ISSN 2226-4310. Detail
BOBÁK Petr, ČMOLÍK Ladislav and ČADÍK Martin. Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement. IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 9, 2024, pp. 5908-5922. ISSN 1077-2626. Detail
NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Reliability-Based Control System Optimization in Uncertain Conditions. In: AIAA Aviation Forum and ASCEND, 2024. Las Vegas: American Institute of Aeronautics and Astronautics, 2024, pp. 1-15. ISBN 978-1-62410-716-0. Detail
MOTLÍČEK Petr, DIKICI Erinç, MADIKERI Srikanth, RANGAPPA Pradeep, BACKFRIED Gerhard, ROHDIN Johan A., SCHWARZ Petr, KOVÁČ Marek, MALÝ Květoslav, BOBOŠ Dominik, KLAKOW Dietrich and SERGIDOU Eleni Konstantina et al. ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024, pp. 17-24. Detail
KIŠŠ Martin and HRADIŠ Michal. Self-supervised Pre-training of Text Recognizers. In: Barney Smith, E.H., Liwicki, M., Peng, L. (eds) Document Analysis and Recognition - ICDAR 2024. Lecture Notes in Computer Science, vol. 14807. Atény: Springer Nature Switzerland AG, 2024, pp. 218-235. ISBN 978-3-031-70545-8. Detail
KUBÍK Tibor, ŠILLING Petr and ŠPANĚL Michal. Souhrnná výzkumná zpráva k projektu TESCAN 3DIM - Automatizace zpracování obrazových a 3D dat pomocí hlubokého učení. Brno: TESCAN 3DIM, s.r.o., 2024. Detail
YUSUF Bolaji, BASKAR Karthick Murali, ROSENBERG Andrew and RAMABHADRAN Bhuvana. Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 792-796. ISSN 1990-9772. Detail
PRASAD Amrutha, MADIKERI Srikanth, KHALIL Driss, MOTLÍČEK Petr and SCHUEPBACH Christof. Speech and Language Recognition with Low-rank Adaptation of Pretrained Models. In: Proceedings of Interspeech. Kos Island: International Speech Communication Association, 2024, pp. 2825-2829. ISSN 1990-9772. Detail
PEŠÁN Jan, JUŘÍK Vojtěch, RŮŽIČKOVÁ Alexandra, SVOBODA Vojtěch, JANOUŠEK Oto, NĚMCOVÁ Andrea, BOJANOVSKÁ Hana, ALDABAGHOVÁ Jasmína, KYSLÍK Filip, VODIČKOVÁ Kateřina, SODOMOVÁ Adéla, BARTYS Patrik, CHUDÝ Peter and ČERNOCKÝ Jan. Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals. Nature Scientific Data, vol. 11, no. 1, 2024, pp. 1-9. ISSN 2052-4463. Detail
ZHANG Lin, WANG Xin, COOPER Erica, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, EVANS Nicholas and YAMAGISHI Junichi. Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 502-506. ISSN 1990-9772. Detail
WANNER Leo, ČERNOCKÝ Jan, EGOROVA Ekaterina, KLUSCH Matthias and MAVROPOULOS Athanasios et al. Support of Migrant Reception, Integration, and Social Inclusion by Intelligent Technologies. Information, vol. 15, no. 11, 2024, pp. 1-33. ISSN 2078-2489. Detail
HANÁK Jiří, NOVÁK Jiří and CHUDÝ Peter. Tactical Scenario Adaptation for Pilot Training. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024, pp. 1-7. ISBN 979-8-3503-4961-0. ISSN 2155-7195. Detail
PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, PLCHOT Oldřich, ARAKI Shoko and ČERNOCKÝ Jan. Target Speech Extraction with Pre-Trained Self-Supervised Learning Models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 10421-10425. ISBN 979-8-3503-4485-1. Detail
LOJDA Jakub, STRNADEL Josef, ŠIMEK Václav, SMRŽ Pavel, HAYES Michael and POPP Ralf. The LoLiPoP-IoT Project: Long Life Power Platforms for Internet of Things. In: Proceedings - 2024 27th Euromicro Conference on Digital System Design, DSD 2024. Paris: Institute of Electrical and Electronics Engineers, 2024, pp. 604-611. ISBN 979-8-3503-8038-5. Detail
DE Leon Martinez Santiago Jose. Understanding User Behavior in Carousel Recommendation Systems for Click Modeling and Learning to Rank. In: Proceedings of the Seventeenth ACM International Conference on Web Search and Data Mining. New York : Association for Computing Machinery, 2024, pp. 1139-1141. ISBN 979-8-4007-0371-3. Detail
ADAMEC Vilém, BERGLOWIEC Petr, SVATOŇ Václav, SCHWARZ Petr and MÜLLER Luděk. Využití umělé inteligence v systému příjmu tísňových volání v podmínkách České republiky. Časopis SPEKTRUM, vol. 2024, no. 2, pp. 3-8. ISSN 1804-1639. Detail
YUSUF Bolaji and SARAÇLAR Murat. Written Term Detection Improves Spoken Term Detection. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 32, no. 06, 2024, pp. 3213-3223. ISSN 2329-9290. Detail