Department of Computer Graphics and Multimedia
2025
- HANÁK, J.; NOVÁK, J.; CHUDÝ, P.; BEN-ASHER, J. Cross-Entropy Method for Laser Defense Applications. Journal of Aerospace Information Systems, 2025, vol. 22, no. 1,
p. 53-58. ISSN: 2327-3097. Detail - NOVÁK, J.; CHUDÝ, P.; HANÁK, J. Weight-varying Model Predictive Control for Coupled Cyber-Physical Systems: Aerial Grasping Study. Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Castiglione della Pescaia: Springer Nature Switzerland AG, 2025.
p. 1-15. ISSN: 0302-9743. Detail - SKOG, K.; KOHOUT, T.; KAŠPÁREK, T.; WOLFMAYR, M. Lossless Hyperspectral Image Compression in Comet Interceptor and Hera Missions with Restricted Bandwith. Remote Sensing, 2025,
p. 1-15. ISSN: 2072-4292. Detail
2024
- ADAMEC, V.; BERGLOWIEC, P.; SVATOŇ, V.; SCHWARZ, P.; MÜLLER, L. Využití umělé inteligence v systému příjmu tísňových volání v podmínkách České republiky. SPEKTRUM, 2024, roč. 2024, č. 2,
s. 3-8. ISSN: 1804-1639. Detail - ALAM, J.; BARAHONA QUIRÓS, S.; BOBOŠ, D.; BURGET, L.; CUMANI, S.; DAHMANE, M.; HAN, J.; HLAVÁČEK, M.; KODOVSKÝ, M.; LANDINI, F.; MOŠNER, L.; PÁLKA, P.; PAVLÍČEK, T.; PENG, J.; PLCHOT, O.; RAJASEKHAR, P.; ROHDIN, J.; SILNOVA, A.; STAFYLAKIS, T.; ZHANG, L. ABC SYSTEM DESCRIPTION FOR NIST SRE 2024. Proceedings of NIST SRE 2024. San Juan: National Institute of Standards and Technology, 2024.
p. 1-9. Detail - BENEŠ, K.; KOCOUR, M.; BURGET, L. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024.
p. 11276-11280. ISBN: 979-8-3503-4485-1. Detail - BHATTACHARJEE, M.; NIGMATULINA, I.; PRASAD, A.; RANGAPPA, P.; MADIKERI, S.; MOTLÍČEK, P.; HELMKE, H.; KLEINERT, M. Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024.
p. 12652-12656. ISBN: 979-8-3503-4485-1. Detail - BOBÁK, P.; ČMOLÍK, L.; ČADÍK, M. Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, vol. 30, no. 9,
p. 5908-5922. ISSN: 1077-2626. Detail - ČEGIŇ, J.; PECHER, B.; ŠIMKO, J.; SRBA, I.; BIELIKOVÁ, M. Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024.
p. 13148-13171. ISBN: 979-8-8917-6094-3. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Automatic 3D-Display-Friendly Scene Extraction from Video Sequences and Optimal Focusing Distance Identification. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, vol. 83, no. 7,
p. 1-29. ISSN: 1573-7721. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. How Capturing Camera Trajectory Distortion Affects User Experience on Looking Glass 3D Display. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, vol. 2024, no. 83,
p. 20265-20287. ISSN: 1573-7721. Detail - CHLUBNA, T.; MILET, T.; ZEMČÍK, P. Lightweight All-Focused Light Field Rendering. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, vol. 244, no. 7,
p. 7-8. ISSN: 1077-3142. Detail - CHLUBNA, T.; ZEMČÍK, P.; MILET, T. Efficient Random-Access GPU Video Decoding for Light-Field Rendering. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, vol. 2024, no. 102,
p. 1-14. ISSN: 1047-3203. Detail - DE LEON MARTINEZ, S. Understanding User Behavior in Carousel Recommendation Systems for Click Modeling and Learning to Rank. Proceedings of the Seventeenth ACM International Conference on Web Search and Data Mining. New York: Association for Computing Machinery, 2024.
p. 1139-1141. ISBN: 979-8-4007-0371-3. Detail - DEKEL, S.; KELLER, Y.; ČADÍK, M. Estimating Extreme 3D Image Rotations using Cascaded Attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle: IEEE Computer Society, 2024.
p. 2588-2598. ISBN: 979-8-3503-5301-3. Detail - ESPUNA, A.; PRASAD, A.; MOTLÍČEK, P.; MADIKERI, S.; SCHUEPBACH, C. Normalising Flows for Speaker and Language Recognition Backend. Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Quebec: International Speech Communication Association, 2024.
p. 74-80. Detail - HAN, J.; LANDINI, F.; ROHDIN, J.; DIEZ SÁNCHEZ, M.; BURGET, L.; CAO, Y.; LU, H.; ČERNOCKÝ, J. Diacorrect: Error Correction Back-End for Speaker Diarization. In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024.
p. 11181-11185. ISBN: 979-8-3503-4485-1. Detail - HANÁK, J.; NOVÁK, J.; CHUDÝ, P. Cognitive Modeling Approach for Generating Authentic Tactical Agent Behavior. In AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024.
p. 1-15. ISBN: 979-8-3503-4961-0. ISSN: 2155-7195. Detail - HANÁK, J.; NOVÁK, J.; CHUDÝ, P. Tactical Scenario Adaptation for Pilot Training. In AIAA/IEEE Digital Avionics Systems Conference - Proceedings. San Diego: Institute of Electrical and Electronics Engineers, 2024.
p. 1-7. ISBN: 979-8-3503-4961-0. ISSN: 2155-7195. Detail - KAPINUS, M.; BERAN, V.; MATERNA, Z.; BAMBUŠEK, D. Augmented Reality Spatial Programming Paradigm Applied to End-User Robot Programming. ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2024, vol. 89, no. 89,
p. 1-13. ISSN: 0736-5845. Detail - KAŠPÁREK, T.; CHUDÝ, P. Pulsar Signal Adaptive Surrogate Modeling. Aerospace, 2024, vol. 11, no. 10,
p. 1-22. ISSN: 2226-4310. Detail - KIŠŠ, M.; HRADIŠ, M. Self-supervised Pre-training of Text Recognizers. In Barney Smith, E.H., Liwicki, M., Peng, L. (eds) Document Analysis and Recognition - ICDAR 2024. Lecture Notes in Computer Science. Atény: Springer Nature Switzerland AG, 2024.
p. 218-235. ISBN: 978-3-031-70545-8. Detail - KLEMENT, D.; DIEZ SÁNCHEZ, M.; LANDINI, F.; BURGET, L.; SILNOVA, A.; DELCROIX, M.; TAWARA, N. Discriminative Training of VBx Diarization. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024.
p. 11871-11875. ISBN: 979-8-3503-4485-1. Detail - KUBÍK, T.; ŠILLING, P.; ŠPANĚL, M. Souhrnná výzkumná zpráva k projektu TESCAN 3DIM - Automatizace zpracování obrazových a 3D dat pomocí hlubokého učení. Brno: TESCAN 3DIM, s.r.o., 2024.
s. 0-0. Detail - KUBÍK, T.; ŠPANĚL, M. LMVSegRNN and Poseidon3D: Addressing Challenging Teeth Segmentation Cases in 3D Dental Surface Orthodontic Scans. Bioengineering, 2024, vol. 11, no. 10,
p. 1-18. ISSN: 2306-5354. Detail - KUMAR, S.; MADIKERI, S.; NIGMATULINA, I.; VILLATORO-TELLO, E.; MOTLÍČEK, P.; PANDIA, K.; DUBAGUNTA, P.; GANAPATHIRAJU, A. Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024.
p. 12592-12596. ISBN: 979-8-3503-4485-1. Detail - KUNEŠOVÁ, M.; ZAJÍC, Z.; ŠMÍDL, L.; KARAFIÁT, M. Comparison of wav2vec 2.0 models on three speech processing tasks. International Journal of Speech Technology, 2024, vol. 27, no. 4,
p. 847-859. ISSN: 1572-8110. Detail - LANDINI, F.; DIEZ SÁNCHEZ, M.; STAFYLAKIS, T.; BURGET, L. DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE Transactions on Audio, Speech, and Language Processing, 2024, vol. 32, no. 7,
p. 3450-3465. ISSN: 1558-7916. Detail - MACIEJEWSKI, M.; KLEMENT, D.; HUANG, R.; WIESNER, M.; KHUDANPUR, S. Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024.
p. 2155-2160. ISSN: 1990-9772. Detail - MOŠNER, L.; SERIZEL, R.; BURGET, L.; PLCHOT, O.; VINCENT, E.; PENG, J.; ČERNOCKÝ, J. Multi-Channel Extension of Pre-trained Models for Speaker Verification. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024.
p. 2135-2139. ISSN: 1990-9772. Detail - MOTLÍČEK, P.; DIKICI, E.; MADIKERI, S.; RANGAPPA, P.; BACKFRIED, G.; ROHDIN, J.; SCHWARZ, P.; KOVÁČ, M.; MALÝ, K.; BOBOŠ, D.; KLAKOW, D.; SERGIDOU, E. ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations. Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024.
p. 17-24. Detail - NOVÁK, J.; CHUDÝ, P. Dynamic Soaring in Uncertain Wind Conditions: Polynomial Chaos Expansion Approach. In Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Grasmere: Springer Nature Switzerland AG, 2024.
p. 104-115. ISBN: 978-3-031-53968-8. ISSN: 0302-9743. Detail - NOVÁK, J.; CHUDÝ, P.; HANÁK, J. Model Predictive Control Driven Aerial Grasping with Soft Operational Constraints. In ICAS Proceedings. ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024.
p. 1-15. ISSN: 2958-4647. Detail - NOVÁK, J.; HANÁK, J.; CHUDÝ, P. Hybrid Modeling Approach for Optimization Based Control of Multirotor Unmanned Aerial Vehicles. In ICAS Proceedings. ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024.
p. 1-10. ISSN: 2958-4647. Detail - NOVÁK, J.; HANÁK, J.; CHUDÝ, P. Predictive Control Driven Tactical Maneuvering. In ICAS Proceedings. ICAS Proceedings. Florence: International Council of the Aeronautical Sciences, 2024.
p. 1-12. ISSN: 2958-4647. Detail - NOVÁK, J.; HANÁK, J.; CHUDÝ, P. Reliability-Based Control System Optimization in Uncertain Conditions. In AIAA Aviation Forum and ASCEND, 2024. Las Vegas: American Institute of Aeronautics and Astronautics, 2024.
p. 1-15. ISBN: 978-1-62410-716-0. Detail - PECHER, B.; ČEGIŇ, J.; BELANEC, R.; SRBA, I.; ŠIMKO, J.; BIELIKOVÁ, M. Fighting Randomness With Randomness: Mitigating Optimisation Instability of Fine-Tuning Using Ensemble and Noise Regularisation. Findings of the Association for Computational Linguistics: EMNLP 2024. Miami: Association for Computational Linguistics, 2024.
p. 11005-11044. ISBN: 979-8-8917-6168-1. Detail - PECHER, B.; SRBA, I.; BIELIKOVÁ, M. On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. Miami: Association for Computational Linguistics, 2024.
p. 522-556. ISBN: 979-8-8917-6164-3. Detail - PECHER, B.; SRBA, I.; BIELIKOVÁ, M. A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness. ACM COMPUTING SURVEYS, 2024, vol. 57, no. 1,
p. 1-40. ISSN: 0360-0300. Detail - PENG, J.; DELCROIX, M.; OCHIAI, T.; ASHIHARA, T.; PLCHOT, O.; ARAKI, S.; ČERNOCKÝ, J. Probing Self-Supervised Learning Models With Target Speech Extraction. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024.
p. 535-539. ISBN: 979-8-3503-7451-3. Detail - PENG, J.; DELCROIX, M.; OCHIAI, T.; PLCHOT, O.; ARAKI, S.; ČERNOCKÝ, J. Target Speech Extraction with Pre-Trained Self-Supervised Learning Models. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024.
p. 10421-10425. ISBN: 979-8-3503-4485-1. Detail - PEŠÁN, J.; JUŘÍK, V.; KARAFIÁT, M.; ČERNOCKÝ, J. BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024.
p. 1355-1359. ISSN: 1990-9772. Detail - PEŠÁN, J.; JUŘÍK, V.; RŮŽIČKOVÁ, A.; SVOBODA, V.; JANOUŠEK, O.; NĚMCOVÁ, A.; BOJANOVSKÁ, H.; ALDABAGHOVÁ, J.; KYSLÍK, F.; VODIČKOVÁ, K.; SODOMOVÁ, A.; BARTYS, P.; CHUDÝ, P.; ČERNOCKÝ, J. Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals. Scientific data, 2024, vol. 11, no. 1,
p. 1-9. ISSN: 2052-4463. Detail - POLOK, A.; KLEMENT, D.; HAN, J.; SEDLÁČEK, Š.; YUSUF, B.; MACIEJEWSKI, M.; WIESNER, M.; BURGET, L. BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge. Proceedings of CHiME 2024 Workshop. Kos Island: International Speech Communication Association, 2024.
p. 18-22. Detail - PRASAD, A.; CAROFILIS, A.; VANDERREYDT, G.; KHALIL, D.; MADIKERI, S.; MOTLÍČEK, P.; SCHUEPBACH, C. Fine-Tuning Self-Supervised Models for Language Identification Using Orthonormal Constraint. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024.
p. 11921-11925. ISBN: 979-8-3503-4485-1. Detail - PRASAD, A.; MADIKERI, S.; KHALIL, D.; MOTLÍČEK, P.; SCHUEPBACH, C. Speech and Language Recognition with Low-rank Adaptation of Pretrained Models. In Proceedings of Interspeech. Proceedings of Interspeech. Kos Island: International Speech Communication Association, 2024.
p. 2825-2829. ISSN: 1990-9772. Detail - RANGAPPA, P.; MUSCAT, A.; SANCHEZ-LARA, A.; MOTLÍČEK, P.; ANTONOPOULOU, M.; FOURFOURIS, I.; SKARLATOS, A.; AVGERINOS, N.; TSANGARIS, M.; KOSTKA, K. Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project. Proceedings of the15th EAI International Conference on Digital Forensics & Cyber Crime (EAI-ICDF2C24). Dubrovnik: 2024.
p. 1-15. Detail - ROHDIN, J.; ZHANG, L.; PLCHOT, O.; STANĚK, V.; MIHOLA, D.; PENG, J.; STAFYLAKIS, T.; BEVERAKI, D.; SILNOVA, A.; BRUKNER, J.; BURGET, L. BUT systems and analyses for the ASVspoof 5 Challenge. Proceedings of ASV spoof 2024 Workshop. Kos Island: International Speech Communication Association, 2024.
p. 24-31. Detail - STAFYLAKIS, T.; SILNOVA, A.; ROHDIN, J.; PLCHOT, O.; BURGET, L. Challenging margin-based speaker embedding extractors by using the variational information bottleneck. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024.
p. 3220-3224. ISSN: 1990-9772. Detail - VILLATORO-TELLO, E.; MADIKERI, S.; SHARMA, B.; KHALIL, D.; KUMAR, S.; NIGMATULINA, I.; MOTLÍČEK, P.; GANAPATHIRAJU, A. Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024.
p. 12617-12621. ISBN: 979-8-3503-4485-1. Detail - VINCENT, J.; KOHOUT, T.; KAŠPÁREK, T. Macroscale Roughness Reveals the Complex History of Asteroids Didymos and Dimorphos. The Planetary Science Journal, 2024, vol. 5, no. 10,
p. 1-29. ISSN: 2632-3338. Detail - VYKOPAL, I.; PIKULIAK, M.; SRBA, I.; MÓRO, R.; MACKO, D.; BIELIKOVÁ, M. Disinformation Capabilities of Large Language Models. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Bangkok: Association for Computational Linguistics, 2024.
p. 14830-14847. ISBN: 979-8-8917-6094-3. Detail - WANG, S.; CHEN, Z.; HAN, B.; WANG, H.; XIANG, X.; ROHDIN, J.; SILNOVA, A.; QIAN, Y.; LI, H. Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Communication, 2024, vol. 162, no. 103104,
p. 1-12. ISSN: 0167-6393. Detail - WANNER, L.; ČERNOCKÝ, J.; EGOROVA, E.; KLUSCH, M.; MAVROPOULOS, A. Support of Migrant Reception, Integration, and Social Inclusion by Intelligent Technologies. Information (Switzerland), 2024, vol. 15, no. 11,
p. 1-33. ISSN: 2078-2489. Detail - YUSUF, B.; BASKAR, M.; ROSENBERG, A.; RAMABHADRAN, B. Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024.
p. 792-796. ISSN: 1990-9772. Detail - YUSUF, B.; ČERNOCKÝ, J.; SARAÇLAR, M. Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024.
p. 5068-5072. ISSN: 1990-9772. Detail - YUSUF, B.; SARAÇLAR, M. Written Term Detection Improves Spoken Term Detection. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, 2024, vol. 32, no. 06,
p. 3213-3223. ISSN: 2329-9290. Detail - ZHANG, L.; STAFYLAKIS, T.; LANDINI, F.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; BURGET, L. Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?. Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024.
p. 123-130. Detail - ZHANG, L.; WANG, X.; COOPER, E.; DIEZ SÁNCHEZ, M.; LANDINI, F.; EVANS, N.; YAMAGISHI, J. Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024.
p. 502-506. ISSN: 1990-9772. Detail