Speech Data Mining Research Group BUT Speech@FIT

2025

HAN Jiangyu, LANDINI Federico Nicolás, ROHDIN Johan A., SILNOVA Anna, DIEZ Sánchez Mireia and BURGET Lukáš. Leveraging Self-Supervised Learning for Speaker Diarization. In: Proceedings of ICASSP 2025. Hyderabad: IEEE Biometric Council, 2025, pp. 1-5. ISBN 979-8-3503-6874-1.
Detail

PENG Junyi, MOšNER Ladislav, ZHANG Lin, PLCHOT Oldřich, STAFYLAKIS Themos, BURGET Lukáš and ČERNOCKý Jan. CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification. In: Proceedings of ICASSP 2025. Hyderabad: IEEE Biometric Council, 2025, pp. 1-5. ISBN 979-8-3503-6874-1.
Detail

HORI Takaaki, KOCOUR Martin, HAIDER Adnan, MCDERMOTT Erik and ZHUANG Xiaodan. Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition. In: Proceedings of ICASSP 2025. Hyderabad: IEEE Biometric Council, 2025, pp. 1-5. ISBN 979-8-3503-6874-1.
Detail

POLOK Alexander, KLEMENT Dominik, KOCOUR Martin, HAN Jiangyu, LANDINI Federico Nicolás, YUSUF Bolaji, WIESNER Matthew, KHUDANPUR Sanjeev, ČERNOCKý Jan and BURGET Lukáš. DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition. Computer Speech and Language, 2025, pp. 1-39. ISSN 0885-2308.
Detail

2024

BENEš Karel, KOCOUR Martin and BURGET Lukáš. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11276-11280. ISBN 979-8-3503-4485-1.
Detail

HAN Jiangyu, LANDINI Federico Nicolás, ROHDIN Johan A., DIEZ Sánchez Mireia, BURGET Lukáš, CAO Yuhang, LU Heng and ČERNOCKý Jan. Diacorrect: Error Correction Back-End for Speaker Diarization. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, pp. 11181-11185. ISBN 979-8-3503-4485-1.
Detail

PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, PLCHOT Oldřich, ARAKI Shoko and ČERNOCKý Jan. Target Speech Extraction with Pre-Trained Self-Supervised Learning Models. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 10421-10425. ISBN 979-8-3503-4485-1.
Detail

PENG Junyi, DELCROIX Marc, OCHIAI Tsubasa, ASHIHARA Takanori, PLCHOT Oldřich, ARAKI Shoko and ČERNOCKý Jan. Probing Self-Supervised Learning Models With Target Speech Extraction. In: 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 535-539. ISBN 979-8-3503-7451-3.
Detail

KLEMENT Dominik, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, SILNOVA Anna, DELCROIX Marc and TAWARA Naohiro. Discriminative Training of VBx Diarization. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11871-11875. ISBN 979-8-3503-4485-1.
Detail

LANDINI Federico Nicolás, DIEZ Sánchez Mireia, STAFYLAKIS Themos and BURGET Lukáš. DiaPer: End-to-End Neural Diarization With Perceiver-Based Attractors. IEEE Transactions on Audio, Speech, and Language Processing, vol. 32, no. 7, 2024, pp. 3450-3465. ISSN 1558-7916.
Detail

PRASAD Amrutha, CAROFILIS Andrés, VANDERREYDT Geoffroy, KHALIL Driss, MADIKERI Srikanth, MOTLíčEK Petr and SCHUEPBACH Christof. Fine-Tuning Self-Supervised Models for Language Identification Using Orthonormal Constraint. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11921-11925. ISBN 979-8-3503-4485-1.
Detail

BHATTACHARJEE Mrinmoy, NIGMATULINA Iuliia, PRASAD Amrutha, RANGAPPA Pradeep, MADIKERI Srikanth, MOTLíčEK Petr, HELMKE Hartmut and KLEINERT Matthias. Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 12652-12656. ISBN 979-8-3503-4485-1.
Detail

WANNER Leo, ČERNOCKý Jan, EGOROVA Ekaterina, KLUSCH Matthias and MAVROPOULOS Athanasios et al. Support of Migrant Reception, Integration, and Social Inclusion by Intelligent Technologies. Information, vol. 15, no. 11, 2024, pp. 1-33. ISSN 2078-2489.
Detail

ESPUNA Fontcuberta Aleix, PRASAD Amrutha, MOTLíčEK Petr, MADIKERI Srikanth and SCHUEPBACH Christof. Normalising Flows for Speaker and Language Recognition Backend. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Quebec: International Speech Communication Association, 2024, pp. 74-80.
Detail

PRASAD Amrutha, MADIKERI Srikanth, KHALIL Driss, MOTLíčEK Petr and SCHUEPBACH Christof. Speech and Language Recognition with Low-rank Adaptation of Pretrained Models. In: Proceedings of Interspeech. Kos Island: International Speech Communication Association, 2024, pp. 2825-2829. ISSN 1990-9772.
Detail

YUSUF Bolaji and SARAçLAR Murat. Written Term Detection Improves Spoken Term Detection. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 32, no. 06, 2024, pp. 3213-3223. ISSN 2329-9290.
Detail

ZHANG Lin, STAFYLAKIS Themos, LANDINI Federico Nicolás, DIEZ Sánchez Mireia, SILNOVA Anna and BURGET Lukáš. Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024, pp. 123-130.
Detail

MOTLíčEK Petr, DIKICI Erinç, MADIKERI Srikanth, RANGAPPA Pradeep, BACKFRIED Gerhard, ROHDIN Johan A., SCHWARZ Petr, KOVáč Marek, MALý Květoslav, BOBOš Dominik, KLAKOW Dietrich and SERGIDOU Eleni Konstantina et al. ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Québec City: International Speech Communication Association, 2024, pp. 17-24.
Detail

PEšáN Jan, JUříK Vojtěch, RůžIčKOVá Alexandra, SVOBODA Vojtěch, JANOUšEK Oto, NěMCOVá Andrea, BOJANOVSKá Hana, ALDABAGHOVá Jasmína, KYSLíK Filip, VODIčKOVá Kateřina, SODOMOVá Adéla, BARTYS Patrik, CHUDý Peter and ČERNOCKý Jan. Speech production under stress for machine learning: multimodal dataset of 79 cases and 8 signals. Nature Scientific Data, vol. 11, no. 1, 2024, pp. 1-9. ISSN 2052-4463.
Detail

KUNEšOVá Marie, ZAJíC Zbyněk, ŠMíDL Luboš and KARAFIáT Martin. Comparison of wav2vec 2.0 models on three speech processing tasks. International Journal of Speech Technology, vol. 27, no. 4, 2024, pp. 847-859. ISSN 1572-8110.
Detail

STAFYLAKIS Themos, SILNOVA Anna, ROHDIN Johan A., PLCHOT Oldřich and BURGET Lukáš. Challenging margin-based speaker embedding extractors by using the variational information bottleneck. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 3220-3224. ISSN 1990-9772.
Detail

YUSUF Bolaji, ČERNOCKý Jan and SARAçLAR Murat. Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Kos: International Speech Communication Association, 2024, pp. 5068-5072. ISSN 1990-9772.
Detail

YUSUF Bolaji, BASKAR Karthick Murali, ROSENBERG Andrew and RAMABHADRAN Bhuvana. Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 792-796. ISSN 1990-9772.
Detail

MOšNER Ladislav, SERIZEL Romain, BURGET Lukáš, PLCHOT Oldřich, VINCENT Emmanuel, PENG Junyi and ČERNOCKý Jan. Multi-Channel Extension of Pre-trained Models for Speaker Verification. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Kos: International Speech Communication Association, 2024, pp. 2135-2139. ISSN 1990-9772.
Detail

ZHANG Lin, WANG Xin, COOPER Erica, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, EVANS Nicholas and YAMAGISHI Junichi. Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 502-506. ISSN 1990-9772.
Detail

PEšáN Jan, JUříK Vojtěch, KARAFIáT Martin and ČERNOCKý Jan. BESST Dataset: A Multimodal Resource for Speech-based Stress Detection and Analysis. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 1355-1359. ISSN 1990-9772.
Detail

MACIEJEWSKI Matthew, KLEMENT Dominik, HUANG Ruizhe, WIESNER Matthew and KHUDANPUR Sanjeev. Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 2155-2160. ISSN 1990-9772.
Detail

WANG Shuai, CHEN Zhengyang, HAN Bing, WANG Hongji, XIANG Xu, ROHDIN Johan A., SILNOVA Anna, QIAN Yanmin and LI Haizhou et al. Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Communication, vol. 162, no. 103104, 2024, pp. 1-12. ISSN 0167-6393.
Detail

POLOK Alexander, KLEMENT Dominik, HAN Jiangyu, SEDLáčEK Šimon, YUSUF Bolaji, MACIEJEWSKI Matthew, WIESNER Matthew and BURGET Lukáš. BUT/JHU System Description for CHiME-8 NOTSOFAR-1 Challenge. In: Proceedings of CHiME 2024 Workshop. Kos Island: International Speech Communication Association, 2024, pp. 18-22.
Detail

ROHDIN Johan A., ZHANG Lin, PLCHOT Oldřich, STANěK Vojtěch, MIHOLA David, PENG Junyi, STAFYLAKIS Themos, BEVERAKI Dmitriy, SILNOVA Anna, BRUKNER Jan and BURGET Lukáš. BUT systems and analyses for the ASVspoof 5 Challenge. In: Proceedings of ASV spoof 2024 Workshop. Kos Island: International Speech Communication Association, 2024, pp. 24-31.
Detail

ALAM Jahangir, BARAHONA Quirós Sara, BOBOš Dominik, BURGET Lukáš, CUMANI Sandro, DAHMANE Mohamed, HAN Jiangyu, HLAVáčEK Miroslav, KODOVSKý Martin, LANDINI Federico Nicolás, MOšNER Ladislav, PáLKA Petr, PAVLíčEK Tomáš, PENG Junyi, PLCHOT Oldřich, RAJASEKHAR Gnana Praveen, ROHDIN Johan A., SILNOVA Anna, STAFYLAKIS Themos and ZHANG Lin. ABC SYSTEM DESCRIPTION FOR NIST SRE 2024. In: Proceedings of NIST SRE 2024. San Juan: National Institute of Standards and Technology, 2024, pp. 1-9.
Detail

BURDISSO Sergio, RAMIREZ Reyes Ernesto Antonio, VILLATORO-TELLO Esaú, SáNCHEZ-VEGA Fernando, LóPEZ-MONROY A. Pastor and MOTLíčEK Petr. DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews. In: Proceedings of the 6th Clinical Natural Language Processing Workshop. Association for Computational Linguistics. Mexico City: Association for Computational Linguistics, 2024, pp. 82-90.
Detail

RANGAPPA Pradeep, MUSCAT Amanda, SANCHEZ-LARA Alejandra, MOTLíčEK Petr, ANTONOPOULOU Michaela, FOURFOURIS Ioannis, SKARLATOS Antonios, AVGERINOS Nikos, TSANGARIS Manolis and KOSTKA Kasia. Detecting Criminal Networks via Non-Content Communication Data Analysis Techniques from the TRACY Project. In: Proceedings of the15th EAI International Conference on Digital Forensics & Cyber Crime (EAI-ICDF2C24). Dubrovnik, 2024, pp. 1-15.
Detail

KUMAR Sashi, MADIKERI Srikanth, NIGMATULINA Iuliia, VILLATORO-TELLO Esaú, MOTLíčEK Petr, PANDIA Karthick, DUBAGUNTA S. Pavankumar and GANAPATHIRAJU Aravind. Multitask Speech Recognition and Speaker Change Detection for Unknown Number of Speakers. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, pp. 12592-12596. ISBN 979-8-3503-4485-1.
Detail

VILLATORO-TELLO Esaú, MADIKERI Srikanth, SHARMA Bidisha, KHALIL Driss, KUMAR Sashi, NIGMATULINA Iuliia, MOTLíčEK Petr and GANAPATHIRAJU Aravind. Probability-Aware Word-Confusion-Network-to-Text Alignment Approach for Intent Classification. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Seoul: IEEE Signal Processing Society, 2024, pp. 12617-12621. ISBN 979-8-3503-4485-1.
Detail

ADAMEC Vilém, BERGLOWIEC Petr, SVATOň Václav, SCHWARZ Petr and MüLLER Luděk. Využití umělé inteligence v systému příjmu tísňových volání v podmínkách České republiky. Časopis SPEKTRUM, vol. 2024, no. 2, pp. 3-8. ISSN 1804-1639.
Detail

2023

PENG Junyi, PLCHOT Oldřich, STAFYLAKIS Themos, MOšNER Ladislav, BURGET Lukáš and ČERNOCKý Jan. An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification. In: 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023, pp. 555-562. ISBN 978-1-6654-7189-3.
Detail

STAFYLAKIS Themos, MOšNER Ladislav, KAKOUROS Sofoklis, PLCHOT Oldřich, BURGET Lukáš and ČERNOCKý Jan. Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations. In: 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023, pp. 1136-1143. ISBN 978-1-6654-7189-3.
Detail

SILNOVA Anna, SLAVíčEK Josef, MOšNER Ladislav, KLčO Michal, PLCHOT Oldřich, MATěJKA Pavel, PENG Junyi, STAFYLAKIS Themos and BURGET Lukáš. ABC System Description for NIST LRE 2022. In: Proceedings of NIST LRE 2022 Workshop. Washington DC: National Institute of Standards and Technology, 2023, pp. 1-5.
Detail

ZULUAGA-GOMEZ Juan, SARFJOO Seyyed Saeed, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLíčEK Petr, ONDřEJ Karel, OHNEISER Oliver and HELMKE Hartmut. BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications. In: IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023, pp. 633-640. ISBN 978-1-6654-7189-3.
Detail

ZULUAGA-GOMEZ Juan, PRASAD Amrutha, NIGMATULINA Iuliia, SARFJOO Seyyed Saeed, MOTLíčEK Petr, KLEINERT Matthias, HELMKE Hartmut, OHNEISER Oliver and ZHAN Qingran. How Does Pre-Trained Wav2Vec 2.0 Perform on Domain-Shifted ASR? an Extensive Benchmark on Air Traffic Control Communications. In: IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings. Doha: IEEE Signal Processing Society, 2023, pp. 205-212. ISBN 978-1-6654-7189-3.
Detail

YUSUF Bolaji, GOURAV Aditya, GANDHE Ankur and BULYKO Ivan. On-the-Fly Text Retrieval for end-to-end ASR Adaptation. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7.
Detail

LANDINI Federico Nicolás, DIEZ Sánchez Mireia, LOZANO Díez Alicia and BURGET Lukáš. Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7.
Detail

SILNOVA Anna, BRUMMER Johan Nikolaas Langenhoven, SWART Albert du Preez and BURGET Lukáš. Toroidal Probabilistic Spherical Discriminant Analysis. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7.
Detail

PENG Junyi, STAFYLAKIS Themos, GU Rongzhi, PLCHOT Oldřich, MOšNER Ladislav, BURGET Lukáš and ČERNOCKý Jan. Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7.
Detail

KAKOUROS Sofoklis, STAFYLAKIS Themos, MOšNER Ladislav and BURGET Lukáš. Speech-Based Emotion Recognition with Self-Supervised Models Using Attentive Channel-Wise Correlations and Label Smoothing. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7.
Detail

KESIRAJU Santosh, BENEš Karel, TIKHONOV Maksim and ČERNOCKý Jan. BUT Systems for IWSLT 2023 Marathi - Hindi Low Resource Speech Translation Task. In: 20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference. Toronto (in-person and online): Association for Computational Linguistics, 2023, pp. 227-234. ISBN 978-1-959429-84-5.
Detail

YUSUF Bolaji, ČERNOCKý Jan and SARAçLAR Murat. End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 31, no. 08, 2023, pp. 3070-3080. ISSN 2329-9290.
Detail

YU Dong, GONG Yifan, PICHENY Michael Alan, RAMABHADRAN Bhuvana, HAKKANI-TüR Dilek, PRASAD Rohit, ZEN Heiga, SKOGLUND Jan, ČERNOCKý Jan, BURGET Lukáš and MOHAMED Abdelrahman. Twenty-Five Years of Evolution in Speech and Language Processing. IEEE Signal Processing Magazine, vol. 40, no. 5, 2023, pp. 27-39. ISSN 1558-0792.
Detail

ŽMOLíKOVá Kateřina, DELCROIX Marc, OCHIAI Tsubasa, ČERNOCKý Jan, KINOSHITA Keisuke and YU Dong. Neural Target Speech Extraction: An overview. IEEE Signal Processing Magazine, vol. 40, no. 3, 2023, pp. 8-29. ISSN 1558-0792.
Detail

SKOWRON Marcin, BACKFRIED Gerhard, NAVAS Eva, BERZINš Aivars, VAN Den Bogaert Joachim, DE Jong Franciska, DEMARCO Andrea, POLáK Peter, KOVáč Marek, POLáK Peter, ROHDIN Johan A., ROSNER Michael, SANCHEZ Jon, SARATXAGA Ibon and SCHWARZ Petr. Deep Dive Speech Technology. European Language Equality. Cham: Springer Nature Switzerland AG, 2023, pp. 289-312. ISBN 978-3-031-28819-7.
Detail

MOšNER Ladislav, PLCHOT Oldřich, PENG Junyi, BURGET Lukáš and ČERNOCKý Jan. Multi-Channel Speech Separation with Cross-Attention and Beamforming. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 1693-1697. ISSN 1990-9772.
Detail

KESIRAJU Santosh, SARVAš Marek, PAVLíčEK Tomáš, MACAIRE Cécile and CIUBA Alejandro. Strategies for Improving Low Resource Speech to Text Translation Relying on Pre-trained ASR Models. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 2148-2152. ISSN 1990-9772.
Detail

DELCROIX Marc, TAWARA Naohiro, DIEZ Sánchez Mireia, LANDINI Federico Nicolás, SILNOVA Anna, OGAWA Atsunori, NAKATANI Tomohiro, BURGET Lukáš and ARAKI Shoko. Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 3477-3481. ISSN 1990-9772.
Detail

MATěJKA Pavel, SILNOVA Anna, SLAVíčEK Josef, MOšNER Ladislav, PLCHOT Oldřich, KLčO Michal, PENG Junyi, STAFYLAKIS Themos and BURGET Lukáš. Description and Analysis of ABC Submission to NIST LRE 2022. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 511-515. ISSN 1990-9772.
Detail

PENG Junyi, PLCHOT Oldřich, STAFYLAKIS Themos, MOšNER Ladislav, BURGET Lukáš and ČERNOCKý Jan. Improving Speaker Verification with Self-Pretrained Transformer Models. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 5361-5365. ISSN 1990-9772.
Detail

ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, PRASAD Amrutha, MOTLíčEK Petr, KHALIL Driss, MADIKERI Srikanth, TART Allan, SZőKE Igor, LENDERS Vincent, RIGAULT Mickael and CHOUKRI Khalid. Lessons Learned in Transcribing 5000 h of Air Traffic Control Communications for Robust Automatic Speech Understanding. Aerospace, vol. 2023, no. 10, pp. 1-33. ISSN 2226-4310.
Detail

ŘIHáčEK Tomáš, NEHYBA Jan, ČEVELíčEK Michal, POLOK Alexander, MATěJKA Pavel and DOLEžAL Petr. DeePsy: Představení online nástroje pro zpětnou vazbu v psychoterapii. Psychoterapie. Masarykova univerzita AN FL, vol. 17, no. 1, 2023, pp. 1-11. ISSN 1802-3983.
Detail

KHALIL Driss, PRASAD Amrutha, MOTLíčEK Petr, ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, MADIKERI Srikanth and SCHUEPBACH Christof. An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain. Aerospace, vol. 10, no. 10, 2023, pp. 1-14. ISSN 2226-4310.
Detail

NIGMATULINA Iuliia, MADIKERI Srikanth, VILLATORO-TELLO Esaú, MOTLíčEK Petr, ZULUAGA-GOMEZ Juan, PANDIA Karthick and GANAPATHIRAJU Aravind. Implementing contextual biasing in GPU decoder for online ASR. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 4494-4498. ISSN 1990-9772.
Detail

BURDISSO Sergio, VILLATORO-TELLO Esaú, MADIKERI Srikanth and MOTLíčEK Petr. Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 3617-3621. ISSN 1990-9772.
Detail

MAI Florian, ZULUAGA-GOMEZ Juan, PARCOLLET Titouan and MOTLíčEK Petr. HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 2213-2217. ISSN 1990-9772.
Detail

VILLATORO-TELLO Esaú, MADIKERI Srikanth, ZULUAGA-GOMEZ Juan, SHARMA Bidisha, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia, MOTLíčEK Petr, IVANOV Alexei V. and GANAPATHIRAJU Aravind. Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding Tasks. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7.
Detail

VANDERREYDT Geoffroy, PRASAD Amrutha, KHALIL Driss, MADIKERI Srikanth, DEMUYNCK Kris and MOTLíčEK Petr. Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition. In: Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei: IEEE Signal Processing Society, 2023, pp. 1-7. ISBN 979-8-3503-0689-7.
Detail

MOTLíčEK Petr, PRASAD Amrutha, NIGMATULINA Iuliia, HELMKE Hartmut, OHNEISER Oliver and KLEINERT Matthias. Automatic Speech Analysis Framework for ATC Communication in HAAWAII. In: SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023, pp. 1-9. ISSN 0770-1268.
Detail

HELMKE Hartmut, KLEINERT Matthias, AHRENHOLD Nils, EHR Heiko, MüHLHAUSEN Thorsten, PINSKA Chauvin Ella, OHNEISER Oliver, KLAMERT Lucas, MOTLíčEK Petr, PRASAD Amrutha, ZULUAGA-GOMEZ Juan and DOKIC Jelena. Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers' Workload. In: Proceedings of ATM Seminar. Savannah, Georgia: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2023, pp. 1-11.
Detail

BHATTACHARJEE Mrinmoy, MOTLíčEK Petr, NIGMATULINA Iuliia, HELMKE Hartmut, OHNEISER Oliver, KLEINERT Matthias and EHR Heiko. Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training. In: 13th SESAR Innovation Days 2023, SIDS 2023. Seville: SESAR Joint Undertaking, 2023, pp. 1-8. ISSN 0770-1268.
Detail

ZULUAGA-GOMEZ Juan, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLíčEK Petr and KLEINERT Matthias. A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers. Aerospace, vol. 10, no. 5, 2023, pp. 1-25. ISSN 2226-4310.
Detail

2022

LANDINI Federico Nicolás, PROFANT Ján, DIEZ Sánchez Mireia and BURGET Lukáš. Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks. Computer Speech and Language, vol. 71, no. 101254, 2022, pp. 1-16. ISSN 0885-2308.
Detail

BURGET Lukáš and BOJAR Ondřej. NEUREM3 Interim Research Report. Brno: Department of Computer Graphics and Multimedia FIT BUT, 2022.
Detail

KIšš Martin, KOHúT Jan, BENEš Karel and HRADIš Michal. Importance of Textlines in Historical Document Classification. In: Uchida, S., Barney, E., Eglin, V. (eds) Document Analysis Systems. Lecture Notes in Computer Science, vol. 13237. La Rochelle: Springer Nature Switzerland AG, 2022, pp. 158-170. ISBN 978-3-031-06554-5.
Detail

YUSUF Bolaji, GANDHE Ankur and SOKOLOV Alex. Usted: Improving ASR with a Unified Speech and Text Encoder-Decoder. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, pp. 8297-8301. ISBN 978-1-6654-0540-9.
Detail

MOšNER Ladislav, PLCHOT Oldřich, BURGET Lukáš and ČERNOCKý Jan. Multisv: Dataset for Far-Field Multi-Channel Speaker Verification. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, pp. 7977-7981. ISBN 978-1-6654-0540-9.
Detail

MOšNER Ladislav, PLCHOT Oldřich, BURGET Lukáš and ČERNOCKý Jan. Multi-Channel Speaker Verification with Conv-Tasnet Based Beamformer. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, pp. 7982-7986. ISBN 978-1-6654-0540-9.
Detail

HAN Jiangyu, LONG Yanhua, BURGET Lukáš and ČERNOCKý Jan. DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation and Extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, pp. 7292-7296. ISBN 978-1-6654-0540-9.
Detail

ONDEL Yang Lucas Antoine Francois, LAM-YEE-MUI L'ea-Marie, KOCOUR Martin, CORRO Caio Filippo and BURGET Lukáš. GPU-Accelerated Forward-Backward Algorithm with Application to Lattice-Free MMI. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, pp. 8417-8421. ISBN 978-1-6654-0540-9.
Detail

BLATT Alexander, KOCOUR Martin, VESELý Karel, SZőKE Igor and KLAKOW Dietrich. Call-Sign Recognition and Understanding for Noisy Air-Traffic Transcripts Using Surveillance Information. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, pp. 8357-8361. ISBN 978-1-6654-0540-9.
Detail

NIGMATULINA Iuliia, ZULUAGA-GOMEZ Juan, PRASAD Amrutha, SARFJOO Saeed and MOTLíčEK Petr. A Two-Step Approach to Leverage Contextual Data: Speech Recognition in Air-Traffic Communications. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Singapore: IEEE Signal Processing Society, 2022, pp. 6282-6286. ISBN 978-1-6654-0540-9.
Detail

ONDEL Yang Lucas Antoine Francois, YUSUF Bolaji, BURGET Lukáš and SARAçLAR Murat. Non-Parametric Bayesian Subspace Models for Acoustic Unit Discovery. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 30, no. 5, 2022, pp. 1902-1917. ISSN 2329-9290.
Detail

EGOROVA Ekaterina, VYDANA Hari K., BURGET Lukáš and ČERNOCKý Jan. Spelling-Aware Word-Based End-to-End ASR. IEEE Signal Processing Letters, vol. 29, no. 29, 2022, pp. 1729-1733. ISSN 1558-2361.
Detail

SILNOVA Anna, STAFYLAKIS Themos, MOšNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., MATěJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej and BRUMMER Johan Nikolaas Langenhoven. Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, pp. 9-16.
Detail

PENG Junyi, ZHANG Chunlei, ČERNOCKý Jan and YU Dong. Progressive contrastive learning for self-supervised text-independent speaker verification. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, pp. 17-24.
Detail

ALAM Jahangir, BURGET Lukáš, GLEMBEK Ondřej, MATěJKA Pavel, MOšNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna and STAFYLAKIS Themos et al. Development of ABC systems for the 2021 edition of NIST Speaker Recognition evaluation. In: Proceedings of The Speaker and Language Recognition Workshop (Odyssey 2022). Beijing: International Speech Communication Association, 2022, pp. 346-353.
Detail

SOLEWICZ Yosef, COHEN Noa, ROHDIN Johan A., MADIKERI Srikanth and ČERNOCKý Jan. Speaker recognition on mono-channel telephony recordings. In: Proceedings of Odyssey 2022. Beijing: International Speech Communication Association, 2022, pp. 193-199.
Detail

BRUMMER Johan Nikolaas Langenhoven, SWART Albert du Preez, MOšNER Ladislav, SILNOVA Anna, PLCHOT Oldřich, STAFYLAKIS Themos and BURGET Lukáš. Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 1446-1450. ISSN 1990-9772.
Detail

LANDINI Federico Nicolás, LOZANO Díez Alicia, DIEZ Sánchez Mireia and BURGET Lukáš. From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 5095-5099. ISSN 1990-9772.
Detail

STAFYLAKIS Themos, MOšNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, BURGET Lukáš and ČERNOCKý Jan. Training Speaker Embedding Extractors Using Multi-Speaker Audio with Unknown Speaker Boundaries. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 605-609. ISSN 1990-9772.
Detail

PENG Junyi, GU Rongzhi, MOšNER Ladislav, PLCHOT Oldřich, BURGET Lukáš and ČERNOCKý Jan. Learnable Sparse Filterbank for Speaker Verification. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 5110-5114. ISSN 1990-9772.
Detail

KOCOUR Martin, ŽMOLíKOVá Kateřina, ONDEL Yang Lucas Antoine Francois, ŠVEC Ján, DELCROIX Marc, OCHIAI Tsubasa, BURGET Lukáš and ČERNOCKý Jan. Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 4955-4959. ISSN 1990-9772.
Detail

BASKAR Murali K., ROSENBERG Andrew, RAMABHADRAN Bhuvana and ZHANG Yu. Reducing Domain mismatch in Self-supervised speech pre-training. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 3028-3032. ISSN 1990-9772.
Detail

BASKAR Murali K., HERZIG Tim, NGUYEN Diana, DIEZ Sánchez Mireia, POLZEHL Tim, BURGET Lukáš and ČERNOCKý Jan. Speaker adaptation for Wav2vec2 based dysarthric ASR. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 3403-3407. ISSN 1990-9772.
Detail

DELCROIX Marc, KINOSHITA Keisuke, OCHIAI Tsubasa, ŽMOLíKOVá Kateřina, SATO Hiroshi and NAKATANI Tomohiro. Listen only to me! How well can target speech extraction handle false alarms?. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Incheon: International Speech Communication Association, 2022, pp. 216-220. ISSN 1990-9772.
Detail

ŠVEC Ján, ŽMOLíKOVá Kateřina, KOCOUR Martin, DELCROIX Marc, OCHIAI Tsubasa, MOšNER Ladislav and ČERNOCKý Jan. Analysis of impact of emotions on target speech extraction and speech separation. In: Proceedings of The 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022). Bamberg: IEEE Signal Processing Society, 2022, pp. 1-5. ISBN 978-1-6654-6867-1.
Detail

DE Benito Gorron Diego, ŽMOLíKOVá Kateřina and TORRE Toledano Doroteo. Source Separation for Sound Event Detection in domestic environments using jointly trained models. In: Proceedings of The 17th International Workshop on Acoustic Signal Enhancement (IWAENC 2022). Bamberg: IEEE Signal Processing Society, 2022, pp. 1-5. ISBN 978-1-6654-6867-1.
Detail

KOCOUR Martin, UMESH Jahnavi, KARAFIáT Martin, ŠVEC Ján, LOPEZ Fernando, BENEš Karel, DIEZ Sánchez Mireia, SZőKE Igor, LUQUE Jordi, VESELý Karel, BURGET Lukáš and ČERNOCKý Jan. BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge. In: Proceedings of IberSpeech 2022. Granada: International Speech Communication Association, 2022, pp. 276-280.
Detail

DVOřáKOVá Martina, HRADIš Michal, ŽABIčKA Petr, KOHúT Jan, KIšš Martin and BENEš Karel. Využití PERO OCR při přepisu rukopisů. Archivní časopis, vol. 72, no. 1, 2022, pp. 14-27. ISSN 0004-0398.
Detail

HELMKE Hartmut, ONDřEJ Karel, SHETTY Shruthi, KLEINERT Matthias, OHNEISER Oliver, EHR Heiko, ZULUAGA-GOMEZ Juan and SMRž Pavel et al. Readback Error Detection by Automatic Speech Recognition and Understanding - Results of HAAWAII project for Isavia's Enroute Airspace. In: SESAR Innovation Days 2022. Budapest, 2022, pp. 1-9.
Detail

NADIMPALLI Vijaya Lakshmi V., KESIRAJU Santosh, BANKA Rohith, KETHIREDDY Rashmi and GANGASHETTY Suryakanth V. Resources and Benchmarks for Keyword Search in Spoken Audio From Low-Resource Indian Languages. IEEE Access, vol. 10, no. 2022, 2022, pp. 34789-34799. ISSN 2169-3536.
Detail

BASKAR Murali K., ROSENBERG Andrew, RAMABHADRAN Bhuvana, ZHANG Yu and MORENO Pedro. Ask2Mask: Guided Data Selection for Masked Speech Modeling. IEEE Journal of Selected Topics in Signal Processing, vol. 16, no. 6, 2022, pp. 1357-1366. ISSN 1932-4553.
Detail

PRASAD Amrutha, ZULUAGA-GOMEZ Juan, MOTLíčEK Petr, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia and VESELý Karel. Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator. In: Proceedings of the 12th SESAR Innovation Days. Budapest, 2022, pp. 1-9.
Detail

PRASAD Amrutha, ZULUAGA-GOMEZ Juan, MOTLíčEK Petr, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia, OHNEISER Oliver and HELMKE Hartmut. Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition. In: Proceedings of the 12th SESAR Innovation Days. Budapest, 2022, pp. 1-9.
Detail

BOITO Marcely Z., YUSUF Bolaji, ONDEL Yang Lucas Antoine Francois, VILLAVICENCIO Aline and BESACIER Laurent. Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. In: Proceedings of the the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages. Marseile: European Language Resources Association, 2022, pp. 1-9. ISBN 979-10-95546-91-7.
Detail

2021

KIšš Martin, BENEš Karel and HRADIš Michal. AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions. In: Lladós J., Lopresti D., Uchida S. (eds) Document Analysis and Recognition - ICDAR 2021. Lecture Notes in Computer Science, vol. 12824. Lausanne: Springer Nature Switzerland AG, 2021, pp. 463-477. ISBN 978-3-030-86336-4.
Detail

LANDINI Federico Nicolás, LOZANO Díez Alicia, BURGET Lukáš, DIEZ Sánchez Mireia, SILNOVA Anna, ŽMOLíKOVá Kateřina, GLEMBEK Ondřej, MATěJKA Pavel, STAFYLAKIS Themos and BRUMMER Johan Nikolaas Langenhoven. BUT System Description for The Third DIHARD Speech Diarization Challenge. In: Proceedings available at Dihard Challenge Github. on-line by LDC and University of Pennsylvania, 2021, pp. 1-5.
Detail

DELCROIX Marc, ŽMOLíKOVá Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke and NAKATANI Tomohiro. Speaker activity driven neural speech extraction. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Toronto: IEEE Signal Processing Society, 2021, pp. 6099-6103. ISBN 978-1-7281-7605-5.
Detail

LANDINI Federico Nicolás, GLEMBEK Ondřej, MATěJKA Pavel, ROHDIN Johan A., BURGET Lukáš, DIEZ Sánchez Mireia and SILNOVA Anna. Analysis of the BUT Diarization System for Voxconverse Challenge. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, pp. 5819-5823. ISBN 978-1-7281-7605-5.
Detail

VYDANA Hari K., KARAFIáT Martin, ŽMOLíKOVá Kateřina, BURGET Lukáš and ČERNOCKý Jan. Jointly Trained Transformers Models for Spoken Language Translation. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, pp. 7513-7517. ISBN 978-1-7281-7605-5.
Detail

YUSUF Bolaji, ONDEL Yang Lucas Antoine Francois, BURGET Lukáš, ČERNOCKý Jan and SARAçLAR Murat. A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, pp. 3710-3714. ISBN 978-1-7281-7605-5.
Detail

BASKAR Murali K., BURGET Lukáš, WATANABE Shinji, ASTUDILLO Ramon and ČERNOCKý Jan. Eat: Enhanced ASR-TTS for Self-Supervised Speech Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, pp. 6753-6757. ISBN 978-1-7281-7605-5.
Detail

KARAFIáT Martin, VESELý Karel, ČERNOCKý Jan, PROFANT Ján, NYTRA Jiří, HLAVáčEK Miroslav and PAVLíčEK Tomáš. Analysis of X-Vectors for Low-Resource Speech Recognition. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021, pp. 6998-7002. ISBN 978-1-7281-7605-5.
Detail

ŽMOLíKOVá Kateřina, DELCROIX Marc, BURGET Lukáš, NAKATANI Tomohiro and ČERNOCKý Jan. Integration of Variational Autoencoder and Spatial Clustering for Adaptive Multi-Channel Neural Speech Separation. In: 2021 IEEE Spoken Language Technology Workshop, SLT 2021 - Proceedings. Shenzhen - virtual : IEEE Signal Processing Society, 2021, pp. 889-896. ISBN 978-1-7281-7066-4.
Detail

KOCOUR Martin, CáMBARA Guillermo, LUQUE Jordi, BONET David, FARRúS Mireia, KARAFIáT Martin, VESELý Karel and ČERNOCKý Jan. BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge. In: Proceedings of IberSPEECH 2021. Vallaloid: International Speech Communication Association, 2021, pp. 113-117.
Detail

STAFYLAKIS Themos, ROHDIN Johan A. and BURGET Lukáš. Speaker embeddings by modeling channel-wise correlations. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, pp. 501-505. ISSN 1990-9772.
Detail

PENG Junyi, QU Xiaoyang, WANG Jianzong, GU Rongzhi, XIAO Jing, BURGET Lukáš and ČERNOCKý Jan. ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, pp. 511-515. ISSN 1990-9772.
Detail

ŽMOLíKOVá Kateřina, DELCROIX Marc, RAJ Desh, WATANABE Shinji and ČERNOCKý Jan. Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics. In: Proceedings of 2021 Interspeech. Brno: International Speech Communication Association, 2021, pp. 1464-1468. ISSN 1990-9772.
Detail

BENEš Karel and BURGET Lukáš. Text Augmentation for Language Models in High Error Recognition Scenario. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Brno: International Speech Communication Association, 2021, pp. 1872-1876. ISSN 1990-9772.
Detail

EGOROVA Ekaterina, VYDANA Hari K., BURGET Lukáš and ČERNOCKý Jan. Out-of-Vocabulary Words Detection with Attention and CTC Alignments in an End-to-End ASR System. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, pp. 2901-2905. ISSN 1990-9772.
Detail

SZőKE Igor, KESIRAJU Santosh, NOVOTNý Ondřej, KOCOUR Martin, VESELý Karel and ČERNOCKý Jan. Detecting English Speech in the Air Traffic Control Voice Communication. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, pp. 3286-3290. ISSN 1990-9772.
Detail

KOCOUR Martin, VESELý Karel, BLATT Alexander, ZULUAGA-GOMEZ Juan, SZőKE Igor, ČERNOCKý Jan, KLAKOW Dietrich and MOTLíčEK Petr. Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, pp. 3301-3305. ISSN 1990-9772.
Detail

ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, PRASAD Amrutha, MOTLíčEK Petr, VESELý Karel, KOCOUR Martin and SZőKE Igor. Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, pp. 3296-3300. ISSN 1990-9772.
Detail

YUSUF Bolaji, GOK Alican, GUNDOGDU Batuhan and SARAçLAR Murat. End-to-End Open Vocabulary Keyword Search. In: Proceedings Interspeech 2021. Brno: International Speech Communication Association, 2021, pp. 4388-4392. ISSN 1990-9772.
Detail

WANNER Leo, KLUSCH Matthias, MAVROPOULOS Athanasios, JAMIN Emmanuel, MARIN Puchades Victor, CASAMAYOR Gerard, ČERNOCKý Jan and EGOROVA Ekaterina et al. Towards a Versatile Intelligent Conversational Agent as Personal Assistant for Migrants. In: The PAAMS Collection. PAAMS 2021: Advances in Practical Applications of Agents, Multi-Agent Systems, and Social Good. . Lecture Notes in Computer Science book series , vol. 12946. Salamanca: Springer International Publishing, 2021, pp. 316-327. ISBN 978-3-030-85739-4. ISSN 0302-9743.
Detail

HELMKE Hartmut, KLEINERT Matthias, SHETTY Shruthi, OHNEISER Oliver, EHR Heiko, PRASAD Amrutha, MOTLíčEK Petr, VESELý Karel, ONDřEJ Karel, SMRž Pavel, HARFMANN Julia and WINDISCH Christian et al. Readback Error Detection by Automatic Speech Recognition to Increase ATM Safety. In: Proceedings of ATM Seminar. on-line: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2021, pp. 1-10.
Detail

KLEINERT Matthias, HELMKE Hartmut, SHETTY Shruthi, OHNEISER Oliver, EHR Heiko, PRASAD Amrutha, MOTLíčEK Petr and HARFMANN Julia. Automated Interpretation of Air Traffic Control Communication: The Journey from Spoken Words to a Deeper Understanding of the Meaning. In: Proceedings of DASC 2021. San Antonio, Texas: Institute of Electrical and Electronics Engineers, 2021, pp. 1-9. ISBN 978-1-6654-3420-1.
Detail

HELMKE Hartmut, SHETTY Shruthi, KLEINERT Matthias, OHNEISER Oliver, EHR Heiko, MOTLíčEK Petr, PRASAD Amrutha and WINDISCH Christian et al. Measuring Speech Recognition And Understanding Performance in Air Traffic Control Domain Beyond Word Error Rates. In: Proceedings of 11th SESAR Innovation Days 2021. Belgie, 2021, pp. 1-8.
Detail

KOCOUR Martin, VESELý Karel, SZőKE Igor, KESIRAJU Santosh, ZULUAGA-GOMEZ Juan, BLATT Alexander, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLíčEK Petr, KLAKOW Dietrich, TART Allan, KOLčáREK Pavel, ČERNOCKý Jan, CEVENINI Claudia, CHOUKRI Khalid, RIGAULT Mickael, LANDIS Fabian and SARFJOO Saeed et al. Automatic Processing Pipeline for Collecting and Annotating Air-Traffic Voice Communication Data. In: Proceedings of 9th OpenSky Symposium 2021, OpenSky Network, Brussels, Belgium. Brussels: MDPI, 2021, pp. 1-10. ISSN 2504-3900.
Detail

VYDANA Hari K., KARAFIáT Martin, BURGET Lukáš and ČERNOCKý Jan. The IWSLT 2021 BUT Speech Translation Systems. In: Proceedings of 18th International Conference on Spoken Language Translation (IWSLT) . Bangkok, on-line: Association for Computational Linguistics, 2021, pp. 75-83. ISBN 978-1-7138-3378-9.
Detail

ŘIHáčEK Tomáš and MATěJKA Pavel. Deep learning v psychoterapii: Strojová analýza nahrávek terapeutických sezení. E-psychologie, vol. 15, no. 3, 2021, pp. 35-37. ISSN 1802-8853.
Detail

2020

ROHDIN Johan A., SILNOVA Anna, DIEZ Sánchez Mireia, PLCHOT Oldřich, MATěJKA Pavel, BURGET Lukáš and GLEMBEK Ondřej. End-to-end DNN based text-independent speaker recognition for long and short utterances. Computer Speech and Language, vol. 2020, no. 59, pp. 22-35. ISSN 0885-2308.
Detail

DIEZ Sánchez Mireia, BURGET Lukáš, LANDINI Federico Nicolás and ČERNOCKý Jan. Analysis of Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 28, no. 1, 2020, pp. 355-368. ISSN 2329-9290.
Detail

MATěJKA Pavel, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš, ROHDIN Johan A., ZEINALI Hossein, MOšNER Ladislav, SILNOVA Anna, NOVOTNý Ondřej, DIEZ Sánchez Mireia and ČERNOCKý Jan. 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE. Computer Speech and Language, vol. 2020, no. 63, pp. 1-15. ISSN 0885-2308.
Detail

WANG Shuai, ROHDIN Johan A., PLCHOT Oldřich, BURGET Lukáš, YU Kai and ČERNOCKý Jan. Investigation of Specaugment for Deep Speaker Embedding Learning. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, pp. 7139-7143. ISBN 978-1-5090-6631-5.
Detail

DELCROIX Marc, OCHIAI Tsubasa, ŽMOLíKOVá Kateřina, KINOSHITA Keisuke, TAWARA Naohiro, NAKATANI Tomohiro and ARAKI Shoko. Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, pp. 691-695. ISBN 978-1-5090-6631-5.
Detail

LANDINI Federico Nicolás, WANG Shuai, DIEZ Sánchez Mireia, BURGET Lukáš, MATěJKA Pavel, ŽMOLíKOVá Kateřina, MOšNER Ladislav, SILNOVA Anna, PLCHOT Oldřich, NOVOTNý Ondřej, ZEINALI Hossein and ROHDIN Johan A. But System for the Second Dihard Speech Diarization Challenge. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, pp. 6529-6533. ISBN 978-1-5090-6631-5.
Detail

DIEZ Sánchez Mireia, BURGET Lukáš, LANDINI Federico Nicolás, WANG Shuai and ČERNOCKý Jan. Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, pp. 6519-6523. ISBN 978-1-5090-6631-5.
Detail

ŽMOLíKOVá Kateřina, KOCOUR Martin, LANDINI Federico Nicolás, BENEš Karel, KARAFIáT Martin, VYDANA Hari K., LOZANO Díez Alicia, PLCHOT Oldřich, BASKAR Murali K., ŠVEC Ján, MOšNER Ladislav, MALENOVSKý Vladimír, BURGET Lukáš, YUSUF Bolaji, NOVOTNý Ondřej, GRéZL František, SZőKE Igor and ČERNOCKý Jan. BUT System for CHiME-6 Challenge. In: Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020, pp. 1-3.
Detail

SILNOVA Anna, BRUMMER Johan Nikolaas Langenhoven, ROHDIN Johan A., STAFYLAKIS Themos and BURGET Lukáš. Probabilistic embeddings for speaker diarization. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, pp. 24-31. ISSN 2312-2846.
Detail

MOšNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A. and ČERNOCKý Jan. Utilizing VOiCES dataset for multichannel speaker verification with beamforming. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, pp. 187-193. ISSN 2312-2846.
Detail

ALAM Jahangir, BOULIANNE Gilles, BURGET Lukáš, DAHMANE Mohamed, DIEZ Sánchez Mireia, GLEMBEK Ondřej, LALONDE Marc, LOZANO Díez Alicia, MATěJKA Pavel, MIZERA Petr, MOšNER Ladislav, NOISEUX Cédric, MONTEIRO Joao, NOVOTNý Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVíčEK Josef, STAFYLAKIS Themos, ST-CHARLES Pierre-Luc, WANG Shuai and ZEINALI Hossein. Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, pp. 289-295. ISSN 2312-2846.
Detail

KESIRAJU Santosh, PLCHOT Oldřich, BURGET Lukáš and GANGASHETTY Suryakanth V. Learning Document Embeddings Along With Their Uncertainties. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 2020, no. 28, pp. 2319-2332. ISSN 2329-9290.
Detail

KOSIBA Matěj and BURGET Lukáš et al. Multiwavelength classification of X-ray selected galaxy cluster candidates using convolutional neural networks. Monthly Notices of the Royal Astronomical Society, vol. 496, no. 4, 2020, pp. 4141-4153. ISSN 1365-2966.
Detail

LOZANO Díez Alicia, SILNOVA Anna, PULUGUNDLA Bhargav, ROHDIN Johan A., VESELý Karel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej, NOVOTNý Ondřej and MATěJKA Pavel. BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, pp. 761-765. ISSN 1990-9772.
Detail

ZEINALI Hossein, LEE Kong Aik, ALAM Jahangir and BURGET Lukáš. SdSV Challenge 2020: Large-Scale Evaluation of Short-duration Speaker Verification. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, pp. 731-735. ISSN 1990-9772.
Detail

DUNBAR Ewan, KARADAYI Julien, BERNARD Mathieu, CAO Xuan-Nga, ALGAYRES Robin, ONDEL Lucas Antoine Francois, BESACIER Laurent, SAKTI Sakriani and DUPOUX Emmanuel. The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, pp. 4831-4835. ISSN 1990-9772.
Detail

ZULUAGA-GOMEZ Juan, VESELý Karel, BLATT Alexander, MOTLíčEK Petr, KLAKOW Dietrich, TART Allan, SZőKE Igor, PRASAD Amrutha, SARFJOO Saeed, KOLčáREK Pavel, KOCOUR Martin, ČERNOCKý Jan, CEVENINI Claudia, CHOUKRI Khalid, RIGAULT Mickael and LANDIS Fabian. Automatic Call Sign Detection: Matching Air Surveillance Data with Air Traffic Spoken Communications. In: Proceedings of the 8th OpenSky Symposium 2020. Brusel: MDPI, 2020, pp. 1-10. ISSN 2504-3900.
Detail

ZULUAGA-GOMEZ Juan, MOTLíčEK Petr, ZHAN Qingran, VESELý Karel and BRAUN Rudolf. Automatic Speech Recognition Benchmark for Air-Traffic Communications. In: Proceedings of Interspeech 2020. Shanghai: International Speech Communication Association, 2020, pp. 2297-2301. ISSN 1990-9772.
Detail

SCHARENBORG Odette, BESACIER Laurent, BLACK Alan, HASEGAWA-JOHNSON Mark, METZE Florian, NEUBIG Graham, STüKER Sebastian, GODARD Pierre, MüLLER Markus, ONDEL Yang Lucas Antoine Francois, PALASKAR Shruti, ARTHUR Philip, CIANNELLA Francesco, DU Mingxing, LARSEN Elin, MERKX Danny, RIAD Rachid, WANG Liming and DUPOUX Emmanuel. Speech Technology for Unwritten Languages. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 2020, no. 28, pp. 964-975. ISSN 2329-9290.
Detail

BURGET Lukáš, GLEMBEK Ondřej, LOZANO Díez Alicia, MATěJKA Pavel, NOVOTNý Ondřej, PLCHOT Oldřich, PULUGUNDLA Bhargav, ROHDIN Johan A., SILNOVA Anna and VESELý Karel. BUT System Description to SdSV Challenge 2020. In: Proceedings of Short-duration Speaker Verification Challenge 2020 Workshop. Shanghai, on-line event of Interspeech 2020 Conference, 2020, pp. 1-5.
Detail

2019

CARTAS Alejandro, KOCOUR Martin, RAMAN Aravindh, LEONTIADIS Ilias, LUQUE Jordi, SASTRY Nishanth, NUNEZ-MARTINEZ Leon, PERINO Diego and PERALES Carlos Segura. A Reality Check on Inference at Mobile Networks Edge. In: Proceedings of the 2nd ACM International Workshop on Edge Systems, Analytics and Networking (EDGESYS '19). Dressden: Association for Computing Machinery, 2019, pp. 54-59. ISBN 978-1-4503-6275-7.
Detail

SZőKE Igor, SKáCEL Miroslav, MOšNER Ladislav, PALIESEK Jakub and ČERNOCKý Jan. Building and Evaluation of a Real Room Impulse Response Dataset. IEEE Journal of Selected Topics in Signal Processing, vol. 13, no. 4, 2019, pp. 863-876. ISSN 1932-4553.
Detail

ROHDIN Johan A., STAFYLAKIS Themos, SILNOVA Anna, ZEINALI Hossein, BURGET Lukáš and PLCHOT Oldřich. Speaker Verification Using End-To-End Adversarial Language Adaptation. In: Proceedings of ICASSP 2019. Brighton: IEEE Signal Processing Society, 2019, pp. 6006-6010. ISBN 978-1-5386-4658-8.
Detail

ZEINALI Hossein, BURGET Lukáš, ROHDIN Johan A., STAFYLAKIS Themos and ČERNOCKý Jan. How To Improve Your Speaker Embeddings Extractor in Generic Toolkits. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, pp. 6141-6145. ISBN 978-1-5386-4658-8.
Detail

NOVOTNý Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej, ČERNOCKý Jan and BURGET Lukáš. Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition. Computer Speech and Language, vol. 2019, no. 58, pp. 403-421. ISSN 0885-2308.
Detail

MAGHSOODI Nooshin, SAMETI Hossein, ZEINALI Hossein and STAFYLAKIS Themos. Speaker Recognition With Random Digit Strings Using Uncertainty Normalized HMM-Based i-Vectors. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 2019, no. 11, pp. 1815-1825. ISSN 2329-9290.
Detail

ŽMOLíKOVá Kateřina, DELCROIX Marc, KINOSHITA Keisuke, OCHIAI Tsubasa, NAKATANI Tomohiro, BURGET Lukáš and ČERNOCKý Jan. SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures. IEEE Journal of Selected Topics in Signal Processing, vol. 13, no. 4, 2019, pp. 800-814. ISSN 1932-4553.
Detail

ONDEL Yang Lucas Antoine Francois, VYDANA Hari K., BURGET Lukáš and ČERNOCKý Jan. Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery. In: Proceedings of Interspeech 2019. Graz: International Speech Communication Association, 2019, pp. 261-265. ISSN 1990-9772.
Detail

DIEZ Sánchez Mireia, BURGET Lukáš, WANG Shuai, ROHDIN Johan A. and ČERNOCKý Jan. Bayesian HMM based x-vector clustering for Speaker Diarization. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 346-350. ISSN 1990-9772.
Detail

ZEINALI Hossein, STAFYLAKIS Themos, ATHANASOPOULOU Georgia, ROHDIN Johan A., GKINIS Ioanis, BURGET Lukáš and ČERNOCKý Jan. Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 1073-1077. ISSN 1990-9772.
Detail

WANG Shuai, ROHDIN Johan A., BURGET Lukáš, PLCHOT Oldřich, QIAN Yanmin, YU Kai and ČERNOCKý Jan. On the Usage of Phonetic Information for Text-independent Speaker Embedding Extraction. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 1148-1152. ISSN 1990-9772.
Detail

KARAFIáT Martin, BASKAR Murali K., WATANABE Shinji, HORI Takaaki, WIESNER Matthew and ČERNOCKý Jan. Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 2220-2224. ISSN 1990-9772.
Detail

BASKAR Murali K., WATANABE Shinji, ASTUDILLO Ramon, HORI Takaaki, BURGET Lukáš and ČERNOCKý Jan. Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 3790-3794. ISSN 1990-9772.
Detail

MATěJKA Pavel, PLCHOT Oldřich, ZEINALI Hossein, MOšNER Ladislav, SILNOVA Anna, BURGET Lukáš, NOVOTNý Ondřej and GLEMBEK Ondřej. Analysis of BUT Submission in Far-Field Scenarios of VOiCES 2019 Challenge. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 2448-2452. ISSN 1990-9772.
Detail

NOVOTNý Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej and BURGET Lukáš. Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 4330-4334. ISSN 1990-9772.
Detail

STAFYLAKIS Themos, ROHDIN Johan A., PLCHOT Oldřich, MIZERA Petr and BURGET Lukáš. Self-supervised speaker embeddings. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 2863-2867. ISSN 1990-9772.
Detail

NOVOTNý Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš and MATěJKA Pavel. Discriminatively Re-trained i-Vector Extractor For Speaker Recognition. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, pp. 6031-6035. ISBN 978-1-5386-4658-8.
Detail

BASKAR Murali K., BURGET Lukáš, WATANABE Shinji, KARAFIáT Martin, HORI Takaaki and ČERNOCKý Jan. Promising Accurate Prefix Boosting For Sequence-to-sequence ASR. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, pp. 5646-5650. ISBN 978-1-5386-4658-8.
Detail

INAGUMA Hirofumi, CHO Jaejin, BASKAR Murali K., KAWAHARA Tatsuya and WATANABE Shinji. Transfer Learning Of Language-independent End-to-end ASR With Language Model Fusion. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, pp. 6096-6100. ISBN 978-1-5386-4658-8.
Detail

DELCROIX Marc, ŽMOLíKOVá Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke, ARAKI Shoko and NAKATANI Tomohiro. Compact Network for Speakerbeam Target Speaker Extraction. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, pp. 6965-6969. ISBN 978-1-5386-4658-8.
Detail

ONDEL Yang Lucas Antoine Francois, LI Ruizhi, SELL Gregory and HEřMANSKý Hynek. Deriving Spectro-temporal Properties of Hearing from Speech Data. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, pp. 411-415. ISBN 978-1-5386-4658-8.
Detail

MOšNER Ladislav, WU Minhua, RAJU Anirudh, PARTHASARATHI Sree Hari Krishnan, KUMATANI Kenichi, SUNDARAM Shiva, MAAS Roland and HOFFMEISTER Björn. Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, pp. 6475-6479. ISBN 978-1-5386-4658-8.
Detail

YANG Jinyi, ONDEL Yang Lucas Antoine Francois, MANOHAR Vimal and HEřMANSKý Hynek. Towards Automatic Methods to Detect Errors in Transcriptions of Speech Recordings. In: Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019, pp. 3747-3751. ISBN 978-1-5386-4658-8.
Detail

BENEš Karel, IRIE Kazuki, BECK Eugen, SCHLüTER Ralf and NEY Hermann. Unsupervised Language Model Adaptation for Speech Recognition with no Extra Resources. In: Proceedings of DAGA 2019. Rostock: DEGA Head office, Deutsche Gesellschaft für Akustik, 2019, pp. 954-957. ISBN 978-3-939296-14-0.
Detail

DELCROIX Marc, ŽMOLíKOVá Kateřina, OCHIAI Tsubasa, KINOSHITA Keisuke, ARAKI Shoko and NAKATANI Tomohiro. Evaluation of SpeakerBeam target speech extraction in real noisy and reverberant conditions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF JAPAN, vol. 2019, no. 2, pp. 1-2. ISSN 0369-4232.
Detail

MOšNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., BURGET Lukáš and ČERNOCKý Jan. Speaker Verification with Application-Aware Beamforming. In: IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019, pp. 411-418. ISBN 978-1-7281-0306-8.
Detail

ZEINALI Hossein, ČERNOCKý Jan and BURGET Lukáš. A multi purpose and large scale speech corpus in Persian and English for speaker and speech Recognition: the DeepMine database. In: IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019, pp. 397-402. ISBN 978-1-7281-0306-8.
Detail

ALAM Jahangir, BOULIANNE Gilles, GLEMBEK Ondřej, LOZANO Díez Alicia, MATěJKA Pavel, MIZERA Petr, MONTEIRO Joao, MOšNER Ladislav, NOVOTNý Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVíčEK Josef, STAFYLAKIS Themos, WANG Shuai and ZEINALI Hossein. ABC NIST SRE 2019 CTS System Description. In: Proceedings of NIST. Sentosa, Singapore: National Institute of Standards and Technology, 2019, pp. 1-6.
Detail

ALAM Jahangir, BOULIANNE Gilles, BURGET Lukáš, GLEMBEK Ondřej, LOZANO Díez Alicia, MATěJKA Pavel, MIZERA Petr, MOšNER Ladislav, NOVOTNý Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVíčEK Josef, STAFYLAKIS Themos, WANG Shuai, ZEINALI Hossein, DAHMANE Mohamed, ST-CHARLES Pierre-Luc, LALONDE Marc, NOISEUX Cédric and MONTEIRO Joao. ABC System Description for NIST Multimedia Speaker Recognition Evaluation 2019. In: Proceedings of NIST 2019 SRE Workshop. Sentosa, Singapore: National Institute of Standards and Technology, 2019, pp. 1-7.
Detail

ZEINALI Hossein, WANG Shuai, SILNOVA Anna, MATěJKA Pavel and PLCHOT Oldřich. BUT System Description to VoxCeleb Speaker Recognition Challenge 2019. In: Proceedings of The VoxCeleb Challange Workshop 2019. Graz, 2019, pp. 1-4.
Detail

CHO Jaejin, WATANABE Shinji, HORI Takaaki, BASKAR Murali K., INAGUMA Hirofumi, VILLALBA Lopez Jesus Antonio and DEHAK Najim. Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, pp. 6191-6195. ISBN 978-1-5386-4658-8.
Detail

SUBRAMANIAN Aswin S., WANG Xiaofei, BASKAR Murali K., WATANABE Shinji, TANIGUCHI Toru, TRAN Dung and FUJITA Yuya. Speech Enhancement Using End-to-End Speech Recognition Objectives. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY: IEEE Signal Processing Society, 2019, pp. 234-238. ISBN 978-1-7281-1123-0.
Detail

2018

BARTOS Anthony L., CIPR Tomáš, NELSON Douglas J., SCHWARZ Petr, BANOWETZ John and JERABEK Ladislav. Noise-robust speech triage. Journal of the Acoustical Society of America, vol. 143, no. 4, 2018, pp. 2313-2320. ISSN 1520-8524.
Detail

ONDEL Yang Lucas Antoine Francois, GODARD Pierre, BESACIER Laurent, LARSEN Elin, HASEGAWA-JOHNSON Mark, SCHARENBORG Odette, DUPOUX Emmanuel, BURGET Lukáš, YVON Francois and KHUDANPUR Sanjeev. Bayesian Models for Unit Discovery on a Very Low Resource Language. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5939-5943. ISBN 978-1-5386-4658-8.
Detail

KARAFIáT Martin, BASKAR Murali K., VESELý Karel, GRéZL František, BURGET Lukáš and ČERNOCKý Jan. Analysis of Multilingual BLSTM Acoustic Model on Low and High Resource Languages. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5789-5793. ISBN 978-1-5386-4658-8.
Detail

DELCROIX Marc, ŽMOLíKOVá Kateřina, KINOSHITA Keisuke, OGAWA Atsunori and NAKATANI Tomohiro. Single Channel Target Speaker Extraction and Recognition with Speaker Beam. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5554-5558. ISBN 978-1-5386-4658-8.
Detail

ŽMOLíKOVá Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, NAKATANI Tomohiro and ČERNOCKý Jan. Optimization of Speaker-aware Multichannel Speech Extraction with ASR Criterion. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 6702-6706. ISBN 978-1-5386-4658-8.
Detail

LOZANO Díez Alicia, PLCHOT Oldřich, MATěJKA Pavel and GONZALEZ-RODRIGUEZ Joaquin. DNN Based Embeddings for Language Recognition. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5184-5188. ISBN 978-1-5386-4658-8.
Detail

ROHDIN Johan A., SILNOVA Anna, DIEZ Sánchez Mireia, PLCHOT Oldřich, MATěJKA Pavel and BURGET Lukáš. End-to-End DNN Based Speaker Recognition Inspired by i-Vector and PLDA. In: Proceedings of ICASSP. Calgary: IEEE Signal Processing Society, 2018, pp. 4874-4878. ISBN 978-1-5386-4658-8.
Detail

EGOROVA Ekaterina and BURGET Lukáš. Out-of-Vocabulary Word Recovery Using FST-Based Subword Unit Clustering in a Hybrid ASR System. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5919-5923. ISBN 978-1-5386-4658-8.
Detail

RYANT Neville, BERGELSON Elika, CHURCH Kenneth, CRISTIA Alejandrina, DU Jun, GANAPATHY Sriram, KHUDANPUR Sanjeev, KOWALSKI Diana, KRISHNAMOORTHY Mahesh, KULSHRESHTA Rajat, LIBERMAN Mark, LU Yu-Ding, MACIEJEWSKI Matthew, METZE Florian, PROFANT Ján, SUN Lei, TSAO Yu and YU Zhou. Enhancement and Analysis of Conversational Speech: JSALT 2017. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5154-5158. ISBN 978-1-5386-4658-8.
Detail

LOZANO Díez Alicia, PLCHOT Oldřich, MATěJKA Pavel, NOVOTNý Ondřej and GONZALEZ-RODRIGUEZ Joaquin. Analysis of DNN-based Embeddings for Language Recognition on the NIST LRE 2017. In: Proceedings of Odyssey 2018 The Speaker and Language Recognition Workshop. Les Sables d'Olonne: International Speech Communication Association, 2018, pp. 39-46. ISSN 2312-2846.
Detail

PLCHOT Oldřich, MATěJKA Pavel, NOVOTNý Ondřej, CUMANI Sandro, LOZANO Díez Alicia, SLAVíčEK Josef, DIEZ Sánchez Mireia, GRéZL František, GLEMBEK Ondřej, KAMSALI Veera Mounika, SILNOVA Anna, BURGET Lukáš, ONDEL Yang Lucas Antoine Francois, KESIRAJU Santosh and ROHDIN Johan A. Analysis of BUT-PT Submission for NIST LRE 2017. In: Proceedings of Odyssey 2018 The Speaker and Language Recognition Workshop. Les Sables d'Olonne: International Speech Communication Association, 2018, pp. 47-53. ISSN 2312-2846.
Detail

DIEZ Sánchez Mireia, BURGET Lukáš and MATěJKA Pavel. Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, pp. 147-154. ISSN 2312-2846.
Detail

NOVOTNý Ondřej, PLCHOT Oldřich, MATěJKA Pavel, MOšNER Ladislav and GLEMBEK Ondřej. On the use of X-vectors for Robust Speaker Recognition. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, pp. 168-175. ISSN 2312-2846.
Detail

SILNOVA Anna, MATěJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, NOVOTNý Ondřej, GRéZL František, SCHWARZ Petr and ČERNOCKý Jan. BUT/Phonexia Bottleneck Feature Extractor. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, pp. 283-287. ISSN 2312-2846.
Detail

BRUMMER Johan Nikolaas Langenhoven, SILNOVA Anna, BURGET Lukáš and STAFYLAKIS Themos. Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model. In: Proceedings of Odyssey 2018. Les Sables d'Olonne: International Speech Communication Association, 2018, pp. 349-356. ISSN 2312-2846.
Detail

ZEINALI Hossein, BURGET Lukáš, SAMETI Hossein and ČERNOCKý Jan. Spoken Pass-Phrase Verification in the i-vector Space. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, pp. 372-377. ISSN 2312-2846.
Detail

SILNOVA Anna, BRUMMER Johan Nikolaas Langenhoven, GARCíA-ROMERO Daniel, SNYDER David and BURGET Lukáš. Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 72-76. ISSN 1990-9772.
Detail

KARAFIáT Martin, BASKAR Murali K., SZőKE Igor, MALENOVSKý Vladimír, VESELý Karel, GRéZL František, BURGET Lukáš and ČERNOCKý Jan. BUT OpenSAT 2017 speech recognition system. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 2638-2642. ISSN 1990-9772.
Detail

DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, ROHDIN Johan A., SILNOVA Anna, ŽMOLíKOVá Kateřina, NOVOTNý Ondřej, VESELý Karel, GLEMBEK Ondřej, PLCHOT Oldřich, MOšNER Ladislav and MATěJKA Pavel. BUT system for DIHARD Speech Diarization Challenge 2018. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 2798-2802. ISSN 1990-9772.
Detail

PULUGUNDLA Bhargav, BASKAR Murali K., KESIRAJU Santosh, EGOROVA Ekaterina, KARAFIáT Martin, BURGET Lukáš and ČERNOCKý Jan. BUT system for low resource Indian language ASR. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 3182-3186. ISSN 1990-9772.
Detail

BENEš Karel, KESIRAJU Santosh and BURGET Lukáš. i-vectors in language modeling: An efficient way of domain adaptation for feed-forward models. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 3383-3387. ISSN 1990-9772.
Detail

MOšNER Ladislav, PLCHOT Oldřich, MATěJKA Pavel, NOVOTNý Ondřej and ČERNOCKý Jan. Dereverberation and Beamforming in Robust Far-Field Speaker Recognition. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 1334-1338. ISSN 1990-9772.
Detail

VESELý Karel, PERALES Carlos Segura, SZőKE Igor, LUQUE Jordi and ČERNOCKý Jan. Lightly supervised vs. semi-supervised training of acoustic model on Luxembourgish for low-resource automatic speech recognition. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 2883-2887. ISSN 1990-9772.
Detail

NOVOTNý Ondřej, MATěJKA Pavel, PLCHOT Oldřich and GLEMBEK Ondřej. On the use of DNN Autoencoder for Robust Speaker Recognition. Brno: Faculty of Information Technology BUT, 2018.
Detail

ZEINALI Hossein, BURGET Lukáš and ČERNOCKý Jan. Convolutional Neural Networks and X-Vector Embedding for DCASE2018 Acoustic Scene Classification Challenge. In: Proceedings of DCASE 2018 Workshop. Surrey: Tampere University of Technology, 2018, pp. 1-5. ISBN 978-952-15-4262-6.
Detail

ALAM Jahangir, BHATTACHARYA Gautam, BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, DIEZ Sánchez Mireia, GLEMBEK Ondřej, KENNY Patrick, KLčO Michal, LANDINI Federico Nicolás, LOZANO Díez Alicia, MATěJKA Pavel, MONTEIRO Joao, MOšNER Ladislav, NOVOTNý Ondřej, PLCHOT Oldřich, PROFANT Ján, ROHDIN Johan A., SILNOVA Anna, SLAVíčEK Josef, STAFYLAKIS Themos and ZEINALI Hossein. ABC NIST SRE 2018 SYSTEM DESCRIPTION. In: Proceedings of 2018 NIST SRE Workshop. Athens: National Institute of Standards and Technology, 2018, pp. 1-10.
Detail

SZőKE Igor. Souhrnná zpráva k výzkumnému projektu "Škoda auto - Digital Minutes". Brno: ŠKODA AUTO a.s., 2018.
Detail

WIESNER Matthew, LIU Chunxi, ONDEL Yang Lucas Antoine Francois, HARMAN Craig, MANOHAR Vimal, TRMAL Jan, HUANG Zhongqiang, DEHAK Najim and KHUDANPUR Sanjeev. Automatic Speech Recognition and Topic Identification for Almost-Zero-Resource Languages. In: Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018, pp. 2052-2056. ISSN 1990-9772.
Detail

GODARD Pierre, BOITO Marcely Z., ONDEL Yang Lucas Antoine Francois, BERARD Alexandre, YVON Francois, VILLAVICENCIO Aline and BESACIER Laurent. Unsupervised Word Segmentation from Speech with Attention. In: Proceeding of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 2678-2682. ISSN 1990-9772.
Detail

CHO Jaejin, BASKAR Murali K., LI Ruizhi, WIESNER Matthew, MALLIDI Sri Harish, YALTA Nelson, KARAFIáT Martin, WATANABE Shinji and HORI Takaaki. Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling. In: Proceedings of 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018). Athens: IEEE Signal Processing Society, 2018, pp. 521-527. ISBN 978-1-5386-4334-1.
Detail

DELCROIX Marc, ŽMOLíKOVá Kateřina, KINOSHITA Keisuke, ARAKI Shoko, OGAWA Atsunori and NAKATANI Tomohiro. SpeakerBeam: A New Deep Learning Technology for Extracting Speech of a Target Speaker Based on the Speaker's Voice Characteristics. NTT Technical Review, vol. 16, no. 11, 2018, pp. 19-24. ISSN 1348-3447.
Detail

2017

ZEINALI Hossein, SAMETI Hossein and BURGET Lukáš. HMM-Based Phrase-Independent i-Vector Extractor for Text-Dependent Speaker Verification. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 25, no. 7, 2017, pp. 1421-1435. ISSN 2329-9290.
Detail

BASKAR Murali K., KARAFIáT Martin, BURGET Lukáš, VESELý Karel, GRéZL František and ČERNOCKý Jan. Residual Memory Networks: Feed-forward approach to learn long-term temporal dependencies. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, pp. 4810-4814. ISBN 978-1-5090-4117-6.
Detail

HANNEMANN Mirko, TRMAL Jan, ONDEL Yang Lucas Antoine Francois, KESIRAJU Santosh and BURGET Lukáš. Bayesian joint-sequence models for grapheme-to-phoneme conversion. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, pp. 2836-2840. ISBN 978-1-5090-4117-6.
Detail

KESIRAJU Santosh, PAPPAGARI Raghavendra, ONDEL Yang Lucas Antoine Francois, BURGET Lukáš, DEHAK Najim, KHUDANPUR Sanjeev, ČERNOCKý Jan and GANGASHETTY Suryakanth V. Topic identification of spoken documents using unsupervised acoustic unit discovery. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, pp. 5745-5749. ISBN 978-1-5090-4117-6.
Detail

LIU Chunxi, YANG Jinyi, SUN Ming, KESIRAJU Santosh, ROTT Alena, ONDEL Yang Lucas Antoine Francois, GHAHREMANI Pegah, DEHAK Najim, BURGET Lukáš and KHUDANPUR Sanjeev. An Empirical evaluation of zero resource acoustic unit discovery. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, pp. 5305-5309. ISBN 978-1-5090-4117-6.
Detail

ONDEL Yang Lucas Antoine Francois, BURGET Lukáš, ČERNOCKý Jan and KESIRAJU Santosh. Bayesian phonotactic language model for acoustic unit discovery. In: Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017, pp. 5750-5754. ISBN 978-1-5090-4117-6.
Detail

FéR Radek, MATěJKA Pavel, GRéZL František, PLCHOT Oldřich, VESELý Karel and ČERNOCKý Jan. Multilingually Trained Bottleneck Features in Spoken Language Recognition. Computer Speech and Language, vol. 2017, no. 46, pp. 252-267. ISSN 0885-2308.
Detail

ZEINALI Hossein, SAMETI Hossein, BURGET Lukáš and ČERNOCKý Jan. Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models. Computer Speech and Language, vol. 2017, no. 46, pp. 53-71. ISSN 0885-2308.
Detail

BENEš Karel, BASKAR Murali K. and BURGET Lukáš. Residual Memory Networks in Language Modeling: Improving the Reputation of Feed-Forward Networks. In: Proceedings of Interspeeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 284-288. ISSN 1990-9772.
Detail

KARAFIáT Martin, BASKAR Murali K., MATěJKA Pavel, VESELý Karel, GRéZL František, BURGET Lukáš and ČERNOCKý Jan. 2016 BUT Babel system: Multilingual BLSTM acoustic model with i-vector based adaptation. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 719-723. ISSN 1990-9772.
Detail

MATěJKA Pavel, NOVOTNý Ondřej, PLCHOT Oldřich, BURGET Lukáš, DIEZ Sánchez Mireia and ČERNOCKý Jan. Analysis of Score Normalization in Multilingual Speaker Recognition. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1567-1571. ISSN 1990-9772.
Detail

PLCHOT Oldřich, MATěJKA Pavel, SILNOVA Anna, NOVOTNý Ondřej, DIEZ Sánchez Mireia, ROHDIN Johan A., GLEMBEK Ondřej, BRüMMER Niko, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, BUERA Luis, KENNY Patrick, ALAM Jahangir and BHATTACHARYA Gautam. Analysis and Description of ABC Submission to NIST SRE 2016. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1348-1352. ISSN 1990-9772.
Detail

SILNOVA Anna, BURGET Lukáš and ČERNOCKý Jan. Alternative Approaches to Neural Network based Speaker Verification. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1572-1575. ISSN 1990-9772.
Detail

PAPADOPOULOS Pavlos, TRAVADI Ruchir, VAZ Colin, MALANDRAKIS Nikolaos, HERMJAKOB Ulf, POURDAMGHANI Nima, PUST Michael, ZHANG Boliang, PAN Xiaoman, LU Di, LIN Ying, GLEMBEK Ondřej, BASKAR Murali K., KARAFIáT Martin, BURGET Lukáš, HASEGAWA-JOHNSON Mark, JI Heng, MAY Jonathan, KNIGHT Kevin and NARAYANAN Shrikanth. Team ELISA System for DARPA LORELEI Speech Evaluation 2016. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 2053-2057. ISSN 1990-9772.
Detail

VESELý Karel, BURGET Lukáš and ČERNOCKý Jan. Semi-supervised DNN training with word selection for ASR. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 3687-3691. ISSN 1990-9772.
Detail

DAS Amit, HASEGAWA-JOHNSON Mark and VESELý Karel. Deep Auto-encoder Based Multi-task Learning Using Probabilistic Transcriptions. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 2073-2077. ISSN 1990-9772.
Detail

HIGUCHI Takuya, KINOSHITA Keisuke, DELCROIX Marc, ŽMOLíKOVá Kateřina and NAKATANI Tomohiro. Deep clustering-based beamforming for separation with unknown number of sources. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1183-1187. ISSN 1990-9772.
Detail

ŽMOLíKOVá Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori and NAKATANI Tomohiro. Speaker-aware neural network based beamformer for speaker extraction in speech mixtures. In: Proceedings of Interspeech 2017. Stocholm: International Speech Communication Association, 2017, pp. 2655-2659. ISSN 1990-9772.
Detail

VESELý Karel, BASKAR Murali K., DIEZ Sánchez Mireia and BENEš Karel. MGB-3 BUT System: Low-resource ASR on Egyptian YOUTUBE data. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, pp. 368-373. ISBN 978-1-5090-4788-8.
Detail

ŽMOLíKOVá Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori and NAKATANI Tomohiro. Learning Speaker Representation for Neural Network Based Multichannel Speaker Extraction. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, pp. 8-15. ISBN 978-1-5090-4788-8.
Detail

ŽMOLíKOVá Kateřina. Summary report of project "Speech enhancement front-end for robust automatic speech recognition with large amount of training data" for Year 2017. Brno: NTT Corporation, 2017.
Detail

MATěJKA Pavel, PLCHOT Oldřich, NOVOTNý Ondřej, CUMANI Sandro, LOZANO Díez Alicia, SLAVíčEK Josef, DIEZ Sánchez Mireia, GRéZL František, GLEMBEK Ondřej, KAMSALI Veera Mounika, SILNOVA Anna, BURGET Lukáš, ONDEL Yang Lucas Antoine Francois, KESIRAJU Santosh and ROHDIN Johan A. BUT- PT System Description for NIST LRE 2017. In: Proceedings of NIST Language Recognition Workshop 2017. Orlando, Florida: National Institute of Standards and Technology, 2017, pp. 1-6.
Detail

MATěJKA Pavel. Souhrnná zpráva k projektu "Speaker REcognition" za rok 2017. Brno: Phonexia s.r.o., 2017.
Detail

GLEMBEK Ondřej. Summary report for project Exploiting Language Information for Situational Awareness (ELISA) For year 2017. Brno: University of Southern California, 2017.
Detail

MATěJKA Pavel. Summary report for project "Robust Automatic Speech Transcription" in Year 2017. Brno: Raytheon BBN Technologies, 2017.
Detail

MALANDRAKIS Nikolaos, GLEMBEK Ondřej and NARAYANAN Shrikanth. Extracting Situation Frames from non-English Speech: Evaluation Framework and Pilot Results. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 2123-2127. ISSN 1990-9772.
Detail

2016

PLCHOT Oldřich, BURGET Lukáš, ARONOWITZ Hagai and MATěJKA Pavel. Audio Enhancing With DNN Autoencoder For Speaker Recognition. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5090-5094. ISBN 978-1-4799-9988-0.
Detail

MATěJKA Pavel, GLEMBEK Ondřej, NOVOTNý Ondřej, PLCHOT Oldřich, GRéZL František, BURGET Lukáš and ČERNOCKý Jan. Analysis Of DNN Approaches To Speaker Identification. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5100-5104. ISBN 978-1-4799-9988-0.
Detail

VESELý Karel, WATANABE Shinji, ŽMOLíKOVá Kateřina, KARAFIáT Martin, BURGET Lukáš and ČERNOCKý Jan. Sequence Summarizing Neural Network for Speaker Adaptation. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5315-5319. ISBN 978-1-4799-9988-0.
Detail

KARAFIáT Martin, BURGET Lukáš, GRéZL František, VESELý Karel and ČERNOCKý Jan. Multilingual Region-Dependent Transforms. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5430-5434. ISBN 978-1-4799-9988-0.
Detail

LOPEZ-MORENO Ignacio, GONZALEZ-DOMINGUEZ Javier, MARTíNEZ González David, PLCHOT Oldřich, GONZALEZ-RODRIGUEZ Joaquin and MORENO Pedro. On the use of deep feedforward neural networks for automatic language identification. Computer Speech and Language, vol. 2016, no. 40, pp. 46-59. ISSN 0885-2308.
Detail

GRéZL František and KARAFIáT Martin. Bottle-Neck Feature Extraction Structures for Multilingual Training and Porting. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, pp. 144-151. ISSN 1877-0509.
Detail

LOZANO Díez Alicia, SILNOVA Anna, MATěJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, PEšáN Jan, BURGET Lukáš and GONZALEZ-RODRIGUEZ Joaquin. Analysis and Optimization of Bottleneck Features for Speaker Recognition. In: Proceedings of Odyssey 2016. Bilbao: International Speech Communication Association, 2016, pp. 352-357. ISSN 2312-2846.
Detail

ZEINALI Hossein, BURGET Lukáš, SAMETI Hossein, GLEMBEK Ondřej and PLCHOT Oldřich. Deep Neural Networks and Hidden Markov Models in i-vector-based Text-Dependent Speaker Verification. In: Proceedings of Odyssey 2016, The Speaker and Language Recognition Workshop. Bilbao: International Speech Communication Association, 2016, pp. 24-30. ISSN 2312-2846.
Detail

PLCHOT Oldřich, MATěJKA Pavel, FéR Radek, GLEMBEK Ondřej, NOVOTNý Ondřej, PEšáN Jan, VESELý Karel, ONDEL Yang Lucas Antoine Francois, KARAFIáT Martin, GRéZL František, KESIRAJU Santosh, BURGET Lukáš, BRUMMER Johan Nikolaas Langenhoven, SWART Albert du Preez, CUMANI Sandro, MALLIDI Sri Harish and LI Ruizhi. BAT System Description for NIST LRE 2015. In: Proceedings of Odyssey 2016, The Speaker and Language Recognition Workshop. Bilbao: International Speech Communication Association, 2016, pp. 166-173. ISSN 2312-2846.
Detail

GRéZL František, EGOROVA Ekaterina and KARAFIáT Martin. Study of Large Data Resources for Multilingual Training and System Porting. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, pp. 15-22. ISSN 1877-0509.
Detail

EGOROVA Ekaterina and SERRANO Jordi Lugue. Semi-Supervised Training of Language Model on Spanish Conversational Telephone Speech Data. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, pp. 114-120. ISSN 1877-0509.
Detail

ONDEL Yang Lucas Antoine Francois, BURGET Lukáš and ČERNOCKý Jan. Variational Inference for Acoustic Unit Discovery. In: Procedia Computer Science. Yogyakarta: Elsevier Science, 2016, pp. 80-86. ISSN 1877-0509.
Detail

ZEINALI Hossein, SAMETI Hossein, BURGET Lukáš, ČERNOCKý Jan, MAGHSOODI Nooshin and MATěJKA Pavel. i-vector/HMM Based Text-dependent Speaker Verification System for RedDots Challenge. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 440-444. ISBN 978-1-5108-3313-5.
Detail

KESIRAJU Santosh, BURGET Lukáš, SZőKE Igor and ČERNOCKý Jan. Learning document representations using subspace multinomial model. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 700-704. ISBN 978-1-5108-3313-5.
Detail

NOVOTNý Ondřej, MATěJKA Pavel, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš and ČERNOCKý Jan. Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 828-832. ISBN 978-1-5108-3313-5.
Detail

ŽMOLíKOVá Kateřina, KARAFIáT Martin, VESELý Karel, DELCROIX Marc, WATANABE Shinji, BURGET Lukáš and ČERNOCKý Jan. Data selection by sequence summarizing neural network in mismatch condition training. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 2354-2358. ISBN 978-1-5108-3313-5.
Detail

LI Ruizhi, MALLIDI Sri Harish, PLCHOT Oldřich, BURGET Lukáš and DEHAK Najim. Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 3265-3269. ISBN 978-1-5108-3313-5.
Detail

PEšáN Jan, BURGET Lukáš and ČERNOCKý Jan. Sequence Summarizing Neural Networks for Spoken Language Recognition. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 3285-3289. ISBN 978-1-5108-3313-5.
Detail

NOVOTNý Ondřej, MATěJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, GRéZL František, BURGET Lukáš and ČERNOCKý Jan. Analysis of the DNN-Based SRE Systems in Multi-language Conditions. In: Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016, pp. 199-204. ISBN 978-1-5090-4903-5.
Detail

KARAFIáT Martin, BASKAR Murali K., MATěJKA Pavel, VESELý Karel, GRéZL František and ČERNOCKý Jan. Multilingual BLSTM and Speaker-Specific Vector Adaptation in 2016 BUT BABEL SYSTEM. In: Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016, pp. 637-643. ISBN 978-1-5090-4903-5.
Detail

GRéZL František and KARAFIáT Martin. Boosting Performance on Low-resource Languages by Standard Corpora: AN ANALYSIS. In: Proceeding of SLT 2016. San Diego: IEEE Signal Processing Society, 2016, pp. 629-636. ISBN 978-1-5090-4903-5.
Detail

POVOLNý Filip, MATěJKA Pavel, HRADIš Michal, POPKOVá Anna, OTRUSINA Lubomír, SMRž Pavel, WOOD Ian, ROBIN Cécile and LAMEL Lori. Multimodal Emotion Recognition for AVEC 2016 Challenge. In: AVEC '16 Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge. Amsterdam: Association for Computing Machinery, 2016, pp. 75-82. ISBN 978-1-4503-4516-3.
Detail

GLEMBEK Ondřej. Summary report for project Exploiting Language Information for Situational Awareness (ELISA) For year 2016. Brno: University of Southern California, 2016.
Detail

MATěJKA Pavel. Summary report for project "Robust Automatic Speech Transcription" in Year 2016. Brno: Raytheon BBN Technologies, 2016.
Detail

SKáCEL Miroslav, KARAFIáT Martin, ONDEL Yang Lucas Antoine Francois, UCHYTIL Albert and SZőKE Igor. BUT Zero-Cost Speech Recognition 2016 System Description. In: CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016, pp. 1-3. ISSN 1613-0073.
Detail

POPKOVá Anna, POVOLNý Filip, MATěJKA Pavel, GLEMBEK Ondřej, GRéZL František and ČERNOCKý Jan. Investigation of Bottle-Neck Features for Emotion Recognition. In: 19th International Conference, TSD 2016, Brno , Czech Republic, September 12-16, 2016, Proceedings. Lecture Notes in Computer Science, Lecture Notes in Artificial Intelligence, vol. 9924. Brno: International Speech Communication Association, 2016, pp. 426-434. ISSN 0302-9743.
Detail

SAGHA Hesam, MATěJKA Pavel, GAVRYUOKOVA Maryna, POVOLNý Filip, MARCHI Erik and SCHULLER Björn W. Enhancing multilingual recognition of emotion in speech by language identification. In: 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION - Proceedings (INTERSPEECH 2016). San Francisco: International Speech Communication Association, 2016, pp. 2949-2953. ISSN 1990-9772.
Detail

SZőKE Igor and ANGUERA Xavier. Zero-Cost Speech Recognition Task at Mediaeval 2016. In: CEUR Workshop Proceedings. Hilversum: CEUR-WS.org, 2016, pp. 1-3. ISSN 1613-0073.
Detail

2015

MOTLíčEK Petr, DEY Subhadeep, MADIKERI Srikanth and BURGET Lukáš. Employment of Subspace Gaussian Mixture Models in Speaker Recognition. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, pp. 4445-4449. ISBN 978-1-4673-6997-8.
Detail

HEřMANSKý Hynek, BURGET Lukáš, COHEN Jordan, DUPOUX Emmanuel, FELDMAN Naomi, GODFREY John, KHUDANPUR Sanjeev, MACIEJEWSKI Matthew, MALLIDI Sri Harish, MENON Anjali, OGAWA Tetsuji, PEDDINTI Vijayaditya, ROSE Richard, STERN Richard, WIESNER Matthew and VESELý Karel. Towards Machines That Know When They Do Not Know: Summary of Work Done at 2014 FREDERICK JELINEK MEMORIAL WORKSHOP. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, pp. 5009-5013. ISBN 978-1-4673-6997-8.
Detail

SZőKE Igor, SKáCEL Miroslav, ČERNOCKý Jan and BURGET Lukáš. Coping with Channel Mismatch in Query-By-Example - BUT QUESST 2014. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, pp. 5838-5842. ISBN 978-1-4673-6997-8.
Detail

ANGUERA Xavier, RODRIGUEZ-FUENTES Luis J., BUZO Andi, METZE Florian, SZőKE Igor and PENAGARIKANO Mikel. QUESST 2014: Evaluating Query-By-Example Speech Search in a Zero-Resource. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, pp. 5833-5837. ISBN 978-1-4673-6997-8.
Detail

ONDEL Yang Lucas Antoine Francois, ANGUERA Xavier and LUQUE Jordi. MASK+:Data-Driven Regions Selection for Acoustic Fingerprinting. In: Proceedings of 2015 IEEE International Conference on Acoustics, Speech and Signal Processing. South Brisbane, Queensland: IEEE Signal Processing Society, 2015, pp. 335-339. ISBN 978-1-4673-6997-8.
Detail

FéR Radek, MATěJKA Pavel, GRéZL František, PLCHOT Oldřich and ČERNOCKý Jan. Multilingual Bottleneck Features for Language Recognition. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 389-393. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

CUMANI Sandro, PLCHOT Oldřich and FéR Radek. Exploiting i-vector posterior covariances for short-duration language recognition. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 1002-1006. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

GLEMBEK Ondřej, MATěJKA Pavel, PLCHOT Oldřich, PEšáN Jan, BURGET Lukáš and SCHWARZ Petr. Migrating i-vectors Between Speaker Recognition Systems Using Regression Neural Networks. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 2327-2331. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

PEšáN Jan, BURGET Lukáš, HEřMANSKý Hynek and VESELý Karel. DNN derived filters for processing of modulation spectrum of speech. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 1908-1911. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

MALLIDI Sri Harish, OGAWA Tetsuji, VESELý Karel, NIDADAVOLU Phani S. and HEřMANSKý Hynek. Autoencoder based multi-stream combination for noise robust speech recognition. In: Proceeding of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 3551-3555. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

SILNOVA Anna, GLEMBEK Ondřej, KINNUNEN Tomi and MATěJKA Pavel. Exploring ANN Back-Ends for i-Vector Based Speaker Age Estimation. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 3036-3040. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

KARAFIáT Martin, GRéZL František, BURGET Lukáš, SZőKE Igor and ČERNOCKý Jan. Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 2454-2458. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Detail

GLEMBEK Ondřej, MATěJKA Pavel, BURGET Lukáš, SCHWARZ Petr, PEšáN Jan and PLCHOT Oldřich. Voice-print transformation for migration between automatic speaker identification systems. Abstract book of the 7th European Academy of Forensic Science Conference. Praha: Criminal Police Department Prague, 2015. ISBN 978-80-260-8659-8.
Detail

HSIAO Roger, MA Jeff, HARTMANN William, KARAFIáT Martin, GRéZL František, BURGET Lukáš, SZőKE Igor, ČERNOCKý Jan, WATANABE Shinji, CHEN Zhuo, MALLIDI Sri Harish, HEřMANSKý Hynek, TSAKALIDIS Stavros and SCHWARTZ Richard. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In: Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015, pp. 533-538. ISBN 978-1-4799-7291-3.
Detail

SKáCEL Miroslav and SZőKE Igor. BUT QUESST 2015 System Description. In: CEUR Workshop Proceedings. Wurzen: CEUR-WS.org, 2015, pp. 1-3. ISSN 1613-0073.
Detail

GRéZL František, KARAFIáT Martin, VESELý Karel and ŽIžKA Josef. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2015. Brno: ReplayWell, s. r. o., 2015.
Detail

GLEMBEK Ondřej, KESIRAJU Santosh and ONDEL Yang Lucas Antoine Francois. Summary report for project "ELISA" in Year 2015. Brno: University of Southern California, 2015.
Detail

MATěJKA Pavel, PLCHOT Oldřich, NOVOTNý Ondřej and FéR Radek. Summary report for project "Robust Automatic Speech Transcription" in Year 2015. Brno: Raytheon BBN Technologies, 2015.
Detail

KARAFIáT Martin, GRéZL František, HANNEMANN Mirko and VESELý Karel. Summary report for project "Multilingual speech recognition" in Year 2015. Brno: Raytheon BBN Technologies, 2015.
Detail

KARAFIáT Martin and GRéZL František. Souhrnná zpráva k projektu "ASR-FR" za rok 2015. Brno: Phonexia s.r.o., 2015.
Detail

KARAFIáT Martin and GRéZL František. Souhrnná zpráva k projektu "Dodání anotací akustických dat, akustického modelu, jazykového modelu a výslovnostního slovníku pro francouzský jazyk" za rok 2015. Brno: Phonexia s.r.o., 2015.
Detail

SZőKE Igor, METZE Florian, RODRIGUEZ-FUENTES Luis J., PROENCA Jorge, BUZO Andi, LOJKA Martin, ANGUERA Xavier and XIONG Xiao. Query by Example Search on Speech at Mediaeval 2015. In: CEUR Workshop Proceedings. Wurzen: CEUR-WS.org, 2015, pp. 1-3. ISSN 1613-0073.
Detail

2014

KARAFIáT Martin, GRéZL František, HANNEMANN Mirko and ČERNOCKý Jan. BUT Neural Network Features for Spontaneous Vietnamese in BABEL. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 5659-5663. ISBN 978-1-4799-2892-7.
Detail

GLEMBEK Ondřej, MA Jeff, MATěJKA Pavel, ZHANG Bing, PLCHOT Oldřich, BURGET Lukáš and MATSOUKAS Spyros. Domain Adaptation Via Within-class Covariance Correction in I-Vector Based Speaker Recognition Systerms. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 4060-4064. ISBN 978-1-4799-2892-7.
Detail

GRéZL František, KARAFIáT Martin and VESELý Karel. Adaptation of Multilingual Stacked Bottle-neck Neural Network Structure for New Language. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 7704-7708. ISBN 978-1-4799-2892-7.
Detail

SZőKE Igor, BURGET Lukáš, GRéZL František, ČERNOCKý Jan and ONDEL Yang Lucas Antoine Francois. Calibration and Fusion of Query-by-example Systems - BUT SWS 2013. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 7899-7903. ISBN 978-1-4799-2892-7.
Detail

LOPEZ-MORENO Ignacio, GONZALEZ-DOMINGUEZ Javier, MARTíNEZ González David, PLCHOT Oldřich, GONZALEZ-RODRIGUEZ Joaquin and MORENO Pedro. Automatic Language Identification Using Deep Neural Networks. In: Proceeding of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 5374-5378. ISBN 978-1-4799-2892-7.
Detail

GRéZL František and KARAFIáT Martin. Adapting Multilingual Neural Network Hierarchy to a New Language. In: Proceedings of the 4th International Workshop on Spoken Language Technologies for Under- resourced Languages SLTU-2014. St. Petersburg, Russia, 2014. St. Petersburg: International Speech Communication Association, 2014, pp. 39-45. ISBN 978-5-8088-0908-6.
Detail

MARTíNEZ González David, BURGET Lukáš, STAFYLAKIS Themos, LEI Yun, KENNY Patrick and LLEIDA Eduardo. Unscented Transform For Ivector-based Noisy Speaker Recognition. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 4070-4074. ISBN 978-1-4799-2892-7.
Detail

ANGUERA Xavier, RODRIGUEZ-FUENTES Luis J., SZőKE Igor, BUZO Andi and METZE Florian et al. Query-by-example Spoken Term Detection Evaluation on Low-resource Languages. In: Proceedings of the 4th International Workshop on Spoken Language Technologies for Under- resourced Languages SLTU-2014. St. Petersburg, Russia. St. Petersburg: International Speech Communication Association, 2014, pp. 24-31. ISBN 978-5-8088-0908-6.
Detail

EGOROVA Ekaterina. Multi-task Neural Networks For Speech Recognition. In: Proceedings of the 20th Student Conference, EEICT 2014. Volume 2. Brno: Brno University of Technology, 2014, pp. 24-26. ISBN 978-80-214-4923-7.
Detail

CUMANI Sandro, LAFACE Pietro and PLCHOT Oldřich. On the use of i-vector posterior distributions in Probabilistic Linear Discriminant Analysis. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 22, no. 4, 2014, pp. 846-857. ISSN 2329-9290.
Detail

MATěJKA Pavel, ZHANG Le, NG Tim, MALLIDI Sri Harish, GLEMBEK Ondřej, MA Jeff and ZHANG Bing. Neural Network Bottleneck Features for Language Identification. In: Proceedings of Odyssey 2014. Joensuu: International Speech Communication Association, 2014, pp. 299-304. ISSN 2312-2846.
Detail

GRéZL František and KARAFIáT Martin. Combination of Multilingual and Semi-Supervised Training for Under-Resourced Languages. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, pp. 820-824. ISBN 978-1-63439-435-2.
Detail

BAHARI Mohamad H., DEHAK Najim, VAN hamme Hugo, BURGET Lukáš, ALI Ahmed M. and GLASS Jim. Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 2014, no. 7, pp. 1117-1129. ISSN 2329-9290.
Detail

NG Tim, HSIAO Roger, ZHANG Le, KARAKOS Damianos, MALLIDI Sri Harish, KARAFIáT Martin, VESELý Karel, SZőKE Igor, ZHANG Bing, NGUYEN Long and SCHWARTZ Richard. Progress in the BBN Keyword Search System for the DARPA RATS Program. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, pp. 959-963. ISBN 978-1-63439-435-2.
Detail

KARAFIáT Martin, GRéZL František, VESELý Karel, HANNEMANN Mirko, SZőKE Igor and ČERNOCKý Jan. BUT 2014 Babel System: Analysis of adaptation in NN based systems. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, pp. 3002-3006. ISBN 978-1-63439-435-2.
Detail

PLCHOT Oldřich, DIEZ Sánchez Mireia, SOUFIFAR Mehdi and BURGET Lukáš. PLLR Features in Language Recognition System for RATS. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, pp. 3048-3051. ISBN 978-1-63439-435-2.
Detail

GRéZL František, EGOROVA Ekaterina and KARAFIáT Martin. Further Investigation into Multilingual Training and Adaptation of Stacked Bottle-neck Neural Network Structure. In: Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014, pp. 48-53. ISBN 978-1-4799-7129-9.
Detail

KARAFIáT Martin, VESELý Karel, SZőKE Igor, BURGET Lukáš, GRéZL František, HANNEMANN Mirko and ČERNOCKý Jan. BUT ASR System for BABEL Surprise Evaluation 2014. In: Proceedings of 2014 Spoken Language Technology Workshop. South Lake Tahoe, Nevada: IEEE Signal Processing Society, 2014, pp. 501-506. ISBN 978-1-4799-7129-9.
Detail

SZőKE Igor, SKáCEL Miroslav and BURGET Lukáš. BUT QUESST 2014 System Description. In: CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014, pp. 1-2. ISSN 1613-0073.
Detail

ANGUERA Xavier, RODRIGUEZ-FUENTES Luis J., SZőKE Igor, BUZO Andi and METZE Florian. Query by Example Search on Speech at Mediaeval 2014. In: CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2014, pp. 1-2. ISSN 1613-0073.
Detail

KARAFIáT Martin and GRéZL František. Souhrnná zpráva k projektu "Dodání anotací akustických dat, akustického modelu, jazykového modelu a výslovnostního slovníku pro španělský jazyk" za rok 2014. Brno: Phonexia s.r.o., 2014.
Detail

2013

CUMANI Sandro, PLCHOT Oldřich and LAFACE Pietro. Probabilistic Linear Discriminant Analysis Of I-Vector Posterior Distributions. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 7644-7648. ISBN 978-1-4799-0355-9.
Detail

PLCHOT Oldřich, MATSOUKAS Spyros, MATěJKA Pavel, DEHAK Najim, MA Jeff, CUMANI Sandro, GLEMBEK Ondřej, HEřMANSKý Hynek, MESGARANI Nima, SOUFIFAR Mehdi Mohammad, THOMAS Samuel, ZHANG Bing and ZHOU Xinhui et al. Developing A Speaker Identification System For The DARPA RATS Project. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 6768-6772. ISBN 978-1-4799-0355-9.
Detail

EGOROVA Ekaterina, VESELý Karel, KARAFIáT Martin, JANDA Miloš and ČERNOCKý Jan. Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 7324-7328. ISBN 978-1-4799-0355-9.
Detail

HANNEMANN Mirko, POVEY Daniel and ZWEIG Geoffrey. Combining Forward and Backward Search in Decoding. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 6739-6743. ISBN 978-1-4799-0355-9.
Detail

LEI Yun, BURGET Lukáš and SCHEFFER Nicolas. A Noise Robust I-Vector Extractor Using Vector Taylor Series For Speaker Recognition. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 6788-6791. ISBN 978-1-4799-0355-9.
Detail

AKBACAK Murat, BURGET Lukáš, WENG Wan and VAN Hout Julien. Rich System Combination For Keyword Spotting In Noisy and Acoustically Heterogenous Audio Streams. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 8267-8271. ISBN 978-1-4799-0355-9.
Detail

JANDA Miloš. Automatic Generation Of Pronunciation Dictionaries Based On Diarization. In: Proceedings of the 19th Conference Student EEICT 2013. Brno: Brno University of Technology, 2013, pp. 228-232. ISBN 978-80-214-4695-3.
Detail

MOTLíčEK Petr, POVEY Daniel and KARAFIáT Martin. Feature And Score Level Combination Of Subspace Gaussians In LVCSR Task. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 7604-7608. ISBN 978-1-4799-0355-9.
Detail

VESELý Karel, GHOSHAL Arnab, BURGET Lukáš and POVEY Daniel. Sequence-discriminative Training of Deep Neural Networks. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, pp. 2345-2349. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

KARAFIáT Martin, GRéZL František, HANNEMANN Mirko, VESELý Karel and ČERNOCKý Jan. BUT BABEL System for Spontaneous Cantonese. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, pp. 2589-2593. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

RATH Shakti P., BURGET Lukáš, KARAFIáT Martin, GLEMBEK Ondřej and ČERNOCKý Jan. A Region-specific Feature-space Transformation for Speaker Adaptation and Singularity Analysis of Jacobian Matrix. In: Proceedings of Interspeeech 2013. Lyon: International Speech Communication Association, 2013, pp. 1228-1232. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

RATH Shakti P., POVEY Daniel, VESELý Karel and ČERNOCKý Jan. Improved Feature Processing for Deep Neural Networks. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, pp. 109-113. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

SOUFIFAR Mehdi Mohammad, BURGET Lukáš, PLCHOT Oldřich, CUMANI Sandro and ČERNOCKý Jan. Regularized Subspace n-Gram Model for Phonotactic iVector Extraction. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, pp. 74-78. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Detail

CUMANI Sandro, BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, LAFACE Pietro, PLCHOT Oldřich and VASILAKAKIS Vasileios. Pairwise Discriminative Speaker Verification in the I -Vector Space. IEEE Transactions on Audio, Speech, and Language Processing, vol. 2013, no. 6, pp. 1217-1227. ISSN 1558-7916.
Detail

TRESADERN Phil, COOTES Timothy F., POH Norman, MATěJKA Pavel, HADID Abdenour, LéVY Christophe, MCCOOL Christopher S. and MARCEL Sebastien. Mobile Biometrics: Combined Face and Voice Verification for a Mobile Platform. Pervasive Computing, vol. 12, no. 1, 2013, pp. 79-87. ISSN 1536-1268.
Detail

GRéZL František and KARAFIáT Martin. Semi-Supervised Bootstrapping Approach For Neural Network Feature Extractor Training. In: Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013, pp. 470-475. ISBN 978-1-4799-2755-5.
Detail

SZőKE Igor, BURGET Lukáš, GRéZL František and ONDEL Yang Lucas Antoine Francois. BUT SWS 2013 - Massive Parallel Approach. In: Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop. Barcelona: CEUR-WS.org, 2013, pp. 1-2. ISSN 1613-0073.
Detail

ANGUERA Xavier, METZE Florian, BUZO Andi, SZőKE Igor and RODRIGUEZ-FUENTES Luis J. The Spoken Web Search Task. In: CEUR Workshop Proceedings. Barcelona: CEUR-WS.org, 2013, pp. 1-2. ISSN 1613-0073.
Detail

KARAKOS Damianos, SCHWARTZ Richard, TSAKALIDIS Stavros, ZHANG Le, RANJAN Shivesh, NG Tim, HSIAO Roger, NGUYEN Long, GRéZL František, HANNEMANN Mirko, KARAFIáT Martin, SZőKE Igor and VESELý Karel et al. Score Normalization and System Combination for Improved Keyword Spotting. In: Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013, pp. 210-215. ISBN 978-1-4799-2755-5.
Detail

HSIAO Roger, NG Tim, GRéZL František, KARAKOS Damianos, TSAKALIDIS Stavros, NGUYEN Long and SCHWARTZ Richard. Discriminative Semi-supervised Training for Keyword Search in Low Resource Languages. In: Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013, pp. 440-445. ISBN 978-1-4799-2755-5.
Detail

VESELý Karel, HANNEMANN Mirko and BURGET Lukáš. Semi-supervised Training of Deep Neural Networks. In: Proceedings of ASRU 2013. Olomouc: IEEE Signal Processing Society, 2013, pp. 267-272. ISBN 978-1-4799-2755-5.
Detail

ZHILA Alisa, YIH Wen-tau, MEEK Christopher, MIKOLOV Tomáš and ZWEIG Geoffrey. Combining Heterogeneous Models for Measuring Relational Similarity. In: Proceedings of NAACL-HLT 2013. Atlanata, Georgia: Association for Computational Linguistics, 2013, pp. 1000-1009. ISBN 978-1-937284-47-3.
Detail

KHOURY Elie S., VESNICER Boštjan, FRANCO-PEDROSO Javier, DIEZ Sánchez Mireia, CIPR Tomáš, SCHWARZ Petr, VAN Leeuwen David, PETROVSKA-DELACRETAZ Dijana, MATěJKA Pavel, RODRIGUEZ-FUENTES Luis J., CHOLLET Gerard and MARCEL Sebastien et al. The 2013 Speaker Recognition Evaluation in Mobile Environment. In: Proceedings of Biometrics (ICB), 2013 International Conference on. Madrid: IEEE Biometric Council, 2013, pp. 1-8. ISBN 978-1-4799-0310-8.
Detail

GRéZL František, CHALUPNíčEK Kamil, KARAFIáT Martin and VESELý Karel. Souhrnná zpráva k projektu "Dodání anotací akustických dat, akustického modelu, jazykového modelu a výslovnostního slovníku pro arabský jazyk" za rok 2013. Brno: Phonexia s.r.o., 2013.
Detail

GRéZL František, KARAFIáT Martin, VESELý Karel and ŽIžKA Josef. Souhrnná zpráva k projektu "Zpracování audiovizuálních dat pro Superlectures.com" za rok 2013. Brno: ReplayWell, s. r. o., 2013.
Detail

BURGET Lukáš, PLCHOT Oldřich and SZőKE Igor. 2013 Summary report of project "Processing and analysis of speech, automatic speaker identification". Brno: Raytheon BBN Technologies, 2013.
Detail

MCLAREN Mitchell, ABRASH Victor, GRACIARENA Martin, LEI Yun and PEšáN Jan. Improving Robustness to Compressed Speech in Speaker Recognition. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, pp. 3698-3702. ISBN 978-1-62993-443-3.
Detail

MATěJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, SCHWARZ Milan, CIPR Tomáš, CUMANI Sandro, KUDLA Radim, SZőKE Igor, SVOBODOVá Marie, MALý Květoslav and ČERNOCKý Jan. BUT HASR'12 Experience: Are developers of SRE Systems naive listeners?. Brno: Faculty of Information Technology BUT, 2013.
Detail

2012

SOUFIFAR Mehdi Mohammad, CUMANI Sandro, BURGET Lukáš and ČERNOCKý Jan. Discriminative Classifiers for Phonotactic Language Recognition with iVectors. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012, pp. 4853-4856. ISBN 978-1-4673-0044-5.
Detail

POVEY Daniel, HANNEMANN Mirko, BOULIANNE Gilles, BURGET Lukáš, GHOSHAL Arnab, JANDA Miloš, KARAFIáT Martin, KOMBRINK Stefan, MOTLíčEK Petr, QIAN Yanmin, RIEDHAMMER Korbinian, VESELý Karel and VU Ngoc Thang. Generating Exact Lattices in The WFST Framework. In: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012, pp. 4213-4216. ISBN 978-1-4673-0044-5.
Detail

KOMBRINK Stefan, MIKOLOV Tomáš, KARAFIáT Martin and BURGET Lukáš. Improving Language Models for ASR Using Translated In-domain Data. In: Proceedings of 2012 IEEE International Conference on Acoustics, Speech and Signal Processing. Kyoto: IEEE Signal Processing Society, 2012, pp. 4405-4408. ISBN 978-1-4673-0044-5.
Detail

KARAFIáT Martin, JANDA Miloš, ČERNOCKý Jan and BURGET Lukáš. Region Dependent Linear Transforms in Multilingual Speech Recognition. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012, pp. 4885-4888. ISBN 978-1-4673-0044-5.
Detail

CUMANI Sandro, PLCHOT Oldřich and KARAFIáT Martin. Independent Component Analysis and MLLR Transforms for Speaker Identification. In: Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012, pp. 4365-4368. ISBN 978-1-4673-0044-5.
Detail

CUMANI Sandro, GLEMBEK Ondřej, BRUMMER Johan Nikolaas Langenhoven, DE Villiers Edward and LAFACE Pietro. Gender Independent Discriminative Speaker Recognition in I-Vector Space. In: Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012, pp. 4361-4364. ISBN 978-1-4673-0044-5.
Detail

KOMBRINK Stefan, HANNEMANN Mirko and BURGET Lukáš. Out-of-Vocabulary Word Detection and Beyond. Detection and Identification of Rare Audiovisual Cues. Studies in Computational Intelligence, 384. Springer-Verlag Berlin Heidelberg: Springer Verlag, 2012, pp. 57-65. ISBN 978-3-642-24033-1.
Detail

HAIN Thomas, BURGET Lukáš, DINES John, GARNER Phillip N., GRéZL František, EL Hannani Asmaa, HUIJBREGTS Marijn, KARAFIáT Martin, LINCOLN Mike and WAN Vincent. Transcribing Meetings with the AMIDA System. IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 2, 2012, pp. 486-498. ISSN 1558-7916.
Detail

MOTLíčEK Petr, VALENTE Fabio and SZőKE Igor. Improving Acoustic Based Keyword Spotting Using LVCSR Lattices. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012, pp. 4413-4416. ISBN 978-1-4673-0044-5.
Detail

METZE Florian, RAJPUT Nitendra, ANGUERA Xavier, DAVEL Marelie H., GRAVIER Guillaume, HEERDEN Charl van, MANTENA Gautam V., MUSCARIELLO Armando, PRAHALLAD Kishore, SZőKE Igor and TEJEDOR Javier. The Spoken WEB Search Task At Mediaeval 2011. In: Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012, pp. 5165-5168. ISBN 978-1-4673-0044-5.
Detail

LEI Yun, BURGET Lukáš, FERRER Luciana, GRACIARENA Martin and SCHEFFER Nicolas. Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis. In: Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012, pp. 4253-4256. ISBN 978-1-4673-0044-5.
Detail

MARTíNEZ González David, BURGET Lukáš, FERRER Luciana and SCHEFFER Nicolas. Ivector-Based Prosodic System For Language Identification. In: Proc. International Conference on Acoustics, Speec. Kyoto: IEEE Signal Processing Society, 2012, pp. 4861-4864. ISBN 978-1-4673-0044-5.
Detail

JANDA Miloš. Grapheme Based Speech Recognition. In: Proceedings of the 18th Conference STUDENT EEICT 2012. Brno: Brno University of Technology, 2012, pp. 441-445. ISBN 978-80-214-4460-7.
Detail

FERRER Luciana, BURGET Lukáš, PLCHOT Oldřich and SCHEFFER Nicolas. A Unified Approach for Audio Characterization and its Application to Speaker Recognition. In: Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, pp. 317-323. ISBN 978-981-07-3093-2.
Detail

BOUSQUET Pierre-Michel, LARCHER Anthony, MATROUF Driss, BONASTRE Jean-Francois and PLCHOT Oldřich. Variance-Spectra based Normalization for I-vector Standard and Probabilistic Linear Discriminant Analysis. In: Proceedings of Odyssey 2012: The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, pp. 157-164. ISBN 978-981-07-3093-2.
Detail

BRUMMER Johan Nikolaas Langenhoven, CUMANI Sandro, GLEMBEK Ondřej, KARAFIáT Martin, MATěJKA Pavel, PEšáN Jan, PLCHOT Oldřich, SOUFIFAR Mehdi Mohammad, DE Villiers Edward and ČERNOCKý Jan. Description and analysis of the Brno276 system for LRE2011. In: Proceedings of Odyssey 2012: The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, pp. 216-223. ISBN 978-981-07-3093-2.
Detail

PLCHOT Oldřich, KARAFIáT Martin, BRUMMER Johan Nikolaas Langenhoven, GLEMBEK Ondřej, MATěJKA Pavel, DE Villiers Edward and ČERNOCKý Jan. Speaker vectors from Subspace Gaussian Mixture Model as complementary features for Language Identification. In: Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, pp. 330-333. ISBN 978-981-07-3093-2.
Detail

RATH Shakti P., KARAFIáT Martin, GLEMBEK Ondřej and ČERNOCKý Jan. A factorized representation of FMLLR transform based on QR-decomposition. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
Detail

D'HARO Luis Fernando, GLEMBEK Ondřej, PLCHOT Oldřich, MATěJKA Pavel, SOUFIFAR Mehdi Mohammad, CORDOBA Ricardo and ČERNOCKý Jan. Phonotactic Language Recognition using i-vectors and Phoneme Posteriogram Counts. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
Detail

MATěJKA Pavel, PLCHOT Oldřich, SOUFIFAR Mehdi Mohammad, GLEMBEK Ondřej, D'HARO Luis Fernando, VESELý Karel, GRéZL František, MA Jeff, MATSOUKAS Spyros and DEHAK Najim. Patrol Team Language Identification System for DARPA RATS P1 Evaluation. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
Detail

NG Tim, ZHANG Bing, NGUYEN Long, MATSOUKAS Spyros, ZHOU Xinhui, MESGARANI Nima, VESELý Karel and MATěJKA Pavel. Developing a Speech Activity Detection System for the DARPA RATS Program. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
Detail

VESELý Karel, KARAFIáT Martin, GRéZL František, JANDA Miloš and EGOROVA Ekaterina. The Language-Independent Bottleneck Features. In: Proceedings of IEEE 2012 Workshop on Spoken Language Technology. Miami: IEEE Signal Processing Society, 2012, pp. 336-341. ISBN 978-1-4673-5124-9.
Detail

JANDA Miloš, KARAFIáT Martin and ČERNOCKý Jan. Dealing with Numbers in Grapheme-Based Speech Recognition. In: Proceedings of 15th International Conference on Text, Speech and Dialogue. Lecture Notes in Computer Science, 2012, Volume 7499, vol. 2012. Springer-Verlag Berlin Heidelberg 2012: Springer Verlag, 2012, pp. 438-445. ISBN 978-3-642-32789-6. ISSN 0302-9743.
Detail

MCCOOL Christopher S., MARCEL Sebastien, MATěJKA Pavel, ČERNOCKý Jan, KITTLER Joseph, LARCHER Anthony, LéVY Christophe, MATROUF Driss and BONASTRE Jean-Francois et al. Bi-Modal Person Recognition on a Mobile Phone: using mobile phone data. In: 2012 IEEE International Conference on Multimedia and Expo Workshops. Melbourne, Victoria: IEEE Computer Society, 2012, pp. 635-640. ISBN 978-1-4673-2027-6.
Detail

DEORAS Anoop, MIKOLOV Tomáš, KOMBRINK Stefan and CHURCH Kenneth. Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model. Speech Communication, vol. 2012, no. 8, pp. 1-16. ISSN 0167-6393.
Detail

SZőKE Igor, FAPšO Michal, ŽIžKA Josef, BERAN Vítězslav and ČERNOCKý Jan. Efektivní přístup ke znalostem v audio-vizuálních záznamech. In: Proceedings of the Annual Database Conference. Praha: The University of Technology Košice, 2012, pp. 57-74. ISBN 978-80-553-1049-7.
Detail

SZőKE Igor, FAPšO Michal and VESELý Karel. BUT2012 Approaches for Spoken Web Search - MediaEval 2012. In: Working Notes Proceedings of the MediaEval 2012 Workshop. Pisa: CEUR-WS.org, 2012, pp. 1-2. ISSN 1613-0073.
Detail

TEJEDOR Javier, FAPšO Michal, SZőKE Igor, ČERNOCKý Jan and GRéZL František. Comparison of methods for language-dependent and language-independent query-by-example spoken term detection. ACM Transactions on Information Systems (TOIS), vol. 2012, no. 30, pp. 1-34. ISSN 1046-8188.
Detail

ČERNOCKý Jan. Dolování informací z mluvené řeči v BUT Speech@FIT. In: Hovory s informatiky 2012. Praha: Academy of Sciences of the Czech Republic, 2012, pp. 113-114. ISBN 978-80-87136-14-0.
Detail

LEI Yun, BURGET Lukáš and SCHEFFER Nicolas. Bilinear Factor Analysis for iVector Based Speaker Verification. In: Proceedings of Interspeech. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5.
Detail

2011

ČERNOCKý Jan. MOBIO D1.3 - Annual Report. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2011.
Detail

POVEY Daniel, KARAFIáT Martin, GHOSHAL Arnab and SCHWARZ Petr. A Symmetrization of the Subspace Gaussian Mixture Model. In: Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing. Praha: IEEE Signal Processing Society, 2011, pp. 4504-4507. ISBN 978-1-4577-0537-3.
Detail

BURGET Lukáš, PLCHOT Oldřich, CUMANI Sandro, GLEMBEK Ondřej, MATěJKA Pavel and BRüMMER Niko. Discriminatively Trained Probabilistic Linear Discriminant Analysis for Speaker Verification. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, pp. 4832-4835. ISBN 978-1-4577-0537-3.
Detail

CUMANI Sandro, BRüMMER Niko, BURGET Lukáš and LAFACE Pietro. Fast Discriminative Speaker Verification in the I-Vector Space. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, pp. 4852-4855. ISBN 978-1-4577-0537-3.
Detail

GLEMBEK Ondřej, BURGET Lukáš, KENNY Patrick, KARAFIáT Martin and MATěJKA Pavel. Simplification and optimization of I-Vector Extraction. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, pp. 4516-4519. ISBN 978-1-4577-0537-3.
Detail

KOCKMANN Marcel, FERRER Luciana, BURGET Lukáš, SHRIBERG Elisabeth and ČERNOCKý Jan. Recent Progress in Prosodic Speaker Verification. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, pp. 4556-4559. ISBN 978-1-4577-0537-3.
Detail

MATěJKA Pavel, GLEMBEK Ondřej, CASTALDO Fabio, ALAM Jahangir, PLCHOT Oldřich, KENNY Patrick, BURGET Lukáš and ČERNOCKý Jan. Full-covariance UBM and Heavy-tailed PLDA in I-Vector Speaker Verification. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, pp. 4828-4831. ISBN 978-1-4577-0537-3.
Detail

MIKOLOV Tomáš, KOMBRINK Stefan, BURGET Lukáš, ČERNOCKý Jan and KHUDANPUR Sanjeev. Extensions of Recurrent Neural Network Language Model. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, pp. 5528-5531. ISBN 978-1-4577-0537-3.
Detail

DEORAS Anoop, MIKOLOV Tomáš, KOMBRINK Stefan, KARAFIáT Martin and KHUDANPUR Sanjeev. Variational Approximation of Long-span Language Models for LVCSR. In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011, pp. 5532-5535. ISBN 978-1-4577-0537-3.
Detail

PEšáN Jan. Rozpoznávání mluvčího na mobilním telefonu. In: Proceedings of the 17th Conference Student EEICT 2011. Volume 2. Brno: Brno University of Technology, 2011, pp. 341-343. ISBN 978-80-214-4272-6.
Detail

POVEY Daniel, BURGET Lukáš, AGARWAL Mohit, AKYAZI Pinar, GHOSHAL Arnab, GLEMBEK Ondřej, GOEL Nagendra K., KARAFIáT Martin, RASTROW Ariya, ROSE Richard, SCHWARZ Petr and THOMAS Samuel et al. The subspace Gaussian mixture model-A structured model for speech recognition. Computer Speech and Language, vol. 25, no. 2, 2011, pp. 404-439. ISSN 0885-2308.
Detail

KOCKMANN Marcel, BURGET Lukáš and ČERNOCKý Jan. Application of speaker- and language identification state-of-the-art techniques for emotion recognition. Speech Communication, vol. 53, no. 9, 2011, pp. 1172-1185. ISSN 0167-6393.
Detail

DEORAS Anoop, MIKOLOV Tomáš and CHURCH Kenneth. A Fast Re-scoring Strategy to Capture Long-Distance Dependencies. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing July 2011 Edinburgh, Scotland, UK. Edinburgh: Association for Computational Linguistics, 2011, pp. 1116-1127. ISBN 978-1-937284-11-4.
Detail

KOMBRINK Stefan and MIKOLOV Tomáš. Recurrent Neural Network Language Modeling Applied to the Brno AMI/AMIDA 2009 Meeting Recognizer Setup. In: Proceedings of the 17th Conference STUDENT EEICT 2011. Volume 3. Brno: Brno University of Technology, 2011, pp. 527-531. ISBN 978-80-214-4273-3.
Detail

GRéZL František. The Role of Neural Network Size in TRAP/HATS Feature Extraction. In: Proceedings Text, Speech and Dialogue 2011. LNAI 6836, vol. 2011. Plzeň: Springer Verlag, 2011, pp. 315-322. ISBN 978-3-642-23537-5. ISSN 0302-9743.
Detail

GLEMBEK Ondřej, BURGET Lukáš, BRüMMER Niko, PLCHOT Oldřich and MATěJKA Pavel. Discriminatively Trained i-vector Extractor for Speaker Verification. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 137-140. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

KOCKMANN Marcel, FERRER Luciana, BURGET Lukáš and ČERNOCKý Jan. iVector Fusion of Prosodic and Cepstral Features for Speaker Verification. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 265-268. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

MARTíNEZ González David, PLCHOT Oldřich, BURGET Lukáš, GLEMBEK Ondřej and MATěJKA Pavel. Language Recognition in iVectors Space. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 861-864. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

GRéZL František and KARAFIáT Martin. Integrating recent MLP feature extraction techniques into TRAP architecture. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 1229-1232. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

BOřIL Hynek, GRéZL František and HANSEN John H. Front-End Compensation Methods for LVCSR Under Lombard Effect. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 1257-1260. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

SOUFIFAR Mehdi, KOCKMANN Marcel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej and SVENDSEN Torbjorn. iVector Approach to Phonotactic Language Recognition. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 2913-2916. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

MIKOLOV Tomáš, DEORAS Anoop, KOMBRINK Stefan, BURGET Lukáš and ČERNOCKý Jan. Empirical Evaluation and Combination of Advanced Language Modeling Techniques. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 605-608. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

KOMBRINK Stefan, MIKOLOV Tomáš, KARAFIáT Martin and BURGET Lukáš. Recurrent Neural Network based Language Modeling in Meeting Recognition. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 2877-2880. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Detail

KARAFIáT Martin, BURGET Lukáš, MATěJKA Pavel, GLEMBEK Ondřej and ČERNOCKý Jan. iVector-Based Discriminative Adaptation for Automatic Speech Recognition. In: Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011, pp. 152-157. ISBN 978-1-4673-0366-8.
Detail

VESELý Karel, KARAFIáT Martin and GRéZL František. Convolutive Bottleneck Network Features for LVCSR. In: Proceedings of ASRU 2011. Big Island, Hawaii: IEEE Signal Processing Society, 2011, pp. 42-47. ISBN 978-1-4673-0366-8.
Detail

GRéZL František, KARAFIáT Martin and JANDA Miloš. Study of Probabilistic and Bottle-Neck Features in Multilingual Environment. In: Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011, pp. 359-364. ISBN 978-1-4673-0366-8.
Detail

MIKOLOV Tomáš, DEORAS Anoop, POVEY Daniel, BURGET Lukáš and ČERNOCKý Jan. Strategies for Training Large Scale Neural Network Language Models. In: Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011, pp. 196-201. ISBN 978-1-4673-0366-8.
Detail

ČERNOCKý Jan, SZőKE Igor, HANNEMANN Mirko, KOMBRINK Stefan and FAPšO Michal. Hybrid Word-Subword Speech Recognition - a Powerful Tool to Search in Speech. Proceedings of 21st International Conference Radioelektronika 2011. Brno: Department of Radioelectronics FEEC BUT, 2011. ISBN 978-1-61284-322-3.
Detail

MIKOLOV Tomáš, KOMBRINK Stefan, DEORAS Anoop, BURGET Lukáš and ČERNOCKý Jan. RNNLM - Recurrent Neural Network Language Modeling Toolkit. In: Proceedings of ASRU 2011. Hilton Waikoloa Village, Big Island, Hawaii: IEEE Signal Processing Society, 2011, pp. 1-4. ISBN 978-1-4673-0366-8.
Detail

FERRER Luciana, BRATT Harry, BURGET Lukáš, ČERNOCKý Jan, GLEMBEK Ondřej, GRACIARENA Martin, LAWSON Aaron, LEI Yun, MATěJKA Pavel, PLCHOT Oldřich and SCHEFFER Nicolas. Promoting robustness for speaker modeling in the community: the PRISM evaluation set. In: Proceedings of SRE11 Analysis Workshop in 2011. Atlanta, Georga, 2011, pp. 1-7.
Detail

POVEY Daniel, GHOSHAL Arnab, BOULIANNE Gilles, BURGET Lukáš, GLEMBEK Ondřej, GOEL Nagendra K., HANNEMANN Mirko, MOTLíčEK Petr, QIAN Yanmin, SCHWARZ Petr, SILOVSKý Jan, STEMMER Georg and VESELý Karel. The Kaldi Speech Recognition Toolkit. In: Proceedings of ASRU 2011. Hilton Waikoloa Village Resort, Hawaii: IEEE Signal Processing Society, 2011, pp. 1-4. ISBN 978-1-4673-0366-8.
Detail

2010

ČERNOCKý Jan and ŠEVEčKOVá Michaela. Korpusové a hlasové technologie v nové generaci elektronických slovníků - závěrečná technická zpráva. Brno: Ministry of Industry and Trade of the Czech Republic, 2010.
Detail

ŽIžKA Josef, ČERNOCKý Jan, FAPšO Michal and SZőKE Igor. Web-Based Lecture Browser with Speech Search. In: Znalosti 2010. Sborník příspěvků 9. ročníku konference. Jindřichův Hradec: Fakulty of management and information, 2010, pp. 287-290. ISBN 978-80-245-1636-3.
Detail

SANTHOSH Kumar Chellappan Pillai, LI Haizhou, TONG Rong, MATěJKA Pavel, BURGET Lukáš and ČERNOCKý Jan. Tuning phone decoders for language identification. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2010. Dallas: IEEE Signal Processing Society, 2010, pp. 5010-5013. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

BURGET Lukáš, SCHWARZ Petr, AGARWAL Mohit, AKYAZI Pinar, FENG Kai, GHOSHAL Arnab, GLEMBEK Ondřej, GOEL Nagendra K., KARAFIáT Martin, POVEY Daniel, RASTROW Ariya, ROSE Richard and THOMAS Samuel. Multilingual acoustic modeling for speech recognition based on Subspace Gaussian Mixture Models. In: Proc. International Conference on Acoustictics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 4334-4337. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

GHOSHAL Arnab, POVEY Daniel, AGARWAL Mohit, AKYAZI Pinar, BURGET Lukáš, FENG Kai, GLEMBEK Ondřej, GOEL Nagendra K., KARAFIáT Martin, RASTROW Ariya, ROSE Richard, SCHWARZ Petr and THOMAS Samuel. A novel estimation of feature-space MLLR for full_covariance models. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 4310-4313. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

GOEL Nagendra K., THOMAS Samuel, AGARWAL Mohit, AKYAZI Pinar, BURGET Lukáš, FENG Kai, GHOSHAL Arnab, GLEMBEK Ondřej, KARAFIáT Martin, POVEY Daniel, RASTROW Ariya, ROSE Richard and SCHWARZ Petr. Approaches to automatic lexicon learning with limited training examples. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 5094-5097. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

KOCKMANN Marcel, BURGET Lukáš and ČERNOCKý Jan. Investigations into prosodic syllable contour features for speaker recognition. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 4418-4421. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

POVEY Daniel, BURGET Lukáš, AGARWAL Mohit, AKYAZI Pinar, FENG Kai, GHOSHAL Arnab, GLEMBEK Ondřej, GOEL Nagendra K., KARAFIáT Martin, RASTROW Ariya, ROSE Richard, SCHWARZ Petr and THOMAS Samuel. Subspace Gaussian mixture models for speech recognition. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 4330-4333. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

ROSE Richard, NOROUZIAN Atta, REDDY Aarthi, COY Andre, GUPTA Vishwa and KARAFIáT Martin. Subword-based spoken term detection in audio course lectures. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 5282-5285. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Detail

MIKOLOV Tomáš, PLCHOT Oldřich, GLEMBEK Ondřej, MATěJKA Pavel, BURGET Lukáš and ČERNOCKý Jan. PCA-based Feature Extraction for Phonotactic Language Recognition. In: Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010, pp. 251-255. ISBN 978-80-214-4114-9.
Detail

JANčíK Zdeněk, PLCHOT Oldřich, BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, GLEMBEK Ondřej, HUBEIKA Valiantsina, KARAFIáT Martin, MATěJKA Pavel, MIKOLOV Tomáš, STRASHEIM Albert and ČERNOCKý Jan. Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system. In: Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010, pp. 215-221. ISBN 978-80-214-4114-9.
Detail

VESELý Karel, BURGET Lukáš and GRéZL František. Parallel Training of Neural Networks for Speech Recognition. In: Prof. Text, Speech and Dialogue 2010. LNAI 6231, vol. 2010. Brno: Springer Verlag, 2010, pp. 439-446. ISBN 978-3-642-15759-2. ISSN 0302-9743.
Detail

KARAFIáT Martin, SZőKE Igor and ČERNOCKý Jan. Using Gradient Descent Optimization for Acoustics Training from Heterogeneous Data. In: Proc. Text, Speech and Dialog 2010. LNAI 6231, vol. 2010. Brno: Springer Verlag, 2010, pp. 322-329. ISBN 978-3-642-15759-2. ISSN 0302-9743.
Detail

KOMBRINK Stefan, HANNEMANN Mirko, BURGET Lukáš and HEřMANSKý Hynek. Recovery of Rare Words in Lecture Speech. In: Proc. Text, Speech and Dialogue 2010. Brno: Springer Verlag, 2010, pp. 330-337. ISBN 978-3-642-15759-2. ISSN 0302-9743.
Detail

VESELý Karel. Parallel training of neural networks for speech recognition. In: Proceedings of the 16th Conference STUDENT EEICT 2010. Volume 3. Brno: Brno University of Technology, 2010, pp. 74-76. ISBN 978-80-214-4078-4.
Detail

BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, KENNY Patrick, MATěJKA Pavel, DE Villiers Edward, KARAFIáT Martin, KOCKMANN Marcel, GLEMBEK Ondřej, PLCHOT Oldřich, BAUM Doris and SENOUSSAUOI Mohammed. ABC System description for NIST SRE 2010. In: Proc. NIST 2010 Speaker Recognition Evaluation. Brno: National Institute of Standards and Technology, 2010, pp. 1-20.
Detail

HANNEMANN Mirko, KOMBRINK Stefan, KARAFIáT Martin and BURGET Lukáš. Similarity Scoring for Recognizing Repeated Out-of-VocabularyWords. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 897-900. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

KOCKMANN Marcel, BURGET Lukáš, GLEMBEK Ondřej, FERRER Luciana and ČERNOCKý Jan. Prosodic Speaker Verification using Subspace Multinomial Models with Intersession Compensation. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba, Japan: International Speech Communication Association, 2010, pp. 1061-1064. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

KOCKMANN Marcel, BURGET Lukáš and ČERNOCKý Jan. Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 2822-2825. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

MIKOLOV Tomáš, KARAFIáT Martin, BURGET Lukáš, ČERNOCKý Jan and KHUDANPUR Sanjeev. Recurrent neural network based language model. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 1045-1048. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

GRéZL František and KARAFIáT Martin. Hierarchical Neural Net Architectures for Feature Extraction in ASR. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 1201-1204. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

VESELý Karel, BURGET Lukáš and GRéZL František. Parallel Training of Neural Networks for Speech Recognition. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 2934-2937. ISSN 1990-9772.
Detail

HAIN Thomas, BURGET Lukáš, DINES John, GARNER Phillip N., EL Hannani Asmaa, HUIJBREGTS Marijn, KARAFIáT Martin, LINCOLN Mike and WAN Vincent. The AMIDA 2009 Meeting Transcription System. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 358-361. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Detail

KOMBRINK Stefan, HANNEMANN Mirko and BURGET Lukáš. Out-of-vocabulary word detection and beyond. In: ECML PKDD 2010 Proceedings and Journal Content. Barcelona, 2010, pp. 1-8.
Detail

ČERNOCKý Jan, SZőKE Igor, HANNEMANN Mirko and KOMBRINK Stefan. Word-subword based keyword spotting with implications in OOV detection. Pacific Grove: Institute of Electrical and Electronics Engineers, 2010.
Detail

MARCEL Sebastien, MCCOOL Christopher S., MATěJKA Pavel, ČERNOCKý Jan, KITTLER Joseph, GLEMBEK Ondřej, PLCHOT Oldřich, JANčíK Zdeněk, LARCHER Anthony and LéVY Christophe et al. On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation. In: Recognizing Patterns in Signals, Speech, Images, and Videos. Lecture Notes in Computer Science, vol. 6388. Istanbul: Springer Verlag, 2010, pp. 210-225. ISBN 978-3-642-17710-1. ISSN 0302-9743.
Detail

SZőKE Igor, GRéZL František, ČERNOCKý Jan and FAPšO Michal. Acoustic keyword spotter - optimization from end-user perspective. In: Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010, pp. 177-181. ISBN 978-1-4244-7902-3.
Detail

SZőKE Igor, ČERNOCKý Jan, FAPšO Michal and ŽIžKA Josef. SPEECH@FIT LECTURE BROWSER. In: Proceedings of the 2010 IEEE Spoken Language Technology Workshop. IEEE Catalog Number: CFP 10SLT-USB. Berkeley, California: IEEE Signal Processing Society, 2010, pp. 157-158. ISBN 978-1-4244-7902-3.
Detail

TEJEDOR Javier, SZőKE Igor and FAPšO Michal. Novel Methods for Query Selection and Query Combination in Query-By-Example Spoken Term Detection. In: Proceedings of the ACM Multimedia 2010 International Conference. Copyright 2010 ACM 978-1-4503-0162-6/10/10. Florencie: Association for Computing Machinery, 2010, pp. 15-20. ISBN 978-1-60558-933-6.
Detail

KOMBRINK Stefan and HANNEMANN Mirko. Final system for identifying unexpected acoustic inputs (BUT). Brno: The Information Society Technologies (IST) 6th Framework programme, 2010.
Detail

MARCEL Sebastien and MATěJKA Pavel. MOBIO D6.6: Report on the MOBIO Final Prototypes. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2010.
Detail

ČERNOCKý Jan. MOBIO D7.4: Second report on dissemination activities. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2010.
Detail

MARCEL Sebastien, MCCOOL Christopher S., ČERNOCKý Jan, LéVY Christophe and LARCHER Anthony et al. MOBIO D1.2: Annual Report. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2010.
Detail

2009

DEHAK Najim, KENNY Patrick, DEHAK Reda, GLEMBEK Ondřej, DUMOUCHEL Pierre, BURGET Lukáš, HUBEIKA Valiantsina and CASTALDO Fabio. Support vector machines and joint factor analysis for speaker verification. In: Proc. ICASSP 2009. Taiwan: IEEE Signal Processing Society, 2009, pp. 1-4. ISBN 978-1-4244-2354-5.
Detail

MIKOLOV Tomáš, KOPECKý Jiří, BURGET Lukáš, GLEMBEK Ondřej and ČERNOCKý Jan. Neural network based language models for highly inflective languages. In: Proc. ICASSP 2009. Taipei: IEEE Signal Processing Society, 2009, p. 4. ISBN 978-1-4244-2354-5.
Detail

GLEMBEK Ondřej, BURGET Lukáš, DEHAK Najim, BRüMMER Niko and KENNY Patrick. Comparison of Scoring Methods used in Speaker Recognition with Joint Factor Analysis. In: Proc. ICASSP 2009. Taipei: IEEE Signal Processing Society, 2009, p. 4. ISBN 978-1-4244-2354-5.
Detail

HUBEIKA Valiantsina. Speaker verification as a target-nontarget trial task. In: Proceedings of the 15th Conference and Competition STUDENT EEICT 2009. Brno: Faculty of Electrical Engineering and Communication BUT, 2009, p. 5. ISBN 978-80-214-3870-5.
Detail

KOMBRINK Stefan, BURGET Lukáš, MATěJKA Pavel, KARAFIáT Martin and HEřMANSKý Hynek. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 80-83. ISSN 1990-9772.
Detail

GRéZL František, KARAFIáT Martin and BURGET Lukáš. Investigation into bottle-neck features for meeting speech recognition. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2947-2950. ISBN 978-1-61567-692-7. ISSN 1990-9772.
Detail

GARNER Phillip N., DINES John, HAIN Thomas, EL Hannani Asmaa, KARAFIáT Martin, KORCHAGIN Danil, LINCOLN Mike, WAN Vincent and ZHANG Le. Real-Time ASR from Meetings. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2119-2122. ISSN 1990-9772.
Detail

BURGET Lukáš, MATěJKA Pavel, HUBEIKA Valiantsina and ČERNOCKý Jan. Investigation into variants of Joint Factor Analysis for speaker recognition. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 1263-1266. ISBN 978-1-61567-692-7. ISSN 1990-9772.
Detail

BURGET Lukáš, FAPšO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIáT Martin, KOCKMANN Marcel, MATěJKA Pavel, SCHWARZ Petr and ČERNOCKý Jan. BUT system for NIST 2008 speaker recognition evaluation. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2335-2338. ISBN 978-1-61567-692-7. ISSN 1990-9772.
Detail

BRüMMER Niko, STRASHEIM Albert, HUBEIKA Valiantsina, MATěJKA Pavel, BURGET Lukáš and GLEMBEK Ondřej. Discriminative Acoustic Language Recognition via Channel-Compensated GMM Statistics. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2187-2190. ISBN 978-1-61567-692-7. ISSN 1990-9772.
Detail

KOCKMANN Marcel, BURGET Lukáš and ČERNOCKý Jan. Brno University of Technology System for Interspeech 2009 Emotion Challenge. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 348-351. ISSN 1990-9772.
Detail

VILLALBA Lopez Jesus Antonio. Segmentation Experiments for NIST SRE. Brno: Faculty of Information Technology BUT, 2009.
Detail

GRéZL František and ČERNOCKý Jan. Audio Surveillance through Known Event Classification. Radioengineering, vol. 18, no. 4, 2009, pp. 671-675. ISSN 1210-2512.
Detail

KAšPAR Michal, ŠEVEčKOVá Michaela, CHALUPNíčEK Kamil and ČERNOCKý Jan. Textové a řečové korpusy. Brno, 2009.
Detail

FAPšO Michal, SZőKE Igor and ČERNOCKý Jan. Hlasový přístup ke korpusům - experimenty. Brno: Ministry of Industry and Trade of the Czech Republic, 2009.
Detail

KAšPAR Michal, PEšáN Jan, SZőKE Igor, CHALUPNíčEK Kamil and ČERNOCKý Jan. Technická zpráva k MPO projektu FT-TA3/006: Práce na Etapě 6: Integrace. Brno: Ministry of Industry and Trade of the Czech Republic, 2009.
Detail

ČERNOCKý Jan, MATěJKA Pavel and GLEMBEK Ondřej. MOBIO D3.4: Description and evaluation of advanced algorithms for uni-modal authentication. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2009.
Detail

ČERNOCKý Jan. MOBIO D7.3: First report on dissemination activities. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2009.
Detail

BRüMMER Niko, BURGET Lukáš, GLEMBEK Ondřej, HUBEIKA Valiantsina, JANčíK Zdeněk, KARAFIáT Martin, MATěJKA Pavel, MIKOLOV Tomáš, PLCHOT Oldřich and STRASHEIM Albert. BUT-AGNITIO System Description for NIST Language Recognition Evaluation 2009. In: Proceedings NIST 2009 Language Recognition Evaluation Workshop. Baltimore, Maryland, USA: National Institute of Standards and Technology, 2009, pp. 1-7.
Detail

2008

BURGET Lukáš, SCHWARZ Petr, MATěJKA Pavel, HANNEMANN Mirko, RASTROW Ariya, WHITE Christopher, KHUDANPUR Sanjeev, HEřMANSKý Hynek and ČERNOCKý Jan. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008, p. 4. ISBN 1-4244-1484-9.
Detail

GRéZL František and FOUSEK Petr. Optimizing bottle-neck features for LVCSR. In: 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas, Nevada: IEEE Signal Processing Society, 2008, pp. 4729-4732. ISBN 1-4244-1484-9.
Detail

PINTO Joel, SZőKE Igor, PRASANNA S.R.M. and HEřMANSKý Hynek. Fast Approximate Spoken Term Detection from Sequence of Phonemes. In: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, Singapore. Singapore: Association for Computing Machinery, 2008, pp. 28-33. ISBN 978-90-365-2697-5.
Detail

JANčíK Zdeněk. Modelování dynamiky prosodie pro rozpoznání řečníka. In: Proceedings of the 14th Conference STUDENT EEICT 2008. Volume 2. Brno: Faculty of Electrical Engineering and Communication BUT, 2008, pp. 67-69. ISBN 978-80-214-3615-2.
Detail

WHITE Christopher, ZWEIG Geoffrey, BURGET Lukáš, SCHWARZ Petr and HEřMANSKý Hynek. Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments. In: Proc. 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas: IEEE Signal Processing Society, 2008, p. 4. ISBN 1-4244-1484-9.
Detail

PLCHOT Oldřich, HUBEIKA Valiantsina, BURGET Lukáš, SCHWARZ Petr and MATěJKA Pavel. Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition. In: Proc. 11th International Conference on Text, Speech and Dialogue. Berlin: Springer Verlag, 2008, pp. 477-483. ISBN 978-3-540-87390-7.
Detail

KOPECKý Jiří, GLEMBEK Ondřej and KARAFIáT Martin. Advances in Acoustic Modeling for the Recognition of Czech. In: Proc. 11th International Conference on Text, Speech and Dialogue. Lecture Notes in Computer Science, vol. 5246. Berlin: Springer Verlag, 2008, pp. 357-363. ISBN 978-3-540-87390-7.
Detail

MATěJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej, SCHWARZ Petr, HUBEIKA Valiantsina, FAPšO Michal, MIKOLOV Tomáš, PLCHOT Oldřich and ČERNOCKý Jan. BUT language recognition system for NIST 2007 evaluations. In: Proc. Interspeech 2008. Brisbane, Australia: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772.
Detail

HUBEIKA Valiantsina, BURGET Lukáš, MATěJKA Pavel and SCHWARZ Petr. Discriminative Training and Channel Compensation for Acoustic Language Recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772.
Detail

GLEMBEK Ondřej, MATěJKA Pavel, BURGET Lukáš and MIKOLOV Tomáš. Advances in Phonotactic Language Recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772.
Detail

KARAFIáT Martin, BURGET Lukáš, HAIN Thomas and ČERNOCKý Jan. Discrimininative training of narrow band - wide band adaptated systems for meeting recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772.
Detail

SZőKE Igor, FAPšO Michal, BURGET Lukáš and ČERNOCKý Jan. Hybrid word-subword decoding for spoken term detection. In: Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008, p. 4. ISBN 978-90-365-2697-5.
Detail

KOCKMANN Marcel and BURGET Lukáš. Syllable based Feature-Contours for Speaker Recognition. In: Proc. 14th International Workshop on Advances in Speech Technology. Maribor, 2008, p. 4.
Detail

BURGET Lukáš, FAPšO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIáT Martin, KOCKMANN Marcel, MATěJKA Pavel, SCHWARZ Petr and ČERNOCKý Jan. BUT system description: NIST SRE 2008. In: Proc. 2008 NIST Speaker Recognition Evaluation Workshop. Montreal: National Institute of Standards and Technology, 2008, pp. 1-4.
Detail

BURGET Lukáš, FAPšO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIáT Martin, KOCKMANN Marcel, MATěJKA Pavel, SCHWARZ Petr and ČERNOCKý Jan. Brno University Of Technology - NIST 2008 SRE. Montreal, 2008.
Detail

MIKOLOV Tomáš. LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION OF CZECH LECTURES. In: Proc. STUDENT EEICT 2008. Brno: Faculty of Electrical Engineering and Communication BUT, 2008, pp. 1-5. ISBN 978-80-214-3617-6.
Detail

SZőKE Igor, BURGET Lukáš, ČERNOCKý Jan and FAPšO Michal. Sub-word modeling of out of vocabulary words in spoken term detection. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, p. 4. ISBN 978-1-4244-3472-5.
Detail

KOCKMANN Marcel and BURGET Lukáš. Contour modeling of prosodic and acoustic features for speaker recognition. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, p. 4. ISBN 978-1-4244-3472-5.
Detail

OPARIN Ilya, GLEMBEK Ondřej, BURGET Lukáš and ČERNOCKý Jan. Morphological random forests for language modeling of inflectional languages. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, p. 4. ISBN 978-1-4244-3472-5.
Detail

BURGET Lukáš, BRüMMER Niko, REYNOLDS Douglas, KENNY Patrick, PELECANOS Jason, VOGT Robbie, CASTALDO Fabio, DEHAK Najim, DEHAK Reda, GLEMBEK Ondřej, KARAM Zahi, NOECKER John Jr., NA Hye Young, COSTIN Ciprian C., HUBEIKA Valiantsina, KAJAREKAR Sachin, SCHEFFER Nicolas and ČERNOCKý Jan. Robust Speaker Recognition Over Varying Channels. Baltimore: Johns Hopkins University, 2008.
Detail

KOMBRINK Stefan. OOV detection in LVCSR using neural networks. In: Proc. STUDENT EEICT 2008. Brno: Faculty of Electrical Engineering and Communication BUT, 2008, p. 3. ISBN 978-80-214-3617-6.
Detail

SZőKE Igor, FAPšO Michal and ČERNOCKý Jan. Hlasový přístup ke korpusům - studie. Brno: Ministry of Industry and Trade of the Czech Republic, 2008.
Detail

ČERNOCKý Jan and MATěJKA Pavel. MOBIO D3.2: Report on the description and evaluation of baseline algorithms for unimodal authentication. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2008.
Detail

ČERNOCKý Jan and MATěJKA Pavel. MOBIO D7.1: Planning of evaluation campaigns. Martigny: Information and Communication Technologies (ICT) 7th Framework programme, 2008.
Detail

2007

GRéZL František, KARAFIáT Martin, KONTáR Stanislav and ČERNOCKý Jan. Probabilistic and bottle-neck features for LVCSR of meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 757-760. ISBN 1-4244-0728-1.
Detail

MATěJKA Pavel, BURGET Lukáš, SCHWARZ Petr, GLEMBEK Ondřej, KARAFIáT Martin, GRéZL František, ČERNOCKý Jan, VAN Leeuwen David, BRüMMER Niko and STRASHEIM Albert. STBU system for the NIST 2006 speaker recognition evaluation. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007, pp. 221-224. ISBN 1-4244-0728-1.
Detail

BURGET Lukáš, MATěJKA Pavel, SCHWARZ Petr, GLEMBEK Ondřej and ČERNOCKý Jan. Analysis of feature extraction and channel compensation in GMM speaker recognition system. IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 7, 2007, pp. 1979-1986. ISSN 1558-7916.
Detail

GRéZL František, KARAFIáT Martin and ČERNOCKý Jan. Neural network topologies and bottle neck features in speech recognition. Brno, 2007.
Detail

GRéZL František and ČERNOCKý Jan. TRAP-based Techniques for Recognition of Noisy Speech. In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). LNCS. Berlin: Springer Verlag, 2007, pp. 270-277. ISBN 978-3-540-74627-0.
Detail

KARAFIáT Martin, BURGET Lukáš, ČERNOCKý Jan and HAIN Thomas. Real-Time ASR from Meetings. In: Proc. INTERSPEECH 2007. Antwerpen: International Speech Communication Association, 2007, p. 4. ISSN 1990-9772.
Detail

SINISCALCHI Sabato M., SCHWARZ Petr and LEE Chin-Hui. High-accuracy phone recognition by combining high performance lattice generation and knowledge based rescoring. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 869-872. ISBN 1-4244-0728-1.
Detail

HAIN Thomas, WAN Vincent, BURGET Lukáš, KARAFIáT Martin, DINES John, VEPA Jithendra, GARAU Giulia and LINCOLN Mike. The AMI System for the Transcription of Speech in Meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 357-360. ISBN 1-4244-0728-1.
Detail

ČERNOCKý Jan, SZőKE Igor, FAPšO Michal, KARAFIáT Martin, BURGET Lukáš, KOPECKý Jiří, GRéZL František, SCHWARZ Petr, GLEMBEK Ondřej, OPARIN Ilya, SMRž Pavel and MATěJKA Pavel. Search in speech for public security and defense. In: Proc. IEEE Workshop on Signal Processing Applications for Public Security and Forensics, 2007 (SAFE '07). Washington D.C.: IEEE Signal Processing Society, 2007, pp. 1-7. ISBN 1-4244-1226-9.
Detail

ČERNOCKý Jan, BURGET Lukáš, SCHWARZ Petr, MATěJKA Pavel, KARAFIáT Martin, GLEMBEK Ondřej, KOPECKý Jiří, SZőKE Igor, FAPšO Michal, GRéZL František, HUBEIKA Valiantsina and OPARIN Ilya. Search in speech, language identification and speaker recognition in Speech@FIT. In: Proc. 17th International Conference Radioelektronika, 2007. Brno: Department of Radioelectronics FEEC BUT, 2007, pp. 1-6. ISBN 978-80-214-3390-8.
Detail

SZőKE Igor, FAPšO Michal, KARAFIáT Martin, BURGET Lukáš, GRéZL František, SCHWARZ Petr, GLEMBEK Ondřej, MATěJKA Pavel, KOPECKý Jiří and ČERNOCKý Jan. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno, 2007.
Detail

SZőKE Igor, BURGET Lukáš and KARAFIáT Martin. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno, 2007.
Detail

HUBEIKA Valiantsina, BURGET Lukáš, MATěJKA Pavel and ČERNOCKý Jan. Channel Compensation for Speaker Recognition. Brno, 2007.
Detail

HUBEIKA Valiantsina, SZőKE Igor, BURGET Lukáš and ČERNOCKý Jan. Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System. In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). Pilsen: Springer Verlag, 2007, pp. 1-6. ISBN 978-3-540-74627-0.
Detail

BRüMMER Niko, BURGET Lukáš, ČERNOCKý Jan, GLEMBEK Ondřej, GRéZL František, KARAFIáT Martin, VAN Leeuwen David, MATěJKA Pavel, SCHWARZ Petr and STRASHEIM Albert. Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006. IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 7, 2007, pp. 2072-2084. ISSN 1558-7916.
Detail

HUBEIKA Valiantsina. Estimation of gender and age. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Faculty of Electrical Engineering and Communication BUT, 2007, pp. 1-3. ISBN 9788021434103.
Detail

FAPšO Michal. Search in speech records. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Faculty of Electrical Engineering and Communication BUT, 2007, pp. 1-3. ISBN 978-80-214-3410-3.
Detail

VESELý Karel. Hybrid recognizer of isolated words. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Faculty of Electrical Engineering and Communication BUT, 2007, pp. 1-3. ISBN 9788021434103.
Detail

HRDLIčKA Pavel. Isolated word recognition. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Faculty of Electrical Engineering and Communication BUT, 2007, pp. 1-3. ISBN 9788021434103.
Detail

MIKOLOV Tomáš. Language modeling of Czech using neural networks. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Faculty of Electrical Engineering and Communication BUT, 2007, pp. 1-3. ISBN 9788021434103.
Detail

MIKOLOV Tomáš, OPARIN Ilya, GLEMBEK Ondřej, BURGET Lukáš, KARAFIáT Martin and ČERNOCKý Jan. Použití mluvených korpusů ve vývoji systému pro rozpoznávání českých přednášek. Praha: Charles University, 2007.
Detail

MATěJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej, SCHWARZ Petr, HUBEIKA Valiantsina, FAPšO Michal, MIKOLOV Tomáš and PLCHOT Oldřich. BUT system description for NIST LRE 2007. In: Proc. 2007 NIST Language Recognition Evaluation Workshop. Orlando: National Institute of Standards and Technology, 2007, pp. 1-5.
Detail

HEřMANSKý Hynek, BURGET Lukáš, SCHWARZ Petr, MATěJKA Pavel, HANNEMANN Mirko, RASTROW Ariya, WHITE Christopher, KHUDANPUR Sanjeev and ČERNOCKý Jan. Recovery from Model Inconsistency in Multilingual Speech Recognition. Baltimore: Johns Hopkins University, 2007.
Detail

CHALUPNíčEK Kamil, ČERNOCKý Jan, KOSTKA Martin, PAVELEK Tomáš and VšIANSKý Jan. Automatické hodnocení výslovnosti. Brno: Ministry of Industry and Trade of the Czech Republic, 2007.
Detail

GRéZL František, HRDLIčKA Pavel, VESELý Karel, CHALUPNíčEK Kamil, ČERNOCKý Jan, KOSTKA Martin, PAVELEK Tomáš and VšIANSKý Jan. Vyhledávání slovníkových hesel hlasem. Brno: Ministry of Industry and Trade of the Czech Republic, 2007.
Detail

2006

FAPšO Michal, SMRž Pavel, SCHWARZ Petr, SZőKE Igor, SCHWARZ Milan, ČERNOCKý Jan, KARAFIáT Martin and BURGET Lukáš. Information Retrieval from Spoken Documents. In: Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006). Mexico City: Springer Verlag, 2006, pp. 410-416. ISBN 3-540-32205-1.
Detail

FAPšO Michal, SCHWARZ Petr, SZőKE Igor, SMRž Pavel, SCHWARZ Milan, ČERNOCKý Jan, KARAFIáT Martin and BURGET Lukáš. Search Engine for Information Retrieval from Speech Records. In: Proceedings of the Third International Seminar on Computer Treatment of Slavic and East European Languages. Bratislava, 2006, pp. 100-101.
Detail

SZőKE Igor. Keyword Spotting in Meeting Data. In: Proceedings of the 12th Conference Student EEICT 2006 Volume 4. Brno: Faculty of Electrical Engineering and Communication BUT, 2006, pp. 440-444. ISBN 80-214-3163-6.
Detail

BURGET Lukáš, ČERNOCKý Jan, FAPšO Michal, KARAFIáT Martin, MATěJKA Pavel, SCHWARZ Petr, SMRž Pavel and SZőKE Igor. Indexing and search methods for spoken documents. In: Proceedings of the Ninth International Conference on Text, Speech and Dialogue, TSD 2006. LNCS. Berlin: Springer Verlag, 2006, pp. 351-358. ISSN 0302-9743.
Detail

MATěJKA Pavel, SCHWARZ Petr, BURGET Lukáš and ČERNOCKý Jan. Use of anti-models to furher improve state-of-the-art PRLM language recognition system. In: Proceedings of ICASSP 2006. Toulouse, 2006, pp. 197-200.
Detail

BURGET Lukáš, MATěJKA Pavel and ČERNOCKý Jan. Discriminative Training Techniques for Acoustic Language Identification. In: Proceedings of ICASSP 2006. Toulouse, 2006, pp. 209-212.
Detail

SCHWARZ Petr, MATěJKA Pavel and ČERNOCKý Jan. Hierarchical structures of neural networks for phoneme recognition. In: Proceedings of ICASSP 2006. Toulouse, 2006, pp. 325-328.
Detail

MATěJKA Pavel, BURGET Lukáš, SCHWARZ Petr and ČERNOCKý Jan. Brno University of Technology System for NIST 2005 Language Recognition Evaluation. In: Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop. San Juan, 2006, pp. 57-64. ISBN 1-4244-0472-X.
Detail

MATěJKA Pavel, BURGET Lukáš, SCHWARZ Petr and ČERNOCKý Jan. NIST 2005 Language Recognition Evaluation. In: Proceedings of NIST LRE 2005. Washington DC: National Institute of Standards and Technology, 2006, pp. 1-37.
Detail

MATěJKA Pavel, BURGET Lukáš, SCHWARZ Petr and ČERNOCKý Jan. NIST Speaker Recognition Evaluation 2006. In: Proceedings of NIST Speaker Recognition Evaluation 2006. San Juan: National Institute of Standards and Technology, 2006, pp. 1-40.
Detail

KONTáR Stanislav. Parallel training of neural networks for speech recognition. In: Proc. 12th International Conference on Soft Computing MENDEL'06. Brno: Brno University of Technology, 2006, p. 6. ISBN 80-214-3195-4.
Detail

GLEMBEK Ondřej, KARAFIáT Martin, BURGET Lukáš and ČERNOCKý Jan. Czech Speech Recognizer for Multiple Environments. In: Radioeletronika 2006. Bratislava, 2006, pp. 1-4.
Detail

ČERNOCKý Jan, MATěJKA Pavel, BURGET Lukáš and SCHWARZ Petr. Automatic Language Identification System. In: Sborník příspěvků z odborného semináře "Nové technologie v radiokomunikacích". Brno: University of Defence in Brno, 2006, pp. 1-6.
Detail

HUBEIKA Valiantsina. Estimation of Gender and Age from Recorded Speech. In: Proc. ACM Student Research competition 2006. Prague: Czech Technical University, 2006, pp. 25-32. ISBN 80-01-03595-6.
Detail

KARAFIáT Martin, GRéZL František, SCHWARZ Petr, BURGET Lukáš and ČERNOCKý Jan. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in large vocabulary continuous speech recognition. In: Proc. Fifth Slovenian and First International Language Technologies Conference. Ljubljana, 2006, pp. 1-4.
Detail

SMRž Pavel. Uncertainty Extensions to Ontologies as a Tool for Semantic Interpretation in Audiovisual Systems. In: Proceedings of the 1st International Conference on Semantic and Digital Media Technologies, Poster and Demo. Athens, 2006, pp. 27-28.
Detail

KARAFIáT Martin, GRéZL František, SCHWARZ Petr, BURGET Lukáš and ČERNOCKý Jan. Robust heteroscedastic linear discriminant analysis and LCRC posterior features in meeting data recognition. In: Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006). Lecture Notes in Computer Science, vol. 4299. Berlin: Springer Verlag, 2006, pp. 275-284. ISBN 3-540-69267-3.
Detail

AL-HAMES Marc, HAIN Thomas, ČERNOCKý Jan, SCHREIBER Sascha, POEL Mannes, MüLLER Ronald, MARCEL Sebastien, VAN Leeuwen David, ODOBEZ Jean-Marc, BA Sileye, BOURLARD Herve, CARDINAUX Fabien, GATICA-PEREZ Daniel, JANIN Adam, MOTLíčEK Petr, REITER Stephan, RENALS Steve, VAN Rest Jeroen, RIENKS Rutger, RIGOLL Gerhard, SMITH Kevin, THEAN Andrew and ZEMčíK Pavel. Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers. In: Proc. 3nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006). Washington D.C., 2006, p. 12.
Detail

HAIN Thomas, BURGET Lukáš, DINES John, GARAU Giulia, KARAFIáT Martin, LINCOLN Mike and WAN Vincent. The AMI Meeting Transcription System. In: Proc. NIST Rich Transcription 2006 Spring Meeting Recognition Evaluation Worskhop. Washington D.C.: National Institute of Standards and Technology, 2006, p. 12.
Detail

SZőKE Igor, FAPšO Michal, KARAFIáT Martin, BURGET Lukáš, GRéZL František, SCHWARZ Petr, GLEMBEK Ondřej, MATěJKA Pavel, KONTáR Stanislav and ČERNOCKý Jan. BUT System for NIST STD 2006 - English. In: Proc. NIST SPoken Term Detection Evaluation workshop (STD 2006). Washington D.C.: National Institute of Standards and Technology, 2006, p. 26.
Detail

KOPECKý Jiří, SZőKE Igor, FAPšO Michal, KARAFIáT Martin, BURGET Lukáš, OPARIN Ilya, SCHWARZ Petr, MATěJKA Pavel, ČERNOCKý Jan and GLEMBEK Ondřej. BUT System for NIST STD 2006 - Arabic. In: Proc. NIST SPoken Term Detection Evaluation workshop (STD 2006). Washington D.C.: National Institute of Standards and Technology, 2006, p. 15.
Detail

ČERNOCKý Jan, POTúčEK Igor, SUMEC Stanislav and ZEMčíK Pavel et al. AMI Mobile Meeting Capture and Analysis System. Washington, 2006.
Detail

STOLCKE Andreas, GRéZL František, HWANG Mei-Yuh, LEI Xin, MORGAN Nelson and VERGYRI Dimitra. Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons. In: 2006 IEEE International Conference on Acoustic, Speech, and Signal Processing. Toulouse: IEEE Signal Processing Society, 2006, pp. 321-324. ISBN 978-3-540-74627-0.
Detail

2005

MATěJKA Pavel, SCHWARZ Petr, ČERNOCKý Jan and CHYTIL Pavel. Tuning Phonotactic Language Identificaion System. Brno: Faculty of Information Technology BUT, 2005.
Detail

MATěJKA Pavel. Phoneme Recognition Tuning for Language Identification System. In: Proceedings of the 11th conference STUDENT EEICT 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005, pp. 658-653. ISBN 80-214-2890-2.
Detail

MATěJKA Pavel, SCHWARZ Petr, ČERNOCKý Jan and CHYTIL Pavel. Phonotactic Language Identification. In: Proceedings of Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005, pp. 140-143. ISBN 80-214-2904-6.
Detail

MATěJKA Pavel, SCHWARZ Petr, ČERNOCKý Jan and CHYTIL Pavel. Phonotactic Language Identification using High Quality Phoneme Recognition. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisbon: International Speech Communication Association, 2005, pp. 2237-2240. ISSN 1018-4074.
Detail

SZőKE Igor. Smooth Pitch Tracker Based on Harmonic and Noise Model. In: STUDENT EEICT 2005. Brno: Faculty of Information Technology BUT, 2005, pp. 673-677. ISBN 80-214-2890-2.
Detail

SZőKE Igor, SCHWARZ Petr, BURGET Lukáš, KARAFIáT Martin and ČERNOCKý Jan. Phoneme based acoustics keyword spotting in informal continuous speech. In: Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005, pp. 195-198. ISBN 80-214-2904-6.
Detail

MOTLíčEK Petr, BURGET Lukáš and ČERNOCKý Jan. VISUAL FEATURES FOR MULTIMODAL SPEECH RECOGNITION. In: Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005, pp. 187-190. ISBN 80-214-2904-6.
Detail

SMRž Pavel and FAPšO Michal. Vyhledávání v záznamech přednášek. In: Sborník semináře Technologie pro e-vzdělávání. Praha: Czech Technical University, 2005, pp. 21-26. ISBN 80-01-03274-4.
Detail

SZőKE Igor, SCHWARZ Petr, BURGET Lukáš, KARAFIáT Martin, MATěJKA Pavel and ČERNOCKý Jan. Phoneme Based Acoustics Keyword Spotting in Informal Continuous Speech. Lecture Notes in Computer Science, vol. 2005, no. 3658, p. 8. ISSN 0302-9743.
Detail

SZőKE Igor, SCHWARZ Petr, BURGET Lukáš, FAPšO Michal, KARAFIáT Martin, ČERNOCKý Jan and MATěJKA Pavel. Comparison of Keyword Spotting Approaches for Informal Continuous Speech. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisabon, 2005, pp. 633-636. ISSN 1018-4074.
Detail

SZőKE Igor, SCHWARZ Petr, MATěJKA Pavel, BURGET Lukáš, FAPšO Michal, KARAFIáT Martin and ČERNOCKý Jan. Comparison of Keyword Spotting Approaches for Informal Continuous Speech. In: 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Edinburgh, 2005, p. 12.
Detail

ZHU Qifeng, CHEN Barry, GRéZL František and MORGAN Nelson. Improved MLP Structures for Data-Driven Feature Extraction for ASR. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisabon, 2005, p. 4. ISSN 1018-4074.
Detail

STOLCKE Andreas, ANGUERA Xavier, BOAKYE Kofi, CETIN Özgür, GRéZL František, JANIN Adam, MANDAL Arindam, PESKIN Barbara, WOOTERS Chuck and ZHENG Jing. Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System. In: Machine Learning for Multimodal Interaction, Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers. Lecture Notes in Computer Science 3869, Springer 2006. Edinburgh, Scotland: University of Edinburgh, 2005, pp. 463-475. ISBN 978-3-540-32549-9.
Detail

GRéZL František. Spectral plane investigation for probabilistic features for ASR. Edinburgh, 2005.
Detail

FAPšO Michal, SCHWARZ Petr, SZőKE Igor, ČERNOCKý Jan, SMRž Pavel, BURGET Lukáš and KARAFIáT Martin. Search Engine for Information Retrieval from Multi-modal Records. Edinburgh, 2005.
Detail

KARAFIáT Martin, BURGET Lukáš and ČERNOCKý Jan. Using Smoothed Heteroscedastic Linear Discriminant Analysis in Large Vocabulary Continuous Speech Recognition System. In: 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms. tento článek nebyl zařazen mezi Revised Selected Papers, nevyšel v LNCS 3869. Edinbourgh, Scotland: University of Edinburgh, 2005, p. 8.
Detail

HAIN Thomas, KARAFIáT Martin, DINES John, MCCOWAN Iain, LINCOLN Mike, GARAU Giulia, WAN Vincent, ORDELMAN Roeland and RENALS Steve. The Development of the AMI System for the Transcription of Speech in Meetings. In: Machine Learning for Multimodal Interaction, Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers. Lecture Notes in Computer Science Volume 3869, Springer 2006. Edinburgh: University of Edinburgh, 2005, pp. 344-356. ISBN 978-3-540-32549-9.
Detail

HAIN Thomas, KARAFIáT Martin, GARAU Giulia, MOORE Darren, WAN Vincent, ORDELMAN Roeland and RENALS Steve. Transcription of Conference Room Meetings: an Investigation. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisabon: International Speech Communication Association, 2005, p. 4. ISSN 1018-4074.
Detail

HAIN Thomas, BURGET Lukáš, DINES John, GARAU Giulia, KARAFIáT Martin, LINCOLN Mike, MCCOWAN Iain, MOORE Darren, WAN Vincent, ORDELMAN Roeland and RENALS Steve. The 2005 AMI System for the Transcription of Speech in Meetings. In: Machine Learning for Multimodal Interaction, Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers. Lecture Notes in Computer Science Volume 3869, Springer 2006. Edinburgh: University of Edinburgh, 2005, pp. 450-462. ISBN 978-3-540-32549-9.
Detail

ASHBY Simone, BOURBAN Sebastien, CARLETTA Jean, FLYNN Mike, GUILLEMOT Mael, HAIN Thomas, KADLEC Jaroslav, KARAISKOS Vasilis, KRAAIJ Wessel, KRONENTHAL Melissa, LATHOUD Guillaume, LINCOLN Mike, LISOWSKA Agnes, MCCOWAN Iain, POST Wilfried, REIDSMA Dennis and WELLNER Pierre. The AMI Meeting Corpus: A Pre-Announcement. In: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI). Edinburgh, 2005, p. 4.
Detail

MOTLíčEK Petr, BURGET Lukáš and ČERNOCKý Jan. Non-parametric Speaker Turn Segmentation of Meeting Data. In: Interspeech'2005 - Eurospeech - 9th European Conference on Speech Communication and Technology. Lisabon: International Speech Communication Association, 2005, pp. 657-660. ISSN 1018-4074.
Detail

ASHBY Simone, BOURBAN Sebastien, CARLETTA Jean, FLYNN Mike, GUILLEMOT Mael, HAIN Thomas, KADLEC Jaroslav, KARAISKOS Vasilis, KRAAIJ Wessel, KRONENTHAL Melissa, LATHOUD Guillaume, LINCOLN Mike, LISOWSKA Agnes, MCCOWAN Iain, POST Wilfried, REIDSMA Dennis and WELLNER Pierre. The AMI Meeting Corpus. In: Measuring Behavior 2005 Proceedings Book. Wageningen, 2005, p. 4.
Detail

ČERNOCKý Jan and LAMPA Petr. Teaching signals - making it automatic, making it fun. In: Proc. Radioelektronika 2005. Brno: Faculty of Electrical Engineering and Communication BUT, 2005, p. 4. ISBN 80-214-2904-6.
Detail

GRéZL František. Adaptation of Unknown Data to Already Trained Speech Recognition System. In: Sborník prací konference a souteze Student EEICT 2005. Brno: Faculty of Information Technology BUT, 2005, p. 4. ISBN 80-214-2890-2.
Detail

ČERNOCKý Jan and CHALUPNíčEK Kamil. Checks of speech transcriptions for AMI meeting database. In: 10th International Conference SPEECH and COMPUTER. Moscow, 2005, pp. 587-590. ISBN 5-7452-0110-1.
Detail

CHALUPNíčEK Kamil. Checks of speech annotation of AMI meetings. In: Sborník prací konference a souteze Student EEICT 2005. Brno: Faculty of Information Technology BUT, 2005, pp. 612-616. ISBN 80-214-2890-2.
Detail

2004

GRéZL František. Combinations of TRAP-based systems. In: Proc. Seventh International conference on Text, Speech and Dialogue. Brno: Faculty of Informatics MU, 2004, pp. 323-330. ISBN 3-540-23049-1.
Detail

MOTLíčEK Petr. Modelování spektra a časových trajektorií v rozpoznávání řeči. GACR 102/02/0124 "Hlasové technologie v podpoře informační společnosti", souhrnný přehled aktivit řešitelských kolektivů. Praha, 2004. ISBN 80-01-02957-3.
Detail

SZőKE Igor and MOTLíčEK Petr. Kódování řeči na velmi nízkých bitových rychlostech. GACR 102/02/0124 "Hlasové technologie v podpoře informační společnosti", souhrnný přehled aktivit řešitelských klektivů. Praha: Faculty of Electrical Engineering, Czech Technical University, 2004. ISBN 80-01-02957-3.
Detail

SZőKE Igor. Speech units automatically generated by ergodic hidden Markov model. In: Proceedings of 10th Conference and Competition STUDENT EEICT 2004. Brno: Faculty of Electrical Engineering and Communication BUT, 2004, p. 5.
Detail

MATěJKA Pavel, SZőKE Igor, SCHWARZ Petr and ČERNOCKý Jan. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. In: Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004, p. 8. ISBN 3-540-23049-1.
Detail

SCHWARZ Petr, MATěJKA Pavel and ČERNOCKý Jan. Towards Lower Error Rates In Phoneme Recognition. Lecture Notes in Computer Science, vol. 2004, no. 3206, pp. 465-472. ISBN 3-540-23049-1. ISSN 0302-9743.
Detail

MOTLíčEK Petr, BURGET Lukáš and ČERNOCKý Jan. PHONEME RECOGNITION OF MEETINGS USING AUDIO-VISUAL DATA. AMI Workshop. Martigny, 2004.
Detail

KARAFIáT Martin, GRéZL František and ČERNOCKý Jan. TRAP based features for LVCSR of meeting data. In: Proc. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co,, 2004, pp. 437-440. ISSN 1225-4111.
Detail

BURGET Lukáš. Combination of Speech Features Using Smoothed Heteroscedastic Linear Discriminant Analysis. In: Proc. 8th International Conference on Spoken Language Processing. Jeju island: Sunjin Printing Co,, 2004, pp. 2549-2552.
Detail

MOTLíčEK Petr and ČERNOCKý Jan. Multimodal Phoneme Recognition of Meeting Data. In: 7th International Conference, TSD 2004 Brno, Czech Republic, September 2004 Proceedings. Brno: Springer Verlag, 2004, pp. 379-384. ISBN 3-540-23049-1. ISSN 0302-9743.
Detail

BURGET Lukáš. Measurement of Complementarity of Recognition Systems. In: Proc. Seventh International conference on Text, Speech and Dialogue. Lecture Notes in Artificial Intelligence (LNAI) subseries of LNCS series as Volume 3206. Brno: Springer Verlag, 2004, pp. 283-290. ISBN 3-540-23049-1.
Detail

FOUSEK Petr, SVOJANOVSKý Petr, GRéZL František and HEřMANSKý Hynek. New Nonsense Syllables Database - Analyses and Preliminary ASR Experiments. In: Proc. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co,, 2004, pp. 348-351. ISSN 1225-4111.
Detail

MOTLíčEK Petr. Visual Feature Extreaction for Phoneme Recognition of Meetings. Brno: Department of Computer Graphics and Multimedia FIT BUT, 2004.
Detail

MATěJKA Pavel, SZőKE Igor, SCHWARZ Petr and ČERNOCKý Jan. Automatic Language Identification using Phoneme and Automatically Derived Unit Strings. Lecture Notes in Computer Science, vol. 2004, no. 3206, p. 8. ISSN 0302-9743.
Detail

MATěJKA Pavel, ČERNOCKý Jan and SIGMUND Milan. Introduction to Automatic Language Identification. In: Conference Proceedings of Radioelektronika 2004. Brno: Slovak University of Technology in Bratislava, 2004, p. 4. ISBN 80-227-2017-8.
Detail

MATěJKA Pavel. Review of Automatic Language Identification. In: Proceedings of 10th Conference and Competition STUDENT EEICTT 2004 Volume 2. Brno, 2004, p. 5. ISBN 80-214-2635-7.
Detail

MOTLíčEK Petr. Segmentace nahrávek živých jednání podle mluvčího. In: Sborník příspěvků a prezentací akce Odborné semináře 2004. REL03V. Brno: Department of Radioelectronics FEEC BUT, 2004, p. 28.
Detail

SCHWARZ Petr, MATěJKA Pavel and ČERNOCKý Jan. Towards Lower Error Rates in Phoneme Recognition. In: Proceedings of 7th International Conference Text,Speech and Dialoque 2004. Brno: Springer Verlag, 2004, p. 8. ISBN 3-540-23049-1.
Detail

SCHWARZ Petr, MATěJKA Pavel and ČERNOCKý Jan. Towards Lower Error Rates in Phoneme Recognition. Lecture Notes in Computer Science, vol. 2004, no. 3206, p. 8. ISSN 0302-9743.
Detail

SCHWARZ Petr, MATěJKA Pavel and ČERNOCKý Jan. Phoneme Recognition from a Long Temporal Context. In: poster at JOINT AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms. Martigny: Institute for Perceptual Artificial Intelligence, 2004, pp. 1-1.
Detail

MOTLíčEK Petr and ČERNOCKý Jan. Multimodal Phoneme Recognition of Meeting Data. Lecture Notes in Computer Science, vol. 2004, no. 3206, p. 6. ISSN 0302-9743.
Detail

2003

MOTLíčEK Petr. Derivation of TRAPs in Auditory Domain. In: Proceedings of 9th Conference and Competition STUDENT EEICT 2003. Brno: Dean Office of FEEC BUT, 2003, pp. 598-602. ISBN 80-214-2379-X.
Detail

JENDERKA Petr and VíCHA Tomáš. Voice Activity Detection in Multimodal Meeting Manager. In: Proceedings of 9th Conference and Competition STUDENT EEICT 2003 Volume 3. Brno: Faculty of Electrical Engineering and Communication BUT, 2003, pp. 588-592. ISBN 80-214-2379-X.
Detail

SCHWARZ Petr, MATěJKA Pavel and ČERNOCKý Jan. Recognition of Phoneme Strings using TRAP Technique. In: Proceedings of 8th International Conference Eurospeech. Geneve: International Speech Communication Association, 2003, pp. 1-4. ISSN 1018-4074.
Detail

MOTLíčEK Petr. Derivation of TRAPs in Auditory Domain. In: Proceedings of the International Conference and Competition. Brno: Faculty of Electrical Engineering and Communication BUT, 2003, pp. 315-319. ISBN 80-214-2401-X.
Detail

MOTLíčEK Petr and ČERNOCKý Jan. Time-domain based Temporal Processing with Application of. In: Proc. EUROSPEECH 2003. Geneva: Institute for Perceptual Artificial Intelligence, 2003, pp. 821-824. ISSN 1018-4074.
Detail

MOTLíčEK Petr and ČERNOCKý Jan. Autoregressive Modeling based Feature Extraction for Aurora3 DSR Task. In: Proc. EUROSPEECH 2003. Geneva: Institute for Perceptual Artificial Intelligence, 2003, pp. 1801-1804. ISSN 1018-4074.
Detail

MOTLíčEK Petr and ČERNOCKý Jan. All-Pole Modeling for Definition of Speech Features in Aurora3 DSR Task. In: 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. České Budějovice: University of West Bohemia in Pilsen, 2003, pp. 295-300. ISBN 3-540-20024-X. ISSN 0302-9743.
Detail

SCHWARZ Petr. Would You Like To Make Your Programs Understand Human Voice?. In: Proceedings of 9th Conference STUDENT EEICT 2003. Brno: Faculty of Electrical Engineering and Communication BUT, 2003, pp. 231-235. ISBN 80-214-2379-X.
Detail

ČERNOCKý Jan. Temporal processing for feature extraction in speech recognition. Vědecké spisy VUT. Edice Habilitační a inaugurační spisy, sv. 112. Brno: Publishing house of Brno University of Technology VUTIUM, 2003, pp. 1-30. ISBN 80-214-2395-1.
Detail

MATěJKA Pavel, SCHWARZ Petr, HEřMANSKý Hynek and ČERNOCKý Jan. Phoneme Recognition using Temporal Patterns. In: Proc. 6th International Conference Text, Speech and Dialogue, TSD2003. Ceske Budejovice: Springer Verlag, 2003, pp. 465-472. ISBN 3-540-20024-X.
Detail

MATěJKA Pavel, SCHWARZ Petr, GRéZL František and ČERNOCKý Jan. Phoneme Classification using Temporal Patterns. In: Proc. 13th International scientific conference Radioelektronika 2003. Brno: Faculty of Electrical Engineering and Communication BUT, 2003, pp. 1-4. ISBN 80-214-2383-8.
Detail

GRéZL František. Local time-frequency operators in TRAPs for speech recognition. In: 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. České Budějovice: University of West Bohemia in Pilsen, 2003, pp. 269-274. ISBN 3-540-20024-X. ISSN 0302-9743.
Detail

GRéZL František and HEřMANSKý Hynek. Local averaging and differentiating of spectral plane for TRAP-based ASR. In: Proc. EUROSPEECH 2003. Geneva: Institute for Perceptual Artificial Intelligence, 2003, p. 4. ISSN 1018-4074.
Detail

GRéZL František. Effect of normalization on TRAP based systems in ASR. In: Proc. 13th International scientific conference Radioelektronika 2003. Brno: Department of Radioelectronics FEEC BUT, 2003, pp. 128-131. ISBN 80-214-2383-8.
Detail

KARAFIáT Martin and GRéZL František. Using MATLAB for Analysis of TRAP system. Radioengineering, vol. 2003, no. 4, pp. 38-41. ISSN 1210-2512.
Detail

MOTLíčEK Petr. Modeling of Spectra and Temporal Trajectories in Speech Processing. In: Sborník příspěvků a prezentací akce Odborné semináře 2003 . REL02V. Brno: Department of Radioelectronics FEEC BUT, 2003, p. 28.
Detail

BURGET Lukáš and ČERNOCKý Jan. Recognition of Speech with Non-random Attributes. In: 6th International Conference, TSD 2003 České Budějovice, Czech Republic, September 2003 Proceedings. České Budějovice: Springer Verlag, 2003, p. 6. ISBN 3-540-20024-X. ISSN 0302-9743.
Detail

2002

BAUDOIN Genevieve, CAPMAN Francois, ČERNOCKý Jan, EL Chami Fadi, CHARBIT Maurice, CHOLLET Gerard and PETROVSKA-DELACRETAZ Dijana. Advances in very low bit-rate speech coding using recognition and synthesis techniques. Lecture Notes in Computer Science, vol. 2002, no. 2448, pp. 269-276. ISBN 3-540-44129-8. ISSN 0302-9743.
Detail

MATěJKA Pavel, SCHWARZ Petr, KARAFIáT Martin and ČERNOCKý Jan. Some like it Gaussian... In: Proc. 5th International Conference Text, Speech and Dialogue, TSD2002. Lecture notes in artificial intelligence 2448. Berlin: Springer Verlag, 2002, pp. 321-324. ISBN 3-540-44129-8.
Detail

SCHWARZ Petr and ČERNOCKý Jan. Keyword detection in Czech fluent speech. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2.
Detail

KARAFIáT Martin and ČERNOCKý Jan. Context dependent Hidden Markov models in recognition of Czech. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2.
Detail

GRéZL František, BURGET Lukáš, JAIN Pratibha and ČERNOCKý Jan. Improving TRAPS features using LDA. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2.
Detail

ČERNOCKý Jan. Units for automatic language independent speech processing. In: Proc. LREC 2002 - workshop on Portability issues in human language technologies. Las Palmas: European Language Resources Association, 2002, pp. 7-13.
Detail

SCHWARZ Petr. Modifications of Viterbi algorithms for keyword detection. In: Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering and Communication BUT, 2002, p. 4. ISBN 80-214-2116-9.
Detail

MOTLíčEK Petr and BURGET Lukáš. Noise estimation for efficient speech enhancement and robust speech recognition. In: Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002, pp. 1033-1036. ISBN 1-876346-42-6.
Detail

MOTLíčEK Petr. Application of Mel-scale Filter bank for Noise Estimation in Speech Processing. In: 12th International Czech-Slovak Scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2.
Detail

MOTLíčEK Petr and BURGET Lukáš. Efficient Noise Estimation and its Application for Robust Speech Recognition. In: 5th International Conference, TSD 2002 Brno, Czech Republic, September 2002 Proceedings. Berlin: Springer Verlag, 2002, pp. 229-236. ISBN 3-540-44129-8.
Detail

MOTLíčEK Petr. Noise Estimation for Spectral Subtraction in Speech Processing. In: Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering and Communication BUT, 2002, p. 4. ISBN 80-214-2116-9.
Detail

BURGET Lukáš, MOTLíčEK Petr, GRéZL František and JAIN Pratibha. Distributed speech recognition. Radioengineering, vol. 2002, no. 4, pp. 12-16. ISSN 1210-2512.
Detail

GARUDADRI Harinath, HEřMANSKý Hynek, MORGAN Nelson, BENITEZ Carmen, BURGET Lukáš, KAJAREKAR Sachin, GRéZL František, JAIN Pratibha and MOTLíčEK Petr. Distributed Voice Recognition System Utilizing Multistream Network Feature Processing. San Diego: Qualcomm, 2002.
Detail

ČERNOCKý Jan and KARAFIáT Martin. Differences between context dependent and context independent Hidden Markov Models for recognition of Czech. In: Proc. of 8th student conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering TUB, 2002, p. 5. ISBN 80-214-2116-9.
Detail

GRéZL František. Classifiers in speech recognition systems based on TRAPS. In: Proceedings of 8th Conference STUDENT EEICT 2002. Brno: Faculty of Electrical Engineering and Communication BUT, 2002, pp. 74-77. ISBN 80-214-2116-9.
Detail

MOTLíčEK Petr. Feature Extraction in Speech Coding and Recognition. Portland: Oregon Graduate Institute of Science and Technology, 2002.
Detail

BURGET Lukáš, DUPONT Stephane, GARUDADRI Harinath, GRéZL František, HEřMANSKý Hynek, JAIN Pratibha, KAJAREKAR Sachin and MORGAN Nelson. QUALCOMM-ICSI-OGI Features for ASR. In: Proc. 7th International Conference on Spoken Language Processing. Denver: International Speech Communication Association, 2002, p. 4. ISBN 1-876346-42-6.
Detail

MATěJKA Pavel and ČERNOCKý Jan. Feature gaussianization in speech recognition. In: Proc. 12th International scientific conference Radioelektronika 2002. Bratislava: Slovak University of Technology in Bratislava, 2002, p. 4. ISBN 80-227-1700-2.
Detail

ČERNOCKý Jan. Temporal processing for feature extraction in speech recognition, habilitation thesis. Brno, 2002.
Detail

2001

ČERNOCKý Jan, BAUDOIN Genevieve, PETROVSKA-DELACRETAZ Dijana and CHOLLET Gerard. Vers une analyse acoustico-phonetique de la parole independante de la langue, basee sur ALISP. Revue Parole, vol. 2001, no. 17, pp. 191-226. ISSN 1373-1955.
Detail

HEUVEL Henk, BOUDY Jerome, BAKCSI Zoltan, ČERNOCKý Jan, GALUNOV Valerij, KOCHANINA Julia, MAJEWSKI Wojciech, POLLáK Petr, RUSKO Milan, SADOWSKI Jerzy, STARONIEWICZ Piotr and TROPF Herbert. SpeechDat-East: Five multilingual speech databases for voice-operated teleservices completed. In: Proc. EUROSPEECH 2001. Aalborg: International Speech Communication Association, 2001, p. 4. ISBN 87-90834-09-7.
Detail

MOTLíčEK Petr, BAUDOIN Genevieve, ČERNOCKý Jan and CHOLLET Gerard. Minimization of transition noise and HNM synthesis in very low bit rate speech coding. In: 4th International Conference, TSD 2001 Železná Ruda, Czech Republic, September 2001 Proceedings. Berlin: Springer Verlag, 2001, pp. 305-312. ISBN 3-540-42557-8.
Detail

MOTLíčEK Petr. Application of Re-segmentation in Very Low Bit Rate Speech Coding. In: Proceedings of 7th Conference STUDENT EEICT 2001. Brno: Faculty of Electrical Engineering and Communication BUT, 2001, pp. 269-274. ISBN 80-214-1860-5.
Detail

MOTLíčEK Petr, GOURNAY Philipe, CHOLLET Gerard and BAUDOIN Genevieve. Codeur tres bas debit par indexation d'unites de parole de taille variable. In: GRETSI'01 on signal and image processing. Toulouse, 2001, p. 4.
Detail

2000

PETROVSKA-DELACRETAZ Dijana, ČERNOCKý Jan, HENNEBERT Jean and CHOLLET Gerard. Segmental Approaches for Automatic Speaker Verification. Digital signal processing, vol. 2000, no. 1, pp. 198-212. ISSN 1052-2004.
Detail

BAUDOIN Genevieve, ČERNOCKý Jan, GOURNAY Philipe and CHOLLET Gerard. Codage de la parole a bas et tres bas debits. Annales des Telecommunications, vol. 2000, no. 9, pp. 1-19. ISSN 0003-4347.
Detail

MOTLíčEK Petr and ČERNOCKý Jan. Optimal Pitch Path Tracking for more reliable Pitch Detection. In: 3th International Conference, TSD 2000 Brno, Czech Republic, September 2000 Proceedings. Berlin: Springer Verlag, 2000, pp. 183-188. ISBN 3-540-41042-2.
Detail

MOTLíčEK Petr and BURGET Lukáš. RELIABILITY IMPROVEMENT OF SPEECH PITCH DETECTION USING PATHS. In: Volume of the Works written by Students and Postgraduate Students. Brno: Faculty of Electrical Engineering and Communication BUT, 2000, pp. 348-351. ISBN 80-7204-155-X.
Detail

1999

ČERNOCKý Jan, POLLáK Petr, HANžL Václav, RUSKO Milan and TRNKA Marián. Recording of Czech and Slovak telephone databases within SpeechDat-E. Proc. Workshop on TEXT, SPEECH and DIALOG (TSD'99). Lecture Notes in Artificial Intelligence No. 1692. Berlin: Springer Verlag, 1999, pp. 388-391. ISBN 3-540-66494-7.
Detail

1996

ČERNOCKý Jan. Multigram-based speech coding - concepts of the dissertation. Brno: Faculty of Electrical Engineering and Computer Science BUT, 1996.
Detail

Study Department

Speech Data Mining Research Group BUT Speech@FIT

https://speech.fit.vutbr.cz/

Publications