Tento projekt je spolufinancován se státní podporou Technologické gentury ČR
v rámci Programu na podporu aplikovaného výzkumu ZÉTA
www.tacr.cz
Výzkum užitečný pro společnost.
Výzkum užitečný pro společnost.
Detail projektu
Neuronové sítě pro zpracování signálu a dolování informací v řeči - NOSIČI
Období řešení: 1. 1. 2018 – 31. 12. 2019
Typ projektu: grant
Kód: TJ01000208
Agentura: Technologická agentura ČR
Název anglicky
Neural networks for signal processing and speech data mining
Typ
grant
Klíčová slova
neuronové sítě
Abstrakt
Projekt se zabývá neuronovými sítěmi pro zpracování signálu a dolování informací v řeči.
Řešitelé
Žmolíková Kateřina, Ing., Ph.D.
(FIT)
– hlavní řešitel
Beneš Karel, Ing. (UPGM)
Egorova Ekaterina, Ing., Ph.D.
Silnova Anna, M.Sc., Ph.D. (UPGM)
Veselý Karel, Ing., Ph.D. (UPGM)
Beneš Karel, Ing. (UPGM)
Egorova Ekaterina, Ing., Ph.D.
Silnova Anna, M.Sc., Ph.D. (UPGM)
Veselý Karel, Ing., Ph.D. (UPGM)
Publikace
2020
- MATĚJKA, P.; PLCHOT, O.; GLEMBEK, O.; BURGET, L.; ROHDIN, J.; ZEINALI, H.; MOŠNER, L.; SILNOVA, A.; NOVOTNÝ, O.; DIEZ SÁNCHEZ, M.; ČERNOCKÝ, J. 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE. COMPUTER SPEECH AND LANGUAGE, 2020, vol. 2020, no. 63,
p. 1-15. ISSN: 0885-2308. Detail - ROHDIN, J.; SILNOVA, A.; DIEZ SÁNCHEZ, M.; PLCHOT, O.; MATĚJKA, P.; BURGET, L.; GLEMBEK, O. End-to-end DNN based text-independent speaker recognition for long and short utterances. COMPUTER SPEECH AND LANGUAGE, 2020, vol. 2020, no. 59,
p. 22-35. ISSN: 0885-2308. Detail
2019
- ALAM, J.; BOULIANNE, G.; GLEMBEK, O.; LOZANO DÍEZ, A.; MATĚJKA, P.; MIZERA, P.; MONTEIRO, J.; MOŠNER, L.; NOVOTNÝ, O.; PLCHOT, O.; ROHDIN, J.; SILNOVA, A.; SLAVÍČEK, J.; STAFYLAKIS, T.; WANG, S.; ZEINALI, H. ABC NIST SRE 2019 CTS System Description. Proceedings of NIST. Sentosa, Singapore: National Institute of Standards and Technology, 2019.
p. 1-6. Detail - DELCROIX, M.; ŽMOLÍKOVÁ, K.; OCHIAI, T.; KINOSHITA, K.; ARAKI, S.; NAKATANI, T. Compact Network for Speakerbeam Target Speaker Extraction. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019.
p. 6965-6969. ISBN: 978-1-5386-4658-8. Detail - MATĚJKA, P.; PLCHOT, O.; ZEINALI, H.; MOŠNER, L.; SILNOVA, A.; BURGET, L.; NOVOTNÝ, O.; GLEMBEK, O. Analysis of BUT Submission in Far-Field Scenarios of VOiCES 2019 Challenge. In Proceedings of Interspeech. Proceedings of Interspeech. Graz: International Speech Communication Association, 2019.
p. 2448-2452. ISSN: 1990-9772. Detail - ŽMOLÍKOVÁ, K.; DELCROIX, M.; KINOSHITA, K.; OCHIAI, T.; NAKATANI, T.; BURGET, L.; ČERNOCKÝ, J. SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures. IEEE J-STSP, 2019, vol. 13, no. 4,
p. 800-814. ISSN: 1932-4553. Detail
2018
- ALAM, J.; BHATTACHARYA, G.; BRUMMER, J.; BURGET, L.; DIEZ SÁNCHEZ, M.; GLEMBEK, O.; KENNY, P.; KLČO, M.; LANDINI, F.; LOZANO DÍEZ, A.; MATĚJKA, P.; MONTEIRO, J.; MOŠNER, L.; NOVOTNÝ, O.; PLCHOT, O.; PROFANT, J.; ROHDIN, J.; SILNOVA, A.; SLAVÍČEK, J.; STAFYLAKIS, T.; ZEINALI, H. ABC NIST SRE 2018 SYSTEM DESCRIPTION. Proceedings of 2018 NIST SRE Workshop. Athens: National Institute of Standards and Technology, 2018.
p. 1-10. Detail - BENEŠ, K.; KESIRAJU, S.; BURGET, L. i-vectors in language modeling: An efficient way of domain adaptation for feed-forward models. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 3383-3387. ISSN: 1990-9772. Detail - BRUMMER, J.; SILNOVA, A.; BURGET, L.; STAFYLAKIS, T. Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model. In Proceedings of Odyssey 2018. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Les Sables d'Olonne: International Speech Communication Association, 2018.
p. 349-356. ISSN: 2312-2846. Detail - DIEZ SÁNCHEZ, M.; LANDINI, F.; BURGET, L.; ROHDIN, J.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; NOVOTNÝ, O.; VESELÝ, K.; GLEMBEK, O.; PLCHOT, O.; MOŠNER, L.; MATĚJKA, P. BUT system for DIHARD Speech Diarization Challenge 2018. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 2798-2802. ISSN: 1990-9772. Detail - EGOROVA, E.; BURGET, L. Out-of-Vocabulary Word Recovery Using FST-Based Subword Unit Clustering in a Hybrid ASR System. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018.
p. 5919-5923. ISBN: 978-1-5386-4658-8. Detail - KARAFIÁT, M.; BASKAR, M.; SZŐKE, I.; MALENOVSKÝ, V.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. BUT OpenSAT 2017 speech recognition system. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 2638-2642. ISSN: 1990-9772. Detail - KARAFIÁT, M.; BASKAR, M.; VESELÝ, K.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. Analysis of Multilingual BLSTM Acoustic Model on Low and High Resource Languages. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018.
p. 5789-5793. ISBN: 978-1-5386-4658-8. Detail - PULUGUNDLA, B.; BASKAR, M.; KESIRAJU, S.; EGOROVA, E.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. BUT system for low resource Indian language ASR. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 3182-3186. ISSN: 1990-9772. Detail - ROHDIN, J.; SILNOVA, A.; DIEZ SÁNCHEZ, M.; PLCHOT, O.; MATĚJKA, P.; BURGET, L. End-to-End DNN Based Speaker Recognition Inspired by i-Vector and PLDA. In Proceedings of ICASSP. Calgary: IEEE Signal Processing Society, 2018.
p. 4874-4878. ISBN: 978-1-5386-4658-8. Detail - SILNOVA, A.; BRUMMER, J.; GARCÍA-ROMERO, D.; SNYDER, D.; BURGET, L. Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 72-76. ISSN: 1990-9772. Detail - SILNOVA, A.; MATĚJKA, P.; GLEMBEK, O.; PLCHOT, O.; NOVOTNÝ, O.; GRÉZL, F.; SCHWARZ, P.; ČERNOCKÝ, J. BUT/Phonexia Bottleneck Feature Extractor. In Proceedings of Odyssey 2018. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Les Sables d´Olonne: International Speech Communication Association, 2018.
p. 283-287. ISSN: 2312-2846. Detail - VESELÝ, K.; PERALES, C.; SZŐKE, I.; LUQUE, J.; ČERNOCKÝ, J. Lightly supervised vs. semi-supervised training of acoustic model on Luxembourgish for low-resource automatic speech recognition. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018.
p. 2883-2887. ISSN: 1990-9772. Detail - ŽMOLÍKOVÁ, K.; DELCROIX, M.; KINOSHITA, K.; HIGUCHI, T.; NAKATANI, T.; ČERNOCKÝ, J. Optimization of Speaker-aware Multichannel Speech Extraction with ASR Criterion. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018.
p. 6702-6706. ISBN: 978-1-5386-4658-8. Detail