Publication Details
Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge
Boulianne Gilles
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
DAHMANE, M.
DIEZ SÁNCHEZ, M.
Glembek Ondřej, Ing., Ph.D.
LALONDE, M.
LOZANO DÍEZ, A.
Matějka Pavel, Ing., Ph.D.
MIZERA, P.
Mošner Ladislav, Ing. (DCGM)
NOISEUX, C.
MONTEIRO, J.
Novotný Ondřej, Ing., Ph.D.
Plchot Oldřich, Ing., Ph.D. (DCGM)
Rohdin Johan Andréas, M.Sc., Ph.D. (DCGM)
Silnova Anna, M.Sc., Ph.D. (DCGM)
SLAVÍČEK, J.
Stafylakis Themos
ST-CHARLES, P.
Wang Shuai
Zeinali Hossein, Ph.D. (DCGM)
speaker verification, NIST SRE, CMN, VAST, system fusion.
We present a condensed description and analysis of the jointsubmission of ABC team for NIST SRE 2019, by BUT, CRIM,Phonexia, Omilia and UAM. We concentrate on challenges thatarose during development and we analyze the results obtainedon the evaluation data and on our development sets. The conversationaltelephone speech (CMN2) condition is challengingfor current state-of-the-art systems, mainly due to the languagemismatch between training and test data. We show that a combinationof adversarial domain adaptation, backend adaptationand score normalization can mitigate this mismatch. On theVAST condition, we demonstrate the importance of deployingdiarization when dealing with multi-speaker utterances and thedrastic improvements that can be obtained by combining audioand visual modalities.
@inproceedings{BUT164070,
author="ALAM, J. and BOULIANNE, G. and BURGET, L. and DAHMANE, M. and DIEZ SÁNCHEZ, M. and GLEMBEK, O. and LALONDE, M. and LOZANO DÍEZ, A. and MATĚJKA, P. and MIZERA, P. and MOŠNER, L. and NOISEUX, C. and MONTEIRO, J. and NOVOTNÝ, O. and PLCHOT, O. and ROHDIN, J. and SILNOVA, A. and SLAVÍČEK, J. and STAFYLAKIS, T. and ST-CHARLES, P. and WANG, S. and ZEINALI, H.",
title="Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge",
booktitle="Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop",
year="2020",
journal="Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland",
volume="2020",
number="11",
pages="289--295",
publisher="International Speech Communication Association",
address="Tokyo",
doi="10.21437/Odyssey.2020-41",
issn="2312-2846",
url="https://www.isca-speech.org/archive/Odyssey_2020/abstracts/73.html"
}