Publication Details

Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge

ALAM, J.; BOULIANNE, G.; BURGET, L.; DAHMANE, M.; DIEZ SÁNCHEZ, M.; GLEMBEK, O.; LALONDE, M.; LOZANO DÍEZ, A.; MATĚJKA, P.; MIZERA, P.; MOŠNER, L.; NOISEUX, C.; MONTEIRO, J.; NOVOTNÝ, O.; PLCHOT, O.; ROHDIN, J.; SILNOVA, A.; SLAVÍČEK, J.; STAFYLAKIS, T.; ST-CHARLES, P.; WANG, S.; ZEINALI, H. Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. In Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Tokyo: International Speech Communication Association, 2020. p. 289-295. ISSN: 2312-2846.
Czech title
Analýza systému ABC pro evaluaci NIST SRE 2019 v kategoriích CMN a VAST
Type
conference paper
Language
English
Authors
Alam Jahangir
Boulianne Gilles
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
DAHMANE, M.
DIEZ SÁNCHEZ, M.
Glembek Ondřej, Ing., Ph.D.
LALONDE, M.
LOZANO DÍEZ, A.
Matějka Pavel, Ing., Ph.D. (DCGM)
MIZERA, P.
Mošner Ladislav, Ing. (DCGM)
NOISEUX, C.
MONTEIRO, J.
Novotný Ondřej, Ing., Ph.D.
Plchot Oldřich, Ing., Ph.D. (DCGM)
Rohdin Johan Andréas, M.Sc., Ph.D. (DCGM)
Silnova Anna, M.Sc., Ph.D. (DCGM)
SLAVÍČEK, J.
Stafylakis Themos
ST-CHARLES, P.
Wang Shuai
Zeinali Hossein, Ph.D. (DCGM)
URL
Keywords

speaker verification, NIST SRE, CMN, VAST, system fusion.

Abstract

We present a condensed description and analysis of the joint submission of ABC team for NIST SRE 2019, by BUT, CRIM, Phonexia, Omilia and UAM. We concentrate on challenges that arose during development and we analyze the results obtained on the evaluation data and on our development sets. The conversational telephone speech (CMN2) condition is challenging for current state-of-the-art systems, mainly due to the language mismatch between training and test data. We show that a combination of adversarial domain adaptation, backend adaptation and score normalization can mitigate this mismatch. On the VAST condition, we demonstrate the importance of deploying diarization when dealing with multi-speaker utterances and the drastic improvements that can be obtained by combining audio and visual modalities.

Published
2020
Pages
289–295
Journal
Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland, vol. 2020, no. 11, ISSN 2312-2846
Proceedings
Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop
Publisher
International Speech Communication Association
Place
Tokyo
DOI
BibTeX
@inproceedings{BUT164070,
  author="ALAM, J. and BOULIANNE, G. and BURGET, L. and DAHMANE, M. and DIEZ SÁNCHEZ, M. and GLEMBEK, O. and LALONDE, M. and LOZANO DÍEZ, A. and MATĚJKA, P. and MIZERA, P. and MOŠNER, L. and NOISEUX, C. and MONTEIRO, J. and NOVOTNÝ, O. and PLCHOT, O. and ROHDIN, J. and SILNOVA, A. and SLAVÍČEK, J. and STAFYLAKIS, T. and ST-CHARLES, P. and WANG, S. and ZEINALI, H.",
  title="Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge",
  booktitle="Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop",
  year="2020",
  journal="Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland",
  volume="2020",
  number="11",
  pages="289--295",
  publisher="International Speech Communication Association",
  address="Tokyo",
  doi="10.21437/Odyssey.2020-41",
  issn="2312-2846",
  url="https://www.isca-speech.org/archive/Odyssey_2020/abstracts/73.html"
}
Back to top