Publication Details

Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge

ALAM, J.; BOULIANNE, G.; BURGET, L.; DAHMANE, M.; DIEZ SÁNCHEZ, M.; GLEMBEK, O.; LALONDE, M.; LOZANO DÍEZ, A.; MATĚJKA, P.; MIZERA, P.; MOŠNER, L.; NOISEUX, C.; MONTEIRO, J.; NOVOTNÝ, O.; PLCHOT, O.; ROHDIN, J.; SILNOVA, A.; SLAVÍČEK, J.; STAFYLAKIS, T.; ST-CHARLES, P.; WANG, S.; ZEINALI, H. Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. In Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Tokyo: International Speech Communication Association, 2020. p. 289-295. ISSN: 2312-2846.
Czech title
Analýza systému ABC pro evaluaci NIST SRE 2019 v kategoriích CMN a VAST
Type
conference paper
Language
English
Authors
Alam Jahangir
Boulianne Gilles
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
DAHMANE, M.
DIEZ SÁNCHEZ, M.
Glembek Ondřej, Ing., Ph.D.
LALONDE, M.
LOZANO DÍEZ, A.
Matějka Pavel, Ing., Ph.D. (DCGM)
MIZERA, P.
Mošner Ladislav, Ing. (DCGM)
NOISEUX, C.
MONTEIRO, J.
Novotný Ondřej, Ing., Ph.D.
Plchot Oldřich, Ing., Ph.D. (DCGM)
Rohdin Johan Andréas, M.Sc., Ph.D. (DCGM)
Silnova Anna, M.Sc., Ph.D. (DCGM)
SLAVÍČEK, J.
Stafylakis Themos
ST-CHARLES, P.
Wang Shuai
Zeinali Hossein, Ph.D. (DCGM)
URL
Keywords

speaker verification, NIST SRE, CMN, VAST, system fusion.

Abstract

We present a condensed description and analysis of the joint submission of ABC
team for NIST SRE 2019, by BUT, CRIM, Phonexia, Omilia and UAM. We concentrate on
challenges that arose during development and we analyze the results obtained on
the evaluation data and on our development sets. The conversational telephone
speech (CMN2) condition is challenging for current state-of-the-art systems,
mainly due to the language mismatch between training and test data. We show that
a combination of adversarial domain adaptation, backend adaptation and score
normalization can mitigate this mismatch. On the VAST condition, we demonstrate
the importance of deploying diarization when dealing with multi-speaker
utterances and the drastic improvements that can be obtained by combining audio
and visual modalities.

Published
2020
Pages
289–295
Journal
Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland, vol. 2020, no. 11, ISSN 2312-2846
Proceedings
Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop
Conference
Odyssey 2020: The Speaker and Language Recognition Workshop, Tokyo, JP
Publisher
International Speech Communication Association
Place
Tokyo
DOI
BibTeX
@inproceedings{BUT164070,
  author="ALAM, J. and BOULIANNE, G. and BURGET, L. and DAHMANE, M. and DIEZ SÁNCHEZ, M. and GLEMBEK, O. and LALONDE, M. and LOZANO DÍEZ, A. and MATĚJKA, P. and MIZERA, P. and MOŠNER, L. and NOISEUX, C. and MONTEIRO, J. and NOVOTNÝ, O. and PLCHOT, O. and ROHDIN, J. and SILNOVA, A. and SLAVÍČEK, J. and STAFYLAKIS, T. and ST-CHARLES, P. and WANG, S. and ZEINALI, H.",
  title="Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge",
  booktitle="Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop",
  year="2020",
  journal="Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland",
  volume="2020",
  number="11",
  pages="289--295",
  publisher="International Speech Communication Association",
  address="Tokyo",
  doi="10.21437/Odyssey.2020-41",
  issn="2312-2846",
  url="https://www.isca-speech.org/archive/Odyssey_2020/abstracts/73.html"
}
Files
Back to top