Publication Details

But System for the Second Dihard Speech Diarization Challenge

LANDINI, F.; WANG, S.; DIEZ SÁNCHEZ, M.; BURGET, L.; MATĚJKA, P.; ŽMOLÍKOVÁ, K.; MOŠNER, L.; SILNOVA, A.; PLCHOT, O.; NOVOTNÝ, O.; ZEINALI, H.; ROHDIN, J. But System for the Second Dihard Speech Diarization Challenge. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020. p. 6529-6533. ISBN: 978-1-5090-6631-5.
Czech title
Systém VUT pro druhou soutěž DIHARD v diarizaci řeči
Type
conference paper
Language
English
Authors
URL
Keywords

Speaker Diarization, Variational Bayes, HMM, DIHARD, CHiME

Abstract

This paper describes the winning systems developed by the BUT team for the four
tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2 the
systems were mainly based on performing agglomerative hierarchical clustering
(AHC) of x-vectors, followed by another x-vector clustering based on Bayes hidden
Markov model and variational Bayes inference. We provide a comparison of the
improvement given by each step and share the implementation of the core of the
system. For tracks 3 and 4 with recordings from the Fifth CHiME Challenge, we
explored different approaches for doing multi-channel diarization and our best
performance was obtained when applying AHC on the fusion of per channel
probabilistic linear discriminant analysis scores.

Published
2020
Pages
6529–6533
Proceedings
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Conference
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), Barcelona, ES
ISBN
978-1-5090-6631-5
Publisher
IEEE Signal Processing Society
Place
Barcelona
DOI
UT WoS
000615970406158
EID Scopus
BibTeX
@inproceedings{BUT163962,
  author="Federico Nicolás {Landini} and Shuai {Wang} and Mireia {Diez Sánchez} and Lukáš {Burget} and Pavel {Matějka} and Kateřina {Žmolíková} and Ladislav {Mošner} and Anna {Silnova} and Oldřich {Plchot} and Ondřej {Novotný} and Hossein {Zeinali} and Johan Andréas {Rohdin}",
  title="But System for the Second Dihard Speech Diarization Challenge",
  booktitle="ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",
  year="2020",
  pages="6529--6533",
  publisher="IEEE Signal Processing Society",
  address="Barcelona",
  doi="10.1109/ICASSP40776.2020.9054251",
  isbn="978-1-5090-6631-5",
  url="https://ieeexplore.ieee.org/document/9054251"
}
Files
Back to top