Publication Details

BUT system for DIHARD Speech Diarization Challenge 2018

DIEZ SÁNCHEZ, M.; LANDINI, F.; BURGET, L.; ROHDIN, J.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; NOVOTNÝ, O.; VESELÝ, K.; GLEMBEK, O.; PLCHOT, O.; MOŠNER, L.; MATĚJKA, P. BUT system for DIHARD Speech Diarization Challenge 2018. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 2798-2802. ISSN: 1990-9772.
Czech title
VUT systém pro DIHARD Speech Diarization Challenge 2018
Type
conference paper
Language
English
Authors
URL
Keywords

Speaker Diarization, Variational Bayes, HMM, i-vector, x-vector, Overlapped speech, DIHARD

Abstract

This paper presents the approach developed by the BUT team for the first DIHARD speech diarization challenge, which is based on our Bayesian Hidden Markov Model with eigenvoice priors system. Besides the description of the approach, we provide a brief analysis of different techniques and data processing methods tested on the development set. We also introduce a simple attempt for overlapped speech detection that we used for attaining cleaner speaker models and reassigning overlapped speech to multiple speakers. Finally, we present results obtained on the evaluation set and discuss findings we made during the development phase and with the help of the DIHARD leaderboard feedback.

Published
2018
Pages
2798–2802
Journal
Proceedings of Interspeech, vol. 2018, no. 9, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2018
Publisher
International Speech Communication Association
Place
Hyderabad
DOI
UT WoS
000465363900585
EID Scopus
BibTeX
@inproceedings{BUT155100,
  author="Mireia {Diez Sánchez} and Federico Nicolás {Landini} and Lukáš {Burget} and Johan Andréas {Rohdin} and Anna {Silnova} and Kateřina {Žmolíková} and Ondřej {Novotný} and Karel {Veselý} and Ondřej {Glembek} and Oldřich {Plchot} and Ladislav {Mošner} and Pavel {Matějka}",
  title="BUT system for DIHARD Speech Diarization Challenge 2018",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="2798--2802",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-1749",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1749.html"
}
Files
Back to top