Publication Details

Dereverberation and Beamforming in Robust Far-Field Speaker Recognition

MOŠNER, L.; PLCHOT, O.; MATĚJKA, P.; NOVOTNÝ, O.; ČERNOCKÝ, J. Dereverberation and Beamforming in Robust Far-Field Speaker Recognition. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. p. 1334-1338. ISSN: 1990-9772.
Czech title
Dereverberace a směrování paprsku v robustním rozpoznávání mluvčího ze vzdálených mikrofonů
Type
conference paper
Language
English
Authors
URL
Keywords

speaker verification, beamforming, dereverberation,autoencoder

Abstract

This paper deals with robust speaker verification (SV) in farfieldsensing. The robustness is verified on a subset of NISTSRE 2010 corpus retransmitted in multiple real rooms of differentacoustics and captured with multiple microphones. Weexperimented with various data preprocessing steps includingdifferent approaches to dereverberation and beamforming appliedto ad-hoc microphone arrays. We found that significantimprovements in accuracy can be achieved with neural networkbased generalized eigenvalue beamformer preceded byweighted prediction error dereverberation. We also exploredthe effect of data augmentation by adding various real or simulatedroom acoustic properties to the Probabilistic Linear DiscriminantAnalysis (PLDA) training dataset. As a result, wedeveloped a speaker recognition system whose performanceis stable across different room acoustic conditions. It yields41.4% relative improvement in performance over the systemwithout multi-channel processing tested on the cleanest microphonedata. With the best combination of data preprocessingand augmentation, we obtained a performance close to the onewe achieved with the original clean test data.

Published
2018
Pages
1334–1338
Journal
Proceedings of Interspeech, vol. 2018, no. 9, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2018
Conference
Interspeech Conference, Hyderabad, India, IN
Publisher
International Speech Communication Association
Place
Hyderabad
DOI
UT WoS
000465363900279
EID Scopus
BibTeX
@inproceedings{BUT155103,
  author="Ladislav {Mošner} and Oldřich {Plchot} and Pavel {Matějka} and Ondřej {Novotný} and Jan {Černocký}",
  title="Dereverberation and Beamforming in Robust Far-Field Speaker Recognition",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="1334--1338",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-2306",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/2306.html"
}
Files
Back to top