Detail výsledku

Dereverberation and Beamforming in Robust Far-Field Speaker Recognition

MOŠNER, L.; PLCHOT, O.; MATĚJKA, P.; NOVOTNÝ, O.; ČERNOCKÝ, J. Dereverberation and Beamforming in Robust Far-Field Speaker Recognition. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. no. 9, p. 1334-1338. ISSN: 1990-9772.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Mošner Ladislav, Ing., UPGM (FIT)
Plchot Oldřich, Ing., Ph.D., UPGM (FIT)
Matějka Pavel, Ing., Ph.D., UPGM (FIT)
Novotný Ondřej, Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)
Abstrakt

This paper deals with robust speaker verification (SV) in farfieldsensing. The robustness is verified on a subset of NISTSRE 2010 corpus retransmitted in multiple real rooms of differentacoustics and captured with multiple microphones. Weexperimented with various data preprocessing steps includingdifferent approaches to dereverberation and beamforming appliedto ad-hoc microphone arrays. We found that significantimprovements in accuracy can be achieved with neural networkbased generalized eigenvalue beamformer preceded byweighted prediction error dereverberation. We also exploredthe effect of data augmentation by adding various real or simulatedroom acoustic properties to the Probabilistic Linear DiscriminantAnalysis (PLDA) training dataset. As a result, wedeveloped a speaker recognition system whose performanceis stable across different room acoustic conditions. It yields41.4% relative improvement in performance over the systemwithout multi-channel processing tested on the cleanest microphonedata. With the best combination of data preprocessingand augmentation, we obtained a performance close to the onewe achieved with the original clean test data.

Klíčová slova

speaker verification, beamforming, dereverberation,autoencoder

URL
Rok
2018
Strany
1334–1338
Časopis
Proceedings of Interspeech, roč. 2018, č. 9, ISSN 1990-9772
Sborník
Proceedings of Interspeech 2018
Konference
Interspeech Conference
Vydavatel
International Speech Communication Association
Místo
Hyderabad
DOI
UT WoS
000465363900279
EID Scopus
BibTeX
@inproceedings{BUT155103,
  author="Ladislav {Mošner} and Oldřich {Plchot} and Pavel {Matějka} and Ondřej {Novotný} and Jan {Černocký}",
  title="Dereverberation and Beamforming in Robust Far-Field Speaker Recognition",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="1334--1338",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-2306",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/2306.html"
}
Soubory
Projekty
Dolování infoRmAcí z řeči Pořízené vzdÁlenými miKrofony, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, zahájení: 2015-10-01, ukončení: 2020-09-30, ukončen
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, zahájení: 2016-01-01, ukončení: 2020-12-31, ukončen
Zvýšení spolehlivosti v automatickém rozpoznávání řečníka, GAČR, Juniorské granty, GJ17-23870Y, zahájení: 2017-01-01, ukončení: 2019-12-31, ukončen
Výzkumné skupiny
Pracoviště
Nahoru