Publication Details

Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis

LEI, Y.; BURGET, L.; FERRER, L.; GRACIARENA, M.; SCHEFFER, N. Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012. p. 4253-4256. ISBN: 978-1-4673-0044-5.
Czech title
Posun k rozpoznávání mluvčího robustnímu vůči šumu pomocí pravděpodobnostní lineární diskriminační analýzy
Type
conference paper
Language
English
Authors
Lei Yun
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Ferrer Luciana
Graciarena Martin
Scheffer Nicolas
URL
Keywords

Speaker Recognition, noise, robustness, i-vector, PLDA

Abstract

We show results on a newly designed noisy corpus for speaker recognition where real recordings of babble noise were added to original NIST SRE clean speech data.

Annotation

This work addresses the problem of speaker verification where additive noise is present in the enrollment and testing utterances. We show how the current state-of-the-art framework can be effectively used to mitigate this effect. We first look at the degradation a standard speaker verification system is subjected to when presented with noisy speech waveforms. We designed and generated a corpus with noisy conditions, based on the NIST SRE 2008 and 2010 data, built using open-source tools and freely available noise samples. We then show how adding noisy training data in the current i-vectorbased approach followed by probabilistic linear discriminant analysis (PLDA) can bring significant gains in accuracy at various signal-to-noise ratio (SNR) levels. We demonstrate that this improvement is not feature-specific as we present positive results for three disparate sets of features: standard mel frequency cepstral coefficients, prosodic polynomial coefficients and maximum likelihood linear regression (MLLR) transforms.

Published
2012
Pages
4253–4256
Proceedings
Proc. International Conference on Acoustics, Speech, and Signal P
ISBN
978-1-4673-0044-5
Publisher
IEEE Signal Processing Society
Place
Kyoto
DOI
BibTeX
@inproceedings{BUT91503,
  author="Yun {Lei} and Lukáš {Burget} and Luciana {Ferrer} and Martin {Graciarena} and Nicolas {Scheffer}",
  title="Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis",
  booktitle="Proc. International Conference on Acoustics, Speech, and Signal P",
  year="2012",
  pages="4253--4256",
  publisher="IEEE Signal Processing Society",
  address="Kyoto",
  doi="10.1109/ICASSP.2012.6288858",
  isbn="978-1-4673-0044-5",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/lei_icassp2012_0004253.pdf"
}
Back to top