Publication Details

Gender Independent Discriminative Speaker Recognition in I-Vector Space

CUMANI, S.; GLEMBEK, O.; BRUMMER, J.; DE VILLIERS, E.; LAFACE, P. Gender Independent Discriminative Speaker Recognition in I-Vector Space. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012. p. 4361-4364. ISBN: 978-1-4673-0044-5.
Czech title
Diskriminativní rozpoznávání mluvčího v i-vektorovém prostoru nezávislé na pohlaví
Type
conference paper
Language
English
Authors
Cumani Sandro, Ph.D.
Glembek Ondřej, Ing., Ph.D.
Brummer Johan Nikolaas Langenhoven, Dr.
de Villiers Edward
Laface Pietro
URL
Keywords

speaker recognition, gender recognition, PLDA models, GI Pairwise SVM

Abstract

This paper describes speaker recognition systems that are trained with gender dependent features and tested with known gender trails.

Annotation

Speaker recognition systems attain their best accuracy when trained with gender dependent features and tested with known gender trials. In real applications, howevcer, gender labels are often not given. In this work we illustrate the design of a system that does not make use of the gender labels both in training and in test, i.e. a completely Gender Independent (GI) system. It relies on discriminative training, where the trials are i-vector pairs, and the discrimination is between the hypothesis that the pair of feature vectors in the trial belong to the same speaker or to different speakers. We demonstrate that this pairwise discriminative training can be interpreted as a procedure that estimates the parameters of the best (second order) approximation of the log-likelihood ratio score function, and that a pairwise SVM can be used for training a gender independent system. Our results show that a pairwise GI SVM, saving memory and execution time, achieves on the last NIST evaluationscomplet state-of-the-art performance, comparable to a Gender Dependent(GD) system.

Published
2012
Pages
4361–4364
Proceedings
Proc. International Conference on Acoustics, Speech, and Signal P
ISBN
978-1-4673-0044-5
Publisher
IEEE Signal Processing Society
Place
Kyoto
DOI
BibTeX
@inproceedings{BUT91484,
  author="Sandro {Cumani} and Ondřej {Glembek} and Johan Nikolaas Langenhoven {Brummer} and Edward {de Villiers} and Pietro {Laface}",
  title="Gender Independent Discriminative Speaker Recognition in I-Vector Space",
  booktitle="Proc. International Conference on Acoustics, Speech, and Signal P",
  year="2012",
  pages="4361--4364",
  publisher="IEEE Signal Processing Society",
  address="Kyoto",
  doi="10.1109/ICASSP.2012.6288885",
  isbn="978-1-4673-0044-5",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/cumani_icassp2012_0004361.pdf"
}
Back to top