Publication Details

STBU system for the NIST 2006 speaker recognition evaluation

MATĚJKA, P.; BURGET, L.; SCHWARZ, P.; GLEMBEK, O.; KARAFIÁT, M.; GRÉZL, F.; ČERNOCKÝ, J.; VAN LEEUWEN, D.; BRÜMMER, N.; STRASHEIM, A. STBU system for the NIST 2006 speaker recognition evaluation. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007. p. 221-224. ISBN: 1-4244-0728-1.
Czech title
STBU systém pro NIST evaluaci rozpoznávání mluvčího 2006
Type
conference paper
Language
English
Authors
Matějka Pavel, Ing., Ph.D. (DCGM)
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Schwarz Petr, Ing., Ph.D. (DCGM)
Glembek Ondřej, Ing., Ph.D.
Karafiát Martin, Ing., Ph.D. (DCGM)
Grézl František, Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
van Leeuwen David
Brümmer Niko
Strasheim Albeert
URL
Keywords

Speaker recognition,GMM, SVM, eigenchannel, NAP.

Abstract

This paper describes STBU 2006 speaker recognition system, which performed well in the NIST 2006 speaker recognition evaluation. STBU is consortium of 4 partners: Spescom DataVoice (South Africa), TNO (Netherlands), BUT (Czech Republic) and University of Stellenbosch (South Africa). The primary system is a combination of three main kinds of systems: (1) GMM, with short-time MFCC or PLP features, (2) GMM-SVM, using GMM mean supervectors as input and (3) MLLR-SVM, using MLLR speaker adaptation coefficients derived from English LVCSR system. In this paper, we describe these sub-systems and present results for each system alone and in combination on the NIST Speaker Recognition Evaluation (SRE) 2006 development and evaluation data sets.

Annotation

This paper describes STBU 2006 speaker recognition system, which performed well in the NIST 2006 speaker recognition evaluation. STBU is consortium of 4 partners: Spescom DataVoice (South Africa), TNO (Netherlands), BUT (Czech Republic) and University of Stellenbosch (South Africa). The primary system is a combination of three main kinds of systems: (1) GMM, with short-time MFCC or PLP features, (2) GMM-SVM, using GMM mean supervectors as input and (3) MLLR-SVM, using MLLR speaker adaptation coefficients derived from English LVCSR system. In this paper, we describe these sub-systems and present results for each system alone and in combination on the NIST Speaker Recognition Evaluation (SRE) 2006 development and evaluation data sets.

Published
2007
Pages
221–224
Proceedings
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007)
ISBN
1-4244-0728-1
Publisher
IEEE Signal Processing Society
Place
Honolulu
BibTeX
@inproceedings{BUT28575,
  author="Pavel {Matějka} and Lukáš {Burget} and Petr {Schwarz} and Ondřej {Glembek} and Martin {Karafiát} and František {Grézl} and Jan {Černocký} and David {van Leeuwen} and Niko {Brümmer} and Albeert {Strasheim}",
  title="STBU system  for the NIST 2006 speaker recognition evaluation",
  booktitle="Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007)",
  year="2007",
  pages="221--224",
  publisher="IEEE Signal Processing Society",
  address="Honolulu",
  isbn="1-4244-0728-1",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2007/matejka_stbu_icassp_2007.pdf"
}
Back to top