Publication Details

STBU system for the NIST 2006 speaker recognition evaluation

MATĚJKA, P.; BURGET, L.; SCHWARZ, P.; GLEMBEK, O.; KARAFIÁT, M.; GRÉZL, F.; ČERNOCKÝ, J.; VAN LEEUWEN, D.; BRÜMMER, N.; STRASHEIM, A. STBU system for the NIST 2006 speaker recognition evaluation. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007. p. 221-224. ISBN: 1-4244-0728-1.

Czech title

STBU systém pro NIST evaluaci rozpoznávání mluvčího 2006

Type

conference paper

Language

English

Authors

Matějka Pavel, Ing., Ph.D.
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Schwarz Petr, Ing., Ph.D. (DCGM)
Glembek Ondřej, Ing., Ph.D.
Karafiát Martin, Ing., Ph.D. (DCGM)
Grézl František, Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
van Leeuwen David
Brümmer Niko
Strasheim Albeert

URL

http://www.fit.vutbr.cz/research/groups/speech/publi/2007/matejka_stbu_icassp_2007.pdf

Keywords

Speaker recognition,GMM, SVM, eigenchannel, NAP.

Abstract

This paper describes STBU 2006 speaker recognition system, whichperformed well in the NIST 2006 speaker recognition evaluation. STBU isconsortium of 4 partners: Spescom DataVoice (South Africa), TNO(Netherlands), BUT (Czech Republic) and University of Stellenbosch(South Africa). The primary system is a combination of three main kindsof systems: (1) GMM, with short-time MFCC or PLP features, (2) GMM-SVM,using GMM mean supervectors as input and (3) MLLR-SVM, using MLLRspeaker adaptation coefficients derived from English LVCSR system. Inthis paper, we describe these sub-systems and present results for eachsystem alone and in combination on the NIST Speaker RecognitionEvaluation (SRE) 2006 development and evaluation data sets.

Annotation

This paper describes STBU 2006 speaker recognition system, which performed well in the NIST 2006 speaker recognition evaluation. STBU is consortium of 4 partners: Spescom DataVoice (South Africa), TNO (Netherlands), BUT (Czech Republic) and University of Stellenbosch (South Africa). The primary system is a combination of three main kinds of systems: (1) GMM, with short-time MFCC or PLP features, (2) GMM-SVM, using GMM mean supervectors as input and (3) MLLR-SVM, using MLLR speaker adaptation coefficients derived from English LVCSR system. In this paper, we describe these sub-systems and present results for each system alone and in combination on the NIST Speaker Recognition Evaluation (SRE) 2006 development and evaluation data sets.

Published

2007

Pages

221–224

Proceedings

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007)

Conference

32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Honolulu, US

ISBN

1-4244-0728-1

Publisher

IEEE Signal Processing Society

Place

Honolulu

BibTeX

@inproceedings{BUT28575,
  author="Pavel {Matějka} and Lukáš {Burget} and Petr {Schwarz} and Ondřej {Glembek} and Martin {Karafiát} and František {Grézl} and Jan {Černocký} and David {van Leeuwen} and Niko {Brümmer} and Albeert {Strasheim}",
  title="STBU system for the NIST 2006 speaker recognition evaluation",
  booktitle="Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007)",
  year="2007",
  pages="221--224",
  publisher="IEEE Signal Processing Society",
  address="Honolulu",
  isbn="1-4244-0728-1",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2007/matejka_stbu_icassp_2007.pdf"
}