Publication Details

Speaker vectors from Subspace Gaussian Mixture Model as complementary features for Language Identification

PLCHOT, O.; KARAFIÁT, M.; BRUMMER, J.; GLEMBEK, O.; MATĚJKA, P.; DE VILLIERS, E.; ČERNOCKÝ, J. Speaker vectors from Subspace Gaussian Mixture Model as complementary features for Language Identification. In Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012. p. 330-333. ISBN: 978-981-07-3093-2.

Czech title

Adaptační vektory mluvčího ze Subspace Gaussian Mixture modelu jako komplementární příznaky pro identifikaci jazyka

Type

conference paper

Language

English

Authors

Plchot Oldřich, Ing., Ph.D. (DCGM)
Karafiát Martin, Ing., Ph.D. (DCGM)
Brummer Johan Nikolaas Langenhoven, Dr.
Glembek Ondřej, Ing., Ph.D.
Matějka Pavel, Ing., Ph.D. (DCGM)
de Villiers Edward
Černocký Jan, prof. Dr. Ing. (DCGM)

URL

http://www.fit.vutbr.cz/research/groups/speech/publi/2012/plchot_odyssey2012_330-333-41.pdf

Keywords

speaker recognition, Gaussian Mixture Model, speaker vectors, language identification

Abstract

In this paper we have presented new features for language identification, based on speaker adaptation vectors from sub-space Gaussian Mixture Models.

Annotation

In this paper, we explore new high-level features for language identification. The recently introduced Subspace Gaussian Mixture Models (SGMM) provide an elegant and efficient way for GMM acoustic modelling, with mean supervectors represented in a low-dimensional representative subspace. SGMMs also provide an efficient way of speaker adaptation by means of lowdimensional vectors. In our framework, these vectors are used as features for language identification. They are compared with our acoustic iVector system, which architecture is currently considered state-of-the-art for Language Identification and Speaker Verification. The results of both systems and their fusion are reported on the NIST LRE2009 dataset.

Published

2012

Pages

330–333

Proceedings

Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop

ISBN

978-981-07-3093-2

Publisher

International Speech Communication Association

Place

Singapur

EID Scopus

2-s2.0-85073253015

BibTeX

@inproceedings{BUT96995,
  author="Oldřich {Plchot} and Martin {Karafiát} and Johan Nikolaas Langenhoven {Brummer} and Ondřej {Glembek} and Pavel {Matějka} and Edward {de Villiers} and Jan {Černocký}",
  title="Speaker vectors from Subspace Gaussian Mixture Model as complementary features for Language Identification",
  booktitle="Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop",
  year="2012",
  pages="330--333",
  publisher="International Speech Communication Association",
  address="Singapur",
  isbn="978-981-07-3093-2",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/plchot_odyssey2012_330-333-41.pdf"
}