Publication Details

Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition

NOVOTNÝ, O.; PLCHOT, O.; GLEMBEK, O.; BURGET, L. Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition. In Proceedings of Interspeech. Proceedings of Interspeech. Graz: International Speech Communication Association, 2019. p. 4330-4334. ISSN: 1990-9772.
Czech title
Faktorizace diskriminativně trénovaného extraktoru i-vektorů pro rozpoznávání mluvčího
Type
conference paper
Language
English
Authors
Novotný Ondřej, Ing., Ph.D.
Plchot Oldřich, Ing., Ph.D. (DCGM)
Glembek Ondřej, Ing., Ph.D.
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
URL
Keywords

SRE

Abstract

In this work, we continue in our research on i-vector extractorfor speaker verification (SV) and we optimize its architecturefor fast and effective discriminative training. We were motivatedby computational and memory requirements caused bythe large number of parameters of the original generative ivectormodel. Our aim is to preserve the power of the originalgenerative model, and at the same time focus the model towardsextraction of speaker-related information. We show that it ispossible to represent a standard generative i-vector extractor bya model with significantly less parameters and obtain similarperformance on SV tasks. We can further refine this compactmodel by discriminative training and obtain i-vectors that leadto better performance on various SV benchmarks representingdifferent acoustic domains.

Published
2019
Pages
4330–4334
Journal
Proceedings of Interspeech, vol. 2019, no. 9, ISSN 1990-9772
Proceedings
Proceedings of Interspeech
Conference
Interspeech Conference, Graz, AT
Publisher
International Speech Communication Association
Place
Graz
DOI
UT WoS
000831796404095
EID Scopus
BibTeX
@inproceedings{BUT159998,
  author="Ondřej {Novotný} and Oldřich {Plchot} and Ondřej {Glembek} and Lukáš {Burget}",
  title="Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition",
  booktitle="Proceedings of Interspeech",
  year="2019",
  journal="Proceedings of Interspeech",
  volume="2019",
  number="9",
  pages="4330--4334",
  publisher="International Speech Communication Association",
  address="Graz",
  doi="10.21437/Interspeech.2019-1757",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2019/pdfs/1757.pdf"
}
Files
Back to top