Publication Details

Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition

NOVOTNÝ, O.; PLCHOT, O.; GLEMBEK, O.; BURGET, L. Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition. In Proceedings of Interspeech. Proceedings of Interspeech. Graz: International Speech Communication Association, 2019. p. 4330-4334. ISSN: 1990-9772.

Czech title

Faktorizace diskriminativně trénovaného extraktoru i-vektorů pro rozpoznávání mluvčího

Type

conference paper

Language

English

Authors

Novotný Ondřej, Ing., Ph.D.
Plchot Oldřich, Ing., Ph.D. (DCGM)
Glembek Ondřej, Ing., Ph.D.
Burget Lukáš, doc. Ing., Ph.D. (DCGM)

URL

Keywords

SRE

Abstract

In this work, we continue in our research on i-vector extractorfor speaker verification (SV) and we optimize its architecturefor fast and effective discriminative training. We were motivatedby computational and memory requirements caused bythe large number of parameters of the original generative ivectormodel. Our aim is to preserve the power of the originalgenerative model, and at the same time focus the model towardsextraction of speaker-related information. We show that it ispossible to represent a standard generative i-vector extractor bya model with significantly less parameters and obtain similarperformance on SV tasks. We can further refine this compactmodel by discriminative training and obtain i-vectors that leadto better performance on various SV benchmarks representingdifferent acoustic domains.

Published

2019

Pages

4330–4334

Journal

Proceedings of Interspeech, vol. 2019, no. 9, ISSN 1990-9772

Proceedings

Proceedings of Interspeech

Conference

Interspeech Conference, Graz, AT

Publisher

International Speech Communication Association

Place

Graz

DOI

10.21437/Interspeech.2019-1757

UT WoS

000831796404095

EID Scopus

2-s2.0-85074713812

BibTeX

@inproceedings{BUT159998,
  author="Ondřej {Novotný} and Oldřich {Plchot} and Ondřej {Glembek} and Lukáš {Burget}",
  title="Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition",
  booktitle="Proceedings of Interspeech",
  year="2019",
  journal="Proceedings of Interspeech",
  volume="2019",
  number="9",
  pages="4330--4334",
  publisher="International Speech Communication Association",
  address="Graz",
  doi="10.21437/Interspeech.2019-1757",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2019/pdfs/1757.pdf"
}

Files

pdf novotny_is2019_191757.pdf 279 kB