Result Details
Tensorflow implementation of speaker recognition with x-vector topology
Created: 2019
Type
software
Language
English
Authors
Zeinali Hossein, Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Rohdin Johan Andréas, M.Sc., Ph.D., FIT (FIT), DCGM (FIT)
Stafylakis Themos
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Rohdin Johan Andréas, M.Sc., Ph.D., FIT (FIT), DCGM (FIT)
Stafylakis Themos
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Description
This is a Tensorflow implementation of x-vector topology (speaker embedding). It uses Kaldi toolkit for data processing. We train the model using Tensorflow and also extract speaker embeddings (x-vectors) using it. This allow to train or retrain the system to the particular customer specific domain or provides the ability to modify the topology or training schema to achieve better performance for the specific domain.
This software is a result of Czech Ministry of Interior project "Dolování infoRmAcí z řeči Pořízené vzdÁlenými miKrofony - DRAPÁK", No. VI20152020025

Keywords
Speaker recognition, speaker embedding, DNN, x-vectors, retraining
URL
License
Use of the result by another entity is possible without acquiring a license (the result is not licensed)
License Fee
The licensor does not require a license fee for the result
Files
Projects
Information mining in speech acquired by distant microphones, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, start: 2015-10-01, end: 2020-09-30, completed
Research groups
Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)
Departments