Publication Details

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery

YUSUF, B.; ONDEL YANG, L.; BURGET, L.; ČERNOCKÝ, J.; SARAÇLAR, M. A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021. p. 3710-3714. ISBN: 978-1-7281-7605-5.

Czech title

Jazykově adaptovaný hierarchický podprostorový model pro objevování akustických jednotek

Type

conference paper

Language

English

Authors

Yusuf Bolaji (DCGM)
ONDEL YANG, L.
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Černocký Jan, prof. Dr. Ing. (DCGM)
SARAÇLAR, M.

URL

http://www.fit.vutbr.cz/research/groups/speech/publi/2021/yusuf_icassp2021_09414899.pdf PDF

Keywords

acoustic unit discovery, hierarchical subspacemodel, unsupervised learning

Abstract

In this work, we propose a hierarchical subspace model for acousticunit discovery. In this approach, we frame the task as one oflearning embeddings on a low-dimensional phonetic subspace, andsimultaneously specify the subspace itself as an embedding on a hyper-subspace. We train the hyper-subspace on a set of transcribedlanguages and transfer it to the target language. In the target language,we infer both the language and unit embeddings in an unsupervisedmanner, and in so doing, we simultaneously learn a subspaceof units specific to that language and the units that dwell on it.We conduct experiments on TIMIT and two low-resource languages:Mboshi and Yoruba. Results show that our model outperforms majoracoustic unit discovery techniques, both in terms of clusteringquality and segmentation accuracy.

Published

2021

Pages

3710–3714

Proceedings

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Conference

2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Canada, CA

ISBN

978-1-7281-7605-5

Publisher

IEEE Signal Processing Society

Place

Toronto, Ontario

DOI

10.1109/ICASSP39728.2021.9414899

UT WoS

000704288403193

EID Scopus

2-s2.0-85115178909

BibTeX

@inproceedings{BUT175792,
  author="YUSUF, B. and ONDEL YANG, L. and BURGET, L. and ČERNOCKÝ, J. and SARAÇLAR, M.",
  title="A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery",
  booktitle="ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)",
  year="2021",
  pages="3710--3714",
  publisher="IEEE Signal Processing Society",
  address="Toronto, Ontario",
  doi="10.1109/ICASSP39728.2021.9414899",
  isbn="978-1-7281-7605-5",
  url="https://www.fit.vut.cz/research/publication/12523/"
}

Files

pdf yusuf_icassp2021_09414899.pdf 2 MB