Publication Details

Topic identification of spoken documents using unsupervised acoustic unit discovery

KESIRAJU, S.; PAPPAGARI, R.; ONDEL YANG, L.; BURGET, L.; DEHAK, N.; KHUDANPUR, S.; ČERNOCKÝ, J.; GANGASHETTY, S. Topic identification of spoken documents using unsupervised acoustic unit discovery. In Proceedings of ICASSP 2017. New Orleans: IEEE Signal Processing Society, 2017. p. 5745-5749. ISBN: 978-1-5090-4117-6.

Czech title

Identifikace témat z mluvených dokumentů pomocí automatického hledání řečových jednotek

Type

conference paper

Language

English

Authors

Kesiraju Santosh, Ph.D. (DCGM)
Pappagari Raghavendra
Ondel Lucas Antoine Francois, Mgr., Ph.D. (SSDIT)
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Dehak Najim
Khudanpur Sanjeev
Černocký Jan, prof. Dr. Ing. (DCGM)
Gangashetty Suryakanth V

URL

http://www.fit.vutbr.cz/research/groups/speech/publi/2017/kesiraju_icassp2017_0005745.pdf PDF

Keywords

topic identification, acoustic unit discovery, unsupervisedlearning, non-parametric Bayesian models

Abstract

This paper investigates the application of unsupervised acoustic unitdiscovery for topic identification (topic ID) of spoken audio documents.The acoustic unit discovery method is based on a nonparametricBayesian phone-loop model that segments a speech utteranceinto phone-like categories. The discovered phone-like (acoustic)units are further fed into the conventional topic ID framework.Using multilingual bottleneck features for the acoustic unit discovery,we show that the proposed method outperforms other systemsthat are based on cross-lingual phoneme recognizer.

Annotation

This paper investigates the application of unsupervised acoustic unit discovery for topic identification (topic ID) of spoken audio documents. The acoustic unit discovery method is based on a nonparametric Bayesian phone-loop model that segments a speech utterance into phone-like categories. The discovered phone-like (acoustic) units are further fed into the conventional topic ID framework. Using multilingual bottleneck features for the acoustic unit discovery, we show that the proposed method outperforms other systems that are based on cross-lingual phoneme recognizer.

Published

2017

Pages

5745–5749

Proceedings

Proceedings of ICASSP 2017

Conference

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), New Orleans, USA, US

ISBN

978-1-5090-4117-6

Publisher

IEEE Signal Processing Society

Place

New Orleans

DOI

10.1109/ICASSP.2017.7953257

UT WoS

000414286205181

EID Scopus

2-s2.0-85023738949

BibTeX

@inproceedings{BUT144450,
  author="Santosh {Kesiraju} and Raghavendra {Pappagari} and Lucas Antoine Francois {Ondel} and Lukáš {Burget} and Najim {Dehak} and Sanjeev {Khudanpur} and Jan {Černocký} and Suryakanth V {Gangashetty}",
  title="Topic identification of spoken documents using unsupervised acoustic unit discovery",
  booktitle="Proceedings of ICASSP 2017",
  year="2017",
  pages="5745--5749",
  publisher="IEEE Signal Processing Society",
  address="New Orleans",
  doi="10.1109/ICASSP.2017.7953257",
  isbn="978-1-5090-4117-6",
  url="https://www.fit.vut.cz/research/publication/11470/"
}

Files

pdf kesiraju_icassp2017_0005745.pdf 205 kB