Publication Details

Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the 'Speaking Rosetta' JSALT 2017 Workshop

SCHARENBORG, O.; BESACIER, L.; BLACK, A.; HASEGAWA-JOHNSON, M.; METZE, F.; NEUBIG, G.; STÜKER, S.; GODARD, P.; MÜLLER, M.; ONDEL YANG, L.; PALASKAR, S.; ARTHUR, P.; CIANNELLA, F.; DU, M.; LARSEN, E.; MERKX, D.; RIAD, R.; WANG, L.; DUPOUX, E. Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the 'Speaking Rosetta' JSALT 2017 Workshop. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018. p. 4979-4983. ISBN: 978-1-5386-4658-8.
Czech title
Objevování lingvistických jednotek z mutli-modálních vstupů v nepsaných jazycích - souhrn JSALT 2017 workshpou "řečová Rosettská deska"
Type
conference paper
Language
English
Authors
SCHARENBORG, O.
BESACIER, L.
BLACK, A.
Hasegawa-Johnson Mark (FIT)
Metze Florian
NEUBIG, G.
STÜKER, S.
GODARD, P.
MÜLLER, M.
ONDEL YANG, L.
PALASKAR, S.
ARTHUR, P.
CIANNELLA, F.
DU, M.
LARSEN, E.
MERKX, D.
RIAD, R.
WANG, L.
Dupoux Emmanuel (FIT)
URL
Keywords

unwritten languages, multi-modal data, unsupervised unit discovery, image retrieval, machine translation.

Abstract

We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic units (subwords and words) in a language without orthography. We study the replacement of orthographic transcriptions by images and/or translated text in a well-resourced language to help unsupervised discovery from raw speech.

Published
2018
Pages
4979–4983
Proceedings
Proceedings of ICASSP 2018
ISBN
978-1-5386-4658-8
Publisher
IEEE Signal Processing Society
Place
Calgary
DOI
UT WoS
000446384605030
EID Scopus
BibTeX
@inproceedings{BUT155040,
  author="SCHARENBORG, O. and BESACIER, L. and BLACK, A. and HASEGAWA-JOHNSON, M. and METZE, F. and NEUBIG, G. and STÜKER, S. and GODARD, P. and MÜLLER, M. and ONDEL YANG, L. and PALASKAR, S. and ARTHUR, P. and CIANNELLA, F. and DU, M. and LARSEN, E. and MERKX, D. and RIAD, R. and WANG, L. and DUPOUX, E.",
  title="Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the 'Speaking Rosetta' JSALT 2017 Workshop",
  booktitle="Proceedings of ICASSP 2018",
  year="2018",
  pages="4979--4983",
  publisher="IEEE Signal Processing Society",
  address="Calgary",
  doi="10.1109/ICASSP.2018.8461761",
  isbn="978-1-5386-4658-8",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2018/scharenborg_icassp2018_0004979.pdf"
}
Back to top