Publication Details

Automatic Speech Analysis Framework for ATC Communication in HAAWAII

MOTLÍČEK, P.; PRASAD, A.; NIGMATULINA, I.; HELMKE, H.; OHNEISER, O.; KLEINERT, M. Automatic Speech Analysis Framework for ATC Communication in HAAWAII. In SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023. p. 1-9. ISSN: 0770-1268.
Czech title
Systém pro automatickou analýzu řeči pro letecké komunikace v projektu HAAWAII
Type
conference paper
Language
English
Authors
Motlíček Petr, doc. Ing., Ph.D. (DCGM)
Prasad Amrutha (DCGM)
NIGMATULINA, I.
HELMKE, H.
OHNEISER, O.
KLEINERT, M.
URL
Keywords

HAAWAII project, Speech activity detection, Speaker segmentation, Speaker role
classification, Automatic Speech Recognition.

Abstract

Over the past years, several SESAR funded ex- ploratory projects focused on
bringing speech and language technologies to the Air Traffic Management (ATM)
domain and demonstrating their added value through successful applications.
Recently ended HAAWAII project developed a generic archi- tecture and framework,
which was validated through several tasks such as callsign highlighting,
pre-filling radar labels, and readback error detection. The primary goal was to
support pilot and air traffic controller communication by deploying Automatic
Speech Recognition (ASR) engines. Contextual information (if available) extracted
from surveillance data, flight plan data, or previous communication can be
exploited via entity boosting to further improve the recognition performance.
HAAWAII proposed various design attributes to integrate the ASR engine into the
ATM framework, often depending on concrete technical specifics of target air
navigation service providers (ANSPs). This paper gives a brief overview and
provides an objective assessment of speech processing components developed and
integrated into the HAAWAII framework. Specifically, the following tasks are
evaluated w.r.t. application domain: (i) speech activity detection, (ii) speaker
segmentation and speaker role classification, as well as (iii) ASR. To our best
knowledge, HAAWAII framework offers the best performing speech technologies for
ATM, reaching high recognition accuracy (i.e., error-correction done by
exploiting additional contextual data), robustness (i.e., models developed using
large training corpora) and support for rapid domain transfer (i.e., to new ATM
sector with minimum investment). Two scenarios provided by ANSPs were used for
testing, achieving callsign detection accuracy of about 96% and 95% for NATS and
ISAVIA, respectively.

Published
2023
Pages
1–9
Proceedings
SESAR Innovation Days
Volume
2023
Number
11
Conference
13th SESAR Innovation Days, Seville, ES
Publisher
SESAR Joint Undertaking
Place
Seville
EID Scopus
BibTeX
@inproceedings{BUT187933,
  author="MOTLÍČEK, P. and PRASAD, A. and NIGMATULINA, I. and HELMKE, H. and OHNEISER, O. and KLEINERT, M.",
  title="Automatic Speech Analysis Framework for ATC Communication in HAAWAII",
  booktitle="SESAR Innovation Days",
  year="2023",
  volume="2023",
  number="11",
  pages="1--9",
  publisher="SESAR Joint Undertaking",
  address="Seville",
  issn="0770-1268",
  url="https://www.sesarju.eu/sites/default/files/documents/sid/2023/Papers/SIDs_2023_paper_72%20final.pdf"
}
Files
Back to top