NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Towards Understanding Data Requirements for Developing Automatic Speech Recognition Systems for Air Traffic ControlIn recent years, the application of automatic speech recognition has gained popularity across diverse industries, including aviation. Given the many applications focusing on transcribing air traffic control and management communication, this paper explores the training of OpenAI's Whisper model across multiple existing public and private air traffic control voice datasets in an effort to improve robustness. Combining roughly 60+ hours of various air traffic datasets, our goal is to train a unified Whisper model and expect an average word error rate reduction across testing datasets. Furthermore, this work aims to understand the data quantity requirements for achieving state-of-the art results by comprehensively training Whisper on varying dataset sizes. This work has the potential to improve automatic speech recognition performance across the domain, improve understanding of the quantity of data required by an aviation speech recognition system, and lastly provide metrics to compare and improve upon in future research.
Document ID
20240014494
Acquisition Source
Ames Research Center
Document Type
Extended Abstract
Authors
Stephen S B Clarke
(Ames Research Center Mountain View, United States)
David Nielsen
(KBR (United States) Houston, Texas, United States)
Charles I Cutler
(Metis Technology Solutions)
Aida Sharif Rohani
(Ames Research Center Mountain View, United States)
Krishna M Kalyanam
(Ames Research Center Mountain View, United States)
Date Acquired
November 14, 2024
Subject Category
Air Transportation and Safety
Meeting Information
Meeting: Aviation Forum
Location: Las Vegas, NV
Country: US
Start Date: July 21, 2025
End Date: July 25, 2025
Sponsors: American Institute of Aeronautics and Astronautics AIAA
Funding Number(s)
WBS: 533127.02.70.01.07
Distribution Limits
Public
Copyright
Public Use Permitted.
Technical Review
Single Expert
Keywords
ATC
Whisper
ML
Machine Learning
STT
Speech-To-Text
ASR
Automatic Speech Recognition
Air Traffic Control
ATCo
No Preview Available