NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Spoken Language Processing in the Clarissa Procedure BrowserClarissa, an experimental voice enabled procedure browser that has recently been deployed on the International Space Station, is as far as we know the first spoken dialog system in space. We describe the objectives of the Clarissa project and the system's architecture. In particular, we focus on three key problems: grammar-based speech recognition using the Regulus toolkit; methods for open mic speech recognition; and robust side-effect free dialogue management for handling undos, corrections and confirmations. We first describe the grammar-based recogniser we have build using Regulus, and report experiments where we compare it against a class N-gram recogniser trained off the same 3297 utterance dataset. We obtained a 15% relative improvement in WER and a 37% improvement in semantic error rate. The grammar-based recogniser moreover outperforms the class N-gram version for utterances of all lengths from 1 to 9 words inclusive. The central problem in building an open-mic speech recognition system is being able to distinguish between commands directed at the system, and other material (cross-talk), which should be rejected. Most spoken dialogue systems make the accept/reject decision by applying a threshold to the recognition confidence score. NASA shows how a simple and general method, based on standard approaches to document classification using Support Vector Machines, can give substantially better performance, and report experiments showing a relative reduction in the task-level error rate by about 25% compared to the baseline confidence threshold method. Finally, we describe a general side-effect free dialogue management architecture that we have implemented in Clarissa, which extends the "update semantics'' framework by including task as well as dialogue information in the information state. We show that this enables elegant treatments of several dialogue management problems, including corrections, confirmations, querying of the environment, and regression testing.
Document ID
20050241713
Acquisition Source
Ames Research Center
Document Type
Preprint (Draft being sent to journal)
Authors
Rayner, M.
(NASA Ames Research Center Moffett Field, CA, United States)
Hockey, B. A.
(NASA Ames Research Center Moffett Field, CA, United States)
Renders, J.-M.
(Xerox Research Center Europe Meylan, France)
Chatzichrisafis, N.
(Research Inst. for Advanced Computer Science Moffett Field, CA, United States)
Farrell, K.
(Research Inst. for Advanced Computer Science Moffett Field, CA, United States)
Date Acquired
August 23, 2013
Publication Date
January 1, 2005
Subject Category
Cybernetics, Artificial Intelligence And Robotics
Funding Number(s)
CONTRACT_GRANT: IST-2002-506778
Distribution Limits
Public
Copyright
Other

Available Downloads

There are no available downloads for this record.
No Preview Available