DESIGN OF SPEECH-BASED CONTROLS
Introduction: Design of speech-based controls refers to systems that recognize human speech and treat speech commands as inputs to the ATIS system. As discussed in Reference 1, automatic speech recognition (ASR) systems may be characterized with respect to three sets of design characteristics. First, speaker-dependent systems recognize speech from only one speaker that has been calibrated to the system; speaker-independent systems can recognize speech from many speakers. Second, isolated word recognition systems require that speakers provide a pause or gap between words in a message; continuous speech recognition systems do not require any pause between words. Third, ASR systems vary with respect to the size of the vocabulary that they recognize.
|
Design Guidelines**
|
|
Issues to Consider When Designing ASR Systems
|
Task-Related Issues |
Environment-Related Issues |
Operator-Related Issues |
|
|
|
Supporting Rationale: Reference 2 provides considerable discussion of
issues and research related to speech controls; the guidelines presented above
have been adapted from design principles presented in Reference 2 and, to a
lesser extent, Reference 1. The guidelines presented above reflect limited experience
in the use of speech as a control device from two technical domains: (1) military
information systems and flight control, and (2) the telecommunications field.
Case studies and anecdotal results from several applications of speechcontrols
can be found in References 1 and 2. Although various commercial speech recognition
systems have been developed for automotive applications, published empirical
results are few and have not always provided consistent design guidance.
Special Design Considerations: As noted in Reference 2, key issues in the design and implementation of ASR systems include:
Cross References:
ATIS Design for Special Populations
Key References:
1. McMillan, G. R., Eggleston, R. G., & Anderson, T. R. (1997). Nonconventional controls. In G. Salvendy (Ed.), Handbook of human factors and ergonomics (pp. 729- 771). New York: J. Wiley & Sons.
2. Simpson, C. A., McCauley, M. E., Roland, E. F., Ruth, J. C., & Williges, B. H. (1987). Speech controls and displays. In G. Salvendy (Ed.), Handbook of human factors (pp. 549-574). New York: J. Wiley & Sons.
*Primarily expert judgement
** Expert judgement with supporting empirical data
***Empirical data with supporting expert judgement
****Primarily empirical data