CHAPTER 4: GENERAL GUIDELINES FOR ADVANCED TRAVELER
INFORMATION SYSTEM (ATIS) CONTROLS

DESIGN OF SPEECH-BASED CONTROLS

Introduction: Design of speech-based controls refers to systems that recognize human speech and treat speech commands as inputs to the ATIS system. As discussed in Reference 1, automatic speech recognition (ASR) systems may be characterized with respect to three sets of design characteristics. First, speaker-dependent systems recognize speech from only one speaker that has been calibrated to the system; speaker-independent systems can recognize speech from many speakers. Second, isolated word recognition systems require that speakers provide a pause or gap between words in a message; continuous speech recognition systems do not require any pause between words. Third, ASR systems vary with respect to the size of the vocabulary that they recognize.

Design Guidelines**
  • Speech controls should be used to aid complex tasks that involve high cognitive, visual, and/or manual requirements.
  • Vocabulary sets for ASR systems should be familiar to drivers and should avoid using similar-sounding words or phrases.
  • Drivers should be provided with immediate feedback of the recognition results or the system's response to the speech input.


Issues to Consider When Designing ASR Systems

Task-Related Issues

Environment-Related Issues

Operator-Related Issues

  • Single versus Dual Task
  • Workload
  • Head Movement Requirements
  • Driving Situation (e.g., effects of stress)
  • Requirements for Feedback
  • Vocabulary Requirements
  • External Noise (e.g., traffic, road noise)
  • Internal Noise (e.g., entertainment system, conversation)
  • Vibration
  • Acceleration/Deceleration G-forces
  • Age
  • Articulation
  • Regional Accents
  • Level of Training
  • Gender


Supporting Rationale: Reference 2 provides considerable discussion of issues and research related to speech controls; the guidelines presented above have been adapted from design principles presented in Reference 2 and, to a lesser extent, Reference 1. The guidelines presented above reflect limited experience in the use of speech as a control device from two technical domains: (1) military information systems and flight control, and (2) the telecommunications field. Case studies and anecdotal results from several applications of speechcontrols can be found in References 1 and 2. Although various commercial speech recognition systems have been developed for automotive applications, published empirical results are few and have not always provided consistent design guidance.

Special Design Considerations: As noted in Reference 2, key issues in the design and implementation of ASR systems include:

Cross References:

ATIS Design for Special Populations

Key References:

1. McMillan, G. R., Eggleston, R. G., & Anderson, T. R. (1997). Nonconventional controls. In G. Salvendy (Ed.), Handbook of human factors and ergonomics (pp. 729- 771). New York: J. Wiley & Sons.

2. Simpson, C. A., McCauley, M. E., Roland, E. F., Ruth, J. C., & Williges, B. H. (1987). Speech controls and displays. In G. Salvendy (Ed.), Handbook of human factors (pp. 549-574). New York: J. Wiley & Sons.

*Primarily expert judgement
** Expert judgement with supporting empirical data
***Empirical data with supporting expert judgement
****Primarily empirical data

Go to previous page
Go to next page