Technical Program

Paper Detail

Paper:SP-P2.8
Session:Speaker Adaptation
Time:Tuesday, May 18, 13:00 - 15:00
Presentation: Poster
Topic: Speech Processing: Adaptation/Normalization
Title: ENROLLMENT IN LOW-RESOURCE SPEECH RECOGNITION SYSTEMS
Authors: Sabine Deligne; IBM T. J. Watson Research Center 
 Satya Dharanipragada; IBM T. J. Watson Research Center 
Abstract: In this paper we consider the problem of enrollment for low-resource speech recoginition systems designed for noisy environments. Noise robustness concerns, memory and computational constraints along with the use of compact acoustic models for fast Gaussian computation make adaptation especially challenging. We derive a Maximum A Posteriori (MAP) algorithm especially designed for the fast off-line adaptation of these compact acoustic models.It requires less computation and memory than standard Feature-space Maximum Likelihood Linear Regression (FMLLR) which is another technique well suited for compact acoustic models. In our experiments of speaker enrollment for speech recognition in the car, we present a computationally efficient procedure to simulate noisy conditions with the adaptation data. In these experiments, MAP compares favorably with FMLLR in terms of recognition accuracy. Besides, combining FMLLR and MAP significantly outperforms each technique individually, thus providing an efficient alternative for systems with larger resources.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004