Technical Program

Paper Detail

Paper:SP-P12.6
Session:Acoustic Modeling: Model Complexity, General Topics
Time:Thursday, May 20, 09:30 - 11:30
Presentation: Poster
Topic: Speech Processing: Acoustic Modeling for Speech Recognition
Title: OPTIMIZING ACOUSTIC MODELS FOR COMMERCIAL SPEECH RECOGNITION USING FOREGROUND SCORES AND DATA WEIGHTING
Authors: Daniel Boies; Nuance Communications 
 Brian Strope; Nuance Communications 
 Mitchel Weintraub; Nuance Communications 
 Su-Lin Wu; Nuance Communications 
Abstract: This paper describes a data-driven technique for optimizing the acoustic models for speech recognition systems that target commercial applications over telephones. Frame-averaged foreground log-likelihoods (foreground scores) correlate to recognition errors. These scores are used together with gender to optimize data weighting for the acoustic model. This process is interpreted as increasing the priors and associated parameters for poorly modeled data. The score-based optimization leads to about 7% fewer semantic errors on a live evaluation set collected after the last data used to estimate the acoustic model.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004