Technical Program

Paper Detail

Paper:	SP-P4.6
Session:	Topics in Speech Understanding Systems
Time:	Tuesday, May 18, 15:30 - 17:30
Presentation:	Poster
Topic:	Speech Processing: Spoken Language Systems and Dialog
Title:	UNSUPERVISED AND ACTIVE LEARNING IN AUTOMATIC SPEECH RECOGNITION FOR CALL CLASSIFICATION
Authors:	Dilek Hakkani-Tür; AT&T Labs - Research
	Gokhan Tur; AT&T Labs - Research
	Mazin Rahim; AT&T Labs - Research
	Giuseppe Riccardi; AT&T Labs - Research
Abstract:	A key challenge in rapidly bootstrapping spoken natural language dialog applications is minimizing the manual effort required in transcribing and labeling speech data. This task is not only expensive but it also delays the application creation process. In this paper, we present a novel approach that aim at reducing the amount of manually transcribed in-domain data required for building automatic speech recognition (ASR) models in spoken language dialog. Our method is based on mining relevant text from various conversational systems and web sites. An iterative process is employed where the performance of the models can be improved through both unsupervised and active learning of the ASR models. We have evaluated the robustness of our approach on a call classification task that has been selected from AT&T VoiceTone customer care. Our results indicate that with unsupervised learning it is possible to achieve a call classification performance that is only 1.5% lower than the upper bound set when using all available in-domain transcribed data.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004