Technical Program

Paper Detail

Paper:SP-L11.3
Session:Language Modeling and Search
Time:Friday, May 21, 16:10 - 16:30
Presentation: Lecture
Topic: Speech Processing: Large Vocabulary Recognition/Search
Title: DEVELOPMENT OF THE 2003 CU-HTK CONVERSATIONAL TELEPHONE SPEECH TRANSCRIPTION SYSTEM
Authors: Gunnar Evermann; Cambridge University 
 H. Y. Chan; Cambridge University 
 Mark J. F. Gales; Cambridge University 
 Thomas Hain; Cambridge University 
 Xunying Liu; Cambridge University 
 David Mrva; Cambridge University 
 Lan Wang; Cambridge University 
 Phil Woodland; Cambridge University 
Abstract: This paper describes the development of the 2003 CU-HTK large vocabulary speech recognition system for Conversational Telephone Speech (CTS). The system was designed based on a multi-pass, multi-branch structure where the output of all branches is combined using system combination. A number of advanced modelling techniques such as Speaker Adaptive Training, Heteroscedastic Linear Discriminant Analysis, Minimum Phone Error estimation and specially constructed Single Pronunciation dictionaries were employed. The effectiveness of each of these techniques and their potential contribution to the result of system combination was evaluated in the framework of a state-of-the-art LVCSR system with sophisticated adaptation. The final 2003 CU-HTK CTS system constructed from some of these models is described and its performance on the DARPA/NIST 2003 Rich Transcription (RT-03) evaluation test set is discussed.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004