Technical Program

Paper Detail

Paper:SP-L3.2
Session:Distributed Speech Recognition
Time:Wednesday, May 19, 13:20 - 13:40
Presentation: Lecture
Topic: Speech Processing: Robust Speech Recognition
Title: THE ETSI EXTENDED DISTRIBUTED SPEECH RECOGNITION (DSR) STANDARDS: SERVER-SIDE SPEECH RECONSTRUCTION
Authors: Tenkasi Ramabadran; Motorola 
 Alexander Sorin; IBM 
 Michael McLaughlin; Motorola Labs 
 Dan Chazan; IBM 
 David Pearce; Motorola 
 Ron Hoory; IBM 
Abstract: In this paper we present work that has been carried out in developing the ETSI Extended DSR standards ES 202 211 and ES 202 212 [1][2]. These standards extend the previous ETSI DSR standards: basic front-end ES 201 108 and advanced (noise robust) front-end ES 202 050 respectively. The extensions enable enhanced tonal language recognition as well as server-side speech reconstruction capability. This paper discusses the server-side speech reconstruction whereas a companion paper discusses the front-end extension and tonal language recognition. Experimental results show that the reconstructed speech produced by the standards is highly intelligible under clean and noisy background conditions with the DRT (Diagnostic Rhyme Test) and TT (Transcription Test) scores meeting or exceeding the objective values corresponding to the US DoD (Department of Defence) Federal standard MELP (Mixed-Excitation Linear Predictive) coder operating at 2400 bps.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004