Technical Program

Paper Detail

Paper:SP-P14.6
Session:Acoustic Modeling: Tone, Prosody, and Features
Time:Thursday, May 20, 15:30 - 17:30
Presentation: Poster
Topic: Speech Processing: Acoustic Modeling for Speech Recognition
Title: A STUDY ON ROBUST SEGMENTATION AND LOCATION OF TONE NUCLEI IN CHINESE CONTINUOUS SPEECH
Authors: Jinsong Zhang; ATR, Spoken Language Translation Laboratories 
 Keikichi Hirose; University of Tokyo 
Abstract: Tone nuclei in continuous speech are regarded as efficient targets for either tone recognition or intonation function decomposition. This paper presents our statistically robust method to segment and locate tone nuclei in continuous speech. The method includes: an iterative segmental K-means segmentation of the tonal F0 contours, which is further aided with T-Test based segment almalgamation. And a linear discriminant function based tone nucleus discriminator, whose features are selected by the sequential feature selection method. The developed system achieved 97.5% tone nuclei correct rate on a speaker dependent task. The tone recognizer based on the detected tone nuclei improved tone recognition rate by more than 6% than the baseline ones using the full tonal syllable features.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004