Technical Program

Paper Detail

Paper:SP-L6.1
Session:Feature Analysis for Speech Recognition
Time:Thursday, May 20, 13:00 - 13:20
Presentation: Lecture
Topic: Speech Processing: Speech Analysis
Title: NON-UNIFORM SPEAKER NORMALIZATION USING AFFINE-TRANSFORMATION
Authors: Bharath Kumar SV; General Electric - Global Research 
 Umesh S.; Indian Institute of Technology 
 Rohit Sinha; Indian Institute of Technology 
Abstract: In this paper, we propose a mathematical model to describe therelation between the formant frequencies of speakers and show that with the proposed affine model, speaker differences separate out as translation factors when a ''mel-like'' warping is performed. Using speech data we estimate the parameters of this warping function and show that it is close to the usual mel-formula. This model is motivated by Rohit et al.'s [1] shift-based non-uniform speaker-normalization method, which provides improvement over the conventional maximum-likelihood based speaker normalization methods. We therefore provide a unified framework that relates the relationship between formants of speakers and method of removing speakers difference (which involves mel-warping) in a neat mathematical framework which is substantiated by our recognition experiments.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004