Technical Program

Paper Detail

Paper:	SP-P12.12
Session:	Acoustic Modeling: Model Complexity, General Topics
Time:	Thursday, May 20, 09:30 - 11:30
Presentation:	Poster
Topic:	Speech Processing: Acoustic Modeling for Speech Recognition
Title:	TRAINING FOR POLYNOMIAL SEGMENT MODEL USING THE EXPECTATION MAXIMIZATION ALGORITHM
Authors:	Chak-Fai Li; Hong Kong University of Science and Technology
	Man-Hung Siu; Hong Kong University of Science and Technology
Abstract:	One of the difficulties in using polynomial segment model (PSM) to capture the temporal correlations within a phonetic segment is the lack of an efficient training algorithm comparable with the Baum-Welch algorithm in HMM. In our previous paper, we introduced a recursive likelihood computation algorithm for PSM recognition and can perform Viterbi-style training. In this paper, we extend the recurrsive likelihood computation into a fast forward-backward PSM training algorithm that maximizes PSM likelihood. In addition, we introduce an improved PSM, dynamic multi-segment PSM, that allows a data-driven alignment betweens observations and the segment trajectory. The dynamic multi-segment PSM model outperforms HMM and traditional PSM in both phone classification and phone recognition tasks on the TIMIT corpus.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004