Technical Program

Paper Detail

Paper:SP-P13.5
Session:General Topics in Robust Speech Recognition
Time:Thursday, May 20, 13:00 - 15:00
Presentation: Poster
Topic: Speech Processing: Robust Speech Recognition
Title: A FACTORIAL HMM APPROACH TO SIMULTANEOUS RECOGNITION OF ISOLATED DIGITS SPOKEN BY MULTIPLE TALKERS ON ONE AUDIO CHANNEL
Authors: Ameya Deoras; University of Illinois at Urbana-Champaign 
 Mark Hasegawa-Johnson; University of Illinois at Urbana-Champaign 
Abstract: This paper addresses the novel problem of recognizing digits spoken simultaneously by two different talkers. A Factorial Hidden Markov Model architecture is proposed to accurately model the simultaneous utterance of two digits. Nadas’ MIXMAX approximation is extended to a mixture of Gaussians observation PDF which enables the implementation of the proposed system. The multiple digit recognizer is found to successfully recognize pairs of simultaneous utterances of digits at 0db SNR with up to 89% accuracy.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004