Technical Program

Paper Detail

Paper:SP-P2.11
Session:Speaker Adaptation
Time:Tuesday, May 18, 13:00 - 15:00
Presentation: Poster
Topic: Speech Processing: Speaker Recognition
Title: SPEAKER INDEXING AND ADAPTATION USING SPEAKER CLUSTERING BASED ON STATISTICAL MODEL SELECTION
Authors: Masafumi Nishida; Chiba University 
 Tatsuya Kawahara; Kyoto University 
Abstract: This paper addresses unsupervised speaker indexing and automatic speech recognition of discussions.In speaker indexing, there are two cases, where the number of speakers is unknown and known beforehand. When the specified number is unknown, it is difficult to apply to various data because it needs to determine several parameters like threshold.In addition, serious problems arise in applying a uniform modelbecause variations in the utterance durations of speakers are large. We thus propose a method which can robustly perform speaker indexing for the two cases using a flexible framework in which an optimal speaker model (GMM or VQ) is selected based on the BIC. Moreover, we propose a combination method of speaker adaptation based on speaker selection and the indexing method. For real discussion archives, we demonstrated that indexing performance is higher than that of conventional methods for the two cases and speech recognition performance was improved by the combination method.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004