Technical Program

Paper Detail

Paper:SP-P8.3
Session:Voice Activity Detection and Speech Segmentation
Time:Wednesday, May 19, 13:00 - 15:00
Presentation: Poster
Topic: Speech Processing: Speech Analysis
Title: CLUSTERING AND SEGMENTING SPEAKERS AND THEIR LOCATIONS IN MEETINGS
Authors: Jitendra Ajmera; IDIAP 
 Guillaume Lathoud; IDIAP 
 Iain McCowan; IDIAP 
Abstract: This paper presents a new approach toward automatic annotation of meetings in terms of speaker identities and their locations. This is achieved by segmenting the audio recordings using two independent sources of information: magnitude spectrum analysis and sound source localization. We combine the two in an appropriate HMM framework. There are three main advantages of this approach. First, it is completely unsupervised, i.e. speaker identities and number of speakers and locations are automatically inferred. Second, it is threshold-free, i.e. the decisions are made without the need of athreshold value which generally requires an additional development dataset. The third advantage is that the joint segmentation improves over the speaker segmentation derived using only acoustic features. Experiments on a series of meetings recorded in the IDIAP Smart Meeting Room demonstrate the effectiveness of this approach.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004