Technical Program

Paper Detail

Paper:SS-12.2
Session:Information Fusion for Multimedia Annotation and Retrieval
Time:Friday, May 21, 13:17 - 13:34
Presentation: Special Session Lecture
Topic: Special Sessions: Information Fusion for Multimedia Annotation and Retrieval
Title: EXPLOITING MULTIPLE MODALITIES FOR INTERACTIVE VIDEO RETRIEVAL
Authors: Michael Christel; Carnegie Mellon University 
 Neema Moraveji; Carnegie Mellon University 
 Chang Huang; Carnegie Mellon University 
 Norman Papernick; Carnegie Mellon University 
Abstract: Aural and visual cues can be automatically extracted from video and used to index its contents. This paper explores the relative merits of the cues extracted from the different modalities for locating relevant shots in video, specifically reporting on the indexing and interface strategies used to retrieve information from the Video TREC 2002 and 2003 data sets, and the evaluation of the interactive search runs. For the documentary and news material in these sets, automated speech recognition produces rich textual descriptions derived from the narrative, with visual descriptions and depictions offering additional browsing functionality. Through speech and visual processing, storyboard interfaces with query-based filtering provide an effective interactive retrieval interface. Examples drawn from the Video TREC 2002 and 2003 search topics and results using these topics illustrate the utility of multiple-document storyboards and other interfaces incorporating the results of multimodal processing.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004