Technical Program

Paper Detail

Session:Topics in Speaker and Langauge Recognition
Time:Tuesday, May 18, 15:30 - 17:30
Presentation: Poster
Topic: Speech Processing: Language Identification
Authors: Nagarajan Thangavelu; Indian Institute of Technology, Madras 
 Hema Murthy; Indian Institute of Technology, Madras 
Abstract: Automatic spoken language identification (LID) is the task of identifying the language from a short utterance of the speech signal. The most successful approach to LID uses phone recognizers of several languages in parallel. The basic requirement to build Parallel Phone recognition (PPR) system is annotated corpora. In this paper, a novel approach is proposed for the LID task which uses parallelsyllable-like unit recognizers, in a frame work similar to PPR approach in the literature. The difference is that unsupervised syllable models are built from the training data. The data is first segmented into syllable-like units. The syllable segments are then clustered using an incremental approach. This results in a set of syllable models for each language. Our initial results on OGI_MLTS corpora show that the performance is 69.5%. We further show that if only a subsetof syllable models that are unique (in some sense), are considered, the performance improves to 75.9%.

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: -||- Last updated Wednesday, April 07, 2004