Technical Program

Paper Detail

Paper:SP-P14.10
Session:Acoustic Modeling: Tone, Prosody, and Features
Time:Thursday, May 20, 15:30 - 17:30
Presentation: Poster
Topic: Speech Processing: Acoustic Modeling for Speech Recognition
Title: PROSODY-BASED RECOGNITION OF SPOKEN GERMAN VARIETIES
Authors: Vedran Dizdarevic; Graz University of Technology 
 Franz Pernkopf; Graz University of Technology 
 Micha Baum; SPEX 
 Martin Hagmüller; Graz University of Technology 
 Gernot Kubin; Graz University of Technology 
Abstract: An approach to the recognition of regional language varieties is presented. The algorithm is tested on utterances of 3 to 6 seconds duration taken from large speech databases (SpeechDat) of Austrian and German German. The features are based only on the prosody of the speech and include parameters derived from the Fujisaki model and statistics of the fundamental frequency. Classification is performed using a multi layer perceptron and yielded a rate of 64% correct identification of the regional ariety. Those results are then further evaluated for the use of a regional variety recognizer as a front-end of an automatic speech recognizer for different regional varieties. In case there is no a priori information of the distribution of the regional varieties spoken by the users, this approach yields a considerable improvement in the robustness of the speech recognition rates.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004