Paper: | SP-P4.10 | ||
Session: | Topics in Speech Understanding Systems | ||
Time: | Tuesday, May 18, 15:30 - 17:30 | ||
Presentation: | Poster | ||
Topic: | Speech Processing: Spoken Language Systems and Dialog | ||
Title: | IMPROVING PHONEME RECOGNITION OF TELEPHONE QUALITY SPEECH | ||
Authors: | Qiang Huang; University of East Anglia | ||
Stephen Cox; University of East Anglia | |||
Abstract: | There are some speech understanding applications in which training transcriptions are unavailable, and hence the vocabulary is unknown, but the task is to recognise key words and phrases within an utterance rather than to attempt a complete, accurate transcription. An example of such a task is call-routing, when transcriptions of training utterances (which are very expensive to produce) are unavailable. In such cases, phoneme rather than word recognition is appropriate. However, phoneme recognition of spontaneous speech spoken by a large multi-accent population over telephone connections is very inaccurate. To improve accuracy, we describe a technique in which we segment the waveform into subword-like units and use clustering and iteratively refined language model to correct the errors in the recognised phonemes. The results show a (46.76-28.06) reduction in phoneme error-rate | ||
Back |
Home -||-
Organizing Committee -||-
Technical Committee -||-
Technical Program -||-
Plenaries
Paper Submission -||-
Special Sessions -||-
ITT -||-
Paper Review -||-
Exhibits -||-
Tutorials
Information -||-
Registration -||-
Travel Insurance -||-
Housing -||-
Workshops