Loading...
2009
A Novel Integration Scheme for Audio Visual Speech Recognition
A Novel Integration Scheme for Audio Visual Speech Recognition
한국음향학회
김진영 외 1명
논문정보
- Publisher
- 한국음향학회지
- Issue Date
- 2009-11-30
- Keywords
- -
- Citation
- -
- Source
- -
- Journal Title
- -
- Volume
- 28
- Number
- 8
- Start Page
- 832
- End Page
- 842
- DOI
- ISSN
- 12254428
Abstract
Automatic speech recognition (ASR) has been successfully applied to many real human computer interaction (HCI) applications; however, its performance tends to be significantly decreased under noisy environments. The invention of audio visual speech recognition (AVSR) using an acoustic signal and lip motion has recently attracted more attention due to its noise-robustness characteristic. In this paper, we describe our novel integration scheme for AVSR based on a late integration approach. Firstly, we introduce the robust reliability measurement for audio and visual modalities using model based information and signal based information. The model based sources measure the confusability of vocabulary while the signal is used to estimate the noise level. Secondly, the output probabilities of audio and visual speech recognizers are normalized respectively before applying the final integration step using normalized output space and estimated weights. We evaluate the performance of our proposed method via Korean isolated word recognition system. The experimental results demonstrate the effectiveness and feasibility of our proposed system compared to the conventional systems.
- 전남대학교
- KCI
- 한국음향학회지
저자 정보
| 이름 | 소속 |
|---|---|
| 김진영 | 지능전자컴퓨터공학과 |