SCOPUS 정보 검색 플랫폼

Volumn 5, Issue , 2006, Pages 2458-2461

Adaptive multimodal fusion by uncertainty compensation

Author keywords

Active appearance models; Audiovisual speech recognition; Multimodal fusion; Product HMMs; Stream weights; Uncertainty compensation

Indexed keywords

DEEP NEURAL NETWORKS; PATTERN RECOGNITION; SPEECH ANALYSIS; UNCERTAINTY ANALYSIS;

ACTIVE APPEARANCE MODELS; AUDIO VISUAL SPEECH RECOGNITION; MULTI-MODAL FUSION; PRODUCT HMMS; STREAM WEIGHTS;

SPEECH RECOGNITION;

EID: 44949227080 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (23)

References (18)

1
- 4544290191
- Automatic recognition of audio-visual speech: Recent progress and challenges
- G. Potamianos, C. Neti. G. Gravier, and A. Garg, "Automatic recognition of audio-visual speech: Recent progress and challenges," Proc. of the IEEE, vol. 91, no. 9, pp. 1306-1326, 2003.
- (2003) Proc. of the IEEE , vol.91 , Issue.9 , pp. 1306-1326
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴

2
- 0034825241
- Multi-stream adaptive evidence combination for noise robust ASR
- A. Morris, A. Hagen, H. Glotin, and H. Bourlard, "Multi-stream adaptive evidence combination for noise robust ASR," Speech Communication, vol. 34, pp. 25-40, 2001.
- (2001) Speech Communication , vol.34 , pp. 25-40
- Morris, A.¹ Hagen, A.² Glotin, H.³ Bourlard, H.⁴

3
- 33947622692
- Stream weight computation for multi-stream classifiers
- A. Potamianos, E. Sanchez-Soto, and K. Daoudi, "Stream weight computation for multi-stream classifiers," in Proc. ICASSP, 2006.
- (2006) Proc. ICASSP
- Potamianos, A.¹ Sanchez-Soto, E.² Daoudi, K.³

4
- 0034842451
- Weighting schemes for audio-visual fusion in speech recognition
- H. Glotin, D. Vergyri, C. Neti, G. Potamianos, and J. Luettin, "Weighting schemes for audio-visual fusion in speech recognition," in Proc. ICASSP, 2001.
- (2001) Proc. ICASSP
- Glotin, H.¹ Vergyri, D.² Neti, C.³ Potamianos, G.⁴ Luettin, J.⁵

5
- 0027681974
- ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
- V. Digalakis, J.R. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE TSAP, pp. 431-442, 1993.
- (1993) IEEE TSAP , pp. 431-442
- Digalakis, V.¹ Rohlicek, J.R.² Ostendorf, M.³

6
- 0028420014
- Integrated models of signal and background with application to speaker identification in noise
- R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE TSAP, vol. 2, no. 2, pp. 245-257, 1994.
- (1994) IEEE TSAP , vol.2 , Issue.2 , pp. 245-257
- Rose, R.C.¹ Hofstetter, E.M.² Reynolds, D.A.³

7
- 0036508276
- Speaker verification in noise using a stochastic version of the weighted viterbi algorithm
- N.B Yoma and M. Villar, "Speaker verification in noise using a stochastic version of the weighted viterbi algorithm," IEEE TSAP, vol. 10, no. 3, pp. 158-166, 2002.
- (2002) IEEE TSAP , vol.10 , Issue.3 , pp. 158-166
- Yoma, N.B.¹ Villar, M.²

8
- 18744401086
- Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
- L. Deng, J. Dropo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE TSAP, vol. 13, no. 3, pp. 412-421, 2005.
- (2005) IEEE TSAP , vol.13 , Issue.3 , pp. 412-421
- Deng, L.¹ Dropo, J.² Acero, A.³

9
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. on Multimedia, vol. 2, no. 3, pp. 141-151, 2000.
- (2000) IEEE Trans. on Multimedia , vol.2 , Issue.3 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

10
- 0034842342
- Asynchronous stream modeling for large vocabulary audio-visual speech recognition
- J. Luettin, G. Potamianos, and C. Neti, "Asynchronous stream modeling for large vocabulary audio-visual speech recognition," in Proc. ICASSP, 2001.
- (2001) Proc. ICASSP
- Luettin, J.¹ Potamianos, G.² Neti, C.³

11
- 0033900150
- A bayesian predictive approach to robust speech recognition
- Q. Huo and C. Lee, "A bayesian predictive approach to robust speech recognition," IEEE TSAP, pp. 200-204, 2000.
- (2000) IEEE TSAP , pp. 200-204
- Huo, Q.¹ Lee, C.²

12
- 0036874999
- Dynamic bayesian networks for audio-visual speech recognition
- A.V. Nefian, L. Liang, X. Pi, X. Liu, and K. Murphy, "Dynamic bayesian networks for audio-visual speech recognition," EURASIP Journal on Applied Signal Processing, vol. 11, pp. 1-15, 2002.
- (2002) EURASIP Journal on Applied Signal Processing , vol.11 , pp. 1-15
- Nefian, A.V.¹ Liang, L.² Pi, X.³ Liu, X.⁴ Murphy, K.⁵

13
- 0035363218
- Active appearance models
- T.F. Cootes, G.J. Edwards, and Taylor C.J., "Active appearance models," IEEE PAMI, vol. 23, no. 6, pp. 681-685, 2001.
- (2001) IEEE PAMI , vol.23 , Issue.6 , pp. 681-685
- Cootes, T.F.¹ Edwards, G.J.² Taylor, C.J.³

14
- 0036472941
- Extraction of visual features for lipreading
- I. Matthews, T. F. Cootes, J. A. Bangham, S. Cox, and R. Harvey, "Extraction of visual features for lipreading," IEEE PAMI, vol. 24, no. 2, pp. 198-213, 2002.
- (2002) IEEE PAMI , vol.24 , Issue.2 , pp. 198-213
- Matthews, I.¹ Cootes, T.F.² Bangham, J.A.³ Cox, S.⁴ Harvey, R.⁵

15
- 0003474751
- Cambridge Univ. Press
- W. Press, S. Teukolsky, W. Vetterling, and B. Flannery, Numerical Recipes, Cambridge Univ. Press, 1992.
- (1992) Numerical Recipes
- Press, W.¹ Teukolsky, S.² Vetterling, W.³ Flannery, B.⁴

16
- 0003608314
- Springer
- A. Blake and M. Isard, Active contours : the application of techniques from graphics, vision, control theory and statistics to visual tracking of shapes in motion, Springer, 1998.
- (1998) Active contours : The application of techniques from graphics, vision, control theory and statistics to visual tracking of shapes in motion
- Blake, A.¹ Isard, M.²

17
- 0036299249
- CUAVE: A new audio-visual database for multimodal human-computer interface research
- E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, "CUAVE: A new audio-visual database for multimodal human-computer interface research," in Proc. ICASSP, 2002.
- (2002) Proc. ICASSP
- Patterson, E.K.¹ Gurbuz, S.² Tufekci, Z.³ Gowdy, J.N.⁴

18
- 0027623210
- Assessment for automatic speech recognition: II. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems
- A. Varga and H.J.M. Steeneken, "Assessment for automatic speech recognition: II. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Communication, vol. 12, no. 3, pp. 247-252, 1993.
- (1993) Speech Communication , vol.12 , Issue.3 , pp. 247-252
- Varga, A.¹ Steeneken, H.J.M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.