메뉴 건너뛰기




Volumn 5, Issue , 2006, Pages 2458-2461

Adaptive multimodal fusion by uncertainty compensation

Author keywords

Active appearance models; Audiovisual speech recognition; Multimodal fusion; Product HMMs; Stream weights; Uncertainty compensation

Indexed keywords

DEEP NEURAL NETWORKS; PATTERN RECOGNITION; SPEECH ANALYSIS; UNCERTAINTY ANALYSIS;

EID: 44949227080     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (23)

References (18)
  • 1
    • 4544290191 scopus 로고    scopus 로고
    • Automatic recognition of audio-visual speech: Recent progress and challenges
    • G. Potamianos, C. Neti. G. Gravier, and A. Garg, "Automatic recognition of audio-visual speech: Recent progress and challenges," Proc. of the IEEE, vol. 91, no. 9, pp. 1306-1326, 2003.
    • (2003) Proc. of the IEEE , vol.91 , Issue.9 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4
  • 2
    • 0034825241 scopus 로고    scopus 로고
    • Multi-stream adaptive evidence combination for noise robust ASR
    • A. Morris, A. Hagen, H. Glotin, and H. Bourlard, "Multi-stream adaptive evidence combination for noise robust ASR," Speech Communication, vol. 34, pp. 25-40, 2001.
    • (2001) Speech Communication , vol.34 , pp. 25-40
    • Morris, A.1    Hagen, A.2    Glotin, H.3    Bourlard, H.4
  • 5
    • 0027681974 scopus 로고
    • ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
    • V. Digalakis, J.R. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE TSAP, pp. 431-442, 1993.
    • (1993) IEEE TSAP , pp. 431-442
    • Digalakis, V.1    Rohlicek, J.R.2    Ostendorf, M.3
  • 6
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE TSAP, vol. 2, no. 2, pp. 245-257, 1994.
    • (1994) IEEE TSAP , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 7
    • 0036508276 scopus 로고    scopus 로고
    • Speaker verification in noise using a stochastic version of the weighted viterbi algorithm
    • N.B Yoma and M. Villar, "Speaker verification in noise using a stochastic version of the weighted viterbi algorithm," IEEE TSAP, vol. 10, no. 3, pp. 158-166, 2002.
    • (2002) IEEE TSAP , vol.10 , Issue.3 , pp. 158-166
    • Yoma, N.B.1    Villar, M.2
  • 8
    • 18744401086 scopus 로고    scopus 로고
    • Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
    • L. Deng, J. Dropo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE TSAP, vol. 13, no. 3, pp. 412-421, 2005.
    • (2005) IEEE TSAP , vol.13 , Issue.3 , pp. 412-421
    • Deng, L.1    Dropo, J.2    Acero, A.3
  • 9
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. on Multimedia, vol. 2, no. 3, pp. 141-151, 2000.
    • (2000) IEEE Trans. on Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 10
    • 0034842342 scopus 로고    scopus 로고
    • Asynchronous stream modeling for large vocabulary audio-visual speech recognition
    • J. Luettin, G. Potamianos, and C. Neti, "Asynchronous stream modeling for large vocabulary audio-visual speech recognition," in Proc. ICASSP, 2001.
    • (2001) Proc. ICASSP
    • Luettin, J.1    Potamianos, G.2    Neti, C.3
  • 11
    • 0033900150 scopus 로고    scopus 로고
    • A bayesian predictive approach to robust speech recognition
    • Q. Huo and C. Lee, "A bayesian predictive approach to robust speech recognition," IEEE TSAP, pp. 200-204, 2000.
    • (2000) IEEE TSAP , pp. 200-204
    • Huo, Q.1    Lee, C.2
  • 13
    • 0035363218 scopus 로고    scopus 로고
    • Active appearance models
    • T.F. Cootes, G.J. Edwards, and Taylor C.J., "Active appearance models," IEEE PAMI, vol. 23, no. 6, pp. 681-685, 2001.
    • (2001) IEEE PAMI , vol.23 , Issue.6 , pp. 681-685
    • Cootes, T.F.1    Edwards, G.J.2    Taylor, C.J.3
  • 14
    • 0036472941 scopus 로고    scopus 로고
    • Extraction of visual features for lipreading
    • I. Matthews, T. F. Cootes, J. A. Bangham, S. Cox, and R. Harvey, "Extraction of visual features for lipreading," IEEE PAMI, vol. 24, no. 2, pp. 198-213, 2002.
    • (2002) IEEE PAMI , vol.24 , Issue.2 , pp. 198-213
    • Matthews, I.1    Cootes, T.F.2    Bangham, J.A.3    Cox, S.4    Harvey, R.5
  • 17
    • 0036299249 scopus 로고    scopus 로고
    • CUAVE: A new audio-visual database for multimodal human-computer interface research
    • E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, "CUAVE: A new audio-visual database for multimodal human-computer interface research," in Proc. ICASSP, 2002.
    • (2002) Proc. ICASSP
    • Patterson, E.K.1    Gurbuz, S.2    Tufekci, Z.3    Gowdy, J.N.4
  • 18
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • A. Varga and H.J.M. Steeneken, "Assessment for automatic speech recognition: II. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Communication, vol. 12, no. 3, pp. 247-252, 1993.
    • (1993) Speech Communication , vol.12 , Issue.3 , pp. 247-252
    • Varga, A.1    Steeneken, H.J.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.