메뉴 건너뛰기




Volumn 1, Issue , 2004, Pages

A stream-weight optimization method for audio-visual speech recognition using multi-stream HMMs

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTICS; MAN MACHINE SYSTEMS; OPTIMIZATION; REGRESSION ANALYSIS; VECTORS; VIDEO SIGNAL PROCESSING;

EID: 4544224863     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (22)

References (6)
  • 1
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech and Language, pp.171-185, 1995.
    • (1995) Computer Speech and Language , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 2
    • 85133531952 scopus 로고    scopus 로고
    • Speaker independent audio-visual database for bimodal ASR
    • G. Potamianos and E. Cosatto and H.P. Gref and D.B. Roe, "Speaker independent audio-visual database for bimodal ASR," Proc. AVSP'97, pp.65-68, 1997.
    • (1997) Proc. AVSP'97 , pp. 65-68
    • Potamianos, G.1    Cosatto, E.2    Gref, H.P.3    Roe, D.B.4
  • 3
    • 85009091822 scopus 로고    scopus 로고
    • Audio-visual speech recognition using MCE-based HMMs and model-dependent stream weights
    • C. Miyajima and K. Tokuda and T. Kitamura, "Audio-visual speech recognition using MCE-based HMMs and model-dependent stream weights," Proc. ICSLP2000, vol.2, pp.1023-1026, 2000.
    • (2000) Proc. ICSLP2000 , vol.2 , pp. 1023-1026
    • Miyajima, C.1    Tokuda, K.2    Kitamura, T.3
  • 4
    • 85009154155 scopus 로고    scopus 로고
    • Stream weight optimization of speech and lip image sequence for audio-visual speech recognition
    • S. Nakamura and H. Ito and K. Shikano, "Stream weight optimization of speech and lip image sequence for audio-visual speech recognition," Proc. ICSLP2000, vol.3, pp.20-24, 2000.
    • (2000) Proc. ICSLP2000 , vol.3 , pp. 20-24
    • Nakamura, S.1    Ito, H.2    Shikano, K.3
  • 5
    • 4544283723 scopus 로고    scopus 로고
    • A robust multi-modal speech recognition method using optical-flow analysis
    • Closter Irsee, Germany
    • S. Tamura, K. Iwano and S. Furui, "A robust multi-modal speech recognition method using optical-flow analysis," Proc. IDS02, Closter Irsee, Germany, pp.2-4, 2002.
    • (2002) Proc. IDS02 , pp. 2-4
    • Tamura, S.1    Iwano, K.2    Furui, S.3
  • 6
    • 85133593587 scopus 로고    scopus 로고
    • Audio-visual speech recognition using lip movement extracted from side-face images
    • St Jorioz, France
    • T. Yoshinaga, S. Tamura, K. Iwano and S. Furui, "Audio-visual speech recognition using lip movement extracted from side-face images," Proc. AVSP2003, St Jorioz, France, pp.117-120, 2003.
    • (2003) Proc. AVSP2003 , pp. 117-120
    • Yoshinaga, T.1    Tamura, S.2    Iwano, K.3    Furui, S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.