메뉴 건너뛰기




Volumn , Issue , 2001, Pages 409-412

State synchronous modeling of audio-visual information for bi-modal speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO SYSTEMS; SPEECH;

EID: 84962816226     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2001.1034671     Document Type: Conference Paper
Times cited : (6)

References (4)
  • 1
    • 33646906672 scopus 로고    scopus 로고
    • Improved bimodal speech recognition using tied-mixture HMMs and 5000 word Audio-Visual Synchronous database
    • Rhodes
    • Satoshi Nakamura, Ron Nagai and Kiyohiro Shikano, "Improved bimodal speech recognition using tied-mixture HMMs and 5000 word Audio-Visual Synchronous database", Proc. Eurospeech, Rhodes, pp. 1623-1626, 1997.
    • (1997) Proc. Eurospeech , pp. 1623-1626
    • Nakamura, S.1    Nagai, R.2    Shikano, K.3
  • 2
    • 85009154155 scopus 로고    scopus 로고
    • Stream weight optimization of speech and lip image sequence for Audio-Visual speech recognition
    • Satoshi Nakamura, Hidetoshi Ito and Kiyohiro Shikano, "Stream weight optimization of speech and lip image sequence for Audio-Visual speech recognition", Proc. ICSLP2000, Vol. 3, pp. 20-23, 2000.
    • (2000) Proc. ICSLP2000 , vol.3 , pp. 20-23
    • Nakamura, S.1    Ito, H.2    Shikano, K.3
  • 4
    • 0029747053 scopus 로고    scopus 로고
    • Integrating audio and visual information to provide highly robust speech recognition
    • May
    • M.J. Tomlinson, M.J. Russell and N.M. Brooke, "'Integrating audio and visual information to provide highly robust speech recognition'", Proc ICASSP-96 Vol. 2 pp. 821-824 May 1996.
    • (1996) Proc ICASSP-96 , vol.2 , pp. 821-824
    • Tomlinson, M.J.1    Russell, M.J.2    Brooke, N.M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.