메뉴 건너뛰기




Volumn , Issue , 2011, Pages 2109-2113

Viseme definitions comparison for visual-only speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO AND VISUAL CUES; AUDIO VISUAL SPEECH RECOGNITION; RECOGNITION RATES; SPEECH RECOGNITION SYSTEMS; VISEMES; VISUAL FEATURE;

EID: 84862215808     PISSN: 22195491     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (37)

References (27)
  • 1
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the autormatic recognition of audio-visual speech
    • G. Potamianos et al., "Recent advances in the autormatic recognition of audio-visual speech," Proceeding of the IEEE, vol. 91, no. 9, 2003.
    • (2003) Proceeding of the IEEE , vol.91 , Issue.9
    • Potamianos, G.1
  • 2
    • 78649537390 scopus 로고    scopus 로고
    • Visual speech recognition using motion features and hidden Markov models
    • S.-V. B. Heidelberg, Ed.
    • W. C. Yau et al., "Visual Speech Recognition Using Motion Features and HiddenMarkovModels," in CAIP 2007, S.-V. B. Heidelberg, Ed., 2007.
    • (2007) CAIP 2007
    • Yau, W.C.1
  • 8
    • 85032752352 scopus 로고    scopus 로고
    • Audiovisual speech processing
    • T. Chen, "Audiovisual speech processing," Signal Processing Magazine, IEEE, vol. 18, no. 1, pp. 9-21, 2001.
    • (2001) Signal Processing Magazine, IEEE , vol.18 , Issue.1 , pp. 9-21
    • Chen, T.1
  • 10
    • 33750368310 scopus 로고    scopus 로고
    • An audio-visual corpus for speech perception and automatic speech recognition
    • M. Cooke et al., "An audio-visual corpus for speech perception and automatic speech recognition," The Journal of the Acoustical Society of America, vol. 120, no. 5, pp. 2421-2424, 2006.
    • (2006) The Journal of the Acoustical Society of America , vol.120 , Issue.5 , pp. 2421-2424
    • Cooke, M.1
  • 11
    • 70149086972 scopus 로고    scopus 로고
    • The realistic multi-modal VALID database and visual speaker identification comparison experiments
    • New York
    • N. Fox, B. O'Mullane, and R. Reilly, "The Realistic Multi-Modal VALID database and Visual Speaker Identification Comparison Experiments," in AVBPA, New York, 2005.
    • (2005) AVBPA
    • Fox, N.1    O'mullane, B.2    Reilly, R.3
  • 14
    • 14944353581 scopus 로고    scopus 로고
    • A segment-based audio-visual speech recognizer: Data collection, development, and initial experiments
    • State College, PA, USA: ACM
    • T. J. Hazen et al., "A segment-based audio-visual speech recognizer: data collection, development, and initial experiments," in Proceedings of the 6th international conference on Multimodal interfaces. State College, PA, USA: ACM, 2004, pp. 235-242.
    • (2004) Proceedings of the 6th International Conference on Multimodal Interfaces , pp. 235-242
    • Hazen, T.J.1
  • 17
    • 47949087133 scopus 로고    scopus 로고
    • Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation
    • E. Bozkurt, E. Qigdem Eroglu, E. Erzin, T. Erdem, and M. Ozkan, "Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation," in 3DTV Conference, 2007, 2007, pp. 1-4.
    • (2007) 3DTV Conference, 2007 , pp. 1-4
    • Bozkurt, E.1    Qigdem Eroglu, E.2    Erzin, E.3    Erdem, T.4    Ozkan, M.5
  • 21
    • 85009254391 scopus 로고    scopus 로고
    • Miketalk: A talking facial display based on morphing visemes
    • T. Ezzat and T. Poggio, "Miketalk: a talking facial display based on morphing visemes," in Computer Animation 98. Proceedings, 1998, pp. 96-102.
    • (1998) Computer Animation 98. Proceedings , pp. 96-102
    • Ezzat, T.1    Poggio, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.