메뉴 건너뛰기




Volumn 2, Issue , 2012, Pages 322-329

Phoneme-to-viseme mapping for visual speech recognition

Author keywords

AVSR; DCT; Optical flow; PCA; Viseme

Indexed keywords

AUDIO VISUAL SPEECH RECOGNITION; AVSR; CONTINUOUS SPEECH; DATA-DRIVEN METHODS; DCT; LINGUISTIC METHODS; PCA; VISEME; VISEMES; VISUAL FEATURE; VISUAL SPEECH RECOGNITION;

EID: 84862178164     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (57)

References (19)
  • 1
    • 84862219678 scopus 로고    scopus 로고
    • Pyramidal implementation of lucas kanade feature tracker
    • Bouguet
    • Bouguet (2002). Pyramidal Implementation of Lucas Kanade Feature Tracker. Description of the algorithm.
    • (2002) Description of the Algorithm
  • 2
    • 47949087133 scopus 로고    scopus 로고
    • Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation
    • Bozkurt, Eroglu, Q., Erzin, Erdem, and Ozkan (2007). Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation. In 3DTV Conference, 2007, pages 1-4.
    • (2007) 3DTV Conference, 2007 , pp. 1-4
    • Bozkurt, E.Q.1    Erzin, E.2    Ozkan3
  • 4
    • 78649613221 scopus 로고    scopus 로고
    • Nostril detection for robust mouth tracking
    • Cork
    • Cappelletta and Harte (2010). Nostril detection for robust mouth tracking. In Irish Signals and Systems Conference, pages 239 - 244, Cork.
    • (2010) Irish Signals and Systems Conference , pp. 239-244
    • Cappelletta1    Harte2
  • 6
    • 85009254391 scopus 로고    scopus 로고
    • Miketalk: A talking facial display based on morphing visemes
    • Ezzat and Poggio (1998). Miketalk: a talking facial display based on morphing visemes. In Computer Animation 98. Proceedings, pages 96-102.
    • (1998) Computer Animation 98. Proceedings , pp. 96-102
    • Ezzat1    Poggio2
  • 8
    • 34047263009 scopus 로고    scopus 로고
    • Visual model structures and synchrony constraints for audio-visual speech recognition
    • Hazen (2006). Visual model structures and synchrony constraints for audio-visual speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 14(3):1082-1089.
    • (2006) Audio, Speech, and Language Processing, IEEE Transactions on , vol.14 , Issue.3 , pp. 1082-1089
    • Hazen1
  • 9
    • 14944353581 scopus 로고    scopus 로고
    • A segment-based audio-visual speech recognizer: Data collection, development, and initial experiments
    • State College, PA, USA. ACM
    • Hazen, Saenko, La, and Glass (2004). A segment-based audio-visual speech recognizer: data collection, development, and initial experiments. In Proceedings of the 6th international conference on Multimodal interfaces, pages 235-242, State College, PA, USA. ACM.
    • (2004) Proceedings of the 6th International Conference on Multimodal Interfaces , pp. 235-242
    • Hazen, S.1    La, G.2
  • 10
    • 85009284526 scopus 로고    scopus 로고
    • DCT-Based video features for audio-visual speech recognition
    • Denver, CO, USA
    • Heckmann, Kroschel, Savariaux, and Berthommier (2002). DCT-Based Video Features for Audio-Visual Speech Recognition. In International Conference on Spoken Language Processing, volume 1, pages 1925-1928, Denver, CO, USA.
    • (2002) International Conference on Spoken Language Processing , vol.1 , pp. 1925-1928
    • Heckmann, K.1    Savariaux, B.2
  • 14
    • 0019647180 scopus 로고
    • An iterative image registration technique with an application to stereo vision
    • Lucas and Kanade (1981). An iterative image registration technique with an application to stereo vision. In Proceedings of Imaging Understanding Workshop.
    • (1981) Proceedings of Imaging Understanding Workshop
    • Lucas, K.1
  • 17
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audio-visual speech
    • Senior
    • Potamianos, Neti, Gravier, Garg, and Senior (2003). Recent advances in the automatic recognition of audio-visual speech. Proceeding of the IEEE, 91(9):1306-1326.
    • (2003) Proceeding of the IEEE , vol.91 , Issue.9 , pp. 1306-1326
    • Neti, P.1    Garg, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.