SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2011, Pages 2109-2113

Viseme definitions comparison for visual-only speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO AND VISUAL CUES; AUDIO VISUAL SPEECH RECOGNITION; RECOGNITION RATES; SPEECH RECOGNITION SYSTEMS; VISEMES; VISUAL FEATURE;

SIGNAL PROCESSING;

SPEECH RECOGNITION;

EID: 84862215808 PISSN: 22195491 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (37)

References (27)

1
- 4544290191
- Recent advances in the autormatic recognition of audio-visual speech
- G. Potamianos et al., "Recent advances in the autormatic recognition of audio-visual speech," Proceeding of the IEEE, vol. 91, no. 9, 2003.
- (2003) Proceeding of the IEEE , vol.91 , Issue.9
- Potamianos, G.¹

2
- 78649537390
- Visual speech recognition using motion features and hidden Markov models
- S.-V. B. Heidelberg, Ed.
- W. C. Yau et al., "Visual Speech Recognition Using Motion Features and HiddenMarkovModels," in CAIP 2007, S.-V. B. Heidelberg, Ed., 2007.
- (2007) CAIP 2007
- Yau, W.C.¹

3
- 74849097139
- Automatic visual feature extraction for Mandarin audio-visual speech recognition
- P. Tsang-Long et al., "Automatic visual feature extraction for Mandarin audio-visual speech recognition," in Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on, 2009, pp. 2936-2940.
- (2009) Systems, Man and Cybernetics, 2009. SMC 2009 IEEE International Conference on , pp. 2936-2940
- Tsang-Long, P.¹

4
- 67249132517
- Lip feature extraction and reduction for hmm-based visual speech recognition systems
- S. Alizadeh et al., "Lip feature extraction and reduction for hmm-based visual speech recognition systems," in Signal Processing, 2008. ICSP 2008. 9th International Conference on, 2008, pp. 561-564.
- (2008) Signal Processing, 2008. ICSP 2008. 9th International Conference on , pp. 561-564
- Alizadeh, S.¹

5
- 85097982114
- Comparing visual features for lipreading
- Y. Lan et al., "Comparing visual features for lipreading," in International Conference on Auditory-Visual Speech Processing, 2009, pp. 102-106.
- (2009) International Conference on Auditory-Visual Speech Processing , pp. 102-106
- Lan, Y.¹

6
- 0002810240
- J. Luettin et al., "Visual speech recognition using active shape models and hidden markov models," 1996.
- (1996) Visual Speech Recognition Using Active Shape Models and Hidden Markov Models
- Luettin, J.¹

7
- 0036299249
- CUAVE: A new audio-visual database for multimodal human-computer interface research
- E. Patterson et al., "CUAVE: a new audio-visual database for multimodal human-computer interface research," in Acoustics, Speech, and Signal Processing, 2002. Proceedings. (ICASSP '02). IEEE International Conference on, vol. 2, 2002, pp. 2017-2020.
- (2002) Acoustics, Speech, and Signal Processing, 2002. Proceedings. (ICASSP '02). IEEE International Conference on , vol.2 , pp. 2017-2020
- Patterson, E.¹

8
- 85032752352
- Audiovisual speech processing
- T. Chen, "Audiovisual speech processing," Signal Processing Magazine, IEEE, vol. 18, no. 1, pp. 9-21, 2001.
- (2001) Signal Processing Magazine, IEEE , vol.18 , Issue.1 , pp. 9-21
- Chen, T.¹

9
- 0001935972
- XM2VTSDB: The extended M2VTS database
- K. Messer et al., "XM2VTSDB: The Extended M2VTS Database," in Second International Conference on Audio and Video-based Biometric Person Authentication, 1999.
- (1999) Second International Conference on Audio and Video-based Biometric Person Authentication
- Messer, K.¹

10
- 33750368310
- An audio-visual corpus for speech perception and automatic speech recognition
- M. Cooke et al., "An audio-visual corpus for speech perception and automatic speech recognition," The Journal of the Acoustical Society of America, vol. 120, no. 5, pp. 2421-2424, 2006.
- (2006) The Journal of the Acoustical Society of America , vol.120 , Issue.5 , pp. 2421-2424
- Cooke, M.¹

11
- 70149086972
- The realistic multi-modal VALID database and visual speaker identification comparison experiments
- New York
- N. Fox, B. O'Mullane, and R. Reilly, "The Realistic Multi-Modal VALID database and Visual Speaker Identification Comparison Experiments," in AVBPA, New York, 2005.
- (2005) AVBPA
- Fox, N.¹ O'mullane, B.² Reilly, R.³

12
- 56749108674
- VDM-Verlag
- C. Sanderson, Biometric Person Recognition: Face, Speech and Fusion. VDM-Verlag, 2008.
- (2008) Biometric Person Recognition: Face, Speech and Fusion
- Sanderson, C.¹

13
- 84908265391
- A comparison of model and transformbased visual features for audio-visual LVCSR
- I. Matthews et al., "A comparison of model and transformbased visual features for audio-visual LVCSR," in Multimedia and Expo, 2001. ICME 2001. IEEE International Conference on, 2001, pp. 825-828.
- (2001) Multimedia and Expo, 2001. ICME 2001 IEEE International Conference on , pp. 825-828
- Matthews, I.¹

14
- 14944353581
- A segment-based audio-visual speech recognizer: Data collection, development, and initial experiments
- State College, PA, USA: ACM
- T. J. Hazen et al., "A segment-based audio-visual speech recognizer: data collection, development, and initial experiments," in Proceedings of the 6th international conference on Multimodal interfaces. State College, PA, USA: ACM, 2004, pp. 235-242.
- (2004) Proceedings of the 6th International Conference on Multimodal Interfaces , pp. 235-242
- Hazen, T.J.¹

15
- 84862162638
- Master Thesis, Massachussetts Institute of Technology
- K. Saenko, "Articulary features for robust visual speech recognition," Master Thesis, Massachussetts Institute of Technology, 2004.
- (2004) Articulary Features for Robust Visual Speech Recognition
- Saenko, K.¹

16
- 0004052871
- Tech. Rep., Oct. 12
- C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, D. Vergyri, S. Sison, A. Mashari, and J. Zhou, "Audio-visual speech recognition," Tech. Rep., Oct. 12 2000.
- (2000) Audio-visual Speech Recognition
- Neti, C.¹ Potamianos, G.² Luettin, J.³ Matthews, I.⁴ Glotin, H.⁵ Vergyri, D.⁶ Sison, S.⁷ Mashari, A.⁸ Zhou, J.⁹

17
- 47949087133
- Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation
- E. Bozkurt, E. Qigdem Eroglu, E. Erzin, T. Erdem, and M. Ozkan, "Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation," in 3DTV Conference, 2007, 2007, pp. 1-4.
- (2007) 3DTV Conference, 2007 , pp. 1-4
- Bozkurt, E.¹ Qigdem Eroglu, E.² Erzin, E.³ Erdem, T.⁴ Ozkan, M.⁵

18
- 0344212675
- New York, NY, USA: John Wiley & Sons, Inc.
- I. S. Pandzic and R. Forchheimer, MPEG-4 Facial Animation: The Standard, Implementation and Applications. New York, NY, USA: John Wiley & Sons, Inc., 2003.
- (2003) MPEG-4 Facial Animation: The Standard, Implementation and Applications
- Pandzic, I.S.¹ Forchheimer, R.²

19
- 84875584220
- vol.1, oct-2 nov
- A. Goldschen, O. Garcia, and E. Petajan, "Continuous optical automatic speech recognition by lipreading," vol. 1, pp. 572-577 vol.1, oct-2 nov 1994.
- (1994) Continuous Optical Automatic Speech Recognition by Lipreading , vol.1 , pp. 572-577
- Goldschen, A.¹ Garcia, O.² Petajan, E.³

20
- 0004266328
- Charles C Thomas Pub Ltd
- J. Jeffers and M. Barley, Speechreading (Lipreading). Charles C Thomas Pub Ltd, 1971.
- (1971) Speechreading (Lipreading)
- Jeffers, J.¹ Barley, M.²

21
- 85009254391
- Miketalk: A talking facial display based on morphing visemes
- T. Ezzat and T. Poggio, "Miketalk: a talking facial display based on morphing visemes," in Computer Animation 98. Proceedings, 1998, pp. 96-102.
- (1998) Computer Animation 98. Proceedings , pp. 96-102
- Ezzat, T.¹ Poggio, T.²

22
- 78649613221
- Nostril detection for robust mouth tracking
- Cork
- L. Cappelletta and N. Harte, "Nostril detection for robust mouth tracking," in Irish Signals and Systems Conference, Cork, 2010, pp. 239 - 244.
- (2010) Irish Signals and Systems Conference , pp. 239-244
- Cappelletta, L.¹ Harte, N.²

23
- 84870243209
- Audiovisual and visual-only speech and speaker recognitio: Issue about theory, system design and implementation
- D. Shiell, L. Terry, P. Aleksic, and A. K. Katsaggelos, "Audiovisual and visual-only speech and speaker recognitio: Issue about theory, system design and implementation," in Visual Speech Recognition: Lip Segmentation and Mapping, 2008, pp. 1-38.
- (2008) Visual Speech Recognition: Lip Segmentation and Mapping , pp. 1-38
- Shiell, D.¹ Terry, L.² Aleksic, P.³ Katsaggelos, A.K.⁴

24
- 85013597845
- Eigenlips for robust speech recognition
- vol.2
- C. Bregler and Y. Konig, "Eigenlips for robust speech recognition," in Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on, vol. ii, 1994, pp. II/669-II/672 vol.2.
- (1994) Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on , vol.2
- Bregler, C.¹ Konig, Y.²

25
- 84867731827
- Determining optical flow
- B. K. P. Horn and B. G. Schunck, "Determining optical flow," Artificial Intellicenge, 1980.
- (1980) Artificial Intellicenge
- Horn, B.K.P.¹ Schunck, B.G.²

26
- 2442456044
- J. Bouguet, "Pyramidal Implementation of the Lucas Kanade Feature Tracker: Description of the algorithm," 2002.
- (2002) Pyramidal Implementation of the Lucas Kanade Feature Tracker: Description of the Algorithm
- Bouguet, J.¹

27
- 0019647180
- An iterative image registration technique with an application to stereo vision
- B. D. Lucas and T. Kanade, "An iterative image registration technique with an application to stereo vision," in Proceedings of Imaging Understanding Workshop, 1981.
- (1981) Proceedings of Imaging Understanding Workshop
- Lucas, B.D.¹ Kanade, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.