SCOPUS 정보 검색 플랫폼

Volumn 3, Issue , 2005, Pages 501-504

Comparison of MPEG-4 facial animation parameter groups with respect to audio-visual speech recognition performance

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO VISUAL AUTOMATIC SPEECH RECOGNITION (AV-ASR); FACIAL ANIMATION PARAMETERS (FAP); WORD ERROR RATE (WER);

ANIMATION; DATABASE SYSTEMS; FEATURE EXTRACTION; GESTURE RECOGNITION; INFORMATION ANALYSIS; INTEGRATION; MARKOV PROCESSES;

SPEECH RECOGNITION;

EID: 33749247429 PISSN: 15224880 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICIP.2005.1530438 Document Type: Conference Paper

Times cited : (5)

References (17)

2
- 0003544881
- D. G. Stork and M. E. Hennecke, editors, Springer-Verlag New York Inc.
- D. G. Stork and M. E. Hennecke, editors, Speechreading by Man and Machine, Springer-Verlag New York Inc., 1996.
- (1996) Speechreading by Man and Machine

5
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- S. Dupont, J. Luettin, "Audio-visual speech modeling for continuous speech recognition, "IEEE Transactions on Multimedia, vol. 2(3), pp. 141-151, 2000.
- (2000) IEEE Transactions on Multimedia , vol.2 , Issue.3 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

6
- 0036874915
- Audio-visual speech recognition using MPEG-4 compliant visual features
- P. S. Aleksic, J. J. Williams, Z. Wu, and A. K. Katsaggelos, "Audio-visual speech recognition using MPEG-4 compliant visual features", EURASIP Journal on Applied Signal Processing, pp. 1213-1227, 2002.
- (2002) EURASIP Journal on Applied Signal Processing , pp. 1213-1227
- Aleksic, P.S.¹ Williams, J.J.² Wu, Z.³ Katsaggelos, A.K.⁴

9
- 0032314380
- An image transform approach for HMM based automatic lipreading
- G. Potamianos, H.P. Graf, and E. Cosatto, "An image transform approach for HMM based automatic lipreading," Proc. of the Int. Conf. on Image Proc., vol. III, pp. 173-177, 1998.
- (1998) Proc. of the Int. Conf. on Image Proc. , vol.3 , pp. 173-177
- Potamianos, G.¹ Graf, H.P.² Cosatto, E.³

11
- 33749235580
- Text for ISO/IEC FDIS 14496-2 Visual, ISO/IEC JTC1/SC29/WG11 N2502, Nov. 1998
- Text for ISO/IEC FDIS 14496-2 Visual, ISO/IEC JTC1/SC29/WG11 N2502, Nov. 1998.

12
- 0344044794
- Gallaudet University, Washington, D.C.
- L. E. Bernstein, Lipreading Corpus V-VI: Disc 3., Gallaudet University, Washington, D.C., 1991.
- (1991) Lipreading Corpus V-VI: Disc 3
- Bernstein, L.E.¹

16
- 0003822743
- Entropic Ltd., Cambridge
- S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, "The HTK Book," Entropic Ltd., Cambridge, 2002.
- (2002) The HTK Book
- Young, S.¹ Kershaw, D.² Odell, J.³ Ollason, D.⁴ Valtchev, V.⁵ Woodland, P.⁶

17
- 85009135251
- AVICAR: An audiovisual speech corpus in a car environment
- B. Lee, M. Hasegawa-Johnson, C. Goudeseune, S. Kamdar, S. Borys, M. Liu, and T. Huang, "AVICAR: An Audiovisual Speech Corpus in a Car Environment," ICSLP 2004.
- (2004) ICSLP
- Lee, B.¹ Hasegawa-Johnson, M.² Goudeseune, C.³ Kamdar, S.⁴ Borys, S.⁵ Liu, M.⁶ Huang, T.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.