SCOPUS 정보 검색 플랫폼

Volumn 10, Issue 6, 2008, Pages 969-981

Humanoid audio-visual avatar with emotive text-to-speech synthesis

(5) Tang, Hao a Fu, Yun a Tu, Jilin a Hasegawa Johnson, Mark a Huang, Thomas S a

a UNIVERSITY OF ILLINOIS AT URBANA CHAMPAIGN (United States)

Author keywords

3 D face modeling and animation; Audio visual avatar; Emotive speech synthesis; Human computer interaction; Multimodal system; TTS

Indexed keywords

ANIMATION; FEATURE EXTRACTION; FLOW INTERACTIONS; HUMAN COMPUTER INTERACTION; KNOWLEDGE MANAGEMENT; SPEECH; SPEECH SYNTHESIS; THREE DIMENSIONAL; THREE DIMENSIONAL COMPUTER GRAPHICS; VIRTUAL REALITY;

3-D FACE MODELING AND ANIMATION; AUDIO-VISUAL AVATAR; EMOTIVE SPEECH SYNTHESIS; MULTIMODAL SYSTEM; TTS;

SPEECH COMMUNICATION;

EID: 54949115779 PISSN: 15209210 EISSN: None Source Type: Journal
DOI: 10.1109/TMM.2008.2001355 Document Type: Article

Times cited : (30)

References (36)

1
- 34249049489
- Human-centered computing : A multimedia perspective
- A. Jaimes, N. Sebe, and D. Gatica-Perez, "Human-centered computing : A multimedia perspective," in ACM Conf. on Multimedia, 2006, pp. 855-864.
- (2006) ACM Conf. on Multimedia , pp. 855-864
- Jaimes, A.¹ Sebe, N.² Gatica-Perez, D.³

2
- 50849136882
- EAVA: A 3D emotive audio-visual avatar
- Jan, Copper Mountain, CO
- H. Tang, Y. Fu, J. Tu, T. S. Huang, and M. Hasegawa-Johnson, "EAVA: A 3D emotive audio-visual avatar," in 2008 IEEE Workshop on Applications of Computer Vision (WACV'08), Jan. 2008, Copper Mountain, CO.
- (2008) 2008 IEEE Workshop on Applications of Computer Vision (WACV'08)
- Tang, H.¹ Fu, Y.² Tu, J.³ Huang, T.S.⁴ Hasegawa-Johnson, M.⁵

3
- 33746894562
- M-face: An appearance-based photorealistic model for multiple facial attributes rendering
- Jul
- Y. Fu and N. Zheng, "M-face: An appearance-based photorealistic model for multiple facial attributes rendering," IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 7, pp. 830-842, Jul. 2006.
- (2006) IEEE Trans. Circuits Syst. Video Technol , vol.16 , Issue.7 , pp. 830-842
- Fu, Y.¹ Zheng, N.²

4
- 54949109437
- Real-time multimodal human-avatar interaction
- to appear
- Y. Fu, R. Li, T. S. Huang, and M. Danielsen, "Real-time multimodal human-avatar interaction," IEEE Trans. Circuits Syst. Video Technol. 2008, to appear.
- (2008) IEEE Trans. Circuits Syst. Video Technol
- Fu, Y.¹ Li, R.² Huang, T.S.³ Danielsen, M.⁴

5
- 46449119257
- Real-time humanoid avatar for multimodal human-machine interaction
- Y. Fu, R. Li, T. S. Huang, and M. Danielsen, "Real-time humanoid avatar for multimodal human-machine interaction," in IEEE Conf. ICME'07, 2007, pp. 991-994.
- (2007) IEEE Conf. ICME'07 , pp. 991-994
- Fu, Y.¹ Li, R.² Huang, T.S.³ Danielsen, M.⁴

6
- 0036650837
- Real-time speech-driven face animation with expressions using neural networks
- P. Hong, Z. Wen, and T. S. Huang, "Real-time speech-driven face animation with expressions using neural networks," IEEE Trans. Neural Netw., vol. 13, no. 4, pp. 916-927, 2002.
- (2002) IEEE Trans. Neural Netw , vol.13 , Issue.4 , pp. 916-927
- Hong, P.¹ Wen, Z.² Huang, T.S.³

7
- 0001260696
- iFace: A 3D synthetic talkingn face
- P. Hong, Z. Wen, and T. S. Huang, "iFace: A 3D synthetic talkingn face," Int. J. Image Graph., vol. 1, no. 1, pp. 19-26, 2001.
- (2001) Int. J. Image Graph , vol.1 , Issue.1 , pp. 19-26
- Hong, P.¹ Wen, Z.² Huang, T.S.³

8
- 46449105244
- 1st ed. Berlin, Germany: Springer
- Z. Wen and T. S. Huang, 3D Face Processing: Modeling, Analysis and Synthesis, 1st ed. Berlin, Germany: Springer, 2004.
- (2004) 3D Face Processing: Modeling, Analysis and Synthesis
- Wen, Z.¹ Huang, T.S.²

9
- 54949145792
- A virtual head driven by music expressivity
- M. Mancini, R. Bresin, and C. Pelachaud, "A virtual head driven by music expressivity," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1833-1841, 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.6 , pp. 1833-1841
- Mancini, M.¹ Bresin, R.² Pelachaud, C.³

10
- 85028639150
- A morphable model for the synthesis of 3D faces
- V. Blanz and T. Vetter, "A morphable model for the synthesis of 3D faces," Proc. SIGGRAPH'99, pp. 187-194, 1999.
- (1999) Proc. SIGGRAPH'99 , pp. 187-194
- Blanz, V.¹ Vetter, T.²

11
- 85018094829
- Computer generated animation of faces
- F. I. Parke, "Computer generated animation of faces," in Pioc. ACM Nat. Conf., 1972, pp. 451-457.
- (1972) Pioc. ACM Nat. Conf , pp. 451-457
- Parke, F.I.¹

12
- 0032643867
- PingPongPlus: Design of an athletic-tangible interface for computer-supported cooperative play
- 99, pp
- H. Ishii, C. Wisneski, J. Orbanes, B. Chun, and J. Paradiso, "PingPongPlus: Design of an athletic-tangible interface for computer-supported cooperative play," Proc. ACM SIGCHI'99, pp. 394-401, 1999.
- (1999) Proc. ACM SIGCHI , pp. 394-401
- Ishii, H.¹ Wisneski, C.² Orbanes, J.³ Chun, B.⁴ Paradiso, J.⁵

13
- 0344212675
- I. Pandzic and R. Forchheimer, Eds, Chichester, U.K, Wiley
- I. Pandzic and R. Forchheimer, Eds., MPEG-4 Facial Animation, Chichester, U.K.: Wiley, 2002.
- (2002) MPEG-4 Facial Animation

14
- 54949085453
- Online, Available
- "DAZ3D," [Online]. Available: http://www.daz3d.com/
- DAZ3D

15
- 2942596586
- Constrained optimization for audio-to-visual conversion
- Jun
- K.-H. Choi and J.-N. Hwang, "Constrained optimization for audio-to-visual conversion," IEEE Trans. Signal Process., vol. 52, no. 6, pp. 1783-1790, Jun. 2004.
- (2004) IEEE Trans. Signal Process , vol.52 , Issue.6 , pp. 1783-1790
- Choi, K.-H.¹ Hwang, J.-N.²

16
- 31344439475
- Accurate visible speech synthesis based on concatenating variable length motion capture data
- J. Ma, R. Cole, B. Pellom, W. Ward, and B. Wise, "Accurate visible speech synthesis based on concatenating variable length motion capture data," IEEE Trans. Vis. Comput. Graph., vol. 12, no. 2, pp. 266-276, 2006.
- (2006) IEEE Trans. Vis. Comput. Graph , vol.12 , Issue.2 , pp. 266-276
- Ma, J.¹ Cole, R.² Pellom, B.³ Ward, W.⁴ Wise, B.⁵

17
- 0001185920
- Communication without words
- A. Mehrabian, "Communication without words," Psychol. Today, vol. 2, pp. 53-56, 1968.
- (1968) Psychol. Today , vol.2 , pp. 53-56
- Mehrabian, A.¹

18
- 0003834176
- Dordrecht, The Netherlands: Kluwer
- T. Dutoit, An Introduction to Text-to-Speech Synthesis. Dordrecht, The Netherlands: Kluwer, 1997.
- (1997) An Introduction to Text-to-Speech Synthesis
- Dutoit, T.¹

19
- 9444257562
- Speech and Emotion Research: An Overview of Research Frameworks and a Dimensional Approach to Emotional Speech Synthesis,
- Ph.D. Thesis, Res. Rep, Institute of Phonetics, Saarland Univ, Saarsland, Germany, of Phonus
- M. Schröder, "Speech and Emotion Research: An Overview of Research Frameworks and a Dimensional Approach to Emotional Speech Synthesis," Ph.D. Thesis, Res. Rep., Institute of Phonetics, Saarland Univ., Saarsland, Germany, 2004, vol. 7 of Phonus.
- (2004) , vol.7
- Schröder, M.¹

20
- 54949130633
- M. Schröder, Can Emotions be Synthesized Without Controlling Voice Quality? Phonus 4, Res. Rep., Inst. Phonetics, Univ. Saarsland, Germany, pp. 37-55, 2004.
- M. Schröder, Can Emotions be Synthesized Without Controlling Voice Quality? Phonus 4, Res. Rep., Inst. Phonetics, Univ. Saarsland, Germany, pp. 37-55, 2004.

21
- 9444268127
- Expressing vocal effort in concatenative synthesis
- Barcelona, Spain
- M. Schröder and M. Grice, "Expressing vocal effort in concatenative synthesis," in Proc. 15th Int. Conf. of Phonetic, Barcelona, Spain, 2003, pp. 2589-2592.
- (2003) Proc. 15th Int. Conf. of Phonetic , pp. 2589-2592
- Schröder, M.¹ Grice, M.²

22
- 0003833128
- Generating Expression in Synthesized Speech,
- Master's thesis, MIT Media Lab
- J. E. Cahn, "Generating Expression in Synthesized Speech," Master's thesis, MIT Media Lab, , 1989.
- (1989)
- Cahn, J.E.¹

23
- 54949109821
- I. R. Murray, Simulating Emotion in Synthetic Speech, Ph.D. thesis, Univ. Dundee, Dundee, U.K., 1989.
- I. R. Murray, "Simulating Emotion in Synthetic Speech," Ph.D. thesis, Univ. Dundee, Dundee, U.K., 1989.

24
- 0242634024
- Simulation Emotionaler Sprechweise mit Sprachsyntheseverfahren,
- Ph.D. thesis, Tech. Univ. Berlin, Germany
- F. Burkhardt, "Simulation Emotionaler Sprechweise mit Sprachsyntheseverfahren," Ph.D. thesis, Tech. Univ. Berlin, Germany, 2000.
- (2000)
- Burkhardt, F.¹

25
- 54949120636
- Corpus-Based Speech Synthesis With Emotion,
- Ph.D. Thesis, Univ. Keio, Tokyo, Japan
- A. Iida, "Corpus-Based Speech Synthesis With Emotion," Ph.D. Thesis, Univ. Keio, Tokyo, Japan, 2002.
- (2002)
- Iida, A.¹

26
- 54949096620
- G. Hofer, Emotional Speech Synthesis, Master thesis, Univ. Edinburgh, Edinburgh, U.K., 2004.
- G. Hofer, "Emotional Speech Synthesis," Master thesis, Univ. Edinburgh, Edinburgh, U.K., 2004.

27
- 34047275265
- The IBM expressive text-to-speech synthesis system for american english
- J. F. Pitrelli, R. Bakis, E. M. Eide, R. Fernandez, W. Hamza, and M. A. Picheny, "The IBM expressive text-to-speech synthesis system for american english," IEEE Trans. Audio, Speech Lang. Process, vol. 14, no. 4, pp. 1099-1108, 2006.
- (2006) IEEE Trans. Audio, Speech Lang. Process , vol.14 , Issue.4 , pp. 1099-1108
- Pitrelli, J.F.¹ Bakis, R.² Eide, E.M.³ Fernandez, R.⁴ Hamza, W.⁵ Picheny, M.A.⁶

28
- 0032626647
- Explanation-based facial motion tracking using a piecewise bezier volume deformation model
- H. Tao and T. S. Huang, "Explanation-based facial motion tracking using a piecewise bezier volume deformation model," in IEEE Conf. CVPR'99, 1999, pp. 611-617.
- (1999) IEEE Conf. CVPR'99 , pp. 611-617
- Tao, H.¹ Huang, T.S.²

29
- 0004122446
- Palo Alto, CA: Psychological Press
- P. Ekman and W. V. Friesen, The Facial Action Coding System. Palo Alto, CA: Psychological Press, 1977.
- (1977) The Facial Action Coding System
- Ekman, P.¹ Friesen, W.V.²

30
- 54949087920
- Online, Available
- The Festival Project [Online]. Available: http://www.cstr.ed.ac.uk/ projects/festival/
- The Festival Project

31
- 54949147960
- Online, Available
- TheMBROLA Project [Online]. Available: http://mambo.ucsc.edu/psl/mbrola/
- TheMBROLA Project

32
- 6644227591
- Development of an emotional speech synthesiser in Spanish
- Budapest, Hungary
- J. M. Montero, J. Gutiérrez-Arriola, J. Colás, J. Macías-Guarasa, E. Enriquez, and J. M. Pardo, "Development of an emotional speech synthesiser in Spanish," in Proc. Eumspeech '99, Budapest, Hungary, 1999.
- (1999) Proc. Eumspeech '99
- Montero, J.M.¹ Gutiérrez-Arriola, J.² Colás, J.³ Macías-Guarasa, J.⁴ Enriquez, E.⁵ Pardo, J.M.⁶

33
- 33745199181
- Emofilt: The simulation of emotional speech by prosody-transformation
- Lisbon, Portugal
- F. Burkhardt, "Emofilt: The simulation of emotional speech by prosody-transformation," in Proc. INTERSPEECH-2005, Lisbon, Portugal, 2005, pp. 509-512.
- (2005) Proc. INTERSPEECH-2005 , pp. 509-512
- Burkhardt, F.¹

34
- 33645777234
- Expressive speech-driven facial animation
- Y. Cao, P. Faloutsos, and F. Pighin, "Expressive speech-driven facial animation," ACM Trans. Graph., vol. 24, no. 4, 2005.
- (2005) ACM Trans. Graph , vol.24 , Issue.4
- Cao, Y.¹ Faloutsos, P.² Pighin, F.³

35
- 0034865155
- Principal components of expressive speech animation
- S. Kshirsagar, T. Molet, and N. Magnenat-Thalmann, "Principal components of expressive speech animation," in Proc. Int. Conf. on Computer Graphics, 2001, pp. 38-44.
- (2001) Proc. Int. Conf. on Computer Graphics , pp. 38-44
- Kshirsagar, S.¹ Molet, T.² Magnenat-Thalmann, N.³

36
- 54949145414
- San Francisco, CA, Available from HIL-0984, UCSF
- P. Ekman, T. S. Huang, T. Sejnowski, and J. Hager, in Final Report to NSF of the Planning Workshop on Facial Expression Understanding, San Francisco, CA, 1993, Available from HIL-0984, UCSF.
- (1993) Final Report to NSF of the Planning Workshop on Facial Expression Understanding
- Ekman, P.¹ Huang, T.S.² Sejnowski, T.³ Hager, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.