메뉴 건너뛰기




Volumn 10, Issue 6, 2008, Pages 969-981

Humanoid audio-visual avatar with emotive text-to-speech synthesis

Author keywords

3 D face modeling and animation; Audio visual avatar; Emotive speech synthesis; Human computer interaction; Multimodal system; TTS

Indexed keywords

ANIMATION; FEATURE EXTRACTION; FLOW INTERACTIONS; HUMAN COMPUTER INTERACTION; KNOWLEDGE MANAGEMENT; SPEECH; SPEECH SYNTHESIS; THREE DIMENSIONAL; THREE DIMENSIONAL COMPUTER GRAPHICS; VIRTUAL REALITY;

EID: 54949115779     PISSN: 15209210     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMM.2008.2001355     Document Type: Article
Times cited : (30)

References (36)
  • 1
  • 3
    • 33746894562 scopus 로고    scopus 로고
    • M-face: An appearance-based photorealistic model for multiple facial attributes rendering
    • Jul
    • Y. Fu and N. Zheng, "M-face: An appearance-based photorealistic model for multiple facial attributes rendering," IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 7, pp. 830-842, Jul. 2006.
    • (2006) IEEE Trans. Circuits Syst. Video Technol , vol.16 , Issue.7 , pp. 830-842
    • Fu, Y.1    Zheng, N.2
  • 5
    • 46449119257 scopus 로고    scopus 로고
    • Real-time humanoid avatar for multimodal human-machine interaction
    • Y. Fu, R. Li, T. S. Huang, and M. Danielsen, "Real-time humanoid avatar for multimodal human-machine interaction," in IEEE Conf. ICME'07, 2007, pp. 991-994.
    • (2007) IEEE Conf. ICME'07 , pp. 991-994
    • Fu, Y.1    Li, R.2    Huang, T.S.3    Danielsen, M.4
  • 6
    • 0036650837 scopus 로고    scopus 로고
    • Real-time speech-driven face animation with expressions using neural networks
    • P. Hong, Z. Wen, and T. S. Huang, "Real-time speech-driven face animation with expressions using neural networks," IEEE Trans. Neural Netw., vol. 13, no. 4, pp. 916-927, 2002.
    • (2002) IEEE Trans. Neural Netw , vol.13 , Issue.4 , pp. 916-927
    • Hong, P.1    Wen, Z.2    Huang, T.S.3
  • 7
    • 0001260696 scopus 로고    scopus 로고
    • iFace: A 3D synthetic talkingn face
    • P. Hong, Z. Wen, and T. S. Huang, "iFace: A 3D synthetic talkingn face," Int. J. Image Graph., vol. 1, no. 1, pp. 19-26, 2001.
    • (2001) Int. J. Image Graph , vol.1 , Issue.1 , pp. 19-26
    • Hong, P.1    Wen, Z.2    Huang, T.S.3
  • 10
    • 85028639150 scopus 로고    scopus 로고
    • A morphable model for the synthesis of 3D faces
    • V. Blanz and T. Vetter, "A morphable model for the synthesis of 3D faces," Proc. SIGGRAPH'99, pp. 187-194, 1999.
    • (1999) Proc. SIGGRAPH'99 , pp. 187-194
    • Blanz, V.1    Vetter, T.2
  • 11
    • 85018094829 scopus 로고
    • Computer generated animation of faces
    • F. I. Parke, "Computer generated animation of faces," in Pioc. ACM Nat. Conf., 1972, pp. 451-457.
    • (1972) Pioc. ACM Nat. Conf , pp. 451-457
    • Parke, F.I.1
  • 12
    • 0032643867 scopus 로고    scopus 로고
    • PingPongPlus: Design of an athletic-tangible interface for computer-supported cooperative play
    • 99, pp
    • H. Ishii, C. Wisneski, J. Orbanes, B. Chun, and J. Paradiso, "PingPongPlus: Design of an athletic-tangible interface for computer-supported cooperative play," Proc. ACM SIGCHI'99, pp. 394-401, 1999.
    • (1999) Proc. ACM SIGCHI , pp. 394-401
    • Ishii, H.1    Wisneski, C.2    Orbanes, J.3    Chun, B.4    Paradiso, J.5
  • 13
    • 0344212675 scopus 로고    scopus 로고
    • I. Pandzic and R. Forchheimer, Eds, Chichester, U.K, Wiley
    • I. Pandzic and R. Forchheimer, Eds., MPEG-4 Facial Animation, Chichester, U.K.: Wiley, 2002.
    • (2002) MPEG-4 Facial Animation
  • 14
    • 54949085453 scopus 로고    scopus 로고
    • Online, Available
    • "DAZ3D," [Online]. Available: http://www.daz3d.com/
    • DAZ3D
  • 15
    • 2942596586 scopus 로고    scopus 로고
    • Constrained optimization for audio-to-visual conversion
    • Jun
    • K.-H. Choi and J.-N. Hwang, "Constrained optimization for audio-to-visual conversion," IEEE Trans. Signal Process., vol. 52, no. 6, pp. 1783-1790, Jun. 2004.
    • (2004) IEEE Trans. Signal Process , vol.52 , Issue.6 , pp. 1783-1790
    • Choi, K.-H.1    Hwang, J.-N.2
  • 16
    • 31344439475 scopus 로고    scopus 로고
    • Accurate visible speech synthesis based on concatenating variable length motion capture data
    • J. Ma, R. Cole, B. Pellom, W. Ward, and B. Wise, "Accurate visible speech synthesis based on concatenating variable length motion capture data," IEEE Trans. Vis. Comput. Graph., vol. 12, no. 2, pp. 266-276, 2006.
    • (2006) IEEE Trans. Vis. Comput. Graph , vol.12 , Issue.2 , pp. 266-276
    • Ma, J.1    Cole, R.2    Pellom, B.3    Ward, W.4    Wise, B.5
  • 17
    • 0001185920 scopus 로고
    • Communication without words
    • A. Mehrabian, "Communication without words," Psychol. Today, vol. 2, pp. 53-56, 1968.
    • (1968) Psychol. Today , vol.2 , pp. 53-56
    • Mehrabian, A.1
  • 19
    • 9444257562 scopus 로고    scopus 로고
    • Speech and Emotion Research: An Overview of Research Frameworks and a Dimensional Approach to Emotional Speech Synthesis,
    • Ph.D. Thesis, Res. Rep, Institute of Phonetics, Saarland Univ, Saarsland, Germany, of Phonus
    • M. Schröder, "Speech and Emotion Research: An Overview of Research Frameworks and a Dimensional Approach to Emotional Speech Synthesis," Ph.D. Thesis, Res. Rep., Institute of Phonetics, Saarland Univ., Saarsland, Germany, 2004, vol. 7 of Phonus.
    • (2004) , vol.7
    • Schröder, M.1
  • 20
    • 54949130633 scopus 로고    scopus 로고
    • M. Schröder, Can Emotions be Synthesized Without Controlling Voice Quality? Phonus 4, Res. Rep., Inst. Phonetics, Univ. Saarsland, Germany, pp. 37-55, 2004.
    • M. Schröder, Can Emotions be Synthesized Without Controlling Voice Quality? Phonus 4, Res. Rep., Inst. Phonetics, Univ. Saarsland, Germany, pp. 37-55, 2004.
  • 21
    • 9444268127 scopus 로고    scopus 로고
    • Expressing vocal effort in concatenative synthesis
    • Barcelona, Spain
    • M. Schröder and M. Grice, "Expressing vocal effort in concatenative synthesis," in Proc. 15th Int. Conf. of Phonetic, Barcelona, Spain, 2003, pp. 2589-2592.
    • (2003) Proc. 15th Int. Conf. of Phonetic , pp. 2589-2592
    • Schröder, M.1    Grice, M.2
  • 22
    • 0003833128 scopus 로고
    • Generating Expression in Synthesized Speech,
    • Master's thesis, MIT Media Lab
    • J. E. Cahn, "Generating Expression in Synthesized Speech," Master's thesis, MIT Media Lab, , 1989.
    • (1989)
    • Cahn, J.E.1
  • 23
    • 54949109821 scopus 로고    scopus 로고
    • I. R. Murray, Simulating Emotion in Synthetic Speech, Ph.D. thesis, Univ. Dundee, Dundee, U.K., 1989.
    • I. R. Murray, "Simulating Emotion in Synthetic Speech," Ph.D. thesis, Univ. Dundee, Dundee, U.K., 1989.
  • 24
    • 0242634024 scopus 로고    scopus 로고
    • Simulation Emotionaler Sprechweise mit Sprachsyntheseverfahren,
    • Ph.D. thesis, Tech. Univ. Berlin, Germany
    • F. Burkhardt, "Simulation Emotionaler Sprechweise mit Sprachsyntheseverfahren," Ph.D. thesis, Tech. Univ. Berlin, Germany, 2000.
    • (2000)
    • Burkhardt, F.1
  • 25
    • 54949120636 scopus 로고    scopus 로고
    • Corpus-Based Speech Synthesis With Emotion,
    • Ph.D. Thesis, Univ. Keio, Tokyo, Japan
    • A. Iida, "Corpus-Based Speech Synthesis With Emotion," Ph.D. Thesis, Univ. Keio, Tokyo, Japan, 2002.
    • (2002)
    • Iida, A.1
  • 26
    • 54949096620 scopus 로고    scopus 로고
    • G. Hofer, Emotional Speech Synthesis, Master thesis, Univ. Edinburgh, Edinburgh, U.K., 2004.
    • G. Hofer, "Emotional Speech Synthesis," Master thesis, Univ. Edinburgh, Edinburgh, U.K., 2004.
  • 28
    • 0032626647 scopus 로고    scopus 로고
    • Explanation-based facial motion tracking using a piecewise bezier volume deformation model
    • H. Tao and T. S. Huang, "Explanation-based facial motion tracking using a piecewise bezier volume deformation model," in IEEE Conf. CVPR'99, 1999, pp. 611-617.
    • (1999) IEEE Conf. CVPR'99 , pp. 611-617
    • Tao, H.1    Huang, T.S.2
  • 30
    • 54949087920 scopus 로고    scopus 로고
    • Online, Available
    • The Festival Project [Online]. Available: http://www.cstr.ed.ac.uk/ projects/festival/
    • The Festival Project
  • 31
    • 54949147960 scopus 로고    scopus 로고
    • Online, Available
    • TheMBROLA Project [Online]. Available: http://mambo.ucsc.edu/psl/mbrola/
    • TheMBROLA Project
  • 33
    • 33745199181 scopus 로고    scopus 로고
    • Emofilt: The simulation of emotional speech by prosody-transformation
    • Lisbon, Portugal
    • F. Burkhardt, "Emofilt: The simulation of emotional speech by prosody-transformation," in Proc. INTERSPEECH-2005, Lisbon, Portugal, 2005, pp. 509-512.
    • (2005) Proc. INTERSPEECH-2005 , pp. 509-512
    • Burkhardt, F.1
  • 34
    • 33645777234 scopus 로고    scopus 로고
    • Expressive speech-driven facial animation
    • Y. Cao, P. Faloutsos, and F. Pighin, "Expressive speech-driven facial animation," ACM Trans. Graph., vol. 24, no. 4, 2005.
    • (2005) ACM Trans. Graph , vol.24 , Issue.4
    • Cao, Y.1    Faloutsos, P.2    Pighin, F.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.