메뉴 건너뛰기




Volumn 6, Issue 4, 2003, Pages 331-346

Audiovisual Speech Synthesis

Author keywords

Audiovisual synthesis; Facial animation; Talking faces; Text to speech synthesis

Indexed keywords

ANIMATION; DATA ACQUISITION; DEFORMATION; SPEECH ANALYSIS;

EID: 0142216141     PISSN: 13812416     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1025700715107     Document Type: Conference Paper
Times cited : (70)

References (64)
  • 1
    • 0001785334 scopus 로고    scopus 로고
    • Towards an audiovisual virtual talking head: 3D articulatory modeling of tongue, lips and face based on MRI and video images
    • Germany: Kloster Seeon
    • Badin, P., Borel, P., Bailly, G., Revéret, L., Baciu, M., and Segebarth, C. (2000). Towards an audiovisual virtual talking head: 3D articulatory modeling of tongue, lips and face based on MRI and video images. Proceedings of the 5th Speech Production Seminar, Germany: Kloster Seeon, pp. 261-264.
    • (2000) Proceedings of the 5th Speech Production Seminar , pp. 261-264
    • Badin, P.1    Borel, P.2    Bailly, G.3    Revéret, L.4    Baciu, M.5    Segebarth, C.6
  • 2
    • 0031198820 scopus 로고    scopus 로고
    • Learning to speak. Sensori-motor control of speech movements
    • Bailly, G. (1998). Learning to speak. Sensori-motor control of speech movements. Speech Communication, 22(2/3):251-267.
    • (1998) Speech Communication , vol.22 , Issue.2-3 , pp. 251-267
    • Bailly, G.1
  • 3
    • 84966335540 scopus 로고    scopus 로고
    • Evaluation of movement generation systems using the point-light technique
    • Santa Monica, CA
    • Bailly, G., Gibert, G., and Odisio, M. (2002). Evaluation of movement generation systems using the point-light technique. IEEE Workshop on Speech Synthesis, Santa Monica, CA.
    • (2002) IEEE Workshop on Speech Synthesis
    • Bailly, G.1    Gibert, G.2    Odisio, M.3
  • 4
  • 5
    • 0013132012 scopus 로고
    • Controlling facial expression and body movements in the computer-generated short "Tony de Peltrie"
    • San Francisco, CA
    • Bergeron, P. and Lachapelle, P. (1985). Controlling facial expression and body movements in the computer-generated short "Tony de Peltrie". SIGGRAPH, Advanced Computer Animation Seminar Notes, San Francisco, CA.
    • (1985) SIGGRAPH, Advanced Computer Animation Seminar Notes
    • Bergeron, P.1    Lachapelle, P.2
  • 8
    • 84937437186 scopus 로고    scopus 로고
    • Voice pupperty
    • Los Angeles, CA
    • Brand, M. (1999). Voice pupperty. SIGGRAPH'99, Los Angeles, CA, pp. 21-28.
    • (1999) SIGGRAPH'99 , pp. 21-28
    • Brand, M.1
  • 9
    • 0030677313 scopus 로고    scopus 로고
    • VideoRewrite: Driving visual speech with audio
    • Los Angeles, CA
    • Bregler, C., Covell, M., and Slaney, M. (1997a). VideoRewrite: Driving visual speech with audio. SIGGRAPH'97, Los Angeles, CA, pp. 353-360.
    • (1997) SIGGRAPH'97 , pp. 353-360
    • Bregler, C.1    Covell, M.2    Slaney, M.3
  • 12
    • 84955535347 scopus 로고
    • Gestural specification using dynamically-defined articulatory structures
    • Browman, C.P. and Goldstein, L.M. (1990). Gestural specification using dynamically-defined articulatory structures. Journal of Phonetics, 18(3):299-320.
    • (1990) Journal of Phonetics , vol.18 , Issue.3 , pp. 299-320
    • Browman, C.P.1    Goldstein, L.M.2
  • 14
    • 0001514782 scopus 로고
    • Modeling coarticulation in synthetic visual speech
    • D. Thalmann and N. Magnenat-Thalmann (Eds.). Springer-Verlag: Tokyo
    • Cohen, M.M. and Massaro, D.W. (1993). Modeling coarticulation in synthetic visual speech. In D. Thalmann and N. Magnenat-Thalmann (Eds.), Models and Techniques in Computer Animation. Springer-Verlag: Tokyo, pp. 141-155.
    • (1993) Models and Techniques in Computer Animation , pp. 141-155
    • Cohen, M.M.1    Massaro, D.W.2
  • 16
    • 0142241582 scopus 로고    scopus 로고
    • Sample-based synthesis of photo-realistic talking-heads
    • Los Angeles, CA
    • Cosatto, E. and Graf, H.P. (1997). Sample-based synthesis of photo-realistic talking-heads. SIGGRAPH'97, Los Angeles, CA, pp. 353-360.
    • (1997) SIGGRAPH'97 , pp. 353-360
    • Cosatto, E.1    Graf, H.P.2
  • 17
    • 84872004031 scopus 로고    scopus 로고
    • Sample-based synthesis of photo-realistic talking heads
    • Philadelphia, Pennsylvania
    • Cosatto, E. and Graf, H.P. (1998). Sample-based synthesis of photo-realistic talking heads. Computer Animation, Philadelphia, Pennsylvania, pp. 103-110.
    • (1998) Computer Animation , pp. 103-110
    • Cosatto, E.1    Graf, H.P.2
  • 18
    • 0034070906 scopus 로고    scopus 로고
    • The Mesh-Matching algorithm: An automatic 3D mesh generator for finite element structures
    • Couteau, B., Payan, Y., and Lavallée, S. (2000). The Mesh-Matching algorithm: An automatic 3D mesh generator for finite element structures. Journal of Biomechanics, 35(8):1005-1009.
    • (2000) Journal of Biomechanics , vol.35 , Issue.8 , pp. 1005-1009
    • Couteau, B.1    Payan, Y.2    Lavallée, S.3
  • 21
    • 0004167520 scopus 로고
    • Palo Alto, California: Consulting Psychologists Press
    • Ekman, P. and Friesen, W.V. (1975). Unmasking the Face. Palo Alto, California: Consulting Psychologists Press.
    • (1975) Unmasking the Face
    • Ekman, P.1    Friesen, W.V.2
  • 25
  • 26
    • 85031438802 scopus 로고    scopus 로고
    • Visual speech synthesis with concatenative speech
    • Terrigal-Sydney, Australia
    • Hällgren, Å. and Lyberg, B. (1998). Visual speech synthesis with concatenative speech. Auditory-Visual Speech Processing Conference, Terrigal-Sydney, Australia, pp. 181-183.
    • (1998) Auditory-Visual Speech Processing Conference , pp. 181-183
    • Hällgren, Å.1    Lyberg, B.2
  • 27
    • 0002258223 scopus 로고
    • The PARAFAC model for three-way factor analysis and multidimensional scaling
    • H.G. Law, C.W. Snyder, J.A. Hattie, and R.P. MacDonald (Eds.). New-York: Praeger
    • Harshman, R.A. and Lundy, M.E. (1984). The PARAFAC model for three-way factor analysis and multidimensional scaling. In H.G. Law, C.W. Snyder, J.A. Hattie, and R.P. MacDonald (Eds.), Research Methods for Multimode Data Analysis. New-York: Praeger, pp. 122-215.
    • (1984) Research Methods for Multimode Data Analysis , pp. 122-215
    • Harshman, R.A.1    Lundy, M.E.2
  • 29
    • 0027607090 scopus 로고
    • 3D motion estimation in model-based facial image coding
    • Li, H., Roivanen, P., and Forchheimer, R. (1993). 3D motion estimation in model-based facial image coding. IEEE Transactions on PAMI, 15(6):545-555.
    • (1993) IEEE Transactions on PAMI , vol.15 , Issue.6 , pp. 545-555
    • Li, H.1    Roivanen, P.2    Forchheimer, R.3
  • 30
    • 0142179494 scopus 로고    scopus 로고
    • Illusions and issues in bimodal speech perception
    • Terrigal, Sydney, Australia
    • Massaro, D. (1998a). Illusions and issues in bimodal speech perception. Auditory-Visual Speech Processing Conference, Terrigal, Sydney, Australia, pp. 21-26.
    • (1998) Auditory-Visual Speech Processing Conference , pp. 21-26
    • Massaro, D.1
  • 33
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • McGurk, H. and MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 26:746-748.
    • (1976) Nature , vol.26 , pp. 746-748
    • McGurk, H.1    Macdonald, J.2
  • 34
    • 85009080445 scopus 로고    scopus 로고
    • Modeling visual coarticulation in synthetic talking heads using a lip motion unit inventory with concatenative synthesis
    • Beijing, China
    • Minnis, S. and Breen, A.P. (1998). Modeling visual coarticulation in synthetic talking heads using a lip motion unit inventory with concatenative synthesis. ICSLP, Beijing, China, pp. 759-762.
    • (1998) ICSLP , pp. 759-762
    • Minnis, S.1    Breen, A.P.2
  • 40
    • 0033336969 scopus 로고    scopus 로고
    • Users evaluation: Synthetic talking faces for interactive services
    • Pandzic, I., Ostermann, J., and Millen, D. (1999). Users evaluation: Synthetic talking faces for interactive services. The Visual Computer, 15:330-340.
    • (1999) The Visual Computer , vol.15 , pp. 330-340
    • Pandzic, I.1    Ostermann, J.2    Millen, D.3
  • 41
    • 85018094829 scopus 로고
    • Computer generated animation of faces
    • Salt Lake City
    • Parke, F.I. (1972). Computer generated animation of faces. ACM National Conference, Salt Lake City, pp. 451-457.
    • (1972) ACM National Conference , pp. 451-457
    • Parke, F.I.1
  • 42
    • 50849153856 scopus 로고
    • A model for human faces that allows speech synchronized animation
    • Parke, F.I. (1975). A model for human faces that allows speech synchronized animation. Journal of Computers and Graphics, 1(1): 1-4.
    • (1975) Journal of Computers and Graphics , vol.1 , Issue.1 , pp. 1-4
    • Parke, F.I.1
  • 43
    • 0020202671 scopus 로고
    • A parametrized model for facial animation
    • Parke, F.I. (1982). A parametrized model for facial animation. IEEE Computer Graphics and Applications, 2(9):61-70.
    • (1982) IEEE Computer Graphics and Applications , vol.2 , Issue.9 , pp. 61-70
    • Parke, F.I.1
  • 45
  • 47
    • 0012433827 scopus 로고    scopus 로고
    • Perception of synthetic speech
    • J.P.H.V. Santen, R.W. Sproat, J.P. Olive, and J. Hirschberg (Eds.). Springer Verlag: New York
    • Pisoni, D.B. (1997). Perception of synthetic speech. In J.P.H.V. Santen, R.W. Sproat, J.P. Olive, and J. Hirschberg (Eds.), Progress in Speech Synthesis. Springer Verlag: New York. pp. 541-560.
    • (1997) Progress in Speech Synthesis , pp. 541-560
    • Pisoni, D.B.1
  • 48
    • 0019603077 scopus 로고
    • Animating facial expressions
    • Platt, S.M. and Badler, N.I. (1981). Animating facial expressions. Computer Graphics, 15(3):245-252.
    • (1981) Computer Graphics , vol.15 , Issue.3 , pp. 245-252
    • Platt, S.M.1    Badler, N.I.2
  • 50
    • 84870292720 scopus 로고    scopus 로고
    • MOTHER: A new generation of talking heads providing a flexible articulatory control for video-realistic speech animation
    • Beijing, China
    • Revéret, L., Bailly, G., and Badin, P. (2000). MOTHER: A new generation of talking heads providing a flexible articulatory control for video-realistic speech animation. International Conference on Speech and Language Processing, Beijing, China, pp. 755-758.
    • (2000) International Conference on Speech and Language Processing , pp. 755-758
    • Revéret, L.1    Bailly, G.2    Badin, P.3
  • 51
    • 0004222842 scopus 로고
    • Sweden, Dept. of Electrical Engineering, Linköping University: LiTH-ISY-I-866
    • Rydfalk, M. (1987). CANDIDE, a parameterized face. Sweden, Dept. of Electrical Engineering, Linköping University: LiTH-ISY-I-866.
    • (1987) CANDIDE, a Parameterized Face
    • Rydfalk, M.1
  • 52
    • 0030409654 scopus 로고    scopus 로고
    • View morphing
    • New Orleans, Louisiana
    • Seitz, S.M. and Dyer, C.R. (1996). View morphing. ACM SIGGRAPH, New Orleans, Louisiana, pp. 21-30.
    • (1996) ACM SIGGRAPH , pp. 21-30
    • Seitz, S.M.1    Dyer, C.R.2
  • 53
    • 0026348904 scopus 로고
    • Different phase-stable relationships of the upper lip and jaw for production of vowels and diphthongs
    • Shaiman, S. and Porter, R.J. (1991). Different phase-stable relationships of the upper lip and jaw for production of vowels and diphthongs. Journal of the Acoustical Society of America, 90:3000-3007.
    • (1991) Journal of the Acoustical Society of America , vol.90 , pp. 3000-3007
    • Shaiman, S.1    Porter, R.J.2
  • 54
    • 0003058857 scopus 로고
    • On the basic scheme and algorithms in non-uniform unit speech synthesis
    • G. Bailly and C. Benoît (Eds.). Elsevier B.V.
    • Takeda, K., Abe, K., and Sagisaka, Y. (1992). On the basic scheme and algorithms in non-uniform unit speech synthesis. In G. Bailly and C. Benoît (Eds.), Talking Machines: Theories, Models and Designs. Elsevier B.V., pp. 93-105.
    • (1992) Talking Machines: Theories, Models and Designs , pp. 93-105
    • Takeda, K.1    Abe, K.2    Sagisaka, Y.3
  • 61
    • 0142210579 scopus 로고    scopus 로고
    • A text-speech synchronization technique with applications to talking heads
    • Santa Cruz, California, USA
    • Vignoli, F. and Braccini, C. (1999). A text-speech synchronization technique with applications to talking heads. Auditory-Visual Speech Processing Conference, Santa Cruz, California, USA, pp. 128-132.
    • (1999) Auditory-Visual Speech Processing Conference , pp. 128-132
    • Vignoli, F.1    Braccini, C.2
  • 62
    • 0023379314 scopus 로고
    • A muscle model for animating three-dimensional facial expression
    • Waters, K. (1987). A muscle model for animating three-dimensional facial expression. Computer Graphics, 21(4):17-24.
    • (1987) Computer Graphics , vol.21 , Issue.4 , pp. 17-24
    • Waters, K.1
  • 64
    • 0032179320 scopus 로고    scopus 로고
    • Lip movement synthesis from speech based on Hidden Markov Models
    • Yamamoto, E., Nakamura, S., and Shikano, K. (1998). Lip movement synthesis from speech based on Hidden Markov Models. Speech Communication, 26(1-2):105-115.
    • (1998) Speech Communication , vol.26 , Issue.1-2 , pp. 105-115
    • Yamamoto, E.1    Nakamura, S.2    Shikano, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.