메뉴 건너뛰기




Volumn 44, Issue 1-4 SPEC. ISS., 2004, Pages 141-154

An articulation model for audiovisual speech synthesis - Determination, adjustment, evaluation

Author keywords

Articulation model; Audiovisual speech synthesis; Auditory visual speech perception; Talking head

Indexed keywords

ARTICULATION MODEL; AUDIOVISUAL SPEECH; TALKING HEAD;

EID: 10444283472     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2004.10.006     Document Type: Article
Times cited : (55)

References (25)
  • 1
    • 10444226718 scopus 로고    scopus 로고
    • On the production and the perception of audio-visual speech by man and machine
    • Bertoni, H., Wang, Y., Panwar, S. (Eds.), Plenum Press, New York
    • Benoît, C., 1996. On the production and the perception of audio-visual speech by man and machine. In: Bertoni, H., Wang, Y., Panwar, S. (Eds.), Multimedia and Video Coding. Plenum Press, New York.
    • (1996) Multimedia and Video Coding
    • Benoît, C.1
  • 2
    • 4143072802 scopus 로고    scopus 로고
    • Trainable articulatory control models for visual speech synthesis
    • Beskow, J., 2004. Trainable articulatory control models for visual speech synthesis. Int. J. Speech Technol.
    • (2004) Int. J. Speech Technol.
    • Beskow, J.1
  • 4
    • 0026200336 scopus 로고
    • Crossmodal integration in the identification of consonant segments
    • Braida, L., 1991. Crossmodal integration in the identification of consonant segments. Quarter. J. Exp. Psychol. 43, 647-677.
    • (1991) Quarter. J. Exp. Psychol. , vol.43 , pp. 647-677
    • Braida, L.1
  • 7
    • 0001514782 scopus 로고
    • Modeling co-articulation in synthetic visual speech
    • Magnenat Thalmann, N., Thalmann, D. (Eds.), Springer-Verlag, Tokyo
    • Cohen, M.M., Massaro, D.W., 1993. Modeling co-articulation in synthetic visual speech. In: Magnenat Thalmann, N., Thalmann, D. (Eds.), Models and Techniques in Computer Animation. Springer-Verlag, Tokyo, pp. 139-156.
    • (1993) Models and Techniques in Computer Animation , pp. 139-156
    • Cohen, M.M.1    Massaro, D.W.2
  • 9
    • 0014366349 scopus 로고
    • Confusions among visually perceived consonants
    • Fisher, C.G., 1968. Confusions among visually perceived consonants. J. Speech Hearing Res. 11, 769-804.
    • (1968) J. Speech Hearing Res. , vol.11 , pp. 769-804
    • Fisher, C.G.1
  • 10
    • 0031684026 scopus 로고    scopus 로고
    • Measures of auditory-visual integration in non-sense syllables and sentences
    • Grant, K.G., Seitz, P.F., 1998. Measures of auditory-visual integration in non-sense syllables and sentences. J. Acoust. Soc. Amer. 104, 2438-2450.
    • (1998) J. Acoust. Soc. Amer. , vol.104 , pp. 2438-2450
    • Grant, K.G.1    Seitz, P.F.2
  • 11
    • 85133504159 scopus 로고    scopus 로고
    • Automatic modelling of coarticulation in text-to-visual-speech synthesis
    • Rhodos
    • Le Goff, B., 1997. Automatic modelling of coarticulation in text-to-visual-speech synthesis. In: Proceedings of the 5th Eurospeech Conference, Rhodos, pp. 1667-1670.
    • (1997) Proceedings of the 5th Eurospeech Conference , pp. 1667-1670
    • Le Goff, B.1
  • 13
    • 0003116759 scopus 로고
    • Speech as audible gestures
    • Hardcastle, A., Marchal, A. (Eds.), Kluwer Academic Publishers, Dodrecht
    • Löfqvist, A., 1990. Speech as audible gestures. In: Hardcastle, A., Marchal, A. (Eds.), Speech Production and Speech Modeling. Kluwer Academic Publishers, Dodrecht, pp. 289-322.
    • (1990) Speech Production and Speech Modeling , pp. 289-322
    • Löfqvist, A.1
  • 15
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • McGurk, H., MacDonald, I., 1976. Hearing lips and seeing voices. Nature 264, 746-748.
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, I.2
  • 17
    • 0041048870 scopus 로고    scopus 로고
    • Articulatory evidence for syllabic structure
    • Munhall, K.G., Jones, J., 1998. Articulatory evidence for syllabic structure. Behavioral Brain Sci. 21, 524-525.
    • (1998) Behavioral Brain Sci. , vol.21 , pp. 524-525
    • Munhall, K.G.1    Jones, J.2
  • 18
    • 0033336969 scopus 로고    scopus 로고
    • User evaluation: Synthetic talking faces for interactive services
    • Pandzic, I.S., Ostermann, J., Millen, D., 1999. User evaluation: Synthetic talking faces for interactive services. Visual Comput. J. 15, 330-340.
    • (1999) Visual Comput. J. , vol.15 , pp. 330-340
    • Pandzic, I.S.1    Ostermann, J.2    Millen, D.3
  • 19
    • 0022980515 scopus 로고
    • Ein Verfahren zur Messung von Fehlleistungen beim Sprachverstehen - Überlegungen und erste Ergebnisse
    • Sendlmeier, W.F., v. Wedel, H., 1986. Ein Verfahren zur Messung von Fehlleistungen beim Sprachverstehen - Überlegungen und erste Ergebnisse. Sprache-Stimme-Gehör 10, 164-169.
    • (1986) Sprache-stimme-gehör , vol.10 , pp. 164-169
    • Sendlmeier, W.F.1    Wedel, H.2
  • 20
    • 84955023977 scopus 로고
    • Visual contribution to speech intelligibility in noise
    • Sumby, W., Pollack, I., 1954. Visual contribution to speech intelligibility in noise. J. Acoust. Soc. Amer. 26, 212-215.
    • (1954) J. Acoust. Soc. Amer. , vol.26 , pp. 212-215
    • Sumby, W.1    Pollack, I.2
  • 21
    • 84873569922 scopus 로고    scopus 로고
    • The MBROLA Project, 2003. Available from: 〈http://tcts.fpms.ac.be/synthesis/mbrola.html〉.
    • (2003) The MBROLA Project
  • 22
    • 0013241487 scopus 로고    scopus 로고
    • The Web3D Consortium, 1997. VRML - Virtual Reality Modeling Language. Available from: 〈http://www.web3d.org/technicalinfo/specifications.htm〉.
    • (1997) VRML - Virtual Reality Modeling Language
  • 25
    • 84873568062 scopus 로고    scopus 로고
    • ZAS - Zentrum fÜr allgemeine Sprachwissenschaft, 2003. Available from: 〈http://www.zas.gwz-berlin.de〉.
    • (2003)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.