메뉴 건너뛰기




Volumn 26, Issue 1-2, 1998, Pages 117-129

Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP

Author keywords

3D lip model; Coarticulation; Face animation; French visemes; Intelligibility; Loudness; Speaking rate; Speechreading; Text to audiovisual speech synthesis

Indexed keywords

SPEECH ANALYSIS; SPEECH COMMUNICATION; SPEECH INTELLIGIBILITY;

EID: 0032178686     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(98)00045-4     Document Type: Article
Times cited : (50)

References (42)
  • 1
    • 0039820686 scopus 로고
    • Audibility and stability of articulatory movements: Deciphering two experiments on anticipatory rounding in French
    • Aix-en-Provence, France
    • Abry, C., Lallouache, M.T., 1991. Audibility and stability of articulatory movements: Deciphering two experiments on anticipatory rounding in French. In: Proceedings of the XIIth International Congress of Phonetic Sciences, Aix-en-Provence, France, Vol. 1, pp. 220-225.
    • (1991) Proceedings of the XIIth International Congress of Phonetic Sciences , vol.1 , pp. 220-225
    • Abry, C.1    Lallouache, M.T.2
  • 4
    • 0031198820 scopus 로고    scopus 로고
    • Learning to speak. Sensori-motor control of speech movements
    • Bailly, G., 1997. Learning to speak. Sensori-motor control of speech movements. Speech Communication 22, 251-267.
    • (1997) Speech Communication , vol.22 , pp. 251-267
    • Bailly, G.1
  • 6
    • 0040413185 scopus 로고
    • COMPOST: A server for multilingual text-to-speech system
    • Bailly, G., Alissali, M., 1992. COMPOST: a server for multilingual text-to-speech system. Traitement du Signal 9 (4), 359-366.
    • (1992) Traitement du Signal , vol.9 , Issue.4 , pp. 359-366
    • Bailly, G.1    Alissali, M.2
  • 7
    • 0016196060 scopus 로고
    • Coarticulation of upper lip protrusion in French
    • Benguerel, A.P., Cowan, H.A., 1974. Coarticulation of upper lip protrusion in French. Phonetica 30, 41-55.
    • (1974) Phonetica , vol.30 , pp. 41-55
    • Benguerel, A.P.1    Cowan, H.A.2
  • 8
    • 0002186602 scopus 로고
    • A set of French visemes for visual speech synthesis
    • Bailly, G., Benoît, C (Eds.), Elsevier, Amsterdam
    • Benoît, C., Lallouache, T., Mohamadi, T., Abry, C., 1992. A set of French visemes for visual speech synthesis. In: Bailly, G., Benoît, C (Eds.), Talking Machines: Theories, Models and Designs. Elsevier, Amsterdam, pp. 485-504.
    • (1992) Talking Machines: Theories, Models and Designs , pp. 485-504
    • Benoît, C.1    Lallouache, T.2    Mohamadi, T.3    Abry, C.4
  • 10
    • 4243879136 scopus 로고    scopus 로고
    • An investigation of hypo- and hyper-speech in the visual modality
    • Autrans, France
    • Benoît, C., Fuster-Duran, A., Le Goff, B., 1996a. An investigation of hypo- and hyper-speech in the visual modality. In: Proceedings of ETRW 96, Autrans, France, pp. 237-240.
    • (1996) Proceedings of ETRW , vol.96 , pp. 237-240
    • Benoît, C.1    Fuster-Duran, A.2    Le Goff, B.3
  • 11
    • 0001055701 scopus 로고    scopus 로고
    • Which components of the face do humans and machines best speechread?
    • Stork, D., Hennecke, M. (Eds.), NATO-ASI Series 150 Springer, Berlin, pp.
    • Benoît, C., Guiard-Marigny, T., Le Goff, B., Adjoudani, A., 1996b. Which components of the face do humans and machines best speechread?. In: Stork, D., Hennecke, M. (Eds.), Speechreading by Humans and Machines, NATO-ASI Series 150 Springer, Berlin, pp. 315-328.
    • (1996) Speechreading by Humans and Machines , pp. 315-328
    • Benoît, C.1    Guiard-Marigny, T.2    Le Goff, B.3    Adjoudani, A.4
  • 13
    • 84883424118 scopus 로고
    • Rule-based visual speech synthesis
    • Madrid, Spain
    • Beskow, J., 1995. Rule-based visual speech synthesis. In: Proceedings of Eurospeech'95, Madrid, Spain, Vol. 1, pp. 299-302.
    • (1995) Proceedings of Eurospeech'95 , vol.1 , pp. 299-302
    • Beskow, J.1
  • 14
    • 84926273209 scopus 로고
    • Analysis, synthesis and perception of visible articulatory movements
    • Brooke, N.M., Summerfield, A.Q., 1983. Analysis, synthesis and perception of visible articulatory movements. Journal of Phonetics 11, 63-76.
    • (1983) Journal of Phonetics , vol.11 , pp. 63-76
    • Brooke, N.M.1    Summerfield, A.Q.2
  • 15
    • 0009643164 scopus 로고
    • Pitch-synchronous wave-form processing technique for text-to-speech synthesis using diphones
    • ESCA, Paris, France
    • Charpentier, F., Moulines, E., 1989. Pitch-synchronous wave-form processing technique for text-to-speech synthesis using diphones. In: Proceedings of the First Eurospeech Conference, ESCA, Paris, France, Vol. 2, pp. 13-19.
    • (1989) Proceedings of the First Eurospeech Conference , vol.2 , pp. 13-19
    • Charpentier, F.1    Moulines, E.2
  • 17
    • 0001514782 scopus 로고
    • Modeling coarticulation in synthetic visual speech. Models and techniques
    • Thalmann, N.M., Thalmann, D. (Eds.), Springer, Tokyo
    • Cohen, M.M., Massaro, D.W., 1993. Modeling coarticulation in synthetic visual speech. Models and techniques. In: Thalmann, N.M., Thalmann, D. (Eds.), Computer Animation. Springer, Tokyo, pp. 139-156.
    • (1993) Computer Animation , pp. 139-156
    • Cohen, M.M.1    Massaro, D.W.2
  • 19
    • 0014529713 scopus 로고
    • Interaction of audition and vision in the recognition of oral speech stimuli
    • Erber, N.P., 1969. Interaction of audition and vision in the recognition of oral speech stimuli. Journal of Speech and Hearing Research 12, 423-425.
    • (1969) Journal of Speech and Hearing Research , vol.12 , pp. 423-425
    • Erber, N.P.1
  • 21
    • 0039820685 scopus 로고
    • Confusion among visually perceived consonants
    • Fisher, C.G., 1968. Confusion among visually perceived consonants. Journal of Speech and Hearing Research 15, 474-482.
    • (1968) Journal of Speech and Hearing Research , vol.15 , pp. 474-482
    • Fisher, C.G.1
  • 22
    • 0020823325 scopus 로고
    • Converging sources of evidence on spoken and perceived rhythms of speech: Cyclic production of vowels in monosyllabic stress feet
    • Fowler, C., 1983. Converging sources of evidence on spoken and perceived rhythms of speech: Cyclic production of vowels in monosyllabic stress feet. Journal of Experimental Psychology: Human Perception and Performance 112, 386-412.
    • (1983) Journal of Experimental Psychology: Human Perception and Performance , vol.112 , pp. 386-412
    • Fowler, C.1
  • 24
    • 0038473771 scopus 로고    scopus 로고
    • 3D models of the lips and jaw for visual speech synthesis
    • van Santen, J.P.H., Sproat, R., Olive, J., Hirshberg, J. (Eds.), Springer, New York
    • Guiard-Marigny, T., Adjoudani, A., Benoît, C., 1996. 3D models of the lips and jaw for visual speech synthesis. In: van Santen, J.P.H., Sproat, R., Olive, J., Hirshberg, J. (Eds.), Progress in Speech Synthesis. Springer, New York, pp. 247-258.
    • (1996) Progress in Speech Synthesis , pp. 247-258
    • Guiard-Marigny, T.1    Adjoudani, A.2    Benoît, C.3
  • 25
    • 0003009750 scopus 로고
    • Acoustic phonetics
    • Joos, M., 1948. Acoustic phonetics. Language 24, 1-136.
    • (1948) Language , vol.24 , pp. 1-136
    • Joos, M.1
  • 27
    • 85133504159 scopus 로고    scopus 로고
    • Automatic modeling of coarticulation in text-to-visual speech synthesis
    • ESCA, Rhodes, Greece
    • Le Goff, B., 1997a. Automatic modeling of coarticulation in text-to-visual speech synthesis. In: Proceedings of the 5th Eurospeech Conference, ESCA, Rhodes, Greece, Vol. 3, pp. 1667-1670.
    • (1997) Proceedings of the 5th Eurospeech Conference , vol.3 , pp. 1667-1670
    • Le Goff, B.1
  • 31
    • 0003762887 scopus 로고    scopus 로고
    • Analysis-synthesis and intelligibility of a talking face
    • Van Santen, J.P.H., Sproat, R.W., Olive, J.P., J. Hirschberg (Eds.), Springer. New York
    • Le Goff, B., Guiard-Marigny, T., Benoît, C., 1996. Analysis-synthesis and intelligibility of a talking face. In: Van Santen, J.P.H., Sproat, R.W., Olive, J.P., J. Hirschberg (Eds.), Progress in Speech Synthesis. Springer. New York, pp. 235-246.
    • (1996) Progress in Speech Synthesis , pp. 235-246
    • Le Goff, B.1    Guiard-Marigny, T.2    Benoît, C.3
  • 32
    • 0003116759 scopus 로고
    • Speech as audible gestures
    • Hardcastle, W.J., Marchal, A. (Eds.), Kluwer Academic Publishers, Dordrecht
    • Löfquist, A., 1990. Speech as audible gestures. In: Hardcastle, W.J., Marchal, A. (Eds.), Speech Production and Speech Modeling. Kluwer Academic Publishers, Dordrecht, pp. 289-322.
    • (1990) Speech Production and Speech Modeling , pp. 289-322
    • Löfquist, A.1
  • 34
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • McGurk, H., MacDonald, J., 1976. Hearing lips and seeing voices. Nature 264, 746-748.
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 36
    • 0003584841 scopus 로고
    • Ph.D Dissertation, University of Utah, Department of Computer Sciences
    • Parke, F.I., 1974. A parametric model for human faces. Ph.D Dissertation, University of Utah, Department of Computer Sciences.
    • (1974) A Parametric Model for Human Faces
    • Parke, F.I.1
  • 37
    • 0040413181 scopus 로고
    • Creation of a synthetic face speaking in real time with a synthetic voice
    • Benoît, C., Bailly, G. (Eds.), Autrans, France
    • Saintourens, M., Tramus, M.H., Huitric, H., Nahas, M., 1990. Creation of a synthetic face speaking in real time with a synthetic voice. In: Benoît, C., Bailly, G. (Eds.), Proceedings of the 1st ESCA Workshop on Speech Synthesis, Autrans, France, pp. 249-252.
    • (1990) Proceedings of the 1st ESCA Workshop on Speech Synthesis , pp. 249-252
    • Saintourens, M.1    Tramus, M.H.2    Huitric, H.3    Nahas, M.4
  • 38
    • 77956779481 scopus 로고
    • A dynamical approach to gestural patterning in speech production
    • Saltzman, E., Munhall, K., 1989. A dynamical approach to gestural patterning in speech production. Ecological Psychology 1, 333-382.
    • (1989) Ecological Psychology , vol.1 , pp. 333-382
    • Saltzman, E.1    Munhall, K.2
  • 40
    • 0002955163 scopus 로고
    • Lips, teeth, and the benefits of lipreading
    • Young, A.W., Ellis, H.D. (Eds.), Elsevier, Amsterdam
    • Summerfield, Q., MacLeod, A., McGrath, M., Brooke, M., 1989. Lips, teeth, and the benefits of lipreading. In: Young, A.W., Ellis, H.D. (Eds.), Handbook of Research on Face Processing. Elsevier, Amsterdam, pp. 223-233.
    • (1989) Handbook of Research on Face Processing , pp. 223-233
    • Summerfield, Q.1    MacLeod, A.2    McGrath, M.3    Brooke, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.