-
1
-
-
0001785334
-
Towards an audiovisual virtual talking head: 3D articulatory modeling of tongue, lips and face based on MRI and video images
-
Germany: Kloster Seeon
-
Badin, P., Borel, P., Bailly, G., Revéret, L., Baciu, M., and Segebarth, C. (2000). Towards an audiovisual virtual talking head: 3D articulatory modeling of tongue, lips and face based on MRI and video images. Proceedings of the 5th Speech Production Seminar, Germany: Kloster Seeon, pp. 261-264.
-
(2000)
Proceedings of the 5th Speech Production Seminar
, pp. 261-264
-
-
Badin, P.1
Borel, P.2
Bailly, G.3
Revéret, L.4
Baciu, M.5
Segebarth, C.6
-
2
-
-
0031198820
-
Learning to speak. Sensori-motor control of speech movements
-
Bailly, G. (1998). Learning to speak. Sensori-motor control of speech movements. Speech Communication, 22(2/3):251-267.
-
(1998)
Speech Communication
, vol.22
, Issue.2-3
, pp. 251-267
-
-
Bailly, G.1
-
3
-
-
84966335540
-
Evaluation of movement generation systems using the point-light technique
-
Santa Monica, CA
-
Bailly, G., Gibert, G., and Odisio, M. (2002). Evaluation of movement generation systems using the point-light technique. IEEE Workshop on Speech Synthesis, Santa Monica, CA.
-
(2002)
IEEE Workshop on Speech Synthesis
-
-
Bailly, G.1
Gibert, G.2
Odisio, M.3
-
4
-
-
0002186602
-
A set of French visemes for visual speech synthesis
-
G. Bailly and C. Benoît (Eds.). Elsevier B. V.
-
Benoît, C., Lallouache, T., Mohamadi, T., and Abry, C. (1992). A set of French visemes for visual speech synthesis. In G. Bailly and C. Benoît (Eds.), Talking Machines: Theories, Models and Designs. Elsevier B. V., pp. 485-501.
-
(1992)
Talking Machines: Theories, Models and Designs
, pp. 485-501
-
-
Benoît, C.1
Lallouache, T.2
Mohamadi, T.3
Abry, C.4
-
5
-
-
0013132012
-
Controlling facial expression and body movements in the computer-generated short "Tony de Peltrie"
-
San Francisco, CA
-
Bergeron, P. and Lachapelle, P. (1985). Controlling facial expression and body movements in the computer-generated short "Tony de Peltrie". SIGGRAPH, Advanced Computer Animation Seminar Notes, San Francisco, CA.
-
(1985)
SIGGRAPH, Advanced Computer Animation Seminar Notes
-
-
Bergeron, P.1
Lachapelle, P.2
-
7
-
-
47949123595
-
-
Rhodos, Greece: Eurospeech, 2003-2010
-
Beskow, J., Dahlquist, M., Granström, B., Lundeberg, M., Spens, K.-E., and Öhman, T. (1997). The Teleface project - multimodal speech communication for the hearing impaired. Rhodos, Greece: Eurospeech, 2003-2010.
-
(1997)
The Teleface Project - Multimodal Speech Communication for the Hearing Impaired
-
-
Beskow, J.1
Dahlquist, M.2
Granström, B.3
Lundeberg, M.4
Spens, K.-E.5
Öhman, T.6
-
8
-
-
84937437186
-
Voice pupperty
-
Los Angeles, CA
-
Brand, M. (1999). Voice pupperty. SIGGRAPH'99, Los Angeles, CA, pp. 21-28.
-
(1999)
SIGGRAPH'99
, pp. 21-28
-
-
Brand, M.1
-
9
-
-
0030677313
-
VideoRewrite: Driving visual speech with audio
-
Los Angeles, CA
-
Bregler, C., Covell, M., and Slaney, M. (1997a). VideoRewrite: Driving visual speech with audio. SIGGRAPH'97, Los Angeles, CA, pp. 353-360.
-
(1997)
SIGGRAPH'97
, pp. 353-360
-
-
Bregler, C.1
Covell, M.2
Slaney, M.3
-
10
-
-
84925678202
-
Video rewrite: Visual speech synthesis from video
-
Rhodes, Greece
-
Bregler, C., Covell, M., and Slaney, M. (1997b). Video rewrite: Visual speech synthesis from video. International Conference on Auditory-Visual Speech Processing, Rhodes, Greece, pp. 153-156.
-
(1997)
International Conference on Auditory-Visual Speech Processing
, pp. 153-156
-
-
Bregler, C.1
Covell, M.2
Slaney, M.3
-
12
-
-
84955535347
-
Gestural specification using dynamically-defined articulatory structures
-
Browman, C.P. and Goldstein, L.M. (1990). Gestural specification using dynamically-defined articulatory structures. Journal of Phonetics, 18(3):299-320.
-
(1990)
Journal of Phonetics
, vol.18
, Issue.3
, pp. 299-320
-
-
Browman, C.P.1
Goldstein, L.M.2
-
14
-
-
0001514782
-
Modeling coarticulation in synthetic visual speech
-
D. Thalmann and N. Magnenat-Thalmann (Eds.). Springer-Verlag: Tokyo
-
Cohen, M.M. and Massaro, D.W. (1993). Modeling coarticulation in synthetic visual speech. In D. Thalmann and N. Magnenat-Thalmann (Eds.), Models and Techniques in Computer Animation. Springer-Verlag: Tokyo, pp. 141-155.
-
(1993)
Models and Techniques in Computer Animation
, pp. 141-155
-
-
Cohen, M.M.1
Massaro, D.W.2
-
15
-
-
0035363218
-
Active appearance models
-
Cootes, T.F., Edwards, G.J., and Taylor, C.J. (2001). Active appearance models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(6):681-685.
-
(2001)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.23
, Issue.6
, pp. 681-685
-
-
Cootes, T.F.1
Edwards, G.J.2
Taylor, C.J.3
-
16
-
-
0142241582
-
Sample-based synthesis of photo-realistic talking-heads
-
Los Angeles, CA
-
Cosatto, E. and Graf, H.P. (1997). Sample-based synthesis of photo-realistic talking-heads. SIGGRAPH'97, Los Angeles, CA, pp. 353-360.
-
(1997)
SIGGRAPH'97
, pp. 353-360
-
-
Cosatto, E.1
Graf, H.P.2
-
17
-
-
84872004031
-
Sample-based synthesis of photo-realistic talking heads
-
Philadelphia, Pennsylvania
-
Cosatto, E. and Graf, H.P. (1998). Sample-based synthesis of photo-realistic talking heads. Computer Animation, Philadelphia, Pennsylvania, pp. 103-110.
-
(1998)
Computer Animation
, pp. 103-110
-
-
Cosatto, E.1
Graf, H.P.2
-
18
-
-
0034070906
-
The Mesh-Matching algorithm: An automatic 3D mesh generator for finite element structures
-
Couteau, B., Payan, Y., and Lavallée, S. (2000). The Mesh-Matching algorithm: An automatic 3D mesh generator for finite element structures. Journal of Biomechanics, 35(8):1005-1009.
-
(2000)
Journal of Biomechanics
, vol.35
, Issue.8
, pp. 1005-1009
-
-
Couteau, B.1
Payan, Y.2
Lavallée, S.3
-
19
-
-
0031140089
-
MPEG-4: Audio/video and synthetic graphics/audio for real-time, interactive media delivery
-
Doenges, P., Capin, T.K., Lavagetto, F., Ostermann, J., Pandzic, I., and Petajan, E. (1997). MPEG-4: audio/video and synthetic graphics/audio for real-time, interactive media delivery. Image Communications Journal, 9(4):433-463.
-
(1997)
Image Communications Journal
, vol.9
, Issue.4
, pp. 433-463
-
-
Doenges, P.1
Capin, T.K.2
Lavagetto, F.3
Ostermann, J.4
Pandzic, I.5
Petajan, E.6
-
21
-
-
0004167520
-
-
Palo Alto, California: Consulting Psychologists Press
-
Ekman, P. and Friesen, W.V. (1975). Unmasking the Face. Palo Alto, California: Consulting Psychologists Press.
-
(1975)
Unmasking the Face
-
-
Ekman, P.1
Friesen, W.V.2
-
23
-
-
0010466389
-
Creating and controlling video-realistic talking heads
-
Scheelsminde, Denmark
-
Elisei, F., Odisio, M., Bailly, G., and Badin, P. (2001). Creating and controlling video-realistic talking heads. Auditory-Visual Speech Processing Workshop, Scheelsminde, Denmark, pp. 90-97.
-
(2001)
Auditory-Visual Speech Processing Workshop
, pp. 90-97
-
-
Elisei, F.1
Odisio, M.2
Bailly, G.3
Badin, P.4
-
25
-
-
0036989560
-
Trainable videorealistic speech animation
-
Ezzat, T., Geiger, G., and Poggio, T. (2002). Trainable videorealistic speech animation. ACM Transactions on Graphics, 21(3):388-398.
-
(2002)
ACM Transactions on Graphics
, vol.21
, Issue.3
, pp. 388-398
-
-
Ezzat, T.1
Geiger, G.2
Poggio, T.3
-
26
-
-
85031438802
-
Visual speech synthesis with concatenative speech
-
Terrigal-Sydney, Australia
-
Hällgren, Å. and Lyberg, B. (1998). Visual speech synthesis with concatenative speech. Auditory-Visual Speech Processing Conference, Terrigal-Sydney, Australia, pp. 181-183.
-
(1998)
Auditory-Visual Speech Processing Conference
, pp. 181-183
-
-
Hällgren, Å.1
Lyberg, B.2
-
27
-
-
0002258223
-
The PARAFAC model for three-way factor analysis and multidimensional scaling
-
H.G. Law, C.W. Snyder, J.A. Hattie, and R.P. MacDonald (Eds.). New-York: Praeger
-
Harshman, R.A. and Lundy, M.E. (1984). The PARAFAC model for three-way factor analysis and multidimensional scaling. In H.G. Law, C.W. Snyder, J.A. Hattie, and R.P. MacDonald (Eds.), Research Methods for Multimode Data Analysis. New-York: Praeger, pp. 122-215.
-
(1984)
Research Methods for Multimode Data Analysis
, pp. 122-215
-
-
Harshman, R.A.1
Lundy, M.E.2
-
28
-
-
0142241581
-
Facial image reconstruction by estimated muscle parameter
-
Nara, Japan
-
Ishikawa, T., Sera, H., Morishima, S., and Terzopoulos, D. (1998). Facial image reconstruction by estimated muscle parameter. International Conference on Automatic Face and Gesture Recognition, Nara, Japan, pp. 342-347.
-
(1998)
International Conference on Automatic Face and Gesture Recognition
, pp. 342-347
-
-
Ishikawa, T.1
Sera, H.2
Morishima, S.3
Terzopoulos, D.4
-
29
-
-
0027607090
-
3D motion estimation in model-based facial image coding
-
Li, H., Roivanen, P., and Forchheimer, R. (1993). 3D motion estimation in model-based facial image coding. IEEE Transactions on PAMI, 15(6):545-555.
-
(1993)
IEEE Transactions on PAMI
, vol.15
, Issue.6
, pp. 545-555
-
-
Li, H.1
Roivanen, P.2
Forchheimer, R.3
-
30
-
-
0142179494
-
Illusions and issues in bimodal speech perception
-
Terrigal, Sydney, Australia
-
Massaro, D. (1998a). Illusions and issues in bimodal speech perception. Auditory-Visual Speech Processing Conference, Terrigal, Sydney, Australia, pp. 21-26.
-
(1998)
Auditory-Visual Speech Processing Conference
, pp. 21-26
-
-
Massaro, D.1
-
32
-
-
0036472941
-
Extraction of visual features for lipreading
-
Matthews, I., Cootes, T.F., and Bangham, J.A. (2002). Extraction of visual features for lipreading. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(2):198-213.
-
(2002)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.24
, Issue.2
, pp. 198-213
-
-
Matthews, I.1
Cootes, T.F.2
Bangham, J.A.3
-
33
-
-
0017199877
-
Hearing lips and seeing voices
-
McGurk, H. and MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 26:746-748.
-
(1976)
Nature
, vol.26
, pp. 746-748
-
-
McGurk, H.1
Macdonald, J.2
-
34
-
-
85009080445
-
Modeling visual coarticulation in synthetic talking heads using a lip motion unit inventory with concatenative synthesis
-
Beijing, China
-
Minnis, S. and Breen, A.P. (1998). Modeling visual coarticulation in synthetic talking heads using a lip motion unit inventory with concatenative synthesis. ICSLP, Beijing, China, pp. 759-762.
-
(1998)
ICSLP
, pp. 759-762
-
-
Minnis, S.1
Breen, A.P.2
-
35
-
-
0142179495
-
3D talking clones for virtual teleconferencing
-
to appear
-
Odisio, M., Elisei, F., Bailly, G., and Badin, P. (to appear). 3D talking clones for virtual teleconferencing. Annals of Telecommunications.
-
Annals of Telecommunications
-
-
Odisio, M.1
Elisei, F.2
Bailly, G.3
Badin, P.4
-
36
-
-
0034224125
-
ProSynth: An integrated prosodic approach to device-independent, natural-sounding speech synthesis
-
Ogden, R., Hawkins, S., House, J., Huckvale, M., Local, J., Carter, P., Dankovicová, J. and Heid, S. (2000). ProSynth: An integrated prosodic approach to device-independent, natural-sounding speech synthesis. Computer Speech and Language, 14(3):177-210.
-
(2000)
Computer Speech and Language
, vol.14
, Issue.3
, pp. 177-210
-
-
Ogden, R.1
Hawkins, S.2
House, J.3
Huckvale, M.4
Local, J.5
Carter, P.6
Dankovicová, J.7
Heid, S.8
-
38
-
-
0033315043
-
Articulatory movement formation by kinematic triphone model
-
Tokyo, Japan
-
Okadome, T., Kaburagi, T., and Honda, M. (1999). Articulatory movement formation by kinematic triphone model. IEEE International Conference on Systems Man and Cybernetics, Tokyo, Japan, pp. 469-474.
-
(1999)
IEEE International Conference on Systems Man and Cybernetics
, pp. 469-474
-
-
Okadome, T.1
Kaburagi, T.2
Honda, M.3
-
39
-
-
0142179491
-
Audio-visual speech synthesis for finnish
-
Santa Cruz, CA
-
Olives, J.-L., Möttönen, R., Kulju, J., and Sams, M. (1999). Audio-visual speech synthesis for finnish. Auditory-Visual Speech Processing Workshop, Santa Cruz, CA, pp. 157-162.
-
(1999)
Auditory-Visual Speech Processing Workshop
, pp. 157-162
-
-
Olives, J.-L.1
Möttönen, R.2
Kulju, J.3
Sams, M.4
-
40
-
-
0033336969
-
Users evaluation: Synthetic talking faces for interactive services
-
Pandzic, I., Ostermann, J., and Millen, D. (1999). Users evaluation: Synthetic talking faces for interactive services. The Visual Computer, 15:330-340.
-
(1999)
The Visual Computer
, vol.15
, pp. 330-340
-
-
Pandzic, I.1
Ostermann, J.2
Millen, D.3
-
41
-
-
85018094829
-
Computer generated animation of faces
-
Salt Lake City
-
Parke, F.I. (1972). Computer generated animation of faces. ACM National Conference, Salt Lake City, pp. 451-457.
-
(1972)
ACM National Conference
, pp. 451-457
-
-
Parke, F.I.1
-
42
-
-
50849153856
-
A model for human faces that allows speech synchronized animation
-
Parke, F.I. (1975). A model for human faces that allows speech synchronized animation. Journal of Computers and Graphics, 1(1): 1-4.
-
(1975)
Journal of Computers and Graphics
, vol.1
, Issue.1
, pp. 1-4
-
-
Parke, F.I.1
-
43
-
-
0020202671
-
A parametrized model for facial animation
-
Parke, F.I. (1982). A parametrized model for facial animation. IEEE Computer Graphics and Applications, 2(9):61-70.
-
(1982)
IEEE Computer Graphics and Applications
, vol.2
, Issue.9
, pp. 61-70
-
-
Parke, F.I.1
-
44
-
-
0003608342
-
-
Wellesley, MA, USA, A.K. Peters
-
Parke, F.I. and Waters, K. (1996). Computer Facial Animation. Wellesley, MA, USA, A.K. Peters.
-
(1996)
Computer Facial Animation
-
-
Parke, F.I.1
Waters, K.2
-
45
-
-
0030123961
-
The equilibrium point hypothesis and its application to speech motor control
-
Perrier, P., Ostry, D.J., and Laboissière, R. (1996). The equilibrium point hypothesis and its application to speech motor control. Journal of Speech and Hearing Research, 39:365-377.
-
(1996)
Journal of Speech and Hearing Research
, vol.39
, pp. 365-377
-
-
Perrier, P.1
Ostry, D.J.2
Laboissière, R.3
-
46
-
-
0031631507
-
Synthesizing realistic facial expressions from photographs
-
Orlando, FL, USA
-
Pighin, F., Hecker, J., Lischinski, D., Szeliski, R., and Salesin, D.H. (1998). Synthesizing realistic facial expressions from photographs. Proceedings of Siggraph, Orlando, FL, USA, pp. 75-84.
-
(1998)
Proceedings of Siggraph
, pp. 75-84
-
-
Pighin, F.1
Hecker, J.2
Lischinski, D.3
Szeliski, R.4
Salesin, D.H.5
-
47
-
-
0012433827
-
Perception of synthetic speech
-
J.P.H.V. Santen, R.W. Sproat, J.P. Olive, and J. Hirschberg (Eds.). Springer Verlag: New York
-
Pisoni, D.B. (1997). Perception of synthetic speech. In J.P.H.V. Santen, R.W. Sproat, J.P. Olive, and J. Hirschberg (Eds.), Progress in Speech Synthesis. Springer Verlag: New York. pp. 541-560.
-
(1997)
Progress in Speech Synthesis
, pp. 541-560
-
-
Pisoni, D.B.1
-
48
-
-
0019603077
-
Animating facial expressions
-
Platt, S.M. and Badler, N.I. (1981). Animating facial expressions. Computer Graphics, 15(3):245-252.
-
(1981)
Computer Graphics
, vol.15
, Issue.3
, pp. 245-252
-
-
Platt, S.M.1
Badler, N.I.2
-
49
-
-
0002103452
-
MPEG-4 facial animation: An implementation
-
Santorini, Greece
-
Pockaj, R., Costa, M., Lavagetto, F., and Braccini, C. (1999). MPEG-4 facial animation: An implementation. International Workshop on Synthetic-Natural Hybrid Coding and Three Dimensional Imaging, Santorini, Greece, pp. 33-36.
-
(1999)
International Workshop on Synthetic-Natural Hybrid Coding and Three Dimensional Imaging
, pp. 33-36
-
-
Pockaj, R.1
Costa, M.2
Lavagetto, F.3
Braccini, C.4
-
50
-
-
84870292720
-
MOTHER: A new generation of talking heads providing a flexible articulatory control for video-realistic speech animation
-
Beijing, China
-
Revéret, L., Bailly, G., and Badin, P. (2000). MOTHER: A new generation of talking heads providing a flexible articulatory control for video-realistic speech animation. International Conference on Speech and Language Processing, Beijing, China, pp. 755-758.
-
(2000)
International Conference on Speech and Language Processing
, pp. 755-758
-
-
Revéret, L.1
Bailly, G.2
Badin, P.3
-
51
-
-
0004222842
-
-
Sweden, Dept. of Electrical Engineering, Linköping University: LiTH-ISY-I-866
-
Rydfalk, M. (1987). CANDIDE, a parameterized face. Sweden, Dept. of Electrical Engineering, Linköping University: LiTH-ISY-I-866.
-
(1987)
CANDIDE, a Parameterized Face
-
-
Rydfalk, M.1
-
52
-
-
0030409654
-
View morphing
-
New Orleans, Louisiana
-
Seitz, S.M. and Dyer, C.R. (1996). View morphing. ACM SIGGRAPH, New Orleans, Louisiana, pp. 21-30.
-
(1996)
ACM SIGGRAPH
, pp. 21-30
-
-
Seitz, S.M.1
Dyer, C.R.2
-
53
-
-
0026348904
-
Different phase-stable relationships of the upper lip and jaw for production of vowels and diphthongs
-
Shaiman, S. and Porter, R.J. (1991). Different phase-stable relationships of the upper lip and jaw for production of vowels and diphthongs. Journal of the Acoustical Society of America, 90:3000-3007.
-
(1991)
Journal of the Acoustical Society of America
, vol.90
, pp. 3000-3007
-
-
Shaiman, S.1
Porter, R.J.2
-
54
-
-
0003058857
-
On the basic scheme and algorithms in non-uniform unit speech synthesis
-
G. Bailly and C. Benoît (Eds.). Elsevier B.V.
-
Takeda, K., Abe, K., and Sagisaka, Y. (1992). On the basic scheme and algorithms in non-uniform unit speech synthesis. In G. Bailly and C. Benoît (Eds.), Talking Machines: Theories, Models and Designs. Elsevier B.V., pp. 93-105.
-
(1992)
Talking Machines: Theories, Models and Designs
, pp. 93-105
-
-
Takeda, K.1
Abe, K.2
Sagisaka, Y.3
-
55
-
-
84919370414
-
Text-to-audio-visual speech synthesis based on parameter generation from HMM
-
Budapest, Hungary
-
Tamura, M., Kondo, S., Masuko, T., and Kobayashi, T. (1999). Text-to-audio-visual speech synthesis based on parameter generation from HMM. European Conference on Speech Communication and Technology, Budapest, Hungary, pp. 959-962.
-
(1999)
European Conference on Speech Communication and Technology
, pp. 959-962
-
-
Tamura, M.1
Kondo, S.2
Masuko, T.3
Kobayashi, T.4
-
58
-
-
0142210581
-
Visual speech synthesis using statistical models of shape and appearance
-
Scheelsminde, Denmark
-
Theobald, B.J., Bangham, J.A., Matthews, I., and Cawley, G.C. (2001). Visual speech synthesis using statistical models of shape and appearance. Auditory-Visual Speech Processing Workshop, Scheelsminde, Denmark, pp. 78-83.
-
(2001)
Auditory-Visual Speech Processing Workshop
, pp. 78-83
-
-
Theobald, B.J.1
Bangham, J.A.2
Matthews, I.3
Cawley, G.C.4
-
59
-
-
0031380451
-
Model-based synthetic view generation from a monocular video sequence
-
Santa Barbara, California
-
Tsai, C.-J., Eisert, P., Girod, B., and Katsaggelos, A.K. (1997). Model-based synthetic view generation from a monocular video sequence. Proceedings of the International Conference on Image Processing, Santa Barbara, California, pp. 444-447.
-
(1997)
Proceedings of the International Conference on Image Processing
, pp. 444-447
-
-
Tsai, C.-J.1
Eisert, P.2
Girod, B.3
Katsaggelos, A.K.4
-
61
-
-
0142210579
-
A text-speech synchronization technique with applications to talking heads
-
Santa Cruz, California, USA
-
Vignoli, F. and Braccini, C. (1999). A text-speech synchronization technique with applications to talking heads. Auditory-Visual Speech Processing Conference, Santa Cruz, California, USA, pp. 128-132.
-
(1999)
Auditory-Visual Speech Processing Conference
, pp. 128-132
-
-
Vignoli, F.1
Braccini, C.2
-
62
-
-
0023379314
-
A muscle model for animating three-dimensional facial expression
-
Waters, K. (1987). A muscle model for animating three-dimensional facial expression. Computer Graphics, 21(4):17-24.
-
(1987)
Computer Graphics
, vol.21
, Issue.4
, pp. 17-24
-
-
Waters, K.1
-
64
-
-
0032179320
-
Lip movement synthesis from speech based on Hidden Markov Models
-
Yamamoto, E., Nakamura, S., and Shikano, K. (1998). Lip movement synthesis from speech based on Hidden Markov Models. Speech Communication, 26(1-2):105-115.
-
(1998)
Speech Communication
, vol.26
, Issue.1-2
, pp. 105-115
-
-
Yamamoto, E.1
Nakamura, S.2
Shikano, K.3
|