-
1
-
-
85032752352
-
Audiovisual Speech Processing
-
T. Chen, "Audiovisual Speech Processing," IEEE Signal Processing Magazine, vol. 18, pp. 9-21, 2001.
-
(2001)
IEEE Signal Processing Magazine
, vol.18
, pp. 9-21
-
-
Chen, T.1
-
2
-
-
0024900468
-
An intelligent Facial Image Coding Driven by Speech and Phoneme
-
89, pp
-
S. Morishima, K. Aizawa, and H. Harashima, "An intelligent Facial Image Coding Driven by Speech and Phoneme," Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '89), pp. 1795-1798, 1989.
-
(1989)
Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP
, pp. 1795-1798
-
-
Morishima, S.1
Aizawa, K.2
Harashima, H.3
-
3
-
-
0030677313
-
Video Rewrite: Driving Visual Speech with Audio
-
97, pp
-
C. Bregier, M. Covell, and M. Slaney, "Video Rewrite: Driving Visual Speech with Audio," Proc. ACM SIGGRAPH '97, pp. 353-360, 1997.
-
(1997)
Proc. ACM SIGGRAPH
, pp. 353-360
-
-
Bregier, C.1
Covell, M.2
Slaney, M.3
-
5
-
-
0032179320
-
Lip Movement Synthesis from Speech Based on Hidden Markov Models
-
E. Yamamoto, S. Nakamura, and K. ShiKano, "Lip Movement Synthesis from Speech Based on Hidden Markov Models," Speech Comm., pp. 105-115, 1998.
-
(1998)
Speech Comm
, pp. 105-115
-
-
Yamamoto, E.1
Nakamura, S.2
ShiKano, K.3
-
7
-
-
2542499812
-
Speech-to-Video Synthesis Using Facial Animation Parameters
-
P.S. Aleksic and A.K. Katsaggelos, "Speech-to-Video Synthesis Using Facial Animation Parameters," IEEE Trans. Circuits and Systems for Video Technology, vol. 14, no. 5, pp. 682-692, 2004.
-
(2004)
IEEE Trans. Circuits and Systems for Video Technology
, vol.14
, Issue.5
, pp. 682-692
-
-
Aleksic, P.S.1
Katsaggelos, A.K.2
-
8
-
-
33646752807
-
Learning Dynamic Audio-Visual Mapping with Inputoutput Hidden Markov Models
-
Y. Li and H.-Y. Shum, "Learning Dynamic Audio-Visual Mapping with Inputoutput Hidden Markov Models," IEEE Trans. Multimedia, vol. 8, no. 3, pp. 542-549, 2006.
-
(2006)
IEEE Trans. Multimedia
, vol.8
, Issue.3
, pp. 542-549
-
-
Li, Y.1
Shum, H.-Y.2
-
9
-
-
34247623168
-
Acoustically -Driven Talking Face Synthesis Using Dynamic Bayesian Networks
-
J. Xue, J. Borgstrom, J. Jiang, L. Bernstein, and A. Alwan, "Acoustically -Driven Talking Face Synthesis Using Dynamic Bayesian Networks," Proc. Int'l Conf. Multimedia and Expo (ICME '06), pp. 1165-1168, 2006.
-
(2006)
Proc. Int'l Conf. Multimedia and Expo (ICME '06)
, pp. 1165-1168
-
-
Xue, J.1
Borgstrom, J.2
Jiang, J.3
Bernstein, L.4
Alwan, A.5
-
10
-
-
84960898014
-
Multimodal Signal Analysis of Prosody and Hand Motion: Temporal Correlation of Speech and Gestures
-
L. Valbonesi, R. Ansari, D. McNeill, F. Quek, S. Duncan, K.E. McCullough, qnd R. Bryll, "Multimodal Signal Analysis of Prosody and Hand Motion: Temporal Correlation of Speech and Gestures," Proc. European Signal Processing Cont. (EUSIPCO '02), vol. 1, pp. 75-78, 2002.
-
(2002)
Proc. European Signal Processing Cont. (EUSIPCO '02)
, vol.1
, pp. 75-78
-
-
Valbonesi, L.1
Ansari, R.2
McNeill, D.3
Quek, F.4
Duncan, S.5
McCullough, K.E.6
qnd, R.7
Bryll8
-
11
-
-
1642405348
-
Visual Prosody and Speech Intelligibility: Head Movement Improves Auditory Speech Perception
-
K. Munhall, J.A. Jones, D.E. Callan, T. Kuratate, and E. Vatikiotis-Bateson, "Visual Prosody and Speech Intelligibility: Head Movement Improves Auditory Speech Perception," Psychological Science vol. 15, no. 2, pp. 133-137, 2004.
-
(2004)
Psychological Science
, vol.15
, Issue.2
, pp. 133-137
-
-
Munhall, K.1
Jones, J.A.2
Callan, D.E.3
Kuratate, T.4
Vatikiotis-Bateson, E.5
-
12
-
-
85037085294
-
Gesture Cues for Conversational Interaction in Monocular Video
-
F. Quek, D. McNeill, R. Ansari, X. Ma, R. Bryll, S. Duncan, and K. McCullough, "Gesture Cues for Conversational Interaction in Monocular Video," Proc. Int'l Workshop Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, pp. 64-69, 1999.
-
(1999)
Proc. Int'l Workshop Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems
, pp. 64-69
-
-
Quek, F.1
McNeill, D.2
Ansari, R.3
Ma, X.4
Bryll, R.5
Duncan, S.6
McCullough, K.7
-
13
-
-
85034718268
-
Audio-Visual Syenthesis of Talking Faces from Speech Production Correllates
-
99, pp
-
T. Kuratate, K.G. Munhall, P.E. Rubin, E. Vatikiotis-Bateson, and H. Yehia, "Audio-Visual Syenthesis of Talking Faces from Speech Production Correllates," Proc. European Conf. Speech Comm. and Technology (EURUSPEECH '99), pp. 1279-1282, 1999.
-
(1999)
Proc. European Conf. Speech Comm. and Technology (EURUSPEECH
, pp. 1279-1282
-
-
Kuratate, T.1
Munhall, K.G.2
Rubin, P.E.3
Vatikiotis-Bateson, E.4
Yehia, H.5
-
14
-
-
78650465043
-
Visual Prosody: Facial Movements Accompanying Speech
-
H.P. Graf, E. Cosatto, V. Strom, and F.J. Huang, "Visual Prosody: Facial Movements Accompanying Speech," Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 381-386, 2002.
-
(2002)
Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition
, pp. 381-386
-
-
Graf, H.P.1
Cosatto, E.2
Strom, V.3
Huang, F.J.4
-
15
-
-
33645764471
-
Mood Swings: Expressive Speech Animation
-
E. Chuang and C. Bregler, "Mood Swings: Expressive Speech Animation," ACM Trans. Graphics, vol. 24, no. 2, pp. 331-347, 2005
-
(2005)
ACM Trans. Graphics
, vol.24
, Issue.2
, pp. 331-347
-
-
Chuang, E.1
Bregler, C.2
-
16
-
-
14944376450
-
Audio-Based Head Motion Synthesis for Avatar-Based Telepresence Systems
-
Z. Deng, C. Busso, S. Narayanan, and U. Neumann, "Audio-Based Head Motion Synthesis for Avatar-Based Telepresence Systems," Proc. ACM SIGMM Workshop Effective Telepresence (ETP '04), pp. 244-30, 2004.
-
(2004)
Proc. ACM SIGMM Workshop Effective Telepresence (ETP '04)
, pp. 244-330
-
-
Deng, Z.1
Busso, C.2
Narayanan, S.3
Neumann, U.4
-
17
-
-
34547499478
-
Gesture-Speech Correlation Analysis and Speech Driven Gesture Synthesis
-
M.E. Sargin, F. Ofli, Y. Yasinnik, O. Aran, A. Karpov, S. Wilson, E. Erzin, Y. Yemez, qnd A.M. Tekalp, "Gesture-Speech Correlation Analysis and Speech Driven Gesture Synthesis," Proc. Int'l Conf. Multimedia and Expo (ICME '06), 2006.
-
(2006)
Proc. Int'l Conf. Multimedia and Expo (ICME '06)
-
-
Sargin, M.E.1
Ofli, F.2
Yasinnik, Y.3
Aran, O.4
Karpov, A.5
Wilson, S.6
Erzin, E.7
Yemez, Y.8
qnd, A.M.9
Tekalp10
-
20
-
-
17744406666
-
An Extended Set of Haar-Like Features for Rapid Object Detection
-
R. Lienhart and J. Maydt, "An Extended Set of Haar-Like Features for Rapid Object Detection," Proc. Int'l Conf. Image Processing (ICIP '02), vol. 1, pp. 900-903, 2002.
-
(2002)
Proc. Int'l Conf. Image Processing (ICIP '02)
, vol.1
, pp. 900-903
-
-
Lienhart, R.1
Maydt, J.2
-
22
-
-
0041972413
-
Advances in Computational Stereo
-
Aug
-
M. Brown, D. Burschka, and G. Hager, "Advances in Computational Stereo," IEEE Trans. Pattern Anallysis and Machine Intelligence, vol. 25, no. 8, pp. 993-1008, Aug. 2003.
-
(2003)
IEEE Trans. Pattern Anallysis and Machine Intelligence
, vol.25
, Issue.8
, pp. 993-1008
-
-
Brown, M.1
Burschka, D.2
Hager, G.3
-
23
-
-
0003009946
-
Combining Stereo and Monocular information to Computer Dense Depth Maps that Preserve Depth Discontinuities
-
P. Fua, "Combining Stereo and Monocular information to Computer Dense Depth Maps that Preserve Depth Discontinuities," Proc. 12th Int'l Joint Conf. Artificial Intelligence, pp. 1292-1298, 1997.
-
(1997)
Proc. 12th Int'l Joint Conf. Artificial Intelligence
, pp. 1292-1298
-
-
Fua, P.1
-
26
-
-
0001835850
-
Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-to-Noise Ratio of a Sampled Sound
-
P. Boersma, "Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-to-Noise Ratio of a Sampled Sound," Proc. Inst. Phonetic Sciences, vol. 17, pp. 97-110, 1993.
-
(1993)
Proc. Inst. Phonetic Sciences
, vol.17
, pp. 97-110
-
-
Boersma, P.1
-
27
-
-
33646806777
-
An Automatic Prosody Recognizer Using a Coupled Multi-Stream Acoustic Model and a Syntactic-Prosodic Language Model
-
S. Ananthakrishnan and S. Narayanan, "An Automatic Prosody Recognizer Using a Coupled Multi-Stream Acoustic Model and a Syntactic-Prosodic Language Model," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05), vol. 1, 2005.
-
(2005)
Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05)
, vol.1
-
-
Ananthakrishnan, S.1
Narayanan, S.2
-
28
-
-
46149102414
-
-
Point Grey Research Inc
-
Point Grey Research Inc., http://www.ptgrey.com/, 2008.
-
(2008)
-
-
-
29
-
-
85119213703
-
Tobi: A Standard for Labeling English Prosody
-
92, pp
-
K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C. Wightman, P. Price, J. Pierrehumbert, and J. Hirschberg, "Tobi: A Standard for Labeling English Prosody," Proc. Int'l Conf. Spoken Language Processing (ICSLP '92), pp. 867-870, 1992.
-
(1992)
Proc. Int'l Conf. Spoken Language Processing (ICSLP
, pp. 867-870
-
-
Silverman, K.1
Beckman, M.2
Pitrelli, J.3
Ostendorf, M.4
Wightman, C.5
Price, P.6
Pierrehumbert, J.7
Hirschberg, J.8
-
30
-
-
46049121456
-
-
Momentum Inc, Speech-Driven Talking Head Avatar
-
Momentum Inc., Speech-Driven Talking Head Avatar, http:// www.momentum-dmt.com/, 2008.
-
(2008)
-
-
-
31
-
-
0030242097
-
Input-Output HMMs for Sequence Processing
-
Y. Bengio and P. Frasconi, "Input-Output HMMs for Sequence Processing," IEEE Trans. Neural Networks, vol. 77, no. 5, pp. 1231-1249, 1996.
-
(1996)
IEEE Trans. Neural Networks
, vol.77
, Issue.5
, pp. 1231-1249
-
-
Bengio, Y.1
Frasconi, P.2
-
32
-
-
4544244082
-
Torch: A Modular Machine Learning Software Library,
-
R. Collobert, S. Bengio, and J. Mariethoz, "Torch: A Modular Machine Learning Software Library," IDIAP Research Report, vol. 2, p. 46, 2002.
-
(2002)
IDIAP Research Report
, vol.2
, pp. 46
-
-
Collobert, R.1
Bengio, S.2
Mariethoz, J.3
-
34
-
-
0036503069
-
Optimisation Algorithms Exploiting Unitary Constraints
-
Mar
-
J.H. Manton, "Optimisation Algorithms Exploiting Unitary Constraints," IEEE Trans. Signal Processing, vol. 50, no. 3, pp. 635-650, Mar. 2002.
-
(2002)
IEEE Trans. Signal Processing
, vol.50
, Issue.3
, pp. 635-650
-
-
Manton, J.H.1
|