SCOPUS 정보 검색 플랫폼

IEEE Transactions on Pattern Analysis and Machine Intelligence

Volumn 30, Issue 8, 2008, Pages 1330-1345

Analysis of head gesture and prosody patterns for prosody-driven head-gesture animation

(4) Sargin, Mehmet E a Yemez, Yucel b Erzin, Engin b Tekalp, Ahmet M b

a University of California (United States)

Author keywords

Animation; Face and gesture recognition; Multimedia information systems; Pattern analysis and recognition; Speech analysis

Indexed keywords

ANIMATION; CONFORMAL MAPPING; HIDDEN MARKOV MODELS; MARKOV PROCESSES; SPEECH; STAGES; TESTING;

APPLIED (CO); AUDIO-VISUAL; EULER ANGLES; GESTURE ANIMATION; HEAD MODELING; HIDDEN MARKOV MODEL (HMM); JOINT ANALYSIS; MULTI-STREAM; SPEECH PROSODY; STAGE ANALYSIS; SUBJECTIVE EVALUATIONS; SYNTHESIS (OF CHIRAL IONIC LIQUIDS); TEMPORAL SEGMENTATIONS; TWO STAGES;

MODAL ANALYSIS;

ALGORITHM; ARTICLE; AUTOMATED PATTERN RECOGNITION; CLUSTER ANALYSIS; COMPUTER ASSISTED DIAGNOSIS; FACE; HEAD; HUMAN; IMAGE ENHANCEMENT; METHODOLOGY; PHYSIOLOGY; REPRODUCIBILITY; SENSITIVITY AND SPECIFICITY; SPEECH; THREE DIMENSIONAL IMAGING;

ALGORITHMS; CLUSTER ANALYSIS; FACE; HEAD; HUMANS; IMAGE ENHANCEMENT; IMAGE INTERPRETATION, COMPUTER-ASSISTED; IMAGING, THREE-DIMENSIONAL; PATTERN RECOGNITION, AUTOMATED; REPRODUCIBILITY OF RESULTS; SENSITIVITY AND SPECIFICITY; SPEECH;

EID: 46149109647 PISSN: 01628828 EISSN: None Source Type: Journal
DOI: 10.1109/TPAMI.2007.70797 Document Type: Article

Times cited : (67)

References (35)

1
- 85032752352
- Audiovisual Speech Processing
- T. Chen, "Audiovisual Speech Processing," IEEE Signal Processing Magazine, vol. 18, pp. 9-21, 2001.
- (2001) IEEE Signal Processing Magazine , vol.18 , pp. 9-21
- Chen, T.¹

2
- 0024900468
- An intelligent Facial Image Coding Driven by Speech and Phoneme
- 89, pp
- S. Morishima, K. Aizawa, and H. Harashima, "An intelligent Facial Image Coding Driven by Speech and Phoneme," Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '89), pp. 1795-1798, 1989.
- (1989) Proc. Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP , pp. 1795-1798
- Morishima, S.¹ Aizawa, K.² Harashima, H.³

3
- 0030677313
- Video Rewrite: Driving Visual Speech with Audio
- 97, pp
- C. Bregier, M. Covell, and M. Slaney, "Video Rewrite: Driving Visual Speech with Audio," Proc. ACM SIGGRAPH '97, pp. 353-360, 1997.
- (1997) Proc. ACM SIGGRAPH , pp. 353-360
- Bregier, C.¹ Covell, M.² Slaney, M.³

4
- 85017188218
- Real-Time Lip-Synch Face Animation Driven by Human Voice
- F. Huang and T. Chen, "Real-Time Lip-Synch Face Animation Driven by Human Voice," Proc. IEEE Second Workshop Multimedia Signal Processing, pp. 352-357, 1998.
- (1998) Proc. IEEE Second Workshop Multimedia Signal Processing , pp. 352-357
- Huang, F.¹ Chen, T.²

5
- 0032179320
- Lip Movement Synthesis from Speech Based on Hidden Markov Models
- E. Yamamoto, S. Nakamura, and K. ShiKano, "Lip Movement Synthesis from Speech Based on Hidden Markov Models," Speech Comm., pp. 105-115, 1998.
- (1998) Speech Comm , pp. 105-115
- Yamamoto, E.¹ Nakamura, S.² ShiKano, K.³

6
- 84937437186
- Voice Puppetry
- M. Brand, "Voice Puppetry," Proc. 26th Ann. Conf. Computer Graphics and Interactive Techniques, pp. 21-288, 1999.
- (1999) Proc. 26th Ann. Conf. Computer Graphics and Interactive Techniques , pp. 21-288
- Brand, M.¹

7
- 2542499812
- Speech-to-Video Synthesis Using Facial Animation Parameters
- P.S. Aleksic and A.K. Katsaggelos, "Speech-to-Video Synthesis Using Facial Animation Parameters," IEEE Trans. Circuits and Systems for Video Technology, vol. 14, no. 5, pp. 682-692, 2004.
- (2004) IEEE Trans. Circuits and Systems for Video Technology , vol.14 , Issue.5 , pp. 682-692
- Aleksic, P.S.¹ Katsaggelos, A.K.²

8
- 33646752807
- Learning Dynamic Audio-Visual Mapping with Inputoutput Hidden Markov Models
- Y. Li and H.-Y. Shum, "Learning Dynamic Audio-Visual Mapping with Inputoutput Hidden Markov Models," IEEE Trans. Multimedia, vol. 8, no. 3, pp. 542-549, 2006.
- (2006) IEEE Trans. Multimedia , vol.8 , Issue.3 , pp. 542-549
- Li, Y.¹ Shum, H.-Y.²

9
- 34247623168
- Acoustically -Driven Talking Face Synthesis Using Dynamic Bayesian Networks
- J. Xue, J. Borgstrom, J. Jiang, L. Bernstein, and A. Alwan, "Acoustically -Driven Talking Face Synthesis Using Dynamic Bayesian Networks," Proc. Int'l Conf. Multimedia and Expo (ICME '06), pp. 1165-1168, 2006.
- (2006) Proc. Int'l Conf. Multimedia and Expo (ICME '06) , pp. 1165-1168
- Xue, J.¹ Borgstrom, J.² Jiang, J.³ Bernstein, L.⁴ Alwan, A.⁵

10
- 84960898014
- Multimodal Signal Analysis of Prosody and Hand Motion: Temporal Correlation of Speech and Gestures
- L. Valbonesi, R. Ansari, D. McNeill, F. Quek, S. Duncan, K.E. McCullough, qnd R. Bryll, "Multimodal Signal Analysis of Prosody and Hand Motion: Temporal Correlation of Speech and Gestures," Proc. European Signal Processing Cont. (EUSIPCO '02), vol. 1, pp. 75-78, 2002.
- (2002) Proc. European Signal Processing Cont. (EUSIPCO '02) , vol.1 , pp. 75-78
- Valbonesi, L.¹ Ansari, R.² McNeill, D.³ Quek, F.⁴ Duncan, S.⁵ McCullough, K.E.⁶ qnd, R.⁷ Bryll⁸

11
- 1642405348
- Visual Prosody and Speech Intelligibility: Head Movement Improves Auditory Speech Perception
- K. Munhall, J.A. Jones, D.E. Callan, T. Kuratate, and E. Vatikiotis-Bateson, "Visual Prosody and Speech Intelligibility: Head Movement Improves Auditory Speech Perception," Psychological Science vol. 15, no. 2, pp. 133-137, 2004.
- (2004) Psychological Science , vol.15 , Issue.2 , pp. 133-137
- Munhall, K.¹ Jones, J.A.² Callan, D.E.³ Kuratate, T.⁴ Vatikiotis-Bateson, E.⁵

12
- 85037085294
- Gesture Cues for Conversational Interaction in Monocular Video
- F. Quek, D. McNeill, R. Ansari, X. Ma, R. Bryll, S. Duncan, and K. McCullough, "Gesture Cues for Conversational Interaction in Monocular Video," Proc. Int'l Workshop Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, pp. 64-69, 1999.
- (1999) Proc. Int'l Workshop Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems , pp. 64-69
- Quek, F.¹ McNeill, D.² Ansari, R.³ Ma, X.⁴ Bryll, R.⁵ Duncan, S.⁶ McCullough, K.⁷

13
- 85034718268
- Audio-Visual Syenthesis of Talking Faces from Speech Production Correllates
- 99, pp
- T. Kuratate, K.G. Munhall, P.E. Rubin, E. Vatikiotis-Bateson, and H. Yehia, "Audio-Visual Syenthesis of Talking Faces from Speech Production Correllates," Proc. European Conf. Speech Comm. and Technology (EURUSPEECH '99), pp. 1279-1282, 1999.
- (1999) Proc. European Conf. Speech Comm. and Technology (EURUSPEECH , pp. 1279-1282
- Kuratate, T.¹ Munhall, K.G.² Rubin, P.E.³ Vatikiotis-Bateson, E.⁴ Yehia, H.⁵

14
- 78650465043
- Visual Prosody: Facial Movements Accompanying Speech
- H.P. Graf, E. Cosatto, V. Strom, and F.J. Huang, "Visual Prosody: Facial Movements Accompanying Speech," Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition, pp. 381-386, 2002.
- (2002) Proc. IEEE Int'l Conf. Automatic Face and Gesture Recognition , pp. 381-386
- Graf, H.P.¹ Cosatto, E.² Strom, V.³ Huang, F.J.⁴

15
- 33645764471
- Mood Swings: Expressive Speech Animation
- E. Chuang and C. Bregler, "Mood Swings: Expressive Speech Animation," ACM Trans. Graphics, vol. 24, no. 2, pp. 331-347, 2005
- (2005) ACM Trans. Graphics , vol.24 , Issue.2 , pp. 331-347
- Chuang, E.¹ Bregler, C.²

16
- 14944376450
- Audio-Based Head Motion Synthesis for Avatar-Based Telepresence Systems
- Z. Deng, C. Busso, S. Narayanan, and U. Neumann, "Audio-Based Head Motion Synthesis for Avatar-Based Telepresence Systems," Proc. ACM SIGMM Workshop Effective Telepresence (ETP '04), pp. 244-30, 2004.
- (2004) Proc. ACM SIGMM Workshop Effective Telepresence (ETP '04) , pp. 244-330
- Deng, Z.¹ Busso, C.² Narayanan, S.³ Neumann, U.⁴

17
- 34547499478
- Gesture-Speech Correlation Analysis and Speech Driven Gesture Synthesis
- M.E. Sargin, F. Ofli, Y. Yasinnik, O. Aran, A. Karpov, S. Wilson, E. Erzin, Y. Yemez, qnd A.M. Tekalp, "Gesture-Speech Correlation Analysis and Speech Driven Gesture Synthesis," Proc. Int'l Conf. Multimedia and Expo (ICME '06), 2006.
- (2006) Proc. Int'l Conf. Multimedia and Expo (ICME '06)
- Sargin, M.E.¹ Ofli, F.² Yasinnik, Y.³ Aran, O.⁴ Karpov, A.⁵ Wilson, S.⁶ Erzin, E.⁷ Yemez, Y.⁸ qnd, A.M.⁹ Tekalp¹⁰

18
- 0036452478
- Discovering Recurrent Events in Video Using Unsupervised Methods
- M. Naphade and T. Huang, "Discovering Recurrent Events in Video Using Unsupervised Methods," Proc. Int'l Conf. Image Processing (ICIP '02) 2, pp. 13-16, 2002.
- (2002) Proc. Int'l Conf. Image Processing (ICIP '02) , vol.2 , pp. 13-16
- Naphade, M.¹ Huang, T.²

19
- 0035680116
- Rapid Object Detection Using a Boosted Cascade of Simple Features
- P. Viola and M. Jones, "Rapid Object Detection Using a Boosted Cascade of Simple Features," Proc. IEEE Computer Vision and Pattern Recognition (CVPR '01), pp. 511-518, 2001.
- (2001) Proc. IEEE Computer Vision and Pattern Recognition (CVPR '01) , pp. 511-518
- Viola, P.¹ Jones, M.²

20
- 17744406666
- An Extended Set of Haar-Like Features for Rapid Object Detection
- R. Lienhart and J. Maydt, "An Extended Set of Haar-Like Features for Rapid Object Detection," Proc. Int'l Conf. Image Processing (ICIP '02), vol. 1, pp. 900-903, 2002.
- (2002) Proc. Int'l Conf. Image Processing (ICIP '02) , vol.1 , pp. 900-903
- Lienhart, R.¹ Maydt, J.²

21
- 2442456044
- OpenCVDocuments, Intel Corp, Microprocessor Research Labs
- J.Y. Bouguet, Pyramidal Implementation of the Lucas Kanade Feature Trackerdescription of the Algorithm, OpenCVDocuments, Intel Corp., Microprocessor Research Labs, 1999.
- (1999) Pyramidal Implementation of the Lucas Kanade Feature Trackerdescription of the Algorithm
- Bouguet, J.Y.¹

22
- 0041972413
- Advances in Computational Stereo
- Aug
- M. Brown, D. Burschka, and G. Hager, "Advances in Computational Stereo," IEEE Trans. Pattern Anallysis and Machine Intelligence, vol. 25, no. 8, pp. 993-1008, Aug. 2003.
- (2003) IEEE Trans. Pattern Anallysis and Machine Intelligence , vol.25 , Issue.8 , pp. 993-1008
- Brown, M.¹ Burschka, D.² Hager, G.³

23
- 0003009946
- Combining Stereo and Monocular information to Computer Dense Depth Maps that Preserve Depth Discontinuities
- P. Fua, "Combining Stereo and Monocular information to Computer Dense Depth Maps that Preserve Depth Discontinuities," Proc. 12th Int'l Joint Conf. Artificial Intelligence, pp. 1292-1298, 1997.
- (1997) Proc. 12th Int'l Joint Conf. Artificial Intelligence , pp. 1292-1298
- Fua, P.¹

24
- 0003782380
- Description of Rotation in Terms of the Euler Angles
- World Scientific
- D. Varshalovich, A. Moskalev, and V. Khersonskii, "Description of Rotation in Terms of the Euler Angles," Quantom Theory of Angular Momentum, World Scientific, 1988.
- (1988) Quantom Theory of Angular Momentum
- Varshalovich, D.¹ Moskalev, A.² Khersonskii, V.³

25
- 0022095776
- Animating Rotation with Quaternion Curves
- K. Shoemake, "Animating Rotation with Quaternion Curves," Proc. 12th Ann. Conf. Computer Graphics and Interactive Techniques, pp. 245-254, 1985.
- (1985) Proc. 12th Ann. Conf. Computer Graphics and Interactive Techniques , pp. 245-254
- Shoemake, K.¹

26
- 0001835850
- Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-to-Noise Ratio of a Sampled Sound
- P. Boersma, "Accurate Short-Term Analysis of the Fundamental Frequency and the Harmonics-to-Noise Ratio of a Sampled Sound," Proc. Inst. Phonetic Sciences, vol. 17, pp. 97-110, 1993.
- (1993) Proc. Inst. Phonetic Sciences , vol.17 , pp. 97-110
- Boersma, P.¹

27
- 33646806777
- An Automatic Prosody Recognizer Using a Coupled Multi-Stream Acoustic Model and a Syntactic-Prosodic Language Model
- S. Ananthakrishnan and S. Narayanan, "An Automatic Prosody Recognizer Using a Coupled Multi-Stream Acoustic Model and a Syntactic-Prosodic Language Model," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05), vol. 1, 2005.
- (2005) Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '05) , vol.1
- Ananthakrishnan, S.¹ Narayanan, S.²

28
- 46149102414
- Point Grey Research Inc
- Point Grey Research Inc., http://www.ptgrey.com/, 2008.
- (2008)

29
- 85119213703
- Tobi: A Standard for Labeling English Prosody
- 92, pp
- K. Silverman, M. Beckman, J. Pitrelli, M. Ostendorf, C. Wightman, P. Price, J. Pierrehumbert, and J. Hirschberg, "Tobi: A Standard for Labeling English Prosody," Proc. Int'l Conf. Spoken Language Processing (ICSLP '92), pp. 867-870, 1992.
- (1992) Proc. Int'l Conf. Spoken Language Processing (ICSLP , pp. 867-870
- Silverman, K.¹ Beckman, M.² Pitrelli, J.³ Ostendorf, M.⁴ Wightman, C.⁵ Price, P.⁶ Pierrehumbert, J.⁷ Hirschberg, J.⁸

30
- 46049121456
- Momentum Inc, Speech-Driven Talking Head Avatar
- Momentum Inc., Speech-Driven Talking Head Avatar, http:// www.momentum-dmt.com/, 2008.
- (2008)

31
- 0030242097
- Input-Output HMMs for Sequence Processing
- Y. Bengio and P. Frasconi, "Input-Output HMMs for Sequence Processing," IEEE Trans. Neural Networks, vol. 77, no. 5, pp. 1231-1249, 1996.
- (1996) IEEE Trans. Neural Networks , vol.77 , Issue.5 , pp. 1231-1249
- Bengio, Y.¹ Frasconi, P.²

32
- 4544244082
- Torch: A Modular Machine Learning Software Library,
- R. Collobert, S. Bengio, and J. Mariethoz, "Torch: A Modular Machine Learning Software Library," IDIAP Research Report, vol. 2, p. 46, 2002.
- (2002) IDIAP Research Report , vol.2 , pp. 46
- Collobert, R.¹ Bengio, S.² Mariethoz, J.³

33
- 46149106085
- Prosody-Driven Head Gesture Animation, http://mvgl.ku.edu.tr/ prosodygesture/, 2008.
- (2008) Prosody-Driven Head Gesture Animation

34
- 0036503069
- Optimisation Algorithms Exploiting Unitary Constraints
- Mar
- J.H. Manton, "Optimisation Algorithms Exploiting Unitary Constraints," IEEE Trans. Signal Processing, vol. 50, no. 3, pp. 635-650, Mar. 2002.
- (2002) IEEE Trans. Signal Processing , vol.50 , Issue.3 , pp. 635-650
- Manton, J.H.¹

35
- 0034857181
- Motion Estimation from Disparity Images
- D. Demirdjian and T. Darrell, "Motion Estimation from Disparity Images," Proc. Eighth IEEE Int'l Conf. Computer Vision, vol. 1, pp. 213-218, 2001.
- (2001) Proc. Eighth IEEE Int'l Conf. Computer Vision , vol.1 , pp. 213-218
- Demirdjian, D.¹ Darrell, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.