SCOPUS 정보 검색 플랫폼

Signal Processing

Volumn 86, Issue 10, 2006, Pages 2932-2951

Speaker-independent 3D face synthesis driven by speech and text

(3) Savran, Arman a Arslan, Levent M a Akarun, Lale a

a BOGAZICI UNIVERSITY (Turkey)

Author keywords

3D facial motion capture; Audio visual codebook; MPEG 4 facial animation; Speaker independent; Visual speech synthesis

Indexed keywords

ANIMATION; CORRELATION METHODS; DATA PROCESSING; GESTURE RECOGNITION; INFORMATION RETRIEVAL; NATURAL LANGUAGE PROCESSING SYSTEMS; RECURRENT NEURAL NETWORKS; THREE DIMENSIONAL;

3D FACIAL MOTION CAPTURE; AUDIO-VISUAL CODEBOOKS; MPEG-4 FACIAL ANIMATION; SPEAKER INDEPENDENT; VISUAL SPEECH SYNTHESIS;

SPEECH RECOGNITION;

EID: 33745712098 PISSN: 01651684 EISSN: None Source Type: Journal
DOI: 10.1016/j.sigpro.2005.12.007 Document Type: Article

Times cited : (8)

References (25)

1
- 0017199877
- Hearing lips and seeing voices
- McGurk H., and MacDonald J. Hearing lips and seeing voices. Nature 264 (1976) 746-748
- (1976) Nature , vol.264 , pp. 746-748
- McGurk, H.¹ MacDonald, J.²

2
- 33745687385
- J. Beskow, Rule-based visual speech synthesis, in: Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain, 1995, pp. 299-302.

3
- 0001514782
- Modeling coarticulation in synthetic visual speech
- Thalmann N.M., and Thalmann D. (Eds), Springer, Tokyo
- Cohen M.M., and Massaro D.W. Modeling coarticulation in synthetic visual speech. In: Thalmann N.M., and Thalmann D. (Eds). Models and Techniques in Computer Animation (1993), Springer, Tokyo 139-156
- (1993) Models and Techniques in Computer Animation , pp. 139-156
- Cohen, M.M.¹ Massaro, D.W.²

4
- 33745701612
- C. Bregler, M. Covell, M. Slaney, Video rewrite: visual speech synthesis from video, in: Proceedings of the Workshop on Audio-Visual Speech Processing, Rhodes, Greece, 1997, pp. 153-156.

5
- 33745687391
- J.P. Lewis, F.I. Parke, Automatic lip-synch and speech synthesis for character animation, in: Proceedings of the Graphics Interface '86, Canadian Information Processing Society, Calgary, 1986, pp. 136-140.

6
- 0032179320
- Lip movement synthesis from speech based on Hidden Markov Models
- Yamamoto E., Nakamura S., and Shikano K. Lip movement synthesis from speech based on Hidden Markov Models. J. Speech Commun. 28 (1998) 105-115
- (1998) J. Speech Commun. , vol.28 , pp. 105-115
- Yamamoto, E.¹ Nakamura, S.² Shikano, K.³

7
- 0029270677
- Converting speech into lip movements: a multimedia telephone for hard of hearing people
- Lavagetto F. Converting speech into lip movements: a multimedia telephone for hard of hearing people. IEEE Trans. Rehabil. Eng. 3 1 (1995) 90-102
- (1995) IEEE Trans. Rehabil. Eng. , vol.3 , Issue.1 , pp. 90-102
- Lavagetto, F.¹

8
- 33745729631
- D.W. Massaro, J. Beskow, M.M. Cohen, Picture my voice: audio to visual speech synthesis using artificial neural networks, in: Proceedings of the AVSP '99, 1999.

9
- 0036650837
- Real-time speech-driven face animation with expressions using neural networks
- Hong P., Wen Z., and Huang T.S. Real-time speech-driven face animation with expressions using neural networks. IEEE Trans. Neural Networks 13 1 (2002) 100-111
- (2002) IEEE Trans. Neural Networks , vol.13 , Issue.1 , pp. 100-111
- Hong, P.¹ Wen, Z.² Huang, T.S.³

10
- 33745715695
- Codebook based face point trajectory synthesis algorithm using speech input
- Arslan L.M., and Talkin D. Codebook based face point trajectory synthesis algorithm using speech input. Elsevier Sci. 953 (1998) 01-13
- (1998) Elsevier Sci. , vol.953 , pp. 01-13
- Arslan, L.M.¹ Talkin, D.²

11
- 0003626435
- Prentice-Hall, Englewood Cliffs, NJ (pp. 295-302, Chapter 6)
- Gonzalez R.C., and Woods R.E. Digital Image Processing (2002), Prentice-Hall, Englewood Cliffs, NJ (pp. 295-302, Chapter 6)
- (2002) Digital Image Processing
- Gonzalez, R.C.¹ Woods, R.E.²

12
- 0004285133
- Prentice-Hall, Englewood Cliffs, NJ (pp. 74-75, Chapter 3)
- Shapiro L.G., and Stockman G.C. Computer Vision (2001), Prentice-Hall, Englewood Cliffs, NJ (pp. 74-75, Chapter 3)
- (2001) Computer Vision
- Shapiro, L.G.¹ Stockman, G.C.²

13
- 84890517975
- Least-squares fitting of two 3-D points sets
- Arun K.S., Huang S.T., and Blostein S.D. Least-squares fitting of two 3-D points sets. IEEE Trans. Pattern Anal. Mach. Intell. 9 5 (1987) 698-700
- (1987) IEEE Trans. Pattern Anal. Mach. Intell. , vol.9 , Issue.5 , pp. 698-700
- Arun, K.S.¹ Huang, S.T.² Blostein, S.D.³

14
- 33745715696
- H. Dutagaci, Statistical language models for large vocabulary Turkish speech recognition, M.S. Thesis, Bogazici University, 2002.

15
- 33745701611
- T. Robinson, M. Hochberg, S. Renals, The use of recurrent neural networks in continuous speech recognition, 1995, 〈svr-www.eng.cam.ac.uk/~ajr/rnn4csr94/rnn4csr94.html〉.

16
- 0003425258
- Prentice-Hall, Englewood Cliffs, NJ
- Rabiner L.R., and Schafer R.W. Digital Processing of Speech Signals (1978), Prentice-Hall, Englewood Cliffs, NJ
- (1978) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

17
- 0032634198
- J. Rothweiler, A root-finding algorithm for line spectral frequencies, in: Proceedings of the IEEE ICASSP 1999, Phoenix, AZ, USA, 1999, pp. II-661-II-664.

18
- 0032595174
- On polynomial reduction in the computation of LSP frequencies
- Rothweiler J. On polynomial reduction in the computation of LSP frequencies. IEEE Trans. Speech Audio Process. 7 5 (1999) 592-594
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.5 , pp. 592-594
- Rothweiler, J.¹

19
- 0024634603
- Phoneme recognition using time-delay neural networks
- Waibel A., Hanazawa T., Hinton G., Shikano K., and Lang. K. Phoneme recognition using time-delay neural networks. IEEE Trans. Acoust. Speech Signal Process. 37 (1989) 328-339
- (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , pp. 328-339
- Waibel, A.¹ Hanazawa, T.² Hinton, G.³ Shikano, K.⁴ Lang., K.⁵

20
- 0037624007
- Simple recurrent network trained by RTRL and extended Kalman filter algorithms
- Cernansky M., and Benuskova L. Simple recurrent network trained by RTRL and extended Kalman filter algorithms. Neural Network World 13 3 (2003) 223-234
- (2003) Neural Network World , vol.13 , Issue.3 , pp. 223-234
- Cernansky, M.¹ Benuskova, L.²

21
- 0001202594
- A learning algorithm for continually running fully recurrent neural networks
- Williams R.J., and Zipser D. A learning algorithm for continually running fully recurrent neural networks. Neural Comput. 1 (1989) 270-280
- (1989) Neural Comput. , vol.1 , pp. 270-280
- Williams, R.J.¹ Zipser, D.²

22
- 0003822743
- Entropic Cambridge Engineering
- Young S., Evermann G., Kershaw D., Moore G., Odell J., Ollason D., Povey D., Valtchev V., and Woodland P. The HTK Book (2002), Entropic Cambridge Engineering
- (2002) The HTK Book
- Young, S.¹ Evermann, G.² Kershaw, D.³ Moore, G.⁴ Odell, J.⁵ Ollason, D.⁶ Povey, D.⁷ Valtchev, V.⁸ Woodland, P.⁹

23
- 0004056285
- Prentice-Hall PTR (pp. 316-318, Chapter 6)
- Huang X., Acero A., and Hon H.W. Spoken Language Processing: a Guide to Theory, Algorithm, and System Development (2001), Prentice-Hall PTR (pp. 316-318, Chapter 6)
- (2001) Spoken Language Processing: a Guide to Theory, Algorithm, and System Development
- Huang, X.¹ Acero, A.² Hon, H.W.³

24
- 0027541354
- B-spline signal processing: Part I-theory
- Unser M., Aldroubi A., and Eden M. B-spline signal processing: Part I-theory. IEEE Trans. Signal Process. 41 2 (1993) 821-833
- (1993) IEEE Trans. Signal Process. , vol.41 , Issue.2 , pp. 821-833
- Unser, M.¹ Aldroubi, A.² Eden, M.³

25
- 0033097911
- The facial animation engine: towards a high-level interface for the design of MPEG-4 compliant animated faces
- Lavagetto F., and Pockaj R. The facial animation engine: towards a high-level interface for the design of MPEG-4 compliant animated faces. IEEE Trans. Circuits Syst. Video Technol. 9 2 (1999) 277-289
- (1999) IEEE Trans. Circuits Syst. Video Technol. , vol.9 , Issue.2 , pp. 277-289
- Lavagetto, F.¹ Pockaj, R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.