메뉴 건너뛰기




Volumn 86, Issue 10, 2006, Pages 2932-2951

Speaker-independent 3D face synthesis driven by speech and text

Author keywords

3D facial motion capture; Audio visual codebook; MPEG 4 facial animation; Speaker independent; Visual speech synthesis

Indexed keywords

ANIMATION; CORRELATION METHODS; DATA PROCESSING; GESTURE RECOGNITION; INFORMATION RETRIEVAL; NATURAL LANGUAGE PROCESSING SYSTEMS; RECURRENT NEURAL NETWORKS; THREE DIMENSIONAL;

EID: 33745712098     PISSN: 01651684     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.sigpro.2005.12.007     Document Type: Article
Times cited : (8)

References (25)
  • 1
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • McGurk H., and MacDonald J. Hearing lips and seeing voices. Nature 264 (1976) 746-748
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 2
    • 33745687385 scopus 로고    scopus 로고
    • J. Beskow, Rule-based visual speech synthesis, in: Proceedings of the Fourth European Conference on Speech Communication and Technology (Eurospeech '95), Madrid, Spain, 1995, pp. 299-302.
  • 3
    • 0001514782 scopus 로고
    • Modeling coarticulation in synthetic visual speech
    • Thalmann N.M., and Thalmann D. (Eds), Springer, Tokyo
    • Cohen M.M., and Massaro D.W. Modeling coarticulation in synthetic visual speech. In: Thalmann N.M., and Thalmann D. (Eds). Models and Techniques in Computer Animation (1993), Springer, Tokyo 139-156
    • (1993) Models and Techniques in Computer Animation , pp. 139-156
    • Cohen, M.M.1    Massaro, D.W.2
  • 4
    • 33745701612 scopus 로고    scopus 로고
    • C. Bregler, M. Covell, M. Slaney, Video rewrite: visual speech synthesis from video, in: Proceedings of the Workshop on Audio-Visual Speech Processing, Rhodes, Greece, 1997, pp. 153-156.
  • 5
    • 33745687391 scopus 로고    scopus 로고
    • J.P. Lewis, F.I. Parke, Automatic lip-synch and speech synthesis for character animation, in: Proceedings of the Graphics Interface '86, Canadian Information Processing Society, Calgary, 1986, pp. 136-140.
  • 6
    • 0032179320 scopus 로고    scopus 로고
    • Lip movement synthesis from speech based on Hidden Markov Models
    • Yamamoto E., Nakamura S., and Shikano K. Lip movement synthesis from speech based on Hidden Markov Models. J. Speech Commun. 28 (1998) 105-115
    • (1998) J. Speech Commun. , vol.28 , pp. 105-115
    • Yamamoto, E.1    Nakamura, S.2    Shikano, K.3
  • 7
    • 0029270677 scopus 로고
    • Converting speech into lip movements: a multimedia telephone for hard of hearing people
    • Lavagetto F. Converting speech into lip movements: a multimedia telephone for hard of hearing people. IEEE Trans. Rehabil. Eng. 3 1 (1995) 90-102
    • (1995) IEEE Trans. Rehabil. Eng. , vol.3 , Issue.1 , pp. 90-102
    • Lavagetto, F.1
  • 8
    • 33745729631 scopus 로고    scopus 로고
    • D.W. Massaro, J. Beskow, M.M. Cohen, Picture my voice: audio to visual speech synthesis using artificial neural networks, in: Proceedings of the AVSP '99, 1999.
  • 9
    • 0036650837 scopus 로고    scopus 로고
    • Real-time speech-driven face animation with expressions using neural networks
    • Hong P., Wen Z., and Huang T.S. Real-time speech-driven face animation with expressions using neural networks. IEEE Trans. Neural Networks 13 1 (2002) 100-111
    • (2002) IEEE Trans. Neural Networks , vol.13 , Issue.1 , pp. 100-111
    • Hong, P.1    Wen, Z.2    Huang, T.S.3
  • 10
    • 33745715695 scopus 로고    scopus 로고
    • Codebook based face point trajectory synthesis algorithm using speech input
    • Arslan L.M., and Talkin D. Codebook based face point trajectory synthesis algorithm using speech input. Elsevier Sci. 953 (1998) 01-13
    • (1998) Elsevier Sci. , vol.953 , pp. 01-13
    • Arslan, L.M.1    Talkin, D.2
  • 11
    • 0003626435 scopus 로고    scopus 로고
    • Prentice-Hall, Englewood Cliffs, NJ (pp. 295-302, Chapter 6)
    • Gonzalez R.C., and Woods R.E. Digital Image Processing (2002), Prentice-Hall, Englewood Cliffs, NJ (pp. 295-302, Chapter 6)
    • (2002) Digital Image Processing
    • Gonzalez, R.C.1    Woods, R.E.2
  • 12
    • 0004285133 scopus 로고    scopus 로고
    • Prentice-Hall, Englewood Cliffs, NJ (pp. 74-75, Chapter 3)
    • Shapiro L.G., and Stockman G.C. Computer Vision (2001), Prentice-Hall, Englewood Cliffs, NJ (pp. 74-75, Chapter 3)
    • (2001) Computer Vision
    • Shapiro, L.G.1    Stockman, G.C.2
  • 14
    • 33745715696 scopus 로고    scopus 로고
    • H. Dutagaci, Statistical language models for large vocabulary Turkish speech recognition, M.S. Thesis, Bogazici University, 2002.
  • 15
    • 33745701611 scopus 로고    scopus 로고
    • T. Robinson, M. Hochberg, S. Renals, The use of recurrent neural networks in continuous speech recognition, 1995, 〈svr-www.eng.cam.ac.uk/~ajr/rnn4csr94/rnn4csr94.html〉.
  • 17
    • 0032634198 scopus 로고    scopus 로고
    • J. Rothweiler, A root-finding algorithm for line spectral frequencies, in: Proceedings of the IEEE ICASSP 1999, Phoenix, AZ, USA, 1999, pp. II-661-II-664.
  • 18
    • 0032595174 scopus 로고    scopus 로고
    • On polynomial reduction in the computation of LSP frequencies
    • Rothweiler J. On polynomial reduction in the computation of LSP frequencies. IEEE Trans. Speech Audio Process. 7 5 (1999) 592-594
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.5 , pp. 592-594
    • Rothweiler, J.1
  • 20
    • 0037624007 scopus 로고    scopus 로고
    • Simple recurrent network trained by RTRL and extended Kalman filter algorithms
    • Cernansky M., and Benuskova L. Simple recurrent network trained by RTRL and extended Kalman filter algorithms. Neural Network World 13 3 (2003) 223-234
    • (2003) Neural Network World , vol.13 , Issue.3 , pp. 223-234
    • Cernansky, M.1    Benuskova, L.2
  • 21
    • 0001202594 scopus 로고
    • A learning algorithm for continually running fully recurrent neural networks
    • Williams R.J., and Zipser D. A learning algorithm for continually running fully recurrent neural networks. Neural Comput. 1 (1989) 270-280
    • (1989) Neural Comput. , vol.1 , pp. 270-280
    • Williams, R.J.1    Zipser, D.2
  • 25
    • 0033097911 scopus 로고    scopus 로고
    • The facial animation engine: towards a high-level interface for the design of MPEG-4 compliant animated faces
    • Lavagetto F., and Pockaj R. The facial animation engine: towards a high-level interface for the design of MPEG-4 compliant animated faces. IEEE Trans. Circuits Syst. Video Technol. 9 2 (1999) 277-289
    • (1999) IEEE Trans. Circuits Syst. Video Technol. , vol.9 , Issue.2 , pp. 277-289
    • Lavagetto, F.1    Pockaj, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.