SCOPUS 정보 검색 플랫폼

Volumn 9, Issue 3, 2007, Pages 500-510

Realistic mouth-synching for speech-driven talking face using articulatory modelling

b CITY UNIVERSITY OF HONG KONG (Hong Kong)

Author keywords

Articulatory model; Baum Welch DBN inversion (DBNI); Dynamic Bayesian networks (DBNs); Facial animation; Mouth synching; Talking face

Indexed keywords

ARTICULATORY MODELS; BAUM-WELCH DBN INVERSION (DBNI); DYNAMIC BAYESIAN NETWORKS (DBNS); FACIAL ANIMATION; TALKING FACES;

ALGORITHMS; BAYESIAN NETWORKS; COMPUTER SIMULATION; FACE RECOGNITION; HIDDEN MARKOV MODELS; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; SPEECH PROCESSING;

ANIMATION;

EID: 33947583073 PISSN: 15209210 EISSN: None Source Type: Journal
DOI: 10.1109/TMM.2006.888009 Document Type: Article

Times cited : (90)

References (34)

1
- 10044221981
- Talking faces-technologies and applications
- Aug
- J. Ostermann and A. Weissenfeld, "Talking faces-technologies and applications," in Proc. of ICPR'04, Aug. 2004, vol. 3, pp. 826-833.
- (2004) Proc. of ICPR'04 , vol.3 , pp. 826-833
- Ostermann, J.¹ Weissenfeld, A.²

2
- 10044281988
- Lifelike talking faces for interactive services
- Sep
- E. Cosatto, J. Ostermann, H. P. Graf, and J. Schroeter, "Lifelike talking faces for interactive services," Proc. IEEE, vol. 91, no. 9, pp. 1406-1428, Sep. 2003.
- (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1406-1428
- Cosatto, E.¹ Ostermann, J.² Graf, H.P.³ Schroeter, J.⁴

3
- 0031631507
- Synthesizing realistic facial expressions from photographs
- F. Pighin, J. Hecker, D. Lischinski, R. Szeliski, and D. H. Salesin, "Synthesizing realistic facial expressions from photographs," in Proc. ACM SIGGRAPH'98, 1998, vol. 3, pp. 75-84.
- (1998) Proc. ACM SIGGRAPH'98 , vol.3 , pp. 75-84
- Pighin, F.¹ Hecker, J.² Lischinski, D.³ Szeliski, R.⁴ Salesin, D.H.⁵

4
- 0036949796
- Head shop: Generating animated head models with anatomical structure
- K. Kaehler, J. Haber, H. Yamauchi, and HP Seidel, "Head shop: Generating animated head models with anatomical structure," in Proc. ACM SIGGRAPH'02, 2002, pp. 55-63.
- (2002) Proc. ACM SIGGRAPH'02 , pp. 55-63
- Kaehler, K.¹ Haber, J.² Yamauchi, H.³ Seidel, H.P.⁴

5
- 0030677313
- Video rewrite: Driving visual speech with audio
- C. Bregler, M. Covell, and M. Slaney, "Video rewrite: Driving visual speech with audio," in Proc. ACM SIGGRAPH'97, 1997.
- (1997) Proc. ACM SIGGRAPH'97
- Bregler, C.¹ Covell, M.² Slaney, M.³

6
- 77953828868
- Trainable videorealistic speech animation
- T. Ezzat, G. Geiger, and T. Poggio, "Trainable videorealistic speech animation," in Proc. ACM SIGGRAPH, 2002, pp. 388-397.
- (2002) Proc. ACM SIGGRAPH , pp. 388-397
- Ezzat, T.¹ Geiger, G.² Poggio, T.³

7
- 84872004031
- Sample-based synthesis of photo-realistic talking heads
- E. Cosatto and H. Graf, "Sample-based synthesis of photo-realistic talking heads," in Proc. IEEE Computer Animation, 1998, pp. 103-110.
- (1998) Proc. IEEE Computer Animation , pp. 103-110
- Cosatto, E.¹ Graf, H.²

8
- 0034271782
- Photo-realistic talking heads from image samples
- _. "Photo-realistic talking heads from image samples," IEEE Trans. Multimedia, vol. 2, pp. 152-163, 2000.
- (2000) IEEE Trans. Multimedia , vol.2 , pp. 152-163
- Cosatto, E.¹ Graf, H.²

9
- 0141702290
- Recent improvements to the IBM trainable speech synthesis system
- E. Eide, A. Aaron, R. Bakis, P. Cohen, R. Donovan, W. Hamza, T. Mathes, M. Picheny, M. Polkosky, M. Smith, and M. Viswanathan, "Recent improvements to the IBM trainable speech synthesis system," in Proc. ICASSP'03, 2003, vol. 1, pp. 708-711.
- (2003) Proc. ICASSP'03 , vol.1 , pp. 708-711
- Eide, E.¹ Aaron, A.² Bakis, R.³ Cohen, P.⁴ Donovan, R.⁵ Hamza, W.⁶ Mathes, T.⁷ Picheny, M.⁸ Polkosky, M.⁹ Smith, M.¹⁰ Viswanathan, M.¹¹

10
- 0001514782
- Modeling coarticulation in synthetic visual speech
- M. Magnenat-Thalmann and D. Thalmann, Eds. Tokyo, Japan: Springer-Verlag
- M. M. Cohen and D. W. Massaro, "Modeling coarticulation in synthetic visual speech," in Models and Techniques in Computer Animation, M. Magnenat-Thalmann and D. Thalmann, Eds. Tokyo, Japan: Springer-Verlag, 1993, pp. 139-156.
- (1993) Models and Techniques in Computer Animation , pp. 139-156
- Cohen, M.M.¹ Massaro, D.W.²

11
- 0036650837
- Real-time speech-driven face animation with expressions using neural networks
- P. Hong, Z. Wen, and T. S. Huang, "Real-time speech-driven face animation with expressions using neural networks," IEEE Trans. Neural Networks, vol. 13, no. 4, pp. 916-927, 2002.
- (2002) IEEE Trans. Neural Networks , vol.13 , Issue.4 , pp. 916-927
- Hong, P.¹ Wen, Z.² Huang, T.S.³

12
- 85133709259
- Picture my voice: Audio to visual speech synthesis using artificial neural networks
- D. W. Massaro, J. Beskow, M. M. Cohen, C. L. Fry, and T. Rodriguez, "Picture my voice: Audio to visual speech synthesis using artificial neural networks," in Proc. AVSP'99, 1999, pp. 133-138.
- (1999) Proc. AVSP'99 , pp. 133-138
- Massaro, D.W.¹ Beskow, J.² Cohen, M.M.³ Fry, C.L.⁴ Rodriguez, T.⁵

13
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.H.²

14
- 0032179320
- Lip movement synthesis from speech based on hidden Markov models
- E. Yamamoto, S. Nakamura, and K. Shikano, "Lip movement synthesis from speech based on hidden Markov models," Speech Commun., vol. 26, no. 1-2, pp. 105-115, 1998.
- (1998) Speech Commun , vol.26 , Issue.1-2 , pp. 105-115
- Yamamoto, E.¹ Nakamura, S.² Shikano, K.³

15
- 84937437186
- Voice puppetry
- M. Brand, "Voice puppetry," in Proc. ACM SIGGRAPH'99, 1999, pp. 21-28.
- (1999) Proc. ACM SIGGRAPH'99 , pp. 21-28
- Brand, M.¹

16
- 85032752352
- Audiovisual speech processing: Lip reading and lip synchronization
- T. Chen, "Audiovisual speech processing: Lip reading and lip synchronization," IEEE Signal Process. Mag., vol. 18, no. 1. pp. 9-21, 2001.
- (2001) IEEE Signal Process. Mag , vol.18 , Issue.1 , pp. 9-21
- Chen, T.¹

17
- 85008058913
- Real-time speech motion synthesis from recorded motions
- Y. Cao, P. Faloutsos, E. Kohler, and F. Pighin, "Real-time speech motion synthesis from recorded motions," in Eurographics/ACM SIGGRAPH Symp. Computer Animation, 2004, pp. 347-355.
- (2004) Eurographics/ACM SIGGRAPH Symp. Computer Animation , pp. 347-355
- Cao, Y.¹ Faloutsos, P.² Kohler, E.³ Pighin, F.⁴

18
- 0000497160
- Baum-Weich hidden Markov model inversion for reliable audio-to-visual conversion
- K. Choi and J. N. Hwang, "Baum-Weich hidden Markov model inversion for reliable audio-to-visual conversion," in Proc. IEEE 3rd Workshop Multimedia Signal Processing, 1999, pp. 175-180.
- (1999) Proc. IEEE 3rd Workshop Multimedia Signal Processing , pp. 175-180
- Choi, K.¹ Hwang, J.N.²

19
- 0028996864
- Noisy speech recognition using robust inversion of hidden Markov models
- S. Y. Moon and J. N. Hwang, "Noisy speech recognition using robust inversion of hidden Markov models," in Proc. ICASSP'95, 1995, pp. 145-148.
- (1995) Proc. ICASSP'95 , pp. 145-148
- Moon, S.Y.¹ Hwang, J.N.²

20
- 0035426641
- Hidden Markov model inversion for audio-to-visual conversion in an MPEG-4 facial animation system
- K. Choi, Y. Luo, and J. Hwang, "Hidden Markov model inversion for audio-to-visual conversion in an MPEG-4 facial animation system," J. VLSI Signal Process., no. 29, pp. 51-61, 2001.
- (2001) J. VLSI Signal Process , Issue.29 , pp. 51-61
- Choi, K.¹ Luo, Y.² Hwang, J.³

21
- 16244385915
- Audio/visual mapping with cross-modal hidden Markov models
- S. Fu, R. Gutierrez-Osuna, A. Esposito, K. P. Kakumanu, and O. N. Garcia, "Audio/visual mapping with cross-modal hidden Markov models," IEEE Trans. Multimedia, vol. 7, pp. 243-251, 2005.
- (2005) IEEE Trans. Multimedia , vol.7 , pp. 243-251
- Fu, S.¹ Gutierrez-Osuna, R.² Esposito, A.³ Kakumanu, K.P.⁴ Garcia, O.N.⁵

22
- 0003773641
- W. Hardcastle and N. Hewlett, Eds. New York: Basil Blackwell
- J. Goldsmith, Autosegmental and Metrical Phonology, W. Hardcastle and N. Hewlett, Eds. New York: Basil Blackwell, 1990.
- (1990) Autosegmental and Metrical Phonology
- Goldsmith, J.¹

23
- 28444470028
- Research on Key Issues of Audio Visual Speech Recognition,
- Ph.D. dissertation, Northwestern Polytechnical Univ, Xian, China
- L. Xie, "Research on Key Issues of Audio Visual Speech Recognition," Ph.D. dissertation, Northwestern Polytechnical Univ., Xian, China, 2004.
- (2004)
- Xie, L.¹

24
- 85128370668
- Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments
- K. Kirchhoff, "Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments," in Proc. ICSLP'98, 1998, pp. 891-894.
- (1998) Proc. ICSLP'98 , pp. 891-894
- Kirchhoff, K.¹

25
- 0003448310
- New York: Springer-Verlag
- F. V. Jensen, Bayesian Networks and Decision Graphs. New York: Springer-Verlag, 2001.
- (2001) Bayesian Networks and Decision Graphs
- Jensen, F.V.¹

26
- 0038784279
- Bayesian network structures and inference techniques for automatic speech recognition
- G. G. Zweig, "Bayesian network structures and inference techniques for automatic speech recognition," Comput. Speech Lang., vol. 17, pp. 173-193, 2003.
- (2003) Comput. Speech Lang , vol.17 , pp. 173-193
- Zweig, G.G.¹

27
- 0037697284
- Hidden-articulator Markov models for speech recognition
- M. Richardson, J. Bilmes, and C. Diorio, "Hidden-articulator Markov models for speech recognition," Speech Commun., vol. 41, pp. 511-529, 2003.
- (2003) Speech Commun , vol.41 , pp. 511-529
- Richardson, M.¹ Bilmes, J.² Diorio, C.³

28
- 0141587250
- Discriminatively structured graphical models for speech recognition
- J. A. Bilmes, G. Zweig, T. Richardson, K. Filali, K. Livescu, P. Xu, K. Jackson, Y. Brandman, E. Sandness, E. Holtz, J. Torres, and B. Byrne, "Discriminatively structured graphical models for speech recognition," in Tech. Rep. JHU 2001 Summer Workshop, 2001.
- (2001) Tech. Rep. JHU 2001 Summer Workshop
- Bilmes, J.A.¹ Zweig, G.² Richardson, T.³ Filali, K.⁴ Livescu, K.⁵ Xu, P.⁶ Jackson, K.⁷ Brandman, Y.⁸ Sandness, E.⁹ Holtz, E.¹⁰ Torres, J.¹¹ Byrne, B.¹²

29
- 84972571328
- Growth functions for transformations on manifolds
- L. E. Baum and G. R. Sell, "Growth functions for transformations on manifolds," Pacific J. Math., vol. 27, no. 2, pp. 211-227, 1968.
- (1968) Pacific J. Math , vol.27 , Issue.2 , pp. 211-227
- Baum, L.E.¹ Sell, G.R.²

30
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. Dempster, A. N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc. B, vol. 39, pp. 89-111, 1977.
- (1977) J. R. Statist. Soc. B , vol.39 , pp. 89-111
- Dempster, A.¹ Laird, A.N.² Rubin, D.³

31
- 0013288412
- Dynamic Bayesian Networks: Representation, Inference and Learning,
- Ph.D. dissertation, Univ. California, Berkeley
- K. Murphy, "Dynamic Bayesian Networks: Representation, Inference and Learning," Ph.D. dissertation, Univ. California, Berkeley, 2002.
- (2002)
- Murphy, K.¹

32
- 33947603303
- L. Xie and Z. Ye, The JEWEL Audio-Visual Dataset for Facial Animation 2005, Tech. Rep. RCMT 05-11.
- L. Xie and Z. Ye, The JEWEL Audio-Visual Dataset for Facial Animation 2005, Tech. Rep. RCMT 05-11.

33
- 33947610805
- S. Young, G. Evermann, D. Kershaw, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK Book Eng. Dept., Cambridge Univ., Cambridge, U.K., 2002 [Online]. Available: http://htk.eng.cam.ac.uk/, 3.2
- S. Young, G. Evermann, D. Kershaw, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK Book Eng. Dept., Cambridge Univ., Cambridge, U.K., 2002 [Online]. Available: http://htk.eng.cam.ac.uk/, 3.2

34
- 4644303413
- Poisson image editing
- P. Pèrez, M. Gangnet, and A. Blake, "Poisson image editing," ACM Trans. Graphics (SIGGRAPH'03), vol. 22, no. 3, pp. 313-318, 2003.
- (2003) ACM Trans. Graphics (SIGGRAPH'03) , vol.22 , Issue.3 , pp. 313-318
- Pèrez, P.¹ Gangnet, M.² Blake, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.