SCOPUS 정보 검색 플랫폼

Signal Processing

Volumn 86, Issue 12, 2006, Pages 3657-3673

Design, implementation and evaluation of the Czech realistic audio-visual speech synthesis

(4) Železný, Miloš a Krňoul, Zdeněk a Císař, Petr a Matoušek, Jindřich a

a UNIVERSITY OF WEST BOHEMIA (Czech Republic)

Author keywords

Audio visual speech processing; Facial animation; Talking head

Indexed keywords

ANIMATION CONTROL; AUDIO VISUAL SPEECH PROCESSING; FACIAL ANIMATION; TALKING HEAD;

ALGORITHMS; ANIMATION; DATABASE SYSTEMS; LINGUISTICS; MATHEMATICAL MODELS; SPEECH RECOGNITION;

SPEECH SYNTHESIS;

EID: 33749437734 PISSN: 01651684 EISSN: None Source Type: Journal
DOI: 10.1016/j.sigpro.2006.02.039 Document Type: Article

Times cited : (26)

References (42)

1
- 85009291900
- Design of an audio-visual speech corpus for the Czech audio-visual speech synthesis
- Denver, USA
- Železný M., Císař P., Krňoul Z., and Novák J. Design of an audio-visual speech corpus for the Czech audio-visual speech synthesis. Proceedings of ICSLP 2002. Denver, USA (2002)
- (2002) Proceedings of ICSLP 2002
- Železný, M.¹ Císař, P.² Krňoul, Z.³ Novák, J.⁴

2
- 0020202671
- F.I. Parke, Parameterized models for facial animation, IEEE Comput. Graph. Appl. (November 1982) 61-68.

3
- 33749431482
- S. Basu, A. Pentland, A three-dimensional model of human lip motions trained from video, M.I.T. media laboratory pereptual computing section, Technical Report No. 441, MIT Media Laboratory, Cambridge, USA, 1997.

4
- 33749440372
- V. Strnadová, Hádej, co říkám aneb Odezírání je nejisté umění. - Guess What I Am Talking or Lip-Reading is Uncertain Art, Ministerstvo zdravotnictví České republiky, Prague, Czech Republic, 1998.

5
- 33749433016
- Springer, Berlin
- Kricos P.B. Differences in Visual Intelligibility Across Talkers, Speechreading by Humans and Machines (1996), Springer, Berlin
- (1996) Differences in Visual Intelligibility Across Talkers, Speechreading by Humans and Machines
- Kricos, P.B.¹

6
- 85009071398
- J. Matoušek, J. Romportl, D. Tihelka, Z. Tychtl, Recent improvements on ARTIC: Czech text-to-speech system, in: Proceedings of ICSLP 2004, vol. 3, Jeju, Korea, 2004, pp. 1933-1936.

7
- 0032651722
- A hidden Markov-model-based trainable speech synthesizer
- Donovan R.E., and Woodland P.C. A hidden Markov-model-based trainable speech synthesizer. Comput. Speech Language 13 1999.0123 (1999) 223-241
- (1999) Comput. Speech Language , vol.13 , Issue.1999 0123 , pp. 223-241
- Donovan, R.E.¹ Woodland, P.C.²

8
- 85009132058
- Design of speech corpus for text-to-speech synthesis
- Ålborg, Denmark
- Matoušek J., Psutka J., and Krůta J. Design of speech corpus for text-to-speech synthesis. Proceedings of EUROSPEECH 2001 vol. 3 (2001), Ålborg, Denmark 2047-2050
- (2001) Proceedings of EUROSPEECH 2001 , vol.3 , pp. 2047-2050
- Matoušek, J.¹ Psutka, J.² Krůta, J.³

9
- 85009152114
- Automatic segmentation for Czech concatenative speech synthesis using statistical approach with boundary-specific correction
- Geneva, Switzerland
- Matoušek J., Tihelka D., and Psutka J. Automatic segmentation for Czech concatenative speech synthesis using statistical approach with boundary-specific correction. Proceedings of EUROSPEECH 2003. Geneva, Switzerland (2003) 301-304
- (2003) Proceedings of EUROSPEECH 2003 , pp. 301-304
- Matoušek, J.¹ Tihelka, D.² Psutka, J.³

10
- 22944490076
- Prosody model and its application to Czech TTS system
- Kijiv, Ukraine
- Romportl J., Matoušek J., and Tihelka D. Prosody model and its application to Czech TTS system. Proceedings of UKROBRAZ 2002. Kijiv, Ukraine (2002) 93-96
- (2002) Proceedings of UKROBRAZ 2002 , pp. 93-96
- Romportl, J.¹ Matoušek, J.² Tihelka, D.³

11
- 22944437142
- Advanced prosody modelling
- Bonn, Heidelberg, Springer, Berlin
- Romportl J., and Matoušek J. Advanced prosody modelling. Proceedings of TSD 2004. Bonn, Heidelberg (2004), Springer, Berlin 441-447
- (2004) Proceedings of TSD 2004 , pp. 441-447
- Romportl, J.¹ Matoušek, J.²

12
- 84936862571
- The design of Czech language formal listening tests for the evaluation of TTS systems
- Lisbon, Portugal
- Tihelka D., and Matoušek J. The design of Czech language formal listening tests for the evaluation of TTS systems. Proceedings of LREC 2004. Lisbon, Portugal (2004) 2099-2102
- (2004) Proceedings of LREC 2004 , pp. 2099-2102
- Tihelka, D.¹ Matoušek, J.²

13
- 33745213550
- Symbolic prosody driven unit selection for highly natural synthetic speech
- Lisbon, Portugal
- Tihelka D. Symbolic prosody driven unit selection for highly natural synthetic speech. Proceedings of EUROSPEECH 2005. Lisbon, Portugal (2005) 2525-2528
- (2005) Proceedings of EUROSPEECH 2005 , pp. 2525-2528
- Tihelka, D.¹

14
- 0031625721
- Text-to-visual speech synthesis based on parameter generation from hmm
- Seattle, USA
- Masuko T., Kobayashi T., Tamura M., Masubuchi J., and Tokuda K. Text-to-visual speech synthesis based on parameter generation from hmm. Proceedings of ICASSP 1998. Seattle, USA (1998)
- (1998) Proceedings of ICASSP 1998
- Masuko, T.¹ Kobayashi, T.² Tamura, M.³ Masubuchi, J.⁴ Tokuda, K.⁵

15
- 85133460248
- Visual speech synthesis based on parameter generation from hmm. speech-driven and text-and-speech-driven approaches
- Sydney, Australia
- Tamura M., Masuko T., Kobayashi T., and Tokuda K. Visual speech synthesis based on parameter generation from hmm. speech-driven and text-and-speech-driven approaches. Proceedings of AVSP 1998. Sydney, Australia (1998)
- (1998) Proceedings of AVSP 1998
- Tamura, M.¹ Masuko, T.² Kobayashi, T.³ Tokuda, K.⁴

16
- 0006455820
- Generation of lip-synched synthetic faces from phonetically clustered face movement data
- Sydney, Australia
- Galanes F.M., Unverferth J., Arslan L., and Talkin D. Generation of lip-synched synthetic faces from phonetically clustered face movement data. Proceedings of AVSP 1998. Sydney, Australia (1998)
- (1998) Proceedings of AVSP 1998
- Galanes, F.M.¹ Unverferth, J.² Arslan, L.³ Talkin, D.⁴

17
- 85009089413
- Hmm-based text-to-audio-visual speech synthesis
- Beijing, China
- Sako S., Tokuda K., Masuko T., Kobayashi T., and Kitamura T. Hmm-based text-to-audio-visual speech synthesis. Proceedings of ICSLP2000. Beijing, China (2000)
- (2000) Proceedings of ICSLP2000
- Sako, S.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

18
- 0003822743
- Cambridge University Press, Cambridge, UK
- Young S.J., Jansen J., Odell J.J., Olasson D., and Woodland P.C. The HTK Book (1999), Cambridge University Press, Cambridge, UK
- (1999) The HTK Book
- Young, S.J.¹ Jansen, J.² Odell, J.J.³ Olasson, D.⁴ Woodland, P.C.⁵

19
- 0020068630
- Anticipatory labial coarticulation: experimental, biological, and linguistic variables
- Lubker J., and Gay T. Anticipatory labial coarticulation: experimental, biological, and linguistic variables. J. Acoust. Soc. Amer. 71 (1982) 437-448
- (1982) J. Acoust. Soc. Amer. , vol.71 , pp. 437-448
- Lubker, J.¹ Gay, T.²

20
- 0025687878
- Coarticulatory organization for lip rounding in Turkish and English
- Boyce S.E. Coarticulatory organization for lip rounding in Turkish and English. J. Acoust. Soc. Amer. 88 6 (1990) 2584-2595
- (1990) J. Acoust. Soc. Amer. , vol.88 , Issue.6 , pp. 2584-2595
- Boyce, S.E.¹

21
- 33749442733
- Coarticulation modeling for the Czech audio-visual speech synthesis
- Liberec, Czech republic
- Krňoul Z., and Železný M. Coarticulation modeling for the Czech audio-visual speech synthesis. Sixth International Workshop on Electronics, Control, Measurement and Signals, ECMS 2003. Liberec, Czech republic (2003) 64-68
- (2003) Sixth International Workshop on Electronics, Control, Measurement and Signals, ECMS 2003 , pp. 64-68
- Krňoul, Z.¹ Železný, M.²

22
- 0003116759
- Speech as audible gestures
- Kluwer Academic Press, Dordrecht, Netherlands
- Löfquist A. Speech as audible gestures. Speech Production and Speech Modelling (1990), Kluwer Academic Press, Dordrecht, Netherlands 289-322
- (1990) Speech Production and Speech Modelling , pp. 289-322
- Löfquist, A.¹

23
- 0001514782
- Text-to-visual speech synthesis based on parameter generation from hmm
- Springer, Tokyo, Japan
- Cohen M.M., and Massaro D.W. Text-to-visual speech synthesis based on parameter generation from hmm. Models and Techniques in Computer Animation (1993), Springer, Tokyo, Japan 139-156
- (1993) Models and Techniques in Computer Animation , pp. 139-156
- Cohen, M.M.¹ Massaro, D.W.²

24
- 33749452937
- Neural Network Simulator 1.1, University of Tübingen 〈http://www-ra.informatik.uni-tuebingen.de/SNNS/〉.

25
- 29844454693
- C.H.BECK, Prague Czech Republic
- Novák M., et al. Umělé neuronové sítě teorie a aplikace (1998), C.H.BECK, Prague Czech Republic
- (1998) Umělé neuronové sítě teorie a aplikace
- Novák, M.¹

26
- 0000892665
- Abstract muscle action procedures for human face animation
- Magnenat-Thalmann N., Primeau E., and Thalmann D. Abstract muscle action procedures for human face animation. Visual Comput. 3 5 (1988) 290-297
- (1988) Visual Comput. , vol.3 , Issue.5 , pp. 290-297
- Magnenat-Thalmann, N.¹ Primeau, E.² Thalmann, D.³

27
- 0029182694
- Realistic modeling for facial animation
- ACM Press, New York
- Lee Y., Terzopoulos D., and Walters K. Realistic modeling for facial animation. Proceedings of the 22nd Annual Conference on Computer Graphics and Interactive Techniques (1995), ACM Press, New York 55-62
- (1995) Proceedings of the 22nd Annual Conference on Computer Graphics and Interactive Techniques , pp. 55-62
- Lee, Y.¹ Terzopoulos, D.² Walters, K.³

28
- 0033879010
- Fast head modeling for animation
- Lee W., and Magnenat-Thalmann N. Fast head modeling for animation. Image Vision Comput. 18 4 (2000) 355-364
- (2000) Image Vision Comput. , vol.18 , Issue.4 , pp. 355-364
- Lee, W.¹ Magnenat-Thalmann, N.²

29
- 0030702311
- M. Escher, N. Magnenat Thalmann, Automatic 3D cloning and real-time animation of a human face, Comput. Animation (1997) 58.

30
- 33749435880
- Cyberware Scanning Products 〈http://www.cyberware.com/products/index.html〉.

31
- 0031337918
- Reading between the lines-a method for extracting dynamic 3D with texture
- Lausanne, Switzerland, ACM Press, New York
- Proesmans M., and Van Gool L. Reading between the lines-a method for extracting dynamic 3D with texture. Proceedings of the ACM Symposium on Virtual Reality Software and Technology. Lausanne, Switzerland (1997), ACM Press, New York 95-102
- (1997) Proceedings of the ACM Symposium on Virtual Reality Software and Technology , pp. 95-102
- Proesmans, M.¹ Van Gool, L.²

32
- 0010946384
- T. Miyasaka, K. Kuroda, M. Hirose, K. Araki, Reconstruction of realistic 3D surface model and 3D animation from range images obtained by real time 3D measurement system, in: IEEE International Conference on Pattern Recognition (ICPR'00), vol. 4, Barcelona, Spain, September 2000, p. 4594.

33
- 33749426466
- Face models from uncalibrated video sequences
- Springer, Berlin
- Fua P. Face models from uncalibrated video sequences. Proceedings of the International Workshop on Modelling and Motion Capture Techniques for Virtual Environments (1998), Springer, Berlin
- (1998) Proceedings of the International Workshop on Modelling and Motion Capture Techniques for Virtual Environments
- Fua, P.¹

34
- 0345180807
- Automated modelling of real human faces for 3D animation
- Nagel B., Wingbermuhle J., Weik S., and Liedtke C.E. Automated modelling of real human faces for 3D animation. ICPR 98 (1998) 693-696
- (1998) ICPR 98 , pp. 693-696
- Nagel, B.¹ Wingbermuhle, J.² Weik, S.³ Liedtke, C.E.⁴

35
- 84944381462
- Automatic creation of 3D facial models
- Akimoto T., Suenaga Y., and Wallace R.S. Automatic creation of 3D facial models. IEEE Comput. Graph. Appl. 13 5 (1993) 16-22
- (1993) IEEE Comput. Graph. Appl. , vol.13 , Issue.5 , pp. 16-22
- Akimoto, T.¹ Suenaga, Y.² Wallace, R.S.³

36
- 0001027507
- Model based face reconstruction for animation
- World Scientific Press, Singapore
- Lee W., Kalra P., and Magnenat-Thalmann N. Model based face reconstruction for animation. Proceedings of the MMM'97 (1997), World Scientific Press, Singapore 323-338
- (1997) Proceedings of the MMM'97 , pp. 323-338
- Lee, W.¹ Kalra, P.² Magnenat-Thalmann, N.³

37
- 0030644092
- L. Moccozet, N. Magnenat Thalmann, Dirichlet free-form deformations and their application to hand simulation, in: Computer Animation '97, Geneva, Switzerland, June 1997.

38
- 33749453587
- Using dirichlet free form deformation to fit deformable models to noisy 3-D data
- Springer, Berlin
- Ilic S., and Fua P. Using dirichlet free form deformation to fit deformable models to noisy 3-D data. ECCV vol. 2351 (2002), Springer, Berlin 704-717
- (2002) ECCV , vol.2351 , pp. 704-717
- Ilic, S.¹ Fua, P.²

39
- 33749431045
- Z. Krňoul, M. Železný, P. Císař, Face model reconstruction for Czech audio-visual speech synthesis, in: Proceedings of SPECOM 2004, Saint Petersburg, Russian Federation, 2004, pp. 47-51, SPIIRAS.

40
- 22944440070
- Z. Krňoul, M. Železný, Realistic face animation for a Czech Talking Head, in: Conference on TEXT, SPEECH and DIALOGUE, TSD 2004, Springer, Berlin, 2004, pp. 603-610.

41
- 33749429528
- B. Le Goff, T. Guiard-Marigny, M. Cohen, C. Benoit, Real-time analysis-synthesis and intelligibility of talking faces, in: Second International Conference on Speech Synthesis, Newark (NY), September 1994.

42
- 33749447929
- D.W. Massaro, J. Beskow, M.M. Cohen, C.L. Fry, T. Rodgriguez, Picture my voice: audio to visual speech synthesis using artificial neural networks, in: AVSP'99, Santa Cruz, CA, USA, 1999.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.