메뉴 건너뛰기




Volumn , Issue , 2013, Pages 109-122

Computational aspects of visual speech: Machines that can speechread and simulate talking faces

Author keywords

[No Author keywords available]

Indexed keywords


EID: 0039891027     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.4324/9780203098752-13     Document Type: Chapter
Times cited : (4)

References (37)
  • 1
    • 0001432664 scopus 로고    scopus 로고
    • On the integration of auditory and visual parameters in an HMM-based ASR
    • D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
    • Adjoudani, A. & Benoît, C. (1996). On the integration of auditory and visual parameters in an HMM-based ASR. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 461-72.
    • (1996) Speechreading by Humans and Machines. , pp. 461-472
    • Adjoudani, A.1    Benoît, C.2
  • 2
    • 0002186602 scopus 로고
    • A set of French visemes for speech synthesis
    • G. Bailly, C. Benoit, & T.R. Sawallis (Eds.), Amsterdam: Elsevier
    • Benoît, C, Lallouache, T., Mohamadi, T., & Abry, C. (1992). A set of French visemes for speech synthesis. In G. Bailly, C. Benoit, & T.R. Sawallis (Eds.), Talking machines: Theories, models and designs (pp. 485-504). Amsterdam: Elsevier.
    • (1992) Talking machines: Theories, models and designs , pp. 485-504
    • Benoît, C.1    Lallouache, T.2    Mohamadi, T.3    Abry, C.4
  • 5
    • 84925642898 scopus 로고
    • Computer graphics animations of speech production
    • London: JA1 Press
    • Brooke, N.M. (1992a). Computer graphics animations of speech production. Advances in Language, Speech and Hearing, Volume 2. London: JA1 Press, 87-134.
    • (1992) Advances in Language, Speech and Hearing , vol.2 , pp. 87-134
    • Brooke, N.M.1
  • 6
    • 0012744585 scopus 로고
    • Computer graphics synthesis of talking faces
    • Bailly, G., Benoit, C. & Sawallis, T.R. (Eds), Amsterdam: Elsevier
    • Brooke, N.M. (1992b). Computer graphics synthesis of talking faces. In Bailly, G., Benoit, C. & Sawallis, T.R. (Eds), Talking Machines: Theories, Models and Designs. Amsterdam: Elsevier, 504-22.
    • (1992) Talking Machines: Theories, Models and Designs. , pp. 504-522
    • Brooke, N.M.1
  • 7
    • 0000813366 scopus 로고    scopus 로고
    • Talking heads and speech recognisers that can see: The computer processing of visual speech signals
    • D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
    • Brooke, N.M. (1996). Talking heads and speech recognisers that can see: the computer processing of visual speech signals. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 351-73
    • (1996) Speechreading by Humans and Machines. , pp. 351-373
    • Brooke, N.M.1
  • 10
    • 84926273209 scopus 로고
    • Analysis, synthesis and perception of visible articulatory movements
    • Brooke, N.M. & Summerfield, A.Q. (1983). Analysis, synthesis and perception of visible articulatory movements. Journal of Phonetics, 11, 63-76.
    • (1983) Journal of Phonetics , vol.11 , pp. 63-76
    • Brooke, N.M.1    Summerfield, A.Q.2
  • 14
    • 0001514782 scopus 로고
    • Modeling coarticulation in synthetic visual speech
    • Thalmann, N.M. & Thalmann, D. (Eds), Tokyo: Springer-Verlag, Berlin
    • Cohen, M.M. & Massaro, D.W. (1993). Modeling coarticulation in synthetic visual speech. In Thalmann, N.M. & Thalmann, D. (Eds), Computer Animation, 93, Tokyo: Springer-Verlag, Berlin, 139-56.
    • (1993) Computer Animation , vol.93 , pp. 139-156
    • Cohen, M.M.1    Massaro, D.W.2
  • 15
    • 0002142055 scopus 로고
    • About brows: Emotional and conversational signals
    • von Cranach, M., Foppa, K., Lepenies, W. & Ploog, D. (Eds), Cambridge: Cambridge University Press
    • Ekman, P. (1979). About brows: emotional and conversational signals. In von Cranach, M., Foppa, K., Lepenies, W. & Ploog, D. (Eds), Human Ethology. Cambridge: Cambridge University Press, 169-202.
    • (1979) Human Ethology. , pp. 169-202
    • Ekman, P.1
  • 17
    • 0002750845 scopus 로고
    • An investigation of visible lip information to be used in automatic speech recognition
    • Washington D.C.: Georgetown University
    • Finn, K.I. (1986). An investigation of visible lip information to be used in automatic speech recognition. PhD. Thesis. Washington D.C.: Georgetown University.
    • (1986) PhD. Thesis.
    • Finn, K.I.1
  • 18
    • 0003544881 scopus 로고    scopus 로고
    • Visionary Speech: Looking ahead to practical speechreading systems
    • D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
    • Hennecke, M.E., Stork, D.G. & Prasad, K.V. (1996). Visionary Speech: looking ahead to practical speechreading systems. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 331-50.
    • (1996) Speechreading by Humans and Machines. , pp. 331-350
    • Hennecke, M.E.1    Stork, D.G.2    Prasad, K.V.3
  • 19
    • 84925669906 scopus 로고    scopus 로고
    • Time delay neural networks for articulatory estimation from speech: Suitable subjective evaluation protocols
    • D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
    • Lavagetto, F. & Lavagetto, P. (1996). Time delay neural networks for articulatory estimation from speech: suitable subjective evaluation protocols. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 437-44.
    • (1996) Speechreading by Humans and Machines. , pp. 437-444
    • Lavagetto, F.1    Lavagetto, P.2
  • 20
    • 0023237267 scopus 로고
    • Quantifying the contribution of vision to speech perception in noise
    • MacLeod, A. & Summerfield, A.Q. (1987). Quantifying the contribution of vision to speech perception in noise. British Journal of Audiology, 21, 131-41.
    • (1987) British Journal of Audiology , vol.21 , pp. 131-141
    • MacLeod, A.1    Summerfield, A.Q.2
  • 21
    • 10644227100 scopus 로고    scopus 로고
    • Bimodal speech perception: A progress report
    • D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
    • Massaro, D.W. (1996). Bimodal speech perception: a progress report. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 79-102.
    • (1996) Speechreading by Humans and Machines. , pp. 79-102
    • Massaro, D.W.1
  • 22
    • 85029619676 scopus 로고
    • Visual speech recognition with stochastic networks
    • Tesauro, G., Touretzky, D. & Leen, T. (Eds,), Cambridge, Mass.: MIT Press
    • Movellan, J.R. (1995). Visual speech recognition with stochastic networks. In Tesauro, G., Touretzky, D. & Leen, T. (Eds,), Advances in neural information processing systems, (7). Cambridge, Mass.: MIT Press, 851-8.
    • (1995) Advances in neural information processing systems , Issue.7 , pp. 851-858
    • Movellan, J.R.1
  • 23
    • 84921138344 scopus 로고
    • Speech recognition enhancement by lip information
    • Nishida, S. (1986). Speech recognition enhancement by lip information. Proceedings of CHI86 (ACM), 198-204.
    • (1986) Proceedings of CHI86 (ACM) , pp. 198-204
    • Nishida, S.1
  • 27
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, L.R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77, 257-86.
    • (1989) Proceedings of the IEEE , vol.77 , pp. 257-286
    • Rabiner, L.R.1
  • 28
    • 0004762797 scopus 로고    scopus 로고
    • Exploiting sensor fusion architectures and stimuli complementarity in AV speech recognition
    • D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
    • Robert-Ribes, J., Piquemal, M., Schwartz, J-L. & Escudier, P. (1996). Exploiting sensor fusion architectures and stimuli complementarity in AV speech recognition. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 193-210.
    • (1996) Speechreading by Humans and Machines. , pp. 193-210
    • Robert-Ribes, J.1    Piquemal, M.2    Schwartz, J.-L.3    Escudier, P.4
  • 29
    • 0002022487 scopus 로고
    • Combining visual and acoustic speech signals with a neural network improves intelligibility
    • Touretzky, D.S. (Ed.), San Mateo, California: Morgan-Kaufman Publishers
    • Sejnowski, T.J., Yuhas, B P., Goldstein, M.H. & Jenkins, R E. (1990). Combining visual and acoustic speech signals with a neural network improves intelligibility. In Touretzky, D.S. (Ed.), Advances in Neural Information Processing Systems (2). San Mateo, California: Morgan-Kaufman Publishers.
    • (1990) Advances in Neural Information Processing Systems , Issue.2
    • Sejnowski, T.J.1    Yuhas, B.P.2    Goldstein, M.H.3    Jenkins, R.E.4
  • 33
    • 0002028032 scopus 로고
    • Some preliminaries to a comprehensive account of audio-visual speech perception
    • Campbell, R. & Dodd, B. (Eds), Hove, UK: Lawrence Erlbaum Associates Ltd
    • Summerfield, A.Q. (1987). Some preliminaries to a comprehensive account of audio-visual speech perception. In Campbell, R. & Dodd, B. (Eds), Hearing by Eye: The Psychology of Lipreading. Hove, UK: Lawrence Erlbaum Associates Ltd.
    • (1987) Hearing by Eye: The Psychology of Lipreading.
    • Summerfield, A.Q.1
  • 34
    • 0001653029 scopus 로고
    • Visual perception of phonetic gestures
    • Mattingley, I.G. & Studdert-Kennedy, M. (Eds), Hillsdale, NJ: Lawrence Erlbaum Associates
    • Summerfield, A.Q. (1991). Visual perception of phonetic gestures. In Mattingley, I.G. & Studdert-Kennedy, M. (Eds), Modularity and the Motor Theory of Speech Perception. Hillsdale, NJ: Lawrence Erlbaum Associates, 117-38.
    • (1991) Modularity and the Motor Theory of Speech Perception. , pp. 117-138
    • Summerfield, A.Q.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.