SCOPUS 정보 검색 플랫폼

Hearing Eye II: Advances in the Psychology of Speechreading and Auditory-Visual Speech

Volumn , Issue , 2013, Pages 109-122

Computational aspects of visual speech: Machines that can speechread and simulate talking faces

(1) Brooke, N Michael a

a University of Bath (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 0039891027 PISSN: None EISSN: None Source Type: Book
DOI: 10.4324/9780203098752-13 Document Type: Chapter

Times cited : (4)

References (37)

1
- 0001432664
- On the integration of auditory and visual parameters in an HMM-based ASR
- D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
- Adjoudani, A. & Benoît, C. (1996). On the integration of auditory and visual parameters in an HMM-based ASR. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 461-72.
- (1996) Speechreading by Humans and Machines. , pp. 461-472
- Adjoudani, A.¹ Benoît, C.²

2
- 0002186602
- A set of French visemes for speech synthesis
- G. Bailly, C. Benoit, & T.R. Sawallis (Eds.), Amsterdam: Elsevier
- Benoît, C, Lallouache, T., Mohamadi, T., & Abry, C. (1992). A set of French visemes for speech synthesis. In G. Bailly, C. Benoit, & T.R. Sawallis (Eds.), Talking machines: Theories, models and designs (pp. 485-504). Amsterdam: Elsevier.
- (1992) Talking machines: Theories, models and designs , pp. 485-504
- Benoît, C.¹ Lallouache, T.² Mohamadi, T.³ Abry, C.⁴

3
- 84991199954
- 3D position, attitude and shape input using video tracking of hands and lips
- Blake, A. & Isard, M. (1994). 3D position, attitude and shape input using video tracking of hands and lips. Computer Graphics Proceedings, Annual Conference Series (ACM), 185-92.
- (1994) Computer Graphics Proceedings, Annual Conference Series (ACM) , pp. 185-192
- Blake, A.¹ Isard, M.²

4
- 85069225711
- Surface learning with applications to lip reading
- Berkeley, California
- Bregler, C. & Omohundro, S. (1994). Surface learning with applications to lip reading. International Computer Science Institute Report TR-94-001. Berkeley, California.
- (1994) International Computer Science Institute Report TR-94-001.
- Bregler, C.¹ Omohundro, S.²

5
- 84925642898
- Computer graphics animations of speech production
- London: JA1 Press
- Brooke, N.M. (1992a). Computer graphics animations of speech production. Advances in Language, Speech and Hearing, Volume 2. London: JA1 Press, 87-134.
- (1992) Advances in Language, Speech and Hearing , vol.2 , pp. 87-134
- Brooke, N.M.¹

6
- 0012744585
- Computer graphics synthesis of talking faces
- Bailly, G., Benoit, C. & Sawallis, T.R. (Eds), Amsterdam: Elsevier
- Brooke, N.M. (1992b). Computer graphics synthesis of talking faces. In Bailly, G., Benoit, C. & Sawallis, T.R. (Eds), Talking Machines: Theories, Models and Designs. Amsterdam: Elsevier, 504-22.
- (1992) Talking Machines: Theories, Models and Designs. , pp. 504-522
- Brooke, N.M.¹

7
- 0000813366
- Talking heads and speech recognisers that can see: The computer processing of visual speech signals
- D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
- Brooke, N.M. (1996). Talking heads and speech recognisers that can see: the computer processing of visual speech signals. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 351-73
- (1996) Speechreading by Humans and Machines. , pp. 351-373
- Brooke, N.M.¹

8
- 84925636076
- Animated computer graphics of talking faces based on stochastic models
- Brooke, N.M. & Scott, S.D. (1994a). Animated computer graphics of talking faces based on stochastic models. Proceedings of the International Symposium on Speech, Image-processing and Neural Networks, Hong Kong (IEEE), 73-6.
- (1994) Proceedings of the International Symposium on Speech, Image-processing and Neural Networks, Hong Kong (IEEE) , pp. 73-76
- Brooke, N.M.¹ Scott, S.D.²

9
- 0008571982
- PCA image coding schemes and visual speech intelligibility
- Brooke, N.M. & Scott, S.D. (1994b). PCA image coding schemes and visual speech intelligibility. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 16(5), 123-9.
- (1994) Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere) , vol.16 , Issue.5 , pp. 123-129
- Brooke, N.M.¹ Scott, S.D.²

10
- 84926273209
- Analysis, synthesis and perception of visible articulatory movements
- Brooke, N.M. & Summerfield, A.Q. (1983). Analysis, synthesis and perception of visible articulatory movements. Journal of Phonetics, 11, 63-76.
- (1983) Journal of Phonetics , vol.11 , pp. 63-76
- Brooke, N.M.¹ Summerfield, A.Q.²

11
- 0040917422
- Visual speech intelligibility of digitally processed facial images
- Brooke, N.M. & Templeton, P.D. (1990). Visual speech intelligibility of digitally processed facial images. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 12(10), 483-90.
- (1990) Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere) , vol.12 , Issue.10 , pp. 483-490
- Brooke, N.M.¹ Templeton, P.D.²

12
- 0002001464
- Automatic speech recognition that includes visual speech cues
- Brooke, N.M., Tomlinson, M.J. & Moore, R.K. (1994). Automatic speech recognition that includes visual speech cues. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 16(5), 15-22.
- (1994) Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere) , vol.16 , Issue.5 , pp. 15-22
- Brooke, N.M.¹ Tomlinson, M.J.² Moore, R.K.³

13
- 0003515753
- London: Chapman & Hall
- Chatfield, C. & Collins, A.J. (1980). Introduction to Multivariate Analysis. London: Chapman & Hall.
- (1980) Introduction to Multivariate Analysis.
- Chatfield, C.¹ Collins, A.J.²

14
- 0001514782
- Modeling coarticulation in synthetic visual speech
- Thalmann, N.M. & Thalmann, D. (Eds), Tokyo: Springer-Verlag, Berlin
- Cohen, M.M. & Massaro, D.W. (1993). Modeling coarticulation in synthetic visual speech. In Thalmann, N.M. & Thalmann, D. (Eds), Computer Animation, 93, Tokyo: Springer-Verlag, Berlin, 139-56.
- (1993) Computer Animation , vol.93 , pp. 139-156
- Cohen, M.M.¹ Massaro, D.W.²

15
- 0002142055
- About brows: Emotional and conversational signals
- von Cranach, M., Foppa, K., Lepenies, W. & Ploog, D. (Eds), Cambridge: Cambridge University Press
- Ekman, P. (1979). About brows: emotional and conversational signals. In von Cranach, M., Foppa, K., Lepenies, W. & Ploog, D. (Eds), Human Ethology. Cambridge: Cambridge University Press, 169-202.
- (1979) Human Ethology. , pp. 169-202
- Ekman, P.¹

16
- 0023936027
- Learning the hidden structure of speech
- Elman, J.L. & Zipser, D. (1986). Learning the hidden structure of speech. Journal of the Acoustical Society of America, 83, 1615-26.
- (1986) Journal of the Acoustical Society of America , vol.83 , pp. 1615-1626
- Elman, J.L.¹ Zipser, D.²

17
- 0002750845
- An investigation of visible lip information to be used in automatic speech recognition
- Washington D.C.: Georgetown University
- Finn, K.I. (1986). An investigation of visible lip information to be used in automatic speech recognition. PhD. Thesis. Washington D.C.: Georgetown University.
- (1986) PhD. Thesis.
- Finn, K.I.¹

18
- 0003544881
- Visionary Speech: Looking ahead to practical speechreading systems
- D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
- Hennecke, M.E., Stork, D.G. & Prasad, K.V. (1996). Visionary Speech: looking ahead to practical speechreading systems. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 331-50.
- (1996) Speechreading by Humans and Machines. , pp. 331-350
- Hennecke, M.E.¹ Stork, D.G.² Prasad, K.V.³

19
- 84925669906
- Time delay neural networks for articulatory estimation from speech: Suitable subjective evaluation protocols
- D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
- Lavagetto, F. & Lavagetto, P. (1996). Time delay neural networks for articulatory estimation from speech: suitable subjective evaluation protocols. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 437-44.
- (1996) Speechreading by Humans and Machines. , pp. 437-444
- Lavagetto, F.¹ Lavagetto, P.²

20
- 0023237267
- Quantifying the contribution of vision to speech perception in noise
- MacLeod, A. & Summerfield, A.Q. (1987). Quantifying the contribution of vision to speech perception in noise. British Journal of Audiology, 21, 131-41.
- (1987) British Journal of Audiology , vol.21 , pp. 131-141
- MacLeod, A.¹ Summerfield, A.Q.²

21
- 10644227100
- Bimodal speech perception: A progress report
- D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
- Massaro, D.W. (1996). Bimodal speech perception: a progress report. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 79-102.
- (1996) Speechreading by Humans and Machines. , pp. 79-102
- Massaro, D.W.¹

22
- 85029619676
- Visual speech recognition with stochastic networks
- Tesauro, G., Touretzky, D. & Leen, T. (Eds,), Cambridge, Mass.: MIT Press
- Movellan, J.R. (1995). Visual speech recognition with stochastic networks. In Tesauro, G., Touretzky, D. & Leen, T. (Eds,), Advances in neural information processing systems, (7). Cambridge, Mass.: MIT Press, 851-8.
- (1995) Advances in neural information processing systems , Issue.7 , pp. 851-858
- Movellan, J.R.¹

23
- 84921138344
- Speech recognition enhancement by lip information
- Nishida, S. (1986). Speech recognition enhancement by lip information. Proceedings of CHI86 (ACM), 198-204.
- (1986) Proceedings of CHI86 (ACM) , pp. 198-204
- Nishida, S.¹

24
- 0020202671
- Parametrized models for facial animation
- Parke, F.I. (1975). Parametrized models for facial animation. IEEE Computer Graphics and Applications, 2, 61-8.
- (1975) IEEE Computer Graphics and Applications , vol.2 , pp. 61-68
- Parke, F.I.¹

25
- 33749913277
- The multilayer perceptron as a tool for speech pattern processing research
- Peeling, S.M., Moore R.K. & Tomlinson, M.J. (1986). The multilayer perceptron as a tool for speech pattern processing research. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 8(7), 307-14.
- (1986) Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere) , vol.8 , Issue.7 , pp. 307-314
- Peeling, S.M.¹ Moore, R.K.² Tomlinson, M.J.³

26
- 0021541159
- Automatic lipreading to enhance speech recognition
- Petajan, E.D. (1984). Automatic lipreading to enhance speech recognition. Proceedings of the Global Telecommunications Conference, Atlanta, Georgia (IEEE Communication Society), 265-72
- (1984) Proceedings of the Global Telecommunications Conference, Atlanta, Georgia (IEEE Communication Society) , pp. 265-272
- Petajan, E.D.¹

27
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Rabiner, L.R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77, 257-86.
- (1989) Proceedings of the IEEE , vol.77 , pp. 257-286
- Rabiner, L.R.¹

28
- 0004762797
- Exploiting sensor fusion architectures and stimuli complementarity in AV speech recognition
- D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
- Robert-Ribes, J., Piquemal, M., Schwartz, J-L. & Escudier, P. (1996). Exploiting sensor fusion architectures and stimuli complementarity in AV speech recognition. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 193-210.
- (1996) Speechreading by Humans and Machines. , pp. 193-210
- Robert-Ribes, J.¹ Piquemal, M.² Schwartz, J.-L.³ Escudier, P.⁴

29
- 0002022487
- Combining visual and acoustic speech signals with a neural network improves intelligibility
- Touretzky, D.S. (Ed.), San Mateo, California: Morgan-Kaufman Publishers
- Sejnowski, T.J., Yuhas, B P., Goldstein, M.H. & Jenkins, R E. (1990). Combining visual and acoustic speech signals with a neural network improves intelligibility. In Touretzky, D.S. (Ed.), Advances in Neural Information Processing Systems (2). San Mateo, California: Morgan-Kaufman Publishers.
- (1990) Advances in Neural Information Processing Systems , Issue.2
- Sejnowski, T.J.¹ Yuhas, B.P.² Goldstein, M.H.³ Jenkins, R.E.⁴

30
- 0000051247
- Generation of mouthshapes for a synthetic talking head
- Simons, A.D. & Cox, S.J. (1990). Generation of mouthshapes for a synthetic talking head. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 12(10), 475-82.
- (1990) Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere) , vol.12 , Issue.10 , pp. 475-482
- Simons, A.D.¹ Cox, S.J.²

31
- 0003544881
- Berlin: Springer
- Stork, D.G. & Hennecke, M.E. (Eds) (1996). Speechreading by Humans and Machines. Berlin: Springer.
- (1996) Speechreading by Humans and Machines.
- Stork, D.G.¹ Hennecke, M.E.²

32
- 85132038963
- Neural network lipreading system for improved speech recognition
- Stork, D.G., Wolff, G. & Levine, E. (1992). Neural network lipreading system for improved speech recognition. Proceedings of the International Joint Conference on Neural Networks, Baltimore (IEEE), 2, 285-95.
- (1992) Proceedings of the International Joint Conference on Neural Networks, Baltimore (IEEE) , vol.2 , pp. 285-295
- Stork, D.G.¹ Wolff, G.² Levine, E.³

33
- 0002028032
- Some preliminaries to a comprehensive account of audio-visual speech perception
- Campbell, R. & Dodd, B. (Eds), Hove, UK: Lawrence Erlbaum Associates Ltd
- Summerfield, A.Q. (1987). Some preliminaries to a comprehensive account of audio-visual speech perception. In Campbell, R. & Dodd, B. (Eds), Hearing by Eye: The Psychology of Lipreading. Hove, UK: Lawrence Erlbaum Associates Ltd.
- (1987) Hearing by Eye: The Psychology of Lipreading.
- Summerfield, A.Q.¹

34
- 0001653029
- Visual perception of phonetic gestures
- Mattingley, I.G. & Studdert-Kennedy, M. (Eds), Hillsdale, NJ: Lawrence Erlbaum Associates
- Summerfield, A.Q. (1991). Visual perception of phonetic gestures. In Mattingley, I.G. & Studdert-Kennedy, M. (Eds), Modularity and the Motor Theory of Speech Perception. Hillsdale, NJ: Lawrence Erlbaum Associates, 117-38.
- (1991) Modularity and the Motor Theory of Speech Perception. , pp. 117-138
- Summerfield, A.Q.¹

35
- 84995132396
- Physically-based facial modelling, analysis and animation
- Terzopoulos, D. & Waters, K. (1990). Physically-based facial modelling, analysis and animation. Journal of Visualisation and Computer Animation, I, 73-80.
- (1990) Journal of Visualisation and Computer Animation , vol.1 , pp. 73-80
- Terzopoulos, D.¹ Waters, K.²

36
- 0029747053
- Integrating audio and visual information to provide highly robust speech recognition
- Tomlinson, M.J., Russell, M.J. & Brooke, N.M. (1996). Integrating audio and visual information to provide highly robust speech recognition. Proceedings of ICASSP, Atlanta, Georgia (IEEE), 821-1.
- (1996) Proceedings of ICASSP, Atlanta, Georgia (IEEE) , pp. 821
- Tomlinson, M.J.¹ Russell, M.J.² Brooke, N.M.³

37
- 0026065565
- Eigenfaces for recognition
- Turk, M. & Pentland, A. (1991). Eigenfaces for recognition. Journal of Cognitive Neuroscience, 3, 71-86.
- (1991) Journal of Cognitive Neuroscience , vol.3 , pp. 71-86
- Turk, M.¹ Pentland, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.