-
1
-
-
0001432664
-
On the integration of auditory and visual parameters in an HMM-based ASR
-
D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
-
Adjoudani, A. & Benoît, C. (1996). On the integration of auditory and visual parameters in an HMM-based ASR. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 461-72.
-
(1996)
Speechreading by Humans and Machines.
, pp. 461-472
-
-
Adjoudani, A.1
Benoît, C.2
-
2
-
-
0002186602
-
A set of French visemes for speech synthesis
-
G. Bailly, C. Benoit, & T.R. Sawallis (Eds.), Amsterdam: Elsevier
-
Benoît, C, Lallouache, T., Mohamadi, T., & Abry, C. (1992). A set of French visemes for speech synthesis. In G. Bailly, C. Benoit, & T.R. Sawallis (Eds.), Talking machines: Theories, models and designs (pp. 485-504). Amsterdam: Elsevier.
-
(1992)
Talking machines: Theories, models and designs
, pp. 485-504
-
-
Benoît, C.1
Lallouache, T.2
Mohamadi, T.3
Abry, C.4
-
3
-
-
84991199954
-
3D position, attitude and shape input using video tracking of hands and lips
-
Blake, A. & Isard, M. (1994). 3D position, attitude and shape input using video tracking of hands and lips. Computer Graphics Proceedings, Annual Conference Series (ACM), 185-92.
-
(1994)
Computer Graphics Proceedings, Annual Conference Series (ACM)
, pp. 185-192
-
-
Blake, A.1
Isard, M.2
-
5
-
-
84925642898
-
Computer graphics animations of speech production
-
London: JA1 Press
-
Brooke, N.M. (1992a). Computer graphics animations of speech production. Advances in Language, Speech and Hearing, Volume 2. London: JA1 Press, 87-134.
-
(1992)
Advances in Language, Speech and Hearing
, vol.2
, pp. 87-134
-
-
Brooke, N.M.1
-
6
-
-
0012744585
-
Computer graphics synthesis of talking faces
-
Bailly, G., Benoit, C. & Sawallis, T.R. (Eds), Amsterdam: Elsevier
-
Brooke, N.M. (1992b). Computer graphics synthesis of talking faces. In Bailly, G., Benoit, C. & Sawallis, T.R. (Eds), Talking Machines: Theories, Models and Designs. Amsterdam: Elsevier, 504-22.
-
(1992)
Talking Machines: Theories, Models and Designs.
, pp. 504-522
-
-
Brooke, N.M.1
-
7
-
-
0000813366
-
Talking heads and speech recognisers that can see: The computer processing of visual speech signals
-
D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
-
Brooke, N.M. (1996). Talking heads and speech recognisers that can see: the computer processing of visual speech signals. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 351-73
-
(1996)
Speechreading by Humans and Machines.
, pp. 351-373
-
-
Brooke, N.M.1
-
8
-
-
84925636076
-
Animated computer graphics of talking faces based on stochastic models
-
Brooke, N.M. & Scott, S.D. (1994a). Animated computer graphics of talking faces based on stochastic models. Proceedings of the International Symposium on Speech, Image-processing and Neural Networks, Hong Kong (IEEE), 73-6.
-
(1994)
Proceedings of the International Symposium on Speech, Image-processing and Neural Networks, Hong Kong (IEEE)
, pp. 73-76
-
-
Brooke, N.M.1
Scott, S.D.2
-
9
-
-
0008571982
-
PCA image coding schemes and visual speech intelligibility
-
Brooke, N.M. & Scott, S.D. (1994b). PCA image coding schemes and visual speech intelligibility. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 16(5), 123-9.
-
(1994)
Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere)
, vol.16
, Issue.5
, pp. 123-129
-
-
Brooke, N.M.1
Scott, S.D.2
-
10
-
-
84926273209
-
Analysis, synthesis and perception of visible articulatory movements
-
Brooke, N.M. & Summerfield, A.Q. (1983). Analysis, synthesis and perception of visible articulatory movements. Journal of Phonetics, 11, 63-76.
-
(1983)
Journal of Phonetics
, vol.11
, pp. 63-76
-
-
Brooke, N.M.1
Summerfield, A.Q.2
-
11
-
-
0040917422
-
Visual speech intelligibility of digitally processed facial images
-
Brooke, N.M. & Templeton, P.D. (1990). Visual speech intelligibility of digitally processed facial images. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 12(10), 483-90.
-
(1990)
Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere)
, vol.12
, Issue.10
, pp. 483-490
-
-
Brooke, N.M.1
Templeton, P.D.2
-
12
-
-
0002001464
-
Automatic speech recognition that includes visual speech cues
-
Brooke, N.M., Tomlinson, M.J. & Moore, R.K. (1994). Automatic speech recognition that includes visual speech cues. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 16(5), 15-22.
-
(1994)
Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere)
, vol.16
, Issue.5
, pp. 15-22
-
-
Brooke, N.M.1
Tomlinson, M.J.2
Moore, R.K.3
-
14
-
-
0001514782
-
Modeling coarticulation in synthetic visual speech
-
Thalmann, N.M. & Thalmann, D. (Eds), Tokyo: Springer-Verlag, Berlin
-
Cohen, M.M. & Massaro, D.W. (1993). Modeling coarticulation in synthetic visual speech. In Thalmann, N.M. & Thalmann, D. (Eds), Computer Animation, 93, Tokyo: Springer-Verlag, Berlin, 139-56.
-
(1993)
Computer Animation
, vol.93
, pp. 139-156
-
-
Cohen, M.M.1
Massaro, D.W.2
-
15
-
-
0002142055
-
About brows: Emotional and conversational signals
-
von Cranach, M., Foppa, K., Lepenies, W. & Ploog, D. (Eds), Cambridge: Cambridge University Press
-
Ekman, P. (1979). About brows: emotional and conversational signals. In von Cranach, M., Foppa, K., Lepenies, W. & Ploog, D. (Eds), Human Ethology. Cambridge: Cambridge University Press, 169-202.
-
(1979)
Human Ethology.
, pp. 169-202
-
-
Ekman, P.1
-
17
-
-
0002750845
-
An investigation of visible lip information to be used in automatic speech recognition
-
Washington D.C.: Georgetown University
-
Finn, K.I. (1986). An investigation of visible lip information to be used in automatic speech recognition. PhD. Thesis. Washington D.C.: Georgetown University.
-
(1986)
PhD. Thesis.
-
-
Finn, K.I.1
-
18
-
-
0003544881
-
Visionary Speech: Looking ahead to practical speechreading systems
-
D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
-
Hennecke, M.E., Stork, D.G. & Prasad, K.V. (1996). Visionary Speech: looking ahead to practical speechreading systems. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 331-50.
-
(1996)
Speechreading by Humans and Machines.
, pp. 331-350
-
-
Hennecke, M.E.1
Stork, D.G.2
Prasad, K.V.3
-
19
-
-
84925669906
-
Time delay neural networks for articulatory estimation from speech: Suitable subjective evaluation protocols
-
D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
-
Lavagetto, F. & Lavagetto, P. (1996). Time delay neural networks for articulatory estimation from speech: suitable subjective evaluation protocols. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 437-44.
-
(1996)
Speechreading by Humans and Machines.
, pp. 437-444
-
-
Lavagetto, F.1
Lavagetto, P.2
-
20
-
-
0023237267
-
Quantifying the contribution of vision to speech perception in noise
-
MacLeod, A. & Summerfield, A.Q. (1987). Quantifying the contribution of vision to speech perception in noise. British Journal of Audiology, 21, 131-41.
-
(1987)
British Journal of Audiology
, vol.21
, pp. 131-141
-
-
MacLeod, A.1
Summerfield, A.Q.2
-
21
-
-
10644227100
-
Bimodal speech perception: A progress report
-
D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
-
Massaro, D.W. (1996). Bimodal speech perception: a progress report. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 79-102.
-
(1996)
Speechreading by Humans and Machines.
, pp. 79-102
-
-
Massaro, D.W.1
-
22
-
-
85029619676
-
Visual speech recognition with stochastic networks
-
Tesauro, G., Touretzky, D. & Leen, T. (Eds,), Cambridge, Mass.: MIT Press
-
Movellan, J.R. (1995). Visual speech recognition with stochastic networks. In Tesauro, G., Touretzky, D. & Leen, T. (Eds,), Advances in neural information processing systems, (7). Cambridge, Mass.: MIT Press, 851-8.
-
(1995)
Advances in neural information processing systems
, Issue.7
, pp. 851-858
-
-
Movellan, J.R.1
-
23
-
-
84921138344
-
Speech recognition enhancement by lip information
-
Nishida, S. (1986). Speech recognition enhancement by lip information. Proceedings of CHI86 (ACM), 198-204.
-
(1986)
Proceedings of CHI86 (ACM)
, pp. 198-204
-
-
Nishida, S.1
-
25
-
-
33749913277
-
The multilayer perceptron as a tool for speech pattern processing research
-
Peeling, S.M., Moore R.K. & Tomlinson, M.J. (1986). The multilayer perceptron as a tool for speech pattern processing research. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 8(7), 307-14.
-
(1986)
Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere)
, vol.8
, Issue.7
, pp. 307-314
-
-
Peeling, S.M.1
Moore, R.K.2
Tomlinson, M.J.3
-
27
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Rabiner, L.R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77, 257-86.
-
(1989)
Proceedings of the IEEE
, vol.77
, pp. 257-286
-
-
Rabiner, L.R.1
-
28
-
-
0004762797
-
Exploiting sensor fusion architectures and stimuli complementarity in AV speech recognition
-
D.G. Stork & M.E. Hennecke (Eds), Berlin: Springer
-
Robert-Ribes, J., Piquemal, M., Schwartz, J-L. & Escudier, P. (1996). Exploiting sensor fusion architectures and stimuli complementarity in AV speech recognition. In D.G. Stork & M.E. Hennecke (Eds), Speechreading by Humans and Machines. Berlin: Springer, 193-210.
-
(1996)
Speechreading by Humans and Machines.
, pp. 193-210
-
-
Robert-Ribes, J.1
Piquemal, M.2
Schwartz, J.-L.3
Escudier, P.4
-
29
-
-
0002022487
-
Combining visual and acoustic speech signals with a neural network improves intelligibility
-
Touretzky, D.S. (Ed.), San Mateo, California: Morgan-Kaufman Publishers
-
Sejnowski, T.J., Yuhas, B P., Goldstein, M.H. & Jenkins, R E. (1990). Combining visual and acoustic speech signals with a neural network improves intelligibility. In Touretzky, D.S. (Ed.), Advances in Neural Information Processing Systems (2). San Mateo, California: Morgan-Kaufman Publishers.
-
(1990)
Advances in Neural Information Processing Systems
, Issue.2
-
-
Sejnowski, T.J.1
Yuhas, B.P.2
Goldstein, M.H.3
Jenkins, R.E.4
-
30
-
-
0000051247
-
Generation of mouthshapes for a synthetic talking head
-
Simons, A.D. & Cox, S.J. (1990). Generation of mouthshapes for a synthetic talking head. Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere), 12(10), 475-82.
-
(1990)
Proceedings of the Institute of Acoustics (Autumn Meeting, Windermere)
, vol.12
, Issue.10
, pp. 475-482
-
-
Simons, A.D.1
Cox, S.J.2
-
32
-
-
85132038963
-
Neural network lipreading system for improved speech recognition
-
Stork, D.G., Wolff, G. & Levine, E. (1992). Neural network lipreading system for improved speech recognition. Proceedings of the International Joint Conference on Neural Networks, Baltimore (IEEE), 2, 285-95.
-
(1992)
Proceedings of the International Joint Conference on Neural Networks, Baltimore (IEEE)
, vol.2
, pp. 285-295
-
-
Stork, D.G.1
Wolff, G.2
Levine, E.3
-
33
-
-
0002028032
-
Some preliminaries to a comprehensive account of audio-visual speech perception
-
Campbell, R. & Dodd, B. (Eds), Hove, UK: Lawrence Erlbaum Associates Ltd
-
Summerfield, A.Q. (1987). Some preliminaries to a comprehensive account of audio-visual speech perception. In Campbell, R. & Dodd, B. (Eds), Hearing by Eye: The Psychology of Lipreading. Hove, UK: Lawrence Erlbaum Associates Ltd.
-
(1987)
Hearing by Eye: The Psychology of Lipreading.
-
-
Summerfield, A.Q.1
-
34
-
-
0001653029
-
Visual perception of phonetic gestures
-
Mattingley, I.G. & Studdert-Kennedy, M. (Eds), Hillsdale, NJ: Lawrence Erlbaum Associates
-
Summerfield, A.Q. (1991). Visual perception of phonetic gestures. In Mattingley, I.G. & Studdert-Kennedy, M. (Eds), Modularity and the Motor Theory of Speech Perception. Hillsdale, NJ: Lawrence Erlbaum Associates, 117-38.
-
(1991)
Modularity and the Motor Theory of Speech Perception.
, pp. 117-138
-
-
Summerfield, A.Q.1
-
35
-
-
84995132396
-
Physically-based facial modelling, analysis and animation
-
Terzopoulos, D. & Waters, K. (1990). Physically-based facial modelling, analysis and animation. Journal of Visualisation and Computer Animation, I, 73-80.
-
(1990)
Journal of Visualisation and Computer Animation
, vol.1
, pp. 73-80
-
-
Terzopoulos, D.1
Waters, K.2
-
36
-
-
0029747053
-
Integrating audio and visual information to provide highly robust speech recognition
-
Tomlinson, M.J., Russell, M.J. & Brooke, N.M. (1996). Integrating audio and visual information to provide highly robust speech recognition. Proceedings of ICASSP, Atlanta, Georgia (IEEE), 821-1.
-
(1996)
Proceedings of ICASSP, Atlanta, Georgia (IEEE)
, pp. 821
-
-
Tomlinson, M.J.1
Russell, M.J.2
Brooke, N.M.3
|