-
1
-
-
82055174896
-
Audio-visual speech recognition compared across two architectures
-
Madrid, Spain
-
Adjoudani, A. & Benoit, C., 1995. "Audio-visual speech recognition compared across two architectures", Proceedings of the Eurospeech Conference, Madrid, Spain, vol. 2, 1563-1566.
-
(1995)
Proceedings of the Eurospeech Conference
, vol.2
, pp. 1563-1566
-
-
Adjoudani, A.1
Benoit, C.2
-
2
-
-
0030093965
-
Acoustic profiles in vocal emotion expression
-
Banse, R., & Scherer, K., 1996. "Acoustic profiles in vocal emotion expression", Journal of Personality and Social Psychology, 70(3), 614-636.
-
(1996)
Journal of Personality and Social Psychology
, vol.70
, Issue.3
, pp. 614-636
-
-
Banse, R.1
Scherer, K.2
-
3
-
-
0032178686
-
Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP
-
Benoit, C. & Le Goff, B., 1998. "Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP", Speech Communication, 26, 117-129.
-
(1998)
Speech Communication
, vol.26
, pp. 117-129
-
-
Benoit, C.1
Le Goff, B.2
-
4
-
-
0030362791
-
For speech perception by humans or machines, three senses are better than one
-
Bernstein, L. & Benoit, C., 1996. "For speech perception by humans or machines, three senses are better than one", Proceedings of the International Conference on Spoken Language Processing, vol. 3, 1477-1480.
-
(1996)
Proceedings of the International Conference on Spoken Language Processing
, vol.3
, pp. 1477-1480
-
-
Bernstein, L.1
Benoit, C.2
-
6
-
-
0002267306
-
Multimodal person recognition using unconstrained audio and video
-
March, Wash., DC
-
Choudhury, T., Clarkson, B., Jebara, T. & Pentland, S., March 1999. "Multimodal person recognition using unconstrained audio and video", Proceedings of the 2nd International Conference on Audio-and-Video-based Biometric Person Authentication, Wash., DC, 176-81.
-
(1999)
Proceedings of the 2nd International Conference on Audio-and-video-based Biometric Person Authentication
, pp. 176-181
-
-
Choudhury, T.1
Clarkson, B.2
Jebara, T.3
Pentland, S.4
-
7
-
-
0031380441
-
Quickset: Multimodal interaction for distributed applications
-
ACM Press: New York
-
Cohen, P. R., Johnston, M., McGee, D., Oviatt, S., Pittman, J., Smith, I., Chen, L., & Clow, J., 1997. "Quickset: Multimodal interaction for distributed applications", Proceedings of the Fifth ACM International Multimedia Conference, ACM Press: New York, 31-40.
-
(1997)
Proceedings of the Fifth ACM International Multimedia Conference
, pp. 31-40
-
-
Cohen, P.R.1
Johnston, M.2
McGee, D.3
Oviatt, S.4
Pittman, J.5
Smith, I.6
Chen, L.7
Clow, J.8
-
8
-
-
0029288202
-
Speech recognition in noisy environments
-
Gong, Y., 1995. "Speech recognition in noisy environments", Speech Communication, 16, 261-291.
-
(1995)
Speech Communication
, vol.16
, pp. 261-291
-
-
Gong, Y.1
-
10
-
-
0032179207
-
Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition
-
North Holland
-
Iverson, P., Bernstein, L., & Auer, E., 1998. "Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition", Speech Communication, 26(1-2), 45-63. North Holland.
-
(1998)
Speech Communication
, vol.26
, Issue.1-2
, pp. 45-63
-
-
Iverson, P.1
Bernstein, L.2
Auer, E.3
-
11
-
-
0027465491
-
The Lombard reflex and its role on human listeners and automatic speech recognizers
-
Junqua, J. C., 1993. "The Lombard reflex and its role on human listeners and automatic speech recognizers", Journal of the Acoustical Society of America, 93(1), 510-24.
-
(1993)
Journal of the Acoustical Society of America
, vol.93
, Issue.1
, pp. 510-524
-
-
Junqua, J.C.1
-
12
-
-
0023237267
-
Quantifying the contribution of vision to speech perception in noise
-
McLeod, A. & Summerfield, Q., 1987. "Quantifying the contribution of vision to speech perception in noise", British Journal of Audiology, 21, 131-141.
-
(1987)
British Journal of Audiology
, vol.21
, pp. 131-141
-
-
McLeod, A.1
Summerfield, Q.2
-
13
-
-
0022019614
-
Intermodal timing relations and audio-visual speech recognition by normalhearing adults
-
McGrath, M. & Summerfield, Q., 1985. "Intermodal timing relations and audio-visual speech recognition by normalhearing adults", Journal of the Acoustical Society of America, 77(2), 678-685.
-
(1985)
Journal of the Acoustical Society of America
, vol.77
, Issue.2
, pp. 678-685
-
-
McGrath, M.1
Summerfield, Q.2
-
14
-
-
80053435138
-
Studies of audiovisual speech perception using production-based animation
-
Oct, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction
-
Munhall, K., Oct. 2000. "Studies of audiovisual speech perception using production-based animation", International Conference on Spoken Language Processing, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction.
-
(2000)
International Conference on Spoken Language Processing
-
-
Munhall, K.1
-
15
-
-
85009154155
-
Stream weight optimization of speech and lip image sequence for audio-visual speech recognition
-
Oct, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction
-
Nakamura, S., Ito, H. & Shikano, K., Oct. 2000. "Stream weight optimization of speech and lip image sequence for audio-visual speech recognition", International Conference on Spoken Language Processing, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction.
-
(2000)
International Conference on Spoken Language Processing
-
-
Nakamura, S.1
Ito, H.2
Shikano, K.3
-
16
-
-
85009060634
-
Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction
-
Oct, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction
-
Neti, C., Iyengar, G., Potamianos, G. & Senior, A., Oct. 2000. "Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction", International Conference on Spoken Language Processing, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction.
-
(2000)
International Conference on Spoken Language Processing
-
-
Neti, C.1
Iyengar, G.2
Potamianos, G.3
Senior, A.4
-
19
-
-
0032075546
-
Predicting hyperarticulate speech during human-computer error resolution
-
Oviatt, S. L., MacEachern, M., & Levow, G., 1998. "Predicting hyperarticulate speech during human-computer error resolution", Speech Communication, 24, 87-110.
-
(1998)
Speech Communication
, vol.24
, pp. 87-110
-
-
Oviatt, S.L.1
MacEachern, M.2
Levow, G.3
-
20
-
-
0034448810
-
Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research directions
-
in press, to be reprinted in J. Carroll ed. Human-Computer Interaction in the New Millennium, Addison-Wesley Press: Boston
-
Oviatt, S. L., Cohen, P. R., Wu, L., Vergo, J., Duncan, L., Suhm, B., Bers, J., Holzman, T., Winograd, T., Landay, J., Larson, J. & Ferro, D., in press (2000). "Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research directions", Human Computer Interaction, (to be reprinted in J. Carroll (ed.) Human-Computer Interaction in the New Millennium, Addison-Wesley Press: Boston).
-
(2000)
Human Computer Interaction
-
-
Oviatt, S.L.1
Cohen, P.R.2
Wu, L.3
Vergo, J.4
Duncan, L.5
Suhm, B.6
Bers, J.7
Holzman, T.8
Winograd, T.9
Landay, J.10
Larson, J.11
Ferro, D.12
-
21
-
-
0033879165
-
Biometrics: The future of identification
-
Pankanti, S., Bolle, R. M., & Jain, A. (Eds.), 2000. "Biometrics: The future of identification", Computer, 33(2), 46-80.
-
(2000)
Computer
, vol.33
, Issue.2
, pp. 46-80
-
-
Pankanti, S.1
Bolle, R.M.2
Jain, A.3
-
23
-
-
0024534402
-
Inhibiting the lombard effect
-
Pick, H. L., Siegel, G. M., Fox, P. W., Garber, S. R. & Kearney, J. K., 1989. "Inhibiting the Lombard effect", Journal of the Acoustical Society of America, 85(2), 894-900.
-
(1989)
Journal of the Acoustical Society of America
, vol.85
, Issue.2
, pp. 894-900
-
-
Pick, H.L.1
Siegel, G.M.2
Fox, P.W.3
Garber, S.R.4
Kearney, J.K.5
-
24
-
-
0031747741
-
Complementarity and synergy in bimodal speech: Auditory, visual, and audio-visual identification of French oral vowels in noise
-
Robert-Ribes, J. Schwartz, J.-L., Lallouache, T. & Escudier, P., 1998. "Complementarity and synergy in bimodal speech: Auditory, visual, and audio-visual identification of French oral vowels in noise", Journal of the Acoustical Society of America, 103(6), 3677-3689.
-
(1998)
Journal of the Acoustical Society of America
, vol.103
, Issue.6
, pp. 3677-3689
-
-
Robert-Ribes, J.1
Schwartz, J.-L.2
Lallouache, T.3
Escudier, P.4
-
25
-
-
0032180188
-
Adaptive fusion of acoustic and visual sources for automatic speech recognition
-
Rogozan, A. & Deglise, P. "Adaptive fusion of acoustic and visual sources for automatic speech recognition", Speech Communication, 26(1-2), 149-161.
-
Speech Communication
, vol.26
, Issue.1-2
, pp. 149-161
-
-
Rogozan, A.1
Deglise, P.2
-
26
-
-
0004986359
-
Special issue on audio-visual speech processing
-
Rubin, P., Vatikiotis-Bateson, E., & Benoit, C. (eds.), 1998. "Special issue on audio-visual speech processing", Speech Communication, 26, 1-2.
-
(1998)
Speech Communication
, vol.26
, pp. 1-2
-
-
Rubin, P.1
Vatikiotis-Bateson, E.2
Benoit, C.3
-
28
-
-
0001048664
-
Visual contribution to speech intelligibility in noise
-
Sumby, W. H. & Pollack, I., 1954. "Visual contribution to speech intelligibility in noise", Journal of the Acoustical Society of America, 26, 212-215.
-
(1954)
Journal of the Acoustical Society of America
, vol.26
, pp. 212-215
-
-
Sumby, W.H.1
Pollack, I.2
-
29
-
-
0029747053
-
Integrating audio and visual information to provide highly robust speech recognition
-
Tomlinson, M. J., Russell, M. J. & Brooke, N. M., 1996. "Integrating audio and visual information to provide highly robust speech recognition", Proceedings of the IEEE ICASSP, 821-824.
-
(1996)
Proceedings of the IEEE ICASSP
, pp. 821-824
-
-
Tomlinson, M.J.1
Russell, M.J.2
Brooke, N.M.3
-
30
-
-
0041827542
-
Perceptual user interfaces
-
special issue
-
Turk, M. & Robertson, G. (Eds.), 2000. "Perceptual user interfaces", Communications of the ACM (special issue), 43(3), 32-70.
-
(2000)
Communications of the ACM
, vol.43
, Issue.3
, pp. 32-70
-
-
Turk, M.1
Robertson, G.2
-
31
-
-
0001259029
-
Multimodal integration: A statistical view
-
Wu, L., Oviatt, S., & Cohen, P., 1999. "Multimodal integration: A statistical view", IEEE Transactions on Multimedia, 1 (4) 334-342.
-
(1999)
IEEE Transactions on Multimedia
, vol.1
, Issue.4
, pp. 334-342
-
-
Wu, L.1
Oviatt, S.2
Cohen, P.3
-
32
-
-
0032662263
-
Manual and gaze input cascaded (MAGIC) pointing
-
ACM Press: New York
-
Zhai, S., Morimoto, C., & Ihde, S., 1999. "Manual and gaze input cascaded (MAGIC) pointing", Proceedings of the Conference on Human Factors in Computing Systems (CHI'99). ACM Press: New York, 246-253.
-
(1999)
Proceedings of the Conference on Human Factors in Computing Systems (CHI'99)
, pp. 246-253
-
-
Zhai, S.1
Morimoto, C.2
Ihde, S.3
|