SCOPUS 정보 검색 플랫폼

6th International Conference on Spoken Language Processing, ICSLP 2000

Volumn , Issue , 2000, Pages

Multimodal interface research: A science without borders

a OREGON HEALTH AND SCIENCE UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPONENT TECHNOLOGIES; CROSS FERTILIZATION; INTERNATIONAL GROUP; MEDICAL COMMUNITY; MULTI-DISCIPLINARY RESEARCH; MULTI-MODAL INTERFACES; NOCV1; RESEARCH CHALLENGES; SCIENTIFIC PROGRESS;

EID: 85009060662 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (26)

References (32)

1
- 82055174896
- Audio-visual speech recognition compared across two architectures
- Madrid, Spain
- Adjoudani, A. & Benoit, C., 1995. "Audio-visual speech recognition compared across two architectures", Proceedings of the Eurospeech Conference, Madrid, Spain, vol. 2, 1563-1566.
- (1995) Proceedings of the Eurospeech Conference , vol.2 , pp. 1563-1566
- Adjoudani, A.¹ Benoit, C.²

2
- 0030093965
- Acoustic profiles in vocal emotion expression
- Banse, R., & Scherer, K., 1996. "Acoustic profiles in vocal emotion expression", Journal of Personality and Social Psychology, 70(3), 614-636.
- (1996) Journal of Personality and Social Psychology , vol.70 , Issue.3 , pp. 614-636
- Banse, R.¹ Scherer, K.²

3
- 0032178686
- Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP
- Benoit, C. & Le Goff, B., 1998. "Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP", Speech Communication, 26, 117-129.
- (1998) Speech Communication , vol.26 , pp. 117-129
- Benoit, C.¹ Le Goff, B.²

4
- 0030362791
- For speech perception by humans or machines, three senses are better than one
- Bernstein, L. & Benoit, C., 1996. "For speech perception by humans or machines, three senses are better than one", Proceedings of the International Conference on Spoken Language Processing, vol. 3, 1477-1480.
- (1996) Proceedings of the International Conference on Spoken Language Processing , vol.3 , pp. 1477-1480
- Bernstein, L.¹ Benoit, C.²

5
- 0030359792
- Using the visual component in automatic speech recognition
- Brooke, M., 1996. "Using the visual component in automatic speech recognition", Proceedings of the International Conference on Spoken Language Processing, vol. 3, 1656-1659.
- (1996) Proceedings of the International Conference on Spoken Language Processing , vol.3 , pp. 1656-1659
- Brooke, M.¹

6
- 0002267306
- Multimodal person recognition using unconstrained audio and video
- March, Wash., DC
- Choudhury, T., Clarkson, B., Jebara, T. & Pentland, S., March 1999. "Multimodal person recognition using unconstrained audio and video", Proceedings of the 2nd International Conference on Audio-and-Video-based Biometric Person Authentication, Wash., DC, 176-81.
- (1999) Proceedings of the 2nd International Conference on Audio-and-video-based Biometric Person Authentication , pp. 176-181
- Choudhury, T.¹ Clarkson, B.² Jebara, T.³ Pentland, S.⁴

7
- 0031380441
- Quickset: Multimodal interaction for distributed applications
- ACM Press: New York
- Cohen, P. R., Johnston, M., McGee, D., Oviatt, S., Pittman, J., Smith, I., Chen, L., & Clow, J., 1997. "Quickset: Multimodal interaction for distributed applications", Proceedings of the Fifth ACM International Multimedia Conference, ACM Press: New York, 31-40.
- (1997) Proceedings of the Fifth ACM International Multimedia Conference , pp. 31-40
- Cohen, P.R.¹ Johnston, M.² McGee, D.³ Oviatt, S.⁴ Pittman, J.⁵ Smith, I.⁶ Chen, L.⁷ Clow, J.⁸

8
- 0029288202
- Speech recognition in noisy environments
- Gong, Y., 1995. "Speech recognition in noisy environments", Speech Communication, 16, 261-291.
- (1995) Speech Communication , vol.16 , pp. 261-291
- Gong, Y.¹

9
- 85009121541
- Oct. 14-16, Beijing, China
- International Conference on Multimodal Interfaces (ICMI'2000), Oct. 14-16, 2000. Beijing, China, (URL: http://www.ia.ac.cn/nlpr/ICMI2000/)
- (2000) International Conference on Multimodal Interfaces (ICMI'2000)

10
- 0032179207
- Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition
- North Holland
- Iverson, P., Bernstein, L., & Auer, E., 1998. "Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition", Speech Communication, 26(1-2), 45-63. North Holland.
- (1998) Speech Communication , vol.26 , Issue.1-2 , pp. 45-63
- Iverson, P.¹ Bernstein, L.² Auer, E.³

11
- 0027465491
- The Lombard reflex and its role on human listeners and automatic speech recognizers
- Junqua, J. C., 1993. "The Lombard reflex and its role on human listeners and automatic speech recognizers", Journal of the Acoustical Society of America, 93(1), 510-24.
- (1993) Journal of the Acoustical Society of America , vol.93 , Issue.1 , pp. 510-524
- Junqua, J.C.¹

12
- 0023237267
- Quantifying the contribution of vision to speech perception in noise
- McLeod, A. & Summerfield, Q., 1987. "Quantifying the contribution of vision to speech perception in noise", British Journal of Audiology, 21, 131-141.
- (1987) British Journal of Audiology , vol.21 , pp. 131-141
- McLeod, A.¹ Summerfield, Q.²

13
- 0022019614
- Intermodal timing relations and audio-visual speech recognition by normalhearing adults
- McGrath, M. & Summerfield, Q., 1985. "Intermodal timing relations and audio-visual speech recognition by normalhearing adults", Journal of the Acoustical Society of America, 77(2), 678-685.
- (1985) Journal of the Acoustical Society of America , vol.77 , Issue.2 , pp. 678-685
- McGrath, M.¹ Summerfield, Q.²

14
- 80053435138
- Studies of audiovisual speech perception using production-based animation
- Oct, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction
- Munhall, K., Oct. 2000. "Studies of audiovisual speech perception using production-based animation", International Conference on Spoken Language Processing, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction.
- (2000) International Conference on Spoken Language Processing
- Munhall, K.¹

15
- 85009154155
- Stream weight optimization of speech and lip image sequence for audio-visual speech recognition
- Oct, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction
- Nakamura, S., Ito, H. & Shikano, K., Oct. 2000. "Stream weight optimization of speech and lip image sequence for audio-visual speech recognition", International Conference on Spoken Language Processing, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction.
- (2000) International Conference on Spoken Language Processing
- Nakamura, S.¹ Ito, H.² Shikano, K.³

16
- 85009060634
- Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction
- Oct, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction
- Neti, C., Iyengar, G., Potamianos, G. & Senior, A., Oct. 2000. "Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction", International Conference on Spoken Language Processing, Beijing China, to be presented in special session on Multimodal and Transmodal Human-Computer Interaction.
- (2000) International Conference on Spoken Language Processing
- Neti, C.¹ Iyengar, G.² Potamianos, G.³ Senior, A.⁴

17
- 0032684957
- Mutual disambiguation of recognition errors in a multimodal architecture
- ACM Press: New York
- Oviatt, S. L., 1999. "Mutual disambiguation of recognition errors in a multimodal architecture", Proceedings of the Conference on Human Factors in Computing Systems (CHI'99), ACM Press: New York, 576-583.
- (1999) Proceedings of the Conference on Human Factors in Computing Systems (CHI'99) , pp. 576-583
- Oviatt, S.L.¹

18
- 85009088524
- Multimodal signal processing in naturalistic noisy environments
- Oct, to be presented in Beijing China
- Oviatt, S. L., Oct. 2000. "Multimodal signal processing in naturalistic noisy environments", Proceedings of the International Conference on Spoken Language Processing, to be presented in Beijing China.
- (2000) Proceedings of the International Conference on Spoken Language Processing
- Oviatt, S.L.¹

19
- 0032075546
- Predicting hyperarticulate speech during human-computer error resolution
- Oviatt, S. L., MacEachern, M., & Levow, G., 1998. "Predicting hyperarticulate speech during human-computer error resolution", Speech Communication, 24, 87-110.
- (1998) Speech Communication , vol.24 , pp. 87-110
- Oviatt, S.L.¹ MacEachern, M.² Levow, G.³

20
- 0034448810
- Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research directions
- in press, to be reprinted in J. Carroll ed. Human-Computer Interaction in the New Millennium, Addison-Wesley Press: Boston
- Oviatt, S. L., Cohen, P. R., Wu, L., Vergo, J., Duncan, L., Suhm, B., Bers, J., Holzman, T., Winograd, T., Landay, J., Larson, J. & Ferro, D., in press (2000). "Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research directions", Human Computer Interaction, (to be reprinted in J. Carroll (ed.) Human-Computer Interaction in the New Millennium, Addison-Wesley Press: Boston).
- (2000) Human Computer Interaction
- Oviatt, S.L.¹ Cohen, P.R.² Wu, L.³ Vergo, J.⁴ Duncan, L.⁵ Suhm, B.⁶ Bers, J.⁷ Holzman, T.⁸ Winograd, T.⁹ Landay, J.¹⁰ Larson, J.¹¹ Ferro, D.¹²

21
- 0033879165
- Biometrics: The future of identification
- Pankanti, S., Bolle, R. M., & Jain, A. (Eds.), 2000. "Biometrics: The future of identification", Computer, 33(2), 46-80.
- (2000) Computer , vol.33 , Issue.2 , pp. 46-80
- Pankanti, S.¹ Bolle, R.M.² Jain, A.³

22
- 85009125807
- Physicians without borders/Medecins sans frontiers (URL: http://www.dwb.org/).
- Physicians Without Borders/Medecins Sans Frontiers

23
- 0024534402
- Inhibiting the lombard effect
- Pick, H. L., Siegel, G. M., Fox, P. W., Garber, S. R. & Kearney, J. K., 1989. "Inhibiting the Lombard effect", Journal of the Acoustical Society of America, 85(2), 894-900.
- (1989) Journal of the Acoustical Society of America , vol.85 , Issue.2 , pp. 894-900
- Pick, H.L.¹ Siegel, G.M.² Fox, P.W.³ Garber, S.R.⁴ Kearney, J.K.⁵

24
- 0031747741
- Complementarity and synergy in bimodal speech: Auditory, visual, and audio-visual identification of French oral vowels in noise
- Robert-Ribes, J. Schwartz, J.-L., Lallouache, T. & Escudier, P., 1998. "Complementarity and synergy in bimodal speech: Auditory, visual, and audio-visual identification of French oral vowels in noise", Journal of the Acoustical Society of America, 103(6), 3677-3689.
- (1998) Journal of the Acoustical Society of America , vol.103 , Issue.6 , pp. 3677-3689
- Robert-Ribes, J.¹ Schwartz, J.-L.² Lallouache, T.³ Escudier, P.⁴

25
- 0032180188
- Adaptive fusion of acoustic and visual sources for automatic speech recognition
- Rogozan, A. & Deglise, P. "Adaptive fusion of acoustic and visual sources for automatic speech recognition", Speech Communication, 26(1-2), 149-161.
- Speech Communication , vol.26 , Issue.1-2 , pp. 149-161
- Rogozan, A.¹ Deglise, P.²

26
- 0004986359
- Special issue on audio-visual speech processing
- Rubin, P., Vatikiotis-Bateson, E., & Benoit, C. (eds.), 1998. "Special issue on audio-visual speech processing", Speech Communication, 26, 1-2.
- (1998) Speech Communication , vol.26 , pp. 1-2
- Rubin, P.¹ Vatikiotis-Bateson, E.² Benoit, C.³

27
- 0003544881
- New York: Springer Verlag
- Stork, D. G., & Hennecke, M. E. (Eds.), 1995. Speechreading by Humans and Machines. New York: Springer Verlag.
- (1995) Speechreading by Humans and Machines
- Stork, D.G.¹ Hennecke, M.E.²

28
- 0001048664
- Visual contribution to speech intelligibility in noise
- Sumby, W. H. & Pollack, I., 1954. "Visual contribution to speech intelligibility in noise", Journal of the Acoustical Society of America, 26, 212-215.
- (1954) Journal of the Acoustical Society of America , vol.26 , pp. 212-215
- Sumby, W.H.¹ Pollack, I.²

29
- 0029747053
- Integrating audio and visual information to provide highly robust speech recognition
- Tomlinson, M. J., Russell, M. J. & Brooke, N. M., 1996. "Integrating audio and visual information to provide highly robust speech recognition", Proceedings of the IEEE ICASSP, 821-824.
- (1996) Proceedings of the IEEE ICASSP , pp. 821-824
- Tomlinson, M.J.¹ Russell, M.J.² Brooke, N.M.³

30
- 0041827542
- Perceptual user interfaces
- special issue
- Turk, M. & Robertson, G. (Eds.), 2000. "Perceptual user interfaces", Communications of the ACM (special issue), 43(3), 32-70.
- (2000) Communications of the ACM , vol.43 , Issue.3 , pp. 32-70
- Turk, M.¹ Robertson, G.²

31
- 0001259029
- Multimodal integration: A statistical view
- Wu, L., Oviatt, S., & Cohen, P., 1999. "Multimodal integration: A statistical view", IEEE Transactions on Multimedia, 1 (4) 334-342.
- (1999) IEEE Transactions on Multimedia , vol.1 , Issue.4 , pp. 334-342
- Wu, L.¹ Oviatt, S.² Cohen, P.³

32
- 0032662263
- Manual and gaze input cascaded (MAGIC) pointing
- ACM Press: New York
- Zhai, S., Morimoto, C., & Ihde, S., 1999. "Manual and gaze input cascaded (MAGIC) pointing", Proceedings of the Conference on Human Factors in Computing Systems (CHI'99). ACM Press: New York, 246-253.
- (1999) Proceedings of the Conference on Human Factors in Computing Systems (CHI'99) , pp. 246-253
- Zhai, S.¹ Morimoto, C.² Ihde, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.