SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE

Volumn 91, Issue 9, 2003, Pages 1457-1468

User-centered modeling and evaluation of multimodal interfaces

(1) Oviatt, Sharon a

a OREGON HEALTH AND SCIENCE UNIVERSITY (United States)

Author keywords

Evaluation; High fidelity simulations; Proactive interface design; Prototyping; Task analysis; User centered modeling

Indexed keywords

COMPUTER SIMULATION; ERROR ANALYSIS; EVALUATION; GESTURE RECOGNITION; HUMAN COMPUTER INTERACTION; INTERACTIVE COMPUTER SYSTEMS; MATHEMATICAL MODELS; SOFTWARE PROTOTYPING; SPEECH COMMUNICATION; SYNCHRONIZATION; WORLD WIDE WEB;

ERROR HANDLING; HIGH FIDELITY SIMULATIONS; MULTIMODAL SYSTEMS; PROACTIVE INTERFACE DESIGN; PROTOTYPING; TASK ANALYSIS; USER-CENTERED MODELING;

INTERFACES (COMPUTER);

EID: 21244476017 PISSN: 00189219 EISSN: None Source Type: Journal
DOI: 10.1109/JPROC.2003.817127 Document Type: Conference Paper

Times cited : (70)

References (88)

1
- 0141685005
- Audio-visual and multimodal speech-based systems
- D. Gibbon, I. Mertins, and R. Moore, Eds. Boston, MA, Kluwer
- J. Benoit, C. Martin, C. Pelachaud, L. Schomaker, and B. Suhm, "Audio-visual and multimodal speech-based systems," in Handbook of Multimodal and Spoken Dialogue Systems: Resources, Terminology and Product Evaluation, D. Gibbon, I. Mertins, and R. Moore, Eds. Boston, MA, Kluwer, 2000, pp. 102-203.
- (2000) Handbook of Multimodal and Spoken Dialogue Systems: Resources, Terminology and Product Evaluation , pp. 102-203
- Benoit, J.¹ Martin, C.² Pelachaud, C.³ Schomaker, L.⁴ Suhm, B.⁵

2
- 0034448810
- Designing the user interface for multimodal speech and gesture applications; State-of-the-art systems and research directions
- S. L. Oviatt, P. R. Cohen, L. Wu, J. Vergo, L. Dunean, B. Suhm, J. Bers, T. Holzman, T. Winograd, J. Landay, J. Larson, and D. Ferro, "Designing the user interface for multimodal speech and gesture applications; State-of-the-art systems and research directions," Human Comput. Interaction, vol. 15, no. 4, pp. 263-322, 2000.
- (2000) Human Comput. Interaction , vol.15 , Issue.4 , pp. 263-322
- Oviatt, S.L.¹ Cohen, P.R.² Wu, L.³ Vergo, J.⁴ Dunean, L.⁵ Suhm, B.⁶ Bers, J.⁷ Holzman, T.⁸ Winograd, T.⁹ Landay, J.¹⁰ Larson, J.¹¹ Ferro, D.¹²

3
- 0031380441
- Quickset: Multimodal interaction for distributed applications
- P. R. Cohen, M. Johnston, D. McGee, S. L. Oviatt, J. Pittman, I. Smith, L. Chen, and J. Clow, "Quickset: Multimodal interaction for distributed applications." in Proc. 5th ACM Int. Multimedia Conf., 1997, pp. 31-40.
- (1997) Proc. 5th ACM Int. Multimedia Conf. , pp. 31-40
- Cohen, P.R.¹ Johnston, M.² McGee, D.³ Oviatt, S.L.⁴ Pittman, J.⁵ Smith, I.⁶ Chen, L.⁷ Clow, J.⁸

4
- 84943647946
- MiPad: A next-generation PDA prototype
- X. Huang, A. Acero, C. Chelba, L. Deng, D. Duchene, J. Goodman, H. Hon, D. Jacoby, L. Jiang, R. Loynd, M. Mahajan, P. Mau, S. Meredith, S. Mughal, S. Neto, M. Plumpe, K. Wang, and Y. Wang, "MiPad: A next-generation PDA prototype," in Proc. ICSLP, vol. 3, 2000, pp. 33-36.
- (2000) Proc. ICSLP , vol.3 , pp. 33-36
- Huang, X.¹ Acero, A.² Chelba, C.³ Deng, L.⁴ Duchene, D.⁵ Goodman, J.⁶ Hon, H.⁷ Jacoby, D.⁸ Jiang, L.⁹ Loynd, R.¹⁰ Mahajan, M.¹¹ Mau, P.¹² Meredith, S.¹³ Mughal, S.¹⁴ Neto, S.¹⁵ Plumpe, M.¹⁶ Wang, K.¹⁷ Wang, Y.¹⁸

5
- 82055174896
- Audio-visual speech recognition compared across two architectures
- A. Adjoudani and C. Benoit, "Audio-visual speech recognition compared across two architectures," in Proc. Eurospeech, vol. 2, 1995, pp. 1563-1566.
- (1995) Proc. Eurospeech , vol.2 , pp. 1563-1566
- Adjoudani, A.¹ Benoit, C.²

6
- 0032178686
- Audio-visual speech synthesis from french text: Eight years of models, designs and evaluation
- C. Benoit and B. Le Goff, "Audio-visual speech synthesis from french text: Eight years of models, designs and evaluation," Speech Commun., vol. 26, pp. 117-129, 1998.
- (1998) Speech Commun. , vol.26 , pp. 117-129
- Benoit, C.¹ Le Goff, B.²

7
- 85013597845
- Eigenlips for robust speech recognition
- C. Bregler and Y. Konig, "Eigenlips for robust speech recognition," in Proc. ICASSP, vol. 2, 1994, pp. 669-672.
- (1994) Proc. ICASSP , vol.2 , pp. 669-672
- Bregler, C.¹ Konig, Y.²

8
- 85032752352
- Audiovisual speech processing
- Jan.
- T. Chen. "Audiovisual speech processing." IEEE Signal Processing Mag., vol. 18, pp. 9-21, Jan. 2001.
- (2001) IEEE Signal Processing Mag. , vol.18 , pp. 9-21
- Chen, T.¹

9
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- Sept.
- S. Dupont and J. Lueitin. "Audio-visual speech modeling for continuous speech recognition." IEEE Trans. Multimedia, vol. 2, pp. 141-151, Sept. 2000.
- (2000) IEEE Trans. Multimedia , vol.2 , pp. 141-151
- Dupont, S.¹ Lueitin, J.²

10
- 0003699540
- Ph.D. dissertation, Univ. Illinois, Urbana-Champaign
- E. D. Petajan, "Automatic lipreading to enhance speech recognition," Ph.D. dissertation, Univ. Illinois, Urbana-Champaign, 1984.
- (1984) Automatic Lipreading to Enhance Speech Recognition
- Petajan, E.D.¹

11
- 4544290191
- Recent advances in the automatic recognition of audio-visual speech
- Sept.
- G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. Senior. "Recent advances in the automatic recognition of audio-visual speech," Proc. IEEE, vol. 91, pp. 1306-1326, Sept. 2003.
- (2003) Proc. IEEE , vol.91 , pp. 1306-1326
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.⁵

12
- 0010070142
- Audiovisual sensory intergration using hidden Markov models
- D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag
- P. L. Silsbee and Q. Su, "Audiovisual sensory intergration using hidden Markov models," in Speechreading by Humana and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 489-504.
- (1996) Speechreading by Humana and Machines: Models, Systems and Applications , pp. 489-504
- Silsbee, P.L.¹ Su, Q.²

13
- 0003544881
- New York: Springer-Verlag
- D. G. Stork and M. E. Hennecke. Eds., Speechreading by Humans and Machines. New York: Springer-Verlag, 1996.
- (1996) Speechreading by Humans and Machines
- Stork, D.G.¹ Hennecke, M.E.²

14
- 0029747053
- Integrating audio and visual information to provide highly robust speech recognition
- M. J. Tomlinson, M. J. Russell, and N. M. Brooke, "Integrating audio and visual information to provide highly robust speech recognition," in Proc: ICASSP, vol. 2, 1996, pp. 821-824.
- (1996) Proc: ICASSP , vol.2 , pp. 821-824
- Tomlinson, M.J.¹ Russell, M.J.² Brooke, N.M.³

15
- 0004986359
- Audio-visual speech processing
- P. Rubin, E. Vatikiotis-Bateson, and C. Benoit, Eds., "Audio-visual speech processing," in Speech Commun. (Special Issue), 1998, vol. 26.
- (1998) Speech Commun. (Special Issue) , vol.26
- Rubin, P.¹ Vatikiotis-Bateson, E.² Benoit, C.³

16
- 33845911698
- Enhancing virtual maintenance environments with speech understanding
- L. Duncan, W. Brown, C. Esposito, H. Holmback, and P. Xue, "Enhancing virtual maintenance environments with speech understanding." Boeing M&CT TechNet, 1999.
- (1999) Boeing M&CT TechNet
- Duncan, L.¹ Brown, W.² Esposito, C.³ Holmback, H.⁴ Xue, P.⁵

17
- 0032075723
- Toward multimodal human-computer interface
- May
- R. Sharma, V. I. Pavlovic, and T. S. Huang, "Toward multimodal human-computer interface," Proc. IEEE, vol. 86, pp. 853-860, May 1998.
- (1998) Proc. IEEE , vol.86 , pp. 853-860
- Sharma, R.¹ Pavlovic, V.I.² Huang, T.S.³

18
- 84882783850
- Architecture Machine Group, Massachusetts Inst. Technol., Cambridge
- N. Negroponte, "Report for ONR and DARPA." Architecture Machine Group, Massachusetts Inst. Technol., Cambridge, 1978.
- (1978) Report for ONR and DARPA.
- Negroponte, N.¹

19
- 0031193007
- Visual interpretation of hand gestures for human-computer interaction: A review
- July
- V. Pavlovic, R. Sharma, and T. Huang, "Visual interpretation of hand gestures for human-computer interaction: A review," IEEE Trans. Pattern Anal. Machine Intell., vol. 19, pp. 677-695, July 1997.
- (1997) IEEE Trans. Pattern Anal. Machine Intell. , vol.19 , pp. 677-695
- Pavlovic, V.¹ Sharma, R.² Huang, T.³

20
- 0032662263
- Manual and gaze input cascaded (MAGIC) pointing
- S. Zhai, C. Morimoto, and S. Ihde, "Manual and gaze input cascaded (MAGIC) pointing," in Proc. Conf Human Factors Computing Systems (CHI'99), 1999, pp. 246-253.
- (1999) Proc. Conf Human Factors Computing Systems (CHI'99) , pp. 246-253
- Zhai, S.¹ Morimoto, C.² Ihde, S.³

21
- 0005056813
- Designing conversational interfaces with multimodal interaction
- J. Bers, S. Miller, and J. Makhoul, "Designing conversational interfaces with multimodal interaction." in Proc. DARPA Workshop Broadcast News Understanding Systems, 1998, pp. 319-321.
- (1998) Proc. DARPA Workshop Broadcast News Understanding Systems , pp. 319-321
- Bers, J.¹ Miller, S.² Makhoul, J.³

22
- 0031685853
- MVIEWS: Multimodal tools for the video analyst
- A. Cheyer. "MVIEWS: Multimodal tools for the video analyst," in Proc. Int. Conf. Intelligent User Interfaces (IUI'98), 1998, pp. 55-62.
- (1998) Proc. Int. Conf. Intelligent User Interfaces (IUI'98) , pp. 55-62
- Cheyer, A.¹

23
- 0002064205
- Computer-human interface solutions for emergency medical care
- T. G. Holzman, "Computer-human interface solutions for emergency medical care." Interactions, vol. 6, no. 3, pp. 13-24, 1999.
- (1999) Interactions , vol.6 , Issue.3 , pp. 13-24
- Holzman, T.G.¹

24
- 85009060634
- Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction
- C. Neti, G. Iyengar, G. Putamianos, and A. Senior, "Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction," in Proc: ICSLP, vol. 3, 2000, pp. 11-14.
- (2000) Proc: ICSLP , vol.3 , pp. 11-14
- Neti, C.¹ Iyengar, G.² Putamianos, G.³ Senior, A.⁴

25
- 0033879165
- Guest editors' introduction: Biometrics-the future of identification
- Feb.
- S. Pankanti, R. M. Bolle, and A. Jain, "Guest editors' introduction: Biometrics-the future of identification," IEEE Computer, vol. 33, pp. 46-80, Feb. 2000.
- (2000) IEEE Computer , vol.33 , pp. 46-80
- Pankanti, S.¹ Bolle, R.M.² Jain, A.³

26
- 0035279096
- Language-based interfaces and their application for cultural tourism
- O. Slock, "Language-based interfaces and their application for cultural tourism," AI Mag., pp. 85-97, 2001.
- (2001) AI Mag. , pp. 85-97
- Slock, O.¹

27
- 21244445225
- SmartKom: Multimodal dialogs with mobile Web users
- International Forum
- W. Wahlster, "SmartKom: Multimodal dialogs with mobile Web users," in Proc. Cyber Assist Int. Symp., International Forum, 2001, pp. 33-34.
- (2001) Proc. Cyber Assist Int. Symp. , pp. 33-34
- Wahlster, W.¹

28
- 0030677453
- Multimodal interfaces for multimedia information agents
- A. Waibel, B. Suhm, M. T. Vo, and J. Yang, "Multimodal interfaces for multimedia information agents." in Proc. ICASSP, vol. 1, 1997, pp. 167-170.
- (1997) Proc. ICASSP , vol.1 , pp. 167-170
- Waibel, A.¹ Suhm, B.² Vo, M.T.³ Yang, J.⁴

29
- 0038377045
- Multimodal systems that process what comes naturally
- Mar.
- S. L. Oviatt and P. R. Cohen, "Multimodal systems that process what comes naturally," Commun. ACM, vol. 43, no. 3, pp. 45-53, Mar. 2000.
- (2000) Commun. ACM , vol.43 , Issue.3 , pp. 45-53
- Oviatt, S.L.¹ Cohen, P.R.²

30
- 85135134004
- A rapid semi-automatic simulation technique for investigating interactive speech and handwriting
- S. L. Oviatt, P. R. Cohen, M. W. Fong, and M. P. Frank, "A rapid semi-automatic simulation technique for investigating interactive speech and handwriting." in Proc. ICSLP, vol. 2, 1992, pp. 1351-1354.
- (1992) Proc. ICSLP , vol.2 , pp. 1351-1354
- Oviatt, S.L.¹ Cohen, P.R.² Fong, M.W.³ Frank, M.P.⁴

31
- 84928838853
- An analysis of behavioral organization
- W. S. Condon, "An analysis of behavioral organization," Sign Lang. Stud., vol. 58, pp. 55-88, 1988.
- (1988) Sign Lang. Stud. , vol.58 , pp. 55-88
- Condon, W.S.¹

32
- 85065273463
- Gesticulation and speech: Two aspects of the process of utterance
- M. Key, Ed. The Hague, The Netherlands: Mouton
- A. Kendon, "Gesticulation and speech: Two aspects of the process of utterance," in The Relationship of Verbal and Nonverbal Communication, M. Key, Ed. The Hague, The Netherlands: Mouton, 1980, pp. 207-227.
- (1980) The Relationship of Verbal and Nonverbal Communication , pp. 207-227
- Kendon, A.¹

33
- 0003520518
- Chicago, IL: Univ. of Chicago Press
- D. McNeill, Hand and Mind: What Gestures Reveal About Thought. Chicago, IL: Univ. of Chicago Press, 1992.
- (1992) Hand and Mind: What Gestures Reveal about Thought
- McNeill, D.¹

34
- 84888902058
- Gestural trajectory symmetries and discourse segmentation
- F. Quek, Y. Xiong, and D. McNeill, "Gestural trajectory symmetries and discourse segmentation." in Proc. ICSLP, vol. 1, 2002, pp. 185-188.
- (2002) Proc. ICSLP , vol.1 , pp. 185-188
- Quek, F.¹ Xiong, Y.² McNeill, D.³

35
- 85009265640
- Gestural spatialization in natural discourse segmentation
- F. Quek, D. McNeill, R. Bryll, and M. Harper, "Gestural spatialization in natural discourse segmentation." in Proc. ICSLP, vol. 1, 2002, pp. 189-192.
- (2002) Proc. ICSLP , vol.1 , pp. 189-192
- Quek, F.¹ McNeill, D.² Bryll, R.³ Harper, M.⁴

36
- 0032072433
- Sensory integration and specchreading by humans and machines
- D. W. Massaro and D. G. Stork, "Sensory integration and specchreading by humans and machines," Amer. Scientist, vol. 86, pp. 236-244, 1998.
- (1998) Amer. Scientist , vol.86 , pp. 236-244
- Massaro, D.W.¹ Stork, D.G.²

37
- 0022019614
- Intermodal timing relations and audio-visual speech recognition by normal-hearing adults
- M. McGrath and Q. Summerfield, "Intermodal timing relations and audio-visual speech recognition by normal-hearing adults," J. Acoust. Soc. Amer., vol. 77. no. 2. pp. 678-685, 1985.
- (1985) J. Acoust. Soc. Amer. , vol.77 , Issue.2 , pp. 678-685
- McGrath, M.¹ Summerfield, Q.²

38
- 0017199877
- Hearing lips and seeing voices
- H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, pp. 746-748, 1976.
- (1976) Nature , vol.264 , pp. 746-748
- McGurk, H.¹ MacDonald, J.²

39
- 0031747741
- Complementarity and synergy in bimodal speech: Auditory, visual, and auditory-visual identification of French oral vowels in noise
- J. Robert-Ribes, J.-L. Schwartz, T. Lallouache, and P. Escudier, "Complementarity and synergy in bimodal speech: Auditory, visual, and auditory-visual identification of French oral vowels in noise." J. Acoust. Soc. Amer., vol. 103, no. 6, pp. 3677-3689, 1998.
- (1998) J. Acoust. Soc. Amer. , vol.103 , Issue.6 , pp. 3677-3689
- Robert-Ribes, J.¹ Schwartz, J.-L.² Lallouache, T.³ Escudier, P.⁴

40
- 0041827542
- Perceptual user interfaces
- M. Turk and G. Robertson, Eds., "Perceptual user interfaces," in Commun. ACM, 2000, vol. 43, pp. 32-70.
- (2000) Commun. ACM , vol.43 , pp. 32-70
- Turk, M.¹ Robertson, G.²

41
- 23044521010
- Statistical sensor calibration for fusion of different classifiers in a biometric person recognition framework
- Heidelberg, Germany
- B. Fröba, C. Rothe, and C. Küblbeck, "Statistical sensor calibration for fusion of different classifiers in a biometric person recognition framework," in Lecture Notes in Computer Science, Multiple Classifier Systems Heidelberg, Germany, 2000, vol. 1857, pp. 362-371.
- (2000) Lecture Notes in Computer Science, Multiple Classifier Systems , vol.1857 , pp. 362-371
- Fröba, B.¹ Rothe, C.² Küblbeck, C.³

42
- 0003079516
- A multimodal biometric system using fingerprint, face and speech
- A. Jain, L. Hong, and Y. Kulkarni, "A multimodal biometric system using fingerprint, face and speech." in Proc. 2nd Int. Conf. Audio- and Video-Based Biometric Person Authentication, 1999. pp. 182-187.
- (1999) Proc. 2nd Int. Conf. Audio- and Video-based Biometric Person Authentication , pp. 182-187
- Jain, A.¹ Hong, L.² Kulkarni, Y.³

43
- 0036448934
- Learning user-specific parameters in a multibiometric system
- Rochester, NY
- A. Jain and A. Ross, "Learning user-specific parameters in a multibiometric system," presented at the Int. Conf. Image Processing (ICIP), Rochester, NY, 2002.
- (2002) Int. Conf. Image Processing (ICIP)
- Jain, A.¹ Ross, A.²

44
- 82055208315
- Information fusion in biometrics
- Heidelberg, Germany
- A. Ross, A. Jain, and J. Z. Qian, "Information fusion in biometrics," in Lecture Notes in Computer Science, Audio- and Video-Based Biometric Person Authentication Heidelberg, Germany, 2001, vol. 2091, pp. 354-359.
- (2001) Lecture Notes in Computer Science, Audio- and Video-based Biometric Person Authentication , vol.2091 , pp. 354-359
- Ross, A.¹ Jain, A.² Qian, J.Z.³

45
- 0030646107
- Integration and synchronization of input modes during multimodal human-computer interaction
- S. L. Oviatt, A. DeAngeli, and K. Kuhn, "Integration and synchronization of input modes during multimodal human-computer interaction." in Proc. Conf. Human Factors Computing Systems (CHI'97), 1997, pp. 415-422.
- (1997) Proc. Conf. Human Factors Computing Systems (CHI'97) , pp. 415-422
- Oviatt, S.L.¹ DeAngeli, A.² Kuhn, K.³

46
- 85024725304
- Wizard of Oz studies - Why and how
- N. Dahlbäck, A. Jëonsson, and L. Ahrenberg, "Wizard of Oz studies - why and how." in Proc. Int. Workshop Intelligent User Interfaces, 1992, pp. 193-200.
- (1992) Proc. Int. Workshop Intelligent User Interfaces , pp. 193-200
- Dahlbäck, N.¹ Jëonsson, A.² Ahrenberg, L.³

47
- 9444239110
- Toward a theory of organized multimodal integration patterns during human-computer interaction
- Vancouver, BC, Canada
- S. L. Oviatt, R. Collision, S. Shriver, B. Xiao, R. Wesson, R. Lunsford, and L. Carmichael, "Toward a theory of organized multimodal integration patterns during human-computer interaction," presented at the Int. Conf. Multimodal Interfaces, Vancouver, BC, Canada, 2003.
- (2003) Int. Conf. Multimodal Interfaces
- Oviatt, S.L.¹ Collision, R.² Shriver, S.³ Xiao, B.⁴ Wesson, R.⁵ Lunsford, R.⁶ Carmichael, L.⁷

48
- 84958900313
- Cambridge, MA: MIT Press
- W. J. M Levelt, Speaking: From Intentions to Articulation. Cambridge, MA: MIT Press, 1989.
- (1989) Speaking: from Intentions to Articulation
- Levelt, W.J.M.¹

49
- 0000886290
- Eye fixations and cognitive processes
- M. A. Just and P. A. Carpenter, "Eye fixations and cognitive processes," Cogn. Psychol., vol. 8, pp. 441-480, 1976.
- (1976) Cogn. Psychol. , vol.8 , pp. 441-480
- Just, M.A.¹ Carpenter, P.A.²

50
- 0032215040
- Eye movements in reading and information processing: Twenty years of research
- K. Rayner, "Eye movements in reading and information processing: Twenty years of research," Psychol. Bull., vol. 124, no. 3, pp. 372-422, 1998.
- (1998) Psychol. Bull. , vol.124 , Issue.3 , pp. 372-422
- Rayner, K.¹

51
- 84976686046
- The use of eye movements in human-computer interaction techniques: What you look at is what you get
- R. J. K. Jacob, "The use of eye movements in human-computer interaction techniques: What you look at is what you get," ACM Trans Inform. Syst., vol. 9, pp. 152-169, 1991.
- (1991) ACM Trans Inform. Syst. , vol.9 , pp. 152-169
- Jacob, R.J.K.¹

52
- 0033721958
- SUITOR: An attentive information system
- P. P. Maglio, R. Barrett, C. S. Campbell, and T. T. Selker, "SUITOR: An attentive information system." in Proc. Int. Conf. Intelligent User Interfaces (IUI 2000), 2000, pp. 169-176.
- (2000) Proc. Int. Conf. Intelligent User Interfaces (IUI 2000) , pp. 169-176
- Maglio, P.P.¹ Barrett, R.² Campbell, C.S.³ Selker, T.T.⁴

53
- 84901688143
- Speech and gestures for graphic image manipulation
- A. G. Hauptmann, "Speech and gestures for graphic image manipulation." in Proc. Conf. Human Factors Computing Systems (CHI'89), 1989, pp.241-245.
- (1989) Proc. Conf. Human Factors Computing Systems (CHI'89) , pp. 241-245
- Hauptmann, A.G.¹

54
- 0030687099
- Multimodal interactive maps: Designing for human performance
- S. L. Oviatt, "Multimodal interactive maps: Designing for human performance." Human Comput. Interaction, vol. 12, no. 1-2, pp. 93-129, 1997.
- (1997) Human Comput. Interaction , vol.12 , Issue.1-2 , pp. 93-129
- Oviatt, S.L.¹

55
- 0028783651
- The role of voice input for human-machine communication
- P. Cohen and S. L. Oviatt, "The role of voice input for human-machine communication," Proc. Nat. Acad. Sci., vol. 92, pp. 9921-9927, 1995.
- (1995) Proc. Nat. Acad. Sci. , vol.92 , pp. 9921-9927
- Cohen, P.¹ Oviatt, S.L.²

56
- 0026240713
- Discourse structure and performance efficiency in interactive and noninteractive spoken modalities
- S. L. Oviatt and P. R. Cohen, "Discourse structure and performance efficiency in interactive and noninteractive spoken modalities." Comput. Speech Lang., vol. 5, no. 4, pp. 297-326, 1991.
- (1991) Comput. Speech Lang. , vol.5 , Issue.4 , pp. 297-326
- Oviatt, S.L.¹ Cohen, P.R.²

57
- 85135322093
- Integration themes in multimodal human-computer interaction
- S. L. Oviatt and E. Olsen, "Integration themes in multimodal human-computer interaction." in Proc. ICSLP, vol. 2, 1994, pp. 551-554.
- (1994) Proc. ICSLP , vol.2 , pp. 551-554
- Oviatt, S.L.¹ Olsen, E.²

58
- 4243792067
- Ph.D. dissertation. Fredericiana University, Karlsruhe, Germany
- B. Suhm, "Multimodal interactive error recovery for non-conversational speech user interfaces," Ph.D. dissertation. Fredericiana University, Karlsruhe, Germany, 1998.
- (1998) Multimodal Interactive Error Recovery for Non-conversational Speech User Interfaces
- Suhm, B.¹

59
- 0019038072
- Put-that-there: Voice and gesture at the graphics interface
- R. A. Bolt, "Put-that-there: Voice and gesture at the graphics interface," Comput. Graph., vol. 14, no. 3, pp. 262-270, 1980.
- (1980) Comput. Graph. , vol.14 , Issue.3 , pp. 262-270
- Bolt, R.A.¹

60
- 0010128235
- Integrating simultaneous input from speech, gaze, and hand gestures
- M. Maybury, Ed. Cambridge, MA: MIT Press
- D. Koons, C. Sparrell, and K. Thorisson, "Integrating simultaneous input from speech, gaze, and hand gestures," in Intelligent Multimedia Interfaces, M. Maybury, Ed. Cambridge, MA: MIT Press, 1993, pp. 257-276.
- (1993) Intelligent Multimedia Interfaces , pp. 257-276
- Koons, D.¹ Sparrell, C.² Thorisson, K.³

61
- 85009285157
- Multimodal integration patterns in children
- B. Xiao, C. Girand, and S. L. Oviatt, "Multimodal integration patterns in children." in Proc. ICSLP, 2002. pp. 629-632.
- (2002) Proc. ICSLP , pp. 629-632
- Xiao, B.¹ Girand, C.² Oviatt, S.L.³

62
- 10844297765
- Modeling multimodal integration patterns and performance in seniors: Toward adaptive processing of individual differences
- Vancouver, BC, Canada
- B. Xiao, R. Lunsford, R. Coulston, M. Wesson, and S. L. Oviatt, "Modeling multimodal integration patterns and performance in seniors: toward adaptive processing of individual differences," presented at the Int. Conf. Multimodal Interfaces, Vancouver, BC, Canada, 2003.
- (2003) Int. Conf. Multimodal Interfaces
- Xiao, B.¹ Lunsford, R.² Coulston, R.³ Wesson, M.⁴ Oviatt, S.L.⁵

63
- 40649110141
- Spontaneous gesture and sign: A study of ASL signs co-occurring with speech
- K. Naughton, "Spontaneous gesture and sign: A study of ASL signs co-occurring with speech," in Proc. Workshop Integration Gesture Language and Speech, 1996, pp. 125-134.
- (1996) Proc. Workshop Integration Gesture Language and Speech , pp. 125-134
- Naughton, K.¹

64
- 0042401939
- How can coarticulation models account for speech sensitivity to audio-visual de synchronization?
- D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag
- C. Abry, M. T. Lallouache, and M. A. Cathiard, "How can coarticulation models account for speech sensitivity to audio-visual de synchronization?." in Speechreading by Humans and Machines: Models, Systemsand Applications, D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 247-255.
- (1996) Speechreading by Humans and Machines: Models, Systemsand Applications , pp. 247-255
- Abry, C.¹ Lallouache, M.T.² Cathiard, M.A.³

65
- 0014036537
- Some functions of gaze direction in social interaction
- A. Kendon. "Some functions of gaze direction in social interaction," Acta Psychol., vol. 26, pp. 22-63, 1967.
- (1967) Acta Psychol. , vol.26 , pp. 22-63
- Kendon, A.¹

66
- 0034232298
- What the eyes say about speaking
- Z. M. Griffin and K. Bock, "What the eyes say about speaking." Psychol. Sci., vol. 11, no. 4, pp. 274-279, 2000.
- (2000) Psychol. Sci. , vol.11 , Issue.4 , pp. 274-279
- Griffin, Z.M.¹ Bock, K.²

67
- 0002126112
- Ten myths of multi modal interaction
- Nov.
- S. L. Oviatt, "Ten myths of multi modal interaction." Commun. ACM, vol. 42, no. 11, pp. 74-81. Nov. 1999.
- (1999) Commun. ACM , vol.42 , Issue.11 , pp. 74-81
- Oviatt, S.L.¹

68
- 21244437771
- Marina del Rey, CA
- Proc. 3rd International ACM Proceedings of the Conf. Assistive Technologies (ASSETS'98), A. I. Karshmer and M. Blattner. Eds., Marina del Rey, CA, 1998.
- (1998) Proc. 3rd International ACM Proceedings of the Conf. Assistive Technologies (ASSETS'98)
- Karshmer, A.I.¹ Blattner, M.²

69
- 21244466158
- private communication
- R. Markinson, private communication, 1993.
- (1993)
- Markinson, R.¹

70
- 0004544671
- Differences in visual intelligibility across talkers
- D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag
- P. B. Kricos, "Differences in visual intelligibility across talkers," in Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 43-53.
- (1996) Speechreading by Humans and Machines: Models, Systems and Applications , pp. 43-53
- Kricos, P.B.¹

71
- 0025935481
- Effect in nonenglish listeners: Few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility
- K. Sekiyama, Y. Tohkura, and Y. McGurk, "Effect in nonenglish listeners: Few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility," J. Acoust. Soc. Amer., vol. 90, pp. 1797-1805, 1991.
- (1991) J. Acoust. Soc. Amer. , vol.90 , pp. 1797-1805
- Sekiyama, K.¹ Tohkura, Y.² McGurk, Y.³

72
- 0005454347
- Perception of conflicting audio-visual speech: An examination across Spanish and German
- D. G. Stork and M. H. Hennecke, Eds. New York: Springer-Verlag
- A. Fuster-Duran, "Perception of conflicting audio-visual speech: An examination across Spanish and German," in Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. H. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 135-143.
- (1996) Speechreading by Humans and Machines: Models, Systems and Applications , pp. 135-143
- Fuster-Duran, A.¹

73
- 0003337251
- Nonverbal communication in human social interaction
- R. Hinde, Ed. Cambridge, MA: Cambridge Univ. Press
- M. Argyle, "Nonverbal communication in human social interaction," in Nonverbal Communication. R. Hinde, Ed. Cambridge, MA: Cambridge Univ. Press, 1972, pp. 243-267.
- (1972) Nonverbal Communication , pp. 243-267
- Argyle, M.¹

74
- 0020735295
- Compatibility and resource competition between modalities of input, central processing, and output
- C. D. Wickens, D. L. Sandry, and M. Vidulich, "Compatibility and resource competition between modalities of input, central processing, and output," Human Factors, vol. 25, pp. 227-248, 1983.
- (1983) Human Factors , vol.25 , pp. 227-248
- Wickens, C.D.¹ Sandry, D.L.² Vidulich, M.³

75
- 84884491122
- Synergistic use of direct manipulation and natural language
- P. Cohen, M. Dalrymple, D. Moran, and F. Pereira, "Synergistic use of direct manipulation and natural language," in Proc. Conf. Human Factors Computing Systems (CHI'89), 1989, pp. 227-234.
- (1989) Proc. Conf. Human Factors Computing Systems (CHI'89) , pp. 227-234
- Cohen, P.¹ Dalrymple, M.² Moran, D.³ Pereira, F.⁴

76
- 0028710004
- Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity
- S. L. Oviatt, P. R. Cohen, P. R., and M. Q. Wang, "Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity," Speech Commun., vol. 15, pp. 283-300, 1994.
- (1994) Speech Commun. , vol.15 , pp. 283-300
- Oviatt, S.L.¹ Cohen, P.R.² Wang, M.Q.³

77
- 21244444022
- Voice input as a replacement for keyboard accelerators in a mouse-based graphical editor: An empirical study
- July
- J. H. Leatherby and R. Pausch, "Voice input as a replacement for keyboard accelerators in a mouse-based graphical editor: An empirical study," J. Amer. Voice Input/Output Soc., vol. 11, no. 2, July 1992.
- (1992) J. Amer. Voice Input/Output Soc. , vol.11 , Issue.2
- Leatherby, J.H.¹ Pausch, R.²

78
- 0011391278
- The efficiency of multimodal interaction for a map-based task
- P. R. Cohen, D. R. McGee, and J. Clow. "The efficiency of multimodal interaction for a map-based task," in Proc. Language Technology-Joint Conf. (ANLP-NAACL 2000), 2000, pp. 331-338.
- (2000) Proc. Language Technology-joint Conf. (ANLP-NAACL 2000) , pp. 331-338
- Cohen, P.R.¹ McGee, D.R.² Clow, J.³

79
- 85128403506
- Referential features and linguistic indirection in multimodal language
- S. L. Oviatt and K. Kuhn, "Referential features and linguistic indirection in multimodal language," in Proc. ICSLP. vol. 2, 1998, pp. 227-280.
- (1998) Proc. ICSLP , vol.2 , pp. 227-280
- Oviatt, S.L.¹ Kuhn, K.²

80
- 0032684957
- Mutual disambiguation of recognition errors in a multimodal architecture
- S. L. Oviatt. "Mutual disambiguation of recognition errors in a multimodal architecture," in Proc. Conf. Human Factors Computing Systems (CHI'99), 1999, pp. 576-583.
- (1999) Proc. Conf. Human Factors Computing Systems (CHI'99) , pp. 576-583
- Oviatt, S.L.¹

81
- 0005073850
- Multimodal interactions in speech systems
- M. Blattner and R. Dannenberg, Eds. New York: ACM, Frontier Series
- A. Rudnicky and A. Hauptman. "Multimodal interactions in speech systems," in Multimedia Interface Design, M. Blattner and R. Dannenberg, Eds. New York: ACM. 1992, Frontier Series, pp. 147-172.
- (1992) Multimedia Interface Design , pp. 147-172
- Rudnicky, A.¹ Hauptman, A.²

82
- 77956782689
- Breaking the robustness barrier: Recent progress on the design of robust multimodal systems
- M. Zelkowitz, Ed. New York: Academic
- S. L. Oviatt, "Breaking the robustness barrier: Recent progress on the design of robust multimodal systems," in Advances in Computers, M. Zelkowitz, Ed. New York: Academic, 2002, vol. 56, pp. 305-341.
- (2002) Advances in Computers , vol.56 , pp. 305-341
- Oviatt, S.L.¹

83
- 0347663785
- Linguistic adaptations during spoken and multimodal error resolution
- S. L. Oviatt, J. Bernard, and G. Levow, "Linguistic adaptations during spoken and multimodal error resolution," Lang. Speech, vol. 41, no. 3-4, pp. 515-438, 1999.
- (1999) Lang. Speech , vol.41 , Issue.3-4 , pp. 515-1438
- Oviatt, S.L.¹ Bernard, J.² Levow, G.³

84
- 0023237267
- Quantifying the contribution of vision to speech perception in noise
- A. McLeod and Q. Summerfield, "Quantifying the contribution of vision to speech perception in noise," Br. J. Audiol., vol. 21, pp. 131-141, 1987.
- (1987) Br. J. Audiol. , vol.21 , pp. 131-141
- McLeod, A.¹ Summerfield, Q.²

85
- 0032179207
- Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition
- P. Iverson, L. Bernstein, and E. Auer, "Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition," Speech Commun., vol. 26, no. 1-2, pp. 45-63, 1998.
- (1998) Speech Commun. , vol.26 , Issue.1-2 , pp. 45-63
- Iverson, P.¹ Bernstein, L.² Auer, E.³

86
- 0141573559
- On the use of visual information for improving audio-based speaker recognition
- A. Senior, C. Neti, and B. Maison. "On the use of visual information for improving audio-based speaker recognition." in Proc. AuditoryVisual Speech Processing (AVSP) 1999, pp. 108-111.
- (1999) Proc. AuditoryVisual Speech Processing (AVSP) , pp. 108-111
- Senior, A.¹ Neti, C.² Maison, B.³

87
- 85009154155
- Stream weight optimization of speech and lip image sequence for audio-visual speech recognition
- S. Nakamura, H. Ito, and K. Shikano, "Stream weight optimization of speech and lip image sequence for audio-visual speech recognition," in Proc. ICSLP, vol. 3, 2000, pp. 20-24.
- (2000) Proc. ICSLP , vol.3 , pp. 20-24
- Nakamura, S.¹ Ito, H.² Shikano, K.³

88
- 85009153179
- Stream confidence estimation for audio-visual speech recognition
- G. Potamianos and C. Neti, "Stream confidence estimation for audio-visual speech recognition," in Proc. ICSLP. vol. 3, 2000, pp. 746-749.
- (2000) Proc. ICSLP , vol.3 , pp. 746-749
- Potamianos, G.¹ Neti, C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.