메뉴 건너뛰기




Volumn 91, Issue 9, 2003, Pages 1457-1468

User-centered modeling and evaluation of multimodal interfaces

Author keywords

Evaluation; High fidelity simulations; Proactive interface design; Prototyping; Task analysis; User centered modeling

Indexed keywords

COMPUTER SIMULATION; ERROR ANALYSIS; EVALUATION; GESTURE RECOGNITION; HUMAN COMPUTER INTERACTION; INTERACTIVE COMPUTER SYSTEMS; MATHEMATICAL MODELS; SOFTWARE PROTOTYPING; SPEECH COMMUNICATION; SYNCHRONIZATION; WORLD WIDE WEB;

EID: 21244476017     PISSN: 00189219     EISSN: None     Source Type: Journal    
DOI: 10.1109/JPROC.2003.817127     Document Type: Conference Paper
Times cited : (70)

References (88)
  • 5
    • 82055174896 scopus 로고
    • Audio-visual speech recognition compared across two architectures
    • A. Adjoudani and C. Benoit, "Audio-visual speech recognition compared across two architectures," in Proc. Eurospeech, vol. 2, 1995, pp. 1563-1566.
    • (1995) Proc. Eurospeech , vol.2 , pp. 1563-1566
    • Adjoudani, A.1    Benoit, C.2
  • 6
    • 0032178686 scopus 로고    scopus 로고
    • Audio-visual speech synthesis from french text: Eight years of models, designs and evaluation
    • C. Benoit and B. Le Goff, "Audio-visual speech synthesis from french text: Eight years of models, designs and evaluation," Speech Commun., vol. 26, pp. 117-129, 1998.
    • (1998) Speech Commun. , vol.26 , pp. 117-129
    • Benoit, C.1    Le Goff, B.2
  • 7
    • 85013597845 scopus 로고
    • Eigenlips for robust speech recognition
    • C. Bregler and Y. Konig, "Eigenlips for robust speech recognition," in Proc. ICASSP, vol. 2, 1994, pp. 669-672.
    • (1994) Proc. ICASSP , vol.2 , pp. 669-672
    • Bregler, C.1    Konig, Y.2
  • 8
    • 85032752352 scopus 로고    scopus 로고
    • Audiovisual speech processing
    • Jan.
    • T. Chen. "Audiovisual speech processing." IEEE Signal Processing Mag., vol. 18, pp. 9-21, Jan. 2001.
    • (2001) IEEE Signal Processing Mag. , vol.18 , pp. 9-21
    • Chen, T.1
  • 9
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Sept.
    • S. Dupont and J. Lueitin. "Audio-visual speech modeling for continuous speech recognition." IEEE Trans. Multimedia, vol. 2, pp. 141-151, Sept. 2000.
    • (2000) IEEE Trans. Multimedia , vol.2 , pp. 141-151
    • Dupont, S.1    Lueitin, J.2
  • 11
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audio-visual speech
    • Sept.
    • G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. Senior. "Recent advances in the automatic recognition of audio-visual speech," Proc. IEEE, vol. 91, pp. 1306-1326, Sept. 2003.
    • (2003) Proc. IEEE , vol.91 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.5
  • 12
    • 0010070142 scopus 로고    scopus 로고
    • Audiovisual sensory intergration using hidden Markov models
    • D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag
    • P. L. Silsbee and Q. Su, "Audiovisual sensory intergration using hidden Markov models," in Speechreading by Humana and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 489-504.
    • (1996) Speechreading by Humana and Machines: Models, Systems and Applications , pp. 489-504
    • Silsbee, P.L.1    Su, Q.2
  • 14
    • 0029747053 scopus 로고    scopus 로고
    • Integrating audio and visual information to provide highly robust speech recognition
    • M. J. Tomlinson, M. J. Russell, and N. M. Brooke, "Integrating audio and visual information to provide highly robust speech recognition," in Proc: ICASSP, vol. 2, 1996, pp. 821-824.
    • (1996) Proc: ICASSP , vol.2 , pp. 821-824
    • Tomlinson, M.J.1    Russell, M.J.2    Brooke, N.M.3
  • 17
    • 0032075723 scopus 로고    scopus 로고
    • Toward multimodal human-computer interface
    • May
    • R. Sharma, V. I. Pavlovic, and T. S. Huang, "Toward multimodal human-computer interface," Proc. IEEE, vol. 86, pp. 853-860, May 1998.
    • (1998) Proc. IEEE , vol.86 , pp. 853-860
    • Sharma, R.1    Pavlovic, V.I.2    Huang, T.S.3
  • 18
    • 84882783850 scopus 로고
    • Architecture Machine Group, Massachusetts Inst. Technol., Cambridge
    • N. Negroponte, "Report for ONR and DARPA." Architecture Machine Group, Massachusetts Inst. Technol., Cambridge, 1978.
    • (1978) Report for ONR and DARPA.
    • Negroponte, N.1
  • 19
    • 0031193007 scopus 로고    scopus 로고
    • Visual interpretation of hand gestures for human-computer interaction: A review
    • July
    • V. Pavlovic, R. Sharma, and T. Huang, "Visual interpretation of hand gestures for human-computer interaction: A review," IEEE Trans. Pattern Anal. Machine Intell., vol. 19, pp. 677-695, July 1997.
    • (1997) IEEE Trans. Pattern Anal. Machine Intell. , vol.19 , pp. 677-695
    • Pavlovic, V.1    Sharma, R.2    Huang, T.3
  • 23
    • 0002064205 scopus 로고    scopus 로고
    • Computer-human interface solutions for emergency medical care
    • T. G. Holzman, "Computer-human interface solutions for emergency medical care." Interactions, vol. 6, no. 3, pp. 13-24, 1999.
    • (1999) Interactions , vol.6 , Issue.3 , pp. 13-24
    • Holzman, T.G.1
  • 24
    • 85009060634 scopus 로고    scopus 로고
    • Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction
    • C. Neti, G. Iyengar, G. Putamianos, and A. Senior, "Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction," in Proc: ICSLP, vol. 3, 2000, pp. 11-14.
    • (2000) Proc: ICSLP , vol.3 , pp. 11-14
    • Neti, C.1    Iyengar, G.2    Putamianos, G.3    Senior, A.4
  • 25
    • 0033879165 scopus 로고    scopus 로고
    • Guest editors' introduction: Biometrics-the future of identification
    • Feb.
    • S. Pankanti, R. M. Bolle, and A. Jain, "Guest editors' introduction: Biometrics-the future of identification," IEEE Computer, vol. 33, pp. 46-80, Feb. 2000.
    • (2000) IEEE Computer , vol.33 , pp. 46-80
    • Pankanti, S.1    Bolle, R.M.2    Jain, A.3
  • 26
    • 0035279096 scopus 로고    scopus 로고
    • Language-based interfaces and their application for cultural tourism
    • O. Slock, "Language-based interfaces and their application for cultural tourism," AI Mag., pp. 85-97, 2001.
    • (2001) AI Mag. , pp. 85-97
    • Slock, O.1
  • 27
    • 21244445225 scopus 로고    scopus 로고
    • SmartKom: Multimodal dialogs with mobile Web users
    • International Forum
    • W. Wahlster, "SmartKom: Multimodal dialogs with mobile Web users," in Proc. Cyber Assist Int. Symp., International Forum, 2001, pp. 33-34.
    • (2001) Proc. Cyber Assist Int. Symp. , pp. 33-34
    • Wahlster, W.1
  • 28
    • 0030677453 scopus 로고    scopus 로고
    • Multimodal interfaces for multimedia information agents
    • A. Waibel, B. Suhm, M. T. Vo, and J. Yang, "Multimodal interfaces for multimedia information agents." in Proc. ICASSP, vol. 1, 1997, pp. 167-170.
    • (1997) Proc. ICASSP , vol.1 , pp. 167-170
    • Waibel, A.1    Suhm, B.2    Vo, M.T.3    Yang, J.4
  • 29
    • 0038377045 scopus 로고    scopus 로고
    • Multimodal systems that process what comes naturally
    • Mar.
    • S. L. Oviatt and P. R. Cohen, "Multimodal systems that process what comes naturally," Commun. ACM, vol. 43, no. 3, pp. 45-53, Mar. 2000.
    • (2000) Commun. ACM , vol.43 , Issue.3 , pp. 45-53
    • Oviatt, S.L.1    Cohen, P.R.2
  • 30
    • 85135134004 scopus 로고
    • A rapid semi-automatic simulation technique for investigating interactive speech and handwriting
    • S. L. Oviatt, P. R. Cohen, M. W. Fong, and M. P. Frank, "A rapid semi-automatic simulation technique for investigating interactive speech and handwriting." in Proc. ICSLP, vol. 2, 1992, pp. 1351-1354.
    • (1992) Proc. ICSLP , vol.2 , pp. 1351-1354
    • Oviatt, S.L.1    Cohen, P.R.2    Fong, M.W.3    Frank, M.P.4
  • 31
    • 84928838853 scopus 로고
    • An analysis of behavioral organization
    • W. S. Condon, "An analysis of behavioral organization," Sign Lang. Stud., vol. 58, pp. 55-88, 1988.
    • (1988) Sign Lang. Stud. , vol.58 , pp. 55-88
    • Condon, W.S.1
  • 32
    • 85065273463 scopus 로고
    • Gesticulation and speech: Two aspects of the process of utterance
    • M. Key, Ed. The Hague, The Netherlands: Mouton
    • A. Kendon, "Gesticulation and speech: Two aspects of the process of utterance," in The Relationship of Verbal and Nonverbal Communication, M. Key, Ed. The Hague, The Netherlands: Mouton, 1980, pp. 207-227.
    • (1980) The Relationship of Verbal and Nonverbal Communication , pp. 207-227
    • Kendon, A.1
  • 34
    • 84888902058 scopus 로고    scopus 로고
    • Gestural trajectory symmetries and discourse segmentation
    • F. Quek, Y. Xiong, and D. McNeill, "Gestural trajectory symmetries and discourse segmentation." in Proc. ICSLP, vol. 1, 2002, pp. 185-188.
    • (2002) Proc. ICSLP , vol.1 , pp. 185-188
    • Quek, F.1    Xiong, Y.2    McNeill, D.3
  • 35
    • 85009265640 scopus 로고    scopus 로고
    • Gestural spatialization in natural discourse segmentation
    • F. Quek, D. McNeill, R. Bryll, and M. Harper, "Gestural spatialization in natural discourse segmentation." in Proc. ICSLP, vol. 1, 2002, pp. 189-192.
    • (2002) Proc. ICSLP , vol.1 , pp. 189-192
    • Quek, F.1    McNeill, D.2    Bryll, R.3    Harper, M.4
  • 36
    • 0032072433 scopus 로고    scopus 로고
    • Sensory integration and specchreading by humans and machines
    • D. W. Massaro and D. G. Stork, "Sensory integration and specchreading by humans and machines," Amer. Scientist, vol. 86, pp. 236-244, 1998.
    • (1998) Amer. Scientist , vol.86 , pp. 236-244
    • Massaro, D.W.1    Stork, D.G.2
  • 37
    • 0022019614 scopus 로고
    • Intermodal timing relations and audio-visual speech recognition by normal-hearing adults
    • M. McGrath and Q. Summerfield, "Intermodal timing relations and audio-visual speech recognition by normal-hearing adults," J. Acoust. Soc. Amer., vol. 77. no. 2. pp. 678-685, 1985.
    • (1985) J. Acoust. Soc. Amer. , vol.77 , Issue.2 , pp. 678-685
    • McGrath, M.1    Summerfield, Q.2
  • 38
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, pp. 746-748, 1976.
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 39
    • 0031747741 scopus 로고    scopus 로고
    • Complementarity and synergy in bimodal speech: Auditory, visual, and auditory-visual identification of French oral vowels in noise
    • J. Robert-Ribes, J.-L. Schwartz, T. Lallouache, and P. Escudier, "Complementarity and synergy in bimodal speech: Auditory, visual, and auditory-visual identification of French oral vowels in noise." J. Acoust. Soc. Amer., vol. 103, no. 6, pp. 3677-3689, 1998.
    • (1998) J. Acoust. Soc. Amer. , vol.103 , Issue.6 , pp. 3677-3689
    • Robert-Ribes, J.1    Schwartz, J.-L.2    Lallouache, T.3    Escudier, P.4
  • 40
    • 0041827542 scopus 로고    scopus 로고
    • Perceptual user interfaces
    • M. Turk and G. Robertson, Eds., "Perceptual user interfaces," in Commun. ACM, 2000, vol. 43, pp. 32-70.
    • (2000) Commun. ACM , vol.43 , pp. 32-70
    • Turk, M.1    Robertson, G.2
  • 41
    • 23044521010 scopus 로고    scopus 로고
    • Statistical sensor calibration for fusion of different classifiers in a biometric person recognition framework
    • Heidelberg, Germany
    • B. Fröba, C. Rothe, and C. Küblbeck, "Statistical sensor calibration for fusion of different classifiers in a biometric person recognition framework," in Lecture Notes in Computer Science, Multiple Classifier Systems Heidelberg, Germany, 2000, vol. 1857, pp. 362-371.
    • (2000) Lecture Notes in Computer Science, Multiple Classifier Systems , vol.1857 , pp. 362-371
    • Fröba, B.1    Rothe, C.2    Küblbeck, C.3
  • 43
    • 0036448934 scopus 로고    scopus 로고
    • Learning user-specific parameters in a multibiometric system
    • Rochester, NY
    • A. Jain and A. Ross, "Learning user-specific parameters in a multibiometric system," presented at the Int. Conf. Image Processing (ICIP), Rochester, NY, 2002.
    • (2002) Int. Conf. Image Processing (ICIP)
    • Jain, A.1    Ross, A.2
  • 45
  • 49
    • 0000886290 scopus 로고
    • Eye fixations and cognitive processes
    • M. A. Just and P. A. Carpenter, "Eye fixations and cognitive processes," Cogn. Psychol., vol. 8, pp. 441-480, 1976.
    • (1976) Cogn. Psychol. , vol.8 , pp. 441-480
    • Just, M.A.1    Carpenter, P.A.2
  • 50
    • 0032215040 scopus 로고    scopus 로고
    • Eye movements in reading and information processing: Twenty years of research
    • K. Rayner, "Eye movements in reading and information processing: Twenty years of research," Psychol. Bull., vol. 124, no. 3, pp. 372-422, 1998.
    • (1998) Psychol. Bull. , vol.124 , Issue.3 , pp. 372-422
    • Rayner, K.1
  • 51
    • 84976686046 scopus 로고
    • The use of eye movements in human-computer interaction techniques: What you look at is what you get
    • R. J. K. Jacob, "The use of eye movements in human-computer interaction techniques: What you look at is what you get," ACM Trans Inform. Syst., vol. 9, pp. 152-169, 1991.
    • (1991) ACM Trans Inform. Syst. , vol.9 , pp. 152-169
    • Jacob, R.J.K.1
  • 54
    • 0030687099 scopus 로고    scopus 로고
    • Multimodal interactive maps: Designing for human performance
    • S. L. Oviatt, "Multimodal interactive maps: Designing for human performance." Human Comput. Interaction, vol. 12, no. 1-2, pp. 93-129, 1997.
    • (1997) Human Comput. Interaction , vol.12 , Issue.1-2 , pp. 93-129
    • Oviatt, S.L.1
  • 55
    • 0028783651 scopus 로고
    • The role of voice input for human-machine communication
    • P. Cohen and S. L. Oviatt, "The role of voice input for human-machine communication," Proc. Nat. Acad. Sci., vol. 92, pp. 9921-9927, 1995.
    • (1995) Proc. Nat. Acad. Sci. , vol.92 , pp. 9921-9927
    • Cohen, P.1    Oviatt, S.L.2
  • 56
    • 0026240713 scopus 로고
    • Discourse structure and performance efficiency in interactive and noninteractive spoken modalities
    • S. L. Oviatt and P. R. Cohen, "Discourse structure and performance efficiency in interactive and noninteractive spoken modalities." Comput. Speech Lang., vol. 5, no. 4, pp. 297-326, 1991.
    • (1991) Comput. Speech Lang. , vol.5 , Issue.4 , pp. 297-326
    • Oviatt, S.L.1    Cohen, P.R.2
  • 57
    • 85135322093 scopus 로고
    • Integration themes in multimodal human-computer interaction
    • S. L. Oviatt and E. Olsen, "Integration themes in multimodal human-computer interaction." in Proc. ICSLP, vol. 2, 1994, pp. 551-554.
    • (1994) Proc. ICSLP , vol.2 , pp. 551-554
    • Oviatt, S.L.1    Olsen, E.2
  • 59
    • 0019038072 scopus 로고
    • Put-that-there: Voice and gesture at the graphics interface
    • R. A. Bolt, "Put-that-there: Voice and gesture at the graphics interface," Comput. Graph., vol. 14, no. 3, pp. 262-270, 1980.
    • (1980) Comput. Graph. , vol.14 , Issue.3 , pp. 262-270
    • Bolt, R.A.1
  • 60
    • 0010128235 scopus 로고
    • Integrating simultaneous input from speech, gaze, and hand gestures
    • M. Maybury, Ed. Cambridge, MA: MIT Press
    • D. Koons, C. Sparrell, and K. Thorisson, "Integrating simultaneous input from speech, gaze, and hand gestures," in Intelligent Multimedia Interfaces, M. Maybury, Ed. Cambridge, MA: MIT Press, 1993, pp. 257-276.
    • (1993) Intelligent Multimedia Interfaces , pp. 257-276
    • Koons, D.1    Sparrell, C.2    Thorisson, K.3
  • 61
    • 85009285157 scopus 로고    scopus 로고
    • Multimodal integration patterns in children
    • B. Xiao, C. Girand, and S. L. Oviatt, "Multimodal integration patterns in children." in Proc. ICSLP, 2002. pp. 629-632.
    • (2002) Proc. ICSLP , pp. 629-632
    • Xiao, B.1    Girand, C.2    Oviatt, S.L.3
  • 62
    • 10844297765 scopus 로고    scopus 로고
    • Modeling multimodal integration patterns and performance in seniors: Toward adaptive processing of individual differences
    • Vancouver, BC, Canada
    • B. Xiao, R. Lunsford, R. Coulston, M. Wesson, and S. L. Oviatt, "Modeling multimodal integration patterns and performance in seniors: toward adaptive processing of individual differences," presented at the Int. Conf. Multimodal Interfaces, Vancouver, BC, Canada, 2003.
    • (2003) Int. Conf. Multimodal Interfaces
    • Xiao, B.1    Lunsford, R.2    Coulston, R.3    Wesson, M.4    Oviatt, S.L.5
  • 63
    • 40649110141 scopus 로고    scopus 로고
    • Spontaneous gesture and sign: A study of ASL signs co-occurring with speech
    • K. Naughton, "Spontaneous gesture and sign: A study of ASL signs co-occurring with speech," in Proc. Workshop Integration Gesture Language and Speech, 1996, pp. 125-134.
    • (1996) Proc. Workshop Integration Gesture Language and Speech , pp. 125-134
    • Naughton, K.1
  • 64
    • 0042401939 scopus 로고    scopus 로고
    • How can coarticulation models account for speech sensitivity to audio-visual de synchronization?
    • D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag
    • C. Abry, M. T. Lallouache, and M. A. Cathiard, "How can coarticulation models account for speech sensitivity to audio-visual de synchronization?." in Speechreading by Humans and Machines: Models, Systemsand Applications, D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 247-255.
    • (1996) Speechreading by Humans and Machines: Models, Systemsand Applications , pp. 247-255
    • Abry, C.1    Lallouache, M.T.2    Cathiard, M.A.3
  • 65
    • 0014036537 scopus 로고
    • Some functions of gaze direction in social interaction
    • A. Kendon. "Some functions of gaze direction in social interaction," Acta Psychol., vol. 26, pp. 22-63, 1967.
    • (1967) Acta Psychol. , vol.26 , pp. 22-63
    • Kendon, A.1
  • 66
    • 0034232298 scopus 로고    scopus 로고
    • What the eyes say about speaking
    • Z. M. Griffin and K. Bock, "What the eyes say about speaking." Psychol. Sci., vol. 11, no. 4, pp. 274-279, 2000.
    • (2000) Psychol. Sci. , vol.11 , Issue.4 , pp. 274-279
    • Griffin, Z.M.1    Bock, K.2
  • 67
    • 0002126112 scopus 로고    scopus 로고
    • Ten myths of multi modal interaction
    • Nov.
    • S. L. Oviatt, "Ten myths of multi modal interaction." Commun. ACM, vol. 42, no. 11, pp. 74-81. Nov. 1999.
    • (1999) Commun. ACM , vol.42 , Issue.11 , pp. 74-81
    • Oviatt, S.L.1
  • 69
    • 21244466158 scopus 로고
    • private communication
    • R. Markinson, private communication, 1993.
    • (1993)
    • Markinson, R.1
  • 70
    • 0004544671 scopus 로고    scopus 로고
    • Differences in visual intelligibility across talkers
    • D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag
    • P. B. Kricos, "Differences in visual intelligibility across talkers," in Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 43-53.
    • (1996) Speechreading by Humans and Machines: Models, Systems and Applications , pp. 43-53
    • Kricos, P.B.1
  • 71
    • 0025935481 scopus 로고
    • Effect in nonenglish listeners: Few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility
    • K. Sekiyama, Y. Tohkura, and Y. McGurk, "Effect in nonenglish listeners: Few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility," J. Acoust. Soc. Amer., vol. 90, pp. 1797-1805, 1991.
    • (1991) J. Acoust. Soc. Amer. , vol.90 , pp. 1797-1805
    • Sekiyama, K.1    Tohkura, Y.2    McGurk, Y.3
  • 72
    • 0005454347 scopus 로고    scopus 로고
    • Perception of conflicting audio-visual speech: An examination across Spanish and German
    • D. G. Stork and M. H. Hennecke, Eds. New York: Springer-Verlag
    • A. Fuster-Duran, "Perception of conflicting audio-visual speech: An examination across Spanish and German," in Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. H. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 135-143.
    • (1996) Speechreading by Humans and Machines: Models, Systems and Applications , pp. 135-143
    • Fuster-Duran, A.1
  • 73
    • 0003337251 scopus 로고
    • Nonverbal communication in human social interaction
    • R. Hinde, Ed. Cambridge, MA: Cambridge Univ. Press
    • M. Argyle, "Nonverbal communication in human social interaction," in Nonverbal Communication. R. Hinde, Ed. Cambridge, MA: Cambridge Univ. Press, 1972, pp. 243-267.
    • (1972) Nonverbal Communication , pp. 243-267
    • Argyle, M.1
  • 74
    • 0020735295 scopus 로고
    • Compatibility and resource competition between modalities of input, central processing, and output
    • C. D. Wickens, D. L. Sandry, and M. Vidulich, "Compatibility and resource competition between modalities of input, central processing, and output," Human Factors, vol. 25, pp. 227-248, 1983.
    • (1983) Human Factors , vol.25 , pp. 227-248
    • Wickens, C.D.1    Sandry, D.L.2    Vidulich, M.3
  • 76
    • 0028710004 scopus 로고
    • Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity
    • S. L. Oviatt, P. R. Cohen, P. R., and M. Q. Wang, "Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity," Speech Commun., vol. 15, pp. 283-300, 1994.
    • (1994) Speech Commun. , vol.15 , pp. 283-300
    • Oviatt, S.L.1    Cohen, P.R.2    Wang, M.Q.3
  • 77
    • 21244444022 scopus 로고
    • Voice input as a replacement for keyboard accelerators in a mouse-based graphical editor: An empirical study
    • July
    • J. H. Leatherby and R. Pausch, "Voice input as a replacement for keyboard accelerators in a mouse-based graphical editor: An empirical study," J. Amer. Voice Input/Output Soc., vol. 11, no. 2, July 1992.
    • (1992) J. Amer. Voice Input/Output Soc. , vol.11 , Issue.2
    • Leatherby, J.H.1    Pausch, R.2
  • 79
    • 85128403506 scopus 로고    scopus 로고
    • Referential features and linguistic indirection in multimodal language
    • S. L. Oviatt and K. Kuhn, "Referential features and linguistic indirection in multimodal language," in Proc. ICSLP. vol. 2, 1998, pp. 227-280.
    • (1998) Proc. ICSLP , vol.2 , pp. 227-280
    • Oviatt, S.L.1    Kuhn, K.2
  • 80
    • 0032684957 scopus 로고    scopus 로고
    • Mutual disambiguation of recognition errors in a multimodal architecture
    • S. L. Oviatt. "Mutual disambiguation of recognition errors in a multimodal architecture," in Proc. Conf. Human Factors Computing Systems (CHI'99), 1999, pp. 576-583.
    • (1999) Proc. Conf. Human Factors Computing Systems (CHI'99) , pp. 576-583
    • Oviatt, S.L.1
  • 81
    • 0005073850 scopus 로고
    • Multimodal interactions in speech systems
    • M. Blattner and R. Dannenberg, Eds. New York: ACM, Frontier Series
    • A. Rudnicky and A. Hauptman. "Multimodal interactions in speech systems," in Multimedia Interface Design, M. Blattner and R. Dannenberg, Eds. New York: ACM. 1992, Frontier Series, pp. 147-172.
    • (1992) Multimedia Interface Design , pp. 147-172
    • Rudnicky, A.1    Hauptman, A.2
  • 82
    • 77956782689 scopus 로고    scopus 로고
    • Breaking the robustness barrier: Recent progress on the design of robust multimodal systems
    • M. Zelkowitz, Ed. New York: Academic
    • S. L. Oviatt, "Breaking the robustness barrier: Recent progress on the design of robust multimodal systems," in Advances in Computers, M. Zelkowitz, Ed. New York: Academic, 2002, vol. 56, pp. 305-341.
    • (2002) Advances in Computers , vol.56 , pp. 305-341
    • Oviatt, S.L.1
  • 83
    • 0347663785 scopus 로고    scopus 로고
    • Linguistic adaptations during spoken and multimodal error resolution
    • S. L. Oviatt, J. Bernard, and G. Levow, "Linguistic adaptations during spoken and multimodal error resolution," Lang. Speech, vol. 41, no. 3-4, pp. 515-438, 1999.
    • (1999) Lang. Speech , vol.41 , Issue.3-4 , pp. 515-1438
    • Oviatt, S.L.1    Bernard, J.2    Levow, G.3
  • 84
    • 0023237267 scopus 로고
    • Quantifying the contribution of vision to speech perception in noise
    • A. McLeod and Q. Summerfield, "Quantifying the contribution of vision to speech perception in noise," Br. J. Audiol., vol. 21, pp. 131-141, 1987.
    • (1987) Br. J. Audiol. , vol.21 , pp. 131-141
    • McLeod, A.1    Summerfield, Q.2
  • 85
    • 0032179207 scopus 로고    scopus 로고
    • Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition
    • P. Iverson, L. Bernstein, and E. Auer, "Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition," Speech Commun., vol. 26, no. 1-2, pp. 45-63, 1998.
    • (1998) Speech Commun. , vol.26 , Issue.1-2 , pp. 45-63
    • Iverson, P.1    Bernstein, L.2    Auer, E.3
  • 86
    • 0141573559 scopus 로고    scopus 로고
    • On the use of visual information for improving audio-based speaker recognition
    • A. Senior, C. Neti, and B. Maison. "On the use of visual information for improving audio-based speaker recognition." in Proc. AuditoryVisual Speech Processing (AVSP) 1999, pp. 108-111.
    • (1999) Proc. AuditoryVisual Speech Processing (AVSP) , pp. 108-111
    • Senior, A.1    Neti, C.2    Maison, B.3
  • 87
    • 85009154155 scopus 로고    scopus 로고
    • Stream weight optimization of speech and lip image sequence for audio-visual speech recognition
    • S. Nakamura, H. Ito, and K. Shikano, "Stream weight optimization of speech and lip image sequence for audio-visual speech recognition," in Proc. ICSLP, vol. 3, 2000, pp. 20-24.
    • (2000) Proc. ICSLP , vol.3 , pp. 20-24
    • Nakamura, S.1    Ito, H.2    Shikano, K.3
  • 88
    • 85009153179 scopus 로고    scopus 로고
    • Stream confidence estimation for audio-visual speech recognition
    • G. Potamianos and C. Neti, "Stream confidence estimation for audio-visual speech recognition," in Proc. ICSLP. vol. 3, 2000, pp. 746-749.
    • (2000) Proc. ICSLP , vol.3 , pp. 746-749
    • Potamianos, G.1    Neti, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.