-
1
-
-
0141685005
-
Audio-visual and multimodal speech-based systems
-
D. Gibbon, I. Mertins, and R. Moore, Eds. Boston, MA, Kluwer
-
J. Benoit, C. Martin, C. Pelachaud, L. Schomaker, and B. Suhm, "Audio-visual and multimodal speech-based systems," in Handbook of Multimodal and Spoken Dialogue Systems: Resources, Terminology and Product Evaluation, D. Gibbon, I. Mertins, and R. Moore, Eds. Boston, MA, Kluwer, 2000, pp. 102-203.
-
(2000)
Handbook of Multimodal and Spoken Dialogue Systems: Resources, Terminology and Product Evaluation
, pp. 102-203
-
-
Benoit, J.1
Martin, C.2
Pelachaud, C.3
Schomaker, L.4
Suhm, B.5
-
2
-
-
0034448810
-
Designing the user interface for multimodal speech and gesture applications; State-of-the-art systems and research directions
-
S. L. Oviatt, P. R. Cohen, L. Wu, J. Vergo, L. Dunean, B. Suhm, J. Bers, T. Holzman, T. Winograd, J. Landay, J. Larson, and D. Ferro, "Designing the user interface for multimodal speech and gesture applications; State-of-the-art systems and research directions," Human Comput. Interaction, vol. 15, no. 4, pp. 263-322, 2000.
-
(2000)
Human Comput. Interaction
, vol.15
, Issue.4
, pp. 263-322
-
-
Oviatt, S.L.1
Cohen, P.R.2
Wu, L.3
Vergo, J.4
Dunean, L.5
Suhm, B.6
Bers, J.7
Holzman, T.8
Winograd, T.9
Landay, J.10
Larson, J.11
Ferro, D.12
-
3
-
-
0031380441
-
Quickset: Multimodal interaction for distributed applications
-
P. R. Cohen, M. Johnston, D. McGee, S. L. Oviatt, J. Pittman, I. Smith, L. Chen, and J. Clow, "Quickset: Multimodal interaction for distributed applications." in Proc. 5th ACM Int. Multimedia Conf., 1997, pp. 31-40.
-
(1997)
Proc. 5th ACM Int. Multimedia Conf.
, pp. 31-40
-
-
Cohen, P.R.1
Johnston, M.2
McGee, D.3
Oviatt, S.L.4
Pittman, J.5
Smith, I.6
Chen, L.7
Clow, J.8
-
4
-
-
84943647946
-
MiPad: A next-generation PDA prototype
-
X. Huang, A. Acero, C. Chelba, L. Deng, D. Duchene, J. Goodman, H. Hon, D. Jacoby, L. Jiang, R. Loynd, M. Mahajan, P. Mau, S. Meredith, S. Mughal, S. Neto, M. Plumpe, K. Wang, and Y. Wang, "MiPad: A next-generation PDA prototype," in Proc. ICSLP, vol. 3, 2000, pp. 33-36.
-
(2000)
Proc. ICSLP
, vol.3
, pp. 33-36
-
-
Huang, X.1
Acero, A.2
Chelba, C.3
Deng, L.4
Duchene, D.5
Goodman, J.6
Hon, H.7
Jacoby, D.8
Jiang, L.9
Loynd, R.10
Mahajan, M.11
Mau, P.12
Meredith, S.13
Mughal, S.14
Neto, S.15
Plumpe, M.16
Wang, K.17
Wang, Y.18
-
5
-
-
82055174896
-
Audio-visual speech recognition compared across two architectures
-
A. Adjoudani and C. Benoit, "Audio-visual speech recognition compared across two architectures," in Proc. Eurospeech, vol. 2, 1995, pp. 1563-1566.
-
(1995)
Proc. Eurospeech
, vol.2
, pp. 1563-1566
-
-
Adjoudani, A.1
Benoit, C.2
-
6
-
-
0032178686
-
Audio-visual speech synthesis from french text: Eight years of models, designs and evaluation
-
C. Benoit and B. Le Goff, "Audio-visual speech synthesis from french text: Eight years of models, designs and evaluation," Speech Commun., vol. 26, pp. 117-129, 1998.
-
(1998)
Speech Commun.
, vol.26
, pp. 117-129
-
-
Benoit, C.1
Le Goff, B.2
-
7
-
-
85013597845
-
Eigenlips for robust speech recognition
-
C. Bregler and Y. Konig, "Eigenlips for robust speech recognition," in Proc. ICASSP, vol. 2, 1994, pp. 669-672.
-
(1994)
Proc. ICASSP
, vol.2
, pp. 669-672
-
-
Bregler, C.1
Konig, Y.2
-
8
-
-
85032752352
-
Audiovisual speech processing
-
Jan.
-
T. Chen. "Audiovisual speech processing." IEEE Signal Processing Mag., vol. 18, pp. 9-21, Jan. 2001.
-
(2001)
IEEE Signal Processing Mag.
, vol.18
, pp. 9-21
-
-
Chen, T.1
-
9
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
Sept.
-
S. Dupont and J. Lueitin. "Audio-visual speech modeling for continuous speech recognition." IEEE Trans. Multimedia, vol. 2, pp. 141-151, Sept. 2000.
-
(2000)
IEEE Trans. Multimedia
, vol.2
, pp. 141-151
-
-
Dupont, S.1
Lueitin, J.2
-
11
-
-
4544290191
-
Recent advances in the automatic recognition of audio-visual speech
-
Sept.
-
G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. Senior. "Recent advances in the automatic recognition of audio-visual speech," Proc. IEEE, vol. 91, pp. 1306-1326, Sept. 2003.
-
(2003)
Proc. IEEE
, vol.91
, pp. 1306-1326
-
-
Potamianos, G.1
Neti, C.2
Gravier, G.3
Garg, A.4
Senior, A.5
-
12
-
-
0010070142
-
Audiovisual sensory intergration using hidden Markov models
-
D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag
-
P. L. Silsbee and Q. Su, "Audiovisual sensory intergration using hidden Markov models," in Speechreading by Humana and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 489-504.
-
(1996)
Speechreading by Humana and Machines: Models, Systems and Applications
, pp. 489-504
-
-
Silsbee, P.L.1
Su, Q.2
-
14
-
-
0029747053
-
Integrating audio and visual information to provide highly robust speech recognition
-
M. J. Tomlinson, M. J. Russell, and N. M. Brooke, "Integrating audio and visual information to provide highly robust speech recognition," in Proc: ICASSP, vol. 2, 1996, pp. 821-824.
-
(1996)
Proc: ICASSP
, vol.2
, pp. 821-824
-
-
Tomlinson, M.J.1
Russell, M.J.2
Brooke, N.M.3
-
15
-
-
0004986359
-
Audio-visual speech processing
-
P. Rubin, E. Vatikiotis-Bateson, and C. Benoit, Eds., "Audio-visual speech processing," in Speech Commun. (Special Issue), 1998, vol. 26.
-
(1998)
Speech Commun. (Special Issue)
, vol.26
-
-
Rubin, P.1
Vatikiotis-Bateson, E.2
Benoit, C.3
-
16
-
-
33845911698
-
Enhancing virtual maintenance environments with speech understanding
-
L. Duncan, W. Brown, C. Esposito, H. Holmback, and P. Xue, "Enhancing virtual maintenance environments with speech understanding." Boeing M&CT TechNet, 1999.
-
(1999)
Boeing M&CT TechNet
-
-
Duncan, L.1
Brown, W.2
Esposito, C.3
Holmback, H.4
Xue, P.5
-
17
-
-
0032075723
-
Toward multimodal human-computer interface
-
May
-
R. Sharma, V. I. Pavlovic, and T. S. Huang, "Toward multimodal human-computer interface," Proc. IEEE, vol. 86, pp. 853-860, May 1998.
-
(1998)
Proc. IEEE
, vol.86
, pp. 853-860
-
-
Sharma, R.1
Pavlovic, V.I.2
Huang, T.S.3
-
18
-
-
84882783850
-
-
Architecture Machine Group, Massachusetts Inst. Technol., Cambridge
-
N. Negroponte, "Report for ONR and DARPA." Architecture Machine Group, Massachusetts Inst. Technol., Cambridge, 1978.
-
(1978)
Report for ONR and DARPA.
-
-
Negroponte, N.1
-
19
-
-
0031193007
-
Visual interpretation of hand gestures for human-computer interaction: A review
-
July
-
V. Pavlovic, R. Sharma, and T. Huang, "Visual interpretation of hand gestures for human-computer interaction: A review," IEEE Trans. Pattern Anal. Machine Intell., vol. 19, pp. 677-695, July 1997.
-
(1997)
IEEE Trans. Pattern Anal. Machine Intell.
, vol.19
, pp. 677-695
-
-
Pavlovic, V.1
Sharma, R.2
Huang, T.3
-
20
-
-
0032662263
-
Manual and gaze input cascaded (MAGIC) pointing
-
S. Zhai, C. Morimoto, and S. Ihde, "Manual and gaze input cascaded (MAGIC) pointing," in Proc. Conf Human Factors Computing Systems (CHI'99), 1999, pp. 246-253.
-
(1999)
Proc. Conf Human Factors Computing Systems (CHI'99)
, pp. 246-253
-
-
Zhai, S.1
Morimoto, C.2
Ihde, S.3
-
23
-
-
0002064205
-
Computer-human interface solutions for emergency medical care
-
T. G. Holzman, "Computer-human interface solutions for emergency medical care." Interactions, vol. 6, no. 3, pp. 13-24, 1999.
-
(1999)
Interactions
, vol.6
, Issue.3
, pp. 13-24
-
-
Holzman, T.G.1
-
24
-
-
85009060634
-
Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction
-
C. Neti, G. Iyengar, G. Putamianos, and A. Senior, "Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction," in Proc: ICSLP, vol. 3, 2000, pp. 11-14.
-
(2000)
Proc: ICSLP
, vol.3
, pp. 11-14
-
-
Neti, C.1
Iyengar, G.2
Putamianos, G.3
Senior, A.4
-
25
-
-
0033879165
-
Guest editors' introduction: Biometrics-the future of identification
-
Feb.
-
S. Pankanti, R. M. Bolle, and A. Jain, "Guest editors' introduction: Biometrics-the future of identification," IEEE Computer, vol. 33, pp. 46-80, Feb. 2000.
-
(2000)
IEEE Computer
, vol.33
, pp. 46-80
-
-
Pankanti, S.1
Bolle, R.M.2
Jain, A.3
-
26
-
-
0035279096
-
Language-based interfaces and their application for cultural tourism
-
O. Slock, "Language-based interfaces and their application for cultural tourism," AI Mag., pp. 85-97, 2001.
-
(2001)
AI Mag.
, pp. 85-97
-
-
Slock, O.1
-
27
-
-
21244445225
-
SmartKom: Multimodal dialogs with mobile Web users
-
International Forum
-
W. Wahlster, "SmartKom: Multimodal dialogs with mobile Web users," in Proc. Cyber Assist Int. Symp., International Forum, 2001, pp. 33-34.
-
(2001)
Proc. Cyber Assist Int. Symp.
, pp. 33-34
-
-
Wahlster, W.1
-
28
-
-
0030677453
-
Multimodal interfaces for multimedia information agents
-
A. Waibel, B. Suhm, M. T. Vo, and J. Yang, "Multimodal interfaces for multimedia information agents." in Proc. ICASSP, vol. 1, 1997, pp. 167-170.
-
(1997)
Proc. ICASSP
, vol.1
, pp. 167-170
-
-
Waibel, A.1
Suhm, B.2
Vo, M.T.3
Yang, J.4
-
29
-
-
0038377045
-
Multimodal systems that process what comes naturally
-
Mar.
-
S. L. Oviatt and P. R. Cohen, "Multimodal systems that process what comes naturally," Commun. ACM, vol. 43, no. 3, pp. 45-53, Mar. 2000.
-
(2000)
Commun. ACM
, vol.43
, Issue.3
, pp. 45-53
-
-
Oviatt, S.L.1
Cohen, P.R.2
-
30
-
-
85135134004
-
A rapid semi-automatic simulation technique for investigating interactive speech and handwriting
-
S. L. Oviatt, P. R. Cohen, M. W. Fong, and M. P. Frank, "A rapid semi-automatic simulation technique for investigating interactive speech and handwriting." in Proc. ICSLP, vol. 2, 1992, pp. 1351-1354.
-
(1992)
Proc. ICSLP
, vol.2
, pp. 1351-1354
-
-
Oviatt, S.L.1
Cohen, P.R.2
Fong, M.W.3
Frank, M.P.4
-
31
-
-
84928838853
-
An analysis of behavioral organization
-
W. S. Condon, "An analysis of behavioral organization," Sign Lang. Stud., vol. 58, pp. 55-88, 1988.
-
(1988)
Sign Lang. Stud.
, vol.58
, pp. 55-88
-
-
Condon, W.S.1
-
32
-
-
85065273463
-
Gesticulation and speech: Two aspects of the process of utterance
-
M. Key, Ed. The Hague, The Netherlands: Mouton
-
A. Kendon, "Gesticulation and speech: Two aspects of the process of utterance," in The Relationship of Verbal and Nonverbal Communication, M. Key, Ed. The Hague, The Netherlands: Mouton, 1980, pp. 207-227.
-
(1980)
The Relationship of Verbal and Nonverbal Communication
, pp. 207-227
-
-
Kendon, A.1
-
34
-
-
84888902058
-
Gestural trajectory symmetries and discourse segmentation
-
F. Quek, Y. Xiong, and D. McNeill, "Gestural trajectory symmetries and discourse segmentation." in Proc. ICSLP, vol. 1, 2002, pp. 185-188.
-
(2002)
Proc. ICSLP
, vol.1
, pp. 185-188
-
-
Quek, F.1
Xiong, Y.2
McNeill, D.3
-
35
-
-
85009265640
-
Gestural spatialization in natural discourse segmentation
-
F. Quek, D. McNeill, R. Bryll, and M. Harper, "Gestural spatialization in natural discourse segmentation." in Proc. ICSLP, vol. 1, 2002, pp. 189-192.
-
(2002)
Proc. ICSLP
, vol.1
, pp. 189-192
-
-
Quek, F.1
McNeill, D.2
Bryll, R.3
Harper, M.4
-
36
-
-
0032072433
-
Sensory integration and specchreading by humans and machines
-
D. W. Massaro and D. G. Stork, "Sensory integration and specchreading by humans and machines," Amer. Scientist, vol. 86, pp. 236-244, 1998.
-
(1998)
Amer. Scientist
, vol.86
, pp. 236-244
-
-
Massaro, D.W.1
Stork, D.G.2
-
37
-
-
0022019614
-
Intermodal timing relations and audio-visual speech recognition by normal-hearing adults
-
M. McGrath and Q. Summerfield, "Intermodal timing relations and audio-visual speech recognition by normal-hearing adults," J. Acoust. Soc. Amer., vol. 77. no. 2. pp. 678-685, 1985.
-
(1985)
J. Acoust. Soc. Amer.
, vol.77
, Issue.2
, pp. 678-685
-
-
McGrath, M.1
Summerfield, Q.2
-
38
-
-
0017199877
-
Hearing lips and seeing voices
-
H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, pp. 746-748, 1976.
-
(1976)
Nature
, vol.264
, pp. 746-748
-
-
McGurk, H.1
MacDonald, J.2
-
39
-
-
0031747741
-
Complementarity and synergy in bimodal speech: Auditory, visual, and auditory-visual identification of French oral vowels in noise
-
J. Robert-Ribes, J.-L. Schwartz, T. Lallouache, and P. Escudier, "Complementarity and synergy in bimodal speech: Auditory, visual, and auditory-visual identification of French oral vowels in noise." J. Acoust. Soc. Amer., vol. 103, no. 6, pp. 3677-3689, 1998.
-
(1998)
J. Acoust. Soc. Amer.
, vol.103
, Issue.6
, pp. 3677-3689
-
-
Robert-Ribes, J.1
Schwartz, J.-L.2
Lallouache, T.3
Escudier, P.4
-
40
-
-
0041827542
-
Perceptual user interfaces
-
M. Turk and G. Robertson, Eds., "Perceptual user interfaces," in Commun. ACM, 2000, vol. 43, pp. 32-70.
-
(2000)
Commun. ACM
, vol.43
, pp. 32-70
-
-
Turk, M.1
Robertson, G.2
-
41
-
-
23044521010
-
Statistical sensor calibration for fusion of different classifiers in a biometric person recognition framework
-
Heidelberg, Germany
-
B. Fröba, C. Rothe, and C. Küblbeck, "Statistical sensor calibration for fusion of different classifiers in a biometric person recognition framework," in Lecture Notes in Computer Science, Multiple Classifier Systems Heidelberg, Germany, 2000, vol. 1857, pp. 362-371.
-
(2000)
Lecture Notes in Computer Science, Multiple Classifier Systems
, vol.1857
, pp. 362-371
-
-
Fröba, B.1
Rothe, C.2
Küblbeck, C.3
-
42
-
-
0003079516
-
A multimodal biometric system using fingerprint, face and speech
-
A. Jain, L. Hong, and Y. Kulkarni, "A multimodal biometric system using fingerprint, face and speech." in Proc. 2nd Int. Conf. Audio- and Video-Based Biometric Person Authentication, 1999. pp. 182-187.
-
(1999)
Proc. 2nd Int. Conf. Audio- and Video-based Biometric Person Authentication
, pp. 182-187
-
-
Jain, A.1
Hong, L.2
Kulkarni, Y.3
-
43
-
-
0036448934
-
Learning user-specific parameters in a multibiometric system
-
Rochester, NY
-
A. Jain and A. Ross, "Learning user-specific parameters in a multibiometric system," presented at the Int. Conf. Image Processing (ICIP), Rochester, NY, 2002.
-
(2002)
Int. Conf. Image Processing (ICIP)
-
-
Jain, A.1
Ross, A.2
-
44
-
-
82055208315
-
Information fusion in biometrics
-
Heidelberg, Germany
-
A. Ross, A. Jain, and J. Z. Qian, "Information fusion in biometrics," in Lecture Notes in Computer Science, Audio- and Video-Based Biometric Person Authentication Heidelberg, Germany, 2001, vol. 2091, pp. 354-359.
-
(2001)
Lecture Notes in Computer Science, Audio- and Video-based Biometric Person Authentication
, vol.2091
, pp. 354-359
-
-
Ross, A.1
Jain, A.2
Qian, J.Z.3
-
47
-
-
9444239110
-
Toward a theory of organized multimodal integration patterns during human-computer interaction
-
Vancouver, BC, Canada
-
S. L. Oviatt, R. Collision, S. Shriver, B. Xiao, R. Wesson, R. Lunsford, and L. Carmichael, "Toward a theory of organized multimodal integration patterns during human-computer interaction," presented at the Int. Conf. Multimodal Interfaces, Vancouver, BC, Canada, 2003.
-
(2003)
Int. Conf. Multimodal Interfaces
-
-
Oviatt, S.L.1
Collision, R.2
Shriver, S.3
Xiao, B.4
Wesson, R.5
Lunsford, R.6
Carmichael, L.7
-
49
-
-
0000886290
-
Eye fixations and cognitive processes
-
M. A. Just and P. A. Carpenter, "Eye fixations and cognitive processes," Cogn. Psychol., vol. 8, pp. 441-480, 1976.
-
(1976)
Cogn. Psychol.
, vol.8
, pp. 441-480
-
-
Just, M.A.1
Carpenter, P.A.2
-
50
-
-
0032215040
-
Eye movements in reading and information processing: Twenty years of research
-
K. Rayner, "Eye movements in reading and information processing: Twenty years of research," Psychol. Bull., vol. 124, no. 3, pp. 372-422, 1998.
-
(1998)
Psychol. Bull.
, vol.124
, Issue.3
, pp. 372-422
-
-
Rayner, K.1
-
51
-
-
84976686046
-
The use of eye movements in human-computer interaction techniques: What you look at is what you get
-
R. J. K. Jacob, "The use of eye movements in human-computer interaction techniques: What you look at is what you get," ACM Trans Inform. Syst., vol. 9, pp. 152-169, 1991.
-
(1991)
ACM Trans Inform. Syst.
, vol.9
, pp. 152-169
-
-
Jacob, R.J.K.1
-
52
-
-
0033721958
-
SUITOR: An attentive information system
-
P. P. Maglio, R. Barrett, C. S. Campbell, and T. T. Selker, "SUITOR: An attentive information system." in Proc. Int. Conf. Intelligent User Interfaces (IUI 2000), 2000, pp. 169-176.
-
(2000)
Proc. Int. Conf. Intelligent User Interfaces (IUI 2000)
, pp. 169-176
-
-
Maglio, P.P.1
Barrett, R.2
Campbell, C.S.3
Selker, T.T.4
-
54
-
-
0030687099
-
Multimodal interactive maps: Designing for human performance
-
S. L. Oviatt, "Multimodal interactive maps: Designing for human performance." Human Comput. Interaction, vol. 12, no. 1-2, pp. 93-129, 1997.
-
(1997)
Human Comput. Interaction
, vol.12
, Issue.1-2
, pp. 93-129
-
-
Oviatt, S.L.1
-
55
-
-
0028783651
-
The role of voice input for human-machine communication
-
P. Cohen and S. L. Oviatt, "The role of voice input for human-machine communication," Proc. Nat. Acad. Sci., vol. 92, pp. 9921-9927, 1995.
-
(1995)
Proc. Nat. Acad. Sci.
, vol.92
, pp. 9921-9927
-
-
Cohen, P.1
Oviatt, S.L.2
-
56
-
-
0026240713
-
Discourse structure and performance efficiency in interactive and noninteractive spoken modalities
-
S. L. Oviatt and P. R. Cohen, "Discourse structure and performance efficiency in interactive and noninteractive spoken modalities." Comput. Speech Lang., vol. 5, no. 4, pp. 297-326, 1991.
-
(1991)
Comput. Speech Lang.
, vol.5
, Issue.4
, pp. 297-326
-
-
Oviatt, S.L.1
Cohen, P.R.2
-
57
-
-
85135322093
-
Integration themes in multimodal human-computer interaction
-
S. L. Oviatt and E. Olsen, "Integration themes in multimodal human-computer interaction." in Proc. ICSLP, vol. 2, 1994, pp. 551-554.
-
(1994)
Proc. ICSLP
, vol.2
, pp. 551-554
-
-
Oviatt, S.L.1
Olsen, E.2
-
59
-
-
0019038072
-
Put-that-there: Voice and gesture at the graphics interface
-
R. A. Bolt, "Put-that-there: Voice and gesture at the graphics interface," Comput. Graph., vol. 14, no. 3, pp. 262-270, 1980.
-
(1980)
Comput. Graph.
, vol.14
, Issue.3
, pp. 262-270
-
-
Bolt, R.A.1
-
60
-
-
0010128235
-
Integrating simultaneous input from speech, gaze, and hand gestures
-
M. Maybury, Ed. Cambridge, MA: MIT Press
-
D. Koons, C. Sparrell, and K. Thorisson, "Integrating simultaneous input from speech, gaze, and hand gestures," in Intelligent Multimedia Interfaces, M. Maybury, Ed. Cambridge, MA: MIT Press, 1993, pp. 257-276.
-
(1993)
Intelligent Multimedia Interfaces
, pp. 257-276
-
-
Koons, D.1
Sparrell, C.2
Thorisson, K.3
-
61
-
-
85009285157
-
Multimodal integration patterns in children
-
B. Xiao, C. Girand, and S. L. Oviatt, "Multimodal integration patterns in children." in Proc. ICSLP, 2002. pp. 629-632.
-
(2002)
Proc. ICSLP
, pp. 629-632
-
-
Xiao, B.1
Girand, C.2
Oviatt, S.L.3
-
62
-
-
10844297765
-
Modeling multimodal integration patterns and performance in seniors: Toward adaptive processing of individual differences
-
Vancouver, BC, Canada
-
B. Xiao, R. Lunsford, R. Coulston, M. Wesson, and S. L. Oviatt, "Modeling multimodal integration patterns and performance in seniors: toward adaptive processing of individual differences," presented at the Int. Conf. Multimodal Interfaces, Vancouver, BC, Canada, 2003.
-
(2003)
Int. Conf. Multimodal Interfaces
-
-
Xiao, B.1
Lunsford, R.2
Coulston, R.3
Wesson, M.4
Oviatt, S.L.5
-
63
-
-
40649110141
-
Spontaneous gesture and sign: A study of ASL signs co-occurring with speech
-
K. Naughton, "Spontaneous gesture and sign: A study of ASL signs co-occurring with speech," in Proc. Workshop Integration Gesture Language and Speech, 1996, pp. 125-134.
-
(1996)
Proc. Workshop Integration Gesture Language and Speech
, pp. 125-134
-
-
Naughton, K.1
-
64
-
-
0042401939
-
How can coarticulation models account for speech sensitivity to audio-visual de synchronization?
-
D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag
-
C. Abry, M. T. Lallouache, and M. A. Cathiard, "How can coarticulation models account for speech sensitivity to audio-visual de synchronization?." in Speechreading by Humans and Machines: Models, Systemsand Applications, D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 247-255.
-
(1996)
Speechreading by Humans and Machines: Models, Systemsand Applications
, pp. 247-255
-
-
Abry, C.1
Lallouache, M.T.2
Cathiard, M.A.3
-
65
-
-
0014036537
-
Some functions of gaze direction in social interaction
-
A. Kendon. "Some functions of gaze direction in social interaction," Acta Psychol., vol. 26, pp. 22-63, 1967.
-
(1967)
Acta Psychol.
, vol.26
, pp. 22-63
-
-
Kendon, A.1
-
66
-
-
0034232298
-
What the eyes say about speaking
-
Z. M. Griffin and K. Bock, "What the eyes say about speaking." Psychol. Sci., vol. 11, no. 4, pp. 274-279, 2000.
-
(2000)
Psychol. Sci.
, vol.11
, Issue.4
, pp. 274-279
-
-
Griffin, Z.M.1
Bock, K.2
-
67
-
-
0002126112
-
Ten myths of multi modal interaction
-
Nov.
-
S. L. Oviatt, "Ten myths of multi modal interaction." Commun. ACM, vol. 42, no. 11, pp. 74-81. Nov. 1999.
-
(1999)
Commun. ACM
, vol.42
, Issue.11
, pp. 74-81
-
-
Oviatt, S.L.1
-
69
-
-
21244466158
-
-
private communication
-
R. Markinson, private communication, 1993.
-
(1993)
-
-
Markinson, R.1
-
70
-
-
0004544671
-
Differences in visual intelligibility across talkers
-
D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag
-
P. B. Kricos, "Differences in visual intelligibility across talkers," in Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 43-53.
-
(1996)
Speechreading by Humans and Machines: Models, Systems and Applications
, pp. 43-53
-
-
Kricos, P.B.1
-
71
-
-
0025935481
-
Effect in nonenglish listeners: Few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility
-
K. Sekiyama, Y. Tohkura, and Y. McGurk, "Effect in nonenglish listeners: Few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility," J. Acoust. Soc. Amer., vol. 90, pp. 1797-1805, 1991.
-
(1991)
J. Acoust. Soc. Amer.
, vol.90
, pp. 1797-1805
-
-
Sekiyama, K.1
Tohkura, Y.2
McGurk, Y.3
-
72
-
-
0005454347
-
Perception of conflicting audio-visual speech: An examination across Spanish and German
-
D. G. Stork and M. H. Hennecke, Eds. New York: Springer-Verlag
-
A. Fuster-Duran, "Perception of conflicting audio-visual speech: An examination across Spanish and German," in Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. H. Hennecke, Eds. New York: Springer-Verlag, 1996, pp. 135-143.
-
(1996)
Speechreading by Humans and Machines: Models, Systems and Applications
, pp. 135-143
-
-
Fuster-Duran, A.1
-
73
-
-
0003337251
-
Nonverbal communication in human social interaction
-
R. Hinde, Ed. Cambridge, MA: Cambridge Univ. Press
-
M. Argyle, "Nonverbal communication in human social interaction," in Nonverbal Communication. R. Hinde, Ed. Cambridge, MA: Cambridge Univ. Press, 1972, pp. 243-267.
-
(1972)
Nonverbal Communication
, pp. 243-267
-
-
Argyle, M.1
-
74
-
-
0020735295
-
Compatibility and resource competition between modalities of input, central processing, and output
-
C. D. Wickens, D. L. Sandry, and M. Vidulich, "Compatibility and resource competition between modalities of input, central processing, and output," Human Factors, vol. 25, pp. 227-248, 1983.
-
(1983)
Human Factors
, vol.25
, pp. 227-248
-
-
Wickens, C.D.1
Sandry, D.L.2
Vidulich, M.3
-
75
-
-
84884491122
-
Synergistic use of direct manipulation and natural language
-
P. Cohen, M. Dalrymple, D. Moran, and F. Pereira, "Synergistic use of direct manipulation and natural language," in Proc. Conf. Human Factors Computing Systems (CHI'89), 1989, pp. 227-234.
-
(1989)
Proc. Conf. Human Factors Computing Systems (CHI'89)
, pp. 227-234
-
-
Cohen, P.1
Dalrymple, M.2
Moran, D.3
Pereira, F.4
-
76
-
-
0028710004
-
Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity
-
S. L. Oviatt, P. R. Cohen, P. R., and M. Q. Wang, "Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity," Speech Commun., vol. 15, pp. 283-300, 1994.
-
(1994)
Speech Commun.
, vol.15
, pp. 283-300
-
-
Oviatt, S.L.1
Cohen, P.R.2
Wang, M.Q.3
-
77
-
-
21244444022
-
Voice input as a replacement for keyboard accelerators in a mouse-based graphical editor: An empirical study
-
July
-
J. H. Leatherby and R. Pausch, "Voice input as a replacement for keyboard accelerators in a mouse-based graphical editor: An empirical study," J. Amer. Voice Input/Output Soc., vol. 11, no. 2, July 1992.
-
(1992)
J. Amer. Voice Input/Output Soc.
, vol.11
, Issue.2
-
-
Leatherby, J.H.1
Pausch, R.2
-
79
-
-
85128403506
-
Referential features and linguistic indirection in multimodal language
-
S. L. Oviatt and K. Kuhn, "Referential features and linguistic indirection in multimodal language," in Proc. ICSLP. vol. 2, 1998, pp. 227-280.
-
(1998)
Proc. ICSLP
, vol.2
, pp. 227-280
-
-
Oviatt, S.L.1
Kuhn, K.2
-
80
-
-
0032684957
-
Mutual disambiguation of recognition errors in a multimodal architecture
-
S. L. Oviatt. "Mutual disambiguation of recognition errors in a multimodal architecture," in Proc. Conf. Human Factors Computing Systems (CHI'99), 1999, pp. 576-583.
-
(1999)
Proc. Conf. Human Factors Computing Systems (CHI'99)
, pp. 576-583
-
-
Oviatt, S.L.1
-
81
-
-
0005073850
-
Multimodal interactions in speech systems
-
M. Blattner and R. Dannenberg, Eds. New York: ACM, Frontier Series
-
A. Rudnicky and A. Hauptman. "Multimodal interactions in speech systems," in Multimedia Interface Design, M. Blattner and R. Dannenberg, Eds. New York: ACM. 1992, Frontier Series, pp. 147-172.
-
(1992)
Multimedia Interface Design
, pp. 147-172
-
-
Rudnicky, A.1
Hauptman, A.2
-
82
-
-
77956782689
-
Breaking the robustness barrier: Recent progress on the design of robust multimodal systems
-
M. Zelkowitz, Ed. New York: Academic
-
S. L. Oviatt, "Breaking the robustness barrier: Recent progress on the design of robust multimodal systems," in Advances in Computers, M. Zelkowitz, Ed. New York: Academic, 2002, vol. 56, pp. 305-341.
-
(2002)
Advances in Computers
, vol.56
, pp. 305-341
-
-
Oviatt, S.L.1
-
83
-
-
0347663785
-
Linguistic adaptations during spoken and multimodal error resolution
-
S. L. Oviatt, J. Bernard, and G. Levow, "Linguistic adaptations during spoken and multimodal error resolution," Lang. Speech, vol. 41, no. 3-4, pp. 515-438, 1999.
-
(1999)
Lang. Speech
, vol.41
, Issue.3-4
, pp. 515-1438
-
-
Oviatt, S.L.1
Bernard, J.2
Levow, G.3
-
84
-
-
0023237267
-
Quantifying the contribution of vision to speech perception in noise
-
A. McLeod and Q. Summerfield, "Quantifying the contribution of vision to speech perception in noise," Br. J. Audiol., vol. 21, pp. 131-141, 1987.
-
(1987)
Br. J. Audiol.
, vol.21
, pp. 131-141
-
-
McLeod, A.1
Summerfield, Q.2
-
85
-
-
0032179207
-
Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition
-
P. Iverson, L. Bernstein, and E. Auer, "Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition," Speech Commun., vol. 26, no. 1-2, pp. 45-63, 1998.
-
(1998)
Speech Commun.
, vol.26
, Issue.1-2
, pp. 45-63
-
-
Iverson, P.1
Bernstein, L.2
Auer, E.3
-
87
-
-
85009154155
-
Stream weight optimization of speech and lip image sequence for audio-visual speech recognition
-
S. Nakamura, H. Ito, and K. Shikano, "Stream weight optimization of speech and lip image sequence for audio-visual speech recognition," in Proc. ICSLP, vol. 3, 2000, pp. 20-24.
-
(2000)
Proc. ICSLP
, vol.3
, pp. 20-24
-
-
Nakamura, S.1
Ito, H.2
Shikano, K.3
-
88
-
-
85009153179
-
Stream confidence estimation for audio-visual speech recognition
-
G. Potamianos and C. Neti, "Stream confidence estimation for audio-visual speech recognition," in Proc. ICSLP. vol. 3, 2000, pp. 746-749.
-
(2000)
Proc. ICSLP
, vol.3
, pp. 746-749
-
-
Potamianos, G.1
Neti, C.2
|