-
1
-
-
0141685005
-
Audio-visual and multimodal speech-based systems
-
Gibbon D., Mertins I., and Moore R. (Eds), Kluwer Academic, Boston
-
Benoit C., Martin J.C., Pelachaud C., Schomaker L., and Suhm B. Audio-visual and multimodal speech-based systems. In: Gibbon D., Mertins I., and Moore R. (Eds). Handbook of Multimodal and Spoken Dialogue Systems: Resources, Terminology and Product Evaluation (2000), Kluwer Academic, Boston 102-203
-
(2000)
Handbook of Multimodal and Spoken Dialogue Systems: Resources, Terminology and Product Evaluation
, pp. 102-203
-
-
Benoit, C.1
Martin, J.C.2
Pelachaud, C.3
Schomaker, L.4
Suhm, B.5
-
2
-
-
0034448810
-
Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research direction
-
[Reprinted in. Chap. 19
-
[Reprinted in. Oviatt S.L., Cohen P.R., Wu L., Vergo J., Duncan L., Suhm B., Bers J., Holzman T., Winograd T., Landay J., Larson J., and Ferro D. Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research direction. Chap. 19. Human Computer Interaction 15 4 (2000) 263-322
-
(2000)
Human Computer Interaction
, vol.15
, Issue.4
, pp. 263-322
-
-
Oviatt, S.L.1
Cohen, P.R.2
Wu, L.3
Vergo, J.4
Duncan, L.5
Suhm, B.6
Bers, J.7
Holzman, T.8
Winograd, T.9
Landay, J.10
Larson, J.11
Ferro, D.12
-
3
-
-
77956780878
-
Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research direction
-
[Reprinted in. Chap. 19. Carroll J. (Ed), Addison-Wesley, Reading, MA
-
[Reprinted in. Oviatt S.L., Cohen P.R., Wu L., Vergo J., Duncan L., Suhm B., Bers J., Holzman T., Winograd T., Landay J., Larson J., and Ferro D. Designing the user interface for multimodal speech and gesture applications: State-of-the-art systems and research direction. Chap. 19. In: Carroll J. (Ed). Human-Computer Interaction in the New Millennium (2001), Addison-Wesley, Reading, MA 421-456
-
(2001)
Human-Computer Interaction in the New Millennium
, pp. 421-456
-
-
Oviatt, S.L.1
Cohen, P.R.2
Wu, L.3
Vergo, J.4
Duncan, L.5
Suhm, B.6
Bers, J.7
Holzman, T.8
Winograd, T.9
Landay, J.10
Larson, J.11
Ferro, D.12
-
4
-
-
85009060634
-
Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction
-
Neti C., Iyengar G., Potamianos G., Senior A., and Maison B. Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction. Proceedings of the International Conference on Spoken Language Processing, Beijing 3 (2000) 11-14
-
(2000)
Proceedings of the International Conference on Spoken Language Processing, Beijing
, vol.3
, pp. 11-14
-
-
Neti, C.1
Iyengar, G.2
Potamianos, G.3
Senior, A.4
Maison, B.5
-
5
-
-
0033879165
-
-
Pankanti S., Bolle R.M., and Jain A. (Eds) 2
-
In: Pankanti S., Bolle R.M., and Jain A. (Eds). Biometrics: The future of identification. Computer 33 (2000) 46-80 2
-
(2000)
Computer
, vol.33
, pp. 46-80
-
-
-
6
-
-
0032178686
-
Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP
-
Benoit C., and Le Goff B. Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP. Speech Communication 26 (1998) 117-129
-
(1998)
Speech Communication
, vol.26
, pp. 117-129
-
-
Benoit, C.1
Le Goff, B.2
-
7
-
-
0031380441
-
Quickset: Multimodal interaction for distributed applications
-
ACM Press, New York
-
Cohen P.R., Johnston M., McGee D., Oviatt S., Pittman J., Smith I., Chen L., and Clow J. Quickset: Multimodal interaction for distributed applications. Proceedings of the Fifth ACM International Multimedia Conference (1997), ACM Press, New York 31-40
-
(1997)
Proceedings of the Fifth ACM International Multimedia Conference
, pp. 31-40
-
-
Cohen, P.R.1
Johnston, M.2
McGee, D.3
Oviatt, S.4
Pittman, J.5
Smith, I.6
Chen, L.7
Clow, J.8
-
8
-
-
0003544881
-
-
Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, New York
-
In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines (1996), Springer-Verlag, New York
-
(1996)
Speechreading by Humans and Machines
-
-
-
9
-
-
0041827542
-
-
Turk M., and Robertson G. (Eds)
-
In: Turk M., and Robertson G. (Eds). Perceptual user interfaces. Communications of the ACM 43 3 (2000) 32-70
-
(2000)
Communications of the ACM
, vol.43
, Issue.3
, pp. 32-70
-
-
-
11
-
-
0019038072
-
Put-that-there: Voice and gesture at the graphics interface
-
Bolt R.A. Put-that-there: Voice and gesture at the graphics interface. Computer Graphics 14 3 (1980) 262-270
-
(1980)
Computer Graphics
, vol.14
, Issue.3
, pp. 262-270
-
-
Bolt, R.A.1
-
12
-
-
84884491122
-
Synergistic use of direct manipulation and natural language
-
[Reprinted in Readings in Intelligent User Interfaces (Maybury and Wahlster, Eds.), pp. 29-37, Morgan Kaufmann, San Francisco.], ACM Press, New York
-
[Reprinted in Readings in Intelligent User Interfaces (Maybury and Wahlster, Eds.), pp. 29-37, Morgan Kaufmann, San Francisco.]. Cohen P.R., Dalrymple M., Moran D.B., Pereira F.C.N., Sullivan J.W., Gargan R.A., Schlossberg J.L., and Tyler S.W. Synergistic use of direct manipulation and natural language. Proceedings of the Conference on Human Factors in Computing Systems (CHI'89) (1989), ACM Press, New York 227-234
-
(1989)
Proceedings of the Conference on Human Factors in Computing Systems (CHI'89)
, pp. 227-234
-
-
Cohen, P.R.1
Dalrymple, M.2
Moran, D.B.3
Pereira, F.C.N.4
Sullivan, J.W.5
Gargan, R.A.6
Schlossberg, J.L.7
Tyler, S.W.8
-
13
-
-
5844422767
-
Combining deictic gestures and natural language for referent identification
-
Bonn, Germany
-
Kobsa A., Allgayer J., Reddig C., Reithinger N., Schmauks D., Harbusch K., and Wahlster W. Combining deictic gestures and natural language for referent identification. Proceedings of the 11th International Conf. on Computational Linguistics. Bonn, Germany (1986) 356-361
-
(1986)
Proceedings of the 11th International Conf. on Computational Linguistics
, pp. 356-361
-
-
Kobsa, A.1
Allgayer, J.2
Reddig, C.3
Reithinger, N.4
Schmauks, D.5
Harbusch, K.6
Wahlster, W.7
-
14
-
-
0002097166
-
Intelligent multimedia interface technology
-
Sullivan J.W., and Tyler S.W. (Eds), ACM Press, New York
-
Neal J.G., and Shapiro S.C. Intelligent multimedia interface technology. In: Sullivan J.W., and Tyler S.W. (Eds). Intelligent User Interfaces (1991), ACM Press, New York 11-43
-
(1991)
Intelligent User Interfaces
, pp. 11-43
-
-
Neal, J.G.1
Shapiro, S.C.2
-
15
-
-
0030355073
-
Multimodal discourse modeling in a multi-user multi-domain environment
-
Bunnell T., and Idsardi W. (Eds), University of Delaware and A. I. duPont Institute
-
Seneff S., Goddeau D., Pao C., and Polifroni J. Multimodal discourse modeling in a multi-user multi-domain environment. In: Bunnell T., and Idsardi W. (Eds). Proceedings of the International Conference on Spoken Language Processing Vol. 1 (1996), University of Delaware and A. I. duPont Institute 192-195
-
(1996)
Proceedings of the International Conference on Spoken Language Processing
, vol.1
, pp. 192-195
-
-
Seneff, S.1
Goddeau, D.2
Pao, C.3
Polifroni, J.4
-
16
-
-
25444516951
-
Modeling and processing of the oral and tactile activities in the Georal tactile system
-
Eindhoven, Netherlands
-
Siroux J., Guyomard M., Multon F., and Remondeau C. Modeling and processing of the oral and tactile activities in the Georal tactile system. Proceedings of the International Conference on Cooperative Multimodal Communication, Theory & Applications. Eindhoven, Netherlands (1995)
-
(1995)
Proceedings of the International Conference on Cooperative Multimodal Communication, Theory & Applications
-
-
Siroux, J.1
Guyomard, M.2
Multon, F.3
Remondeau, C.4
-
17
-
-
0010644416
-
User and discourse models for multimodal communication
-
Chap. 3. Sullivan J.W., and Tyler S.W. (Eds), ACM Press, New York
-
Chap. 3. Wahlster W. User and discourse models for multimodal communication. In: Sullivan J.W., and Tyler S.W. (Eds). Intelligent User Interfaces (1991), ACM Press, New York 45-67
-
(1991)
Intelligent User Interfaces
, pp. 45-67
-
-
Wahlster, W.1
-
18
-
-
0038377045
-
Multimodal systems that process what comes naturally
-
Oviatt S.L., and Cohen P.R. Multimodal systems that process what comes naturally. Communications of the ACM 43 3 (2000) 45-53
-
(2000)
Communications of the ACM
, vol.43
, Issue.3
, pp. 45-53
-
-
Oviatt, S.L.1
Cohen, P.R.2
-
19
-
-
77956777662
-
-
Rubin P., Vatikiotis-Bateson E., and Benoit C. (Eds)
-
In: Rubin P., Vatikiotis-Bateson E., and Benoit C. (Eds). Speech Communication 26 (1998) 1-2
-
(1998)
Speech Communication
, vol.26
, pp. 1-2
-
-
-
20
-
-
0042161151
-
Multimodal Interfaces
-
Jacko J., and Sears A. (Eds), Lawrence Erlbaum, Mahwah, NJ
-
Oviatt S.L. Multimodal Interfaces. In: Jacko J., and Sears A. (Eds). Handbook of Human-Computer Interaction (2002), Lawrence Erlbaum, Mahwah, NJ
-
(2002)
Handbook of Human-Computer Interaction
-
-
Oviatt, S.L.1
-
21
-
-
85135134004
-
A rapid semi-automatic simulation technique for investigating interactive speech and handwriting
-
Oviatt S.L., Cohen P.R., Fong M.W., and Frank M.P. A rapid semi-automatic simulation technique for investigating interactive speech and handwriting. Proceedings of the International Conference on Spoken Language Processing, University of Alberta Vol. 2 (1992) 1351-1354
-
(1992)
Proceedings of the International Conference on Spoken Language Processing, University of Alberta
, vol.2
, pp. 1351-1354
-
-
Oviatt, S.L.1
Cohen, P.R.2
Fong, M.W.3
Frank, M.P.4
-
24
-
-
0030677453
-
Multimodal interfaces for multimedia information agents
-
IEEE Press, Menlo Park, CA
-
Waibel A., Suhm B., Vo M.T., and Yang J. Multimodal interfaces for multimedia information agents. Proceedings of the International Conference on Acoustics, Speech and Signal Processing (IEEE-ICASSP) Vol. 1 (1997), IEEE Press, Menlo Park, CA 167-170
-
(1997)
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (IEEE-ICASSP)
, vol.1
, pp. 167-170
-
-
Waibel, A.1
Suhm, B.2
Vo, M.T.3
Yang, J.4
-
26
-
-
85009133573
-
Integrating multimodal language processing with speech recognition
-
Yuan B., Huang T., and Tang X. (Eds), Chinese Friendship, Beijing
-
Bangalore S., and Johnston M. Integrating multimodal language processing with speech recognition. In: Yuan B., Huang T., and Tang X. (Eds). Proceedings of the International Conference on Spoken Language Processing (ICSLP'2000) Vol. 2 (2000), Chinese Friendship, Beijing 126-129
-
(2000)
Proceedings of the International Conference on Spoken Language Processing (ICSLP'2000)
, vol.2
, pp. 126-129
-
-
Bangalore, S.1
Johnston, M.2
-
27
-
-
84947807484
-
Partial information in multimodal dialogue
-
Yuan B., Huang T., and Tang X. (Eds), Chinese Friendship, Beijing
-
Denecke M., and Yang J. Partial information in multimodal dialogue. In: Yuan B., Huang T., and Tang X. (Eds). Proceedings of the International Conference on Spoken Language Processing (ICSLIP'2000) (2000), Chinese Friendship, Beijing 624-633
-
(2000)
Proceedings of the International Conference on Spoken Language Processing (ICSLIP'2000)
, pp. 624-633
-
-
Denecke, M.1
Yang, J.2
-
29
-
-
0001514782
-
Modeling coarticulation in synthetic visible speech
-
Thalmann N.M., and Thalmann D. (Eds), Springer-Verlag, Berlin
-
Cohen M.M., and Massaro D.W. Modeling coarticulation in synthetic visible speech. In: Thalmann N.M., and Thalmann D. (Eds). Models and Techniques in Computer Animation (1993), Springer-Verlag, Berlin 139-156
-
(1993)
Models and Techniques in Computer Animation
, pp. 139-156
-
-
Cohen, M.M.1
Massaro, D.W.2
-
30
-
-
0032072433
-
Sensory integration and speechreading by humans and machines
-
Massaro D.W., and Stork D.G. Sensory integration and speechreading by humans and machines. American Scientist 86 (1998) 236-244
-
(1998)
American Scientist
, vol.86
, pp. 236-244
-
-
Massaro, D.W.1
Stork, D.G.2
-
31
-
-
0022019614
-
Intermodal timing relations and audiovisual speech recognition by normal-hearing adults
-
McGrath M., and Summerfield Q. Intermodal timing relations and audiovisual speech recognition by normal-hearing adults. Journal of the Acoustical Society of America 77 2 (1985) 678-685
-
(1985)
Journal of the Acoustical Society of America
, vol.77
, Issue.2
, pp. 678-685
-
-
McGrath, M.1
Summerfield, Q.2
-
32
-
-
0017199877
-
Hearing lips and seeing voices
-
McGurk H., and MacDonald J. Hearing lips and seeing voices. Nature 264 (1976) 746-748
-
(1976)
Nature
, vol.264
, pp. 746-748
-
-
McGurk, H.1
MacDonald, J.2
-
33
-
-
0023237267
-
Quantifying the contribution of vision to speech perception in noise
-
McLeod A., and Summerfield Q. Quantifying the contribution of vision to speech perception in noise. British Journal of Audiology 21 (1987) 131-141
-
(1987)
British Journal of Audiology
, vol.21
, pp. 131-141
-
-
McLeod, A.1
Summerfield, Q.2
-
34
-
-
0031747741
-
Complementarity and synergy in bimodal speech: Auditory, visual, and audio-visual identification of French oral vowels in noise
-
Robert-Ribes J., Schwartz J.L., Lallouache T., and Escudier P. Complementarity and synergy in bimodal speech: Auditory, visual, and audio-visual identification of French oral vowels in noise. Journal of the Acoustical Society of America 103 6 (1998) 3677-3689
-
(1998)
Journal of the Acoustical Society of America
, vol.103
, Issue.6
, pp. 3677-3689
-
-
Robert-Ribes, J.1
Schwartz, J.L.2
Lallouache, T.3
Escudier, P.4
-
37
-
-
0010605203
-
The dynamics of audiovisual behavior in speech
-
Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
-
Vatikiotis-Bateson E., Munhall K.G., Hirayama M., Lee Y.V., and Terzopoulos D. The dynamics of audiovisual behavior in speech. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 221-232
-
(1996)
NATO ASI Series, Series F: Computer and Systems Sciences
, vol.150
, pp. 221-232
-
-
Vatikiotis-Bateson, E.1
Munhall, K.G.2
Hirayama, M.3
Lee, Y.V.4
Terzopoulos, D.5
-
38
-
-
0003699540
-
Automatic Lipreading to Enhance Speech Recognition
-
University of Illinois at Urbana-Champaign
-
Petajan E.D. Automatic Lipreading to Enhance Speech Recognition. Ph.D. thesis (1984), University of Illinois at Urbana-Champaign
-
(1984)
Ph.D. thesis
-
-
Petajan, E.D.1
-
42
-
-
78650077027
-
Continuous Automatic Speech Recognition by Lipreading
-
Department of Electrical Engineering and Computer Science, George Washington University
-
Goldschen A.J. Continuous Automatic Speech Recognition by Lipreading. Ph.D. thesis (1993), Department of Electrical Engineering and Computer Science, George Washington University
-
(1993)
Ph.D. thesis
-
-
Goldschen, A.J.1
-
43
-
-
0010070142
-
Audiovisual sensory integration using Hidden Markov Models
-
Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
-
Silsbee P.L., and Su Q. Audiovisual sensory integration using Hidden Markov Models. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 489-504
-
(1996)
NATO ASI Series, Series F: Computer and Systems Sciences
, vol.150
, pp. 489-504
-
-
Silsbee, P.L.1
Su, Q.2
-
45
-
-
0003517572
-
-
Cassell J., Sullivan J., Prevost S., and Churchill E. (Eds), MIT Press, Cambridge, MA
-
In: Cassell J., Sullivan J., Prevost S., and Churchill E. (Eds). Embodied conversational agents (2000), MIT Press, Cambridge, MA
-
(2000)
Embodied conversational agents
-
-
-
46
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
Dupont S., and Luettin J. Audio-visual speech modeling for continuous speech recognition. IEEE Transactions on Multimedia 2 3 (2000) 141-151
-
(2000)
IEEE Transactions on Multimedia
, vol.2
, Issue.3
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
47
-
-
0029725863
-
Adaptive bimodal sensor fusion for automatic speechreading
-
IEEE Press, Menlo Park, CA
-
Meier U., Hürst W., and Duchnowski P. Adaptive bimodal sensor fusion for automatic speechreading. Proceedings of the International Conference on Acoustics, Speech and Signal Processing (IEEE-ICASSP) (1996), IEEE Press, Menlo Park, CA 833-836
-
(1996)
Proceedings of the International Conference on Acoustics, Speech and Signal Processing (IEEE-ICASSP)
, pp. 833-836
-
-
Meier, U.1
Hürst, W.2
Duchnowski, P.3
-
48
-
-
0032180188
-
Adaptive fusion of acoustic and visual sources for automatic speech recognition
-
Rogozan A., and Deglise P. Adaptive fusion of acoustic and visual sources for automatic speech recognition. Speech Communication 26 1-2 (1998) 149-161
-
(1998)
Speech Communication
, vol.26
, Issue.1-2
, pp. 149-161
-
-
Rogozan, A.1
Deglise, P.2
-
49
-
-
0002267306
-
Multimodal person recognition using unconstrained audio and video
-
Washington, DC
-
Choudhury T., Clarkson B., Jebera T., and Pentland S. Multimodal person recognition using unconstrained audio and video. Proceedings of the 2nd International Conference on Audio-and-Video-based Biometric Person Authentication. Washington, DC (1999) 176-181
-
(1999)
Proceedings of the 2nd International Conference on Audio-and-Video-based Biometric Person Authentication
, pp. 176-181
-
-
Choudhury, T.1
Clarkson, B.2
Jebera, T.3
Pentland, S.4
-
50
-
-
77956750399
-
Retooling products so all can use them
-
June 21
-
June 21. Lee J. Retooling products so all can use them. New York Times (2001)
-
(2001)
New York Times
-
-
Lee, J.1
-
53
-
-
0347663785
-
Linguistic adaptation during error resolution with spoken and multimodal systems
-
(special issue on prosody and conversation)
-
(special issue on prosody and conversation). Oviatt S.L., Bernard J., and Levow G. Linguistic adaptation during error resolution with spoken and multimodal systems. Language and Speech 41 3-4 (1998) 419-442
-
(1998)
Language and Speech
, vol.41
, Issue.3-4
, pp. 419-442
-
-
Oviatt, S.L.1
Bernard, J.2
Levow, G.3
-
55
-
-
0005073850
-
Multimodal interactions in speech systems
-
Blattner M., and Dannenberg R. (Eds), ACM Press, New York
-
Rudnicky A., and Hauptman A. Multimodal interactions in speech systems. In: Blattner M., and Dannenberg R. (Eds). Multimedia Interface Design, Frontier Series (1992), ACM Press, New York 147-172
-
(1992)
Multimedia Interface Design, Frontier Series
, pp. 147-172
-
-
Rudnicky, A.1
Hauptman, A.2
-
56
-
-
4243792067
-
Multimodal Interactive Error Recovery for Non-conversational Speech User Interfaces
-
Karlsruhe University, Germany
-
Suhm B. Multimodal Interactive Error Recovery for Non-conversational Speech User Interfaces. Ph.D. thesis (1998), Karlsruhe University, Germany
-
(1998)
Ph.D. thesis
-
-
Suhm, B.1
-
57
-
-
0030687099
-
Multimodal interactive maps: Designing for human performance
-
(special issue on multimodal interfaces)
-
(special issue on multimodal interfaces). Oviatt S.L. Multimodal interactive maps: Designing for human performance. Human-Computer Interaction 12 (1997) 93-129
-
(1997)
Human-Computer Interaction
, vol.12
, pp. 93-129
-
-
Oviatt, S.L.1
-
58
-
-
85128403506
-
Referential features and linguistic indirection in multimodal language
-
Oviatt S.L., and Kuhn K. Referential features and linguistic indirection in multimodal language. Proceedings of the International Conference on Spoken Language Processing, ASSTA Inc., Sydney, Australia Vol. 6 (1998) 2339-2342
-
(1998)
Proceedings of the International Conference on Spoken Language Processing, ASSTA Inc., Sydney, Australia
, vol.6
, pp. 2339-2342
-
-
Oviatt, S.L.1
Kuhn, K.2
-
60
-
-
0002798273
-
Taming recognition errors with a multimodal architecture
-
(special issue on conversational interfaces)
-
(special issue on conversational interfaces). Oviatt S.L. Taming recognition errors with a multimodal architecture. Communications of the ACM 43 (2000) 45-51
-
(2000)
Communications of the ACM
, vol.43
, pp. 45-51
-
-
Oviatt, S.L.1
-
62
-
-
0008571386
-
Towards a robust speechreading dialog system
-
Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
-
Bregler C., Omohundro S.M., Shi J., and Konig Y. Towards a robust speechreading dialog system. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 409-423
-
(1996)
NATO ASI Series, Series F: Computer and Systems Sciences
, vol.150
, pp. 409-423
-
-
Bregler, C.1
Omohundro, S.M.2
Shi, J.3
Konig, Y.4
-
64
-
-
85009154155
-
Stream weight optimization of speech and lip, image sequence for audio-visual speech recognition
-
Yuan B., Huang T., and Tang X. (Eds), Chinese Friendship Publishers, Beijing
-
Nakamura S., Ito H., and Shikano K. Stream weight optimization of speech and lip, image sequence for audio-visual speech recognition. In: Yuan B., Huang T., and Tang X. (Eds). Proceedings of the International Conference on Spoken Language Processing (ICSLP 2000) Vol. 3 (2000), Chinese Friendship Publishers, Beijing 20-24
-
(2000)
Proceedings of the International Conference on Spoken Language Processing (ICSLP 2000)
, vol.3
, pp. 20-24
-
-
Nakamura, S.1
Ito, H.2
Shikano, K.3
-
65
-
-
85009153179
-
Stream confidence estimation for audiovisual speech recognition
-
Yuan B., Huang T., and Tang X. (Eds), Chinese Friendship Publishers, Beijing
-
Potamianos G., and Neti C. Stream confidence estimation for audiovisual speech recognition. In: Yuan B., Huang T., and Tang X. (Eds). Proceedings of the International Conference on Spoken Language Processing (ICSLP 2000) Vol. 3 (2000), Chinese Friendship Publishers, Beijing 746-749
-
(2000)
Proceedings of the International Conference on Spoken Language Processing (ICSLP 2000)
, vol.3
, pp. 746-749
-
-
Potamianos, G.1
Neti, C.2
-
66
-
-
0030247984
-
Computer lipreading for improved accuracy in automatic speech recognition
-
Silsbee P.L., and Bovik A.C. Computer lipreading for improved accuracy in automatic speech recognition. IEEE Transactions on Speech and Audio Processing 4 5 (1996) 337-351
-
(1996)
IEEE Transactions on Speech and Audio Processing
, vol.4
, Issue.5
, pp. 337-351
-
-
Silsbee, P.L.1
Bovik, A.C.2
-
67
-
-
0029757828
-
Biological and cognitive foundations of intelligent sensor fusion
-
Murphy R.R. Biological and cognitive foundations of intelligent sensor fusion. IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans 26 1 (1996) 42-51
-
(1996)
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
, vol.26
, Issue.1
, pp. 42-51
-
-
Murphy, R.R.1
-
68
-
-
0013012690
-
The functions of vision
-
Pick H.L., and Saltzman E. (Eds), Wiley, New York
-
Lee D. The functions of vision. In: Pick H.L., and Saltzman E. (Eds). Modes of Perceiving and Processing information (1978), Wiley, New York 159-170
-
(1978)
Modes of Perceiving and Processing information
, pp. 159-170
-
-
Lee, D.1
-
69
-
-
0141461865
-
Modes of perceiving and processing information
-
Pick Jr. H.L., and Saltzman E. (Eds), Wiley, New York
-
Pick H.L., and Saltzman E. Modes of perceiving and processing information. In: Pick Jr. H.L., and Saltzman E. (Eds). Modes of Perceiving and Processing Information (1978), Wiley, New York 1-20
-
(1978)
Modes of Perceiving and Processing Information
, pp. 1-20
-
-
Pick, H.L.1
Saltzman, E.2
-
70
-
-
77956722369
-
Information and effects of early perceptual experience
-
Eisenberg N. (Ed), Wiley, New York
-
Pick H. Information and effects of early perceptual experience. In: Eisenberg N. (Ed). Contemporary Topics in Developmental Psychology (1987), Wiley, New York 59-76
-
(1987)
Contemporary Topics in Developmental Psychology
, pp. 59-76
-
-
Pick, H.1
-
73
-
-
0040111438
-
The evolution of sensory systems
-
MacLeod R.B., and Pick Jr. H.L. (Eds), Cornell University Press, Ithaca, NY
-
Bower T.G.R. The evolution of sensory systems. In: MacLeod R.B., and Pick Jr. H.L. (Eds). Perception: Essays in Honor of James J. Gibson (1974), Cornell University Press, Ithaca, NY 141-153
-
(1974)
Perception: Essays in Honor of James J. Gibson
, pp. 141-153
-
-
Bower, T.G.R.1
-
74
-
-
0348021772
-
The functional integrity of spatial behavior
-
Freedman S.J. (Ed), Dorsey Press, Homewood, IL
-
Freedman S.J., and Rekosh J.H. The functional integrity of spatial behavior. In: Freedman S.J. (Ed). The Neuropsychology of Spatially-Oriented Behavior (1968), Dorsey Press, Homewood, IL 153-162
-
(1968)
The Neuropsychology of Spatially-Oriented Behavior
, pp. 153-162
-
-
Freedman, S.J.1
Rekosh, J.H.2
-
75
-
-
0003225664
-
Some aspects of sensory-motor control and adaptation in man
-
Walk R.D., and Pick H.L. (Eds), Plenum, New York
-
Lackner J.R. Some aspects of sensory-motor control and adaptation in man. In: Walk R.D., and Pick H.L. (Eds). Intersensory Perception and Sensory Integration (1981), Plenum, New York 143-173
-
(1981)
Intersensory Perception and Sensory Integration
, pp. 143-173
-
-
Lackner, J.R.1
-
77
-
-
0003799851
-
Model-based sensor fusion for aviation
-
Pavel M., and Sharma R.K. Model-based sensor fusion for aviation. Proceedings of SPIE 3088 (1997) 169-176
-
(1997)
Proceedings of SPIE
, vol.3088
, pp. 169-176
-
-
Pavel, M.1
Sharma, R.K.2
-
79
-
-
0004990976
-
System descriptions and performance summary
-
Morgan Kaufman, San Mateo, CA
-
Martin A., Fiscus J., Fisher B., Pallet D., and Przybocki M. System descriptions and performance summary. Proceedings of the Conversational Speech Recognition Workshop/DARPA Hub-5E Evaluation (1997), Morgan Kaufman, San Mateo, CA
-
(1997)
Proceedings of the Conversational Speech Recognition Workshop/DARPA Hub-5E Evaluation
-
-
Martin, A.1
Fiscus, J.2
Fisher, B.3
Pallet, D.4
Przybocki, M.5
-
80
-
-
0043086491
-
Effect of speaking style on LVCSR performance
-
Morgan Kaufman, San Mateo, CA
-
Weintraub M., Taussig K., Hunicke K., and Snodgrass A. Effect of speaking style on LVCSR performance. Proceedings of the Conversational Speech Recognition Workshop/DARPA Hub-5E Evaluation (1997), Morgan Kaufman, San Mateo, CA
-
(1997)
Proceedings of the Conversational Speech Recognition Workshop/DARPA Hub-5E Evaluation
-
-
Weintraub, M.1
Taussig, K.2
Hunicke, K.3
Snodgrass, A.4
-
81
-
-
0032075546
-
Predicting hyperarticulate speech during human-computer error resolution
-
Oviatt S.L., MacEachern M., and Levow G. Predicting hyperarticulate speech during human-computer error resolution. Speech Communication 24 (1998) 87-110
-
(1998)
Speech Communication
, vol.24
, pp. 87-110
-
-
Oviatt, S.L.1
MacEachern, M.2
Levow, G.3
-
83
-
-
84946757864
-
How effective is unsupervised data collection for children's speech recognition?
-
Aist G., Chan P., Huang X., Jiang L., Kennedy R., Latimer D., Mostow J., and Yeung C. How effective is unsupervised data collection for children's speech recognition?. Proceedings of the International Conference on Spoken Language Processing, ASSTA Inc., Sydney Vol. 7 (1998) 3171-3174
-
(1998)
Proceedings of the International Conference on Spoken Language Processing, ASSTA Inc., Sydney
, vol.7
, pp. 3171-3174
-
-
Aist, G.1
Chan, P.2
Huang, X.3
Jiang, L.4
Kennedy, R.5
Latimer, D.6
Mostow, J.7
Yeung, C.8
-
84
-
-
0031644298
-
Improvements in children's speech recognition performance
-
IEEE Press, Menlo Park, CA
-
Das S., Nix D., and Picheny M. Improvements in children's speech recognition performance. Proceedings of the International Conference on Acoustics, Speech and Signal Processing Vol. 1 (1998), IEEE Press, Menlo Park, CA 433-436
-
(1998)
Proceedings of the International Conference on Acoustics, Speech and Signal Processing
, vol.1
, pp. 433-436
-
-
Das, S.1
Nix, D.2
Picheny, M.3
-
88
-
-
0002792478
-
-
Yeni-Komshian G., Kavanaugh J., and Ferguson C. (Eds), Academic Press, New York
-
In: Yeni-Komshian G., Kavanaugh J., and Ferguson C. (Eds). Child Phonology, Vol. 1: Production (1980), Academic Press, New York
-
(1980)
Child Phonology, Vol. 1: Production
-
-
-
89
-
-
0027229711
-
Influence of background noise and microphone on the performance of the IBM TANGORA speech recognition system
-
Das S., Bakis R., Nadas A., Nahamoo D., and Picheny M. Influence of background noise and microphone on the performance of the IBM TANGORA speech recognition system. Proceedings of the IEEE International Conference on Acoustic Speech Signal Processing Vol. 2 (1993) 71-74
-
(1993)
Proceedings of the IEEE International Conference on Acoustic Speech Signal Processing
, vol.2
, pp. 71-74
-
-
Das, S.1
Bakis, R.2
Nadas, A.3
Nahamoo, D.4
Picheny, M.5
-
90
-
-
0029288202
-
Speech recognition in noisy environments
-
Gong Y. Speech recognition in noisy environments. Speech Communication 16 (1995) 261-291
-
(1995)
Speech Communication
, vol.16
, pp. 261-291
-
-
Gong, Y.1
-
91
-
-
0026882842
-
Experiments with a non-linear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars
-
Lockwood P., and Boudy J. Experiments with a non-linear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars. Speech Communication 11 (1992) 2-3
-
(1992)
Speech Communication
, vol.11
, pp. 2-3
-
-
Lockwood, P.1
Boudy, J.2
-
92
-
-
0026882842
-
Experiments with a non-linear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars
-
Lockwood P., and Boudy J. Experiments with a non-linear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars. Speech Communication 11 (1992) 215-228
-
(1992)
Speech Communication
, vol.11
, pp. 215-228
-
-
Lockwood, P.1
Boudy, J.2
-
93
-
-
0027465491
-
The Lombard reflex and its role on human listeners and automatic speech recognizers
-
Junqua J.C. The Lombard reflex and its role on human listeners and automatic speech recognizers. Journal of the Acoustical Society of America 93 1 (1993) 510-524
-
(1993)
Journal of the Acoustical Society of America
, vol.93
, Issue.1
, pp. 510-524
-
-
Junqua, J.C.1
-
95
-
-
0039915438
-
Effect of level of distracting noise upon speaking rate, duration and intensity
-
Hanley T.D., and Steer M.D. Effect of level of distracting noise upon speaking rate, duration and intensity. Journal of Speech and Hearing Disorders 14 (1949) 363-368
-
(1949)
Journal of Speech and Hearing Disorders
, vol.14
, pp. 363-368
-
-
Hanley, T.D.1
Steer, M.D.2
-
97
-
-
0023786666
-
Effects of noise on speech production: Acoustic and perceptual analyses
-
van Summers W.V., Pisoni D.B., Bernacki R.H., Pedlow R.I., and Stokes M.A. Effects of noise on speech production: Acoustic and perceptual analyses. Journal of the Acoustical Society of America 84 (1988) 917-928
-
(1988)
Journal of the Acoustical Society of America
, vol.84
, pp. 917-928
-
-
van Summers, W.V.1
Pisoni, D.B.2
Bernacki, R.H.3
Pedlow, R.I.4
Stokes, M.A.5
-
98
-
-
33646663508
-
A signal detection problem and a possible solution in Japanese quail
-
Potash L.M. A signal detection problem and a possible solution in Japanese quail. Animal Behavior 20 (1972) 192-195
-
(1972)
Animal Behavior
, vol.20
, pp. 192-195
-
-
Potash, L.M.1
-
100
-
-
0009778876
-
Auditory feedback in the regulation of vocal intensity of preschool children
-
Siegel G.M., Pick H.L., Olsen M.G., and Sawin L. Auditory feedback in the regulation of vocal intensity of preschool children. Developmental Psychology 12 (1976) 255-261
-
(1976)
Developmental Psychology
, vol.12
, pp. 255-261
-
-
Siegel, G.M.1
Pick, H.L.2
Olsen, M.G.3
Sawin, L.4
-
101
-
-
0024534402
-
Inhibiting the Lombard effect
-
Pick H.L., Siegel G.M., Fox P.W., Garber S.R., and Kearney J.K. Inhibiting the Lombard effect. Journal of the Acoustical Society of America 85 2 (1989) 894-900
-
(1989)
Journal of the Acoustical Society of America
, vol.85
, Issue.2
, pp. 894-900
-
-
Pick, H.L.1
Siegel, G.M.2
Fox, P.W.3
Garber, S.R.4
Kearney, J.K.5
-
102
-
-
0005454347
-
Perception of conflicting audio-visual speech: An examination across Spanish and German
-
Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
-
Fuster-Duran A. Perception of conflicting audio-visual speech: An examination across Spanish and German. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 135-143
-
(1996)
NATO ASI Series, Series F: Computer and Systems Sciences
, vol.150
, pp. 135-143
-
-
Fuster-Duran, A.1
-
103
-
-
10644227100
-
Bimodal speech perception: A progress report
-
Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
-
Massaro D.W. Bimodal speech perception: A progress report. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 79-101
-
(1996)
NATO ASI Series, Series F: Computer and Systems Sciences
, vol.150
, pp. 79-101
-
-
Massaro, D.W.1
-
104
-
-
0000417467
-
Visionary speech: Looking ahead to practical speechreading systems
-
Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
-
Hennecke M.E., Stork D.G., and Prasad K.V. Visionary speech: Looking ahead to practical speechreading systems. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 331-349
-
(1996)
NATO ASI Series, Series F: Computer and Systems Sciences
, vol.150
, pp. 331-349
-
-
Hennecke, M.E.1
Stork, D.G.2
Prasad, K.V.3
-
107
-
-
85009088524
-
Multimodal signal processing in naturalistic noisy environments
-
Yuan B., Huang T., and Tang X. (Eds), Chinese Friendship Publishers, Beijing
-
Oviatt S.L. Multimodal signal processing in naturalistic noisy environments. In: Yuan B., Huang T., and Tang X. (Eds). Proceedings of the International Conference on Spoken Language Processing (ICSLP'2000) Vol. 2 (2000), Chinese Friendship Publishers, Beijing 696-699
-
(2000)
Proceedings of the International Conference on Spoken Language Processing (ICSLP'2000)
, vol.2
, pp. 696-699
-
-
Oviatt, S.L.1
-
108
-
-
0002028032
-
Some preliminaries to a comprehensive account of audio-visual speech perception
-
Dodd B., and Campbell R. (Eds), Lawrence Erlbaum, London
-
Summerfield Q. Some preliminaries to a comprehensive account of audio-visual speech perception. In: Dodd B., and Campbell R. (Eds). Hearing by Eye: The Psychology of Lip-reading (1987), Lawrence Erlbaum, London 3-51
-
(1987)
Hearing by Eye: The Psychology of Lip-reading
, pp. 3-51
-
-
Summerfield, Q.1
-
109
-
-
4244043499
-
An improved automatic lipreading system to enhance speech recognition
-
AT&T Bell Labs
-
Petajan E.D. An improved automatic lipreading system to enhance speech recognition. Tech. Rep. 11251-871012-111TM (1987), AT&T Bell Labs
-
(1987)
Tech. Rep. 11251-871012-111TM
-
-
Petajan, E.D.1
-
110
-
-
0032179207
-
Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition
-
Iverson P., Bernstein L., and Auer E. Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition. Speech Communication 26 (1998) 1-2
-
(1998)
Speech Communication
, vol.26
, pp. 1-2
-
-
Iverson, P.1
Bernstein, L.2
Auer, E.3
-
111
-
-
0032179207
-
Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition
-
Iverson P., Bernstein L., and Auer E. Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition. Speech Communication 26 (1998) 45-63
-
(1998)
Speech Communication
, vol.26
, pp. 45-63
-
-
Iverson, P.1
Bernstein, L.2
Auer, E.3
-
112
-
-
0028710004
-
Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity
-
Oviatt S.L., Cohen P.R., and Wang M.Q. Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity. Speech Communication 15 (1994) 3-4
-
(1994)
Speech Communication
, vol.15
, pp. 3-4
-
-
Oviatt, S.L.1
Cohen, P.R.2
Wang, M.Q.3
-
113
-
-
0028710004
-
Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity
-
Oviatt S.L., Cohen P.R., and Wang M.Q. Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity. Speech Communication 15 (1994) 283-300
-
(1994)
Speech Communication
, vol.15
, pp. 283-300
-
-
Oviatt, S.L.1
Cohen, P.R.2
Wang, M.Q.3
|