메뉴 건너뛰기




Volumn 56, Issue C, 2002, Pages 305-341

Breaking the robustness barrier: Recent progress on the design of robust multimodal systems

Author keywords

[No Author keywords available]

Indexed keywords


EID: 77956782689     PISSN: 00652458     EISSN: None     Source Type: Book Series    
DOI: 10.1016/S0065-2458(02)80009-2     Document Type: Chapter
Times cited : (55)

References (114)
  • 5
    • 0033879165 scopus 로고    scopus 로고
    • Pankanti S., Bolle R.M., and Jain A. (Eds) 2
    • In: Pankanti S., Bolle R.M., and Jain A. (Eds). Biometrics: The future of identification. Computer 33 (2000) 46-80 2
    • (2000) Computer , vol.33 , pp. 46-80
  • 6
    • 0032178686 scopus 로고    scopus 로고
    • Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP
    • Benoit C., and Le Goff B. Audio-visual speech synthesis from French text: Eight years of models, designs and evaluation at the ICP. Speech Communication 26 (1998) 117-129
    • (1998) Speech Communication , vol.26 , pp. 117-129
    • Benoit, C.1    Le Goff, B.2
  • 8
    • 0003544881 scopus 로고    scopus 로고
    • Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, New York
    • In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines (1996), Springer-Verlag, New York
    • (1996) Speechreading by Humans and Machines
  • 9
    • 0041827542 scopus 로고    scopus 로고
    • Turk M., and Robertson G. (Eds)
    • In: Turk M., and Robertson G. (Eds). Perceptual user interfaces. Communications of the ACM 43 3 (2000) 32-70
    • (2000) Communications of the ACM , vol.43 , Issue.3 , pp. 32-70
  • 11
    • 0019038072 scopus 로고
    • Put-that-there: Voice and gesture at the graphics interface
    • Bolt R.A. Put-that-there: Voice and gesture at the graphics interface. Computer Graphics 14 3 (1980) 262-270
    • (1980) Computer Graphics , vol.14 , Issue.3 , pp. 262-270
    • Bolt, R.A.1
  • 14
    • 0002097166 scopus 로고
    • Intelligent multimedia interface technology
    • Sullivan J.W., and Tyler S.W. (Eds), ACM Press, New York
    • Neal J.G., and Shapiro S.C. Intelligent multimedia interface technology. In: Sullivan J.W., and Tyler S.W. (Eds). Intelligent User Interfaces (1991), ACM Press, New York 11-43
    • (1991) Intelligent User Interfaces , pp. 11-43
    • Neal, J.G.1    Shapiro, S.C.2
  • 15
    • 0030355073 scopus 로고    scopus 로고
    • Multimodal discourse modeling in a multi-user multi-domain environment
    • Bunnell T., and Idsardi W. (Eds), University of Delaware and A. I. duPont Institute
    • Seneff S., Goddeau D., Pao C., and Polifroni J. Multimodal discourse modeling in a multi-user multi-domain environment. In: Bunnell T., and Idsardi W. (Eds). Proceedings of the International Conference on Spoken Language Processing Vol. 1 (1996), University of Delaware and A. I. duPont Institute 192-195
    • (1996) Proceedings of the International Conference on Spoken Language Processing , vol.1 , pp. 192-195
    • Seneff, S.1    Goddeau, D.2    Pao, C.3    Polifroni, J.4
  • 17
    • 0010644416 scopus 로고
    • User and discourse models for multimodal communication
    • Chap. 3. Sullivan J.W., and Tyler S.W. (Eds), ACM Press, New York
    • Chap. 3. Wahlster W. User and discourse models for multimodal communication. In: Sullivan J.W., and Tyler S.W. (Eds). Intelligent User Interfaces (1991), ACM Press, New York 45-67
    • (1991) Intelligent User Interfaces , pp. 45-67
    • Wahlster, W.1
  • 18
    • 0038377045 scopus 로고    scopus 로고
    • Multimodal systems that process what comes naturally
    • Oviatt S.L., and Cohen P.R. Multimodal systems that process what comes naturally. Communications of the ACM 43 3 (2000) 45-53
    • (2000) Communications of the ACM , vol.43 , Issue.3 , pp. 45-53
    • Oviatt, S.L.1    Cohen, P.R.2
  • 19
    • 77956777662 scopus 로고    scopus 로고
    • Rubin P., Vatikiotis-Bateson E., and Benoit C. (Eds)
    • In: Rubin P., Vatikiotis-Bateson E., and Benoit C. (Eds). Speech Communication 26 (1998) 1-2
    • (1998) Speech Communication , vol.26 , pp. 1-2
  • 20
    • 0042161151 scopus 로고    scopus 로고
    • Multimodal Interfaces
    • Jacko J., and Sears A. (Eds), Lawrence Erlbaum, Mahwah, NJ
    • Oviatt S.L. Multimodal Interfaces. In: Jacko J., and Sears A. (Eds). Handbook of Human-Computer Interaction (2002), Lawrence Erlbaum, Mahwah, NJ
    • (2002) Handbook of Human-Computer Interaction
    • Oviatt, S.L.1
  • 26
    • 85009133573 scopus 로고    scopus 로고
    • Integrating multimodal language processing with speech recognition
    • Yuan B., Huang T., and Tang X. (Eds), Chinese Friendship, Beijing
    • Bangalore S., and Johnston M. Integrating multimodal language processing with speech recognition. In: Yuan B., Huang T., and Tang X. (Eds). Proceedings of the International Conference on Spoken Language Processing (ICSLP'2000) Vol. 2 (2000), Chinese Friendship, Beijing 126-129
    • (2000) Proceedings of the International Conference on Spoken Language Processing (ICSLP'2000) , vol.2 , pp. 126-129
    • Bangalore, S.1    Johnston, M.2
  • 29
    • 0001514782 scopus 로고
    • Modeling coarticulation in synthetic visible speech
    • Thalmann N.M., and Thalmann D. (Eds), Springer-Verlag, Berlin
    • Cohen M.M., and Massaro D.W. Modeling coarticulation in synthetic visible speech. In: Thalmann N.M., and Thalmann D. (Eds). Models and Techniques in Computer Animation (1993), Springer-Verlag, Berlin 139-156
    • (1993) Models and Techniques in Computer Animation , pp. 139-156
    • Cohen, M.M.1    Massaro, D.W.2
  • 30
    • 0032072433 scopus 로고    scopus 로고
    • Sensory integration and speechreading by humans and machines
    • Massaro D.W., and Stork D.G. Sensory integration and speechreading by humans and machines. American Scientist 86 (1998) 236-244
    • (1998) American Scientist , vol.86 , pp. 236-244
    • Massaro, D.W.1    Stork, D.G.2
  • 31
    • 0022019614 scopus 로고
    • Intermodal timing relations and audiovisual speech recognition by normal-hearing adults
    • McGrath M., and Summerfield Q. Intermodal timing relations and audiovisual speech recognition by normal-hearing adults. Journal of the Acoustical Society of America 77 2 (1985) 678-685
    • (1985) Journal of the Acoustical Society of America , vol.77 , Issue.2 , pp. 678-685
    • McGrath, M.1    Summerfield, Q.2
  • 32
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • McGurk H., and MacDonald J. Hearing lips and seeing voices. Nature 264 (1976) 746-748
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 33
    • 0023237267 scopus 로고
    • Quantifying the contribution of vision to speech perception in noise
    • McLeod A., and Summerfield Q. Quantifying the contribution of vision to speech perception in noise. British Journal of Audiology 21 (1987) 131-141
    • (1987) British Journal of Audiology , vol.21 , pp. 131-141
    • McLeod, A.1    Summerfield, Q.2
  • 34
    • 0031747741 scopus 로고    scopus 로고
    • Complementarity and synergy in bimodal speech: Auditory, visual, and audio-visual identification of French oral vowels in noise
    • Robert-Ribes J., Schwartz J.L., Lallouache T., and Escudier P. Complementarity and synergy in bimodal speech: Auditory, visual, and audio-visual identification of French oral vowels in noise. Journal of the Acoustical Society of America 103 6 (1998) 3677-3689
    • (1998) Journal of the Acoustical Society of America , vol.103 , Issue.6 , pp. 3677-3689
    • Robert-Ribes, J.1    Schwartz, J.L.2    Lallouache, T.3    Escudier, P.4
  • 37
    • 0010605203 scopus 로고    scopus 로고
    • The dynamics of audiovisual behavior in speech
    • Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
    • Vatikiotis-Bateson E., Munhall K.G., Hirayama M., Lee Y.V., and Terzopoulos D. The dynamics of audiovisual behavior in speech. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 221-232
    • (1996) NATO ASI Series, Series F: Computer and Systems Sciences , vol.150 , pp. 221-232
    • Vatikiotis-Bateson, E.1    Munhall, K.G.2    Hirayama, M.3    Lee, Y.V.4    Terzopoulos, D.5
  • 38
    • 0003699540 scopus 로고
    • Automatic Lipreading to Enhance Speech Recognition
    • University of Illinois at Urbana-Champaign
    • Petajan E.D. Automatic Lipreading to Enhance Speech Recognition. Ph.D. thesis (1984), University of Illinois at Urbana-Champaign
    • (1984) Ph.D. thesis
    • Petajan, E.D.1
  • 42
    • 78650077027 scopus 로고
    • Continuous Automatic Speech Recognition by Lipreading
    • Department of Electrical Engineering and Computer Science, George Washington University
    • Goldschen A.J. Continuous Automatic Speech Recognition by Lipreading. Ph.D. thesis (1993), Department of Electrical Engineering and Computer Science, George Washington University
    • (1993) Ph.D. thesis
    • Goldschen, A.J.1
  • 43
    • 0010070142 scopus 로고    scopus 로고
    • Audiovisual sensory integration using Hidden Markov Models
    • Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
    • Silsbee P.L., and Su Q. Audiovisual sensory integration using Hidden Markov Models. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 489-504
    • (1996) NATO ASI Series, Series F: Computer and Systems Sciences , vol.150 , pp. 489-504
    • Silsbee, P.L.1    Su, Q.2
  • 45
    • 0003517572 scopus 로고    scopus 로고
    • Cassell J., Sullivan J., Prevost S., and Churchill E. (Eds), MIT Press, Cambridge, MA
    • In: Cassell J., Sullivan J., Prevost S., and Churchill E. (Eds). Embodied conversational agents (2000), MIT Press, Cambridge, MA
    • (2000) Embodied conversational agents
  • 46
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Dupont S., and Luettin J. Audio-visual speech modeling for continuous speech recognition. IEEE Transactions on Multimedia 2 3 (2000) 141-151
    • (2000) IEEE Transactions on Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 48
    • 0032180188 scopus 로고    scopus 로고
    • Adaptive fusion of acoustic and visual sources for automatic speech recognition
    • Rogozan A., and Deglise P. Adaptive fusion of acoustic and visual sources for automatic speech recognition. Speech Communication 26 1-2 (1998) 149-161
    • (1998) Speech Communication , vol.26 , Issue.1-2 , pp. 149-161
    • Rogozan, A.1    Deglise, P.2
  • 50
    • 77956750399 scopus 로고    scopus 로고
    • Retooling products so all can use them
    • June 21
    • June 21. Lee J. Retooling products so all can use them. New York Times (2001)
    • (2001) New York Times
    • Lee, J.1
  • 53
    • 0347663785 scopus 로고    scopus 로고
    • Linguistic adaptation during error resolution with spoken and multimodal systems
    • (special issue on prosody and conversation)
    • (special issue on prosody and conversation). Oviatt S.L., Bernard J., and Levow G. Linguistic adaptation during error resolution with spoken and multimodal systems. Language and Speech 41 3-4 (1998) 419-442
    • (1998) Language and Speech , vol.41 , Issue.3-4 , pp. 419-442
    • Oviatt, S.L.1    Bernard, J.2    Levow, G.3
  • 55
    • 0005073850 scopus 로고
    • Multimodal interactions in speech systems
    • Blattner M., and Dannenberg R. (Eds), ACM Press, New York
    • Rudnicky A., and Hauptman A. Multimodal interactions in speech systems. In: Blattner M., and Dannenberg R. (Eds). Multimedia Interface Design, Frontier Series (1992), ACM Press, New York 147-172
    • (1992) Multimedia Interface Design, Frontier Series , pp. 147-172
    • Rudnicky, A.1    Hauptman, A.2
  • 56
    • 4243792067 scopus 로고    scopus 로고
    • Multimodal Interactive Error Recovery for Non-conversational Speech User Interfaces
    • Karlsruhe University, Germany
    • Suhm B. Multimodal Interactive Error Recovery for Non-conversational Speech User Interfaces. Ph.D. thesis (1998), Karlsruhe University, Germany
    • (1998) Ph.D. thesis
    • Suhm, B.1
  • 57
    • 0030687099 scopus 로고    scopus 로고
    • Multimodal interactive maps: Designing for human performance
    • (special issue on multimodal interfaces)
    • (special issue on multimodal interfaces). Oviatt S.L. Multimodal interactive maps: Designing for human performance. Human-Computer Interaction 12 (1997) 93-129
    • (1997) Human-Computer Interaction , vol.12 , pp. 93-129
    • Oviatt, S.L.1
  • 60
    • 0002798273 scopus 로고    scopus 로고
    • Taming recognition errors with a multimodal architecture
    • (special issue on conversational interfaces)
    • (special issue on conversational interfaces). Oviatt S.L. Taming recognition errors with a multimodal architecture. Communications of the ACM 43 (2000) 45-51
    • (2000) Communications of the ACM , vol.43 , pp. 45-51
    • Oviatt, S.L.1
  • 62
    • 0008571386 scopus 로고    scopus 로고
    • Towards a robust speechreading dialog system
    • Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
    • Bregler C., Omohundro S.M., Shi J., and Konig Y. Towards a robust speechreading dialog system. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 409-423
    • (1996) NATO ASI Series, Series F: Computer and Systems Sciences , vol.150 , pp. 409-423
    • Bregler, C.1    Omohundro, S.M.2    Shi, J.3    Konig, Y.4
  • 64
    • 85009154155 scopus 로고    scopus 로고
    • Stream weight optimization of speech and lip, image sequence for audio-visual speech recognition
    • Yuan B., Huang T., and Tang X. (Eds), Chinese Friendship Publishers, Beijing
    • Nakamura S., Ito H., and Shikano K. Stream weight optimization of speech and lip, image sequence for audio-visual speech recognition. In: Yuan B., Huang T., and Tang X. (Eds). Proceedings of the International Conference on Spoken Language Processing (ICSLP 2000) Vol. 3 (2000), Chinese Friendship Publishers, Beijing 20-24
    • (2000) Proceedings of the International Conference on Spoken Language Processing (ICSLP 2000) , vol.3 , pp. 20-24
    • Nakamura, S.1    Ito, H.2    Shikano, K.3
  • 65
    • 85009153179 scopus 로고    scopus 로고
    • Stream confidence estimation for audiovisual speech recognition
    • Yuan B., Huang T., and Tang X. (Eds), Chinese Friendship Publishers, Beijing
    • Potamianos G., and Neti C. Stream confidence estimation for audiovisual speech recognition. In: Yuan B., Huang T., and Tang X. (Eds). Proceedings of the International Conference on Spoken Language Processing (ICSLP 2000) Vol. 3 (2000), Chinese Friendship Publishers, Beijing 746-749
    • (2000) Proceedings of the International Conference on Spoken Language Processing (ICSLP 2000) , vol.3 , pp. 746-749
    • Potamianos, G.1    Neti, C.2
  • 66
    • 0030247984 scopus 로고    scopus 로고
    • Computer lipreading for improved accuracy in automatic speech recognition
    • Silsbee P.L., and Bovik A.C. Computer lipreading for improved accuracy in automatic speech recognition. IEEE Transactions on Speech and Audio Processing 4 5 (1996) 337-351
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 337-351
    • Silsbee, P.L.1    Bovik, A.C.2
  • 68
    • 0013012690 scopus 로고
    • The functions of vision
    • Pick H.L., and Saltzman E. (Eds), Wiley, New York
    • Lee D. The functions of vision. In: Pick H.L., and Saltzman E. (Eds). Modes of Perceiving and Processing information (1978), Wiley, New York 159-170
    • (1978) Modes of Perceiving and Processing information , pp. 159-170
    • Lee, D.1
  • 69
    • 0141461865 scopus 로고
    • Modes of perceiving and processing information
    • Pick Jr. H.L., and Saltzman E. (Eds), Wiley, New York
    • Pick H.L., and Saltzman E. Modes of perceiving and processing information. In: Pick Jr. H.L., and Saltzman E. (Eds). Modes of Perceiving and Processing Information (1978), Wiley, New York 1-20
    • (1978) Modes of Perceiving and Processing Information , pp. 1-20
    • Pick, H.L.1    Saltzman, E.2
  • 70
    • 77956722369 scopus 로고
    • Information and effects of early perceptual experience
    • Eisenberg N. (Ed), Wiley, New York
    • Pick H. Information and effects of early perceptual experience. In: Eisenberg N. (Ed). Contemporary Topics in Developmental Psychology (1987), Wiley, New York 59-76
    • (1987) Contemporary Topics in Developmental Psychology , pp. 59-76
    • Pick, H.1
  • 73
    • 0040111438 scopus 로고
    • The evolution of sensory systems
    • MacLeod R.B., and Pick Jr. H.L. (Eds), Cornell University Press, Ithaca, NY
    • Bower T.G.R. The evolution of sensory systems. In: MacLeod R.B., and Pick Jr. H.L. (Eds). Perception: Essays in Honor of James J. Gibson (1974), Cornell University Press, Ithaca, NY 141-153
    • (1974) Perception: Essays in Honor of James J. Gibson , pp. 141-153
    • Bower, T.G.R.1
  • 74
    • 0348021772 scopus 로고
    • The functional integrity of spatial behavior
    • Freedman S.J. (Ed), Dorsey Press, Homewood, IL
    • Freedman S.J., and Rekosh J.H. The functional integrity of spatial behavior. In: Freedman S.J. (Ed). The Neuropsychology of Spatially-Oriented Behavior (1968), Dorsey Press, Homewood, IL 153-162
    • (1968) The Neuropsychology of Spatially-Oriented Behavior , pp. 153-162
    • Freedman, S.J.1    Rekosh, J.H.2
  • 75
    • 0003225664 scopus 로고
    • Some aspects of sensory-motor control and adaptation in man
    • Walk R.D., and Pick H.L. (Eds), Plenum, New York
    • Lackner J.R. Some aspects of sensory-motor control and adaptation in man. In: Walk R.D., and Pick H.L. (Eds). Intersensory Perception and Sensory Integration (1981), Plenum, New York 143-173
    • (1981) Intersensory Perception and Sensory Integration , pp. 143-173
    • Lackner, J.R.1
  • 77
    • 0003799851 scopus 로고    scopus 로고
    • Model-based sensor fusion for aviation
    • Pavel M., and Sharma R.K. Model-based sensor fusion for aviation. Proceedings of SPIE 3088 (1997) 169-176
    • (1997) Proceedings of SPIE , vol.3088 , pp. 169-176
    • Pavel, M.1    Sharma, R.K.2
  • 81
    • 0032075546 scopus 로고    scopus 로고
    • Predicting hyperarticulate speech during human-computer error resolution
    • Oviatt S.L., MacEachern M., and Levow G. Predicting hyperarticulate speech during human-computer error resolution. Speech Communication 24 (1998) 87-110
    • (1998) Speech Communication , vol.24 , pp. 87-110
    • Oviatt, S.L.1    MacEachern, M.2    Levow, G.3
  • 88
    • 0002792478 scopus 로고
    • Yeni-Komshian G., Kavanaugh J., and Ferguson C. (Eds), Academic Press, New York
    • In: Yeni-Komshian G., Kavanaugh J., and Ferguson C. (Eds). Child Phonology, Vol. 1: Production (1980), Academic Press, New York
    • (1980) Child Phonology, Vol. 1: Production
  • 90
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments
    • Gong Y. Speech recognition in noisy environments. Speech Communication 16 (1995) 261-291
    • (1995) Speech Communication , vol.16 , pp. 261-291
    • Gong, Y.1
  • 91
    • 0026882842 scopus 로고
    • Experiments with a non-linear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars
    • Lockwood P., and Boudy J. Experiments with a non-linear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars. Speech Communication 11 (1992) 2-3
    • (1992) Speech Communication , vol.11 , pp. 2-3
    • Lockwood, P.1    Boudy, J.2
  • 92
    • 0026882842 scopus 로고
    • Experiments with a non-linear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars
    • Lockwood P., and Boudy J. Experiments with a non-linear spectral subtractor (NSS), Hidden Markov Models and the projection for robust speech recognition in cars. Speech Communication 11 (1992) 215-228
    • (1992) Speech Communication , vol.11 , pp. 215-228
    • Lockwood, P.1    Boudy, J.2
  • 93
    • 0027465491 scopus 로고
    • The Lombard reflex and its role on human listeners and automatic speech recognizers
    • Junqua J.C. The Lombard reflex and its role on human listeners and automatic speech recognizers. Journal of the Acoustical Society of America 93 1 (1993) 510-524
    • (1993) Journal of the Acoustical Society of America , vol.93 , Issue.1 , pp. 510-524
    • Junqua, J.C.1
  • 95
    • 0039915438 scopus 로고
    • Effect of level of distracting noise upon speaking rate, duration and intensity
    • Hanley T.D., and Steer M.D. Effect of level of distracting noise upon speaking rate, duration and intensity. Journal of Speech and Hearing Disorders 14 (1949) 363-368
    • (1949) Journal of Speech and Hearing Disorders , vol.14 , pp. 363-368
    • Hanley, T.D.1    Steer, M.D.2
  • 98
    • 33646663508 scopus 로고
    • A signal detection problem and a possible solution in Japanese quail
    • Potash L.M. A signal detection problem and a possible solution in Japanese quail. Animal Behavior 20 (1972) 192-195
    • (1972) Animal Behavior , vol.20 , pp. 192-195
    • Potash, L.M.1
  • 100
    • 0009778876 scopus 로고
    • Auditory feedback in the regulation of vocal intensity of preschool children
    • Siegel G.M., Pick H.L., Olsen M.G., and Sawin L. Auditory feedback in the regulation of vocal intensity of preschool children. Developmental Psychology 12 (1976) 255-261
    • (1976) Developmental Psychology , vol.12 , pp. 255-261
    • Siegel, G.M.1    Pick, H.L.2    Olsen, M.G.3    Sawin, L.4
  • 102
    • 0005454347 scopus 로고    scopus 로고
    • Perception of conflicting audio-visual speech: An examination across Spanish and German
    • Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
    • Fuster-Duran A. Perception of conflicting audio-visual speech: An examination across Spanish and German. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 135-143
    • (1996) NATO ASI Series, Series F: Computer and Systems Sciences , vol.150 , pp. 135-143
    • Fuster-Duran, A.1
  • 103
    • 10644227100 scopus 로고    scopus 로고
    • Bimodal speech perception: A progress report
    • Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
    • Massaro D.W. Bimodal speech perception: A progress report. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 79-101
    • (1996) NATO ASI Series, Series F: Computer and Systems Sciences , vol.150 , pp. 79-101
    • Massaro, D.W.1
  • 104
    • 0000417467 scopus 로고    scopus 로고
    • Visionary speech: Looking ahead to practical speechreading systems
    • Speechreading by Humans and Machines: Models, Systems and Applications. Stork D.G., and Hennecke M.E. (Eds), Springer-Verlag, Berlin
    • Hennecke M.E., Stork D.G., and Prasad K.V. Visionary speech: Looking ahead to practical speechreading systems. In: Stork D.G., and Hennecke M.E. (Eds). Speechreading by Humans and Machines: Models, Systems and Applications. NATO ASI Series, Series F: Computer and Systems Sciences 150 (1996), Springer-Verlag, Berlin 331-349
    • (1996) NATO ASI Series, Series F: Computer and Systems Sciences , vol.150 , pp. 331-349
    • Hennecke, M.E.1    Stork, D.G.2    Prasad, K.V.3
  • 107
    • 85009088524 scopus 로고    scopus 로고
    • Multimodal signal processing in naturalistic noisy environments
    • Yuan B., Huang T., and Tang X. (Eds), Chinese Friendship Publishers, Beijing
    • Oviatt S.L. Multimodal signal processing in naturalistic noisy environments. In: Yuan B., Huang T., and Tang X. (Eds). Proceedings of the International Conference on Spoken Language Processing (ICSLP'2000) Vol. 2 (2000), Chinese Friendship Publishers, Beijing 696-699
    • (2000) Proceedings of the International Conference on Spoken Language Processing (ICSLP'2000) , vol.2 , pp. 696-699
    • Oviatt, S.L.1
  • 108
    • 0002028032 scopus 로고
    • Some preliminaries to a comprehensive account of audio-visual speech perception
    • Dodd B., and Campbell R. (Eds), Lawrence Erlbaum, London
    • Summerfield Q. Some preliminaries to a comprehensive account of audio-visual speech perception. In: Dodd B., and Campbell R. (Eds). Hearing by Eye: The Psychology of Lip-reading (1987), Lawrence Erlbaum, London 3-51
    • (1987) Hearing by Eye: The Psychology of Lip-reading , pp. 3-51
    • Summerfield, Q.1
  • 109
    • 4244043499 scopus 로고
    • An improved automatic lipreading system to enhance speech recognition
    • AT&T Bell Labs
    • Petajan E.D. An improved automatic lipreading system to enhance speech recognition. Tech. Rep. 11251-871012-111TM (1987), AT&T Bell Labs
    • (1987) Tech. Rep. 11251-871012-111TM
    • Petajan, E.D.1
  • 110
    • 0032179207 scopus 로고    scopus 로고
    • Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition
    • Iverson P., Bernstein L., and Auer E. Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition. Speech Communication 26 (1998) 1-2
    • (1998) Speech Communication , vol.26 , pp. 1-2
    • Iverson, P.1    Bernstein, L.2    Auer, E.3
  • 111
    • 0032179207 scopus 로고    scopus 로고
    • Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition
    • Iverson P., Bernstein L., and Auer E. Modeling the interaction of phonemic intelligibility and lexical structure in audiovisual word recognition. Speech Communication 26 (1998) 45-63
    • (1998) Speech Communication , vol.26 , pp. 45-63
    • Iverson, P.1    Bernstein, L.2    Auer, E.3
  • 112
    • 0028710004 scopus 로고
    • Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity
    • Oviatt S.L., Cohen P.R., and Wang M.Q. Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity. Speech Communication 15 (1994) 3-4
    • (1994) Speech Communication , vol.15 , pp. 3-4
    • Oviatt, S.L.1    Cohen, P.R.2    Wang, M.Q.3
  • 113
    • 0028710004 scopus 로고
    • Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity
    • Oviatt S.L., Cohen P.R., and Wang M.Q. Toward interface design for human language technology: Modality and structure as determinants of linguistic complexity. Speech Communication 15 (1994) 283-300
    • (1994) Speech Communication , vol.15 , pp. 283-300
    • Oviatt, S.L.1    Cohen, P.R.2    Wang, M.Q.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.