메뉴 건너뛰기




Volumn 37, Issue 4, 2011, Pages 1193-1209

Psychophysics of the McGurk and Other Audiovisual Speech Integration Effects

Author keywords

Audiovisual speech perception; Congruent and incongruent; Factor analysis; Quantitative stimulus measures

Indexed keywords

ARTICLE; AUDITORY STIMULATION; HUMAN; LANGUAGE; PERCEPTIVE DISCRIMINATION; PHOTOSTIMULATION; PSYCHOPHYSICS; REFERENCE VALUE; SPEECH PERCEPTION; VISION;

EID: 79961150200     PISSN: 00961523     EISSN: None     Source Type: Journal    
DOI: 10.1037/a0023100     Document Type: Article
Times cited : (39)

References (66)
  • 1
    • 79961158748 scopus 로고    scopus 로고
    • (in press-a). Multisensory information integration for communication and speech. In B. E. Stein (Ed.), The new handbook of multisensory integration. Cambridge, MA: MIT Press.
    • Bernstein, L. E. (in press-a). Multisensory information integration for communication and speech. In B. E. Stein (Ed.), The new handbook of multisensory integration. Cambridge, MA: MIT Press.
    • Bernstein, L.E.1
  • 2
    • 79961153019 scopus 로고    scopus 로고
    • (in press-b). Visual speech perception. In G. Bailly, P. Perrier, & E. Vatiokis-Bateson (Eds.), Audiovisual speech processing. Cambridge, England: Cambridge University Press.
    • Bernstein, L. E. (in press-b). Visual speech perception. In G. Bailly, P. Perrier, & E. Vatiokis-Bateson (Eds.), Audiovisual speech processing. Cambridge, England: Cambridge University Press.
    • Bernstein, L.E.1
  • 3
    • 0000970784 scopus 로고    scopus 로고
    • Development of a facility for simultaneous recordings of acoustic, optical (3-D motion and video), and physiological speech data
    • Bernstein L.E., Auer E.T., Chaney B., Alwan A., Keating P.A. Development of a facility for simultaneous recordings of acoustic, optical (3-D motion and video), and physiological speech data. Journal of the Acoustical Society of America 2000, 107:2887.
    • (2000) Journal of the Acoustical Society of America , vol.107 , pp. 2887
    • Bernstein, L.E.1    Auer, E.T.2    Chaney, B.3    Alwan, A.4    Keating, P.A.5
  • 4
    • 10444238251 scopus 로고    scopus 로고
    • Audiovisual speech binding: Convergence or association?
    • MIT Press, Cambridge, MA, G.A. Calvert, C. Spence, B.E. Stein (Eds.)
    • Bernstein L.E., Auer E.T., Moore J.K. Audiovisual speech binding: Convergence or association?. Handbook of multisensory processes 2004, 203-223. MIT Press, Cambridge, MA. G.A. Calvert, C. Spence, B.E. Stein (Eds.).
    • (2004) Handbook of multisensory processes , pp. 203-223
    • Bernstein, L.E.1    Auer, E.T.2    Moore, J.K.3
  • 5
    • 55649097537 scopus 로고    scopus 로고
    • Quantified acoustic-optical speech signal incongruity identifies cortical sites of audiovisual speech processing
    • Bernstein L.E., Lu Z.-L., Jiang J. Quantified acoustic-optical speech signal incongruity identifies cortical sites of audiovisual speech processing. Brain Research 2008, 1242:172-184.
    • (2008) Brain Research , vol.1242 , pp. 172-184
    • Bernstein, L.E.1    Lu, Z.-L.2    Jiang, J.3
  • 7
    • 24744466072 scopus 로고    scopus 로고
    • The perception of voice onset time: An fMRI investigation of phonetic category structure
    • Blumstein S.E., Myers E.B., Rissman J. The perception of voice onset time: An fMRI investigation of phonetic category structure. Journal of Cognitive Neuroscience 2005, 17:1353-1366.
    • (2005) Journal of Cognitive Neuroscience , vol.17 , pp. 1353-1366
    • Blumstein, S.E.1    Myers, E.B.2    Rissman, J.3
  • 9
    • 0021892250 scopus 로고
    • Speechreading supplemented with formant-frequency information from voiced speech
    • Breeuwer M., Plomp R. Speechreading supplemented with formant-frequency information from voiced speech. Journal of the Acoustical Society of America 1985, 77:314-317.
    • (1985) Journal of the Acoustical Society of America , vol.77 , pp. 314-317
    • Breeuwer, M.1    Plomp, R.2
  • 10
    • 0033832324 scopus 로고    scopus 로고
    • Single-sweep EEG analysis of neural processes underlying perception and production of vowels
    • Callan D.E., Callan A.M., Honda K., Masaki S. Single-sweep EEG analysis of neural processes underlying perception and production of vowels. Cognitive Brain Research 2000, 10:173-176.
    • (2000) Cognitive Brain Research , vol.10 , pp. 173-176
    • Callan, D.E.1    Callan, A.M.2    Honda, K.3    Masaki, S.4
  • 11
    • 0031296869 scopus 로고    scopus 로고
    • Perception of visible speech: Influence of spatial quantization
    • Campbell C.S., Massaro D.W. Perception of visible speech: Influence of spatial quantization. Perception 1997, 26:627-644.
    • (1997) Perception , vol.26 , pp. 627-644
    • Campbell, C.S.1    Massaro, D.W.2
  • 14
    • 56749174163 scopus 로고    scopus 로고
    • A linear model of acoustic-to-facial mapping: Model parameters, data set size, and generalization across speakers
    • Craig M.S., van Lieshout P., Wong W. A linear model of acoustic-to-facial mapping: Model parameters, data set size, and generalization across speakers. Journal of the Acoustical Society of America 2008, 124:3183-3190.
    • (2008) Journal of the Acoustical Society of America , vol.124 , pp. 3183-3190
    • Craig, M.S.1    van Lieshout, P.2    Wong, W.3
  • 15
    • 33846876359 scopus 로고    scopus 로고
    • Speech as a supramodal or amodal phenomenon
    • MIT Press, Cambridge, MA, G. Calvert, C. Spence, B.E. Stein (Eds.)
    • Fowler C. Speech as a supramodal or amodal phenomenon. Handbook of multisensory processes 2004, 189-202. MIT Press, Cambridge, MA. G. Calvert, C. Spence, B.E. Stein (Eds.).
    • (2004) Handbook of multisensory processes , pp. 189-202
    • Fowler, C.1
  • 17
    • 4444300502 scopus 로고    scopus 로고
    • Effects of spectro-temporal asynchrony in auditory and auditory-visual speech processing
    • Grant K.W., Greenberg S., Poeppel D., van Wassenhove V. Effects of spectro-temporal asynchrony in auditory and auditory-visual speech processing. Seminars in Hearing 2004, 25:241-255.
    • (2004) Seminars in Hearing , vol.25 , pp. 241-255
    • Grant, K.W.1    Greenberg, S.2    Poeppel, D.3    van Wassenhove, V.4
  • 18
    • 0024587814 scopus 로고
    • The role of visual information in the processing of place and manner features in speech perception
    • Green K.P., Kuhl P.K. The role of visual information in the processing of place and manner features in speech perception. Perception & Psychophysics 1989, 45:34-42.
    • (1989) Perception & Psychophysics , vol.45 , pp. 34-42
    • Green, K.P.1    Kuhl, P.K.2
  • 19
    • 0026307114 scopus 로고
    • Integrating speech information across talkers, gender, and sensory modality: Female faces and male voices in the McGurk effect
    • Green K.P., Kuhl P.K., Meltzoff A.N., Stevens E.B. Integrating speech information across talkers, gender, and sensory modality: Female faces and male voices in the McGurk effect. Perception & Psychophysics 1991, 50:524-536.
    • (1991) Perception & Psychophysics , vol.50 , pp. 524-536
    • Green, K.P.1    Kuhl, P.K.2    Meltzoff, A.N.3    Stevens, E.B.4
  • 20
    • 0030835529 scopus 로고    scopus 로고
    • Acoustic cues to place of articulation and the McGurk effect: The role of release bursts, aspiration, and formant transitions
    • Green K.P., Norrix L.W. Acoustic cues to place of articulation and the McGurk effect: The role of release bursts, aspiration, and formant transitions. Journal of Speech, Language, & Hearing Research 1997, 40:646-665.
    • (1997) Journal of Speech, Language, & Hearing Research , vol.40 , pp. 646-665
    • Green, K.P.1    Norrix, L.W.2
  • 21
    • 37049000152 scopus 로고    scopus 로고
    • Abstract coding of audiovisual speech: Beyond sensory representation
    • Hasson U., Skipper J.I., Nusbaum H.C., Small S.L. Abstract coding of audiovisual speech: Beyond sensory representation. Neuron 2007, 56:1116-1126.
    • (2007) Neuron , vol.56 , pp. 1116-1126
    • Hasson, U.1    Skipper, J.I.2    Nusbaum, H.C.3    Small, S.L.4
  • 22
    • 79961158307 scopus 로고    scopus 로고
    • Topographic change of ERP due to discrimination of CV syllables with various vowel durations
    • Hosokawa M., Okazaki S., Kawakubo Y., Maekawa H., Ozaki H. Topographic change of ERP due to discrimination of CV syllables with various vowel durations. International Congress Series 2002, 1232:53-57.
    • (2002) International Congress Series , vol.1232 , pp. 53-57
    • Hosokawa, M.1    Okazaki, S.2    Kawakubo, Y.3    Maekawa, H.4    Ozaki, H.5
  • 23
    • 39149092050 scopus 로고    scopus 로고
    • An event-related fMRI investigation of voice-onset time discrimination
    • Hutchison E.R., Blumstein S.E., Myers E.B. An event-related fMRI investigation of voice-onset time discrimination. Neuroimage 2008, 40:342-352.
    • (2008) Neuroimage , vol.40 , pp. 342-352
    • Hutchison, E.R.1    Blumstein, S.E.2    Myers, E.B.3
  • 29
    • 0018388657 scopus 로고
    • Visual influences on speech perception processes
    • MacDonald J., McGurk H. Visual influences on speech perception processes. Perception & Psychophysics 1978, 24:253-257.
    • (1978) Perception & Psychophysics , vol.24 , pp. 253-257
    • MacDonald, J.1    McGurk, H.2
  • 33
    • 0032789664 scopus 로고    scopus 로고
    • Speechreading: Illusion or window into pattern recognition
    • Massaro D.W. Speechreading: Illusion or window into pattern recognition. Trends in Cognitive Sciences 1999, 3:310-317.
    • (1999) Trends in Cognitive Sciences , vol.3 , pp. 310-317
    • Massaro, D.W.1
  • 35
    • 0022019614 scopus 로고
    • Intermodal timing relations and audio-visual speech recognition by normal-hearing adults
    • McGrath M., Summerfield Q. Intermodal timing relations and audio-visual speech recognition by normal-hearing adults. Journal of the Acoustical Society of America 1985, 77:678-685.
    • (1985) Journal of the Acoustical Society of America , vol.77 , pp. 678-685
    • McGrath, M.1    Summerfield, Q.2
  • 36
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • McGurk H., MacDonald J. Hearing lips and seeing voices. Nature 1976, 264(5588):746-748.
    • (1976) Nature , vol.264 , Issue.5588 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 37
    • 21344450921 scopus 로고    scopus 로고
    • Perceptual fusion and stimulus coincidence in the cross-modal integration of speech
    • Miller L.M., d'Esposito M. Perceptual fusion and stimulus coincidence in the cross-modal integration of speech. Journal of Neuroscience 2005, 25:5884-5893.
    • (2005) Journal of Neuroscience , vol.25 , pp. 5884-5893
    • Miller, L.M.1    d'Esposito, M.2
  • 42
    • 0037700834 scopus 로고    scopus 로고
    • (December). Assessing face and speech consistency for monologue detection in video. Paper presented at the ACM International Conference on Multimedia, Juan-les-Pins, France.
    • Nock, H. J., Iyengar, G., & Neti, C. (2002, December). Assessing face and speech consistency for monologue detection in video. Paper presented at the ACM International Conference on Multimedia, Juan-les-Pins, France.
    • (2002)
    • Nock, H.J.1    Iyengar, G.2    Neti, C.3
  • 44
    • 0037325848 scopus 로고    scopus 로고
    • Differential brain activation patterns during perception of voice and tone onset time series: A MEG study
    • Papanicolaou A.C., Castillo E., Breier J.I., Davis R.N., Simos P.G., Diehl R.L. Differential brain activation patterns during perception of voice and tone onset time series: A MEG study. Neuroimage 2003, 18:448-459.
    • (2003) Neuroimage , vol.18 , pp. 448-459
    • Papanicolaou, A.C.1    Castillo, E.2    Breier, J.I.3    Davis, R.N.4    Simos, P.G.5    Diehl, R.L.6
  • 46
    • 70349568713 scopus 로고    scopus 로고
    • Mismatch negativity with visual-only and audiovisual speech
    • Ponton C.W., Bernstein L.E., Auer E.T. Mismatch negativity with visual-only and audiovisual speech. Brain Topography 2009, 21:207-215.
    • (2009) Brain Topography , vol.21 , pp. 207-215
    • Ponton, C.W.1    Bernstein, L.E.2    Auer, E.T.3
  • 47
    • 0024610919 scopus 로고
    • A tutorial on Hidden Markov Models and selected applications in speech recognition
    • Rabiner L.R. A tutorial on Hidden Markov Models and selected applications in speech recognition. Proceedings of The IEEE 1989, 77:257-286.
    • (1989) Proceedings of The IEEE , vol.77 , pp. 257-286
    • Rabiner, L.R.1
  • 49
    • 34250357232 scopus 로고    scopus 로고
    • Lip-read me now, hear me better later: Cross-modal transfer of talker-familiarity effects
    • Rosenblum L.D., Miller R.M., Sanchez K. Lip-read me now, hear me better later: Cross-modal transfer of talker-familiarity effects. Psychological Science 2007, 18:392-396.
    • (2007) Psychological Science , vol.18 , pp. 392-396
    • Rosenblum, L.D.1    Miller, R.M.2    Sanchez, K.3
  • 50
    • 37349057822 scopus 로고    scopus 로고
    • McGurk effects in cochlear-implanted deaf subjects
    • Rouger J., Fraysse B., Deguine O., Barone P. McGurk effects in cochlear-implanted deaf subjects. Brain Research 2008, 1188:87-99.
    • (2008) Brain Research , vol.1188 , pp. 87-99
    • Rouger, J.1    Fraysse, B.2    Deguine, O.3    Barone, P.4
  • 52
    • 0030636050 scopus 로고    scopus 로고
    • Cultural and linguistic factors in audiovisual speech processing: The McGurk effect in Chinese subjects
    • Sekiyama K. Cultural and linguistic factors in audiovisual speech processing: The McGurk effect in Chinese subjects. Perception & Psychophysics 1997, 59:73-80.
    • (1997) Perception & Psychophysics , vol.59 , pp. 73-80
    • Sekiyama, K.1
  • 53
    • 0025935481 scopus 로고
    • McGurk effect in non-English listeners: Few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility
    • Sekiyama K., Tohkura Y. McGurk effect in non-English listeners: Few visual effects for Japanese subjects hearing Japanese syllables of high auditory intelligibility. Journal of the Acoustical Society of America 1991, 90:1797-1805.
    • (1991) Journal of the Acoustical Society of America , vol.90 , pp. 1797-1805
    • Sekiyama, K.1    Tohkura, Y.2
  • 55
    • 34249039173 scopus 로고    scopus 로고
    • Hearing lips and seeing voices: How cortical areas supporting speech production mediate audiovisual speech perception
    • Skipper J.I., van Wassenhove V., Nusbaum H.C., Small S.L. Hearing lips and seeing voices: How cortical areas supporting speech production mediate audiovisual speech perception. Cerebral Cortex 2007, 17:2387-2399.
    • (2007) Cerebral Cortex , vol.17 , pp. 2387-2399
    • Skipper, J.I.1    van Wassenhove, V.2    Nusbaum, H.C.3    Small, S.L.4
  • 57
    • 0001390793 scopus 로고
    • Speech analysis and synthesis methods developed at ECL in NTT: From LPC to LSP
    • Sugamura N., Itakura F. Speech analysis and synthesis methods developed at ECL in NTT: From LPC to LSP. Speech Communication 1986, 5:199-215.
    • (1986) Speech Communication , vol.5 , pp. 199-215
    • Sugamura, N.1    Itakura, F.2
  • 59
    • 0002028032 scopus 로고
    • Some preliminaries to a comprehensive account of audio-visual speech perception
    • Erlbaum, London, England, B. Dodd, R. Campbell (Eds.)
    • Summerfield Q. Some preliminaries to a comprehensive account of audio-visual speech perception. Hearing by eye: The psychology of lip-reading 1987, 3-52. Erlbaum, London, England. B. Dodd, R. Campbell (Eds.).
    • (1987) Hearing by eye: The psychology of lip-reading , pp. 3-52
    • Summerfield, Q.1
  • 61
  • 62
    • 33845468167 scopus 로고    scopus 로고
    • Temporal window of integration in auditory-visual speech perception
    • van Wassenhove V., Grant K.W., Poeppel D. Temporal window of integration in auditory-visual speech perception. Neuropsychologia 2007, 45:598-607.
    • (2007) Neuropsychologia , vol.45 , pp. 598-607
    • van Wassenhove, V.1    Grant, K.W.2    Poeppel, D.3
  • 63
    • 0029400335 scopus 로고
    • Facial identity and facial speech processing: Familiar faces and voices in the McGurk effect
    • Walker S., Bruce V., O'Malley C. Facial identity and facial speech processing: Familiar faces and voices in the McGurk effect. Perception & Psychophysics 1995, 57:1124-1133.
    • (1995) Perception & Psychophysics , vol.57 , pp. 1124-1133
    • Walker, S.1    Bruce, V.2    O'Malley, C.3
  • 65
    • 0032179320 scopus 로고    scopus 로고
    • Lip movement synthesis from speech based on Hidden Markov Models
    • Yamamoto E., Nakamura S., Shikano K. Lip movement synthesis from speech based on Hidden Markov Models. Speech Communication 1998, 26:105-115.
    • (1998) Speech Communication , vol.26 , pp. 105-115
    • Yamamoto, E.1    Nakamura, S.2    Shikano, K.3
  • 66


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.