메뉴 건너뛰기




Volumn 44, Issue 1-4 SPEC. ISS., 2004, Pages 31-41

A phonetically neutral model of the low-level audio-visual interaction

Author keywords

Audio visual processing; Modeling; Multimodal interaction; Subband decomposition; Temporal envelope processing

Indexed keywords

AUDIO-VISUAL PROCESSING; MULTIMODAL INTERACTION; SPECTRALLY REDUCED SPEECH (SRS); SUBBAND DECOMPOSITION; TEMPORAL ENVELOPE PROCESSING;

EID: 10344267573     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2004.10.003     Document Type: Article
Times cited : (24)

References (19)
  • 1
    • 85093706714 scopus 로고    scopus 로고
    • Is primitive AV coherence an aid to segment the scene?
    • Terrigal
    • Barker, J.P., Berthommier, F., Schwartz, J.-L., 1998. Is primitive AV coherence an aid to segment the scene? In: Proc. AVSP'98, Terrigal, pp.103-108.
    • (1998) Proc. AVSP'98 , pp. 103-108
    • Barker, J.P.1    Berthommier, F.2    Schwartz, J.-L.3
  • 2
    • 0012725678 scopus 로고    scopus 로고
    • Estimation of speech acoustics from visual speech features: A comparison of linear and non-linear models
    • Santa Cruz
    • Barker, J.P., Berthommier, F., 1999. Estimation of speech acoustics from visual speech features: a comparison of linear and non-linear models. In: Proc. AVSP'99, Santa Cruz, pp. 112-117.
    • (1999) Proc. AVSP'99 , pp. 112-117
    • Barker, J.P.1    Berthommier, F.2
  • 3
    • 10444238251 scopus 로고    scopus 로고
    • Audiovisual speech binding: Convergence or association
    • Calvert, G. et al. (Eds.), MIT Press, Cambridge
    • Bernstein, L.E., Auer, E.T., Moore, J.K., 2004. Audiovisual speech binding: convergence or association. In: Calvert, G. et al. (Eds.), Handbook of Multisensory Processes. MIT Press, Cambridge.
    • (2004) Handbook of Multisensory Processes
    • Bernstein, L.E.1    Auer, E.T.2    Moore, J.K.3
  • 4
    • 84904537057 scopus 로고    scopus 로고
    • Audio-visual recognition of spectrally reduced speech
    • Aalborg
    • Berthommier, F., 2001. Audio-visual recognition of spectrally reduced speech. In: Proc. AVSP'01, Aalborg, pp. 183-188.
    • (2001) Proc. AVSP'01 , pp. 183-188
    • Berthommier, F.1
  • 5
    • 84873571173 scopus 로고    scopus 로고
    • Direct synthesis of video from speech sounds for new telecommunication applications
    • Grenoble
    • Berthommier, F., 2003a. Direct synthesis of video from speech sounds for new telecommunication applications, In: Proc. SOC'03, Grenoble.
    • (2003) Proc. SOC'03
    • Berthommier, F.1
  • 6
    • 84901700762 scopus 로고    scopus 로고
    • Audiovisual speech enhancement based on the association between speech envelope and video features
    • Geneva
    • Berthommier, F., 2003b. Audiovisual speech enhancement based on the association between speech envelope and video features. In: Proc. Eurospeech'03, Geneva.
    • (2003) Proc. Eurospeech'03
    • Berthommier, F.1
  • 8
    • 0015325150 scopus 로고
    • Speech-envelope cues as an acoustical aid to lip-reading for profoundly deaf children
    • Erber, N.P., 1972. Speech-envelope cues as an acoustical aid to lip-reading for profoundly deaf children. JASA 51, 1224-1227.
    • (1972) JASA , vol.51 , pp. 1224-1227
    • Erber, N.P.1
  • 9
    • 0034974093 scopus 로고    scopus 로고
    • Audio-visual enhancement of speech in noise
    • Girin, L., Schwartz, J.L., Feng, G., 2001. Audio-visual enhancement of speech in noise. JASA 109 (6), 3007-3020.
    • (2001) JASA , vol.109 , Issue.6 , pp. 3007-3020
    • Girin, L.1    Schwartz, J.L.2    Feng, G.3
  • 10
    • 0036295990 scopus 로고    scopus 로고
    • Noisy audio feature enhancement using audio-visual speech data
    • Orlando
    • Goecke, R., Potamianos, G., Neti, C., 2002. Noisy audio feature enhancement using audio-visual speech data. In: Proc. ICASSP'02, Orlando.
    • (2002) Proc. ICASSP'02
    • Goecke, R.1    Potamianos, G.2    Neti, C.3
  • 11
    • 0033822769 scopus 로고    scopus 로고
    • The use of visible speech cues for improving auditory detection of spoken sentences
    • Grant, K.W., Seitz, P.-F., 2000. The use of visible speech cues for improving auditory detection of spoken sentences. JASA 108, 1197-1208.
    • (2000) JASA , vol.108 , pp. 1197-1208
    • Grant, K.W.1    Seitz, P.-F.2
  • 12
    • 85009284526 scopus 로고    scopus 로고
    • DCT-Based video features for audio-visual speech recognition
    • Denver
    • Heckmann, M., Kroschel, K., Savariaux, C., Berthommier, F., 2002. DCT-Based video features for audio-visual speech recognition. In: Proc. ICSLP'02, Denver, pp. 1925-1928.
    • (2002) Proc. ICSLP'02 , pp. 1925-1928
    • Heckmann, M.1    Kroschel, K.2    Savariaux, C.3    Berthommier, F.4
  • 13
    • 0036874551 scopus 로고    scopus 로고
    • On the relationship between face movements, tongue movements, and speech acoustics
    • Jiang, J., Alwan, A., Keating, P.A., Auer, E.T., Bernstein, L.E., 2002. On the relationship between face movements, tongue movements, and speech acoustics. EURASIP JASP 11, 1174-1188.
    • (2002) EURASIP JASP , vol.11 , pp. 1174-1188
    • Jiang, J.1    Alwan, A.2    Keating, P.A.3    Auer, E.T.4    Bernstein, L.E.5
  • 14
    • 85071899496 scopus 로고    scopus 로고
    • Visible speech cues and auditory detection of spoken sentences: An effect of degree of correlation between acoustic and visual properties
    • Aalborg
    • Kim, J., Davis, C., 2001. Visible speech cues and auditory detection of spoken sentences: an effect of degree of correlation between acoustic and visual properties. In: Proc. AVSP'01, Aalborg, pp. 127-131.
    • (2001) Proc. AVSP'01 , pp. 127-131
    • Kim, J.1    Davis, C.2
  • 16
    • 85009257811 scopus 로고    scopus 로고
    • Audio-visual scene analysis: Evidence for a "very-early" integration process in audio-visual speech perception
    • Denver
    • Schwartz, J.-L., Berthommier, F., Savariaux, C., 2002. Audio-visual scene analysis: evidence for a "very-early" integration process in audio-visual speech perception. In: Proc. ICSLP'02, Denver, pp. 1937-1940.
    • (2002) Proc. ICSLP'02 , pp. 1937-1940
    • Schwartz, J.-L.1    Berthommier, F.2    Savariaux, C.3
  • 18
    • 0002028032 scopus 로고
    • Some preliminaries to a comprehensive account of audio-visual speech perception
    • Dodd, B., Campbell, R. (Eds.), Lawrence Erlbaum, Hillsdale, NJ
    • Summerfield, Q., 1987. Some preliminaries to a comprehensive account of audio-visual speech perception. In: Dodd, B., Campbell, R. (Eds.), Hearing by Eye: The Psychology of Lip-Reading. Lawrence Erlbaum, Hillsdale, NJ.
    • (1987) Hearing by Eye: The Psychology of Lip-reading
    • Summerfield, Q.1
  • 19
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocal tract and facial behavior
    • Yehia, H., Rubin, P., Vatikiotis-Bateson, E., 1998. Quantitative association of vocal tract and facial behavior. Speech Commun. 26 (1), 23-43.
    • (1998) Speech Commun. , vol.26 , Issue.1 , pp. 23-43
    • Yehia, H.1    Rubin, P.2    Vatikiotis-Bateson, E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.