메뉴 건너뛰기




Volumn 5641 LNAI, Issue , 2009, Pages 331-343

An investigation into audiovisual speech correlation in reverberant noisy environments

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO-VISUAL CORPORA; AUDIO-VISUAL CORRELATIONS; AUDIO-VISUAL SPEECH; BEAM FORMERS; COMPLEX RELATIONSHIPS; FEATURE EXTRACTION TECHNIQUES; HUMAN COMMUNICATIONS; MULTI-MODAL; MULTIMODAL DATA STREAMS; NOISY ENVIRONMENT; PROCESSING METHOD; SPEECH ENHANCEMENT SYSTEM; VISUAL FEATURE; VISUAL TECHNIQUES;

EID: 70350378930     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-03320-9_31     Document Type: Conference Paper
Times cited : (7)

References (22)
  • 1
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • McGurk, H., MacDonald, J.: Hearing lips and seeing voices. Nature 264, 746-748 (1976)
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 2
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocal tract and facial behavior
    • Yehia, H., Rubin, P., Vatikiotis Bateson, E.: Quantitative association of vocal tract and facial behavior. Speech Communication 26(1), 23-43 (1998)
    • (1998) Speech Communication , vol.26 , Issue.1 , pp. 23-43
    • Yehia, H.1    Rubin, P.2    Vatikiotis Bateson, E.3
  • 3
    • 70350388418 scopus 로고    scopus 로고
    • Barker, J.P., Berthommier, F.: Evidence of correlation between acoustic and visual features of speech. In: ICPhS 1999, San Francisco (August 1999)
    • Barker, J.P., Berthommier, F.: Evidence of correlation between acoustic and visual features of speech. In: ICPhS 1999, San Francisco (August 1999)
  • 4
    • 84901220357 scopus 로고    scopus 로고
    • Maximising Audio-Visual Speech Correlation. Accepted for AVSP 2007
    • paper P16
    • Almajai, I., Milner, B.: Maximising Audio-Visual Speech Correlation. Accepted for AVSP 2007, paper P16 (2007)
    • (2007)
    • Almajai, I.1    Milner, B.2
  • 5
    • 4544290191 scopus 로고    scopus 로고
    • Recent Advances in the Automatic Recognition of Audiovisual Speech
    • Potamianos, G., Neti, C., Gravier, G., Garg, A., Senior, A.W.: Recent Advances in the Automatic Recognition of Audiovisual Speech. Proceedings - IEEE 91, part 9, 1306-1326 (2003)
    • (2003) Proceedings - IEEE , vol.91 , Issue.PART 9 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.W.5
  • 6
    • 0000344953 scopus 로고    scopus 로고
    • Girin, L., Feng, G., Schwartz, J.L.: Fusion of auditory and visual information for noisy speech enhancement: a preliminary study of vowel transition. In: ICASSP 1998, Seattle, WA, USA (1998)
    • Girin, L., Feng, G., Schwartz, J.L.: Fusion of auditory and visual information for noisy speech enhancement: a preliminary study of vowel transition. In: ICASSP 1998, Seattle, WA, USA (1998)
  • 7
    • 34547539737 scopus 로고    scopus 로고
    • Almajai, I., Milner, B., Darch, J., Vaseghi, S.: Visually-Derived Wiener Filters for Speech Enhancement. In: ICASSP 2007, 4, pp. IV-585-IV-588 (2007)
    • Almajai, I., Milner, B., Darch, J., Vaseghi, S.: Visually-Derived Wiener Filters for Speech Enhancement. In: ICASSP 2007, vol. 4, pp. IV-585-IV-588 (2007)
  • 8
    • 70350414562 scopus 로고    scopus 로고
    • Young, S.J, Odell, J, Ollason, D, Valtchev, V, Woodland, P, The HTK Book. Version 2.1 Department of Engineering. Cambridge University, UK 1995
    • Young, S.J., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book. Version 2.1 Department of Engineering. Cambridge University, UK (1995)
  • 9
    • 84902748454 scopus 로고
    • On image processing and a discrete cosine transform
    • Ahmed, N., Natarajan, T., Rao, K.R.: On image processing and a discrete cosine transform. IEEE Transactions on Computers C-23(1), 90-93 (1974)
    • (1974) IEEE Transactions on Computers , vol.C-23 , Issue.1 , pp. 90-93
    • Ahmed, N.1    Natarajan, T.2    Rao, K.R.3
  • 12
    • 0019928857 scopus 로고
    • An alternative approach to linearly constrained adaptive beamforming
    • Griffiths, L.J., Jim, C.W.: An alternative approach to linearly constrained adaptive beamforming. IEEE Trans. Antennas Propagat. AP-30, 27-34 (1982)
    • (1982) IEEE Trans. Antennas Propagat , vol.AP-30 , pp. 27-34
    • Griffiths, L.J.1    Jim, C.W.2
  • 13
    • 0035424281 scopus 로고    scopus 로고
    • Signal enhancement using beamforming and nonstationarity with applications to speech
    • Gannot, S., Burshtein, D., Weinstein, E.: Signal enhancement using beamforming and nonstationarity with applications to speech. IEEE Trans. Signal Processing 49, 1614-1626 (2001)
    • (2001) IEEE Trans. Signal Processing , vol.49 , pp. 1614-1626
    • Gannot, S.1    Burshtein, D.2    Weinstein, E.3
  • 15
    • 33750368310 scopus 로고    scopus 로고
    • An audio-visual corpus for speech perception and automatic speech recognition
    • Cooke, M., Barker, J., Cunningham, S., Shao, X.: An audio-visual corpus for speech perception and automatic speech recognition. J. Acoust. Soc. Amer. 120(5), 2421-2424 (2006)
    • (2006) J. Acoust. Soc. Amer , vol.120 , Issue.5 , pp. 2421-2424
    • Cooke, M.1    Barker, J.2    Cunningham, S.3    Shao, X.4
  • 17
    • 84867204408 scopus 로고    scopus 로고
    • A Vowel Based Approach for Acted Emotion Recognition
    • Ringeval, F., Chetouani, M.: A Vowel Based Approach for Acted Emotion Recognition. In: Interspeech 2008 (2008)
    • (2008) Interspeech
    • Ringeval, F.1    Chetouani, M.2
  • 18
    • 0141814632 scopus 로고    scopus 로고
    • Speech Segmentation without Speech Recognition
    • Wang, D., Lu, L., Zhang, H.J.: Speech Segmentation without Speech Recognition. In: ICASSP 2003, vol. 1, pp. 468-471 (2003)
    • (2003) ICASSP 2003 , vol.1 , pp. 468-471
    • Wang, D.1    Lu, L.2    Zhang, H.J.3
  • 19
    • 70350414561 scopus 로고    scopus 로고
    • Audio-Visual Speech Fragment Decoding. Accepted for AVSP 2007
    • paper L5-2
    • Barker, J., Shao, X.: Audio-Visual Speech Fragment Decoding. Accepted for AVSP 2007, paper L5-2 (2007)
    • (2007)
    • Barker, J.1    Shao, X.2
  • 20
    • 34447100075 scopus 로고    scopus 로고
    • Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures
    • Rivet, B., Girin, L., Jutten, C.: Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures. IEEE Trans. on Audio, Speech, and Lang. Processing 15(1), 96-108 (2007)
    • (2007) IEEE Trans. on Audio, Speech, and Lang. Processing , vol.15 , Issue.1 , pp. 96-108
    • Rivet, B.1    Girin, L.2    Jutten, C.3
  • 21
    • 14944353581 scopus 로고    scopus 로고
    • Hazen, J.T., Saenko, K., La, C.H., Glass, J.R.: A Segment Based Audio-Visual Speech Recognizer: Data Collection, Development, and Initial Experiments. In: ICMI 2004: Proceedings of the 6th international conference on Multimodal interfaces, pp. 235-242 (2004)
    • Hazen, J.T., Saenko, K., La, C.H., Glass, J.R.: A Segment Based Audio-Visual Speech Recognizer: Data Collection, Development, and Initial Experiments. In: ICMI 2004: Proceedings of the 6th international conference on Multimodal interfaces, pp. 235-242 (2004)
  • 22
    • 38149111343 scopus 로고    scopus 로고
    • A Novel Psychoa-coustically Motivated Multichannel Speech Enhancement System
    • Esposito, A, Faundez-Zanuy, M, Keller, E, Marinaro, M, eds, COST Action 2102, Springer, Heidelberg
    • Hussain, A., Cifani, S., Squartini, S., Piazza, F., Durrani, T.: A Novel Psychoa-coustically Motivated Multichannel Speech Enhancement System. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS, vol. 4775, pp. 190-199. Springer, Heidelberg (2007)
    • (2007) LNCS , vol.4775 , pp. 190-199
    • Hussain, A.1    Cifani, S.2    Squartini, S.3    Piazza, F.4    Durrani, T.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.