SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5641 LNAI, Issue , 2009, Pages 331-343

An investigation into audiovisual speech correlation in reverberant noisy environments

(5) Cifani, Simone b Abel, Andrew a Hussain, Amir a Squartini, Stefano b Piazza, Francesco b

a UNIVERSITY OF STIRLING (United Kingdom)

b UNIVERSITÀ POLITECNICA DELLE MARCHE (Italy)

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO-VISUAL CORPORA; AUDIO-VISUAL CORRELATIONS; AUDIO-VISUAL SPEECH; BEAM FORMERS; COMPLEX RELATIONSHIPS; FEATURE EXTRACTION TECHNIQUES; HUMAN COMMUNICATIONS; MULTI-MODAL; MULTIMODAL DATA STREAMS; NOISY ENVIRONMENT; PROCESSING METHOD; SPEECH ENHANCEMENT SYSTEM; VISUAL FEATURE; VISUAL TECHNIQUES;

BEAMFORMING; CORRELATION METHODS; COST BENEFIT ANALYSIS; FEATURE EXTRACTION; MODAL ANALYSIS; PROCESSING; REVERBERATION; SPEECH ENHANCEMENT;

SPEECH COMMUNICATION;

EID: 70350378930 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-03320-9_31 Document Type: Conference Paper

Times cited : (7)

References (22)

1
- 0017199877
- Hearing lips and seeing voices
- McGurk, H., MacDonald, J.: Hearing lips and seeing voices. Nature 264, 746-748 (1976)
- (1976) Nature , vol.264 , pp. 746-748
- McGurk, H.¹ MacDonald, J.²

2
- 0032178592
- Quantitative association of vocal tract and facial behavior
- Yehia, H., Rubin, P., Vatikiotis Bateson, E.: Quantitative association of vocal tract and facial behavior. Speech Communication 26(1), 23-43 (1998)
- (1998) Speech Communication , vol.26 , Issue.1 , pp. 23-43
- Yehia, H.¹ Rubin, P.² Vatikiotis Bateson, E.³

3
- 70350388418
- Barker, J.P., Berthommier, F.: Evidence of correlation between acoustic and visual features of speech. In: ICPhS 1999, San Francisco (August 1999)
- Barker, J.P., Berthommier, F.: Evidence of correlation between acoustic and visual features of speech. In: ICPhS 1999, San Francisco (August 1999)

4
- 84901220357
- Maximising Audio-Visual Speech Correlation. Accepted for AVSP 2007
- paper P16
- Almajai, I., Milner, B.: Maximising Audio-Visual Speech Correlation. Accepted for AVSP 2007, paper P16 (2007)
- (2007)
- Almajai, I.¹ Milner, B.²

5
- 4544290191
- Recent Advances in the Automatic Recognition of Audiovisual Speech
- Potamianos, G., Neti, C., Gravier, G., Garg, A., Senior, A.W.: Recent Advances in the Automatic Recognition of Audiovisual Speech. Proceedings - IEEE 91, part 9, 1306-1326 (2003)
- (2003) Proceedings - IEEE , vol.91 , Issue.PART 9 , pp. 1306-1326
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.W.⁵

6
- 0000344953
- Girin, L., Feng, G., Schwartz, J.L.: Fusion of auditory and visual information for noisy speech enhancement: a preliminary study of vowel transition. In: ICASSP 1998, Seattle, WA, USA (1998)
- Girin, L., Feng, G., Schwartz, J.L.: Fusion of auditory and visual information for noisy speech enhancement: a preliminary study of vowel transition. In: ICASSP 1998, Seattle, WA, USA (1998)

7
- 34547539737
- Almajai, I., Milner, B., Darch, J., Vaseghi, S.: Visually-Derived Wiener Filters for Speech Enhancement. In: ICASSP 2007, 4, pp. IV-585-IV-588 (2007)
- Almajai, I., Milner, B., Darch, J., Vaseghi, S.: Visually-Derived Wiener Filters for Speech Enhancement. In: ICASSP 2007, vol. 4, pp. IV-585-IV-588 (2007)

8
- 70350414562
- Young, S.J, Odell, J, Ollason, D, Valtchev, V, Woodland, P, The HTK Book. Version 2.1 Department of Engineering. Cambridge University, UK 1995
- Young, S.J., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book. Version 2.1 Department of Engineering. Cambridge University, UK (1995)

9
- 84902748454
- On image processing and a discrete cosine transform
- Ahmed, N., Natarajan, T., Rao, K.R.: On image processing and a discrete cosine transform. IEEE Transactions on Computers C-23(1), 90-93 (1974)
- (1974) IEEE Transactions on Computers , vol.C-23 , Issue.1 , pp. 90-93
- Ahmed, N.¹ Natarajan, T.² Rao, K.R.³

10
- 0021392032
- Scene adaptive coder
- Chen, W.H., Pratt, W.K.: Scene adaptive coder. IEEE Transactions on Communications 32(3), 225-232 (1984)
- (1984) IEEE Transactions on Communications , vol.32 , Issue.3 , pp. 225-232
- Chen, W.H.¹ Pratt, W.K.²

11
- 0009590598
- Springer, New York
- Brandstein, M.S., Ward, D.B.: Microphone Arrays. Springer, New York (2001)
- (2001) Microphone Arrays
- Brandstein, M.S.¹ Ward, D.B.²

12
- 0019928857
- An alternative approach to linearly constrained adaptive beamforming
- Griffiths, L.J., Jim, C.W.: An alternative approach to linearly constrained adaptive beamforming. IEEE Trans. Antennas Propagat. AP-30, 27-34 (1982)
- (1982) IEEE Trans. Antennas Propagat , vol.AP-30 , pp. 27-34
- Griffiths, L.J.¹ Jim, C.W.²

13
- 0035424281
- Signal enhancement using beamforming and nonstationarity with applications to speech
- Gannot, S., Burshtein, D., Weinstein, E.: Signal enhancement using beamforming and nonstationarity with applications to speech. IEEE Trans. Signal Processing 49, 1614-1626 (2001)
- (2001) IEEE Trans. Signal Processing , vol.49 , pp. 1614-1626
- Gannot, S.¹ Burshtein, D.² Weinstein, E.³

14
- 0003729218
- John Wiley and Sons, Canada
- Chatterjee, S., Hadi, A.S., Price, B.: Regression analysis by example. John Wiley and Sons, Canada (2000)
- (2000) Regression analysis by example
- Chatterjee, S.¹ Hadi, A.S.² Price, B.³

15
- 33750368310
- An audio-visual corpus for speech perception and automatic speech recognition
- Cooke, M., Barker, J., Cunningham, S., Shao, X.: An audio-visual corpus for speech perception and automatic speech recognition. J. Acoust. Soc. Amer. 120(5), 2421-2424 (2006)
- (2006) J. Acoust. Soc. Amer , vol.120 , Issue.5 , pp. 2421-2424
- Cooke, M.¹ Barker, J.² Cunningham, S.³ Shao, X.⁴

16
- 0035363218
- Active Appearance Models
- Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active Appearance Models. IEEE Trans. on Pattern Analysis and Machine Intelligence 23(6), 681-685 (2001)
- (2001) IEEE Trans. on Pattern Analysis and Machine Intelligence , vol.23 , Issue.6 , pp. 681-685
- Cootes, T.F.¹ Edwards, G.J.² Taylor, C.J.³

17
- 84867204408
- A Vowel Based Approach for Acted Emotion Recognition
- Ringeval, F., Chetouani, M.: A Vowel Based Approach for Acted Emotion Recognition. In: Interspeech 2008 (2008)
- (2008) Interspeech
- Ringeval, F.¹ Chetouani, M.²

18
- 0141814632
- Speech Segmentation without Speech Recognition
- Wang, D., Lu, L., Zhang, H.J.: Speech Segmentation without Speech Recognition. In: ICASSP 2003, vol. 1, pp. 468-471 (2003)
- (2003) ICASSP 2003 , vol.1 , pp. 468-471
- Wang, D.¹ Lu, L.² Zhang, H.J.³

19
- 70350414561
- Audio-Visual Speech Fragment Decoding. Accepted for AVSP 2007
- paper L5-2
- Barker, J., Shao, X.: Audio-Visual Speech Fragment Decoding. Accepted for AVSP 2007, paper L5-2 (2007)
- (2007)
- Barker, J.¹ Shao, X.²

20
- 34447100075
- Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures
- Rivet, B., Girin, L., Jutten, C.: Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures. IEEE Trans. on Audio, Speech, and Lang. Processing 15(1), 96-108 (2007)
- (2007) IEEE Trans. on Audio, Speech, and Lang. Processing , vol.15 , Issue.1 , pp. 96-108
- Rivet, B.¹ Girin, L.² Jutten, C.³

21
- 14944353581
- Hazen, J.T., Saenko, K., La, C.H., Glass, J.R.: A Segment Based Audio-Visual Speech Recognizer: Data Collection, Development, and Initial Experiments. In: ICMI 2004: Proceedings of the 6th international conference on Multimodal interfaces, pp. 235-242 (2004)
- Hazen, J.T., Saenko, K., La, C.H., Glass, J.R.: A Segment Based Audio-Visual Speech Recognizer: Data Collection, Development, and Initial Experiments. In: ICMI 2004: Proceedings of the 6th international conference on Multimodal interfaces, pp. 235-242 (2004)

22
- 38149111343
- A Novel Psychoa-coustically Motivated Multichannel Speech Enhancement System
- Esposito, A, Faundez-Zanuy, M, Keller, E, Marinaro, M, eds, COST Action 2102, Springer, Heidelberg
- Hussain, A., Cifani, S., Squartini, S., Piazza, F., Durrani, T.: A Novel Psychoa-coustically Motivated Multichannel Speech Enhancement System. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS, vol. 4775, pp. 190-199. Springer, Heidelberg (2007)
- (2007) LNCS , vol.4775 , pp. 190-199
- Hussain, A.¹ Cifani, S.² Squartini, S.³ Piazza, F.⁴ Durrani, T.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.