-
1
-
-
0017199877
-
Hearing lips and seeing voices
-
McGurk, H., MacDonald, J.: Hearing lips and seeing voices. Nature 264, 746-748 (1976)
-
(1976)
Nature
, vol.264
, pp. 746-748
-
-
McGurk, H.1
MacDonald, J.2
-
2
-
-
0032178592
-
Quantitative association of vocal tract and facial behavior
-
Yehia, H., Rubin, P., Vatikiotis Bateson, E.: Quantitative association of vocal tract and facial behavior. Speech Communication 26(1), 23-43 (1998)
-
(1998)
Speech Communication
, vol.26
, Issue.1
, pp. 23-43
-
-
Yehia, H.1
Rubin, P.2
Vatikiotis Bateson, E.3
-
3
-
-
70350388418
-
-
Barker, J.P., Berthommier, F.: Evidence of correlation between acoustic and visual features of speech. In: ICPhS 1999, San Francisco (August 1999)
-
Barker, J.P., Berthommier, F.: Evidence of correlation between acoustic and visual features of speech. In: ICPhS 1999, San Francisco (August 1999)
-
-
-
-
4
-
-
84901220357
-
Maximising Audio-Visual Speech Correlation. Accepted for AVSP 2007
-
paper P16
-
Almajai, I., Milner, B.: Maximising Audio-Visual Speech Correlation. Accepted for AVSP 2007, paper P16 (2007)
-
(2007)
-
-
Almajai, I.1
Milner, B.2
-
5
-
-
4544290191
-
Recent Advances in the Automatic Recognition of Audiovisual Speech
-
Potamianos, G., Neti, C., Gravier, G., Garg, A., Senior, A.W.: Recent Advances in the Automatic Recognition of Audiovisual Speech. Proceedings - IEEE 91, part 9, 1306-1326 (2003)
-
(2003)
Proceedings - IEEE
, vol.91
, Issue.PART 9
, pp. 1306-1326
-
-
Potamianos, G.1
Neti, C.2
Gravier, G.3
Garg, A.4
Senior, A.W.5
-
6
-
-
0000344953
-
-
Girin, L., Feng, G., Schwartz, J.L.: Fusion of auditory and visual information for noisy speech enhancement: a preliminary study of vowel transition. In: ICASSP 1998, Seattle, WA, USA (1998)
-
Girin, L., Feng, G., Schwartz, J.L.: Fusion of auditory and visual information for noisy speech enhancement: a preliminary study of vowel transition. In: ICASSP 1998, Seattle, WA, USA (1998)
-
-
-
-
7
-
-
34547539737
-
-
Almajai, I., Milner, B., Darch, J., Vaseghi, S.: Visually-Derived Wiener Filters for Speech Enhancement. In: ICASSP 2007, 4, pp. IV-585-IV-588 (2007)
-
Almajai, I., Milner, B., Darch, J., Vaseghi, S.: Visually-Derived Wiener Filters for Speech Enhancement. In: ICASSP 2007, vol. 4, pp. IV-585-IV-588 (2007)
-
-
-
-
8
-
-
70350414562
-
-
Young, S.J, Odell, J, Ollason, D, Valtchev, V, Woodland, P, The HTK Book. Version 2.1 Department of Engineering. Cambridge University, UK 1995
-
Young, S.J., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book. Version 2.1 Department of Engineering. Cambridge University, UK (1995)
-
-
-
-
9
-
-
84902748454
-
On image processing and a discrete cosine transform
-
Ahmed, N., Natarajan, T., Rao, K.R.: On image processing and a discrete cosine transform. IEEE Transactions on Computers C-23(1), 90-93 (1974)
-
(1974)
IEEE Transactions on Computers
, vol.C-23
, Issue.1
, pp. 90-93
-
-
Ahmed, N.1
Natarajan, T.2
Rao, K.R.3
-
12
-
-
0019928857
-
An alternative approach to linearly constrained adaptive beamforming
-
Griffiths, L.J., Jim, C.W.: An alternative approach to linearly constrained adaptive beamforming. IEEE Trans. Antennas Propagat. AP-30, 27-34 (1982)
-
(1982)
IEEE Trans. Antennas Propagat
, vol.AP-30
, pp. 27-34
-
-
Griffiths, L.J.1
Jim, C.W.2
-
13
-
-
0035424281
-
Signal enhancement using beamforming and nonstationarity with applications to speech
-
Gannot, S., Burshtein, D., Weinstein, E.: Signal enhancement using beamforming and nonstationarity with applications to speech. IEEE Trans. Signal Processing 49, 1614-1626 (2001)
-
(2001)
IEEE Trans. Signal Processing
, vol.49
, pp. 1614-1626
-
-
Gannot, S.1
Burshtein, D.2
Weinstein, E.3
-
14
-
-
0003729218
-
-
John Wiley and Sons, Canada
-
Chatterjee, S., Hadi, A.S., Price, B.: Regression analysis by example. John Wiley and Sons, Canada (2000)
-
(2000)
Regression analysis by example
-
-
Chatterjee, S.1
Hadi, A.S.2
Price, B.3
-
15
-
-
33750368310
-
An audio-visual corpus for speech perception and automatic speech recognition
-
Cooke, M., Barker, J., Cunningham, S., Shao, X.: An audio-visual corpus for speech perception and automatic speech recognition. J. Acoust. Soc. Amer. 120(5), 2421-2424 (2006)
-
(2006)
J. Acoust. Soc. Amer
, vol.120
, Issue.5
, pp. 2421-2424
-
-
Cooke, M.1
Barker, J.2
Cunningham, S.3
Shao, X.4
-
16
-
-
0035363218
-
Active Appearance Models
-
Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active Appearance Models. IEEE Trans. on Pattern Analysis and Machine Intelligence 23(6), 681-685 (2001)
-
(2001)
IEEE Trans. on Pattern Analysis and Machine Intelligence
, vol.23
, Issue.6
, pp. 681-685
-
-
Cootes, T.F.1
Edwards, G.J.2
Taylor, C.J.3
-
17
-
-
84867204408
-
A Vowel Based Approach for Acted Emotion Recognition
-
Ringeval, F., Chetouani, M.: A Vowel Based Approach for Acted Emotion Recognition. In: Interspeech 2008 (2008)
-
(2008)
Interspeech
-
-
Ringeval, F.1
Chetouani, M.2
-
18
-
-
0141814632
-
Speech Segmentation without Speech Recognition
-
Wang, D., Lu, L., Zhang, H.J.: Speech Segmentation without Speech Recognition. In: ICASSP 2003, vol. 1, pp. 468-471 (2003)
-
(2003)
ICASSP 2003
, vol.1
, pp. 468-471
-
-
Wang, D.1
Lu, L.2
Zhang, H.J.3
-
19
-
-
70350414561
-
Audio-Visual Speech Fragment Decoding. Accepted for AVSP 2007
-
paper L5-2
-
Barker, J., Shao, X.: Audio-Visual Speech Fragment Decoding. Accepted for AVSP 2007, paper L5-2 (2007)
-
(2007)
-
-
Barker, J.1
Shao, X.2
-
20
-
-
34447100075
-
Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures
-
Rivet, B., Girin, L., Jutten, C.: Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures. IEEE Trans. on Audio, Speech, and Lang. Processing 15(1), 96-108 (2007)
-
(2007)
IEEE Trans. on Audio, Speech, and Lang. Processing
, vol.15
, Issue.1
, pp. 96-108
-
-
Rivet, B.1
Girin, L.2
Jutten, C.3
-
21
-
-
14944353581
-
-
Hazen, J.T., Saenko, K., La, C.H., Glass, J.R.: A Segment Based Audio-Visual Speech Recognizer: Data Collection, Development, and Initial Experiments. In: ICMI 2004: Proceedings of the 6th international conference on Multimodal interfaces, pp. 235-242 (2004)
-
Hazen, J.T., Saenko, K., La, C.H., Glass, J.R.: A Segment Based Audio-Visual Speech Recognizer: Data Collection, Development, and Initial Experiments. In: ICMI 2004: Proceedings of the 6th international conference on Multimodal interfaces, pp. 235-242 (2004)
-
-
-
-
22
-
-
38149111343
-
A Novel Psychoa-coustically Motivated Multichannel Speech Enhancement System
-
Esposito, A, Faundez-Zanuy, M, Keller, E, Marinaro, M, eds, COST Action 2102, Springer, Heidelberg
-
Hussain, A., Cifani, S., Squartini, S., Piazza, F., Durrani, T.: A Novel Psychoa-coustically Motivated Multichannel Speech Enhancement System. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS, vol. 4775, pp. 190-199. Springer, Heidelberg (2007)
-
(2007)
LNCS
, vol.4775
, pp. 190-199
-
-
Hussain, A.1
Cifani, S.2
Squartini, S.3
Piazza, F.4
Durrani, T.5
|