-
1
-
-
0032134085
-
Eye movement of perceivers during audio-visual speech intelligibility in noise
-
E. Vatikiotis-Bateson, I.M. Eigsti, S. Yano, and K. Munhall, "Eye movement of perceivers during audio-visual speech intelligibility in noise," Perception and Psychophysics, vol.60, no.6, pp.926-940, 1998.
-
(1998)
Perception and Psychophysics
, vol.60
, Issue.6
, pp. 926-940
-
-
Vatikiotis-Bateson, E.1
Eigsti, I.M.2
Yano, S.3
Munhall, K.4
-
2
-
-
0001048664
-
Visual contribution to speech inteligibility in noise
-
March
-
W.H. Sumby and I. Pollack, "Visual contribution to speech inteligibility in noise," J. Acoust. Soc. Am., vol.26, pp.212-215, March 1954.
-
(1954)
J. Acoust. Soc. Am.
, vol.26
, pp. 212-215
-
-
Sumby, W.H.1
Pollack, I.2
-
3
-
-
0001055701
-
Which components of the face do humans and machines best speechread?
-
Speechreading by Humans and Machines: Models, Systems and Applications, Springer-Verlag
-
C. Benoit, T. Guiard Marigny, B. LeGoffand, and A. Adjoudani, "Which components of the face do humans and machines best speechread?," in Speechreading by Humans and Machines: Models, Systems and Applications, NATO ASI Series, pp.315-328, Springer-Verlag, 1996.
-
(1996)
NATO ASI Series
, pp. 315-328
-
-
Benoit, C.1
Guiard Marigny, T.2
Legoffand, B.3
Adjoudani, A.4
-
4
-
-
0027228958
-
Improving connected letter recognition by lipreading
-
April
-
C. Bregler, H. Hild, S. Manke, and A. Waibel, "Improving connected letter recognition by lipreading," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'93), vol.1, pp.557-560, April 1993.
-
(1993)
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'93)
, vol.1
, pp. 557-560
-
-
Bregler, C.1
Hild, H.2
Manke, S.3
Waibel, A.4
-
5
-
-
0002517880
-
Audio-visual large vocabulary continuous speech recognition in the broadcast domain
-
Dec.
-
S. Basu, C. Neti, N. Rajput, A. Senior, L. Subramaniam, and A. Verma, "Audio-visual large vocabulary continuous speech recognition in the broadcast domain," Workshop on Multimedia Signal Processing, pp.475-481, Dec. 1998.
-
(1998)
Workshop on Multimedia Signal Processing
, pp. 475-481
-
-
Basu, S.1
Neti, C.2
Rajput, N.3
Senior, A.4
Subramaniam, L.5
Verma, A.6
-
6
-
-
33646906672
-
Improved bimodal speech recognition using tied-mixture HMMs and 5000 word audio-visual Synchronous database
-
S. Nakamura, R. Nagai, and K. Shikano, "Improved bimodal speech recognition using tied-mixture HMMs and 5000 word Audio-Visual Synchronous database," Proc. EUROSPEECH'97, pp.1623-1626, 1997.
-
(1997)
Proc. EUROSPEECH'97
, pp. 1623-1626
-
-
Nakamura, S.1
Nagai, R.2
Shikano, K.3
-
7
-
-
0029747053
-
Integrating audio and visual information to provide highly robust speech recognition
-
May
-
M.J. Tomlinson, M.J. Russell, and N.M. Brooke, "Integrating audio and visual information to provide highly robust speech recognition," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'96), vol.2, pp.821-824, May 1996.
-
(1996)
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'96)
, vol.2
, pp. 821-824
-
-
Tomlinson, M.J.1
Russell, M.J.2
Brooke, N.M.3
-
8
-
-
84949458153
-
Using the multi-stream approach for continuous audio-visual speech recognition: Experiments on the M2VTS DATABASE
-
Oct.
-
S. Dupont and J. Luettin, "Using the multi-stream approach for continuous audio-visual speech recognition: experiments on the M2VTS DATABASE," Proc. International Conference on Spoken Language Processing (ICSLP'98), vol.4, pp.1283-1286, Oct. 1998.
-
(1998)
Proc. International Conference on Spoken Language Processing (ICSLP'98)
, vol.4
, pp. 1283-1286
-
-
Dupont, S.1
Luettin, J.2
-
9
-
-
0034270644
-
Audio-visual speech modelling for continuous speech recognition
-
Sept.
-
S. Dupont and J. Luettin, "Audio-visual speech modelling for continuous speech recognition," IEEE Trans. Multimed., vol.2, no.3, pp.141-151, Sept. 2000.
-
(2000)
IEEE Trans. Multimed.
, vol.2
, Issue.3
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
10
-
-
0002100804
-
Adaptive determination of audio and visual weights for automatic speech recognition
-
Sept.
-
A. Rogozan, P. Deleglise, and M. Alissali, "Adaptive determination of audio and visual weights for Automatic speech recognition," Proc. Europ. Tut. Work. Audio-Visual Speech Process (AVSP), pp.61-64, Sept. 1997.
-
(1997)
Proc. Europ. Tut. Work. Audio-visual Speech Process (AVSP)
, pp. 61-64
-
-
Rogozan, A.1
Deleglise, P.2
Alissali, M.3
-
12
-
-
85009154155
-
Stream weight optimization of speech and lip image sequence for audio-visual speech recognition
-
Oct.
-
S. Nakamura, H. Ito and K. Shikano, "Stream weight optimization of speech and lip image sequence for Audio-Visual speech recognition," Proc. International Conference on Spoken Language Processing (ICSLP'00), vol.3, pp.20-23, Oct. 2000.
-
(2000)
Proc. International Conference on Spoken Language Processing (ICSLP'00)
, vol.3
, pp. 20-23
-
-
Nakamura, S.1
Ito, H.2
Shikano, K.3
-
13
-
-
0031624666
-
Discriminative training of HMM stream exponents for audio-visual speech recognition
-
May
-
G. Potamianos and H.P. Graf, "Discriminative training of HMM stream exponents for Audio-Visual speech recognition," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'98), vol.6, pp.3733-3736, May 1998.
-
(1998)
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'98)
, vol.6
, pp. 3733-3736
-
-
Potamianos, G.1
Graf, H.P.2
-
14
-
-
85009091822
-
Audio-visual speech recognition using MCE-based HMMs and model-dependent stream weights
-
Oct.
-
C. Miyajima, K. Tokuda, and Tadashi Kitamura, "Audio-Visual speech recognition using MCE-based HMMs and model-dependent stream weights," Proc. International Conference on Spoken Language Processing (ICSLP'00), vol.2, pp.1023-1026, Oct. 2000.
-
(2000)
Proc. International Conference on Spoken Language Processing (ICSLP'00)
, vol.2
, pp. 1023-1026
-
-
Miyajima, C.1
Tokuda, K.2
Kitamura, T.3
-
15
-
-
84885664026
-
An adaptive integration based on product HMM for audio-visual speech recognition
-
Aug.
-
K. Kumatani, S. Nakamura, and K. Shikano, "An adaptive integration based on product HMM for audio-visual speech recognition," Proc. IEEE International Conference Multimedia and Expo (ICME'01), vol.1, Aug. 2001.
-
(2001)
Proc. IEEE International Conference Multimedia and Expo (ICME'01)
, vol.1
-
-
Kumatani, K.1
Nakamura, S.2
Shikano, K.3
-
17
-
-
0034825241
-
Multistream adaptive evidence combination for noise robust ASR
-
A. Morris, A. Hagen, H. Glotin, and H. Bourlard, "Multistream adaptive evidence combination for noise robust ASR," Speech Commun, vol.34, pp.25-40, 2001.
-
(2001)
Speech Commun
, vol.34
, pp. 25-40
-
-
Morris, A.1
Hagen, A.2
Glotin, H.3
Bourlard, H.4
-
18
-
-
0030676381
-
Maximum likelihood weighting of dynamic speech features for CDHMM speech recognition
-
April
-
J. Hernando, "Maximum likelihood weighting of dynamic speech features for CDHMM speech recognition," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'97), vol.2, pp.1267-1270, April 1997.
-
(1997)
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'97)
, vol.2
, pp. 1267-1270
-
-
Hernando, J.1
-
19
-
-
0029765665
-
Visual speech recognition using active shape models and hidden Markov models
-
May
-
J. Luettin, N.A. Thacker, and S.W. Beet, "Visual speech recognition using active shape models and hidden Markov models," Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-96), vol.2, pp.817-820, May 1996.
-
(1996)
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP-96)
, vol.2
, pp. 817-820
-
-
Luettin, J.1
Thacker, N.A.2
Beet, S.W.3
-
20
-
-
0037662332
-
Overview on recent activities in multi-modal corpora
-
Oct.
-
S. Nakamura, "Overview on recent activities in multi-modal corpora," COCOSDA Workshop, Oct. 2000.
-
(2000)
COCOSDA Workshop
-
-
Nakamura, S.1
-
21
-
-
0033708747
-
Asynchronous-transition HMM
-
May
-
S. Matsuda, M. Nakai, H. Shimodair, and S. Sagayama, "Asynchronous-transition HMM," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'00), pp.1005-1008, May 2000.
-
(2000)
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'00)
, pp. 1005-1008
-
-
Matsuda, S.1
Nakai, M.2
Shimodair, H.3
Sagayama, S.4
-
22
-
-
0040413488
-
Quantitative association of orofacial and vocal-tract shapes
-
Sept.
-
H. Yehia, P. Rubin, and E. Vatikiotis-Bateson, "Quantitative association of orofacial and vocal-tract shapes," Proc. Europ. Tut. Work. Audio-Visual Speech Process (AVSP), pp.41-44, Sept. 1997.
-
(1997)
Proc. Europ. Tut. Work. Audio-visual Speech Process (AVSP)
, pp. 41-44
-
-
Yehia, H.1
Rubin, P.2
Vatikiotis-Bateson, E.3
-
23
-
-
0035251712
-
Speech-to-lip movement synthesis by maximizing audio-visual joint probability based on the EM algorithm
-
S. Nakamura and E. Yamamoto, "Speech-to-lip movement synthesis by maximizing audio-visual joint probability based on the EM algorithm," J. VLSI Signal Processing, vol.27, no.1/2, pp.119-126, 2001.
-
(2001)
J. VLSI Signal Processing
, vol.27
, Issue.1-2
, pp. 119-126
-
-
Nakamura, S.1
Yamamoto, E.2
-
24
-
-
0034501586
-
Speech-to-face movement synthesis based on HMMs
-
K. Kakihara, S. Nakamura, and K. Shikano, "Speech-to-face movement synthesis based on HMMs," Proc. IEEE International Conference Multimedia and Expo (ICME'00), no.MP7.07, 2000.
-
(2000)
Proc. IEEE International Conference Multimedia and Expo (ICME'00)
, Issue.MP7.07
-
-
Kakihara, K.1
Nakamura, S.2
Shikano, K.3
-
25
-
-
0038676522
-
Model based lip synchronization with an automatic translation system
-
Aug.
-
S. Ogata, K. Murai, S. Nakamura, and S. Morisima, "Model based lip synchronization with an automatic translation system," Proc. IEEE International Conference Multimedia and Expo (ICME'01), Aug. 2001.
-
(2001)
Proc. IEEE International Conference Multimedia and Expo (ICME'01)
-
-
Ogata, S.1
Murai, K.2
Nakamura, S.3
Morisima, S.4
-
26
-
-
0003462715
-
Hidden Markov models for speech recognition
-
Edinburgh University Press, Edinburgh
-
X.D. Huang, Y. Ariki, and N.A. Jack, Hidden Markov Models for Speech Recognition, Edinburgh Information Technology Series, Edinburgh University Press, Edinburgh, 1990.
-
(1990)
Edinburgh Information Technology Series
-
-
Huang, X.D.1
Ariki, Y.2
Jack, N.A.3
-
27
-
-
85009263395
-
Segmental GPD training of HMM based speech recognizer
-
May
-
W. Chou, B.-H. Juang, and C.-H. Lee, "Segmental GPD training of HMM based speech recognizer," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'92), vol.1, pp.473-476, May 2000.
-
(2000)
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'92)
, vol.1
, pp. 473-476
-
-
Chou, W.1
Juang, B.-H.2
Lee, C.-H.3
-
28
-
-
0006132736
-
A minimum error rate pattern recognition approach to speech recognition
-
Col. VIII
-
W. Chou, B.-H. Juang, C.-H. Lee, and F.K. Soong, "A minimum error rate pattern recognition approach to speech recognition," J. Pattern Recog. Art. Intell., Col. VIII, pp.5-31, 1994.
-
(1994)
J. Pattern Recog. Art. Intell.
, pp. 5-31
-
-
Chou, W.1
Juang, B.-H.2
Lee, C.-H.3
Soong, F.K.4
-
29
-
-
0003483593
-
-
Microsoft Corporation
-
S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, HTK-Hidden Markov Model Toolkit, Version 3.0, Microsoft Corporation, 2000.
-
(2000)
HTK-hidden Markov Model Toolkit, Version 3.0
-
-
Young, S.1
Kershaw, D.2
Odell, J.3
Ollason, D.4
Valtchev, V.5
Woodland, P.6
|