-
1
-
-
0031187171
-
Speech recognition by machines and humans
-
July
-
R. Lippman, "Speech recognition by machines and humans," Speech Communication, vol. 22(1), pp. 1-15, July 1997.
-
(1997)
Speech Communication
, vol.22
, Issue.1
, pp. 1-15
-
-
Lippman, R.1
-
2
-
-
0003544881
-
-
D. G. Stork and M. E. Hennecke, editors, Springer-Verlag New York Inc.
-
D. G. Stork and M. E. Hennecke, editors, Speechreading by Man and Machine, Springer-Verlag New York Inc., 1996.
-
(1996)
Speechreading by Man and Machine
-
-
-
3
-
-
0004052871
-
Audio-visual speech recognition
-
Johns Hopkins University, Baltimore
-
C. Neti et al., "Audio-visual speech recognition," Tech. Rep., Johns Hopkins University, Baltimore, 2000.
-
(2000)
Tech. Rep.
-
-
Neti, C.1
-
4
-
-
4544290191
-
Recent advances in the automatic recognition of audio-visual speech
-
Sep.
-
G. Potamianos, C. Neti, G. Gravier, A. Garg, and A.W. Senior, "Recent advances in the automatic recognition of audio-visual speech, " Proc. of the IEEE, vol. 91, no. 9, Sep. 2003.
-
(2003)
Proc. of the IEEE
, vol.91
, Issue.9
-
-
Potamianos, G.1
Neti, C.2
Gravier, G.3
Garg, A.4
Senior, A.W.5
-
5
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
S. Dupont, J. Luettin, "Audio-visual speech modeling for continuous speech recognition, "IEEE Transactions on Multimedia, vol. 2(3), pp. 141-151, 2000.
-
(2000)
IEEE Transactions on Multimedia
, vol.2
, Issue.3
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
6
-
-
0036874915
-
Audio-visual speech recognition using MPEG-4 compliant visual features
-
P. S. Aleksic, J. J. Williams, Z. Wu, and A. K. Katsaggelos, "Audio-visual speech recognition using MPEG-4 compliant visual features", EURASIP Journal on Applied Signal Processing, pp. 1213-1227, 2002.
-
(2002)
EURASIP Journal on Applied Signal Processing
, pp. 1213-1227
-
-
Aleksic, P.S.1
Williams, J.J.2
Wu, Z.3
Katsaggelos, A.K.4
-
7
-
-
0036447870
-
Audio-visual continuous speech recognition using MPEG-4 compliant visual feature
-
Rochester, NY, Sep.
-
P. S. Aleksic, J. J. Williams, Z. Wu, A. K. Katsaggelos, "Audio-visual continuous speech recognition using MPEG-4 compliant visual feature," Proc. of the Int. Conf. on Image Processing (ICIP), pp. 960-963, Rochester, NY, Sep. 2002.
-
(2002)
Proc. of the Int. Conf. on Image Processing (ICIP)
, pp. 960-963
-
-
Aleksic, P.S.1
Williams, J.J.2
Wu, Z.3
Katsaggelos, A.K.4
-
8
-
-
84908265391
-
A Comparison of model and transform-based visual features for audio-visual LVCSR
-
Tokyo
-
I. Matthews, G. Potamianos, C. Neti and J. Luettin, "A Comparison of model and transform-based visual features for audio-visual LVCSR," Proc. of the Int. Conf. on Multimedia and Expo (ICME), Tokyo, 2001.
-
(2001)
Proc. of the Int. Conf. on Multimedia and Expo (ICME)
-
-
Matthews, I.1
Potamianos, G.2
Neti, C.3
Luettin, J.4
-
9
-
-
0032314380
-
An image transform approach for HMM based automatic lipreading
-
G. Potamianos, H.P. Graf, and E. Cosatto, "An image transform approach for HMM based automatic lipreading," Proc. of the Int. Conf. on Image Proc., vol. III, pp. 173-177, 1998.
-
(1998)
Proc. of the Int. Conf. on Image Proc.
, vol.3
, pp. 173-177
-
-
Potamianos, G.1
Graf, H.P.2
Cosatto, E.3
-
10
-
-
4544329810
-
Comparison of low-and high-level visual features for audio-visual continuous automatic speech recognition
-
Montreal, Canada, May
-
P. S. Aleksic and A. K. Katsaggelos, "Comparison of Low-and High-level Visual Features for Audio-Visual Continuous Automatic Speech Recognition," IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, vol. 5, pp. 917-920, Montreal, Canada, May 2004.
-
(2004)
IEEE Int. Conf. on Acoustics, Speech, and Signal Processing
, vol.5
, pp. 917-920
-
-
Aleksic, P.S.1
Katsaggelos, A.K.2
-
11
-
-
33749235580
-
-
Text for ISO/IEC FDIS 14496-2 Visual, ISO/IEC JTC1/SC29/WG11 N2502, Nov. 1998
-
Text for ISO/IEC FDIS 14496-2 Visual, ISO/IEC JTC1/SC29/WG11 N2502, Nov. 1998.
-
-
-
-
14
-
-
4544321778
-
Inner lip feature extraction for MPEG-4 facial animation
-
Montreal, Canada, May
-
Z. Wu, P. S. Aleksic, and A. K. Katsaggelos, "Inner Lip Feature Extraction for MPEG-4 Facial Animation," Proc.of IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, vol. 3, pp. 633-636, Montreal, Canada, May 2004.
-
(2004)
Proc.of IEEE Int. Conf. on Acoustics, Speech, and Signal Processing
, vol.3
, pp. 633-636
-
-
Wu, Z.1
Aleksic, P.S.2
Katsaggelos, A.K.3
-
15
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
February
-
L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. of the IEEE, vol. 77(2), pp. 257-286, February 1989.
-
(1989)
Proc. of the IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
16
-
-
0003822743
-
-
Entropic Ltd., Cambridge
-
S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, "The HTK Book," Entropic Ltd., Cambridge, 2002.
-
(2002)
The HTK Book
-
-
Young, S.1
Kershaw, D.2
Odell, J.3
Ollason, D.4
Valtchev, V.5
Woodland, P.6
-
17
-
-
85009135251
-
AVICAR: An audiovisual speech corpus in a car environment
-
B. Lee, M. Hasegawa-Johnson, C. Goudeseune, S. Kamdar, S. Borys, M. Liu, and T. Huang, "AVICAR: An Audiovisual Speech Corpus in a Car Environment," ICSLP 2004.
-
(2004)
ICSLP
-
-
Lee, B.1
Hasegawa-Johnson, M.2
Goudeseune, C.3
Kamdar, S.4
Borys, S.5
Liu, M.6
Huang, T.7
|