-
1
-
-
0031187171
-
Speech recognition by machines and humans
-
July
-
R. Lippman, "Speech recognition by machines and humans," Speech Commun., vol. 22, no. 1, pp. 1-15, July 1997.
-
(1997)
Speech Commun.
, vol.22
, Issue.1
, pp. 1-15
-
-
Lippman, R.1
-
2
-
-
0029288202
-
Speech recognition in noisy environments: A survey
-
Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, 1995.
-
(1995)
Speech Commun.
, vol.16
, pp. 261-291
-
-
Gong, Y.1
-
4
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Feb.
-
L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, pp. 257-286, Feb. 1989.
-
(1989)
Proc. IEEE
, vol.77
, pp. 257-286
-
-
Rabiner, L.R.1
-
5
-
-
0029270677
-
Converting speech into lip movements: A multimedia telephone for hard of hearing people
-
Mar.
-
F. Lavagetto, "Converting speech into lip movements: A multimedia telephone for hard of hearing people," IEEE Trans. Rehab. Eng., vol. 3, pp. 1-14, Mar. 1995.
-
(1995)
IEEE Trans. Rehab. Eng.
, vol.3
, pp. 1-14
-
-
Lavagetto, F.1
-
6
-
-
0000051247
-
Generation of mouth shapes for a synthetic talking head
-
A. Simons and S. Cox, "Generation of mouth shapes for a synthetic talking head," Proc. Inst. Acoust., vol. 12, pp. 475-482, 1990.
-
(1990)
Proc. Inst. Acoust.
, vol.12
, pp. 475-482
-
-
Simons, A.1
Cox, S.2
-
7
-
-
0032074310
-
Audio-visual integration in multimedia communication
-
May
-
T. Chen and R. R. Rao, "Audio-visual integration in multimedia communication," Proc. IEEE, vol. 86, pp. 837-852, May 1998.
-
(1998)
Proc. IEEE
, vol.86
, pp. 837-852
-
-
Chen, T.1
Rao, R.R.2
-
8
-
-
0030677313
-
Video rewrite: Driving visual speech with audio
-
C. Bregler, M. Covell, and M. Slaney, "Video rewrite: Driving visual speech with audio," in Proc. ACM SIGGRAPH, 1997, pp. 353-360.
-
(1997)
Proc. ACM SIGGRAPH
, pp. 353-360
-
-
Bregler, C.1
Covell, M.2
Slaney, M.3
-
9
-
-
0000497160
-
Baum-welch hidden Markov model inversion for reliable audio-to-video conversion
-
K. Choi and J.-N. Hwang, "Baum-welch hidden Markov model inversion for reliable audio-to-video conversion," in Proc. IEEE 3rd Workshop Multimedia Signal Processing, 1999, pp. 175-180.
-
(1999)
Proc. IEEE 3rd Workshop Multimedia Signal Processing
, pp. 175-180
-
-
Choi, K.1
Hwang, J.-N.2
-
10
-
-
0031100269
-
Robust speech recognition based on joint model and feature space optimization of hidden Markov models
-
Mar.
-
S. Moon and J.-N. Hwang, "Robust speech recognition based on joint model and feature space optimization of hidden Markov models," IEEE Trans. Neural Networks, vol. 8, pp. 194-204, Mar. 1997.
-
(1997)
IEEE Trans. Neural Networks
, vol.8
, pp. 194-204
-
-
Moon, S.1
Hwang, J.-N.2
-
13
-
-
0034779303
-
Subjective analysis of an HMM-based visual speech synthesizer
-
San Jose, CA, Jan.
-
J. J. Williams, A. K. Katsaggelos, and D. C. Garstecki, "Subjective analysis of an HMM-based visual speech synthesizer," in Proc. SPIE Conf. Human Vision and Electronic Imaging, vol. 4299, San Jose, CA, Jan. 2001, pp. 544-555.
-
(2001)
Proc. SPIE Conf. Human Vision and Electronic Imaging
, vol.4299
, pp. 544-555
-
-
Williams, J.J.1
Katsaggelos, A.K.2
Garstecki, D.C.3
-
17
-
-
0035472468
-
An efficient use of MPEG-4 FAP interpolation for facial animation at 70 bits/frame
-
Oct.
-
F. Lavagetto and R. Pockaj, "An efficient use of MPEG-4 FAP interpolation for facial animation at 70 bits/frame," IEEE Trans. Circuits Syst. Video Technol., vol. 11, pp. 1085-1097, Oct. 2001.
-
(2001)
IEEE Trans. Circuits Syst. Video Technol.
, vol.11
, pp. 1085-1097
-
-
Lavagetto, F.1
Pockaj, R.2
-
18
-
-
0036874915
-
Audio-visual speech recognition using MPEG-4 compliant visual features
-
P. S. Aleksic, J. J. Williams, Z. Wu, and A. K. Katsaggelos, "Audio-visual speech recognition using MPEG-4 compliant visual features," EURASIP J. Appl. Signal Processing, pp. 1213-1227, 2002.
-
(2002)
EURASIP J. Appl. Signal Processing
, pp. 1213-1227
-
-
Aleksic, P.S.1
Williams, J.J.2
Wu, Z.3
Katsaggelos, A.K.4
-
19
-
-
0036447870
-
Audio-visual continuous speech recognition using MPEG-4 compliant visual feature
-
Rochester, NY, Sept.
-
_, "Audio-visual continuous speech recognition using MPEG-4 compliant visual feature," in Proc. Int. Conf. Image Processing, Rochester, NY, Sept. 2002, pp. 960-963.
-
(2002)
Proc. Int. Conf. Image Processing
, pp. 960-963
-
-
-
25
-
-
2542482407
-
-
Baltimore, MD, Oct.
-
C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, D. Vergyri, J. Sison, A. Mashari, and J. Zhou, Workshop Audio-Visual Speech Recognition, Final Report. Baltimore, MD, Oct. 2000.
-
(2000)
Workshop Audio-visual Speech Recognition, Final Report
-
-
Neti, C.1
Potamianos, G.2
Luettin, J.3
Matthews, I.4
Glotin, H.5
Vergyri, D.6
Sison, J.7
Mashari, A.8
Zhou, J.9
-
26
-
-
4544290191
-
Recent advances in the automatic recognition of audio-visual speech
-
Sept.
-
G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. W. Senior, "Recent advances in the automatic recognition of audio-visual speech," Proc. IEEE, vol. 91, pp. 1306-1326, Sept. 2003.
-
(2003)
Proc. IEEE
, vol.91
, pp. 1306-1326
-
-
Potamianos, G.1
Neti, C.2
Gravier, G.3
Garg, A.4
Senior, A.W.5
-
27
-
-
0034853041
-
Hierarchical discriminant features for audio-visual LVCSR
-
G. Potamianos, J. Luettin, and C. Neti, "Hierarchical discriminant features for audio-visual LVCSR," in Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, 2001, pp. 165-168.
-
(2001)
Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing
, vol.1
, pp. 165-168
-
-
Potamianos, G.1
Luettin, J.2
Neti, C.3
-
28
-
-
85013597845
-
Eigenlips' for robust speech recognition
-
Adelaide, Australia
-
C. Bregler and Y. Conig, "Eigenlips' for robust speech recognition," in Proc. Int. Conf. Acoustics, Speech and Signal Processing, Adelaide, Australia, 1994, pp. 669-672.
-
(1994)
Proc. Int. Conf. Acoustics, Speech and Signal Processing
, pp. 669-672
-
-
Bregler, C.1
Conig, Y.2
-
29
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
Mar.
-
S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, pp. 141-151, Mar. 2000.
-
(2000)
IEEE Trans. Multimedia
, vol.2
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
30
-
-
0034842451
-
Weighting schemes for audio-visual fusion in speech recognition
-
H. Glotin, D. Vergyri, C. Neti, G. Potamianos, and J. Luettin, "Weighting schemes for audio-visual fusion in speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, 2001, pp. 165-168.
-
(2001)
Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing
, vol.1
, pp. 165-168
-
-
Glotin, H.1
Vergyri, D.2
Neti, C.3
Potamianos, G.4
Luettin, J.5
-
31
-
-
0003822743
-
-
Cambridge, U.K.: Entropic Ltd.
-
S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book. Cambridge, U.K.: Entropic Ltd., 2002.
-
(2002)
The HTK Book
-
-
Young, S.1
Kershaw, D.2
Odell, J.3
Ollason, D.4
Valtchev, V.5
Woodland, P.6
-
32
-
-
0345134263
-
Speech-to-video synthesis using facial animation parameters
-
Barcelona, Spain, Sept.
-
P. S. Aleksic and A. K. Katsaggelos, "Speech-to-video synthesis using facial animation parameters," in Proc. Int. Conf. Image Processing, Barcelona, Spain, Sept. 2003, pp. 1-4.
-
(2003)
Proc. Int. Conf. Image Processing
, pp. 1-4
-
-
Aleksic, P.S.1
Katsaggelos, A.K.2
|