-
1
-
-
4544290191
-
Automatic recognition of audio-visual speech: Recent progress and challenges
-
G. Potamianos, C. Neti. G. Gravier, and A. Garg, "Automatic recognition of audio-visual speech: Recent progress and challenges," Proc. of the IEEE, vol. 91, no. 9, pp. 1306-1326, 2003.
-
(2003)
Proc. of the IEEE
, vol.91
, Issue.9
, pp. 1306-1326
-
-
Potamianos, G.1
Neti, C.2
Gravier, G.3
Garg, A.4
-
2
-
-
0034825241
-
Multi-stream adaptive evidence combination for noise robust ASR
-
A. Morris, A. Hagen, H. Glotin, and H. Bourlard, "Multi-stream adaptive evidence combination for noise robust ASR," Speech Communication, vol. 34, pp. 25-40, 2001.
-
(2001)
Speech Communication
, vol.34
, pp. 25-40
-
-
Morris, A.1
Hagen, A.2
Glotin, H.3
Bourlard, H.4
-
4
-
-
0034842451
-
Weighting schemes for audio-visual fusion in speech recognition
-
H. Glotin, D. Vergyri, C. Neti, G. Potamianos, and J. Luettin, "Weighting schemes for audio-visual fusion in speech recognition," in Proc. ICASSP, 2001.
-
(2001)
Proc. ICASSP
-
-
Glotin, H.1
Vergyri, D.2
Neti, C.3
Potamianos, G.4
Luettin, J.5
-
5
-
-
0027681974
-
ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
-
V. Digalakis, J.R. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE TSAP, pp. 431-442, 1993.
-
(1993)
IEEE TSAP
, pp. 431-442
-
-
Digalakis, V.1
Rohlicek, J.R.2
Ostendorf, M.3
-
6
-
-
0028420014
-
Integrated models of signal and background with application to speaker identification in noise
-
R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE TSAP, vol. 2, no. 2, pp. 245-257, 1994.
-
(1994)
IEEE TSAP
, vol.2
, Issue.2
, pp. 245-257
-
-
Rose, R.C.1
Hofstetter, E.M.2
Reynolds, D.A.3
-
7
-
-
0036508276
-
Speaker verification in noise using a stochastic version of the weighted viterbi algorithm
-
N.B Yoma and M. Villar, "Speaker verification in noise using a stochastic version of the weighted viterbi algorithm," IEEE TSAP, vol. 10, no. 3, pp. 158-166, 2002.
-
(2002)
IEEE TSAP
, vol.10
, Issue.3
, pp. 158-166
-
-
Yoma, N.B.1
Villar, M.2
-
8
-
-
18744401086
-
Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
-
L. Deng, J. Dropo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE TSAP, vol. 13, no. 3, pp. 412-421, 2005.
-
(2005)
IEEE TSAP
, vol.13
, Issue.3
, pp. 412-421
-
-
Deng, L.1
Dropo, J.2
Acero, A.3
-
9
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. on Multimedia, vol. 2, no. 3, pp. 141-151, 2000.
-
(2000)
IEEE Trans. on Multimedia
, vol.2
, Issue.3
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
10
-
-
0034842342
-
Asynchronous stream modeling for large vocabulary audio-visual speech recognition
-
J. Luettin, G. Potamianos, and C. Neti, "Asynchronous stream modeling for large vocabulary audio-visual speech recognition," in Proc. ICASSP, 2001.
-
(2001)
Proc. ICASSP
-
-
Luettin, J.1
Potamianos, G.2
Neti, C.3
-
11
-
-
0033900150
-
A bayesian predictive approach to robust speech recognition
-
Q. Huo and C. Lee, "A bayesian predictive approach to robust speech recognition," IEEE TSAP, pp. 200-204, 2000.
-
(2000)
IEEE TSAP
, pp. 200-204
-
-
Huo, Q.1
Lee, C.2
-
12
-
-
0036874999
-
Dynamic bayesian networks for audio-visual speech recognition
-
A.V. Nefian, L. Liang, X. Pi, X. Liu, and K. Murphy, "Dynamic bayesian networks for audio-visual speech recognition," EURASIP Journal on Applied Signal Processing, vol. 11, pp. 1-15, 2002.
-
(2002)
EURASIP Journal on Applied Signal Processing
, vol.11
, pp. 1-15
-
-
Nefian, A.V.1
Liang, L.2
Pi, X.3
Liu, X.4
Murphy, K.5
-
13
-
-
0035363218
-
Active appearance models
-
T.F. Cootes, G.J. Edwards, and Taylor C.J., "Active appearance models," IEEE PAMI, vol. 23, no. 6, pp. 681-685, 2001.
-
(2001)
IEEE PAMI
, vol.23
, Issue.6
, pp. 681-685
-
-
Cootes, T.F.1
Edwards, G.J.2
Taylor, C.J.3
-
14
-
-
0036472941
-
Extraction of visual features for lipreading
-
I. Matthews, T. F. Cootes, J. A. Bangham, S. Cox, and R. Harvey, "Extraction of visual features for lipreading," IEEE PAMI, vol. 24, no. 2, pp. 198-213, 2002.
-
(2002)
IEEE PAMI
, vol.24
, Issue.2
, pp. 198-213
-
-
Matthews, I.1
Cootes, T.F.2
Bangham, J.A.3
Cox, S.4
Harvey, R.5
-
15
-
-
0003474751
-
-
Cambridge Univ. Press
-
W. Press, S. Teukolsky, W. Vetterling, and B. Flannery, Numerical Recipes, Cambridge Univ. Press, 1992.
-
(1992)
Numerical Recipes
-
-
Press, W.1
Teukolsky, S.2
Vetterling, W.3
Flannery, B.4
-
17
-
-
0036299249
-
CUAVE: A new audio-visual database for multimodal human-computer interface research
-
E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, "CUAVE: A new audio-visual database for multimodal human-computer interface research," in Proc. ICASSP, 2002.
-
(2002)
Proc. ICASSP
-
-
Patterson, E.K.1
Gurbuz, S.2
Tufekci, Z.3
Gowdy, J.N.4
-
18
-
-
0027623210
-
Assessment for automatic speech recognition: II. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems
-
A. Varga and H.J.M. Steeneken, "Assessment for automatic speech recognition: II. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Communication, vol. 12, no. 3, pp. 247-252, 1993.
-
(1993)
Speech Communication
, vol.12
, Issue.3
, pp. 247-252
-
-
Varga, A.1
Steeneken, H.J.M.2
|