-
1
-
-
0039228740
-
The intrinsic bimodality of speech communication and the synthesis of talking faces
-
Hungary, September
-
C. Benoit. The intrinsic bimodality of speech communication and the synthesis of talking faces. In Journal on Communications of the Scientific Society for Telecommunications, Hungary, number 43, pages 32-40, September 1992.
-
(1992)
Journal on Communications of the Scientific Society for Telecommunications
, Issue.43
, pp. 32-40
-
-
Benoit, C.1
-
2
-
-
84925639646
-
Real-time lip tracking and bimodal continuous speech recognition
-
Redondo Beach, CA
-
M. T. Chan, Y. Zhang, and T. S. Huang. Real-time lip tracking and bimodal continuous speech recognition. In Proc. of the Workshop on Multimedia Signal Processing, pp. 65-70, Redondo Beach, CA, 1998.
-
(1998)
Proc. of the Workshop on Multimedia Signal Processing
, pp. 65-70
-
-
Chan, M.T.1
Zhang, Y.2
Huang, T.S.3
-
4
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
September
-
S. Dupont and J. Luettin. Audio-visual speech modeling for continuous speech recognition. In IEEE Transactions on Multimedia, number 2, pages 141-151, September 2000.
-
(2000)
IEEE Transactions on Multimedia
, Issue.2
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
5
-
-
0038359548
-
A probabilistic framework for segment-based speech recognition
-
To appear in
-
J. Glass. A probabilistic framework for segment-based speech recognition. To appear in Computer Speech and Language, 2003.
-
(2003)
Computer Speech and Language
-
-
Glass, J.1
-
6
-
-
85128407852
-
Heterogeneous measurements and multiple classifiers for speech recognition
-
Sydney, Australia, November
-
A. Halberstadt and J. Glass. Heterogeneous measurements and multiple classifiers for speech recognition. In Proceedings of ICSLP 98, Sydney, Australia, November 1998.
-
(1998)
Proceedings of ICSLP 98
-
-
Halberstadt, A.1
Glass, J.2
-
7
-
-
84892140515
-
Using aggregation to improve the performance of mixture Gaussian acoustic models
-
Seattle, May
-
T. J. Hazen and A. Halberstadt, "Using aggregation to improve the performance of mixture Gaussian acoustic models," In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, Seattle, May, 1998.
-
(1998)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing
-
-
Hazen, T.J.1
Halberstadt, A.2
-
9
-
-
14944355052
-
-
Intel's AVCSR Toolkit source code can be downloaded from http://sourceforge.net/projects/opencvlibrary/.
-
-
-
-
10
-
-
0024768209
-
Speaker-independent phone recognition using hidden markov models
-
November
-
K. F. Lee and H. W. Hon. Speaker-independent phone recognition using hidden Markov models. In IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 37, no. 11, pp. 1641-1648, November 1989.
-
(1989)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.37
, Issue.11
, pp. 1641-1648
-
-
Lee, K.F.1
Hon, H.W.2
-
11
-
-
79952493967
-
Speaker independent audio-visual continuous speech recognition
-
L. H. Liang, X. X. Liu, Y. Zhao, X. Pi and A.V. Nefian. Speaker independent audio-visual continuous speech recognition. In Proc. of the IEEE International Conference on Multimedia and Expo, vol.2, pp. 25-28, 2002.
-
(2002)
Proc. of the IEEE International Conference on Multimedia and Expo
, vol.2
, pp. 25-28
-
-
Liang, L.H.1
Liu, X.X.2
Zhao, Y.3
Pi, X.4
Nefian, A.V.5
-
12
-
-
0030355932
-
Audio-visual speech recognition using multiscale nonlinear image decomposition
-
Philadelphia, PA
-
I. Matthews, J. A. Bangham, and S. Cox. Audio-visual speech recognition using multiscale nonlinear image decomposition. In Proc. of the International Conference on Spoken Language Processing, pp. 38-41, Philadelphia, PA, 1996.
-
(1996)
Proc. of the International Conference on Spoken Language Processing
, pp. 38-41
-
-
Matthews, I.1
Bangham, J.A.2
Cox, S.3
-
13
-
-
0034238554
-
Towards unrestricted lip reading
-
August
-
U. Meier, R. Stiefelhagen, J. Yang, and A. Waibel. Towards unrestricted lip reading. In International Journal of Pattern Recognition and Artificial Intelligence, number 14, pages 571-585, August 2000.
-
(2000)
International Journal of Pattern Recognition and Artificial Intelligence
, Issue.14
, pp. 571-585
-
-
Meier, U.1
Stiefelhagen, R.2
Yang, J.3
Waibel, A.4
-
14
-
-
0001935972
-
XM2VTSDB: The extended M2VTS database
-
Washington, D.C., March. 16 IDIAP-RR 99-02
-
K. Messer, J. Matas, J. Kittler, and K. Jonsson. XM2VTSDB: The extended M2VTS database. In Audio- and Video-based Biometric Person Authentication, AVBPA'99, pages 72-77, Washington, D.C., March 1999. 16 IDIAP-RR 99-02.
-
(1999)
Audio- and Video-based Biometric Person Authentication, AVBPA'99
, pp. 72-77
-
-
Messer, K.1
Matas, J.2
Kittler, J.3
Jonsson, K.4
-
17
-
-
85009230873
-
Audio-visual speech recognition in challenging environments
-
Geneva, Switzerland, September
-
G. Potamianos and C. Neti. Audio-visual speech recognition in challenging environments. In Proc. Of EUROSPEECH, pp. 1293-1296, Geneva, Switzerland, September 2003.
-
(2003)
Proc. of EUROSPEECH
, pp. 1293-1296
-
-
Potamianos, G.1
Neti, C.2
-
18
-
-
14944351246
-
Articulatory features for robust visual speech recognition
-
In these proceedings, State College, Pennsylvania
-
K. Saenko, T. Darrel, and J. Glass. Articulatory features for robust visual speech recognition In these proceedings, ICMI'04, State College, Pennsylvania, 2004.
-
(2004)
ICMI'04
-
-
Saenko, K.1
Darrel, T.2
Glass, J.3
-
19
-
-
0041355006
-
The VidTIMIT database
-
Martigny, Switzerland
-
C. Sanderson. The VidTIMIT Database. IDIAP Communication 02-06, Martigny, Switzerland, 2002.
-
(2002)
IDIAP Communication
, vol.2
, Issue.6
-
-
Sanderson, C.1
-
21
-
-
14944356145
-
Acoustic modeling improvements in a segment-based speech recognizer
-
Keystone, CO, December
-
N. Ström, L. Hetherington, T.J. Hazen, E. Sandness, and J. Glass. Acoustic modeling improvements in a segment-based speech recognizer. In Proc. 1999 IEEE ASRU Workshop, Keystone, CO, December 1999.
-
(1999)
Proc. 1999 IEEE ASRU Workshop
-
-
Ström, N.1
Hetherington, L.2
Hazen, T.J.3
Sandness, E.4
Glass, J.5
-
22
-
-
0025477640
-
Speech database development: TIMIT and beyond
-
V. Zue, S. Seneff, and J. Glass. Speech database development: TIMIT and beyond. Speech Communication, vol. 9, no. 4, pp. 351-356, 1990.
-
(1990)
Speech Communication
, vol.9
, Issue.4
, pp. 351-356
-
-
Zue, V.1
Seneff, S.2
Glass, J.3
|