-
1
-
-
4544290191
-
Recent advances in the autormatic recognition of audio-visual speech
-
G. Potamianos et al., "Recent advances in the autormatic recognition of audio-visual speech," Proceeding of the IEEE, vol. 91, no. 9, 2003.
-
(2003)
Proceeding of the IEEE
, vol.91
, Issue.9
-
-
Potamianos, G.1
-
2
-
-
78649537390
-
Visual speech recognition using motion features and hidden Markov models
-
S.-V. B. Heidelberg, Ed.
-
W. C. Yau et al., "Visual Speech Recognition Using Motion Features and HiddenMarkovModels," in CAIP 2007, S.-V. B. Heidelberg, Ed., 2007.
-
(2007)
CAIP 2007
-
-
Yau, W.C.1
-
3
-
-
74849097139
-
Automatic visual feature extraction for Mandarin audio-visual speech recognition
-
P. Tsang-Long et al., "Automatic visual feature extraction for Mandarin audio-visual speech recognition," in Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on, 2009, pp. 2936-2940.
-
(2009)
Systems, Man and Cybernetics, 2009. SMC 2009 IEEE International Conference on
, pp. 2936-2940
-
-
Tsang-Long, P.1
-
7
-
-
0036299249
-
CUAVE: A new audio-visual database for multimodal human-computer interface research
-
E. Patterson et al., "CUAVE: a new audio-visual database for multimodal human-computer interface research," in Acoustics, Speech, and Signal Processing, 2002. Proceedings. (ICASSP '02). IEEE International Conference on, vol. 2, 2002, pp. 2017-2020.
-
(2002)
Acoustics, Speech, and Signal Processing, 2002. Proceedings. (ICASSP '02). IEEE International Conference on
, vol.2
, pp. 2017-2020
-
-
Patterson, E.1
-
8
-
-
85032752352
-
Audiovisual speech processing
-
T. Chen, "Audiovisual speech processing," Signal Processing Magazine, IEEE, vol. 18, no. 1, pp. 9-21, 2001.
-
(2001)
Signal Processing Magazine, IEEE
, vol.18
, Issue.1
, pp. 9-21
-
-
Chen, T.1
-
10
-
-
33750368310
-
An audio-visual corpus for speech perception and automatic speech recognition
-
M. Cooke et al., "An audio-visual corpus for speech perception and automatic speech recognition," The Journal of the Acoustical Society of America, vol. 120, no. 5, pp. 2421-2424, 2006.
-
(2006)
The Journal of the Acoustical Society of America
, vol.120
, Issue.5
, pp. 2421-2424
-
-
Cooke, M.1
-
11
-
-
70149086972
-
The realistic multi-modal VALID database and visual speaker identification comparison experiments
-
New York
-
N. Fox, B. O'Mullane, and R. Reilly, "The Realistic Multi-Modal VALID database and Visual Speaker Identification Comparison Experiments," in AVBPA, New York, 2005.
-
(2005)
AVBPA
-
-
Fox, N.1
O'mullane, B.2
Reilly, R.3
-
14
-
-
14944353581
-
A segment-based audio-visual speech recognizer: Data collection, development, and initial experiments
-
State College, PA, USA: ACM
-
T. J. Hazen et al., "A segment-based audio-visual speech recognizer: data collection, development, and initial experiments," in Proceedings of the 6th international conference on Multimodal interfaces. State College, PA, USA: ACM, 2004, pp. 235-242.
-
(2004)
Proceedings of the 6th International Conference on Multimodal Interfaces
, pp. 235-242
-
-
Hazen, T.J.1
-
16
-
-
0004052871
-
-
Tech. Rep., Oct. 12
-
C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, D. Vergyri, S. Sison, A. Mashari, and J. Zhou, "Audio-visual speech recognition," Tech. Rep., Oct. 12 2000.
-
(2000)
Audio-visual Speech Recognition
-
-
Neti, C.1
Potamianos, G.2
Luettin, J.3
Matthews, I.4
Glotin, H.5
Vergyri, D.6
Sison, S.7
Mashari, A.8
Zhou, J.9
-
17
-
-
47949087133
-
Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation
-
E. Bozkurt, E. Qigdem Eroglu, E. Erzin, T. Erdem, and M. Ozkan, "Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation," in 3DTV Conference, 2007, 2007, pp. 1-4.
-
(2007)
3DTV Conference, 2007
, pp. 1-4
-
-
Bozkurt, E.1
Qigdem Eroglu, E.2
Erzin, E.3
Erdem, T.4
Ozkan, M.5
-
18
-
-
0344212675
-
-
New York, NY, USA: John Wiley & Sons, Inc.
-
I. S. Pandzic and R. Forchheimer, MPEG-4 Facial Animation: The Standard, Implementation and Applications. New York, NY, USA: John Wiley & Sons, Inc., 2003.
-
(2003)
MPEG-4 Facial Animation: The Standard, Implementation and Applications
-
-
Pandzic, I.S.1
Forchheimer, R.2
-
19
-
-
84875584220
-
-
vol.1, oct-2 nov
-
A. Goldschen, O. Garcia, and E. Petajan, "Continuous optical automatic speech recognition by lipreading," vol. 1, pp. 572-577 vol.1, oct-2 nov 1994.
-
(1994)
Continuous Optical Automatic Speech Recognition by Lipreading
, vol.1
, pp. 572-577
-
-
Goldschen, A.1
Garcia, O.2
Petajan, E.3
-
21
-
-
85009254391
-
Miketalk: A talking facial display based on morphing visemes
-
T. Ezzat and T. Poggio, "Miketalk: a talking facial display based on morphing visemes," in Computer Animation 98. Proceedings, 1998, pp. 96-102.
-
(1998)
Computer Animation 98. Proceedings
, pp. 96-102
-
-
Ezzat, T.1
Poggio, T.2
-
23
-
-
84870243209
-
Audiovisual and visual-only speech and speaker recognitio: Issue about theory, system design and implementation
-
D. Shiell, L. Terry, P. Aleksic, and A. K. Katsaggelos, "Audiovisual and visual-only speech and speaker recognitio: Issue about theory, system design and implementation," in Visual Speech Recognition: Lip Segmentation and Mapping, 2008, pp. 1-38.
-
(2008)
Visual Speech Recognition: Lip Segmentation and Mapping
, pp. 1-38
-
-
Shiell, D.1
Terry, L.2
Aleksic, P.3
Katsaggelos, A.K.4
-
24
-
-
85013597845
-
Eigenlips for robust speech recognition
-
vol.2
-
C. Bregler and Y. Konig, "Eigenlips for robust speech recognition," in Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on, vol. ii, 1994, pp. II/669-II/672 vol.2.
-
(1994)
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
, vol.2
-
-
Bregler, C.1
Konig, Y.2
|