-
1
-
-
84862219678
-
Pyramidal implementation of lucas kanade feature tracker
-
Bouguet
-
Bouguet (2002). Pyramidal Implementation of Lucas Kanade Feature Tracker. Description of the algorithm.
-
(2002)
Description of the Algorithm
-
-
-
2
-
-
47949087133
-
Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation
-
Bozkurt, Eroglu, Q., Erzin, Erdem, and Ozkan (2007). Comparison of phoneme and viseme based acoustic units for speech driven realistic lip animation. In 3DTV Conference, 2007, pages 1-4.
-
(2007)
3DTV Conference, 2007
, pp. 1-4
-
-
Bozkurt, E.Q.1
Erzin, E.2
Ozkan3
-
3
-
-
85013597845
-
-
vol.2
-
Bregler and Konig (1994).'Eigenlips'for robust speech recognition. In Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on, volume ii, pages II/669-II/672 vol.2.
-
(1994)
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
, vol.2
-
-
Bregler1
Konig2
-
4
-
-
78649613221
-
Nostril detection for robust mouth tracking
-
Cork
-
Cappelletta and Harte (2010). Nostril detection for robust mouth tracking. In Irish Signals and Systems Conference, pages 239 - 244, Cork.
-
(2010)
Irish Signals and Systems Conference
, pp. 239-244
-
-
Cappelletta1
Harte2
-
6
-
-
85009254391
-
Miketalk: A talking facial display based on morphing visemes
-
Ezzat and Poggio (1998). Miketalk: a talking facial display based on morphing visemes. In Computer Animation 98. Proceedings, pages 96-102.
-
(1998)
Computer Animation 98. Proceedings
, pp. 96-102
-
-
Ezzat1
Poggio2
-
7
-
-
84875584220
-
Continuous optical automatic speech recognition by lipreading
-
Goldschen, A. J., Garcia, O. N., and Petajan, E. (1994). Continuous optical automatic speech recognition by lipreading. In Proceedings of the 28th Asilomar Conference on Signals, Systems, and Computers, pages 572-577.
-
(1994)
Proceedings of the 28th Asilomar Conference on Signals, Systems, and Computers
, pp. 572-577
-
-
Goldschen, A.J.1
Garcia, O.N.2
Petajan, E.3
-
8
-
-
34047263009
-
Visual model structures and synchrony constraints for audio-visual speech recognition
-
Hazen (2006). Visual model structures and synchrony constraints for audio-visual speech recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 14(3):1082-1089.
-
(2006)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.14
, Issue.3
, pp. 1082-1089
-
-
Hazen1
-
9
-
-
14944353581
-
A segment-based audio-visual speech recognizer: Data collection, development, and initial experiments
-
State College, PA, USA. ACM
-
Hazen, Saenko, La, and Glass (2004). A segment-based audio-visual speech recognizer: data collection, development, and initial experiments. In Proceedings of the 6th international conference on Multimodal interfaces, pages 235-242, State College, PA, USA. ACM.
-
(2004)
Proceedings of the 6th International Conference on Multimodal Interfaces
, pp. 235-242
-
-
Hazen, S.1
La, G.2
-
10
-
-
85009284526
-
DCT-Based video features for audio-visual speech recognition
-
Denver, CO, USA
-
Heckmann, Kroschel, Savariaux, and Berthommier (2002). DCT-Based Video Features for Audio-Visual Speech Recognition. In International Conference on Spoken Language Processing, volume 1, pages 1925-1928, Denver, CO, USA.
-
(2002)
International Conference on Spoken Language Processing
, vol.1
, pp. 1925-1928
-
-
Heckmann, K.1
Savariaux, B.2
-
14
-
-
0019647180
-
An iterative image registration technique with an application to stereo vision
-
Lucas and Kanade (1981). An iterative image registration technique with an application to stereo vision. In Proceedings of Imaging Understanding Workshop.
-
(1981)
Proceedings of Imaging Understanding Workshop
-
-
Lucas, K.1
-
15
-
-
0004052871
-
Audio-visual speech recognition
-
The Johns Hopkins University, Baltimore
-
Neti, Potamianos, Luettin, Matthews, Glotin, Vergyri, Sison, Mashari, and Zhou (2000). Audio-visual speech recognition. Technical report, Center for Language and Speech Processing, The Johns Hopkins University, Baltimore.
-
(2000)
Technical Report, Center for Language and Speech Processing
-
-
Neti, P.1
Luettin, M.2
Glotin, V.3
Sison, M.4
Zhou5
-
16
-
-
0344212675
-
-
John Wiley & Sons, Inc. New York, NY, USA
-
Pandzic, I. S. and Forchheimer, R. (2003). MPEG-4 Facial Animation: The Standard, Implementation and Applications. John Wiley & Sons, Inc., New York, NY, USA.
-
(2003)
MPEG-4 Facial Animation: The Standard, Implementation and Applications
-
-
Pandzic, I.S.1
Forchheimer, R.2
-
17
-
-
4544290191
-
Recent advances in the automatic recognition of audio-visual speech
-
Senior
-
Potamianos, Neti, Gravier, Garg, and Senior (2003). Recent advances in the automatic recognition of audio-visual speech. Proceeding of the IEEE, 91(9):1306-1326.
-
(2003)
Proceeding of the IEEE
, vol.91
, Issue.9
, pp. 1306-1326
-
-
Neti, P.1
Garg, G.2
|