-
2
-
-
4244194696
-
Multispectral color modeling
-
University of Pennsylvania, CIS
-
ANGELOPOULOU, E., MOLANA, R., and DANIILIDIS, K. (2001): Multispectral color modeling. Technical Report MS-CIS-01-22, University of Pennsylvania, CIS.
-
(2001)
Technical Report
, vol.MS-CIS-01-22
-
-
Angelopoulou, E.1
Molana, R.2
Daniilidis, K.3
-
4
-
-
84943272400
-
Bimodal sensor integration on the example of "speechreading"
-
BREGLER, C., MANKE, S., HILD, H., and WAIBEL, A. (1993): Bimodal sensor integration on the example of "speechreading". Proceedings of the IEEE International Conference on Neural Networks, 667-671.
-
(1993)
Proceedings of the IEEE International Conference on Neural Networks
, pp. 667-671
-
-
Bregler, C.1
Manke, S.2
Hild, H.3
Waibel, A.4
-
5
-
-
0042453502
-
-
STORK and HENNECKE (1996)
-
BREGLER, C., OMOHUNDRO, S.M., SHI, J., and KONIG, Y. (1996): Towards a robust speechreading dialog system. In STORK and HENNECKE (1996), 410-423.
-
(1996)
Towards a Robust Speechreading Dialog System
, pp. 410-423
-
-
Bregler, C.1
Omohundro, S.M.2
Shi, J.3
Konig, Y.4
-
8
-
-
0029304865
-
Human and machine recognition of faces: A survey
-
CHELAPPA, R., WILSON, C., and SIROHEY, S. (1995): Human and machine recognition of faces: A survey, in Proceedings of the IEEE, 83(5): 705-739.
-
(1995)
Proceedings of the IEEE
, vol.83
, Issue.5
, pp. 705-739
-
-
Chelappa, R.1
Wilson, C.2
Sirohey, S.3
-
9
-
-
0010424566
-
-
STORK and HENNECKE (1996)
-
COHEN, M., WALKER, R., and MASSARO, D. (1996): Perception of synthetic visual speech. In STORK and HENNECKE (1996), 153-168.
-
(1996)
Perception of Synthetic Visual Speech
, pp. 153-168
-
-
Cohen, M.1
Walker, R.2
Massaro, D.3
-
11
-
-
0042453480
-
Comparision of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
WAIBEL, A. and LEE, K., editors, Morgan Kaufmann Publishers Inc., San Mateo, CA
-
DAVIS, S. and MERMELSTEIN, P. (1990): Comparision of parametric representations for monosyllabic word recognition in continuously spoken sentences. In WAIBEL, A. and LEE, K., editors, Readings in Speech Recognition, 64-74. Morgan Kaufmann Publishers Inc., San Mateo, CA.
-
(1990)
Readings in Speech Recognition
, pp. 64-74
-
-
Davis, S.1
Mermelstein, P.2
-
14
-
-
0028996862
-
Toward movement-invariant automatic lip-reading and speech recognition
-
Detriot USA
-
DUCHNOWSKI, P., HUNKE, P., BUSCHING, M., MEIER, U., and WAIBEL, A. (1995): Toward movement-invariant automatic lip-reading and speech recognition. In Proceedings of the International Conference of Acoustics, Speech, and Signal Processing, Detriot USA.
-
(1995)
Proceedings of the International Conference of Acoustics, Speech, and Signal Processing
-
-
Duchnowski, P.1
Hunke, P.2
Busching, M.3
Meier, U.4
Waibel, A.5
-
15
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
DUPONT. S. and LEUTTIN, J. (2000): Audio-visual speech modeling for continuous speech recognition. IEEE Transactions on Multimedia, 2(3):141-151.
-
(2000)
IEEE Transactions on Multimedia
, vol.2
, Issue.3
, pp. 141-151
-
-
Dupont, S.1
Leuttin, J.2
-
16
-
-
0003824723
-
-
Hartcort Brace and Company, Sydney, 3rd edition
-
FROMKIN, V., RODMAN, R., COLLINS, P., and BLAIR, D. (1996): An Introduction to Langauge. Hartcort Brace and Company, Sydney, 3rd edition.
-
(1996)
An Introduction to Langauge
-
-
Fromkin, V.1
Rodman, R.2
Collins, P.3
Blair, D.4
-
17
-
-
0034842451
-
Weighting schemes for audio-visual fusion in speech recognition
-
GLOTIN, H., VERGYRI, D., NETI, C., POTAMIANOS, G., and LUETTIN, J. (2001): Weighting schemes for audio-visual fusion in speech recognition. In Proc. Int. Conf. Acoust, Speech Signal Process.
-
(2001)
Proc. Int. Conf. Acoust. Speech Signal Process.
-
-
Glotin, H.1
Vergyri, D.2
Neti, C.3
Potamianos, G.4
Luettin, J.5
-
18
-
-
0012745879
-
-
STORK and HENNECKE (1996)
-
GOLDSCHEN, A., GARCIA, O., and PETAJAN, E. (1996): Rationale for phoneme-viseme mapping and feature selection in visual speech recognition. In STORK and HENNECKE (1996), 505-515.
-
(1996)
Rationale for Phoneme-viseme Mapping and Feature Selection in Visual Speech Recognition
, pp. 505-515
-
-
Goldschen, A.1
Garcia, O.2
Petajan, E.3
-
19
-
-
0041451439
-
The use of visible speech cues (speechreading) for directing auditory attention: Reducing temporal and spectral uncertainty in auditory detection of spoken utterances
-
GRANT, K. and SEITZ, P. (1998): The use of visible speech cues (speechreading) for directing auditory attention: Reducing temporal and spectral uncertainty in auditory detection of spoken utterances. In 16th International Congress on Acoustics.
-
(1998)
16th International Congress on Acoustics
-
-
Grant, K.1
Seitz, P.2
-
20
-
-
0000874921
-
Dynamic features for visual speechreading: A systematic comparision
-
MOZER, JORDAN, and PERSCHE, editors, MIT Press, Cambridge MA
-
GRAY, M., MOVELLAN, J., and SEJNOWSKI, T. (1997): Dynamic features for visual speechreading: A systematic comparision. In MOZER, JORDAN, and PERSCHE, editors, Advances in Neural Information Processing Systems, volume 9. MIT Press, Cambridge MA.
-
(1997)
Advances in Neural Information Processing Systems
, vol.9
-
-
Gray, M.1
Movellan, J.2
Sejnowski, T.3
-
22
-
-
0034848499
-
Optimal weighting of posteriors for audio-visual speech recognition
-
Salt Lake City, Utah
-
HECKMANN, M., BERTHOMMIER, F., and KROSCHEL, K. (2001b): Optimal weighting of posteriors for audio-visual speech recognition. In Proceedings of lCASSP 2001, Salt Lake City, Utah.
-
(2001)
Proceedings of LCASSP 2001
-
-
Heckmann, M.1
Berthommier, F.2
Kroschel, K.3
-
23
-
-
4243462047
-
Automatic speech recognition using acoustic and visual signals
-
Ricoh Californian Research Centre
-
HENNECKE, M., PRASAD, K.V., and STORK, D. (1995): Automatic speech recognition using acoustic and visual signals. Technical Report CRC-TR-95-37, Ricoh Californian Research Centre.
-
(1995)
Technical Report
, vol.CRC-TR-95-37
-
-
Hennecke, M.1
Prasad, K.V.2
Stork, D.3
-
24
-
-
78649238564
-
Using deformable templates to infer visual speech dynamics
-
Pacific Grove, CA. IEEE Computer
-
HENNECKE, M., PRASAD, V., and STORK, D. (1994): Using deformable templates to infer visual speech dynamics. In 28th Annual Asimolar Conference on Signals, Systems, and Computer, Pacific Grove, CA. IEEE Computer. 2:576-582.
-
(1994)
28th Annual Asimolar Conference on Signals, Systems, and Computer
, vol.2
, pp. 576-582
-
-
Hennecke, M.1
Prasad, V.2
Stork, D.3
-
25
-
-
0000417467
-
-
STORK and HENNECKE (1996)
-
HENNECKE, M., STORK, D., and PRASAD, K.V. (1996): Visionary speech: Looking ahead to practical speech reading systems. In STORK and HENNECKE (1996), 331-350.
-
(1996)
Visionary Speech: Looking Ahead to Practical Speech Reading Systems
, pp. 331-350
-
-
Hennecke, M.1
Stork, D.2
Prasad, K.V.3
-
26
-
-
84992590661
-
Face locating and tracking for human-computer interaction
-
IEEE Computer Society, Pacific Grove, CA
-
HUNKE, M. and WAIBEL, A. (1994): Face locating and tracking for human-computer interaction. In 28th Annual Asimolar Conference on Signals, Systems, and Computers, IEEE Computer Society, Pacific Grove, CA. 2: 1277-1281.
-
(1994)
28th Annual Asimolar Conference on Signals, Systems, and Computers
, vol.2
, pp. 1277-1281
-
-
Hunke, M.1
Waibel, A.2
-
29
-
-
1542320375
-
Lip feature extraction using red exclusion
-
EADES, P. and JIN, J., editors
-
Lewis T.W. and POWERS, D. (2001): Lip feature extraction using red exclusion. In EADES, P. and JIN, J., editors, CRPIT: Visualisation, 2000, 2: 61-70.
-
(2000)
CRPIT: Visualisation
, vol.2
, pp. 61-70
-
-
Lewis, T.W.1
Powers, D.2
-
31
-
-
0032072433
-
Speech recognition and sensory integration: A 240-year old theorem helps explain how people and machines can integrate auditory and visual information to understand speech
-
MASSARO, D. and STORK, D. (1998): Speech recognition and sensory integration: a 240-year old theorem helps explain how people and machines can integrate auditory and visual information to understand speech. American Scientist, 86(3): 236-245.
-
(1998)
American Scientist
, vol.86
, Issue.3
, pp. 236-245
-
-
Massaro, D.1
Stork, D.2
-
32
-
-
0017199877
-
Hearing lips and seeing voices
-
MCGURK, H. and MACDONALD, J. (1976): Hearing lips and seeing voices. Nature, 264:746-748.
-
(1976)
Nature
, vol.264
, pp. 746-748
-
-
McGurk, H.1
Macdonald, J.2
-
33
-
-
0029725863
-
Adaptive bimodal sensor fusion for automatic speechreading
-
MEIER, U., HURST, W., and DUCHNOWSKI, P. (1996): Adaptive bimodal sensor fusion for automatic speechreading. In Proceedings of the International Conference of Acoustics, Speech, and Signal Processing, 2: 833-837.
-
(1996)
Proceedings of the International Conference of Acoustics, Speech, and Signal Processing
, vol.2
, pp. 833-837
-
-
Meier, U.1
Hurst, W.2
Duchnowski, P.3
-
34
-
-
2642559942
-
Towards unrestricted lip reading
-
Hong Kong
-
MEIER, U., STEIFELHAGEN, R., YANG, J., and WAIBEL, A. (1999): Towards unrestricted lip reading. In Second International Conference on Multimedia Interfaces, Hong Kong, http://wemer.ir.uks.de/js.
-
(1999)
Second International Conference on Multimedia Interfaces
-
-
Meier, U.1
Steifelhagen, R.2
Yang, J.3
Waibel, A.4
-
35
-
-
85029619676
-
Visual speech recognition with stochastic networks
-
Tesauro, G., Toruetzky, D., and Leen, T., editors, MIT Press, Cambridge
-
MOVELLAN, J. (1995): Visual speech recognition with stochastic networks. In Tesauro, G., Toruetzky, D., and Leen, T., editors, Advances in Neural Information Processing Systems, 7: 851-858. MIT Press, Cambridge.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 851-858
-
-
Movellan, J.1
-
36
-
-
0032138429
-
Robust sensor fusion: Analysis and application to audio visual speech recognition
-
MOVELLAN, J. and MINEIRO, P. (1998): Robust sensor fusion: Analysis and application to audio visual speech recognition. Machine Learning, 32: 85-100.
-
(1998)
Machine Learning
, vol.32
, pp. 85-100
-
-
Movellan, J.1
Mineiro, P.2
-
37
-
-
0035790960
-
Large-vocabulary audio-visual speech recognition: A summary of the Johns Hopkins summer 2000 workshop
-
Cannes
-
NETI, C., POTAMIANOS, G., LEUTTIN, J., MATTHEWS, I., GLOTIN, H., and VERGYRI, D. (2001): Large-vocabulary audio-visual speech recognition: A summary of the Johns Hopkins summer 2000 workshop. In Workshop on Multimedia Signal Processing. Special Session on Joint Audio-Visual Processing, Cannes.
-
(2001)
Workshop on Multimedia Signal Processing. Special Session on Joint Audio-Visual Processing
-
-
Neti, C.1
Potamianos, G.2
Leuttin, J.3
Matthews, I.4
Glotin, H.5
Vergyri, D.6
-
39
-
-
0010127090
-
Speaker adaptation for audio-visual speech recognition
-
Budapest
-
POTAMIAONOS, G. and POTAMIANOS, A. (1999): Speaker adaptation for audio-visual speech recognition. In Proceedings of EUROSPEECH (3), 1291-1294, Budapest.
-
(1999)
Proceedings of EUROSPEECH (3)
, pp. 1291-1294
-
-
Potamiaonos, G.1
Potamianos, A.2
-
40
-
-
0003552976
-
Preprocessing video images for neural learning of lipreading
-
Ricoh California Research Centre
-
PRASAD, K., STORK, D., and WOLFF, G. (1993): Preprocessing video images for neural learning of lipreading. Technical Report CRC-TR-93-26, Ricoh California Research Centre.
-
(1993)
Technical Report
, vol.CRC-TR-93-26
-
-
Prasad, K.1
Stork, D.2
Wolff, G.3
-
42
-
-
85060684689
-
Lip modeling for visual speech recognition
-
IEEE Computer Society, Pacific Grove CA
-
RAO, R. and MERSEREAU, R. (1994): Lip modeling for visual speech recognition. In 28th Annual Asimolar Conference on Signals, Systems, and Computers, volume 2. IEEE Computer Society, Pacific Grove CA.
-
(1994)
28th Annual Asimolar Conference on Signals, Systems, and Computers
, vol.2
-
-
Rao, R.1
Mersereau, R.2
-
43
-
-
0042453473
-
-
STORK and HENNECKE (1996)
-
ROBERT-RIBES, J., PIQUEMAL, M., SCHWARTZ, J., and ESCUDIER, P. (1996): Exploiting sensor fusion and stimuli complementary in av speech recognition. In STORK and HENNECKE (1996), 194-219.
-
(1996)
Exploiting Sensor Fusion and Stimuli Complementary in av Speech Recognition
, pp. 194-219
-
-
Robert-Ribes, J.1
Piquemal, M.2
Schwartz, J.3
Escudier, P.4
-
44
-
-
0002358797
-
Discriminative learning of visual data for audiovisual speech recognition
-
ROGOZAN, A. (1999): Discriminative learning of visual data for audiovisual speech recognition. International Journal of Artificial Intelligence Tools, 8(1):43-52.
-
(1999)
International Journal of Artificial Intelligence Tools
, vol.8
, Issue.1
, pp. 43-52
-
-
Rogozan, A.1
-
45
-
-
0038133938
-
Digital representations of speech signals
-
WAIBEL, A. and LEE, K., editors, Morgan Kaufmann Publishers Inc., San Mateo, CA
-
SCHAFER, R. and RABINER, L. (1990): Digital representations of speech signals. In WAIBEL, A. and LEE, K., editors, Readings in Speech Recognition, 49-64. Morgan Kaufmann Publishers Inc., San Mateo, CA.
-
(1990)
Readings in Speech Recognition
, pp. 49-64
-
-
Schafer, R.1
Rabiner, L.2
-
48
-
-
0003544881
-
-
NATO/Springer-Verlag, New York
-
STORK, D. and HENNECKE, M., editors (1996): Speechreading by Man and Machine: Models, System, and Applications. NATO/Springer-Verlag, New York.
-
(1996)
Speechreading by Man and Machine: Models, System, and Applications
-
-
Stork, D.1
Hennecke, M.2
-
50
-
-
0042954451
-
Late integration in audio-visual continuous speech recognition
-
VERMA, A., FARUQUIE, T., NETI, C., BASU, S., and SENIOR, A. (1999): Late integration in audio-visual continuous speech recognition. In Automatic Speech Recognition and Understanding.
-
(1999)
Automatic Speech Recognition and Understanding
-
-
Verma, A.1
Faruquie, T.2
Neti, C.3
Basu, S.4
Senior, A.5
-
52
-
-
0017357502
-
Effect of training on the visual recognition of consonants
-
WALDEN, B., PROSEK, R., MONTGOMERY, A., SCHERR, C., and JONES, C. (1977): Effect of training on the visual recognition of consonants. Journal of Speech and Hearing Research, 20:130-145.
-
(1977)
Journal of Speech and Hearing Research
, vol.20
, pp. 130-145
-
-
Walden, B.1
Prosek, R.2
Montgomery, A.3
Scherr, C.4
Jones, C.5
-
53
-
-
0004524499
-
An approach to statistical lip modelling for speaker identification via chromatic feature extraction
-
WARK, T., SRIDHARAN, S., and CHANDRAN, V. (1998): An approach to statistical lip modelling for speaker identification via chromatic feature extraction. In Proceedings of the IEEE International Conference on Pattern Recognition, 123-125.
-
(1998)
Proceedings of the IEEE International Conference on Pattern Recognition
, pp. 123-125
-
-
Wark, T.1
Sridharan, S.2
Chandran, V.3
|