-
1
-
-
0031187171
-
Speech recognition by machines and humans
-
Lippmann R.P. Speech recognition by machines and humans. Speech Communication. 22:1997;1-15.
-
(1997)
Speech Communication
, vol.22
, pp. 1-15
-
-
Lippmann, R.P.1
-
3
-
-
0037999967
-
Large-vocabulary audio-visual speech recognition by machines and humans
-
G. Potamianos, C. Neti, G. Iyenar, E. Helmuth, Large-vocabulary audio-visual speech recognition by machines and humans, in: Proceedings on Eurospeech, Aalborg, 2001, pp. 1899-1902.
-
(2001)
Proceedings on Eurospeech, Aalborg
, pp. 1899-1902
-
-
Potamianos, G.1
Neti, C.2
Iyenar, G.3
Helmuth, E.4
-
4
-
-
85013580214
-
Sensory integration in audiovisual automatic speech recognition
-
P.L. Silsbee, Sensory integration in audiovisual automatic speech recognition, in: 28th Annual Asilomar Conference on Signals, Systems, and Computers, vol. 1, 1994, pp. 561-565.
-
(1994)
28th Annual Asilomar Conference on Signals, Systems, and Computers
, vol.1
, pp. 561-565
-
-
Silsbee, P.L.1
-
6
-
-
0034848499
-
Optimal weighting of posteriors for audio-visual speech recognition
-
Salt Lake
-
M. Heckmann, F. Berthommier, K. Kroschel, Optimal weighting of posteriors for audio-visual speech recognition, in: Proceedings on ICASSP 2001, Salt Lake, 2001, pp. 161-164.
-
(2001)
Proceedings on ICASSP 2001
, pp. 161-164
-
-
Heckmann, M.1
Berthommier, F.2
Kroschel, K.3
-
8
-
-
85009083793
-
Comparing audio- and a-posteriori-probability-based stream confidence measures for audio-visual speech recognition
-
Aalborg
-
M. Heckmann, T. Wild, F. Berthommier, K. Kroschel, Comparing audio- and a-posteriori-probability-based stream confidence measures for audio-visual speech recognition, in: Proceedings on Eurospeech 2001, Aalborg, 2001, pp. 1023-1026.
-
(2001)
Proceedings on Eurospeech 2001
, pp. 1023-1026
-
-
Heckmann, M.1
Wild, T.2
Berthommier, F.3
Kroschel, K.4
-
9
-
-
85009153179
-
Stream confidence estimation for audio-visual speech recognition
-
Beijing
-
G. Potamianos, C. Neti, Stream confidence estimation for audio-visual speech recognition, in: Proceedings on ICSLP 2000, Beijing, 2000, pp. 746-749.
-
(2000)
Proceedings on ICSLP 2000
, pp. 746-749
-
-
Potamianos, G.1
Neti, C.2
-
10
-
-
0034842451
-
Weighting schemes for audio-visual fusion in speech recognition
-
Salt Lake
-
H. Glotin, D. Vergyri, C. Neti, G. Potamianos, J. Luettin, Weighting schemes for audio-visual fusion in speech recognition, in: Proceedings on ICASSP 2001, Salt Lake, 2001.
-
(2001)
Proceedings on ICASSP 2001
-
-
Glotin, H.1
Vergyri, D.2
Neti, C.3
Potamianos, G.4
Luettin, J.5
-
11
-
-
0003322357
-
Audio visual speech recognition
-
Center for Language and Speech Processing, The Johns Hopkins University, Baltimore
-
C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, D. Vergyri, J. Sison, A. Mashari, J. Zhou, Audio visual speech recognition, Final Workshop 2000 Report, Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, 2000.
-
(2000)
Final Workshop 2000 Report
-
-
Neti, C.1
Potamianos, G.2
Luettin, J.3
Matthews, I.4
Glotin, H.5
Vergyri, D.6
Sison, J.7
Mashari, A.8
Zhou, J.9
-
12
-
-
0020836249
-
Evaluation and integration of visual and auditory information in speech perception
-
Massaro D.W., Cohen M.M. Evaluation and integration of visual and auditory information in speech perception. Journal of Experimental Psychology: HPP. 9:1983;751-753.
-
(1983)
Journal of Experimental Psychology: HPP
, vol.9
, pp. 751-753
-
-
Massaro, D.W.1
Cohen, M.M.2
-
13
-
-
85009080413
-
Auditory visual speech processing
-
Aalborg
-
D.W. Massaro, Auditory visual speech processing, in: Proceedings on Eurospeech 2001, Aalborg, 2001, pp. 1153-1156.
-
(2001)
Proceedings on Eurospeech 2001
, pp. 1153-1156
-
-
Massaro, D.W.1
-
14
-
-
0000789852
-
Channel separability in the audio-visual integration of speech: A Bayesian approach
-
Speechreading by Man and Machine, Models, Systems and Applications, Berlin: Springer-Verlag
-
Movellan J.R., Chadderon G. Channel separability in the audio-visual integration of speech: a Bayesian approach. Speechreading by Man and Machine, Models, Systems and Applications. NATO ASI Series. 1996;473-487 Springer-Verlag, Berlin.
-
(1996)
NATO ASI Series
, pp. 473-487
-
-
Movellan, J.R.1
Chadderon, G.2
-
15
-
-
0002028032
-
Some preliminaries to a comprehensive account of audio-visual speech perception
-
B. Dodd, & R. Campbell. Hillsdale, NJ: Lawrence Erlbaum Associates
-
Summerfield A.Q. Some preliminaries to a comprehensive account of audio-visual speech perception. Dodd B., Campbell R. Hearing by Eye, the Psychology of Lip-reading. 1987;3-51 Lawrence Erlbaum Associates, Hillsdale, NJ.
-
(1987)
Hearing by Eye, the Psychology of Lip-reading
, pp. 3-51
-
-
Summerfield, A.Q.1
-
16
-
-
0036297183
-
A coupled HMM for audio-visual speech recognition
-
A.V. Nefian, L. Liang, X. Pi, L. Xiaoxiang, C. Mao, K. Murphy, A coupled HMM for audio-visual speech recognition, in: Proceedings on ICASSP 2002, vol. 2, 2002, pp. 2013-2016.
-
(2002)
Proceedings on ICASSP 2002
, vol.2
, pp. 2013-2016
-
-
Nefian, A.V.1
Liang, L.2
Pi, X.3
Xiaoxiang, L.4
Mao, C.5
Murphy, K.6
-
17
-
-
0032314380
-
An image transform approach for HMM based automatic lipreading
-
Chicago
-
G. Potamianos, H.P. Graf, E. Cosatto, An image transform approach for HMM based automatic lipreading, in: Proceedings of the International Conference on Image Processing, Chicago, vol. III, 1998, pp. 173-177.
-
(1998)
Proceedings of the International Conference on Image Processing
, vol.3
, pp. 173-177
-
-
Potamianos, G.1
Graf, H.P.2
Cosatto, E.3
-
18
-
-
0034270644
-
Audio-visual speech modelling for continuous speech recognition
-
Dupont S., Luettin J. Audio-visual speech modelling for continuous speech recognition. IEEE Transactions on Multimedia. 2:2000;141-151.
-
(2000)
IEEE Transactions on Multimedia
, vol.2
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
19
-
-
84957810405
-
A comparison of active shape model and scale decomposition based features for visual speech recognition
-
Freiburg
-
I. Mathews, J.A. Bangham, R. Harvey, S. Cox, A comparison of active shape model and scale decomposition based features for visual speech recognition, in: Proceedings of the European Conference on Computer Vision, Freiburg, 1998, pp. 514-528.
-
(1998)
Proceedings of the European Conference on Computer Vision
, pp. 514-528
-
-
Mathews, I.1
Bangham, J.A.2
Harvey, R.3
Cox, S.4
-
20
-
-
84987702417
-
The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
-
Beijing
-
D. Pearce, H.-G. Hirsch, The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions, in: Proceedings on ICSLP'00, Beijing, vol. 4, 2000, pp. 29-32.
-
(2000)
Proceedings on ICSLP'00
, vol.4
, pp. 29-32
-
-
Pearce, D.1
Hirsch, H.-G.2
-
21
-
-
0003822743
-
-
S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, P. Woodland, The HTK Book, Revised version for HTK V 3.0, 2000, http://htk.eng.cam.ac.uk/index. shtml.
-
(2000)
The HTK Book, Revised Version for HTK V 3.0
-
-
Young, S.1
Kershaw, D.2
Odell, J.3
Ollason, D.4
Valtchev, V.5
Woodland, P.6
-
23
-
-
0003552976
-
Preprocessing video images for neural learning of lipreading
-
K.V. Prasad, G. Storck, G.J. Wolf, Preprocessing video images for neural learning of lipreading, Ricoh CRC Technical Report 93-26, 1993.
-
(1993)
Ricoh CRC Technical Report
, vol.93
, Issue.26
-
-
Prasad, K.V.1
Storck, G.2
Wolf, G.J.3
-
24
-
-
85013597845
-
Eigenlips for robust speech recognition
-
Adelaide
-
C. Bregler, Y. Konig, Eigenlips for robust speech recognition, in: Proceedings on ICASSP'94, Adelaide, 1994, pp. 669-672.
-
(1994)
Proceedings on ICASSP'94
, pp. 669-672
-
-
Bregler, C.1
Konig, Y.2
-
25
-
-
85133465985
-
Lipreading using shape, shade and scale
-
Terrigal
-
I. Mathews, S. Cootes, S. Cox, R. Harvey, J.A. Bangham, Lipreading using shape, shade and scale, in: Proceedings of Workshop on Audio Visual Speech Processing, Terrigal, 1998, pp. 73-78.
-
(1998)
Proceedings of Workshop on Audio Visual Speech Processing
, pp. 73-78
-
-
Mathews, I.1
Cootes, S.2
Cox, S.3
Harvey, R.4
Bangham, J.A.5
-
29
-
-
0008571982
-
PCA image coding schemes and visual speech intelligibility
-
Windermere
-
N.M. Brooke, S.D. Scott, PCA image coding schemes and visual speech intelligibility, in: Proceedings on Institute of Acoustics, Windermere, vol. 16, 1994, pp. 123-129.
-
(1994)
Proceedings on Institute of Acoustics
, vol.16
, pp. 123-129
-
-
Brooke, N.M.1
Scott, S.D.2
-
30
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
Dupont S., Luettin J. Audio-visual speech modeling for continuous speech recognition. IEEE Transactions on Multimedia. 2:2000;141-151.
-
(2000)
IEEE Transactions on Multimedia
, vol.2
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
31
-
-
0028996862
-
Toward movement invariant automatic lip-reading and speech recognition
-
Philadelphia
-
P. Duchnowski, M. Hunke, D. Büsching, U. Meier, A. Waibel, Toward movement invariant automatic lip-reading and speech recognition, in: Proceedings of International Conference on Spoken Language Processing, Philadelphia, vol. 1, 1995, pp. 109-112.
-
(1995)
Proceedings of International Conference on Spoken Language Processing
, vol.1
, pp. 109-112
-
-
Duchnowski, P.1
Hunke, M.2
Büsching, D.3
Meier, U.4
Waibel, A.5
-
32
-
-
0034517163
-
A cascade image transform for speaker independent automatic speech reading
-
G. Potanianos, A. Verma, C. Neti, G. Iyengar, S. Basu, A cascade image transform for speaker independent automatic speech reading, in: Proceedings of International Conference on Multimedia and Expo, vol. 2, 2000, pp. 1097-1100.
-
(2000)
Proceedings of International Conference on Multimedia and Expo
, vol.2
, pp. 1097-1100
-
-
Potanianos, G.1
Verma, A.2
Neti, C.3
Iyengar, G.4
Basu, S.5
-
33
-
-
0000813366
-
Talking heads and speech recognisers that can see: The computer processing of visual speech signals
-
D.G. Stork, & M.E. Hennecke. Berlin: Springer-Verlag
-
Brooke N.M. Talking heads and speech recognisers that can see: the computer processing of visual speech signals. Stork D.G., Hennecke M.E. Speechreading by Humans and Machines. 1996;351-371 Springer-Verlag, Berlin.
-
(1996)
Speechreading by Humans and Machines
, pp. 351-371
-
-
Brooke, N.M.1
-
35
-
-
85009284526
-
DCT-based video features for audio-visual speech recognition
-
Beijing
-
M. Heckmann, K. Kroschel, C. Savariaux, F. Berthommier, DCT-based video features for audio-visual speech recognition, in: Proceedings on ICSLP, Beijing, vol. 3, 2002, pp. 1925-1928.
-
(2002)
Proceedings on ICSLP
, vol.3
, pp. 1925-1928
-
-
Heckmann, M.1
Kroschel, K.2
Savariaux, C.3
Berthommier, F.4
-
36
-
-
0002358797
-
Discriminative learning of visual data for audiovisual speech recognition
-
Rogozan A. Discriminative learning of visual data for audiovisual speech recognition. International Journal on Artificial Intelligence Tools. 8:1999;43-52.
-
(1999)
International Journal on Artificial Intelligence Tools
, vol.8
, pp. 43-52
-
-
Rogozan, A.1
|