-
1
-
-
0034825241
-
Multistream adaptive evidence combination for noise robust ASR
-
A. Morris, A. Hagen, H. Glotin, and H. Bourlard, "Multistream adaptive evidence combination for noise robust ASR," Speech Communication Journal, vol. 34, no. 1-2, pp. 25-40, 2001.
-
(2001)
Speech Communication Journal
, vol.34
, Issue.1-2
, pp. 25-40
-
-
Morris, A.1
Hagen, A.2
Glotin, H.3
Bourlard, H.4
-
2
-
-
85009135142
-
Beyond the conventional statistical language models: The variable-length sequences approach
-
Beijing, China, October
-
I. Zitouni, K. Smaïli, and J.-P. Haton, "Beyond the conventional statistical language models: the variable-length sequences approach," in Proc. 6th International Conference on Spoken Language Processing (ICSLP), vol. 3, pp. 962-965, Beijing, China, October 2000.
-
(2000)
Proc. 6th International Conference on Spoken Language Processing (ICSLP)
, vol.3
, pp. 962-965
-
-
Zitouni, I.1
Smaïli, K.2
Haton, J.-P.3
-
3
-
-
0017199877
-
Hearing lips and seeing voices
-
December
-
H. McGurk and J. McDonald, "Hearing lips and seeing voices," Nature, vol. 264, pp. 746-748, December 1976.
-
(1976)
Nature
, vol.264
, pp. 746-748
-
-
McGurk, H.1
McDonald, J.2
-
4
-
-
0027228958
-
Improving connected letter recognition by lipreading
-
Minneapolis, Minn, USA, April
-
C. Bregler, H. Hild, S. Manke, and A. Waibel, "Improving connected letter recognition by lipreading," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, pp. 557-560, Minneapolis, Minn, USA, April 1993.
-
(1993)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.1
, pp. 557-560
-
-
Bregler, C.1
Hild, H.2
Manke, S.3
Waibel, A.4
-
5
-
-
0028996862
-
Toward movement-invariant automatic lip-reading and speech recognition
-
Detroit, Mich, USA, May
-
P. Duchnowski, M. Hunke, D. Büsching, U. Meier, and A. Waibel, "Toward movement-invariant automatic lip-reading and speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, pp. 109-112, Detroit, Mich, USA, May 1995.
-
(1995)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.1
, pp. 109-112
-
-
Duchnowski, P.1
Hunke, M.2
Büsching, D.3
Meier, U.4
Waibel, A.5
-
6
-
-
84957886748
-
Real-time lip tracking for audio-visual speech recognition applications
-
Cambridge, UK, April
-
R. Kaucic, B. Dalton, and A. Blake, "Real-time lip tracking for audio-visual speech recognition applications," in Proc. 4th European Conference on Computer Vision (ECCV), vol. 2, pp. 376-387, Cambridge, UK, April 1996.
-
(1996)
Proc. 4th European Conference on Computer Vision (ECCV)
, vol.2
, pp. 376-387
-
-
Kaucic, R.1
Dalton, B.2
Blake, A.3
-
7
-
-
0001622390
-
Active shape models for visual speech feature extraction
-
D. G. Stork and M. E. Hennecke, Eds., NATO Advanced Science Institutes, Springer-Verlag, New York, NY, USA
-
J. Luettin, N. A. Thacker, and S. W. Beet, "Active shape models for visual speech feature extraction," in Speechreading by Humans and Machines: Models, Systems, and Applications, D. G. Stork and M. E. Hennecke, Eds., vol. 150 of NATO Advanced Science Institutes, pp. 383-390, Springer-Verlag, New York, NY, USA, 1996.
-
(1996)
Speechreading by Humans and Machines: Models, Systems, and Applications
, vol.150
, pp. 383-390
-
-
Luettin, J.1
Thacker, N.A.2
Beet, S.W.3
-
8
-
-
0012707450
-
-
Tech. Rep. Workshop 2000, Center for Language and Speech Processing (CLSP), Johns Hopkins University, Baltimore, Md, USA, October
-
C. Neti, G. Potamianos, J. Luettin, et al., "Audio-visual speech recognition," Tech. Rep. Workshop 2000, Center for Language and Speech Processing (CLSP), Johns Hopkins University, Baltimore, Md, USA, October 2000.
-
(2000)
Audio-Visual Speech Recognition
-
-
Neti, C.1
Potamianos, G.2
Luettin, J.3
-
9
-
-
0032180188
-
Adaptive fusion of acoustic and visual sources for automatic speech recognition
-
A. Rogozan and P. Deléglise, "Adaptive fusion of acoustic and visual sources for automatic speech recognition," Speech Communication Journal, vol. 26, no. 1-2, pp. 149-161, 1998.
-
(1998)
Speech Communication Journal
, vol.26
, Issue.1-2
, pp. 149-161
-
-
Rogozan, A.1
Deléglise, P.2
-
10
-
-
0002546123
-
Lip signatures for automatic person recognition
-
Washington, DC, USA, March
-
R. Auckenthaler, J. Brand, J. S. D. Mason, F. Deravi, and C. C. Chibelushi, "Lip signatures for automatic person recognition," in Proc. 2nd International Conference on Audio- and Video-Based Biometric Person Authentication (AVBPA), pp. 142-147, Washington, DC, USA, March 1999.
-
(1999)
Proc. 2nd International Conference on Audio- and Video-Based Biometric Person Authentication (AVBPA)
, pp. 142-147
-
-
Auckenthaler, R.1
Brand, J.2
Mason, J.S.D.3
Deravi, F.4
Chibelushi, C.C.5
-
11
-
-
84947907880
-
Acoustic-labial speaker verification
-
J. Bigün, G. Chollet, and G. Borgefors, Eds., Springer-Verlag, Crans-Montana, Switzerland, March
-
P. Jourlin, J. Luettin, D. Genoud, and H. Wassner, "Acoustic-labial speaker verification," in Proc. 1st International Conference on Audio- and Video-Based Biometric Person Authentification (AVBPA), J. Bigün, G. Chollet, and G. Borgefors, Eds., pp. 319-326, Springer-Verlag, Crans-Montana, Switzerland, March 1997.
-
(1997)
Proc. 1st International Conference on Audio- and Video-Based Biometric Person Authentification (AVBPA)
, pp. 319-326
-
-
Jourlin, P.1
Luettin, J.2
Genoud, D.3
Wassner, H.4
-
12
-
-
4243927729
-
A signal processing system for having the sound "pop-out" in noise thanks to the image of the speaker's lips: New advances using multilayer perceptrons
-
Sydney, Australia, December
-
L. Girin, L. Varin, G. Feng, and J.-L. Schwartz, "A signal processing system for having the sound "pop-out" in noise thanks to the image of the speaker's lips: New advances using multilayer perceptrons," in Proc. 5th International Conference on Spoken Language Processing (ICSLP), vol. 4, pp. 1451-1454, Sydney, Australia, December 1998.
-
(1998)
Proc. 5th International Conference on Spoken Language Processing (ICSLP)
, vol.4
, pp. 1451-1454
-
-
Girin, L.1
Varin, L.2
Feng, G.3
Schwartz, J.-L.4
-
13
-
-
2642559942
-
Towards unrestricted lip reading
-
Hong Kong
-
U. Meier, R. Stiefelhagen, J. Yang, and A. Waibel, "Towards unrestricted lip reading," in Proc. 2nd International Conference on Multimodal Interfaces (ICMI), Hong Kong, 1999.
-
(1999)
Proc. 2nd International Conference on Multimodal Interfaces (ICMI)
-
-
Meier, U.1
Stiefelhagen, R.2
Yang, J.3
Waibel, A.4
-
14
-
-
0032678693
-
Unsupervised lip segmentation under natural conditions
-
Phoenix, Ariz, USA, March
-
M. Liévin and F. Luthon, "Unsupervised lip segmentation under natural conditions," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP), vol. 6, pp. 3065-3068, Phoenix, Ariz, USA, March 1999.
-
(1999)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP)
, vol.6
, pp. 3065-3068
-
-
Liévin, M.1
Luthon, F.2
-
15
-
-
0001055701
-
Which components of the face do humans and machines best speechread?
-
D. G. Stork and M. E. Hennecke, Eds., NATO Advanced Science Institutes, Springer-Verlag, New York, NY, USA
-
C. Benoît, T. Guiard-Marigny, B. Le Goff, and A. Adjoudani, "Which components of the face do humans and machines best speechread?," in Speechreading by Humans and Machines: Models, Systems, and Applications, D. G. Stork and M. E. Hennecke, Eds., vol. 150 of NATO Advanced Science Institutes, pp. 315-328, Springer-Verlag, New York, NY, USA, 1996.
-
(1996)
Speechreading by Humans and Machines: Models, Systems, and Applications
, vol.150
, pp. 315-328
-
-
Benoît, C.1
Guiard-Marigny, T.2
Le Goff, B.3
Adjoudani, A.4
-
16
-
-
0012725681
-
On the production and perception of audio-visual speech by man and machine
-
Y. Wang, S. Panwar, S.-P. Kim, and H. L. Bertoni, Eds., Plenum, New York, NY, USA, October
-
C. Benoît, "On the production and perception of audio-visual speech by man and machine," in Multimedia Communications and Video Coding, Y. Wang, S. Panwar, S.-P. Kim, and H. L. Bertoni, Eds., Plenum, New York, NY, USA, October 1995.
-
(1995)
Multimedia Communications and Video Coding
-
-
Benoît, C.1
-
17
-
-
0032314380
-
An image transform approach for HMM based automatic lipreading
-
Chicago, Ill, USA, October
-
G. Potamianos, H. P. Graf, and E. Cosatto, "An image transform approach for HMM based automatic lipreading," in Proc. IEEE International Conference on Image Processing (ICIP), vol. 3, pp. 173-177, Chicago, Ill, USA, October 1998.
-
(1998)
Proc. IEEE International Conference on Image Processing (ICIP)
, vol.3
, pp. 173-177
-
-
Potamianos, G.1
Graf, H.P.2
Cosatto, E.3
-
18
-
-
78649238564
-
Using deformable templates to infer visual speech dynamics
-
Pacific Grove, Calif, USA, November
-
M. E. Hennecke, K. V. Prasad, and D. G. Stork, "Using deformable templates to infer visual speech dynamics," in 28th Annual Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, Calif, USA, November 1994.
-
(1994)
28th Annual Asilomar Conference on Signals, Systems, and Computers
-
-
Hennecke, M.E.1
Prasad, K.V.2
Stork, D.G.3
-
19
-
-
0003231941
-
Active contours for lipreading: Combining snakes with templates
-
Juan-les-Pins, France, September
-
S. Horbelt and J.-L. Dugelay, "Active contours for lipreading: combining snakes with templates," in 15th GRETSI Symposium Signal and Image Processing, pp. 717-720, Juan-les-Pins, France, September 1995.
-
(1995)
15th GRETSI Symposium Signal and Image Processing
, pp. 717-720
-
-
Horbelt, S.1
Dugelay, J.-L.2
-
20
-
-
84997531258
-
Model-based versus knowledge-guided representation of non-rigid objects: A case study
-
IEEE Computer Society Press, Los Alamitos, Calif, USA
-
R. Kober, J. Schiffers, and K. Schmidt, "Model-based versus knowledge-guided representation of non-rigid objects: A case study," in Proc. IEEE International Conference on Image Processing, vol. 1, pp. 973-977, IEEE Computer Society Press, Los Alamitos, Calif, USA, 1994.
-
(1994)
Proc. IEEE International Conference on Image Processing
, vol.1
, pp. 973-977
-
-
Kober, R.1
Schiffers, J.2
Schmidt, K.3
-
21
-
-
0012755890
-
Face identification by deformation measure
-
Vienna, Austria, August
-
B. Leroy, I. L. Herlin, and L. D. Cohen, "Face identification by deformation measure," in Proc. IEEE International Conference on Pattern Recognition (ICPR), vol. 3, pp. 633-637, Vienna, Austria, August 1996.
-
(1996)
Proc. IEEE International Conference on Pattern Recognition (ICPR)
, vol.3
, pp. 633-637
-
-
Leroy, B.1
Herlin, I.L.2
Cohen, L.D.3
-
22
-
-
0012704142
-
Tracking of deformable contours by synthesis and match
-
Vienna, Austria, August
-
K. F. Lai, C. W. Ngo, and S. Chan, "Tracking of deformable contours by synthesis and match," in Proc. IEEE International Conference on Pattern Recognition (ICPR), vol. 1, pp. 657-661, Vienna, Austria, August 1996.
-
(1996)
Proc. IEEE International Conference on Pattern Recognition (ICPR)
, vol.1
, pp. 657-661
-
-
Lai, K.F.1
Ngo, C.W.2
Chan, S.3
-
23
-
-
78649293030
-
A new 3D lip model for analysis and synthesis of lip motion in speech production
-
Terrigal, Australia, December
-
L. Revéret and C. Benoît, "A new 3D lip model for analysis and synthesis of lip motion in speech production," in Proc. Auditory-Visual Speech Processing (AVSP), pp. 207-212, Terrigal, Australia, December 1998.
-
(1998)
Proc. Auditory-Visual Speech Processing (AVSP)
, pp. 207-212
-
-
Revéret, L.1
Benoît, C.2
-
24
-
-
0032309170
-
3D modeling and tracking of human lip motion
-
Bombay, India, January
-
S. Basu, N. Oliver, and A. Pentland, "3D modeling and tracking of human lip motion," in Proc. IEEE International Conference on Computer Vision (ICCV), pp. 337-343, Bombay, India, January 1998.
-
(1998)
Proc. IEEE International Conference on Computer Vision (ICCV)
, pp. 337-343
-
-
Basu, S.1
Oliver, N.2
Pentland, A.3
-
25
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, 2000.
-
(2000)
IEEE Trans. Multimedia
, vol.2
, Issue.3
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
26
-
-
85133465985
-
Lipreading using shape, shading and scale
-
Terrigal, Australia, December
-
I. Matthews, T. Cootes, S. Cox, R. Harvey, and J. A. Bangham, "Lipreading using shape, shading and scale," in Proc. Auditory-Visual Speech Processing (AVSP), pp. 73-78, Terrigal, Australia, December 1998.
-
(1998)
Proc. Auditory-Visual Speech Processing (AVSP)
, pp. 73-78
-
-
Matthews, I.1
Cootes, T.2
Cox, S.3
Harvey, R.4
Bangham, J.A.5
-
27
-
-
0000134331
-
2D deformable models for visual speech analysis
-
D. G. Stork and M. E. Hennecke, Eds., NATO Advanced Science Institutes, Springer-Verlag, New York, NY, USA
-
T. Coianiz, L. Torresani, and B. Caprile, "2D deformable models for visual speech analysis," in Speechreading by Humans and Machines: Models, Systems, and Applications, D. G. Stork and M. E. Hennecke, Eds., vol. 150 of NATO Advanced Science Institutes, pp. 391-398, Springer-Verlag, New York, NY, USA, 1996.
-
(1996)
Speechreading by Humans and Machines: Models, Systems, and Applications
, vol.150
, pp. 391-398
-
-
Coianiz, T.1
Torresani, L.2
Caprile, B.3
-
28
-
-
84941187690
-
Using aerial and geometric features in automatic lip-reading
-
Aalborg, Denmark, September
-
J. C. Wojdel and L. J. M. Rothkrantz, "Using aerial and geometric features in automatic lip-reading," in Proc. 7th European Conference on Speech Communication and Technology (Eurospeech), vol. 4, pp. 2463-2466, Aalborg, Denmark, September 2001.
-
(2001)
Proc. 7th European Conference on Speech Communication and Technology (Eurospeech)
, vol.4
, pp. 2463-2466
-
-
Wojdel, J.C.1
Rothkrantz, L.J.M.2
-
29
-
-
0036875048
-
Automatic speechreading with applications to human-computer interfaces
-
X. Zhang, C. C. Broun, R. M. Mersereau, and M. Clements, "Automatic speechreading with applications to human-computer interfaces," EURASIP Journal on Applied Signal Processing, vol. 2002, no. 11, pp. 1228-1247, 2002.
-
(2002)
EURASIP Journal on Applied Signal Processing
, vol.2002
, Issue.11
, pp. 1228-1247
-
-
Zhang, X.1
Broun, C.C.2
Mersereau, R.M.3
Clements, M.4
-
30
-
-
0032310760
-
Accurate, real-time, unadorned lip tracking
-
Bombay, India, January
-
R. Kaucic and A. Blake, "Accurate, real-time, unadorned lip tracking," in Proc. IEEE International Conference on Computer Vision (ICCV), pp. 370-375, Bombay, India, January 1998.
-
(1998)
Proc. IEEE International Conference on Computer Vision (ICCV)
, pp. 370-375
-
-
Kaucic, R.1
Blake, A.2
-
32
-
-
0000238336
-
A simplex method for function minimization
-
J. A. Nelder and R. Mead, "A simplex method for function minimization," Computing Journal, vol. 7, no. 4, pp. 308-313, 1965.
-
(1965)
Computing Journal
, vol.7
, Issue.4
, pp. 308-313
-
-
Nelder, J.A.1
Mead, R.2
-
33
-
-
0017930815
-
Dynamic programming algorithm optimization for spoken word recognition
-
H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 26, no. 1, pp. 43-49, 1978.
-
(1978)
IEEE Trans. Acoustics, Speech, and Signal Processing
, vol.26
, Issue.1
, pp. 43-49
-
-
Sakoe, H.1
Chiba, S.2
-
34
-
-
0012706763
-
Utilisation de l'information acoustique pour aligner deux séquences de parole audiovisuelle
-
Mons, Belgium, September
-
P. Daubias, "Utilisation de l'information acoustique pour aligner deux séquences de parole audiovisuelle," in Proc. 4th Rencontres Jeunes Chercheurs en Parole, pp. 74-77, Mons, Belgium, September 2001.
-
(2001)
Proc. 4th Rencontres Jeunes Chercheurs en Parole
, pp. 74-77
-
-
Daubias, P.1
-
35
-
-
84858968331
-
BD-SONS: Une base de donnés des sons du français
-
Toronto, Canada
-
R. Descout, J.-F. Sérignat, O. Cervantes, and R. Carré, "BD-SONS: Une base de donnés des sons du français," in Proc. 12th International Congress on Acoustics (ICA), Toronto, Canada, 1986.
-
(1986)
Proc. 12th International Congress on Acoustics (ICA)
-
-
Descout, R.1
Sérignat, J.-F.2
Cervantes, O.3
Carré, R.4
-
36
-
-
85009187138
-
Lip-reading based on a fully automatic statistical model
-
Denver, Col, USA, September
-
P. Daubias and P. Deléglise, "Lip-reading based on a fully automatic statistical model," in Proc. 7th International Conference on Spoken Language Processing (ICSLP), vol. 1, pp. 209-212, Denver, Col, USA, September 2002.
-
(2002)
Proc. 7th International Conference on Spoken Language Processing (ICSLP)
, vol.1
, pp. 209-212
-
-
Daubias, P.1
Deléglise, P.2
|