-
1
-
-
0032638088
-
Robust speaker verification via fusion of speech and lip modalities
-
T. Wark, S. Sridharan, and V. Chandran, "Robust speaker verification via fusion of speech and lip modalities," in Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'99), vol. 6, 1999, pp. 3061-3064.
-
(1999)
Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'99)
, vol.6
, pp. 3061-3064
-
-
Wark, T.1
Sridharan, S.2
Chandran, V.3
-
2
-
-
0032074310
-
Audio-visual integration in multimodal communication
-
May
-
T. Chen and R. Rao, "Audio-visual integration in multimodal communication," Proc. IEEE, vol. 86, no. 5, pp. 837-852, May 1998.
-
(1998)
Proc. IEEE
, vol.86
, Issue.5
, pp. 837-852
-
-
Chen, T.1
Rao, R.2
-
3
-
-
0029270677
-
Converting speech into lip movements: A multimedia telephone for hard hearing people
-
Mar.
-
F. Lavagetto, "Converting speech into lip movements: A multimedia telephone for hard hearing people," IEEE Trans. Rehab. Eng., vol. 3, no. 1, pp. 90-102, Mar. 1995.
-
(1995)
IEEE Trans. Rehab. Eng.
, vol.3
, Issue.1
, pp. 90-102
-
-
Lavagetto, F.1
-
4
-
-
0017199877
-
Hearing lips and seeing voices
-
Dec.
-
H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, pp. 746-748, Dec. 1976.
-
(1976)
Nature
, pp. 746-748
-
-
McGurk, H.1
MacDonald, J.2
-
5
-
-
0003544881
-
-
NATO ASI Series F: Computer and Systems Sciences, Eds., Springer-Verlag, New York
-
Speechreading by Humans and Machines, vol. 150, NATO ASI Series F: Computer and Systems Sciences, D. G. Stork and M. E. Hennecke, Eds., Springer-Verlag, New York, 1996.
-
(1996)
Speechreading by Humans and Machines
, vol.150
-
-
Stork, D.G.1
Hennecke, M.E.2
-
6
-
-
0036502797
-
A review of speech-based bimodal recognition
-
Mar.
-
C. C. Chibelushi, F. Deravi, and J. S. D. Mason, "A review of speech-based bimodal recognition," IEEE Trans. Multimedia, vol. 4, no. 1, pp. 23-37, Mar. 2002.
-
(2002)
IEEE Trans. Multimedia
, vol.4
, Issue.1
, pp. 23-37
-
-
Chibelushi, C.C.1
Deravi, F.2
Mason, J.S.D.3
-
7
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
Sep.
-
S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, Sep. 2000.
-
(2000)
IEEE Trans. Multimedia
, vol.2
, Issue.3
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
8
-
-
0031624666
-
Discriminative training of HMM stream exponents for audio-visual speech recognition
-
G. Potamianos and H. P. Graf, "Discriminative training of HMM stream exponents for audio-visual speech recognition," in Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'98), vol. 6, 1998, pp. 3733-3736.
-
(1998)
Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'98)
, vol.6
, pp. 3733-3736
-
-
Potamianos, G.1
Graf, H.P.2
-
9
-
-
0025681008
-
Hidden Markov model decomposition of speech and noise
-
A. P. Varga and R. K. Moore, "Hidden Markov model decomposition of speech and noise," in Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'90), vol. 2, 1990, pp. 845-848.
-
(1990)
Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'90)
, vol.2
, pp. 845-848
-
-
Varga, A.P.1
Moore, R.K.2
-
10
-
-
0032021555
-
On combining classifiers
-
Mar.
-
J. Kittler, M. Hatef, R. Duin, and J. Matas, "On combining classifiers," IEEE Trans. Pattern Anal. Machine Intell., vol. 20, no. 3, pp. 226-239, Mar. 1998.
-
(1998)
IEEE Trans. Pattern Anal. Machine Intell.
, vol.20
, Issue.3
, pp. 226-239
-
-
Kittler, J.1
Hatef, M.2
Duin, R.3
Matas, J.4
-
12
-
-
22444454265
-
Combining classifiers: A theoretical framework
-
J. Kittler, "Combining classifiers: A theoretical framework," Pattern Anal. and Applicat., vol. 1, no. 1, pp. 18-27, 1998.
-
(1998)
Pattern Anal. and Applicat.
, vol.1
, Issue.1
, pp. 18-27
-
-
Kittler, J.1
-
13
-
-
0004473740
-
Modularity and catastrophic fusion: A Bayesian approach with applications to audio-visual speech recognition
-
USCD, Dept. Cognitive Sci., San Diego, CA
-
J. R. Movellan and P. Mineiro, "Modularity and Catastrophic Fusion: A Bayesian Approach with Applications to Audio-Visual Speech Recognition," USCD, Dept. Cognitive Sci., San Diego, CA, Tech. Rep. 97.01, 1997.
-
(1997)
Tech. Rep. 97.01
-
-
Movellan, J.R.1
Mineiro, P.2
-
14
-
-
0024766457
-
A family of distortion measures based upon projection operation for robust speech recognition
-
Nov.
-
D. Mansour and B. H. Juang, "A family of distortion measures based upon projection operation for robust speech recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 11, pp. 1659-1671, Nov. 1989.
-
(1989)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.37
, Issue.11
, pp. 1659-1671
-
-
Mansour, D.1
Juang, B.H.2
-
15
-
-
35248829639
-
Data dependence in combining classifiers
-
T. Windeatt and F. Roli, Eds.
-
M. S. Kamel and N. M. Wanas, "Data dependence in combining classifiers," in Multiple Classifier Systems, T. Windeatt and F. Roli, Eds., 2003, pp. 1-14.
-
(2003)
Multiple Classifier Systems
, pp. 1-14
-
-
Kamel, M.S.1
Wanas, N.M.2
-
17
-
-
84925595128
-
Combining noise compensation with visual information in speech recognition
-
Rhodes, Greece
-
S. Cox, I. Matthews, and J. A. Bangham, "Combining noise compensation with visual information in speech recognition," in Auditory-Visual Speech Processing (AVSP'97), Rhodes, Greece, 1997.
-
(1997)
Auditory-visual Speech Processing (AVSP'97)
-
-
Cox, S.1
Matthews, I.2
Bangham, J.A.3
-
18
-
-
85135374344
-
Integration of acoustic and visual speech for speaker recognition
-
C. C. Chibelushi, J. S. Mason, and F. Deravi, "Integration of acoustic and visual speech for speaker recognition," in Proc. European Conf. Speech Communication and Technology (Eurospeech'93), 1993, pp. 157-160.
-
(1993)
Proc. European Conf. Speech Communication and Technology (Eurospeech'93)
, pp. 157-160
-
-
Chibelushi, C.C.1
Mason, J.S.2
Deravi, F.3
-
19
-
-
0022019614
-
Intermodal timing relations and audio-visual speech recognition
-
Feb.
-
M. McGrath and Q. Summerfield, "Intermodal timing relations and audio-visual speech recognition," J. Acoust. Soc. Amer., vol. 77, no. 2, pp. 678-685, Feb. 1985.
-
(1985)
J. Acoust. Soc. Amer.
, vol.77
, Issue.2
, pp. 678-685
-
-
McGrath, M.1
Summerfield, Q.2
-
20
-
-
0034842342
-
Asynchronous stream modeling for large vocabulary audio-visual speech recognition
-
J. Luettin, G. Potamianos, and C. Neti, "Asynchronous stream modeling for large vocabulary audio-visual speech recognition," in Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'01), vol. 1, 2001, pp. 169-172.
-
(2001)
Proc. Int. Conf. Acoustics, Speech and Signal Processing (ICASSP'01)
, vol.1
, pp. 169-172
-
-
Luettin, J.1
Potamianos, G.2
Neti, C.3
-
21
-
-
85046873967
-
The DET curve in assessment of detection task performance
-
A. Martin, G. Doddington, T. Kamm, M. Ordowski, and P. Przybocki, "The DET curve in assessment of detection task performance," in Proc. European Conf. Speech Communication and Technology (Eurospeech'97), vol. 4, 1997, pp. 1895-1898.
-
(1997)
Proc. European Conf. Speech Communication and Technology (Eurospeech'97)
, vol.4
, pp. 1895-1898
-
-
Martin, A.1
Doddington, G.2
Kamm, T.3
Ordowski, M.4
Przybocki, P.5
-
22
-
-
0003922190
-
-
New York: Wiley
-
R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, 2nd ed, New York: Wiley, 2001.
-
(2001)
Pattern Classification, 2nd Ed
-
-
Duda, R.O.1
Hart, P.E.2
Stork, D.G.3
-
24
-
-
0031220766
-
Acoustic-labial speaker verification
-
P. Jourlin, J. Luettin, D. Genoud, and H. Wassner, "Acoustic-labial speaker verification," Pattern Recognit. Lett., vol. 18:9, pp. 853-858, 1997.
-
(1997)
Pattern Recognit. Lett.
, vol.18
, Issue.9
, pp. 853-858
-
-
Jourlin, P.1
Luettin, J.2
Genoud, D.3
Wassner, H.4
-
25
-
-
0037360227
-
Improved facial-feature detection for AVSP via unsupervised clustering and discriminant analysis
-
S. Lucey, V. Chandran, and S. Sridharan, "Improved facial-feature detection for AVSP via unsupervised clustering and discriminant analysis," EURASIP J. Appl. Signal Process., no. 3, pp. 264-275, 2003.
-
(2003)
EURASIP J. Appl. Signal Process.
, Issue.3
, pp. 264-275
-
-
Lucey, S.1
Chandran, V.2
Sridharan, S.3
-
26
-
-
0003571976
-
-
Cambridge, U.K.: Entropic Ltd.
-
S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book (for HTK Version 2.2). Cambridge, U.K.: Entropic Ltd., 1999.
-
(1999)
The HTK Book (For HTK Version 2.2)
-
-
Young, S.1
Kershaw, D.2
Odell, J.3
Ollason, D.4
Valtchev, V.5
Woodland, P.6
-
27
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Feb.
-
L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
-
(1989)
Proc. IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
28
-
-
0029747053
-
Integrating audio and visual information to provide highly robust speech recognition
-
M. J. Tomlinson, M. J. Russell, and N. M. Brooke, "Integrating audio and visual information to provide highly robust speech recognition," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP '96), 1996, pp. 821-824.
-
(1996)
Proc. Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP '96)
, pp. 821-824
-
-
Tomlinson, M.J.1
Russell, M.J.2
Brooke, N.M.3
-
29
-
-
85009268624
-
A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition
-
S. Lucey, V. Chandran, and S. Sridharan, "A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition," in Proc. Int. Conf. Spoken Language Processing (ICSLP'02), 2002, pp. 1961-1964.
-
(2002)
Proc. Int. Conf. Spoken Language Processing (ICSLP'02)
, pp. 1961-1964
-
-
Lucey, S.1
Chandran, V.2
Sridharan, S.3
-
30
-
-
85009126374
-
An investigation of HMM classifier combination strategies for improved audio-visual speech recognition
-
S. Lucey, S. Sridharan, and V. Chandran, "An investigation of HMM classifier combination strategies for improved audio-visual speech recognition," in Proc. European Conf. Speech Communication and Technology (Eurospeech'01), 2001, pp. 1185-1188.
-
(2001)
Proc. European Conf. Speech Communication and Technology (Eurospeech'01)
, pp. 1185-1188
-
-
Lucey, S.1
Sridharan, S.2
Chandran, V.3
-
31
-
-
0001935972
-
XM2VTSDB: The extended M2VTS database
-
K. Messer, J. Matas, J. Kittler, J. Luettin, and G. Maitre, "XM2VTSDB: The extended M2VTS database," in Proc. Int. Conf. Audio and Video-Based Biometric Person Authentication (AVBPA'99), 1999, pp. 72-77.
-
(1999)
Proc. Int. Conf. Audio and Video-based Biometric Person Authentication (AVBPA'99)
, pp. 72-77
-
-
Messer, K.1
Matas, J.2
Kittler, J.3
Luettin, J.4
Maitre, G.5
|