-
2
-
-
0037860595
-
Look Who's Talking: Speaker Detection using Video and Audio Correlation
-
Ross Cutler and Larry Davis, "Look Who's Talking: Speaker Detection using Video and Audio Correlation," in Proc. ICME, 2000.
-
(2000)
Proc. ICME
-
-
Cutler, R.1
Davis, L.2
-
3
-
-
0037700834
-
Assessing face and speech consistency for monologue detectionin video
-
Harriet J. Nock, Giridharan lyengar, and Chalapathy Neti, "Assessing face and speech consistency for monologue detectionin video," in Proc. ACM Multimedia, 2002.
-
(2002)
Proc. ACM Multimedia
-
-
Nock, H.J.1
Lyengar, G.2
Neti, C.3
-
4
-
-
0141826698
-
Audio-visual speaker recognition for video broadcast news: Some fusion techniques
-
Denmark, September
-
Benoit Maison, Chalapathy Neti, and Andrew Senior, "Audio-visual speaker recognition for video broadcast news: some fusion techniques," in IEEE Multimedia Signal Processing (MMSP99), Denmark, September 1999.
-
(1999)
IEEE Multimedia Signal Processing (MMSP99)
-
-
Maison, B.1
Neti, C.2
Senior, A.3
-
5
-
-
0009622482
-
Using audio-visual synchrony to locate sounds
-
John Hershey and Javier Movellan, "Using audio-visual synchrony to locate sounds," in Proc. NIPS, 1999.
-
(1999)
Proc. NIPS
-
-
Hershey, J.1
Movellan, J.2
-
7
-
-
0036293478
-
Informative sub-spaces for audiovisual processing: High-level function from low-level fusion
-
John W Fisher III and Trevor Darrell, "Informative sub-spaces for audiovisual processing: High-level function from low-level fusion," in Proc. ICASSP, 2002.
-
(2002)
Proc. ICASSP
-
-
Fisher J.W. III1
Darrell, T.2
-
8
-
-
84898931254
-
Facesync: A linear operator for measuring synchronization of video facial images and audio tracks
-
Malcolm Slaney and Michele Covell, "Facesync: a linear operator for measuring synchronization of video facial images and audio tracks," in Proc. NIPS, 2001.
-
(2001)
Proc. NIPS
-
-
Slaney, M.1
Covell, M.2
-
9
-
-
0004052871
-
Audio-visual speech recognition
-
Johns-Hopkins University, Baltimore, MD
-
C. Neti, G. Potamianos, J. Leuttin, I. Matthews, H. Glotin, D. Vergyri, J. Sisson, A. Mashari, and J. Zhou, "Audio-visual speech recognition, " CLSP Summer Workshop Tech. Rep. WSOOAVSR, Johns-Hopkins University, Baltimore, MD, 2000.
-
(2000)
CLSP Summer Workshop Tech. Rep. WSOOAVSR
-
-
Neti, C.1
Potamianos, G.2
Leuttin, J.3
Matthews, I.4
Glotin, H.5
Vergyri, D.6
Sisson, J.7
Mashari, A.8
Zhou, J.9
-
10
-
-
17344389852
-
Robust Speech Recognition in Noisy Environments: The IBM Spine-2 Evaluation System
-
Brian Kingsbury, George Saon, Lidia Mangu, Mukund Padmanabhan, and Ruhi Sarikaya, "Robust Speech Recognition in Noisy Environments: The IBM Spine-2 Evaluation System," in Proc. ICASSP, 2002.
-
(2002)
Proc. ICASSP
-
-
Kingsbury, B.1
Saon, G.2
Mangu, L.3
Padmanabhan, M.4
Sarikaya, R.5
-
11
-
-
0002595416
-
Speaker, environment and channel change detection and clustering via the bayesian information criterion
-
Scott S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the bayesian information criterion," Intl. Conf. On Acoust., Sp., and Sig. Proc., 1998.
-
(1998)
Intl. Conf. On Acoust., Sp., and Sig. Proc.
-
-
Chen, S.S.1
Gopalakrishnan, P.S.2
|