-
2
-
-
0037860595
-
Look who's talking: Speaker detection using video and audio correlation
-
Ross Cutler and Larry Davis, "Look Who's Talking: Speaker Detection using Video and Audio Correlation, " in Proc. ICME, 2000.
-
(2000)
Proc. ICME
-
-
Cutler, R.1
Davis, L.2
-
3
-
-
0037700834
-
Assessing face and speech consistency for monologue de- Tectionin video
-
Harriet J. Nock, Giridharan Iyengar, and Chalapathy Neti, "Assessing face and speech consistency for monologue de- Tectionin video, " in Proc. ACM Multimedia, 2002.
-
(2002)
Proc. ACM Multimedia
-
-
Nock, H.J.1
Iyengar, G.2
Neti, C.3
-
4
-
-
0141826698
-
Audio-visual speaker recognition for video broadcast news: Some fusion techniques
-
Denmark, September
-
Benoit Maison, Chalapathy Neti, and Andrew Senior, "Audio-visual speaker recognition for video broadcast news: Some fusion techniques, " in IEEE Multimedia Signal Processing (MMSP99), Denmark, September 1999.
-
(1999)
IEEE Multimedia Signal Processing (MMSP99)
-
-
Maison, B.1
Neti, C.2
Andrew, S.3
-
5
-
-
0009622482
-
Using audio-visual synchrony to locate sounds
-
John Hershey and Javier Movellan, "Using audio-visual synchrony to locate sounds, " in Proc. NIPS, 1999.
-
(1999)
Proc. NIPS
-
-
Hershey, J.1
Movellan, J.2
-
6
-
-
84898954418
-
Learning joint statistical models for audio-visual fusion and segregation
-
JW Fisher III, T Darrell, WT Freeman, and P Viola, "Learning Joint Statistical Models for Audio-Visual Fusion and Segregation, " in Proc. NIPS, 2001.
-
(2001)
Proc. NIPS
-
-
Fisher, J.W.1
Darrell, T.2
Freeman, W.T.3
Viola, P.4
-
7
-
-
0036293478
-
Informative sub- spaces for audiovisual processing: High-level function from low-level fusion
-
John W Fisher III and Trevor Darrell, "Informative sub- spaces for audiovisual processing: High-level function from low-level fusion, " in Proc. ICASSP, 2002.
-
(2002)
Proc. ICASSP
-
-
Fisher, J.W.1
Darrell, T.2
-
8
-
-
84898931254
-
Facesync: Alinearop- erator for measuring synchronization of video facial images and audio tracks
-
Malcolm Slaney and Michele Covell, "Facesync: Alinearop- erator for measuring synchronization of video facial images and audio tracks, " in Proc. NIPS, 2001.
-
(2001)
Proc. NIPS
-
-
Slaney, M.1
Covell, M.2
-
9
-
-
0004052871
-
Audio-visual speech recognition
-
Johns-Hopkins University, Baltimore, MD
-
C. Neti, G. Potamianos, J. Leuttin, I. Matthews, H. Glotin, D. Vergyri, J. Sisson, A. Mashari, and J. Zhou, "Audio-visual speech recognition, " CLSP Summer Workshop Tech. Rep. WSOOAVSR, Johns-Hopkins University, Baltimore, MD, 2000.
-
(2000)
CLSP Summer Workshop Tech. Rep. WSOOAVSR
-
-
Neti, C.1
Potamianos, G.2
Leuttin, J.3
Matthews, I.4
Glotin, H.5
Vergyri, D.6
Sisson, J.7
Mashari, A.8
Zhou, J.9
-
10
-
-
85088715355
-
Robust speech recognition in noisy environments: The IBM spine-2 evaluation system
-
Brian Kingsbury, George Saon, Lidia Mangu, Mukund Pad- manabhan, and Ruhi Sarikaya, "Robust Speech Recognition in Noisy Environments: The IBM Spine-2 Evaluation System, " in Proc. ICASSP, 2002.
-
(2002)
Proc. ICASSP
-
-
Kingsbury, B.1
Saon, G.2
Mangu, L.3
Manabhan, M.P.-4
Sarikaya, R.5
-
11
-
-
0002595416
-
Speaker, environment and channel change detection and clustering via the bayesian information criterion
-
Scott S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the bayesian information criterion, " Intl. Conf. On Acoust., Sp., andSig. Proc., 1998.
-
(1998)
Intl. Conf. on Acoust., Sp., AndSig. Proc
-
-
Chen, S.S.1
Gopalakrishnan, P.S.2
|