-
1
-
-
0000125550
-
Synthesis of visible speech
-
April
-
M. M. Cohen and D. Massaro, "Synthesis of visible speech," Behaviour Research Methods, Instruments and Computers, Vol. 22, No. 2, pp. 260-263, April 1990.
-
(1990)
Behaviour Research Methods, Instruments and Computers
, vol.22
, Issue.2
, pp. 260-263
-
-
Cohen, M.M.1
Massaro, D.2
-
2
-
-
0017199877
-
Hearing lips and seeing voices
-
December
-
Harry McGurk and John MacDonald, "Hearing lips and seeing voices," Nature, 264:746-748, December 1976.
-
(1976)
Nature
, vol.264
, pp. 746-748
-
-
McGurk, H.1
MacDonald, J.2
-
3
-
-
0030419195
-
Eigenpoints
-
Lausanne, Switzerland
-
Michele Covell, Christoph Bregler. "Eigenpoints." Proc. Int. Conf. Image Processing, Lausanne, Switzerland, Vol. 3, pp. 471-474, 1996.
-
(1996)
Proc. Int. Conf. Image Processing
, vol.3
, pp. 471-474
-
-
Covell, M.1
Bregler, C.2
-
4
-
-
2342473246
-
Audio-visual talking face detection
-
Baltimore, MD, July
-
Mingkun Li, Dongge Li, Nevenka Dimitrova, and Ishwar K. Sethi, "Audio-visual talking face detection," Proc. International Conference on Multimedia and Expo (ICME), pp. 473-476, Baltimore, MD, July 2003.
-
(2003)
Proc. International Conference on Multimedia and Expo (ICME)
, pp. 473-476
-
-
Li, M.1
Li, D.2
Dimitrova, N.3
Sethi, I.K.4
-
5
-
-
2642557514
-
FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks
-
November
-
Malcolm Slaney and Michele Covell, "FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks," Proc. Advances in Neural Information Processing Systems (NIPS), pp. 814-820, November 2000.
-
(2000)
Proc. Advances in Neural Information Processing Systems (NIPS)
, pp. 814-820
-
-
Slaney, M.1
Covell, M.2
-
7
-
-
0032178592
-
Quantitative association of vocal-tract and facial behavior
-
Hani C. Yehia, Philip E. Rubin, Eric Vatikiotis-Bateson, "Quantitative association of vocal-tract and facial behavior," Speech Communication, Vol. 26, pp. 23-43, 1998.
-
(1998)
Speech Communication
, vol.26
, pp. 23-43
-
-
Yehia, H.C.1
Rubin, P.E.2
Vatikiotis-Bateson, E.3
-
8
-
-
0035492608
-
Person identification in TV programs
-
October
-
Dongge Li, Gang Wei, Ishwar K. Sethi, N. Dimitrova, "Person Identification in TV programs," Journal on Electronic Imaging, Vol. 10, Issue. 4, pp. 930-938, October 2001.
-
(2001)
Journal on Electronic Imaging
, vol.10
, Issue.4
, pp. 930-938
-
-
Li, D.1
Wei, G.2
Sethi, I.K.3
Dimitrova, N.4
-
9
-
-
0009622481
-
Learning joint statistical models for audio-visual fusion and segregation
-
November
-
John W. Fisher III, Trevor Darrell, William T. Freeman, Paul Viola, "Learning joint statistical models for audio-visual fusion and segregation," Advances in Neural Information Processing Systems (NIPS), pp. 772-778, November 2000.
-
(2000)
Advances in Neural Information Processing Systems (NIPS)
, pp. 772-778
-
-
Fisher III, J.W.1
Darrell, T.2
Freeman, W.T.3
Viola, P.4
-
10
-
-
0141631499
-
Audio-visual synchrony for detection of monologues in video archives
-
April
-
G. Iyengar, H. Nock, C. Neti, "Audio-visual synchrony for detection of monologues in video archives" Proc. ICASSP, April 2003.
-
(2003)
Proc. ICASSP
-
-
Iyengar, G.1
Nock, H.2
Neti, C.3
-
11
-
-
0032223839
-
Color-WISE: A system for image similarity retrieval using color
-
San Jose, CA, January
-
Ishwar K. Sethi, Ioana Coman, Brian Day, Feng Jiang, Dongge Li, Jose Segovia-Juarez, Gang Wei, and Bemon You, "Color-WISE: A system for image similarity retrieval using color," SPIE Proc. on Storage and Retrieval for Image and Video Database VI, vol. 3312, pp. 140-149, San Jose, CA, January 1998.
-
(1998)
SPIE Proc. on Storage and Retrieval for Image and Video Database VI
, vol.3312
, pp. 140-149
-
-
Sethi, I.K.1
Coman, I.2
Day, B.3
Jiang, F.4
Li, D.5
Segovia-Juarez, J.6
Wei, G.7
You, B.8
-
12
-
-
0030394830
-
Open-vocabulary speech indexing for voice and video mail retrieval
-
Boston, MA
-
M. G. Brown, J. T. Foote, G. J. Jones, K. S. Jones, and S. J. Young, "Open-vocabulary speech indexing for voice and video mail retrieval," Proc. of ACM Multimedia 96, pp. 307-316, Boston, MA, 1996.
-
(1996)
Proc. of ACM Multimedia 96
, pp. 307-316
-
-
Brown, M.G.1
Foote, J.T.2
Jones, G.J.3
Jones, K.S.4
Young, S.J.5
-
13
-
-
0032374191
-
Cross-modal retrieval of scripted speech audio
-
San Jose, CA, January
-
Fillia Makedon and Charles Owen, "Cross-modal retrieval of scripted speech audio," SPIE Proc. On Multimedia Computing and Networking, vol. 3310, pp. 226-235, San Jose, CA, January 1998.
-
(1998)
SPIE Proc. On Multimedia Computing and Networking
, vol.3310
, pp. 226-235
-
-
Makedon, F.1
Owen, C.2
-
14
-
-
0344644312
-
Omni-face detection for video/image content description
-
November
-
Gang Wei and Ishwar K. Sethi "Omni-face detection for video/image content description", Proc. International Workshop on Multimedia Information Retrieval, in conjunction with ACM Multimedia Conference 2000, (MIR2000), pp. 185-189, November 2000.
-
(2000)
Proc. International Workshop on Multimedia Information Retrieval, in Conjunction with ACM Multimedia Conference 2000, (MIR2000)
, pp. 185-189
-
-
Wei, G.1
Sethi, I.K.2
-
18
-
-
0035308233
-
Classification of general audio data for content-based retrieval
-
April
-
Dongge Li, Ishwar K. Sethi, Nevenka Dimitrova, Tom McGee, "Classification of general audio data for content-based retrieval", Pattern Recognition Letters, Vol. 22, No. 5, pp. 533-544, April 2001.
-
(2001)
Pattern Recognition Letters
, vol.22
, Issue.5
, pp. 533-544
-
-
Li, D.1
Sethi, I.K.2
Dimitrova, N.3
McGee, T.4
|