SCOPUS 정보 검색 플랫폼

Proceedings of the ACM International Multimedia Conference and Exhibition

Volumn , Issue , 2003, Pages 604-611

Multimedia content processing through cross-modal association

(4) Li, Dongge a Dimitrova, Nevenka b Li, Mingkun c Sethi, Ishwar K c

a MOTOROLA INC (United States)

b PHILIPS RESEARCH LABORATORIES (Netherlands)

c OAKLAND UNIVERSITY (United States)

Author keywords

Cross modal association; Cross modal factor analysis (CFA); Cross modal information retrieval; Talking head analysis

Indexed keywords

ALGORITHMS; COMMUNICATION CHANNELS (INFORMATION THEORY); IMAGE COMPRESSION; INFORMATION RETRIEVAL; MULTIMEDIA SYSTEMS; SEMANTICS; SPEECH RECOGNITION; SYNCHRONIZATION;

CROSS-MODAL ASSOCIATION; LATENT SEMANTIC INDEXING (LSI); TALKING HEAD ANALYSIS; VIDEO ANALYSIS;

DATA PROCESSING;

EID: 2342451199 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/957013.957143 Document Type: Conference Paper

Times cited : (286)

References (18)

1
- 0000125550
- Synthesis of visible speech
- April
- M. M. Cohen and D. Massaro, "Synthesis of visible speech," Behaviour Research Methods, Instruments and Computers, Vol. 22, No. 2, pp. 260-263, April 1990.
- (1990) Behaviour Research Methods, Instruments and Computers , vol.22 , Issue.2 , pp. 260-263
- Cohen, M.M.¹ Massaro, D.²

2
- 0017199877
- Hearing lips and seeing voices
- December
- Harry McGurk and John MacDonald, "Hearing lips and seeing voices," Nature, 264:746-748, December 1976.
- (1976) Nature , vol.264 , pp. 746-748
- McGurk, H.¹ MacDonald, J.²

3
- 0030419195
- Eigenpoints
- Lausanne, Switzerland
- Michele Covell, Christoph Bregler. "Eigenpoints." Proc. Int. Conf. Image Processing, Lausanne, Switzerland, Vol. 3, pp. 471-474, 1996.
- (1996) Proc. Int. Conf. Image Processing , vol.3 , pp. 471-474
- Covell, M.¹ Bregler, C.²

4
- 2342473246
- Audio-visual talking face detection
- Baltimore, MD, July
- Mingkun Li, Dongge Li, Nevenka Dimitrova, and Ishwar K. Sethi, "Audio-visual talking face detection," Proc. International Conference on Multimedia and Expo (ICME), pp. 473-476, Baltimore, MD, July 2003.
- (2003) Proc. International Conference on Multimedia and Expo (ICME) , pp. 473-476
- Li, M.¹ Li, D.² Dimitrova, N.³ Sethi, I.K.⁴

5
- 2642557514
- FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks
- November
- Malcolm Slaney and Michele Covell, "FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks," Proc. Advances in Neural Information Processing Systems (NIPS), pp. 814-820, November 2000.
- (2000) Proc. Advances in Neural Information Processing Systems (NIPS) , pp. 814-820
- Slaney, M.¹ Covell, M.²

6
- 84899028297
- Using audio-visual synchrony to locate sounds
- December
- John Hershey and Javier Movellan. "Using audio-visual synchrony to locate sounds," Proc. Advances in Neural Information Processing Systems (NIPS), pp. 813-819, December 1999.
- (1999) Proc. Advances in Neural Information Processing Systems (NIPS) , pp. 813-819
- Hershey, J.¹ Movellan, J.²

7
- 0032178592
- Quantitative association of vocal-tract and facial behavior
- Hani C. Yehia, Philip E. Rubin, Eric Vatikiotis-Bateson, "Quantitative association of vocal-tract and facial behavior," Speech Communication, Vol. 26, pp. 23-43, 1998.
- (1998) Speech Communication , vol.26 , pp. 23-43
- Yehia, H.C.¹ Rubin, P.E.² Vatikiotis-Bateson, E.³

8
- 0035492608
- Person identification in TV programs
- October
- Dongge Li, Gang Wei, Ishwar K. Sethi, N. Dimitrova, "Person Identification in TV programs," Journal on Electronic Imaging, Vol. 10, Issue. 4, pp. 930-938, October 2001.
- (2001) Journal on Electronic Imaging , vol.10 , Issue.4 , pp. 930-938
- Li, D.¹ Wei, G.² Sethi, I.K.³ Dimitrova, N.⁴

9
- 0009622481
- Learning joint statistical models for audio-visual fusion and segregation
- November
- John W. Fisher III, Trevor Darrell, William T. Freeman, Paul Viola, "Learning joint statistical models for audio-visual fusion and segregation," Advances in Neural Information Processing Systems (NIPS), pp. 772-778, November 2000.
- (2000) Advances in Neural Information Processing Systems (NIPS) , pp. 772-778
- Fisher III, J.W.¹ Darrell, T.² Freeman, W.T.³ Viola, P.⁴

10
- 0141631499
- Audio-visual synchrony for detection of monologues in video archives
- April
- G. Iyengar, H. Nock, C. Neti, "Audio-visual synchrony for detection of monologues in video archives" Proc. ICASSP, April 2003.
- (2003) Proc. ICASSP
- Iyengar, G.¹ Nock, H.² Neti, C.³

11
- 0032223839
- Color-WISE: A system for image similarity retrieval using color
- San Jose, CA, January
- Ishwar K. Sethi, Ioana Coman, Brian Day, Feng Jiang, Dongge Li, Jose Segovia-Juarez, Gang Wei, and Bemon You, "Color-WISE: A system for image similarity retrieval using color," SPIE Proc. on Storage and Retrieval for Image and Video Database VI, vol. 3312, pp. 140-149, San Jose, CA, January 1998.
- (1998) SPIE Proc. on Storage and Retrieval for Image and Video Database VI , vol.3312 , pp. 140-149
- Sethi, I.K.¹ Coman, I.² Day, B.³ Jiang, F.⁴ Li, D.⁵ Segovia-Juarez, J.⁶ Wei, G.⁷ You, B.⁸

12
- 0030394830
- Open-vocabulary speech indexing for voice and video mail retrieval
- Boston, MA
- M. G. Brown, J. T. Foote, G. J. Jones, K. S. Jones, and S. J. Young, "Open-vocabulary speech indexing for voice and video mail retrieval," Proc. of ACM Multimedia 96, pp. 307-316, Boston, MA, 1996.
- (1996) Proc. of ACM Multimedia 96 , pp. 307-316
- Brown, M.G.¹ Foote, J.T.² Jones, G.J.³ Jones, K.S.⁴ Young, S.J.⁵

13
- 0032374191
- Cross-modal retrieval of scripted speech audio
- San Jose, CA, January
- Fillia Makedon and Charles Owen, "Cross-modal retrieval of scripted speech audio," SPIE Proc. On Multimedia Computing and Networking, vol. 3310, pp. 226-235, San Jose, CA, January 1998.
- (1998) SPIE Proc. On Multimedia Computing and Networking , vol.3310 , pp. 226-235
- Makedon, F.¹ Owen, C.²

14
- 0344644312
- Omni-face detection for video/image content description
- November
- Gang Wei and Ishwar K. Sethi "Omni-face detection for video/image content description", Proc. International Workshop on Multimedia Information Retrieval, in conjunction with ACM Multimedia Conference 2000, (MIR2000), pp. 185-189, November 2000.
- (2000) Proc. International Workshop on Multimedia Information Retrieval, in Conjunction with ACM Multimedia Conference 2000, (MIR2000) , pp. 185-189
- Wei, G.¹ Sethi, I.K.²

15
- 0003900252
- Oxford University Press, Oxford
- Wojtek Krzanowski, Principles of multivariate analysis, Oxford University Press, Oxford, 1988.
- (1988) Principles of Multivariate Analysis
- Krzanowski, W.¹

16
- 0006625272
- Canonical correlation analysis using artificial neural networks
- Pei L. Lai and Colin Fyfe, "Canonical correlation analysis using artificial neural networks," Proc. European Symposium on Artificial Neural Networks (ESANN), 1998.
- (1998) Proc. European Symposium on Artificial Neural Networks (ESANN)
- Lai, P.L.¹ Fyfe, C.²

17
- 0003976359
- Allyn and Bacon Press
- Barbara G. Tabachnick and Linda S. Fidell, Using multivariate statistics, Allyn and Bacon Press, 1996.
- (1996) Using Multivariate Statistics
- Tabachnick, B.G.¹ Fidell, L.S.²

18
- 0035308233
- Classification of general audio data for content-based retrieval
- April
- Dongge Li, Ishwar K. Sethi, Nevenka Dimitrova, Tom McGee, "Classification of general audio data for content-based retrieval", Pattern Recognition Letters, Vol. 22, No. 5, pp. 533-544, April 2001.
- (2001) Pattern Recognition Letters , vol.22 , Issue.5 , pp. 533-544
- Li, D.¹ Sethi, I.K.² Dimitrova, N.³ McGee, T.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.