메뉴 건너뛰기




Volumn , Issue , 2003, Pages 604-611

Multimedia content processing through cross-modal association

Author keywords

Cross modal association; Cross modal factor analysis (CFA); Cross modal information retrieval; Talking head analysis

Indexed keywords

ALGORITHMS; COMMUNICATION CHANNELS (INFORMATION THEORY); IMAGE COMPRESSION; INFORMATION RETRIEVAL; MULTIMEDIA SYSTEMS; SEMANTICS; SPEECH RECOGNITION; SYNCHRONIZATION;

EID: 2342451199     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/957013.957143     Document Type: Conference Paper
Times cited : (286)

References (18)
  • 2
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • December
    • Harry McGurk and John MacDonald, "Hearing lips and seeing voices," Nature, 264:746-748, December 1976.
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 5
    • 2642557514 scopus 로고    scopus 로고
    • FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks
    • November
    • Malcolm Slaney and Michele Covell, "FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks," Proc. Advances in Neural Information Processing Systems (NIPS), pp. 814-820, November 2000.
    • (2000) Proc. Advances in Neural Information Processing Systems (NIPS) , pp. 814-820
    • Slaney, M.1    Covell, M.2
  • 7
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocal-tract and facial behavior
    • Hani C. Yehia, Philip E. Rubin, Eric Vatikiotis-Bateson, "Quantitative association of vocal-tract and facial behavior," Speech Communication, Vol. 26, pp. 23-43, 1998.
    • (1998) Speech Communication , vol.26 , pp. 23-43
    • Yehia, H.C.1    Rubin, P.E.2    Vatikiotis-Bateson, E.3
  • 8
    • 0035492608 scopus 로고    scopus 로고
    • Person identification in TV programs
    • October
    • Dongge Li, Gang Wei, Ishwar K. Sethi, N. Dimitrova, "Person Identification in TV programs," Journal on Electronic Imaging, Vol. 10, Issue. 4, pp. 930-938, October 2001.
    • (2001) Journal on Electronic Imaging , vol.10 , Issue.4 , pp. 930-938
    • Li, D.1    Wei, G.2    Sethi, I.K.3    Dimitrova, N.4
  • 10
    • 0141631499 scopus 로고    scopus 로고
    • Audio-visual synchrony for detection of monologues in video archives
    • April
    • G. Iyengar, H. Nock, C. Neti, "Audio-visual synchrony for detection of monologues in video archives" Proc. ICASSP, April 2003.
    • (2003) Proc. ICASSP
    • Iyengar, G.1    Nock, H.2    Neti, C.3
  • 13
    • 0032374191 scopus 로고    scopus 로고
    • Cross-modal retrieval of scripted speech audio
    • San Jose, CA, January
    • Fillia Makedon and Charles Owen, "Cross-modal retrieval of scripted speech audio," SPIE Proc. On Multimedia Computing and Networking, vol. 3310, pp. 226-235, San Jose, CA, January 1998.
    • (1998) SPIE Proc. On Multimedia Computing and Networking , vol.3310 , pp. 226-235
    • Makedon, F.1    Owen, C.2
  • 18
    • 0035308233 scopus 로고    scopus 로고
    • Classification of general audio data for content-based retrieval
    • April
    • Dongge Li, Ishwar K. Sethi, Nevenka Dimitrova, Tom McGee, "Classification of general audio data for content-based retrieval", Pattern Recognition Letters, Vol. 22, No. 5, pp. 533-544, April 2001.
    • (2001) Pattern Recognition Letters , vol.22 , Issue.5 , pp. 533-544
    • Li, D.1    Sethi, I.K.2    Dimitrova, N.3    McGee, T.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.