메뉴 건너뛰기




Volumn 86, Issue 12, 2006, Pages 3534-3548

Analysis of multimodal sequences using geometric video representations

Author keywords

Audiovisual association; Cross modal localization; Geometric video representation; Multimodal data processing; Sparse redundant decomposition

Indexed keywords

AUDIOVISUAL ASSOCIATION; CROSS MODAL LOCALIZATION; GEOMETRIC VIDEO REPRESENTATION; MULTIMODAL DATA PROCESSING; SPARSE REDUNDANT DECOMPOSITION;

EID: 33749427593     PISSN: 01651684     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.sigpro.2006.02.044     Document Type: Article
Times cited : (18)

References (25)
  • 1
    • 33749432376 scopus 로고    scopus 로고
    • J. Hershey, J. Movellan, Audio-vision: Using audio-visual synchrony to locate sounds, in: Proceedings of NIPS, vol. 12, 1999.
  • 2
    • 33749439245 scopus 로고    scopus 로고
    • M. Slaney, M. Covell, FaceSync: a linear operator for measuring synchronization of video facial images and audio tracks, in: Proceedings of NIPS, vol. 13, 2000.
  • 3
    • 33749436181 scopus 로고    scopus 로고
    • H.J. Nock, G. Iyengar, C. Neti, Speaker localisation using audio-visual synchrony: an empirical study, in: Proceedings of the 10th ACM International Conference on Multimedia, 2002.
  • 4
    • 33749425185 scopus 로고    scopus 로고
    • J.W. Fisher III, T. Darrell, W.T. Freeman, P. Viola, Learning joint statistical models for audio-visual fusion and segregation, in: Proceedings of NIPS, vol. 13, 2000.
  • 5
    • 2642562769 scopus 로고    scopus 로고
    • Speaker association with signal-level audiovisual fusion
    • Fisher III J.W., and Darrell T. Speaker association with signal-level audiovisual fusion. IEEE Trans. Multimedia 6 3 (2004) 406-413
    • (2004) IEEE Trans. Multimedia , vol.6 , Issue.3 , pp. 406-413
    • Fisher III, J.W.1    Darrell, T.2
  • 6
    • 14844344462 scopus 로고    scopus 로고
    • From error probability to information theoretic (multi-modal) signal processing
    • Butz T., and Thiran J.-P. From error probability to information theoretic (multi-modal) signal processing. Signal Processing 85 5 (2005) 875-902
    • (2005) Signal Processing , vol.85 , Issue.5 , pp. 875-902
    • Butz, T.1    Thiran, J.-P.2
  • 7
    • 84863714265 scopus 로고    scopus 로고
    • P. Besson, M. Kunt, T. Butz, J.-P. Thiran, A multimodal approach to extract optimized audio features for speaker detection, in: Proceedings of EUSIPCO, 2005.
  • 8
    • 33749427114 scopus 로고    scopus 로고
    • P. Smaragdis, M. Casey, Audio/visual independent components, in: Proceedings of ICA, 2003, pp. 709-714.
  • 10
    • 0024035735 scopus 로고
    • Time-frequency localization operators: a geometric phase space approach
    • Daubechies I. Time-frequency localization operators: a geometric phase space approach. IEEE Trans. Inform. Theory 34 4 (1988) 605-612
    • (1988) IEEE Trans. Inform. Theory , vol.34 , Issue.4 , pp. 605-612
    • Daubechies, I.1
  • 11
    • 0027842081 scopus 로고
    • Matching pursuits with time-frequency dictionaries
    • Mallat S., and Zhang Z. Matching pursuits with time-frequency dictionaries. IEEE Trans. Signal Process. 41 12 (1993) 3397-3415
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.12 , pp. 3397-3415
    • Mallat, S.1    Zhang, Z.2
  • 13
    • 0020737631 scopus 로고
    • The Laplacian pyramid as a compact image code
    • Burt P.J., and Adelson E.H. The Laplacian pyramid as a compact image code. IEEE Trans. Comm. 31 4 (1983) 532-540
    • (1983) IEEE Trans. Comm. , vol.31 , Issue.4 , pp. 532-540
    • Burt, P.J.1    Adelson, E.H.2
  • 14
    • 1242286060 scopus 로고    scopus 로고
    • L. Peotta, L. Granai, P. Vandergheynst, Very low bit rate image coding using redundant dictionaries, in: Proceedings of the SPIE, Wavelets: Applications in Signal and Image Processing X, vol. 5207, 2003, pp. 228-239.
  • 15
    • 33749437951 scopus 로고    scopus 로고
    • O. Divorra Escoda, Toward sparse and geometry adapted video approximations, Ph.D. Thesis, EPFL, Lausanne. Available: 〈http://lts2www.epfl.ch/〉, June 2005 (online).
  • 16
    • 13344261335 scopus 로고    scopus 로고
    • O. Divorra Escoda, P. Vandergheynst, A Bayesian approach to video expansions on parametric over-complete 2-D dictionaries, in: Proceedings of IEEE MMSP, 2004, pp. 490-493.
  • 18
    • 0029701799 scopus 로고    scopus 로고
    • R. Gribonval, E. Bacry, S. Mallat, P. Depalle, X. Rodet, Analysis of sound signals with high resolution matching pursuit, in: Proceedings of IEEE TFTS, 1996, pp. 125-128.
  • 20
    • 33749239260 scopus 로고    scopus 로고
    • G. Monaci, O. Divorra Escoda, P. Vandergheynst, Analysis of multimodal signals using redundant representations, in: Proceedings of IEEE ICIP, 2005.
  • 22
    • 0036874756 scopus 로고    scopus 로고
    • Moving-talker, speaker-independent feature study, and baseline results using the CUAVE multimodal speech corpus
    • Patterson E.K., Gurbuz S., Tufekci Z., and Gowdy J.N. Moving-talker, speaker-independent feature study, and baseline results using the CUAVE multimodal speech corpus. J. Appl. Signal Process. 11 (2002) 1189-1201
    • (2002) J. Appl. Signal Process. , vol.11 , pp. 1189-1201
    • Patterson, E.K.1    Gurbuz, S.2    Tufekci, Z.3    Gowdy, J.N.4
  • 23
    • 33749450587 scopus 로고    scopus 로고
    • R. Gribonval, E. Bacry, J. Abadia, Matching pursuit software and documentation, 〈http://www.cmap.polytechnique.fr/bacry/LastWave/packages/mp/mp.html〉.
  • 24
    • 33749451269 scopus 로고    scopus 로고
    • G. Monaci, Multimodal web page, 〈http://lts2www.epfl.ch/monaci/multimodal.html〉.
  • 25
    • 33749438467 scopus 로고    scopus 로고
    • P. Jost, P. Vandergheynst, P. Frossard, Tree-based pursuit: algorithm and properties, EPFL-ITS Technical Report 2005.13, Lausanne. Available: 〈http://lts2www.epfl.ch/〉, May 2005 (online).


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.