메뉴 건너뛰기




Volumn 16, Issue 9, 2007, Pages 2272-2283

Learning multimodal dictionaries

Author keywords

Audiovisual source localization; Dictionary learning; Multimodal data processing; Sparse representation

Indexed keywords

COMPUTER VISION; DATA STRUCTURES; EIGENVALUES AND EIGENFUNCTIONS; ITERATIVE METHODS; LEARNING ALGORITHMS; SENSORY PERCEPTION;

EID: 34548231808     PISSN: 10577149     EISSN: None     Source Type: Journal    
DOI: 10.1109/TIP.2007.901813     Document Type: Article
Times cited : (42)

References (36)
  • 1
    • 0031106962 scopus 로고    scopus 로고
    • Multimodality image registration by maximization of mutual information
    • Feb
    • F. Maes, A. Collignon, D. Vandermeulen, G. Marchal, and P. Suetens, "Multimodality image registration by maximization of mutual information," IEEE Trans. Med. Imag., vol. 16, no. 2, pp. 187-198, Feb. 1997.
    • (1997) IEEE Trans. Med. Imag , vol.16 , Issue.2 , pp. 187-198
    • Maes, F.1    Collignon, A.2    Vandermeulen, D.3    Marchal, G.4    Suetens, P.5
  • 2
    • 14844344462 scopus 로고    scopus 로고
    • From error probability to information theoretic (multi-modal) signal processing
    • T. Butz and J.-P. Thiran, "From error probability to information theoretic (multi-modal) signal processing," Signal Process., vol. 85, no. 5, pp. 875-902, 2005.
    • (2005) Signal Process , vol.85 , Issue.5 , pp. 875-902
    • Butz, T.1    Thiran, J.-P.2
  • 3
    • 0242456951 scopus 로고    scopus 로고
    • Multispectral satellite image analysis based on the method of blind separation and fusion of sources
    • I. R. Farah, M. B. Ahmed, and M. R. Boussema, "Multispectral satellite image analysis based on the method of blind separation and fusion of sources," in Proc. Int. Geoscience and Remote Sensing Symp., 2003, vol. 6, pp. 3638-3640.
    • (2003) Proc. Int. Geoscience and Remote Sensing Symp , vol.6 , pp. 3638-3640
    • Farah, I.R.1    Ahmed, M.B.2    Boussema, M.R.3
  • 4
    • 0034229412 scopus 로고    scopus 로고
    • A data fusion algorithm for mapping sea-ice concentrations from special sensor microwave/imager data
    • Apr
    • K. C. Partington, "A data fusion algorithm for mapping sea-ice concentrations from special sensor microwave/imager data," IEEE Trans. Geosci. Remote Sens., vol. 38, no. 4, pp. 1947-1958, Apr. 2000.
    • (2000) IEEE Trans. Geosci. Remote Sens , vol.38 , Issue.4 , pp. 1947-1958
    • Partington, K.C.1
  • 7
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audiovisual speech
    • Sep
    • G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. W. Senior, "Recent advances in the automatic recognition of audiovisual speech," Proc. IEEE, vol. 91, no. 9, pp. 1306-1326, Sep. 2003.
    • (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.W.5
  • 8
    • 20444375102 scopus 로고    scopus 로고
    • Integration strategies for audio-visual speech processing: Applied to text-dependent speaker recognition
    • Sep
    • S. Lucey, T. Chen, S. Sridharan, and V. Chandran, "Integration strategies for audio-visual speech processing: Applied to text-dependent speaker recognition," IEEE Trans. Multimedia, vol. 7, no. 3, pp. 495-506, Sep. 2005.
    • (2005) IEEE Trans. Multimedia , vol.7 , Issue.3 , pp. 495-506
    • Lucey, S.1    Chen, T.2    Sridharan, S.3    Chandran, V.4
  • 9
    • 10044281988 scopus 로고    scopus 로고
    • Lifelike talking faces for interactive services
    • Sep
    • E. Cosatto, J. Ostermann, H. Graf, and J. Schroeter, "Lifelike talking faces for interactive services," Proc. IEEE, vol. 91, no. 9, pp. 1406-1429, Sep. 2003.
    • (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1406-1429
    • Cosatto, E.1    Ostermann, J.2    Graf, H.3    Schroeter, J.4
  • 10
    • 84899028297 scopus 로고    scopus 로고
    • Audio-vision: Using audio-visual synchrony to locate sounds
    • J. Hershey and J. Movellan, "Audio-vision: Using audio-visual synchrony to locate sounds," Adv. Neural Inf. Process. Syst., vol. 12, pp. 813-819, 1999.
    • (1999) Adv. Neural Inf. Process. Syst , vol.12 , pp. 813-819
    • Hershey, J.1    Movellan, J.2
  • 11
    • 2642557514 scopus 로고    scopus 로고
    • FaceSync: A linear operaior for measuring synchronization of video facial images and audio tracks
    • M. Slaney and M. Covell, "FaceSync: A linear operaior for measuring synchronization of video facial images and audio tracks," Adv. Neural Inf. Process. Syst., vol. 13, pp. 814-820, 2000.
    • (2000) Adv. Neural Inf. Process. Syst , vol.13 , pp. 814-820
    • Slaney, M.1    Covell, M.2
  • 12
    • 13444275916 scopus 로고    scopus 로고
    • Audio/visual independent components
    • Apr
    • P. Smaragdis and M. Casey, "Audio/visual independent components," in Proc. ICA, Apr. 2003, pp. 709-714.
    • (2003) Proc. ICA , pp. 709-714
    • Smaragdis, P.1    Casey, M.2
  • 13
    • 2642562769 scopus 로고    scopus 로고
    • Speaker association with signal-level audiovisual fusion
    • Jun
    • J. W. Fisher, III and T. Darrell, "Speaker association with signal-level audiovisual fusion," IEEE Trans. Multimedia, vol. 6, no. 3, pp. 406-413, Jun. 2004.
    • (2004) IEEE Trans. Multimedia , vol.6 , Issue.3 , pp. 406-413
    • Fisher III, J.W.1    Darrell, T.2
  • 14
    • 34147167538 scopus 로고    scopus 로고
    • Cross-modal localization via sparsity
    • Apr
    • E. Kidron, Y. Schechner, and M. Elad, "Cross-modal localization via sparsity," IEEE Trans. Signal Process., vol. 55, no. 4, pp. 1390-1404, Apr. 2007.
    • (2007) IEEE Trans. Signal Process , vol.55 , Issue.4 , pp. 1390-1404
    • Kidron, E.1    Schechner, Y.2    Elad, M.3
  • 17
    • 33749427593 scopus 로고    scopus 로고
    • Analysis of mullimodal sequences using geometric video representations
    • G. Monaci, O. D. Escoda, and P. Vandergheynst, "Analysis of mullimodal sequences using geometric video representations," Signal Process. vol. 86, no. 12, pp. 3534-3548, 2006.
    • (2006) Signal Process , vol.86 , Issue.12 , pp. 3534-3548
    • Monaci, G.1    Escoda, O.D.2    Vandergheynst, P.3
  • 18
    • 0029935458 scopus 로고    scopus 로고
    • Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading
    • J. Driver, "Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading," Nature, vol. 381, pp. 66-68, 1996.
    • (1996) Nature , vol.381 , pp. 66-68
    • Driver, J.1
  • 20
    • 33745016748 scopus 로고    scopus 로고
    • Sound alters activity in human V1 in association with illusory visual perception
    • S. Watkins, L. Shams, S. Tanaka, J.-D. Haynes, and G. Rees, "Sound alters activity in human V1 in association with illusory visual perception," NeuroImage, vol. 31, no. 3, pp. 1247-1256, 2006.
    • (2006) NeuroImage , vol.31 , Issue.3 , pp. 1247-1256
    • Watkins, S.1    Shams, L.2    Tanaka, S.3    Haynes, J.-D.4    Rees, G.5
  • 21
    • 22544487240 scopus 로고    scopus 로고
    • Touch-induced visual illusion
    • A. Violentyev, S. Shimojo, and L. Shams, "Touch-induced visual illusion," Neuroreport, vol. 10, no. 16, pp. 1107-1110, 2005.
    • (2005) Neuroreport , vol.10 , Issue.16 , pp. 1107-1110
    • Violentyev, A.1    Shimojo, S.2    Shams, L.3
  • 22
    • 33646707471 scopus 로고    scopus 로고
    • Vision and touch are automatically integrated for the perception of sequences of events
    • J.-P. Bresciani, F. Dammeier, and M. Emst, "Vision and touch are automatically integrated for the perception of sequences of events," J. Vis., vol. 6, no. 5, pp. 554-564, 2006.
    • (2006) J. Vis , vol.6 , Issue.5 , pp. 554-564
    • Bresciani, J.-P.1    Dammeier, F.2    Emst, M.3
  • 23
    • 0036874756 scopus 로고    scopus 로고
    • Moving-talker, speaker-independent feature study, and baseline results using the CUAVE multimodal speech corpus
    • E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, "Moving-talker, speaker-independent feature study, and baseline results using the CUAVE multimodal speech corpus," EURASIP J. Appl. Signal Process., vol. 2002, no. 11, pp. 1189-1201, 2002.
    • (2002) EURASIP J. Appl. Signal Process , vol.2002 , Issue.11 , pp. 1189-1201
    • Patterson, E.K.1    Gurbuz, S.2    Tufekci, Z.3    Gowdy, J.N.4
  • 25
    • 0036293554 scopus 로고    scopus 로고
    • Sparse decomposition of stereo signals with matching pursuit and application to blind separation of more than two sources from a stereo mixture
    • R. Gribonval, "Sparse decomposition of stereo signals with matching pursuit and application to blind separation of more than two sources from a stereo mixture," in Proc. IEEE Int. Conf. Image Processing 2002, vol. 3, pp. 3057-3060.
    • (2002) Proc. IEEE Int. Conf. Image Processing , vol.3 , pp. 3057-3060
    • Gribonval, R.1
  • 26
    • 33646777471 scopus 로고    scopus 로고
    • Simultaneous sparse approximation via greedy pursuit
    • J. Tropp, A. Gilbert, and M. J. Strauss, "Simultaneous sparse approximation via greedy pursuit," in Proc. IEEE ICASSP, 2005, vol. 5, pp. 721-724.
    • (2005) Proc. IEEE ICASSP , vol.5 , pp. 721-724
    • Tropp, J.1    Gilbert, A.2    Strauss, M.J.3
  • 27
    • 0034133184 scopus 로고    scopus 로고
    • Learning overcomplete representations
    • M. Lewicki and T. Sejnowski, "Learning overcomplete representations," Neural Comput., vol. 12, no. 2, pp. 337-365, 2000.
    • (2000) Neural Comput , vol.12 , Issue.2 , pp. 337-365
    • Lewicki, M.1    Sejnowski, T.2
  • 28
    • 0038705107 scopus 로고    scopus 로고
    • If edges are the independent components of natural images, what are the independent components of natural sounds?
    • S. Abdallah and M. Plumbley, "If edges are the independent components of natural images, what are the independent components of natural sounds?," in Proc. ICA, 2001, pp. 534-539.
    • (2001) Proc. ICA , pp. 534-539
    • Abdallah, S.1    Plumbley, M.2
  • 29
    • 33947685468 scopus 로고    scopus 로고
    • MoTIF: An efficient algorithm for learning translation invariant dictionaries
    • P. Jost, P. Vandergheynst, S. Lesage, and R. Gribonval, "MoTIF: An efficient algorithm for learning translation invariant dictionaries," in Proc. IEEE ICASSP, 2006, vol. 5, pp. 857-860.
    • (2006) Proc. IEEE ICASSP , vol.5 , pp. 857-860
    • Jost, P.1    Vandergheynst, P.2    Lesage, S.3    Gribonval, R.4
  • 30
    • 0030832881 scopus 로고    scopus 로고
    • The "independent components" of natural scenes are edge filters
    • A. Bell and T. Sejnowski, "The "independent components" of natural scenes are edge filters," Vis. Res., vol. 37, no. 23, pp. 3327-3338, 1997.
    • (1997) Vis. Res , vol.37 , Issue.23 , pp. 3327-3338
    • Bell, A.1    Sejnowski, T.2
  • 31
    • 0030779611 scopus 로고    scopus 로고
    • Sparse coding with an overcomplete basis set: A strategy employed by V1?
    • B. A. Olshausen and D. J. Field, "Sparse coding with an overcomplete basis set: A strategy employed by V1?," Vis. Res., vol. 37, pp. 3311-3327, 1997.
    • (1997) Vis. Res , vol.37 , pp. 3311-3327
    • Olshausen, B.A.1    Field, D.J.2
  • 32
    • 0032606945 scopus 로고    scopus 로고
    • A probabilistic framework for the adaptation and comparison of image codes
    • M. Lewicki and B. A. Olshausen, "A probabilistic framework for the adaptation and comparison of image codes," J. Opt. Soc. Amer. A, vol. 16, no. 7, pp. 1587-1601, 1999.
    • (1999) J. Opt. Soc. Amer. A , vol.16 , Issue.7 , pp. 1587-1601
    • Lewicki, M.1    Olshausen, B.A.2
  • 34
    • 0345529041 scopus 로고    scopus 로고
    • Learning sparse, overcomplete representations of time-varying natural images
    • B. A. Olshausen, "Learning sparse, overcomplete representations of time-varying natural images," in Proc. IEEE Int. Conf. Image Processing, 1, 2003, pp. 41-44.
    • (2003) Proc. IEEE Int. Conf. Image Processing , vol.1 , pp. 41-44
    • Olshausen, B.A.1
  • 35
    • 0001732587 scopus 로고
    • Temporal decorrelation: A theory of lagged and nonlagged responses in the lateral geniculate nucleus
    • D. Dong and J. Atick, "Temporal decorrelation: A theory of lagged and nonlagged responses in the lateral geniculate nucleus," Network: Comput. Neural Syst., vol. 6, pp. 159-178, 1995.
    • (1995) Network: Comput. Neural Syst , vol.6 , pp. 159-178
    • Dong, D.1    Atick, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.