SCOPUS 정보 검색 플랫폼

IEEE Transactions on Image Processing

Volumn 16, Issue 9, 2007, Pages 2272-2283

Learning multimodal dictionaries

(6) Monaci, Gianluca a Jost, Philippe a Vandergheynst, Pierre a Mailhé, Boris b Lesage, Sylvain b Gribonval, Rémi b

a EPFL (Switzerland)

b CAMPUS DE BEAULIEU (France)

Author keywords

Audiovisual source localization; Dictionary learning; Multimodal data processing; Sparse representation

Indexed keywords

COMPUTER VISION; DATA STRUCTURES; EIGENVALUES AND EIGENFUNCTIONS; ITERATIVE METHODS; LEARNING ALGORITHMS; SENSORY PERCEPTION;

AUDIOVISUAL SOURCE LOCALIZATION; DICTIONARY LEARNING; MULTIMODAL DATA PROCESSING; SPARSE REPRESENTATION;

DATA PROCESSING;

ALGORITHM; ARTICLE; ARTIFICIAL INTELLIGENCE; AUTOMATED PATTERN RECOGNITION; BOOK; COMPUTER ASSISTED DIAGNOSIS; IMAGE ENHANCEMENT; IMAGE SUBTRACTION; INFORMATION RETRIEVAL; METHODOLOGY;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; DICTIONARIES AS TOPIC; IMAGE ENHANCEMENT; IMAGE INTERPRETATION, COMPUTER-ASSISTED; INFORMATION STORAGE AND RETRIEVAL; PATTERN RECOGNITION, AUTOMATED; SUBTRACTION TECHNIQUE;

EID: 34548231808 PISSN: 10577149 EISSN: None Source Type: Journal
DOI: 10.1109/TIP.2007.901813 Document Type: Article

Times cited : (42)

References (36)

1
- 0031106962
- Multimodality image registration by maximization of mutual information
- Feb
- F. Maes, A. Collignon, D. Vandermeulen, G. Marchal, and P. Suetens, "Multimodality image registration by maximization of mutual information," IEEE Trans. Med. Imag., vol. 16, no. 2, pp. 187-198, Feb. 1997.
- (1997) IEEE Trans. Med. Imag , vol.16 , Issue.2 , pp. 187-198
- Maes, F.¹ Collignon, A.² Vandermeulen, D.³ Marchal, G.⁴ Suetens, P.⁵

2
- 14844344462
- From error probability to information theoretic (multi-modal) signal processing
- T. Butz and J.-P. Thiran, "From error probability to information theoretic (multi-modal) signal processing," Signal Process., vol. 85, no. 5, pp. 875-902, 2005.
- (2005) Signal Process , vol.85 , Issue.5 , pp. 875-902
- Butz, T.¹ Thiran, J.-P.²

3
- 0242456951
- Multispectral satellite image analysis based on the method of blind separation and fusion of sources
- I. R. Farah, M. B. Ahmed, and M. R. Boussema, "Multispectral satellite image analysis based on the method of blind separation and fusion of sources," in Proc. Int. Geoscience and Remote Sensing Symp., 2003, vol. 6, pp. 3638-3640.
- (2003) Proc. Int. Geoscience and Remote Sensing Symp , vol.6 , pp. 3638-3640
- Farah, I.R.¹ Ahmed, M.B.² Boussema, M.R.³

4
- 0034229412
- A data fusion algorithm for mapping sea-ice concentrations from special sensor microwave/imager data
- Apr
- K. C. Partington, "A data fusion algorithm for mapping sea-ice concentrations from special sensor microwave/imager data," IEEE Trans. Geosci. Remote Sens., vol. 38, no. 4, pp. 1947-1958, Apr. 2000.
- (2000) IEEE Trans. Geosci. Remote Sens , vol.38 , Issue.4 , pp. 1947-1958
- Partington, K.C.¹

5
- 3042592851
- Concurrent EEG/fMR1 analysis by multiway partial least squares
- E. Martínez-Montes, P. A. Valdés-Sosa, F. Miwakeichi, R. I. Goldman, and M. S. Cohen, "Concurrent EEG/fMR1 analysis by multiway partial least squares," NeuroImage, vol. 22, pp. 1023-1034, 2004.
- (2004) NeuroImage , vol.22 , pp. 1023-1034
- Martínez-Montes, E.¹ Valdés-Sosa, P.A.² Miwakeichi, F.³ Goldman, R.I.⁴ Cohen, M.S.⁵

6
- 33745131045
- Characterizing inter-annual variations in global fire calendar using data from earth observing satellites
- C. Carmona-Moreno, A. Belward, J. Malingreau, M. Garcia-Alegre, A. Hartley, M. Antonovskiy, V. Buchshiaber, and V. Pivovarov, "Characterizing inter-annual variations in global fire calendar using data from earth observing satellites," Global Change Biol., vol. 11, no. 9, pp. 1537-1555, 2005.
- (2005) Global Change Biol , vol.11 , Issue.9 , pp. 1537-1555
- Carmona-Moreno, C.¹ Belward, A.² Malingreau, J.³ Garcia-Alegre, M.⁴ Hartley, A.⁵ Antonovskiy, M.⁶ Buchshiaber, V.⁷ Pivovarov, V.⁸

7
- 4544290191
- Recent advances in the automatic recognition of audiovisual speech
- Sep
- G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. W. Senior, "Recent advances in the automatic recognition of audiovisual speech," Proc. IEEE, vol. 91, no. 9, pp. 1306-1326, Sep. 2003.
- (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.W.⁵

8
- 20444375102
- Integration strategies for audio-visual speech processing: Applied to text-dependent speaker recognition
- Sep
- S. Lucey, T. Chen, S. Sridharan, and V. Chandran, "Integration strategies for audio-visual speech processing: Applied to text-dependent speaker recognition," IEEE Trans. Multimedia, vol. 7, no. 3, pp. 495-506, Sep. 2005.
- (2005) IEEE Trans. Multimedia , vol.7 , Issue.3 , pp. 495-506
- Lucey, S.¹ Chen, T.² Sridharan, S.³ Chandran, V.⁴

9
- 10044281988
- Lifelike talking faces for interactive services
- Sep
- E. Cosatto, J. Ostermann, H. Graf, and J. Schroeter, "Lifelike talking faces for interactive services," Proc. IEEE, vol. 91, no. 9, pp. 1406-1429, Sep. 2003.
- (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1406-1429
- Cosatto, E.¹ Ostermann, J.² Graf, H.³ Schroeter, J.⁴

10
- 84899028297
- Audio-vision: Using audio-visual synchrony to locate sounds
- J. Hershey and J. Movellan, "Audio-vision: Using audio-visual synchrony to locate sounds," Adv. Neural Inf. Process. Syst., vol. 12, pp. 813-819, 1999.
- (1999) Adv. Neural Inf. Process. Syst , vol.12 , pp. 813-819
- Hershey, J.¹ Movellan, J.²

11
- 2642557514
- FaceSync: A linear operaior for measuring synchronization of video facial images and audio tracks
- M. Slaney and M. Covell, "FaceSync: A linear operaior for measuring synchronization of video facial images and audio tracks," Adv. Neural Inf. Process. Syst., vol. 13, pp. 814-820, 2000.
- (2000) Adv. Neural Inf. Process. Syst , vol.13 , pp. 814-820
- Slaney, M.¹ Covell, M.²

12
- 13444275916
- Audio/visual independent components
- Apr
- P. Smaragdis and M. Casey, "Audio/visual independent components," in Proc. ICA, Apr. 2003, pp. 709-714.
- (2003) Proc. ICA , pp. 709-714
- Smaragdis, P.¹ Casey, M.²

13
- 2642562769
- Speaker association with signal-level audiovisual fusion
- Jun
- J. W. Fisher, III and T. Darrell, "Speaker association with signal-level audiovisual fusion," IEEE Trans. Multimedia, vol. 6, no. 3, pp. 406-413, Jun. 2004.
- (2004) IEEE Trans. Multimedia , vol.6 , Issue.3 , pp. 406-413
- Fisher III, J.W.¹ Darrell, T.²

14
- 34147167538
- Cross-modal localization via sparsity
- Apr
- E. Kidron, Y. Schechner, and M. Elad, "Cross-modal localization via sparsity," IEEE Trans. Signal Process., vol. 55, no. 4, pp. 1390-1404, Apr. 2007.
- (2007) IEEE Trans. Signal Process , vol.55 , Issue.4 , pp. 1390-1404
- Kidron, E.¹ Schechner, Y.² Elad, M.³

15
- 35248827017
- Speaker localisation using audio-visual synchrony: An empirical study
- H. J. Nock, G. Iyengar, and C. Neti, "Speaker localisation using audio-visual synchrony: An empirical study," in Proc. Int. Conf. Image and Video Retrieval, 2003, pp. 488-199.
- (2003) Proc. Int. Conf. Image and Video Retrieval , pp. 488-199
- Nock, H.J.¹ Iyengar, G.² Neti, C.³

16
- 33845523077
- Audiovisual Gestalts
- G. Monaci and P. Vandergheynst, "Audiovisual Gestalts," in Proc. IEEE CVPR Workshop, 2006, p. 200.
- (2006) Proc. IEEE CVPR Workshop , pp. 200
- Monaci, G.¹ Vandergheynst, P.²

17
- 33749427593
- Analysis of mullimodal sequences using geometric video representations
- G. Monaci, O. D. Escoda, and P. Vandergheynst, "Analysis of mullimodal sequences using geometric video representations," Signal Process. vol. 86, no. 12, pp. 3534-3548, 2006.
- (2006) Signal Process , vol.86 , Issue.12 , pp. 3534-3548
- Monaci, G.¹ Escoda, O.D.² Vandergheynst, P.³

18
- 0029935458
- Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading
- J. Driver, "Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading," Nature, vol. 381, pp. 66-68, 1996.
- (1996) Nature , vol.381 , pp. 66-68
- Driver, J.¹

19
- 4644276897
- Unifying multisensory signals across time and space
- M. T. Wallace, G. E. Roberson, W. D. Hairston, B. E. Stein, J. W. Vaughan, and J. A. Schirillo, "Unifying multisensory signals across time and space," Exp. Brain Res., vol. 158, pp. 252-258, 2004.
- (2004) Exp. Brain Res , vol.158 , pp. 252-258
- Wallace, M.T.¹ Roberson, G.E.² Hairston, W.D.³ Stein, B.E.⁴ Vaughan, J.W.⁵ Schirillo, J.A.⁶

20
- 33745016748
- Sound alters activity in human V1 in association with illusory visual perception
- S. Watkins, L. Shams, S. Tanaka, J.-D. Haynes, and G. Rees, "Sound alters activity in human V1 in association with illusory visual perception," NeuroImage, vol. 31, no. 3, pp. 1247-1256, 2006.
- (2006) NeuroImage , vol.31 , Issue.3 , pp. 1247-1256
- Watkins, S.¹ Shams, L.² Tanaka, S.³ Haynes, J.-D.⁴ Rees, G.⁵

21
- 22544487240
- Touch-induced visual illusion
- A. Violentyev, S. Shimojo, and L. Shams, "Touch-induced visual illusion," Neuroreport, vol. 10, no. 16, pp. 1107-1110, 2005.
- (2005) Neuroreport , vol.10 , Issue.16 , pp. 1107-1110
- Violentyev, A.¹ Shimojo, S.² Shams, L.³

22
- 33646707471
- Vision and touch are automatically integrated for the perception of sequences of events
- J.-P. Bresciani, F. Dammeier, and M. Emst, "Vision and touch are automatically integrated for the perception of sequences of events," J. Vis., vol. 6, no. 5, pp. 554-564, 2006.
- (2006) J. Vis , vol.6 , Issue.5 , pp. 554-564
- Bresciani, J.-P.¹ Dammeier, F.² Emst, M.³

23
- 0036874756
- Moving-talker, speaker-independent feature study, and baseline results using the CUAVE multimodal speech corpus
- E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, "Moving-talker, speaker-independent feature study, and baseline results using the CUAVE multimodal speech corpus," EURASIP J. Appl. Signal Process., vol. 2002, no. 11, pp. 1189-1201, 2002.
- (2002) EURASIP J. Appl. Signal Process , vol.2002 , Issue.11 , pp. 1189-1201
- Patterson, E.K.¹ Gurbuz, S.² Tufekci, Z.³ Gowdy, J.N.⁴

24
- 33749239260
- Analysis of multimodal signals using redundant representations
- G. Monaci, O. D. Escoda, and P. Vandergheynst, "Analysis of multimodal signals using redundant representations," in Proc. IEEE Int. Conf. Image Processing, 2005, vol. 3, pp. 46-49.
- (2005) Proc. IEEE Int. Conf. Image Processing , vol.3 , pp. 46-49
- Monaci, G.¹ Escoda, O.D.² Vandergheynst, P.³

25
- 0036293554
- Sparse decomposition of stereo signals with matching pursuit and application to blind separation of more than two sources from a stereo mixture
- R. Gribonval, "Sparse decomposition of stereo signals with matching pursuit and application to blind separation of more than two sources from a stereo mixture," in Proc. IEEE Int. Conf. Image Processing 2002, vol. 3, pp. 3057-3060.
- (2002) Proc. IEEE Int. Conf. Image Processing , vol.3 , pp. 3057-3060
- Gribonval, R.¹

26
- 33646777471
- Simultaneous sparse approximation via greedy pursuit
- J. Tropp, A. Gilbert, and M. J. Strauss, "Simultaneous sparse approximation via greedy pursuit," in Proc. IEEE ICASSP, 2005, vol. 5, pp. 721-724.
- (2005) Proc. IEEE ICASSP , vol.5 , pp. 721-724
- Tropp, J.¹ Gilbert, A.² Strauss, M.J.³

27
- 0034133184
- Learning overcomplete representations
- M. Lewicki and T. Sejnowski, "Learning overcomplete representations," Neural Comput., vol. 12, no. 2, pp. 337-365, 2000.
- (2000) Neural Comput , vol.12 , Issue.2 , pp. 337-365
- Lewicki, M.¹ Sejnowski, T.²

28
- 0038705107
- If edges are the independent components of natural images, what are the independent components of natural sounds?
- S. Abdallah and M. Plumbley, "If edges are the independent components of natural images, what are the independent components of natural sounds?," in Proc. ICA, 2001, pp. 534-539.
- (2001) Proc. ICA , pp. 534-539
- Abdallah, S.¹ Plumbley, M.²

29
- 33947685468
- MoTIF: An efficient algorithm for learning translation invariant dictionaries
- P. Jost, P. Vandergheynst, S. Lesage, and R. Gribonval, "MoTIF: An efficient algorithm for learning translation invariant dictionaries," in Proc. IEEE ICASSP, 2006, vol. 5, pp. 857-860.
- (2006) Proc. IEEE ICASSP , vol.5 , pp. 857-860
- Jost, P.¹ Vandergheynst, P.² Lesage, S.³ Gribonval, R.⁴

30
- 0030832881
- The "independent components" of natural scenes are edge filters
- A. Bell and T. Sejnowski, "The "independent components" of natural scenes are edge filters," Vis. Res., vol. 37, no. 23, pp. 3327-3338, 1997.
- (1997) Vis. Res , vol.37 , Issue.23 , pp. 3327-3338
- Bell, A.¹ Sejnowski, T.²

31
- 0030779611
- Sparse coding with an overcomplete basis set: A strategy employed by V1?
- B. A. Olshausen and D. J. Field, "Sparse coding with an overcomplete basis set: A strategy employed by V1?," Vis. Res., vol. 37, pp. 3311-3327, 1997.
- (1997) Vis. Res , vol.37 , pp. 3311-3327
- Olshausen, B.A.¹ Field, D.J.²

32
- 0032606945
- A probabilistic framework for the adaptation and comparison of image codes
- M. Lewicki and B. A. Olshausen, "A probabilistic framework for the adaptation and comparison of image codes," J. Opt. Soc. Amer. A, vol. 16, no. 7, pp. 1587-1601, 1999.
- (1999) J. Opt. Soc. Amer. A , vol.16 , Issue.7 , pp. 1587-1601
- Lewicki, M.¹ Olshausen, B.A.²

33
- 0037313218
- Dictionary learning algorithms for sparse representation
- K. Kreutz-Delgado, J. Murray, B. Rao, K. Engan, T. Lee, and T. Sejnowski, "Dictionary learning algorithms for sparse representation," Neural Comput., vol. 15, pp. 349-396, 2003.
- (2003) Neural Comput , vol.15 , pp. 349-396
- Kreutz-Delgado, K.¹ Murray, J.² Rao, B.³ Engan, K.⁴ Lee, T.⁵ Sejnowski, T.⁶

34
- 0345529041
- Learning sparse, overcomplete representations of time-varying natural images
- B. A. Olshausen, "Learning sparse, overcomplete representations of time-varying natural images," in Proc. IEEE Int. Conf. Image Processing, 1, 2003, pp. 41-44.
- (2003) Proc. IEEE Int. Conf. Image Processing , vol.1 , pp. 41-44
- Olshausen, B.A.¹

35
- 0001732587
- Temporal decorrelation: A theory of lagged and nonlagged responses in the lateral geniculate nucleus
- D. Dong and J. Atick, "Temporal decorrelation: A theory of lagged and nonlagged responses in the lateral geniculate nucleus," Network: Comput. Neural Syst., vol. 6, pp. 159-178, 1995.
- (1995) Network: Comput. Neural Syst , vol.6 , pp. 159-178
- Dong, D.¹ Atick, J.²

36
- 34548253912
- Lausanne, Switzerland, Jun. 2005 [Online, Available
- O. Divorra Escoda, "Toward sparse and geometry adapted video approximations" Ph.D. dissertation, Swiss Fed. Inst. Technol., Lausanne, Switzerland, Jun. 2005 [Online]. Available: http://lts2www.epfl.ch/.
- Toward sparse and geometry adapted video approximations Ph.D. dissertation, Swiss Fed. Inst. Technol
- Divorra Escoda, O.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.