SCOPUS 정보 검색 플랫폼

Proceedings - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005

Volumn I, Issue , 2005, Pages 88-95

Pixels that sound

(3) Kidron, Einat a Schechner, Yoav Y a Elad, Michael a

a TECHNION ISRAEL INSTITUTE OF TECHNOLOGY (Israel)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; AUDITION; CORRELATION THEORY; LINEAR PROGRAMMING; MICROPHONES; MODAL ANALYSIS; PROBLEM SOLVING; VISION;

AUDIO-VISUAL EVENTS; CANONICAL CORRELATION ANALYSIS (CCA); SPATIO-TEMPORAL RESOLUTIONS; VISUAL DISTRACTIONS;

COMPUTER VISION;

EID: 24644451644 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2005.274 Document Type: Conference Paper

Times cited : (199)

References (29)

1
- 0011812771
- Kernel independent component analysis
- F. Bach and M. Jordan. 2002, "Kernel independent component analysis," J. of Mach. Learning Res. 3, pp. 1-48.
- (2002) J. of Mach. Learning Res. , vol.3 , pp. 1-48
- Bach, F.¹ Jordan, M.²

2
- 0042349407
- A graphical model for audiovisual object tracking
- M. J. Beal, N. Jojic, and H. Attias, 2003, "A graphical model for audiovisual object tracking," IEEE Tran. on PAMI, 25, pp. 828-836.
- (2003) IEEE Tran. on PAMI , vol.25 , pp. 828-836
- Beal, M.J.¹ Jojic, N.² Attias, H.³

3
- 85013597845
- Eigenlips for robust speech recognition
- C. Bregler, and Y. Konig, 1994, "Eigenlips for robust speech recognition," In Proc. IEEE ICASSP, vol. 2, pp. 667-672.
- (1994) Proc. IEEE ICASSP , vol.2 , pp. 667-672
- Bregler, C.¹ Konig, Y.²

4
- 0034507915
- Look who's talking: Speaker detection using video and audio correlation
- R. Cutler, and L. Davis, 2000, "Look who's talking: speaker detection using video and audio correlation," Proc. IEEE ICME, vol. 3, pp. 1589-1592.
- (2000) Proc. IEEE ICME , vol.3 , pp. 1589-1592
- Cutler, R.¹ Davis, L.²

5
- 24644433110
- On the regularization of canonical correlation analysis
- T. De Bie, and B. De Moor, 2003, "On the regularization of canonical correlation analysis," Int. Sympos. ICA and BSS, pp. 785-790.
- (2003) Int. Sympos. ICA and BSS , pp. 785-790
- De Bie, T.¹ De Moor, B.²

6
- 24644433991
- Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization)
- S. Deligne, G. Potamianos, and C. Neti, 2002, 'Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization)," IEEE Work-shop on Sensor Array and Multichannel Signal Processing., pp. 68-71.
- (2002) IEEE Work-shop on Sensor Array and Multichannel Signal Processing , pp. 68-71
- Deligne, S.¹ Potamianos, G.² Neti, C.³

7
- 0037418225
- 1 minimization
- 1 minimization," Proc. Nat. Aca. Sci. 100, pp. 2197-2202.
- (2003) Proc. Nat. Aca. Sci. , vol.100 , pp. 2197-2202
- Donoho, D.L.¹ Elad, M.²

8
- 0037745171
- Can recent innovations in harmonic analysis explain key findings in natural image statistics?
- D. L. Donoho, and A. G. Flesia, 2001, "Can recent innovations in harmonic analysis explain key findings in natural image statistics?," Network: Comput. Neural. Syst., 12, pp. 371-393.
- (2001) Network: Comput. Neural. Syst. , vol.12 , pp. 371-393
- Donoho, D.L.¹ Flesia, A.G.²

9
- 0029935458
- Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading
- J. Driver, 1996, "Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading," Nature 381, pp. 66-68.
- (1996) Nature , vol.381 , pp. 66-68
- Driver, J.¹

10
- 24644451695
- A probabilistic study of the average performance of the basis pursuit
- submitted to the
- M. Elad, and M. Zibulevsky, 2004, "A probabilistic study of the average performance of the basis pursuit", submitted to the IEEE Trans. on IT.
- (2004) IEEE Trans. on IT
- Elad, M.¹ Zibulevsky, M.²

11
- 24644498666
- A unified framework for bases, frames, subspace bases, and subspace frames
- G. Farnebäck, 1999, "A unified framework for bases, frames, subspace bases, and subspace frames", Proc. Scand. Conf. Image Analysis pp. 341-349.
- (1999) Proc. Scand. Conf. Image Analysis , pp. 341-349
- Farnebäck, G.¹

12
- 0030879469
- An anatomical basis for visual calibration of the auditory space map in the barn owl's midbrain
- D. E. Feldman, and E. I. Knudsen, 1996, "An anatomical basis for visual calibration of the auditory space map in the barn owl's midbrain," The J. Neuroscience 17 pp. 6820-6837.
- (1996) The J. Neuroscience , vol.17 , pp. 6820-6837
- Feldman, D.E.¹ Knudsen, E.I.²

13
- 2642562769
- Speaker association with signal-level audiovisual fusion
- J. W. Fisher III, and T. Darrell, 2004, "Speaker association with signal-level audiovisual fusion," IEEE Trans. Multimedia 6, pp. 406-413.
- (2004) IEEE Trans. Multimedia , vol.6 , pp. 406-413
- Fisher III, J.W.¹ Darrell, T.²

14
- 84898954418
- Learning joint statistical models for audio-visual fusion and Segregation
- J. W. Fisher III, T. Darrell, W. Freeman, and P. Viola, 2001, "Learning joint statistical models for audio-visual fusion and Segregation," Advanced in Neural Inf. Process. Syst. 13, pp. 772-778.
- (2001) Advanced in Neural Inf. Process. Syst. , vol.13 , pp. 772-778
- Fisher III, J.W.¹ Darrell, T.² Freeman, W.³ Viola, P.⁴

15
- 0347968052
- Sparse representations in unions of bases
- R. Gribonval, and M. Nielsen, 2003, "Sparse representations in unions of bases," IEEE Trans. IT 49, pp. 3320-3325.
- (2003) IEEE Trans. IT , vol.49 , pp. 3320-3325
- Gribonval, R.¹ Nielsen, M.²

16
- 0037199954
- Gated visual input to the central auditory system
- Y. Gutfreund, W. Zheng, and E. I. Knudsen, 2002, "Gated visual input to the central auditory system," Science 297, pp. 1556-1559.
- (2002) Science , vol.297 , pp. 1556-1559
- Gutfreund, Y.¹ Zheng, W.² Knudsen, E.I.³

17
- 84899028297
- Audio-vision: Using audio-visual synchrony to locate sound
- J. Hershey, and J. Movellan, 1999, "Audio-vision: using audio-visual synchrony to locate sound," Advances in Neural Inf. Process. Syst. 12, pp. 813-819.
- (1999) Advances in Neural Inf. Process. Syst. , vol.12 , pp. 813-819
- Hershey, J.¹ Movellan, J.²

18
- 24644517212
- Pixels that sound
- Dep. of Electrical Engineering, Technion
- E. Kidron, Y. Y. Schechner, and M. Elad, 2005, "Pixels that sound," Tech. Rep. CCIT TR-524, Dep. of Electrical Engineering, Technion.
- (2005) Tech. Rep. , vol.CCIT TR-524
- Kidron, E.¹ Schechner, Y.Y.² Elad, M.³

19
- 34147133605
- Learning canonical correlations
- Computer Vision Laboratory, S-581 83 Linköping Univ., Sweden
- H. Knutsson, M. Borga, and T. Landelius, 1995, "Learning canonical correlations," Tech. Rep. LiTH-ISY-R-1761, Computer Vision Laboratory, S-581 83 Linköping Univ., Sweden.
- (1995) Tech. Rep. , vol.LITH-ISY-R-1761
- Knutsson, H.¹ Borga, M.² Landelius, T.³

20
- 2342451199
- Multimedia content processing through cross-modal association
- D. Li, N. Dimitrova, M. Li, and I. K. Sethi, 2003, "Multimedia content processing through cross-modal association," Proc. ACM Int. Conf. Multimedia, pp. 604-611.
- (2003) Proc. ACM Int. Conf. Multimedia , pp. 604-611
- Li, D.¹ Dimitrova, N.² Li, M.³ Sethi, I.K.⁴

21
- 0038648412
- Appearance models based on kernel canonical correlation analysis
- T. Melzer, M. Reiter, and H. Bischof, 2003, "Appearance models based on kernel canonical correlation analysis," Patt. Rec. 36, pp. 1961-1971.
- (2003) Patt. Rec. , vol.36 , pp. 1961-1971
- Melzer, T.¹ Reiter, M.² Bischof, H.³

22
- 7444264756
- Conducting audio files via computer vision
- D. Murphy, T. H. Andersen, and K. Jensen, 2004, "Conducting audio files via computer vision," Lecture Notes in Computer Science, 2915, pp. 529-540
- (2004) Lecture Notes in Computer Science , vol.2915 , pp. 529-540
- Murphy, D.¹ Andersen, T.H.² Jensen, K.³

23
- 0037700834
- Assessing face and speech consistency for monologue detection in video
- H. J. Nock, G. Iyengar, and C. Neti, 2002, "Assessing face and speech consistency for monologue detection in video," Proc. ACM Int. Conf. Multimedia, pp. 303-306.
- (2002) Proc. ACM Int. Conf. Multimedia , pp. 303-306
- Nock, H.J.¹ Iyengar, G.² Neti, C.³

24
- 24644501841
- A computational model of early auditory-visual integration
- Proc. Patt. Rec. Sympos.
- C. Schauer, and H. M. Gross, 2003, "A computational model of early auditory-visual integration," Proc. Patt. Rec. Sympos., Lecture Notes in Computer Science 2781 pp. 362-369.
- (2003) Lecture Notes in Computer Science , vol.2781 , pp. 362-369
- Schauer, C.¹ Gross, H.M.²

25
- 2642557514
- FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks
- M. Slaney, and M. Covell, 2000, "FaceSync: a linear operator for measuring synchronization of video facial images and audio tracks," Advanc. in Neural Inf. Process. Syst. 13, pp. 814-820.
- (2000) Advanc. in Neural Inf. Process. Syst. , vol.13 , pp. 814-820
- Slaney, M.¹ Covell, M.²

26
- 5044226917
- Audio-visual based emotion recognition-a new approach
- M. Song, J. Bu, C. Chen, and N. Li, 2004, "Audio-visual based emotion recognition-a new approach," Proc. IEEE CVPR, vol. 2, pp. 1020-1025.
- (2004) Proc. IEEE CVPR , vol.2 , pp. 1020-1025
- Song, M.¹ Bu, J.² Chen, C.³ Li, N.⁴

27
- 13444275916
- Audio/Visual independent components
- P. Smaragdis, and M. Casey, 2003, "Audio/Visual independent components," Int. Sympos. ICA and BSS, pp. 709-714.
- (2003) Int. Sympos. ICA and BSS , pp. 709-714
- Smaragdis, P.¹ Casey, M.²

28
- 0034844366
- Sequential Monte Carlo fusion of sound and vision for speaker tracking
- J. Vermaak, M. Gangnet, A. Blake, and P. Perez, 2001, "Sequential Monte Carlo fusion of sound and vision for speaker tracking," Proc. IEEE ICCV, vol. 1, pp. 741-746.
- (2001) Proc. IEEE ICCV , vol.1 , pp. 741-746
- Vermaak, J.¹ Gangnet, M.² Blake, A.³ Perez, P.⁴

29
- 4644322072
- Learning over Sets using Kernel Principal Angles
- E. Wolf, A. Shashua, 2003, "Learning over Sets using Kernel Principal Angles," J. of Mach. Learning Res. 4, pp. 913-931.
- (2003) J. of Mach. Learning Res. , vol.4 , pp. 913-931
- Wolf, E.¹ Shashua, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.