SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Volumn , Issue , 2007, Pages

Harmony in motion

(2) Barzelay, Zohar a Schechner, Yoav Y a

a TECHNION ISRAEL INSTITUTE OF TECHNOLOGY (Israel)

Author keywords

[No Author keywords available]

Indexed keywords

MICROPHONES; MOTION ESTIMATION; PROBABILISTIC LOGICS; PROBLEM SOLVING;

AUDIO-ASSOCIATED VISUAL OBJECTS; AUDIO-VISUAL ANALYSIS; VISUAL LOCALIZATION;

MODAL ANALYSIS;

EID: 34948829598 PISSN: 10636919 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2007.383344 Document Type: Conference Paper

Times cited : (98)

References (32)

1
- 33749051687
- Blind one-microphone speech separation: A spectral learning approach
- R R. Bach and M. I. Jordan. Blind one-microphone speech separation: A spectral learning approach. Proc. NIPS (2004).
- (2004) Proc. NIPS
- Bach, R.R.¹ Jordan, M.I.²

2
- 34948856380
- Harmony in Motion
- Tech. Rep. CCIT #620, Dep. of Electrical Engineering, Technion
- Z. Barzelay and Y. Y. Schechner. Harmony in Motion. Tech. Rep. CCIT #620, Dep. of Electrical Engineering, Technion (2007).
- (2007)
- Barzelay, Z.¹ Schechner, Y.Y.²

3
- 27644583688
- A tutorial on onset detection in music signals
- J. Bello, L. Daudet, S. Abdallah, C. Duxbury, M. Davies, and M. Sandler. A tutorial on onset detection in music signals. In IEEE Trans. Speech and Audio Process., 5:1035-1047 (2005).
- (2005) IEEE Trans. Speech and Audio Process , vol.5 , pp. 1035-1047
- Bello, J.¹ Daudet, L.² Abdallah, S.³ Duxbury, C.⁴ Davies, M.⁵ Sandler, M.⁶

4
- 0004331630
- tracker. Available at
- S. Birchfield. An implementation of the Kanade-Lucas-Tomasi feature tracker. Available at http://www.ces.clemson.edu/~stb/klt/.
- An implementation of the Kanade-Lucas-Tomasi feature
- Birchfield, S.¹

5
- 0003684441
- Cambridge, USA: MIT Press
- A. Bregman. Auditory Scene Analysis. Cambridge, USA: MIT Press (1990).
- (1990) Auditory Scene Analysis
- Bregman, A.¹

6
- 0026967479
- Survey of image registration techniques
- L. S. Brown. Survey of image registration techniques. ACM Comput. Surv., 24:325-376 (1992).
- (1992) ACM Comput. Surv , vol.24 , pp. 325-376
- Brown, L.S.¹

7
- 75649112752
- Relating audio-visual events caused by multiple movements: In the case of entire object movement
- J. Chen, T. Mukai, Y. Takeuchi, T. Matsumoto, H. Kudo, T. Yamamura, and N. Ohnishi. Relating audio-visual events caused by multiple movements: in the case of entire object movement. Proc. Inf. Fusion, pp. 213-219 (2002).
- (2002) Proc. Inf. Fusion , pp. 213-219
- Chen, J.¹ Mukai, T.² Takeuchi, Y.³ Matsumoto, T.⁴ Kudo, H.⁵ Yamamura, T.⁶ Ohnishi, N.⁷

8
- 4544386970
- Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection
- T. Choudhury, J. Rehg, V. Pavlovic, and A. Pentland. Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection. In Proc. ICPR., vol. 3, pp. 789-794 (2002).
- (2002) Proc. ICPR , vol.3 , pp. 789-794
- Choudhury, T.¹ Rehg, J.² Pavlovic, V.³ Pentland, A.⁴

9
- 21544435145
- Efficient pitch detection techniques for interactive music using harmonic model
- P. Cuadra, A. Master, and C. Sapp Efficient pitch detection techniques for interactive music using harmonic model. Proc. ICMI, (2001).
- (2001) Proc. ICMI
- Cuadra, P.¹ Master, A.² Sapp, C.³

10
- 1842830672
- Audio-visual segmentation and the cocktail party effect
- T. Darrell, J. W. Fisher, , P. A. Viola, and W. T. Freeman. Audio-visual segmentation and the cocktail party effect. In Proc. ICMI2000, pp. 1611-3349 (2000).
- (2000) Proc. ICMI2000 , pp. 1611-3349
- Darrell, T.¹ Fisher, J.W.² Viola, P.A.³ Freeman, W.T.⁴

11
- 27144543541
- Temporal frequency characteristics of synchrony-asynchrony discrimination of audio-visual signals
- W. Fujisaki and S. Nishida. Temporal frequency characteristics of synchrony-asynchrony discrimination of audio-visual signals. J. Exp. Brain Res., 166:455-464 (2005).
- (2005) J. Exp. Brain Res , vol.166 , pp. 455-464
- Fujisaki, W.¹ Nishida, S.²

12
- 0037199954
- Gated visual input to the central auditory system
- Y. Gutfreund, W. Zheng, and E. I. Knudsen. Gated visual input to the central auditory system. Science 297:1556-1559 (2002).
- (2002) Science , vol.297 , pp. 1556-1559
- Gutfreund, Y.¹ Zheng, W.² Knudsen, E.I.³

13
- 10044285992
- Canonical correlation analysis: An overview with application to learning methods
- D. Hardoon, S. Szedmak, and J. Shawe-Taylor. Canonical correlation analysis: An overview with application to learning methods. Neural Computation, 16:2639-2664 (2004).
- (2004) Neural Computation , vol.16 , pp. 2639-2664
- Hardoon, D.¹ Szedmak, S.² Shawe-Taylor, J.³

14
- 34948876301
- Audio-visual sound separation via hidden markov models
- J. Hershey and M. Casey. Audio-visual sound separation via hidden markov models. Proc. NIPS, pp. 1173-1180 (2001).
- (2001) Proc. NIPS , pp. 1173-1180
- Hershey, J.¹ Casey, M.²

15
- 84899028297
- Audio vision: Using audio-visual synchrony to locate sounds
- J. Hershey and J. R. Movellan. Audio vision: Using audio-visual synchrony to locate sounds. Proc. NIPS, pp. 813-819 (1999).
- (1999) Proc. NIPS , pp. 813-819
- Hershey, J.¹ Movellan, J.R.²

16
- 0032315531
- Robust multi-sensor image alignment
- M. Irani and P. Anandan. Robust multi-sensor image alignment. Proc. IEEE ICCV,pp. 959-966 (1998).
- (1998) Proc. IEEE ICCV , pp. 959-966
- Irani, M.¹ Anandan, P.²

17
- 33745127045
- Computer vision for music identification
- Y. Ke, D. Hoiem, and R. Sukthankar. Computer vision for music identification. Proc. IEEE CVPR, vol. 1, pp. 597-604 (2005).
- (2005) Proc. IEEE CVPR , vol.1 , pp. 597-604
- Ke, Y.¹ Hoiem, D.² Sukthankar, R.³

18
- 24644451644
- Pixels that sound
- E. Kidron, Y. Y. Schechner, and M. Elad. Pixels that sound. Proc. IEEE CVPR, vol. 1, pp. 88-95 (2005).
- (2005) Proc. IEEE CVPR , vol.1 , pp. 88-95
- Kidron, E.¹ Schechner, Y.Y.² Elad, M.³

19
- 34147167538
- Cross-modal localization via sparsity
- E. Kidron, Y. Y. Schechner, and M. Elad. Cross-modal localization via sparsity. IEEE Trans. Signal Processing, 55:1390-1404 (2007)
- (2007) IEEE Trans. Signal Processing , vol.55 , pp. 1390-1404
- Kidron, E.¹ Schechner, Y.Y.² Elad, M.³

20
- 0032649117
- Sound onset detection by applying psychoacoustic knowledge
- A. Klapuri. Sound onset detection by applying psychoacoustic knowledge. Proc. IEEE ICASSP, vol. 6, pp. 3089-3092 (1999).
- (1999) Proc. IEEE ICASSP , vol.6 , pp. 3089-3092
- Klapuri, A.¹

21
- 33749059402
- A perceptually motivated multiple-f0 estimation method
- A. Klapuri. A perceptually motivated multiple-f0 estimation method. Proc. IEEE Worksh. App. Sig. Proc. to Audio & Acoustics, pp. 291-294, (2005).
- (2005) Proc. IEEE Worksh. App. Sig. Proc. to Audio & Acoustics , pp. 291-294
- Klapuri, A.¹

22
- 0027842081
- Matching pursuits with time-frequency dictionaries
- S. Mallat and Z. Zhang. Matching pursuits with time-frequency dictionaries. Proc. IEEE Trans. Sig. Process., 41:3397-3415 (1993).
- (1993) Proc. IEEE Trans. Sig. Process , vol.41 , pp. 3397-3415
- Mallat, S.¹ Zhang, Z.²

23
- 34948857415
- in Comp. Vis
- G. Monaci and P. Vandergheynst. Audiovisual gestalts. Proc. IEEE Worksh. Percept. Org. in Comp. Vis. (2006).
- (2006) Audiovisual gestalts
- Monaci, G.¹ Vandergheynst, P.²

24
- 0036058193
- Real-time speaker localization and speech separation by audio-visual integration
- K. Nakadai, K. Hidai, H. Okuno, and H. Kitano. Real-time speaker localization and speech separation by audio-visual integration. IEEE Conf. Robotics & Auto., vol. 1, pp. 1043-1049 (2002).
- (2002) IEEE Conf. Robotics & Auto , vol.1 , pp. 1043-1049
- Nakadai, K.¹ Hidai, K.² Okuno, H.³ Kitano, H.⁴

25
- 13344250690
- Data fusion for visual tracking with particles
- P. Perez, J. Vermaak, and A. Blake. Data fusion for visual tracking with particles. Proc. IEEE, 92:495-513 (2004).
- (2004) Proc. IEEE , vol.92 , pp. 495-513
- Perez, P.¹ Vermaak, J.² Blake, A.³

26
- 4544247264
- Bayesian separation of audio-visual speech sources
- S. Rajaram, A. Nefian, and T. Huang. Bayesian separation of audio-visual speech sources. Proc. IEEE ICAASP, vol. 5, pp. 657-660 (2004).
- (2004) Proc. IEEE ICAASP , vol.5 , pp. 657-660
- Rajaram, S.¹ Nefian, A.² Huang, T.³

27
- 84898946024
- One microphone source separation
- S. T. Roweis. One microphone source separation. Proc. NIPS, pp. 793-799 (2001).
- (2001) Proc. NIPS , pp. 793-799
- Roweis, S.T.¹

28
- 33745936208
- Separating transparent layers of repetitive dynamic behaviors
- B. Sarel and M. Irani. Separating transparent layers of repetitive dynamic behaviors. Proc. IEEE ICCV, vol. 1, pp. 26-32 (2005).
- (2005) Proc. IEEE ICCV , vol.1 , pp. 26-32
- Sarel, B.¹ Irani, M.²

29
- 0028112849
- Good features to track
- J. Shi and C. Tomasi. Good features to track. Proc. IEEE CVPR, pp. 593-600 (1994).
- (1994) Proc. IEEE CVPR , pp. 593-600
- Shi, J.¹ Tomasi, C.²

30
- 13444275916
- Audio/visual independent components
- P. Smaragdis and M. Casey. Audio/visual independent components. Proc. ICA, pp. 709-714 (2003).
- (2003) Proc. ICA , pp. 709-714
- Smaragdis, P.¹ Casey, M.²

31
- 0036296012
- A multi-pitch tracking algorithm for noisy speech
- M. Wu, D. Wang, and G. Brown. A multi-pitch tracking algorithm for noisy speech. Proc. IEEE ICAASP, vol. 2, pp. 229-241 (2002).
- (2002) Proc. IEEE ICAASP , vol.2 , pp. 229-241
- Wu, M.¹ Wang, D.² Brown, G.³

32
- 3142694930
- Blind separation of speech mixtures via time-frequency masking
- O. Yilmaz and S. Rickard. Blind separation of speech mixtures via time-frequency masking. IEEE Trans. Sig. Process., 52:1830-1847 (2004).
- (2004) IEEE Trans. Sig. Process , vol.52 , pp. 1830-1847
- Yilmaz, O.¹ Rickard, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.