메뉴 건너뛰기




Volumn I, Issue , 2005, Pages 88-95

Pixels that sound

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; AUDITION; CORRELATION THEORY; LINEAR PROGRAMMING; MICROPHONES; MODAL ANALYSIS; PROBLEM SOLVING; VISION;

EID: 24644451644     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CVPR.2005.274     Document Type: Conference Paper
Times cited : (199)

References (29)
  • 1
    • 0011812771 scopus 로고    scopus 로고
    • Kernel independent component analysis
    • F. Bach and M. Jordan. 2002, "Kernel independent component analysis," J. of Mach. Learning Res. 3, pp. 1-48.
    • (2002) J. of Mach. Learning Res. , vol.3 , pp. 1-48
    • Bach, F.1    Jordan, M.2
  • 2
    • 0042349407 scopus 로고    scopus 로고
    • A graphical model for audiovisual object tracking
    • M. J. Beal, N. Jojic, and H. Attias, 2003, "A graphical model for audiovisual object tracking," IEEE Tran. on PAMI, 25, pp. 828-836.
    • (2003) IEEE Tran. on PAMI , vol.25 , pp. 828-836
    • Beal, M.J.1    Jojic, N.2    Attias, H.3
  • 3
    • 85013597845 scopus 로고
    • Eigenlips for robust speech recognition
    • C. Bregler, and Y. Konig, 1994, "Eigenlips for robust speech recognition," In Proc. IEEE ICASSP, vol. 2, pp. 667-672.
    • (1994) Proc. IEEE ICASSP , vol.2 , pp. 667-672
    • Bregler, C.1    Konig, Y.2
  • 4
    • 0034507915 scopus 로고    scopus 로고
    • Look who's talking: Speaker detection using video and audio correlation
    • R. Cutler, and L. Davis, 2000, "Look who's talking: speaker detection using video and audio correlation," Proc. IEEE ICME, vol. 3, pp. 1589-1592.
    • (2000) Proc. IEEE ICME , vol.3 , pp. 1589-1592
    • Cutler, R.1    Davis, L.2
  • 5
    • 24644433110 scopus 로고    scopus 로고
    • On the regularization of canonical correlation analysis
    • T. De Bie, and B. De Moor, 2003, "On the regularization of canonical correlation analysis," Int. Sympos. ICA and BSS, pp. 785-790.
    • (2003) Int. Sympos. ICA and BSS , pp. 785-790
    • De Bie, T.1    De Moor, B.2
  • 8
    • 0037745171 scopus 로고    scopus 로고
    • Can recent innovations in harmonic analysis explain key findings in natural image statistics?
    • D. L. Donoho, and A. G. Flesia, 2001, "Can recent innovations in harmonic analysis explain key findings in natural image statistics?," Network: Comput. Neural. Syst., 12, pp. 371-393.
    • (2001) Network: Comput. Neural. Syst. , vol.12 , pp. 371-393
    • Donoho, D.L.1    Flesia, A.G.2
  • 9
    • 0029935458 scopus 로고    scopus 로고
    • Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading
    • J. Driver, 1996, "Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading," Nature 381, pp. 66-68.
    • (1996) Nature , vol.381 , pp. 66-68
    • Driver, J.1
  • 10
    • 24644451695 scopus 로고    scopus 로고
    • A probabilistic study of the average performance of the basis pursuit
    • submitted to the
    • M. Elad, and M. Zibulevsky, 2004, "A probabilistic study of the average performance of the basis pursuit", submitted to the IEEE Trans. on IT.
    • (2004) IEEE Trans. on IT
    • Elad, M.1    Zibulevsky, M.2
  • 11
    • 24644498666 scopus 로고    scopus 로고
    • A unified framework for bases, frames, subspace bases, and subspace frames
    • G. Farnebäck, 1999, "A unified framework for bases, frames, subspace bases, and subspace frames", Proc. Scand. Conf. Image Analysis pp. 341-349.
    • (1999) Proc. Scand. Conf. Image Analysis , pp. 341-349
    • Farnebäck, G.1
  • 12
    • 0030879469 scopus 로고    scopus 로고
    • An anatomical basis for visual calibration of the auditory space map in the barn owl's midbrain
    • D. E. Feldman, and E. I. Knudsen, 1996, "An anatomical basis for visual calibration of the auditory space map in the barn owl's midbrain," The J. Neuroscience 17 pp. 6820-6837.
    • (1996) The J. Neuroscience , vol.17 , pp. 6820-6837
    • Feldman, D.E.1    Knudsen, E.I.2
  • 13
    • 2642562769 scopus 로고    scopus 로고
    • Speaker association with signal-level audiovisual fusion
    • J. W. Fisher III, and T. Darrell, 2004, "Speaker association with signal-level audiovisual fusion," IEEE Trans. Multimedia 6, pp. 406-413.
    • (2004) IEEE Trans. Multimedia , vol.6 , pp. 406-413
    • Fisher III, J.W.1    Darrell, T.2
  • 15
    • 0347968052 scopus 로고    scopus 로고
    • Sparse representations in unions of bases
    • R. Gribonval, and M. Nielsen, 2003, "Sparse representations in unions of bases," IEEE Trans. IT 49, pp. 3320-3325.
    • (2003) IEEE Trans. IT , vol.49 , pp. 3320-3325
    • Gribonval, R.1    Nielsen, M.2
  • 16
    • 0037199954 scopus 로고    scopus 로고
    • Gated visual input to the central auditory system
    • Y. Gutfreund, W. Zheng, and E. I. Knudsen, 2002, "Gated visual input to the central auditory system," Science 297, pp. 1556-1559.
    • (2002) Science , vol.297 , pp. 1556-1559
    • Gutfreund, Y.1    Zheng, W.2    Knudsen, E.I.3
  • 17
    • 84899028297 scopus 로고    scopus 로고
    • Audio-vision: Using audio-visual synchrony to locate sound
    • J. Hershey, and J. Movellan, 1999, "Audio-vision: using audio-visual synchrony to locate sound," Advances in Neural Inf. Process. Syst. 12, pp. 813-819.
    • (1999) Advances in Neural Inf. Process. Syst. , vol.12 , pp. 813-819
    • Hershey, J.1    Movellan, J.2
  • 18
    • 24644517212 scopus 로고    scopus 로고
    • Pixels that sound
    • Dep. of Electrical Engineering, Technion
    • E. Kidron, Y. Y. Schechner, and M. Elad, 2005, "Pixels that sound," Tech. Rep. CCIT TR-524, Dep. of Electrical Engineering, Technion.
    • (2005) Tech. Rep. , vol.CCIT TR-524
    • Kidron, E.1    Schechner, Y.Y.2    Elad, M.3
  • 19
    • 34147133605 scopus 로고
    • Learning canonical correlations
    • Computer Vision Laboratory, S-581 83 Linköping Univ., Sweden
    • H. Knutsson, M. Borga, and T. Landelius, 1995, "Learning canonical correlations," Tech. Rep. LiTH-ISY-R-1761, Computer Vision Laboratory, S-581 83 Linköping Univ., Sweden.
    • (1995) Tech. Rep. , vol.LITH-ISY-R-1761
    • Knutsson, H.1    Borga, M.2    Landelius, T.3
  • 21
    • 0038648412 scopus 로고    scopus 로고
    • Appearance models based on kernel canonical correlation analysis
    • T. Melzer, M. Reiter, and H. Bischof, 2003, "Appearance models based on kernel canonical correlation analysis," Patt. Rec. 36, pp. 1961-1971.
    • (2003) Patt. Rec. , vol.36 , pp. 1961-1971
    • Melzer, T.1    Reiter, M.2    Bischof, H.3
  • 23
    • 0037700834 scopus 로고    scopus 로고
    • Assessing face and speech consistency for monologue detection in video
    • H. J. Nock, G. Iyengar, and C. Neti, 2002, "Assessing face and speech consistency for monologue detection in video," Proc. ACM Int. Conf. Multimedia, pp. 303-306.
    • (2002) Proc. ACM Int. Conf. Multimedia , pp. 303-306
    • Nock, H.J.1    Iyengar, G.2    Neti, C.3
  • 24
    • 24644501841 scopus 로고    scopus 로고
    • A computational model of early auditory-visual integration
    • Proc. Patt. Rec. Sympos.
    • C. Schauer, and H. M. Gross, 2003, "A computational model of early auditory-visual integration," Proc. Patt. Rec. Sympos., Lecture Notes in Computer Science 2781 pp. 362-369.
    • (2003) Lecture Notes in Computer Science , vol.2781 , pp. 362-369
    • Schauer, C.1    Gross, H.M.2
  • 25
    • 2642557514 scopus 로고    scopus 로고
    • FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks
    • M. Slaney, and M. Covell, 2000, "FaceSync: a linear operator for measuring synchronization of video facial images and audio tracks," Advanc. in Neural Inf. Process. Syst. 13, pp. 814-820.
    • (2000) Advanc. in Neural Inf. Process. Syst. , vol.13 , pp. 814-820
    • Slaney, M.1    Covell, M.2
  • 26
    • 5044226917 scopus 로고    scopus 로고
    • Audio-visual based emotion recognition-a new approach
    • M. Song, J. Bu, C. Chen, and N. Li, 2004, "Audio-visual based emotion recognition-a new approach," Proc. IEEE CVPR, vol. 2, pp. 1020-1025.
    • (2004) Proc. IEEE CVPR , vol.2 , pp. 1020-1025
    • Song, M.1    Bu, J.2    Chen, C.3    Li, N.4
  • 28
    • 0034844366 scopus 로고    scopus 로고
    • Sequential Monte Carlo fusion of sound and vision for speaker tracking
    • J. Vermaak, M. Gangnet, A. Blake, and P. Perez, 2001, "Sequential Monte Carlo fusion of sound and vision for speaker tracking," Proc. IEEE ICCV, vol. 1, pp. 741-746.
    • (2001) Proc. IEEE ICCV , vol.1 , pp. 741-746
    • Vermaak, J.1    Gangnet, M.2    Blake, A.3    Perez, P.4
  • 29
    • 4644322072 scopus 로고    scopus 로고
    • Learning over Sets using Kernel Principal Angles
    • E. Wolf, A. Shashua, 2003, "Learning over Sets using Kernel Principal Angles," J. of Mach. Learning Res. 4, pp. 913-931.
    • (2003) J. of Mach. Learning Res. , vol.4 , pp. 913-931
    • Wolf, E.1    Shashua, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.