메뉴 건너뛰기




Volumn 22, Issue 8, 2011, Pages 704-711

Efficient video coding based on audio-visual focus of attention

Author keywords

Audio visual focus of attention; Audio visual source localization; Canonical correlation analysis; Flexible macroblock ordering (FMO); H.264 AVC; Quality of experience; Subjective quality assessment; Video coding

Indexed keywords

AUDIO-VISUAL; CANONICAL CORRELATION ANALYSIS; FLEXIBLE MACROBLOCK ORDERING; FOCUS OF ATTENTION; H.264/AVC; QUALITY OF EXPERIENCE; SUBJECTIVE QUALITY ASSESSMENTS;

EID: 80053958796     PISSN: 10473203     EISSN: 10959076     Source Type: Journal    
DOI: 10.1016/j.jvcir.2010.11.002     Document Type: Article
Times cited : (35)

References (31)
  • 1
    • 4544260101 scopus 로고    scopus 로고
    • Automatic foveation for video compression using a neurobiological model of visual attention
    • L. Itti Automatic foveation for video compression using a neurobiological model of visual attention IEEE Trans. Image Process. 13 2004 1304 1318
    • (2004) IEEE Trans. Image Process. , vol.13 , pp. 1304-1318
    • Itti, L.1
  • 3
    • 0038343438 scopus 로고    scopus 로고
    • Foveation scalable video coding with automatic fixation selection
    • Z. Wang, L. Lu, and A.C. Bovic Foveation scalable video coding with automatic fixation selection IEEE Trans. Image Process. 12 2003 243 254
    • (2003) IEEE Trans. Image Process. , vol.12 , pp. 243-254
    • Wang, Z.1    Lu, L.2    Bovic, A.C.3
  • 4
  • 5
    • 33846650999 scopus 로고    scopus 로고
    • Spatiotemporal visual considerations for video coding
    • DOI 10.1109/TMM.2006.886328
    • C.-W. Tang Spatiotemporal visual considerations for video coding IEEE Trans. Multimedia 9 2007 231 238 (Pubitemid 46188196)
    • (2007) IEEE Transactions on Multimedia , vol.9 , Issue.2 , pp. 231-238
    • Tang, C.-W.1
  • 7
    • 0031857262 scopus 로고    scopus 로고
    • Attention and the crossmodal construction of space
    • DOI 10.1016/S1364-6613(98)01188-7, PII S1364661398011887
    • J. Driver, and C. Spence Attention and the crossmodal construction of space Trends Cogn. Sci. 2 1998 254 262 (Pubitemid 28394243)
    • (1998) Trends in Cognitive Sciences , vol.2 , Issue.7 , pp. 254-262
    • Driver, J.1    Spence, C.2
  • 9
    • 0030636048 scopus 로고    scopus 로고
    • Audiovisual links in exogenous covert spatial orienting
    • C. Spence, and J. Driver Audiovisual links in exogenous covert spatial orienting Percept. Psychophys. 59 1997 1 22 (Pubitemid 127456363)
    • (1997) Perception and Psychophysics , vol.59 , Issue.1 , pp. 1-22
    • Spence, C.1    Driver, J.2
  • 11
    • 0141541726 scopus 로고    scopus 로고
    • The inability to ignore auditory distractors as a function of visual task perceptual load
    • D.J. Tellinghuisen, and E.J. Nowak The inability to ignore auditory distractors as a function of visual task perceptual load Percept. Psychophys. 65 2003 817 828 (Pubitemid 37544856)
    • (2003) Perception and Psychophysics , vol.65 , Issue.5 , pp. 817-828
    • Tellinghuisen, D.J.1    Nowak, E.J.2
  • 12
    • 33845403620 scopus 로고    scopus 로고
    • How automatic are audiovisual links in exogenous spatial attention?
    • DOI 10.1016/j.neuropsychologia.2006.02.010, PII S0028393206000583, Advances in Multisensory Processes
    • V. Mazza, M. Turatto, M. Rossi, and C. Umilta How automatic are audiovisual links in exogenous spatial attention? Neuropsychologia 45 2007 514 522 (Pubitemid 44894459)
    • (2007) Neuropsychologia , vol.45 , Issue.3 , pp. 514-522
    • Mazza, V.1    Turatto, M.2    Rossi, M.3    Umilta, C.4
  • 13
    • 9244221425 scopus 로고    scopus 로고
    • Perceptual effects of cross-modal stimulation: Ventriloquism and the freezing phenomenon
    • MIT Press
    • J. Vroomen, and B. de Gelder Perceptual effects of cross-modal stimulation: ventriloquism and the freezing phenomenon Handbook of Multisensory Processes 2004 MIT Press 141 150
    • (2004) Handbook of Multisensory Processes , pp. 141-150
    • Vroomen, J.1    De Gelder, B.2
  • 14
    • 0034303199 scopus 로고    scopus 로고
    • Sound enhances visual perception: Cross-modal effects of auditory organisation on vision
    • J. Vroomen, and B. de Gelder Sound enhances visual perception: cross-modal effects of auditory organisation on vision J. Exp. Psychol.: Human Percept. Perform. 26 2000 1583 1590
    • (2000) J. Exp. Psychol.: Human Percept. Perform. , vol.26 , pp. 1583-1590
    • Vroomen, J.1    De Gelder, B.2
  • 15
    • 34247172408 scopus 로고    scopus 로고
    • Do you see what I am saying? Exploring visual enhancement of speech comprehension in noisy environments
    • DOI 10.1093/cercor/bhl024
    • L.A. Ross, D. Saint-Amour, V.M. Leavitt, D.C. Javitt, and J.J. Foxe Do you see what I am saying? Exploring visual enhancement of speech comprehension in noisy environments Cereb. Cortex 17 2007 1147 1153 (Pubitemid 46598855)
    • (2007) Cerebral Cortex , vol.17 , Issue.5 , pp. 1147-1153
    • Ross, L.A.1    Saint-Amour, D.2    Leavitt, V.M.3    Javitt, D.C.4    Foxe, J.J.5
  • 16
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • H. McGurk, and J. MacDonald Hearing lips and seeing voices Nature 264 1976 746 748
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 17
    • 0032075723 scopus 로고    scopus 로고
    • Toward Multimodal Human-Computer Interface
    • PII S0018921998032812
    • R. Sharma, V.I. Pavlović, and T.S. Huang Toward multimodal human-computer interface Proc. IEEE 86 1998 853 869 (Pubitemid 128720228)
    • (1998) Proceedings of the IEEE , vol.86 , Issue.5 , pp. 853-869
    • Sharma, R.1    Pavlovic, V.I.2    Huang, T.S.3
  • 18
    • 21744450745 scopus 로고    scopus 로고
    • Multisensory spatial interactions: A window onto functional integration in the human brain
    • DOI 10.1016/j.tins.2005.03.008
    • E. Macaluso, and J. Driver Multisensory spatial interactions: a window onto functional integration in the human brain Trends Neurosci. 28 2005 264 271 (Pubitemid 41556911)
    • (2005) Trends in Neurosciences , vol.28 , Issue.5 , pp. 264-271
    • Macaluso, E.1    Driver, J.2
  • 19
    • 74349099570 scopus 로고    scopus 로고
    • Efficient video coding in H.264/AVC by using audio-visual information
    • Rio de Janeiro, Brazil
    • J.-S. Lee, T. Ebrahimi, Efficient video coding in H.264/AVC by using audio-visual information, in: Proc. Int. Conf. Multimedia Signal Processing, Rio de Janeiro, Brazil, 2009, pp. 1-6.
    • (2009) Proc. Int. Conf. Multimedia Signal Processing , pp. 1-6
    • Lee, J.-S.1    Ebrahimi, T.2
  • 20
    • 54049156157 scopus 로고    scopus 로고
    • Target detection and tracking with heterogeneous sensors
    • H. Zhou, M. Taj, and A. Cavallaro Target detection and tracking with heterogeneous sensors IEEE J. Sel. Top. Sign. Proc. 2 2008 503 513
    • (2008) IEEE J. Sel. Top. Sign. Proc. , vol.2 , pp. 503-513
    • Zhou, H.1    Taj, M.2    Cavallaro, A.3
  • 22
    • 13344250690 scopus 로고    scopus 로고
    • Data fusion for visual tracking with particles
    • DOI 10.1109/JPROC.2003.823147, Sequential State Estimation: From Kalman Filters to Particles Filters
    • P. Perez, J. Vermaak, and A. Blake Data fusion for visual tracking with particles Proc. IEEE 92 2004 495 513 (Pubitemid 40890756)
    • (2004) Proceedings of the IEEE , vol.92 , Issue.3 , pp. 495-513
    • Perez, P.1    Vermaak, J.2    Blake, A.3
  • 23
    • 0033683662 scopus 로고    scopus 로고
    • Multimodal speaker detection using error feedback dynamic Bayesian networks
    • Hilton Head Island, SC, USA
    • V. Pavlović, A. Garg, J.M. Rehg, T.S. Huang, Multimodal speaker detection using error feedback dynamic Bayesian networks, in: Proc. Int. Conf. Computer Vision and Pattern Recognition, Hilton Head Island, SC, USA, 2000, pp. 34-41.
    • (2000) Proc. Int. Conf. Computer Vision and Pattern Recognition , pp. 34-41
    • V. Pavlović1
  • 24
    • 0034507915 scopus 로고    scopus 로고
    • Look who's talking: Speaker detection using video and audio correlation
    • R. Cutler, L. Davis, Look who's talking: speaker detection using video and audio correlation, in: Proc. Int. Conf. Multimedia and Expo, New York, NY, USA, 2000, pp. 1589-1592. (Pubitemid 33059248)
    • (2000) IEEE International Conference on Multi-Media and Expo , pp. 1589-1592
    • Cutler, R.1    Davis, L.2
  • 25
    • 85008046156 scopus 로고    scopus 로고
    • Extraction of audio features specific to speech production for multimodal speaker detection
    • P. Besson, V. Popovici, J.-M. Vesin, J.-P. Thiran, and M. Kunt Extraction of audio features specific to speech production for multimodal speaker detection IEEE Trans. Multimedia 10 2008 63 73
    • (2008) IEEE Trans. Multimedia , vol.10 , pp. 63-73
    • Besson, P.1    Popovici, V.2    Vesin, J.-M.3    Thiran, J.-P.4    Kunt, M.5
  • 26
    • 10044285992 scopus 로고    scopus 로고
    • Canonical correlation analysis: An overview with application to learning methods
    • DOI 10.1162/0899766042321814
    • D.R. Hardoon, S. Szedmak, and J. Shawe-Taylor Canonical correlation analysis: an overview with application to learning methods Neural Comput. 16 2004 2639 2664 (Pubitemid 39604012)
    • (2004) Neural Computation , vol.16 , Issue.12 , pp. 2639-2664
    • Hardoon, D.R.1    Szedmak, S.2    Shawe-Taylor, J.3
  • 31
    • 70350759682 scopus 로고    scopus 로고
    • Influence of audio-visual attention on perceived quality of standard definition multimedia content
    • San Diego, CA, USA
    • J.-S. Lee, F. De Simone, T. Ebrahimi, Influence of audio-visual attention on perceived quality of standard definition multimedia content, in: Proc. Int. Workshop on Quality of Multimedia Experience, San Diego, CA, USA, 2009, pp. 13-19.
    • (2009) Proc. Int. Workshop on Quality of Multimedia Experience , pp. 13-19
    • Lee, J.-S.1    De Simone, F.2    Ebrahimi, T.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.