메뉴 건너뛰기




Volumn 5, Issue 7, 2011, Pages 1322-1331

Subjective quality evaluation of foveated video coding using audio-visual focus of attention

Author keywords

Audio visual focus of attention; content dependence; foveated coding; H.264 AVC; memory effect; quality of experience; subjective quality assessment

Indexed keywords

CONTENT DEPENDENCE; FOCUS OF ATTENTION; FOVEATED CODING; H.264/AVC; MEMORY EFFECTS; QUALITY OF EXPERIENCE; SUBJECTIVE QUALITY ASSESSMENTS;

EID: 80054802141     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2011.2165199     Document Type: Article
Times cited : (25)

References (43)
  • 1
    • 0038343438 scopus 로고    scopus 로고
    • Foveation scalable video coding with automatic fixation selection
    • Feb
    • Z. Wang, L. Lu, and A. C. Bovic, "Foveation scalable video coding with automatic fixation selection," IEEE Trans. Image Process., vol. 12, no. 2, pp. 243-254, Feb. 2003.
    • (2003) IEEE Trans. Image Process. , vol.12 , Issue.2 , pp. 243-254
    • Wang, Z.1    Lu, L.2    Bovic, A.C.3
  • 2
    • 4544260101 scopus 로고    scopus 로고
    • Automatic foveation for video compression using a neurobiological model of visual attention
    • Oct
    • L. Itti, "Automatic foveation for video compression using a neurobiological model of visual attention," IEEE Trans. Image Process., vol. 13, no. 10, pp. 1304-1318, Oct. 2004.
    • (2004) IEEE Trans. Image Process. , vol.13 , Issue.10 , pp. 1304-1318
    • Itti, L.1
  • 3
    • 27844469032 scopus 로고    scopus 로고
    • Semantic video analysis for adaptive content delivery and automatic description
    • DOI 10.1109/TCSVT.2005.854240
    • A. Cavallaro, O. Steiger, and T. Ebrahimi, "Semantic video analysis for adaptive content delivery and automatic description," IEEE Trans. Circuits Syst. Video Technol., vol. 15, no. 10, pp. 1200-1209,Oct. 2005. (Pubitemid 41640558)
    • (2005) IEEE Transactions on Circuits and Systems for Video Technology , vol.15 , Issue.10 , pp. 1200-1209
    • Cavallaro, A.1    Steiger, O.2    Ebrahimi, T.3
  • 4
    • 33846650999 scopus 로고    scopus 로고
    • Spatiotemporal visual considerations for video coding
    • DOI 10.1109/TMM.2006.886328
    • C.-W. Tang, "Spatiotemporal visual considerations for video coding," IEEE Trans. Multimedia, vol. 9, no. 2, pp. 231-238, Feb. 2007. (Pubitemid 46188196)
    • (2007) IEEE Transactions on Multimedia , vol.9 , Issue.2 , pp. 231-238
    • Tang, C.-W.1
  • 6
    • 77953091169 scopus 로고    scopus 로고
    • Perceptually-friendly H.264/AVC video coding based on foveated just-noticeable distortion model
    • Jun
    • Z. Chen and C. Guillemot, "Perceptually-friendly H.264/AVC video coding based on foveated just-noticeable distortion model," IEEE Trans. Circuits Syst. Video Technol., vol. 20, no. 6, pp. 806-819, Jun. 2010.
    • (2010) IEEE Trans. Circuits Syst. Video Technol. , vol.20 , Issue.6 , pp. 806-819
    • Chen, Z.1    Guillemot, C.2
  • 7
    • 51749113583 scopus 로고    scopus 로고
    • Application of scalable visual sensitivity profile in image and video coding
    • Seattle, WA, May
    • Q. Chen, G. Zhai, X. Yang, and W. Zhang, "Application of scalable visual sensitivity profile in image and video coding," in Proc. IEEE Int. Symp. Circuits Syst., Seattle, WA, May 2008, pp. 268-271.
    • (2008) Proc. IEEE Int. Symp. Circuits Syst. , pp. 268-271
    • Chen, Q.1    Zhai, G.2    Yang, X.3    Zhang, W.4
  • 8
    • 68249112310 scopus 로고    scopus 로고
    • Robust region-of-interest determination based on user attention model through visual rhythm analysis
    • Jul
    • M.-C. Chi, C.-H. Yeh, and M.-J. Chen, "Robust region-of-interest determination based on user attention model through visual rhythm analysis," IEEE Trans. Circuits Syst. Video Technol., vol. 19, no. 7, pp. 1025-1038, Jul. 2010.
    • (2010) IEEE Trans. Circuits Syst. Video Technol. , vol.19 , Issue.7 , pp. 1025-1038
    • Chi, M.-C.1    Yeh, C.-H.2    Chen, M.-J.3
  • 9
    • 74349099570 scopus 로고    scopus 로고
    • Efficient video coding in H.264/AVC by using audio-visual information
    • Rio de Janeiro, Brazil, Oct
    • J.-S. Lee and T. Ebrahimi, "Efficient video coding in H.264/AVC by using audio-visual information," in Proc. Int. Conf. Multimedia Signal Process., Rio de Janeiro, Brazil, Oct. 2009, pp. 1-6.
    • (2009) Proc. Int. Conf. Multimedia Signal Process. , pp. 1-6
    • Lee, J.-S.1    Ebrahimi, T.2
  • 10
    • 0031857262 scopus 로고    scopus 로고
    • Attention and the crossmodal construction of space
    • DOI 10.1016/S1364-6613(98)01188-7, PII S1364661398011887
    • J. Driver and C. Spence, "Attention and the crossmodal construction of space," Trends in Cognitive Sci., vol. 2, no. 7, pp. 254-262, Jul. 1998. (Pubitemid 28394243)
    • (1998) Trends in Cognitive Sciences , vol.2 , Issue.7 , pp. 254-262
    • Driver, J.1    Spence, C.2
  • 11
    • 77950249727 scopus 로고    scopus 로고
    • Crossmodal spatial attention
    • Mar
    • C. Spence, "Crossmodal spatial attention," Ann. New York Acad. Sci., vol. 1191, pp. 182-200, Mar. 2010.
    • (2010) Ann. New York Acad. Sci. , vol.1191 , pp. 182-200
    • Spence, C.1
  • 13
    • 0030636048 scopus 로고    scopus 로고
    • Audiovisual links in exogenous covert spatial orienting
    • C. Spence and J.Driver, "Audiovisual links in exogenous covert spatial orienting," Percept. Psychophys., vol. 59, no. 1, pp. 1-22, Jan. 1997. (Pubitemid 127456363)
    • (1997) Perception and Psychophysics , vol.59 , Issue.1 , pp. 1-22
    • Spence, C.1    Driver, J.2
  • 15
    • 0141541726 scopus 로고    scopus 로고
    • The inability to ignore auditory distractors as a function of visual task perceptual load
    • D. J. Tellinghuisen and E. J. Nowak, "The inability to ignore auditory distractors as a function of visual task perceptual load," Percept. Psychophys., vol. 65, no. 5, pp. 817-828, 2003. (Pubitemid 37544856)
    • (2003) Perception and Psychophysics , vol.65 , Issue.5 , pp. 817-828
    • Tellinghuisen, D.J.1    Nowak, E.J.2
  • 16
    • 33845403620 scopus 로고    scopus 로고
    • How automatic are audiovisual links in exogenous spatial attention?
    • DOI 10.1016/j.neuropsychologia.2006.02.010, PII S0028393206000583, Advances in Multisensory Processes
    • V. Mazza, M. Turatto, M. Rossi, and C. Umilta, "How automatic are audiovisual links in exogenous spatial attention?," Neuropsychologia, vol. 45, no. 3, pp. 514-522, 2007. (Pubitemid 44894459)
    • (2007) Neuropsychologia , vol.45 , Issue.3 , pp. 514-522
    • Mazza, V.1    Turatto, M.2    Rossi, M.3    Umilta, C.4
  • 17
    • 77956831585 scopus 로고    scopus 로고
    • Flexible macroblock ordering for context-aware ultrasound video transmission over mobile WiMAX
    • M. G. Martini and C. T. E. R. Hewage, "Flexible macroblock ordering for context-aware ultrasound video transmission over mobile WiMAX," Int. J. Telemed. Appl., vol. 2010, pp. 1-14, 2010.
    • (2010) T. J. Telemed. Appl. , vol.2010 , pp. 1-14
    • Martini, M.G.1    Hewage, C.T.E.R.2
  • 18
    • 85026149763 scopus 로고    scopus 로고
    • A real-time foveated multiresolution system for low-bandwidth video communication
    • SPIE, San Jose, CA, Jan
    • W. S. Geisler and J. S. Perry, "A real-time foveated multiresolution system for low-bandwidth video communication," in Proc. SPIE, San Jose, CA, Jan. 1998, vol. 3299, pp. 294-305.
    • (1998) Proc. , vol.3299 , pp. 294-305
    • Geisler, W.S.1    Perry, J.S.2
  • 19
    • 72949100573 scopus 로고    scopus 로고
    • Anovelmultiresolution spatiotemporal saliency detection model and its applications in image and video compression
    • Jan
    • C. Guo and L. Zhang, "Anovelmultiresolution spatiotemporal saliency detection model and its applications in image and video compression," IEEE Trans. Image Process., vol. 19, no. 1, pp. 185-198, Jan. 2010.
    • (2010) IEEE Trans. Image Process. , vol.19 , Issue.1 , pp. 185-198
    • Guo, C.1    Zhang, L.2
  • 20
    • 60149104029 scopus 로고    scopus 로고
    • Region-of-interest based resource allocation for conversational video communication of H.264/AVC
    • Jan
    • Y. Liu, Z. G. Li, andY. C. Soh, "Region-of-interest based resource allocation for conversational video communication of H.264/AVC," IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 1, pp. 134-139, Jan. 2008.
    • (2008) IEEE Trans. Circuits Syst. Video Technol. , vol.18 , Issue.1 , pp. 134-139
    • Liu, Y.1    Li, Z.G.2    Soh, Y.C.3
  • 21
    • 39149102828 scopus 로고    scopus 로고
    • Region-of-interest video coding based on rate and distortion variations for H.263
    • M.-C. Chi, M.-J. Chen, C.-H. Yeh, and J.-A. Jhu, "Region-of-interest video coding based on rate and distortion variations for H.263 ," Signal Process.: Image Commun., vol. 23, pp. 127-142, 2008.
    • (2008) Signal Process.: Image Commun. , vol.23 , pp. 127-142
    • Chi, M.-C.1    Chen, M.-J.2    Yeh, C.-H.3    Jhu, J.-A.4
  • 22
    • 0032651860 scopus 로고    scopus 로고
    • Face segmentation using skin-color map in videophone applications
    • Jun
    • D. Chai and K. N. Ngan, "Face segmentation using skin-color map in videophone applications," IEEE Trans. Circuits Syst. Video Technol., vol. 9, no. 4, pp. 551-934, Jun. 1999.
    • (1999) IEEE Trans. Circuits Syst. Video Technol. , vol.9 , Issue.4 , pp. 551-934
    • Chai, D.1    Ngan, K.N.2
  • 24
    • 31344476802 scopus 로고    scopus 로고
    • Region-based rate control and bit allocation for wireless video transmission
    • Feb
    • Y. Sun, I. Ahmad, D. Li, and Y.-Q. Zhang, "Region-based rate control and bit allocation for wireless video transmission," IEEE Trans. Multimedia, vol. 8, no. 1, pp. 1-10, Feb. 2006.
    • (2006) IEEE Trans. Multimedia , vol.8 , Issue.1 , pp. 1-10
    • Sun, Y.1    Ahmad, I.2    Li, D.3    Zhang, Y.-Q.4
  • 25
    • 0037296332 scopus 로고    scopus 로고
    • Real-time foveation techniques for low bit rate video coding
    • H. R. Sheikh, B. L. Evans, and A. C. Bovik, "Real-time foveation techniques for low bit rate video coding," Real-Time Imaging, vol. 9, pp. 27-40, 2003.
    • (2003) Real-Time Imaging , vol.9 , pp. 27-40
    • Sheikh, H.R.1    Evans, B.L.2    Bovik, A.C.3
  • 26
    • 77957702893 scopus 로고    scopus 로고
    • Visual attention guided bit allocation in video compression
    • Jan
    • Z. Li, S. Qin, and L. Itti, "Visual attention guided bit allocation in video compression," Image Vis. Comput., vol. 29, no. 1, pp. 1-14, Jan. 2011.
    • (2011) Image Vis. Comput. , vol.29 , Issue.1 , pp. 1-14
    • Li, Z.1    Qin, S.2    Itti, L.3
  • 27
    • 50549092673 scopus 로고    scopus 로고
    • The evolution of video quality measurement: From PSNR to hybrid metrics
    • Sep
    • S.Winkler and P.Mohandas, "The evolution of video quality measurement: From PSNR to hybrid metrics," IEEE Trans. Broadcast., vol. 54, no. 3, pp. 660-668, Sep. 2008.
    • (2008) IEEE Trans. Broadcast. , vol.54 , Issue.3 , pp. 660-668
    • Winkler, S.1    Mohandas, P.2
  • 29
    • 80054823575 scopus 로고    scopus 로고
    • Methodology for the Subjective Assessment of the Quality of Television Pictures, Rec. ITU-R BT.500-11, 2002, Geneva, Switzerland
    • Methodology for the Subjective Assessment of the Quality of Television Pictures, Rec. ITU-R BT.500-11, 2002, Geneva, Switzerland.
  • 31
    • 13344250690 scopus 로고    scopus 로고
    • Data fusion for visual tracking with particles
    • DOI 10.1109/JPROC.2003.823147, Sequential State Estimation: From Kalman Filters to Particles Filters
    • P. Perez, J. Vermaak, and A. Blake, "Data fusion for visual tracking with particles," Proc. IEEE, vol. 92, no. 3, pp. 495-513, Mar. 2004. (Pubitemid 40890756)
    • (2004) Proceedings of the IEEE , vol.92 , Issue.3 , pp. 495-513
    • Perez, P.1    Vermaak, J.2    Blake, A.3
  • 32
    • 0033683662 scopus 로고    scopus 로고
    • Multimodal speaker detection using error feedback dynamic Bayesian networks
    • Hilton Head Island, SC
    • V. Pavlovic, A. Garg, J. M. Rehg, and T. S. Huang, "Multimodal speaker detection using error feedback dynamic Bayesian networks," in Proc. Int. Conf. Comput. Vis. Pattern Recognit., Hilton Head Island, SC, 2000, pp. 34-41.
    • (2000) Proc. Int. Conf. Comput. Vis. Pattern Recognit. , pp. 34-41
    • Pavlovic, V.1    Garg, A.2    Rehg, J.M.3    Huang, T.S.4
  • 33
    • 85008046156 scopus 로고    scopus 로고
    • Extraction of audio features specific to speech production for multimodal speaker detection
    • Jan
    • P. Besson, V. Popovici, J.-M. Vesin, J.-P. Thiran, and M. Kunt, "Extraction of audio features specific to speech production for multimodal speaker detection," IEEE Trans. Multimedia, vol. 10, no. 1, pp. 63-73, Jan. 2008.
    • (2008) IEEE Trans. Multimedia , vol.10 , Issue.1 , pp. 63-73
    • Besson, P.1    Popovici, V.2    Vesin, M.J.3    Thiran, J.-P.4    Kunt, M.5
  • 34
    • 10044285992 scopus 로고    scopus 로고
    • Canonical correlation analysis: An overview with application to learning methods
    • DOI 10.1162/0899766042321814
    • D. R. Hardoon, S. Szedmak, and J. Shawe-Taylor, "Canonical correlation analysis: An overview with application to learning methods," Neural Comput., vol. 16, no. 12, pp. 2639-2664, 2004. (Pubitemid 39604012)
    • (2004) Neural Computation , vol.16 , Issue.12 , pp. 2639-2664
    • Hardoon, D.R.1    Szedmak, S.2    Shawe-Taylor, J.3
  • 35
    • 70449553985 scopus 로고    scopus 로고
    • Video coding based on audio-visual attention
    • Multimedia Expo, New York, Jun
    • J.-S. Lee, F. D. Simone, and T. Ebrahimi, "Video coding based on audio-visual attention," in Proc. Int. Conf. Multimedia Expo, New York, Jun. 2009, pp. 57-60.
    • (2009) Proc. Int. Conf. , pp. 57-60
    • Lee, J.-S.1    Simone, F.D.2    Ebrahimi, T.3
  • 36
    • 34147167538 scopus 로고    scopus 로고
    • Cross-modal localization via sparsity
    • DOI 10.1109/TSP.2006.888095
    • E. Kidron, Y. Y. Schechner, and M. Eland, "Cross-modal localization via sparsity," IEEE Trans. Signal Process., vol. 55, no. 4, pp. 1390-1404, Apr. 2007. (Pubitemid 46563162)
    • (2007) IEEE Transactions on Signal Processing , vol.55 , Issue.4 , pp. 1390-1404
    • Kidron, E.1    Schechner, Y.Y.2    Elad, M.3
  • 37
    • 78149334236 scopus 로고    scopus 로고
    • Effect of compressed offline foveated video on viewing behavior and subjective quality
    • Feb
    • M. Nyström and K. Holmqvist, "Effect of compressed offline foveated video on viewing behavior and subjective quality," ACM Trans. Multimedia Comput., Commun., Applicat., vol. 6, no. 1, pp. 1-14, Feb. 2010.
    • (2010) ACM Trans. Multimedia Comput., Commun., Applicat. , vol.6 , Issue.1 , pp. 1-14
    • Nyström, M.1    Holmqvist, K.2
  • 38
    • 80054796130 scopus 로고    scopus 로고
    • H.264/AVC JM Reference Software, 2008 [Online]. Available
    • H.264/AVC JM Reference Software, 2008 [Online]. Available: http:// iphome.hhi.de/suehring/tml/
  • 39
    • 34547521060 scopus 로고    scopus 로고
    • Integrating audiovisual information for the control of overt attention
    • Jul
    • S. Onat, K. Libertus, and P. König, "Integrating audiovisual information for the control of overt attention," J. Vis., vol. 7, no. 10, pp. 1-16, Jul. 2007.
    • (2007) J. Vis. , vol.7 , Issue.10 , pp. 1-16
    • Onat, S.1    Libertus, K.2    König, P.3
  • 41
    • 39749119941 scopus 로고    scopus 로고
    • Task-demands can immediately reverse the effects of sensory-driven saliency in complex visual stimuli
    • Feb
    • W. Einhäuser, U. Rutishauser, and C. Koch, "Task-demands can immediately reverse the effects of sensory-driven saliency in complex visual stimuli," J. Vis., vol. 8, no. 2, pp. 1-19, Feb. 2008.
    • (2008) J. Vis. , vol.8 , Issue.2 , pp. 1-19
    • Einhäuser, W.1    Rutishauser, U.2    Koch, C.3
  • 42
    • 34948850787 scopus 로고    scopus 로고
    • The role of memory in guiding attention during natural vision
    • Aug
    • R. Carmi and L. Itti, "The role of memory in guiding attention during natural vision," J. Vis., vol. 6, no. 9, pp. 898-914, Aug. 2006.
    • (2006) J. Vis. , vol.6 , Issue.9 , pp. 898-914
    • Carmi, R.1    Itti, L.2
  • 43
    • 77955708844 scopus 로고    scopus 로고
    • Do video coding impairments disturb the visual attention deployment?
    • Sep
    • O. L. Meur, A. Ninassi, P. L. Callet, and D. Barba, "Do video coding impairments disturb the visual attention deployment?," Signal Process.: Image Commun., vol. 25, no. 8, pp. 597-609, Sep. 2010.
    • (2010) Signal Process.: Image Commun. , vol.25 , Issue.8 , pp. 597-609
    • Meur, O.L.1    Ninassi, A.2    Callet, P.L.3    Barba, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.