메뉴 건너뛰기




Volumn 113, Issue 1, 2015, Pages 67-79

A Neural Autoregressive Approach to Attention-based Recognition

Author keywords

Attention based recognition; Deep learning; Neural autoregressive distribution estimator; Neural networks

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER VISION; GESTURE RECOGNITION; LEARNING SYSTEMS; NEURAL NETWORKS;

EID: 84939873522     PISSN: 09205691     EISSN: 15731405     Source Type: Journal    
DOI: 10.1007/s11263-014-0765-x     Document Type: Article
Times cited : (27)

References (31)
  • 1
    • 80053442030 scopus 로고    scopus 로고
    • Bazzani, L., Freitas, N., Larochelle, H., Murino, V., & Ting, J.-A. (2011). Learning attentional policies for tracking and recognition in video with deep networks. In Proceedings of the 28th international conference on machine learning (ICML 2011) (pp. 937–944). ACM
    • Bazzani, L., Freitas, N., Larochelle, H., Murino, V., & Ting, J.-A. (2011). Learning attentional policies for tracking and recognition in video with deep networks. In Proceedings of the 28th international conference on machine learning (ICML 2011) (pp. 937–944). ACM.
  • 3
    • 80052948224 scopus 로고    scopus 로고
    • Cheng, M.-M., Zhang, G.-X., Mitra, N. J., Huang, X., & Hu, S.-M. (2011). Global contrast based salient region detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
    • Cheng, M.-M., Zhang, G.-X., Mitra, N. J., Huang, X., & Hu, S.-M. (2011). Global contrast based salient region detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011 (pp. 409–416). IEEE.
  • 4
    • 33645146449 scopus 로고    scopus 로고
    • Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In IEEE computer society conference on computer vision and pattern recognition. CVPR 2005
    • Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In IEEE computer society conference on computer vision and pattern recognition. CVPR 2005 (Vol. 1, pp. 886–893). IEEE.
  • 5
    • 3042535216 scopus 로고    scopus 로고
    • Lowe. Distinctive image features from scale-invariant keypoints
    • David, G. (2004). Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91–110.
    • (2004) International Journal of Computer Vision , vol.60 , Issue.2 , pp. 91-110
    • David, G.1
  • 6
    • 84867478719 scopus 로고    scopus 로고
    • Learning where to attend with deep architectures for image tracking
    • Denil, M., Bazzani, L., Larochelle, H., & de Freitas, N. (2012). Learning where to attend with deep architectures for image tracking. Neural Computation, 24(8), 2151–2184.
    • (2012) Neural Computation , vol.24 , Issue.8 , pp. 2151-2184
    • Denil, M.1    Bazzani, L.2    Larochelle, H.3    de Freitas, N.4
  • 7
    • 84939919582 scopus 로고    scopus 로고
    • Erez, T., Tramper, J. J., Smart, W. D., & Stan CAM Gielen. (2011). A pomdp model of eye-hand coordination. In AAAI
    • Erez, T., Tramper, J. J., Smart, W. D., & Stan CAM Gielen. (2011). A pomdp model of eye-hand coordination. In AAAI.
  • 8
    • 55549147144 scopus 로고    scopus 로고
    • View-invariant object category learning, recognition, and search: How spatial and object attention are coordinated using surface-based attentional shrouds
    • Fazl, A., Grossberg, S., & Mingolla, E. (2009). View-invariant object category learning, recognition, and search: How spatial and object attention are coordinated using surface-based attentional shrouds. Cognitive psychology, 58(1), 1–48.
    • (2009) Cognitive psychology , vol.58 , Issue.1 , pp. 1-48
    • Fazl, A.1    Grossberg, S.2    Mingolla, E.3
  • 10
    • 0013344078 scopus 로고    scopus 로고
    • Training products of experts by minimizing contrastive divergence
    • Hinton, G. E. (2002). Training products of experts by minimizing contrastive divergence. Neural Computation, 14(8), 1771–1800.
    • (2002) Neural Computation , vol.14 , Issue.8 , pp. 1771-1800
    • Hinton, G.E.1
  • 12
    • 77953205576 scopus 로고    scopus 로고
    • Judd, T., Ehinger, K., Durand, F., & Torralba, A. (2009). Learning to predict where humans look. In IEEE International Conference on Computer Vision (ICCV)
    • Judd, T., Ehinger, K., Durand, F., & Torralba, A. (2009). Learning to predict where humans look. In IEEE International Conference on Computer Vision (ICCV).
  • 13
    • 77956006319 scopus 로고    scopus 로고
    • Kanan, C., & Cottrell, G. (2010) Robust classification of objects, faces, and flowers using natural image statistics. In CVPR
    • Kanan, C., & Cottrell, G. (2010) Robust classification of objects, faces, and flowers using natural image statistics. In CVPR.
  • 14
    • 85162453120 scopus 로고    scopus 로고
    • Contextual gaussian process bandit optimization
    • Krause, A., & Ong, C. S. (2011). Contextual gaussian process bandit optimization. In NIPS (pp. 2447–2455).
    • (2011) In NIPS , pp. 2447-2455
    • Krause, A.1    Ong, C.S.2
  • 16
    • 56449110012 scopus 로고    scopus 로고
    • Larochelle, H., & Bengio, Y. (2008). Classification using discriminative restricted boltzmann machines. In Proceedings of the 25th international conference on machine learning
    • Larochelle, H., & Bengio, Y. (2008). Classification using discriminative restricted boltzmann machines. In Proceedings of the 25th international conference on machine learning (pp. 536–543). ACM.
  • 20
    • 84939919586 scopus 로고    scopus 로고
    • Lazebnik, S. (2006). Cordelia, and Jean Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR
    • Lazebnik, S. (2006). Cordelia, and Jean Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR.
  • 21
    • 84899020371 scopus 로고    scopus 로고
    • (2013). Action from still image dataset and inverse optimal control to learn task specific visual scanpaths
    • Mathe, S., & Sminchisescu, C. (2013). Action from still image dataset and inverse optimal control to learn task specific visual scanpaths. In Advances in neural information processing systems (pp. 1923–1931, 2013).
    • (2013) In Advances in neural information processing systems , pp. 1923-1931
    • Mathe, S.1    Sminchisescu, C.2
  • 22
    • 77956509090 scopus 로고    scopus 로고
    • Nair, V., & Hinton, G. E. (2010) Rectified linear units improve restricted boltzmann machines. In ICML
    • Nair, V., & Hinton, G. E. (2010) Rectified linear units improve restricted boltzmann machines. In ICML.
  • 23
    • 15244352522 scopus 로고    scopus 로고
    • Optimal eye movement strategies in visual search
    • Najemnik, J., & Geisler, W. S. (2005). Optimal eye movement strategies in visual search. Nature, 434(7031), 387–391.
    • (2005) Nature , vol.434 , Issue.7031 , pp. 387-391
    • Najemnik, J.1    Geisler, W.S.2
  • 24
    • 84866667038 scopus 로고    scopus 로고
    • Perazzi, F., Krahenbuhl, P., Pritch, Y., & Hornung, A. (2012). Saliency filters: Contrast based filtering for salient region detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012
    • Perazzi, F., Krahenbuhl, P., Pritch, Y., & Hornung, A. (2012). Saliency filters: Contrast based filtering for salient region detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012 (pp. 733–740). IEEE.
  • 26
    • 33745620071 scopus 로고
    • Learning to generate artificial fovea trajectories for target detection
    • Schmidhuber, J., & Huber, R. (1991). Learning to generate artificial fovea trajectories for target detection. International Journal of Neural Systems, 2(01n02), 125–134.
    • (1991) International Journal of Neural Systems , vol.2 , Issue.01n02 , pp. 125-134
    • Schmidhuber, J.1    Huber, R.2
  • 27
    • 84977775981 scopus 로고
    • Helmholtzs treatise on physiological optics. vol. 2: The sensation of vision, trans. J. P. C. Southall
    • Southall, J. P. C. (1962). Helmholtzs treatise on physiological optics. vol. 2: The sensation of vision, trans. J. P. C. Southall. (translated from the third german edition).
    • (1962) (translated from the third german edition)
    • Southall, J.P.C.1
  • 28
    • 84866707259 scopus 로고    scopus 로고
    • The toronto face database. Department of Computer Science, University of Toronto, Toronto
    • Canada: Tech. Rep
    • Susskind, J. M., Anderson, A. K., & Hinton, G. E. (2010). The toronto face database. Department of Computer Science, University of Toronto, Toronto, ON, Canada, Tech. Rep.
    • (2010) ON
    • Susskind, J.M.1    Anderson, A.K.2    Hinton, G.E.3
  • 30
    • 56449089103 scopus 로고    scopus 로고
    • Vincent, P., Larochelle, H., Bengio, Y., & Manzagol, P.-A. (2008). Extracting and composing robust features with denoising autoencoders. In Proceedings of the 25th international conference on machine learning (ICML 2008) ACM
    • Vincent, P., Larochelle, H., Bengio, Y., & Manzagol, P.-A. (2008). Extracting and composing robust features with denoising autoencoders. In Proceedings of the 25th international conference on machine learning (ICML 2008) (pp. 1096–1103). ACM.
  • 31
    • 70450209196 scopus 로고    scopus 로고
    • Yang, J., Yu., K., & Gong, Y. (2009). Linear spatial pyramid matching using sparse coding for image classification. In CVPR
    • Yang, J., Yu., K., & Gong, Y. (2009). Linear spatial pyramid matching using sparse coding for image classification. In CVPR.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.