SCOPUS 정보 검색 플랫폼

International Journal of Computer Vision

Volumn 113, Issue 1, 2015, Pages 67-79

A Neural Autoregressive Approach to Attention-based Recognition

(4) Zheng, Yin a Zemel, Richard S b Zhang, Yu Jin a Larochelle, Hugo c

a TSINGHUA UNIVERSITY (China)

b UNIVERSITY OF TORONTO (Canada)

c UNIVERSITÉ DE SHERBROOKE (Canada)

Author keywords

Attention based recognition; Deep learning; Neural autoregressive distribution estimator; Neural networks

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER VISION; GESTURE RECOGNITION; LEARNING SYSTEMS; NEURAL NETWORKS;

ATTENTION-BASED RECOGNITION; AUTO-REGRESSIVE; DEEP LEARNING; EXACT CALCULATIONS; FACIAL EXPRESSION RECOGNITION; PERCEPTION AND ACTIONS; RESTRICTED BOLTZMANN MACHINE; VISUAL RECOGNITION;

FACE RECOGNITION;

EID: 84939873522 PISSN: 09205691 EISSN: 15731405 Source Type: Journal
DOI: 10.1007/s11263-014-0765-x Document Type: Article

Times cited : (27)

References (31)

1
- 80053442030
- Bazzani, L., Freitas, N., Larochelle, H., Murino, V., & Ting, J.-A. (2011). Learning attentional policies for tracking and recognition in video with deep networks. In Proceedings of the 28th international conference on machine learning (ICML 2011) (pp. 937–944). ACM
- Bazzani, L., Freitas, N., Larochelle, H., Murino, V., & Ting, J.-A. (2011). Learning attentional policies for tracking and recognition in video with deep networks. In Proceedings of the 28th international conference on machine learning (ICML 2011) (pp. 937–944). ACM.

2
- 77957863822
- Infomax control of eye movements
- Butko, N. J., & Movellan, J. R. (2010). Infomax control of eye movements. IEEE Transactions on Autonomous Mental Development, 2(2), 91–107.
- (2010) IEEE Transactions on Autonomous Mental Development , vol.2 , Issue.2 , pp. 91-107
- Butko, N.J.¹ Movellan, J.R.²

3
- 80052948224
- Cheng, M.-M., Zhang, G.-X., Mitra, N. J., Huang, X., & Hu, S.-M. (2011). Global contrast based salient region detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Cheng, M.-M., Zhang, G.-X., Mitra, N. J., Huang, X., & Hu, S.-M. (2011). Global contrast based salient region detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011 (pp. 409–416). IEEE.

4
- 33645146449
- Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In IEEE computer society conference on computer vision and pattern recognition. CVPR 2005
- Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In IEEE computer society conference on computer vision and pattern recognition. CVPR 2005 (Vol. 1, pp. 886–893). IEEE.

5
- 3042535216
- Lowe. Distinctive image features from scale-invariant keypoints
- David, G. (2004). Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91–110.
- (2004) International Journal of Computer Vision , vol.60 , Issue.2 , pp. 91-110
- David, G.¹

6
- 84867478719
- Learning where to attend with deep architectures for image tracking
- Denil, M., Bazzani, L., Larochelle, H., & de Freitas, N. (2012). Learning where to attend with deep architectures for image tracking. Neural Computation, 24(8), 2151–2184.
- (2012) Neural Computation , vol.24 , Issue.8 , pp. 2151-2184
- Denil, M.¹ Bazzani, L.² Larochelle, H.³ de Freitas, N.⁴

7
- 84939919582
- Erez, T., Tramper, J. J., Smart, W. D., & Stan CAM Gielen. (2011). A pomdp model of eye-hand coordination. In AAAI
- Erez, T., Tramper, J. J., Smart, W. D., & Stan CAM Gielen. (2011). A pomdp model of eye-hand coordination. In AAAI.

8
- 55549147144
- View-invariant object category learning, recognition, and search: How spatial and object attention are coordinated using surface-based attentional shrouds
- Fazl, A., Grossberg, S., & Mingolla, E. (2009). View-invariant object category learning, recognition, and search: How spatial and object attention are coordinated using surface-based attentional shrouds. Cognitive psychology, 58(1), 1–48.
- (2009) Cognitive psychology , vol.58 , Issue.1 , pp. 1-48
- Fazl, A.¹ Grossberg, S.² Mingolla, E.³

9
- 84904598276
- Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. R. (2012). Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580.
- (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv , vol.1207 , pp. 0580
- Hinton, G.E.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.R.⁵

10
- 0013344078
- Training products of experts by minimizing contrastive divergence
- Hinton, G. E. (2002). Training products of experts by minimizing contrastive divergence. Neural Computation, 14(8), 1771–1800.
- (2002) Neural Computation , vol.14 , Issue.8 , pp. 1771-1800
- Hinton, G.E.¹

11
- 0032204063
- A model of saliency-based visual attention for rapid scene analysis
- Itti, L., Koch, C., & Niebur, E. (1998). A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(11), 1254–1259.
- (1998) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.20 , Issue.11 , pp. 1254-1259
- Itti, L.¹ Koch, C.² Niebur, E.³

12
- 77953205576
- Judd, T., Ehinger, K., Durand, F., & Torralba, A. (2009). Learning to predict where humans look. In IEEE International Conference on Computer Vision (ICCV)
- Judd, T., Ehinger, K., Durand, F., & Torralba, A. (2009). Learning to predict where humans look. In IEEE International Conference on Computer Vision (ICCV).

13
- 77956006319
- Kanan, C., & Cottrell, G. (2010) Robust classification of objects, faces, and flowers using natural image statistics. In CVPR
- Kanan, C., & Cottrell, G. (2010) Robust classification of objects, faces, and flowers using natural image statistics. In CVPR.

14
- 85162453120
- Contextual gaussian process bandit optimization
- Krause, A., & Ong, C. S. (2011). Contextual gaussian process bandit optimization. In NIPS (pp. 2447–2455).
- (2011) In NIPS , pp. 2447-2455
- Krause, A.¹ Ong, C.S.²

15
- 84878919540
- Imagenet classification with deep convolutional neural networks
- Krizhevsky, A., Sutskever, I., & Hinton, G. (2012). Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems, 25, 1106–1114.
- (2012) Advances in Neural Information Processing Systems , vol.25 , pp. 1106-1114
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.³

16
- 56449110012
- Larochelle, H., & Bengio, Y. (2008). Classification using discriminative restricted boltzmann machines. In Proceedings of the 25th international conference on machine learning
- Larochelle, H., & Bengio, Y. (2008). Classification using discriminative restricted boltzmann machines. In Proceedings of the 25th international conference on machine learning (pp. 536–543). ACM.

17
- 85162061663
- Learning to combine foveal glimpses with a third-order Boltzmann machine
- Larochelle, H., & Hinton, G. E. (2010). Learning to combine foveal glimpses with a third-order Boltzmann machine. In Advances in neural information processing systems (pp. 1243–1251).
- (2010) In Advances in neural information processing systems , pp. 1243-1251
- Larochelle, H.¹ Hinton, G.E.²

18
- 84861999538
- The neural autoregressive distribution estimator
- Larochelle, H., & Murray, I. (2011). The neural autoregressive distribution estimator. Artificial Intelligence and Statistics (AISTATS), 15, 29–37.
- (2011) Artificial Intelligence and Statistics (AISTATS) , vol.15 , pp. 29-37
- Larochelle, H.¹ Murray, I.²

19
- 84877761544
- A neural autoregressive topic model
- Larochelle, H., & Lauly, S. (2012). A neural autoregressive topic model. Advances in Neural Information Processing Systems, 25, 2717–2725.
- (2012) Advances in Neural Information Processing Systems , vol.25 , pp. 2717-2725
- Larochelle, H.¹ Lauly, S.²

20
- 84939919586
- Lazebnik, S. (2006). Cordelia, and Jean Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR
- Lazebnik, S. (2006). Cordelia, and Jean Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR.

21
- 84899020371
- (2013). Action from still image dataset and inverse optimal control to learn task specific visual scanpaths
- Mathe, S., & Sminchisescu, C. (2013). Action from still image dataset and inverse optimal control to learn task specific visual scanpaths. In Advances in neural information processing systems (pp. 1923–1931, 2013).
- (2013) In Advances in neural information processing systems , pp. 1923-1931
- Mathe, S.¹ Sminchisescu, C.²

22
- 77956509090
- Nair, V., & Hinton, G. E. (2010) Rectified linear units improve restricted boltzmann machines. In ICML
- Nair, V., & Hinton, G. E. (2010) Rectified linear units improve restricted boltzmann machines. In ICML.

23
- 15244352522
- Optimal eye movement strategies in visual search
- Najemnik, J., & Geisler, W. S. (2005). Optimal eye movement strategies in visual search. Nature, 434(7031), 387–391.
- (2005) Nature , vol.434 , Issue.7031 , pp. 387-391
- Najemnik, J.¹ Geisler, W.S.²

24
- 84866667038
- Perazzi, F., Krahenbuhl, P., Pritch, Y., & Hornung, A. (2012). Saliency filters: Contrast based filtering for salient region detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012
- Perazzi, F., Krahenbuhl, P., Pritch, Y., & Hornung, A. (2012). Saliency filters: Contrast based filtering for salient region detection. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012 (pp. 733–740). IEEE.

25
- 80053460450
- Contractive auto-encoders: Explicit invariance during feature extraction
- Rifai, S., Vincent, P., Muller, X., Glorot, X., & Bengio, Y. (2011). Contractive auto-encoders: Explicit invariance during feature extraction. In Proceedings of the 28th international conference on machine learning (ICML 2011).
- (2011) In Proceedings of the 28th international conference on machine learning (ICML , pp. 2011
- Rifai, S.¹ Vincent, P.² Muller, X.³ Glorot, X.⁴ Bengio, Y.⁵

26
- 33745620071
- Learning to generate artificial fovea trajectories for target detection
- Schmidhuber, J., & Huber, R. (1991). Learning to generate artificial fovea trajectories for target detection. International Journal of Neural Systems, 2(01n02), 125–134.
- (1991) International Journal of Neural Systems , vol.2 , Issue.01n02 , pp. 125-134
- Schmidhuber, J.¹ Huber, R.²

27
- 84977775981
- Helmholtzs treatise on physiological optics. vol. 2: The sensation of vision, trans. J. P. C. Southall
- Southall, J. P. C. (1962). Helmholtzs treatise on physiological optics. vol. 2: The sensation of vision, trans. J. P. C. Southall. (translated from the third german edition).
- (1962) (translated from the third german edition)
- Southall, J.P.C.¹

28
- 84866707259
- The toronto face database. Department of Computer Science, University of Toronto, Toronto
- Canada: Tech. Rep
- Susskind, J. M., Anderson, A. K., & Hinton, G. E. (2010). The toronto face database. Department of Computer Science, University of Toronto, Toronto, ON, Canada, Tech. Rep.
- (2010) ON
- Susskind, J.M.¹ Anderson, A.K.² Hinton, G.E.³

29
- 84898933061
- Rnade: The real-valued neural autoregressive density-estimator
- Uria, B., Murray, I., & Larochelle, H. (2013). Rnade: The real-valued neural autoregressive density-estimator. Advances in Neural Information Processing Systems, 26, 2175–2183.
- (2013) Advances in Neural Information Processing Systems , vol.26 , pp. 2175-2183
- Uria, B.¹ Murray, I.² Larochelle, H.³

30
- 56449089103
- Vincent, P., Larochelle, H., Bengio, Y., & Manzagol, P.-A. (2008). Extracting and composing robust features with denoising autoencoders. In Proceedings of the 25th international conference on machine learning (ICML 2008) ACM
- Vincent, P., Larochelle, H., Bengio, Y., & Manzagol, P.-A. (2008). Extracting and composing robust features with denoising autoencoders. In Proceedings of the 25th international conference on machine learning (ICML 2008) (pp. 1096–1103). ACM.

31
- 70450209196
- Yang, J., Yu., K., & Gong, Y. (2009). Linear spatial pyramid matching using sparse coding for image classification. In CVPR
- Yang, J., Yu., K., & Gong, Y. (2009). Linear spatial pyramid matching using sparse coding for image classification. In CVPR.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.