SCOPUS 정보 검색 플랫폼

Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017

Volumn 2017-January, Issue , 2017, Pages 4428-4437

Counting everyday objects in everyday scenes

(5) Chattopadhyay, Prithvijit a Vedantam, Ramakrishna a Selvaraju, Ramprasaath R a Batra, Dhruv b Parikh, Devi b

a VIRGINIA POLYTECHNIC INSTITUTE AND STATE UNIVERSITY (United States)

b GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; OBJECT DETECTION; OBJECT RECOGNITION; SECURITY SYSTEMS;

DIVIDE AND CONQUER; NATURAL SCENES; OBJECT CLASS; PROOF OF CONCEPT; QUESTION ANSWERING; RESTRICTED-DOMAIN; SURVEILLANCE VIDEO;

PATTERN RECOGNITION;

EID: 85043790150 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2017.471 Document Type: Conference Paper

Times cited : (137)

References (46)

1
- 85044277971
- Analyzing the behavior of visual question answering models
- A. Agrawal, D. Batra, and D. Parikh. Analyzing the behavior of visual question answering models. CoRR, abs/1606.07356, 2016.
- (2016) CoRR, abs/1606.07356
- Agrawal, A.¹ Batra, D.² Parikh, D.³

2
- 84973890960
- VQA: Visual question answering
- S. Antol, A. Agrawal, J. Lu, M. Mitchell, D. Batra, C. L. Zitnick, and D. Parikh. VQA: visual question answering. In 2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7-13, 2015, pages 2425-2433, 2015.
- (2015) 2015 IEEE International Conference on Computer Vision, ICCV 2015, Santiago, Chile, December 7-13, 2015 , pp. 2425-2433
- Antol, S.¹ Agrawal, A.² Lu, J.³ Mitchell, M.⁴ Batra, D.⁵ Zitnick, C.L.⁶ Parikh, D.⁷

3
- 85041804800
- Counting in the wild
- C. Arteta, V. Lempitsky, and A. Zisserman. Counting in the wild. In European Conference on Computer Vision, 2016.
- (2016) European Conference on Computer Vision
- Arteta, C.¹ Lempitsky, V.² Zisserman, A.³

4
- 84867872703
- Semantic segmentation with second-order pooling
- J. Carreira, R. Caseiro, J. Batista, and C. Sminchisescu. Semantic segmentation with second-order pooling. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), volume 7578 LNCS, pages 430-443, 2012.
- (2012) Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7578 LNCS , pp. 430-443
- Carreira, J.¹ Caseiro, R.² Batista, J.³ Sminchisescu, C.⁴

5
- 51949104316
- Privacy preserving crowd monitoring: Counting people without people models or tracking
- IEEE
- A. B. Chan and N. Vasconcelos. Privacy preserving crowd monitoring: Counting people without people models or tracking. In 2008 IEEE Conference on Computer Vision and Pattern Recognition, pages 1-7. IEEE, 6 2008.
- (2008) 2008 IEEE Conference on Computer Vision and Pattern Recognition , vol.6 , pp. 1-7
- Chan, A.B.¹ Vasconcelos, N.²

6
- 77953177412
- Bayesian poisson regression for crowd counting
- IEEE
- A. B. Chan and N. Vasconcelos. Bayesian poisson regression for crowd counting. In 2009 IEEE 12th International Conference on Computer Vision, pages 545-551. IEEE, 9 2009.
- (2009) 2009 IEEE 12th International Conference on Computer Vision , vol.9 , pp. 545-551
- Chan, A.B.¹ Vasconcelos, N.²

7
- 85029078673
- Counting everyday objects in everyday scenes
- P. Chattopadhyay, R. Vedantam, R. S. Ramprasaath, D. Batra, and D. Parikh. Counting everyday objects in everyday scenes. CoRR, abs/1604.03505, 2016.
- (2016) CoRR, abs/1604.03505
- Chattopadhyay, P.¹ Vedantam, R.² Ramprasaath, R.S.³ Batra, D.⁴ Parikh, D.⁵

8
- 0012036878
- Subitizing: What is it? Why teach it?
- D. H. Clements. Subitizing: What is it? why teach it? Teaching children mathematics, 5(7):400, 1999.
- (1999) Teaching Children Mathematics , vol.5 , Issue.7 , pp. 400
- Clements, D.H.¹

9
- 84888340666
- Torch7: A matlab-like environment for machine learning
- R. Collobert, K. Kavukcuoglu, and C. Farabet. Torch7: A matlab-like environment for machine learning. In BigLearn, NIPS Workshop, 2011.
- (2011) BigLearn, NIPS Workshop
- Collobert, R.¹ Kavukcuoglu, K.² Farabet, C.³

10
- 84870917824
- Subitizing and visual shortterm memory in human and non-human species: A common shared system?
- S. Cutini and M. Bonato. Subitizing and visual shortterm memory in human and non-human species: a common shared system? Frontiers in Psychology, 3, 2012.
- (2012) Frontiers in Psychology , vol.3
- Cutini, S.¹ Bonato, M.²

11
- 45849104230
- Log or linear? Distinct intuitions of the number scale in western and amazonian indigene cultures
- S. Dehaene, V. Izard, E. Spelke, and P. Pica. Log or linear? distinct intuitions of the number scale in western and amazonian indigene cultures. Science, 320(5880):1217-1220, 2008.
- (2008) Science , vol.320 , Issue.5880 , pp. 1217-1220
- Dehaene, S.¹ Izard, V.² Spelke, E.³ Pica, P.⁴

12
- 85044305973
- J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. Decaf: A deep convolutional activation feature for generic visual recognition. 10 2013.
- (2013) Decaf: A Deep Convolutional Activation Feature for Generic Visual Recognition , vol.10
- Donahue, J.¹ Jia, Y.² Vinyals, O.³ Hoffman, J.⁴ Zhang, N.⁵ Tzeng, E.⁶ Darrell, T.⁷

13
- 84921069139
- The pascal visual object classes challenge: A retrospective
- Jan
- M. Everingham, S. M. A. Eslami, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The pascal visual object classes challenge: A retrospective. International Journal of Computer Vision, 111(1):98-136, Jan. 2015.
- (2015) International Journal of Computer Vision , vol.111 , Issue.1 , pp. 98-136
- Everingham, M.¹ Eslami, S.M.A.² Van Gool, L.³ Williams, C.K.I.⁴ Winn, J.⁵ Zisserman, A.⁶

14
- 51949101231
- A discriminatively trained, multiscale, deformable part model
- P. Felzenszwalb, D. McAllester, and D. Ramanan. A discriminatively trained, multiscale, deformable part model. In 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR, 2008.
- (2008) 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR
- Felzenszwalb, P.¹ McAllester, D.² Ramanan, D.³

15
- 85044506279
- Multimodal compact bilinear pooling for visual question answering and visual grounding
- A. Fukui, D. H. Park, D. Yang, A. Rohrbach, T. Darrell, and M. Rohrbach. Multimodal compact bilinear pooling for visual question answering and visual grounding. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP 2016, Austin, Texas, USA, November 1-4, 2016, pages 457-468, 2016.
- (2016) Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, EMNLP 2016, Austin, Texas, USA, November 1-4 2016 , pp. 457-468
- Fukui, A.¹ Park, D.H.² Yang, D.³ Rohrbach, A.⁴ Darrell, T.⁵ Rohrbach, M.⁶

16
- 85046386564
- Feb
- F. Galton. One Vote, One Value. 75:414, Feb. 1907.
- (1907) One Vote One Value , vol.75 , pp. 414
- Galton, F.¹

17
- 84973864191
- Object detection via a multiregion and semantic segmentation-aware cnn model
- S. Gidaris and N. Komodakis. Object detection via a multiregion and semantic segmentation-aware cnn model. In Proceedings of the IEEE International Conference on Computer Vision, pages 1134-1142, 2015.
- (2015) Proceedings of the IEEE International Conference on Computer Vision , pp. 1134-1142
- Gidaris, S.¹ Komodakis, N.²

18
- 84986248789
- Fast r-cnn
- R. Girshick. Fast r-cnn. In International Conference on Computer Vision (ICCV), 2015.
- (2015) International Conference on Computer Vision ICCV
- Girshick, R.¹

19
- 84887356947
- Multi-source multi-scale counting in extremely dense crowd images
- Washington, DC, USA, IEEE Computer Society
- H. Idrees, I. Saleemi, C. Seibert, and M. Shah. Multi-source multi-scale counting in extremely dense crowd images. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, CVPR '13, pages 2547-2554, Washington, DC, USA, 2013. IEEE Computer Society.
- (2013) Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, CVPR '13 , pp. 2547-2554
- Idrees, H.¹ Saleemi, I.² Seibert, C.³ Shah, M.⁴

20
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In Proceedings of The 32nd International Conference on Machine Learning, pages 448-456, 2015.
- (2015) Proceedings of the 32nd International Conference on Machine Learning , pp. 448-456
- Ioffe, S.¹ Szegedy, C.²

21
- 85044330046
- D. B. Jiasen Lu, Xiao Lin and D. Parikh. Deeper lstm and normalized cnn visual question answering model. https://github.com/VT-vision-lab/VQA-LSTM-CNN, 2015.
- (2015) Deeper Lstm and Normalized Cnn Visual Question Answering Model
- Jiasen Lu, D.B.¹ Lin, X.² Parikh, D.³

22
- 85041904911
- CLEVR: A diagnostic dataset for compositional language and elementary visual reasoning
- J. Johnson, B. Hariharan, L. van der Maaten, L. Fei-Fei, C. L. Zitnick, and R. Girshick. CLEVR: A diagnostic dataset for compositional language and elementary visual reasoning. In CVPR, 2017.
- (2017) CVPR
- Johnson, J.¹ Hariharan, B.² Maaten Der LVan³ Fei-Fei, L.⁴ Zitnick, C.L.⁵ Girshick, R.⁶

23
- 84943540775
- Referit game: Referring to objects in photographs of natural scenes
- S. Kazemzadeh, V. Ordonez, M. Matten, and T. L. Berg. Referit game: Referring to objects in photographs of natural scenes. In EMNLP, 2014.
- (2014) EMNLP
- Kazemzadeh, S.¹ Ordonez, V.² Matten, M.³ Berg, T.L.⁴

24
- 85083951076
- Adam: A method for stochastic optimization
- D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. CoRR, abs/1412.6980, 2014.
- (2014) CoRR, abs/1412.6980
- Kingma, D.P.¹ Ba, J.²

25
- 84989405747
- Universals in the development of early arithmetic cognition
- A. Klein and P. Starkey. Universals in the development of early arithmetic cognition. New Directions for Child and Adolescent Development, 1988(41):5-26, 1988.
- (1988) New Directions for Child and Adolescent Development 1988 , vol.41 , pp. 5-26
- Klein, A.¹ Starkey, P.²

26
- 84876231242
- Imagenet classification with deep convolutional neural networks
- A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, pages 1097-1105, 2012.
- (2012) Advances in Neural Information Processing Systems , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

27
- 85162384490
- Learning to count objects in images
- V. Lempitsky and A. Zisserman. Learning To Count Objects in Images. In Advances in Neural Information Processing Systems, pages 1324-1332, 2010.
- (2010) Advances in Neural Information Processing Systems , pp. 1324-1332
- Lempitsky, V.¹ Zisserman, A.²

28
- 84937834115
- Microsoft COCO: Common objects in context
- T. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick. Microsoft COCO: Common objects in context. In ECCV, 2014.
- (2014) ECCV
- Lin, T.¹ Maire, M.² Belongie, S.³ Hays, J.⁴ Perona, P.⁵ Ramanan, D.⁶ Dollár, P.⁷ Zitnick, C.L.⁸

29
- 85007570504
- SSD: Single shot multibox detector
- W. Liu, D. Anguelov, D. Erhan, C. Szegedy, and S. E. Reed. SSD: single shot multibox detector. CoRR, abs/1512.02325, 2015.
- (2015) CoRR, abs/1512.02325
- Liu, W.¹ Anguelov, D.² Erhan, D.³ Szegedy, C.⁴ Reed, S.E.⁵

30
- 84959205572
- Fully convolutional networks for semantic segmentation
- J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3431-3440, 2015.
- (2015) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 3431-3440
- Long, J.¹ Shelhamer, E.² Darrell, T.³

31
- 84973896625
- Ask your neurons: A neural-based approach to answering questions about images
- M. Malinowski, M. Rohrbach, and M. Fritz. Ask your neurons: A neural-based approach to answering questions about images. In Proceedings of the IEEE International Conference on Computer Vision, pages 1-9, 2015.
- (2015) Proceedings of the IEEE International Conference on Computer Vision , pp. 1-9
- Malinowski, M.¹ Rohrbach, M.² Fritz, M.³

32
- 84898956512
- Distributed representations of words and phrases and their compositionality
- T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed Representations of Words and Phrases and their Compositionality. In Advances in Neural Information Processing Systems, pages 3111-3119, 2013.
- (2013) Advances in Neural Information Processing Systems , pp. 3111-3119
- Mikolov, T.¹ Sutskever, I.² Chen, K.³ Corrado, G.S.⁴ Dean, J.⁵

33
- 85021624882
- Towards perspective-free object counting with deep learning
- D. Oñoro Rubio and R. J. López-Sastre. Towards perspective-free object counting with deep learning. In ECCV, 2016.
- (2016) ECCV
- Oñoro Rubio, D.¹ López-Sastre, R.J.²

34
- 84986308404
- You only look once: Unified, real-time object detection
- June
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi. You only look once: Unified, real-time object detection. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016.
- (2016) The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Redmon, J.¹ Divvala, S.² Girshick, R.³ Farhadi, A.⁴

35
- 84965170394
- Exploring models and data for image question answering
- M. Ren, R. Kiros, and R. Zemel. Exploring models and data for image question answering. In Advances in Neural Information Processing Systems, pages 2953-2961, 2015.
- (2015) Advances in Neural Information Processing Systems , pp. 2953-2961
- Ren, M.¹ Kiros, R.² Zemel, R.³

36
- 84990028830
- End-to-end instance segmentation and counting with recurrent attention
- M. Ren and R. S. Zemel. End-to-end instance segmentation and counting with recurrent attention. CoRR, abs/1605.09410, 2016.
- (2016) CoRR, abs/1605.09410
- Ren, M.¹ Zemel, R.S.²

37
- 84960980241
- Faster r-cnn: Towards real-time object detection with region proposal networks
- C. Cortes, N. D. Lawrence, D. D. Lee, M. Sugiyama, and R. Garnett, editors, Curran Associates, Inc
- S. Ren, K. He, R. Girshick, and J. Sun. Faster r-cnn: Towards real-time object detection with region proposal networks. In C. Cortes, N. D. Lawrence, D. D. Lee, M. Sugiyama, and R. Garnett, editors, Advances in Neural Information Processing Systems 28, pages 91-99. Curran Associates, Inc., 2015.
- (2015) Advances in Neural Information Processing Systems , vol.28 , pp. 91-99
- Ren, S.¹ He, K.² Girshick, R.³ Sun, J.⁴

38
- 84947041871
- Imagenet large scale visual recognition challenge
- O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision (IJCV), 115(3):211-252, 2015.
- (2015) International Journal of Computer Vision (IJCV) , vol.115 , Issue.3 , pp. 211-252
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶ Huang, Z.⁷ Karpathy, A.⁸ Khosla, A.⁹ Bernstein, M.¹⁰ Berg, A.C.¹¹ Fei-Fei, L.¹²

39
- 84887384357
- It's not polite to point: Describing people with uncertain attributes
- IEEE
- A. Sadovnik, A. C. Gallagher, and T. Chen. It's not polite to point: Describing people with uncertain attributes. In CVPR, pages 3089-3096. IEEE, 2013.
- (2013) CVPR , pp. 3089-3096
- Sadovnik, A.¹ Gallagher, A.C.² Chen, T.³

40
- 0031268931
- Bidirectional recurrent neural networks
- M. Schuster and K. K. Paliwal. Bidirectional recurrent neural networks. IEEE Trans. Signal Processing, 45:2673-2681, 1997.
- (1997) IEEE Trans. Signal Processing , vol.45 , pp. 2673-2681
- Schuster, M.¹ Paliwal, K.K.²

41
- 85044288065
- may
- S. Seguí, O. Pujol, and J. Vitrià. Learning to count with deep object features. may 2015.
- (2015) Learning to Count with Deep Object Features
- Seguí, S.¹ Pujol, O.² Vitrià, J.³

42
- 85044267754
- K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. 9 2014.
- (2014) Very Deep Convolutional Networks for Large-scale Image Recognition , vol.9
- Simonyan, K.¹ Zisserman, A.²

43
- 0035680116
- Rapid object detection using a boosted cascade of simple features
- P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features.computer Vision and Pattern Recognition (CVPR), 1:I-511-I-518, 2001.
- (2001) Computer Vision and Pattern Recognition (CVPR) , vol.1 , pp. 1511-1518
- Viola, P.¹ Jones, M.²

44
- 84959203164
- End-to-end integration of a convolution network, deformable parts model and nonmaximum suppression
- L. Wan, D. Eigen, and R. Fergus. End-to-end integration of a convolution network, deformable parts model and nonmaximum suppression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 851-859, 2015.
- (2015) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 851-859
- Wan, L.¹ Eigen, D.² Fergus, R.³

45
- 84959214343
- Cross-scene crowd counting via deep convolutional neural networks
- C. Zhang, H. Li, X.Wang, and X. Yang. Cross-Scene Crowd Counting via Deep Convolutional Neural Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 833-841, 2015.
- (2015) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 833-841
- Zhang, C.¹ Li, H.² Wang, X.³ Yang, X.⁴

46
- 84959205754
- Salient object subitizing
- J. Zhang, S. Ma, M. Sameki, S. Sclaroff, M. Betke, Z. Lin, X. Shen, B. Price, and R. M?ech. Salient object subitizing. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
- (2015) IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Zhang, J.¹ Ma, S.² Sameki, M.³ Sclaroff, S.⁴ Betke, M.⁵ Lin, Z.⁶ Shen, X.⁷ Price, B.⁸ Mech, R.⁹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.