-
1
-
-
84976441698
-
-
Ahonen, T., Matas, J., He, C., & Pietikäinen, M., et al. (2009). Rotation invariant image description with local binary pattern histogram fourier features. In Scandinavian Conference on Image Analysis
-
Ahonen, T., Matas, J., He, C., & Pietikäinen, M., et al. (2009). Rotation invariant image description with local binary pattern histogram fourier features. In Scandinavian Conference on Image Analysis.
-
-
-
-
2
-
-
84976371137
-
-
The Berkeley segmentation dataset and benchmark, Retrieved from
-
Arbelaez, P., Fowlkes, C., & Martin, D. (2007). The Berkeley segmentation dataset and benchmark. Retrieved from http://www.eecs.berkeley.edu/Research/Projects/CS/vision/bsds.
-
(2007)
& Martin, D
-
-
Arbelaez, P.1
Fowlkes, C.2
-
3
-
-
79953048649
-
Contour detection and hierarchical image segmentation. Pattern Analysis and Machine Intelligence
-
Arbelaez, P., Maire, M., Fowlkes, C., & Malik, J. (2011). Contour detection and hierarchical image segmentation. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 33(5), 898–916.
-
(2011)
IEEE Transactions on
, vol.33
, Issue.5
, pp. 898-916
-
-
Arbelaez, P.1
Maire, M.2
Fowlkes, C.3
Malik, J.4
-
4
-
-
0041876117
-
Matching words and pictures
-
Barnard, K., Duygulu, P., Forsyth, D., De Freitas, N., Blei, D. M., & Jordan, M. I. (2003). Matching words and pictures. The Journal of Machine Learning Research, 3, 1107–1135.
-
(2003)
The Journal of Machine Learning Research
, vol.3
, pp. 1107-1135
-
-
Barnard, K.1
Duygulu, P.2
Forsyth, D.3
De Freitas, N.4
Blei, D.M.5
Jordan, M.I.6
-
6
-
-
0023322501
-
Recognition-by-components: a theory of human image understanding
-
Biederman, I. (1987). Recognition-by-components: a theory of human image understanding. Psychological Review, 94(2), 115.
-
(1987)
Psychological Review
, vol.94
, Issue.2
, pp. 115
-
-
Biederman, I.1
-
7
-
-
3042597440
-
Learning multi-label scene classification
-
Boutell, M. R., Luo, J., Shen, X., & Brown, C. M. (2004). Learning multi-label scene classification. Pattern recognition, 37(9), 1757–1771.
-
(2004)
Pattern recognition
, vol.37
, Issue.9
, pp. 1757-1771
-
-
Boutell, M.R.1
Luo, J.2
Shen, X.3
Brown, C.M.4
-
8
-
-
21144477583
-
Estimating the number of species: A review
-
Bunge, J., & Fitzpatrick, M. (1993). Estimating the number of species: A review. Journal of the American Statistical Association, 88(421), 364–373.
-
(1993)
Journal of the American Statistical Association
, vol.88
, Issue.421
, pp. 364-373
-
-
Bunge, J.1
Fitzpatrick, M.2
-
9
-
-
33645146449
-
-
Dalal, N., & Triggs, B. (2005, June). Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on (Vol. 1, pp. 886-893). IEEE
-
Dalal, N., & Triggs, B. (2005, June). Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on (Vol. 1, pp. 886-893). IEEE.
-
-
-
-
10
-
-
84976364014
-
-
Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., & Fei-Fei, L. (2009, June). Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on (pp. 248-255). IEEE
-
Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., & Fei-Fei, L. (2009, June). Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on (pp. 248-255). IEEE.
-
-
-
-
11
-
-
84906504048
-
Decaf: A deep convolutional activation feature for generic visual recognition
-
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., & Darrell, T. (2013). Decaf: A deep convolutional activation feature for generic visual recognition. Retrieved from arXiv:1310.1531.
-
(2013)
Retrieved from arXiv
, vol.1310
, pp. 1531
-
-
Donahue, J.1
Jia, Y.2
Vinyals, O.3
Hoffman, J.4
Zhang, N.5
Tzeng, E.6
Darrell, T.7
-
12
-
-
84864968893
-
-
Massachusetts: Cognitive science
-
Ehinger, K. A., Xiao, J., Torralba, A., & Oliva, A. (2011). Estimating scene typicality from human ratings and image features. Massachusetts: Cognitive science.
-
(2011)
Estimating scene typicality from human ratings and image features
-
-
Ehinger, K.A.1
Xiao, J.2
Torralba, A.3
Oliva, A.4
-
13
-
-
0032499196
-
A cortical representation of the local visual environment
-
Epstein, R., & Kanwisher, N. (1998). A cortical representation of the local visual environment. Nature, 392(6676), 598–601.
-
(1998)
Nature
, vol.392
, Issue.6676
, pp. 598-601
-
-
Epstein, R.1
Kanwisher, N.2
-
14
-
-
77951298115
-
The pascal visual object classes (voc) challenge
-
Everingham, M., Van Gool, L., Williams, C. K., Winn, J., & Zisserman, A. (2010). The pascal visual object classes (voc) challenge. International Journal of Computer Vision, 88(2), 303–338.
-
(2010)
International Journal of Computer Vision
, vol.88
, Issue.2
, pp. 303-338
-
-
Everingham, M.1
Van Gool, L.2
Williams, C.K.3
Winn, J.4
Zisserman, A.5
-
15
-
-
85199257282
-
-
Fei-Fei, L., Fergus, R., & Perona, P., et al. (2004). Learning generative visual models from few training examples. In Computer Vision and Pattern Recognition Workshop on Generative-Model Based Vision
-
Fei-Fei, L., Fergus, R., & Perona, P., et al. (2004). Learning generative visual models from few training examples. In Computer Vision and Pattern Recognition Workshop on Generative-Model Based Vision.
-
-
-
-
16
-
-
33745155436
-
-
Fei-Fei, L., & Perona, P. (2005). A bayesian hierarchical model for learning natural scene categories. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on (Vol. 2, pp. 524–531). IEEE
-
Fei-Fei, L., & Perona, P. (2005). A bayesian hierarchical model for learning natural scene categories. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on (Vol. 2, pp. 524–531). IEEE.
-
-
-
-
18
-
-
77955422240
-
Object detection with discriminatively trained part-based models. Pattern Analysis and Machine Intelligence
-
Felzenszwalb, P. F., Girshick, R. B., McAllester, D., & Ramanan, D. (2010). Object detection with discriminatively trained part-based models. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 32(9), 1627–1645.
-
(2010)
IEEE Transactions on
, vol.32
, Issue.9
, pp. 1627-1645
-
-
Felzenszwalb, P.F.1
Girshick, R.B.2
McAllester, D.3
Ramanan, D.4
-
20
-
-
51949088643
-
-
Hays, J., & Efros, A. A. (2008). IM2GPS: estimating geographic information from a single image. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on (pp. 1–8). IEEE
-
Hays, J., & Efros, A. A. (2008). IM2GPS: estimating geographic information from a single image. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on (pp. 1–8). IEEE.
-
-
-
-
21
-
-
34547216923
-
Recovering surface layout from an image
-
Hoiem, D., Efros, A. A., & Hebert, M. (2007). Recovering surface layout from an image. International Journal of Computer Vision, 75(1), 151–172.
-
(2007)
International Journal of Computer Vision
, vol.75
, Issue.1
, pp. 151-172
-
-
Hoiem, D.1
Efros, A.A.2
Hebert, M.3
-
22
-
-
0021418363
-
Pictures and names: Making the connection
-
Jolicoeur, P., Gluck, M. A., & Kosslyn, S. M. (1984). Pictures and names: Making the connection. Cognitive Psychology, 16(2), 243–275.
-
(1984)
Cognitive Psychology
, vol.16
, Issue.2
, pp. 243-275
-
-
Jolicoeur, P.1
Gluck, M.A.2
Kosslyn, S.M.3
-
23
-
-
84937580138
-
-
Kosecka, J., & Zhang, W. (2002). Video compass. In Computer Vision-ECCV 2002 (pp. 476–490). Berlin: Springer
-
Kosecka, J., & Zhang, W. (2002). Video compass. In Computer Vision-ECCV 2002 (pp. 476–490). Berlin: Springer.
-
-
-
-
24
-
-
84976420971
-
-
Photo clip art, SIGGRAPH
-
Lalonde, J.F., Hoiem, D., Efros, A.A., Rother, C., Winn, J., & Criminisi, A., et al. (2007). Photo clip art. SIGGRAPH.
-
(2007)
et al
-
-
Lalonde, J.F.1
Hoiem, D.2
Efros, A.A.3
Rother, C.4
Winn, J.5
Criminisi, A.6
-
25
-
-
33845572523
-
-
Lazebnik, S., Schmid, C., & Ponce, J. (2006). Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on (Vol. 2, pp. 2169–2178). IEEE
-
Lazebnik, S., Schmid, C., & Ponce, J. (2006). Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on (Vol. 2, pp. 2169–2178). IEEE.
-
-
-
-
26
-
-
0034850577
-
-
Martin, D., Fowlkes, C., Tal, D., & Malik, J. (2001). A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on (Vol. 2, pp. 416-423). IEEE
-
Martin, D., Fowlkes, C., Tal, D., & Malik, J. (2001). A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on (Vol. 2, pp. 416-423). IEEE.
-
-
-
-
27
-
-
3142736062
-
Robust wide-baseline stereo from maximally stable extremal regions
-
Matas, J., Chum, O., Urban, M., & Pajdla, T. (2004). Robust wide-baseline stereo from maximally stable extremal regions. Image and Vision Computing, 22(10), 761–767.
-
(2004)
Image and Vision Computing
, vol.22
, Issue.10
, pp. 761-767
-
-
Matas, J.1
Chum, O.2
Urban, M.3
Pajdla, T.4
-
28
-
-
0036647193
-
Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. Pattern Analysis and Machine Intelligence
-
Ojala, T., Pietikainen, M., & Maenpaa, T. (2002). Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 24(7), 971–987.
-
(2002)
IEEE Transactions on
, vol.24
, Issue.7
, pp. 971-987
-
-
Ojala, T.1
Pietikainen, M.2
Maenpaa, T.3
-
29
-
-
0035328421
-
Modeling the shape of the scene: A holistic representation of the spatial envelope
-
Oliva, A., & Torralba, A. (2001). Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42(3), 145–175.
-
(2001)
International Journal of Computer Vision
, vol.42
, Issue.3
, pp. 145-175
-
-
Oliva, A.1
Torralba, A.2
-
30
-
-
51949105132
-
-
Philbin, J., Chum, O., Isard, M., Sivic, J., & Zisserman, A. (2008, June). Lost in quantization: Improving particular object retrieval in large scale image databases. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on (pp. 1–8). IEEE
-
Philbin, J., Chum, O., Isard, M., Sivic, J., & Zisserman, A. (2008, June). Lost in quantization: Improving particular object retrieval in large scale image databases. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on (pp. 1–8). IEEE.
-
-
-
-
31
-
-
84906341074
-
CNN Features off-the-shelf: An Astounding Baseline for Recognition
-
Sharif Razavian, A., Azizpour, H., Sullivan, J., & Carlsson, S. (2014). CNN Features off-the-shelf: An Astounding Baseline for Recognition. Retrieved from arXiv:1403.6382.
-
(2014)
Retrieved from arXiv
, vol.1403
, pp. 6382
-
-
Sharif Razavian, A.1
Azizpour, H.2
Sullivan, J.3
Carlsson, S.4
-
32
-
-
2942687330
-
When is scene identification just texture recognition?
-
Renninger, L. W., & Malik, J. (2004). When is scene identification just texture recognition?. Vision Research, 44(19), 2301–2311.
-
(2004)
Vision Research
, vol.44
, Issue.19
, pp. 2301-2311
-
-
Renninger, L.W.1
Malik, J.2
-
33
-
-
3042591803
-
Natural categories
-
Rosch, E. H. (1973). Natural categories. Cognitive Psychology, 4(3), 328–350.
-
(1973)
Cognitive Psychology
, vol.4
, Issue.3
, pp. 328-350
-
-
Rosch, E.H.1
-
34
-
-
34248936100
-
Basic objects in natural categories
-
Rosch, E., Mervis, C. B., Gray, W. D., Johnson, D. M., & Boyes-Braem, P. (1976). Basic objects in natural categories. Cognitive Psychology, 8(3), 382–439.
-
(1976)
Cognitive Psychology
, vol.8
, Issue.3
, pp. 382-439
-
-
Rosch, E.1
Mervis, C.B.2
Gray, W.D.3
Johnson, D.M.4
Boyes-Braem, P.5
-
35
-
-
84898805253
-
-
Russakovsky, O., Deng, J., Huang, Z., Berg, A. C., & Fei-Fei, L. (2013, December). Detecting avocados to zucchinis: what have we done, and where are we going?. In Computer Vision (ICCV), 2013 IEEE International Conference on (pp. 2064–2071). IEE
-
Russakovsky, O., Deng, J., Huang, Z., Berg, A. C., & Fei-Fei, L. (2013, December). Detecting avocados to zucchinis: what have we done, and where are we going?. In Computer Vision (ICCV), 2013 IEEE International Conference on (pp. 2064–2071). IEEE.
-
-
-
-
36
-
-
39749186006
-
LabelMe: a database and web-based tool for image annotation
-
Russell, B. C., Torralba, A., Murphy, K. P., & Freeman, W. T. (2008). LabelMe: a database and web-based tool for image annotation. International Journal of Computer Vision, 77(1–3), 157–173.
-
(2008)
International Journal of Computer Vision
, vol.77
, Issue.1-3
, pp. 157-173
-
-
Russell, B.C.1
Torralba, A.2
Murphy, K.P.3
Freeman, W.T.4
-
37
-
-
80052889458
-
-
Sadeghi, M. A., & Farhadi, A. (2011, June). Recognition using visual phrases. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on (pp. 1745–1752). IEEE
-
Sadeghi, M. A., & Farhadi, A. (2011, June). Recognition using visual phrases. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on (pp. 1745–1752). IEEE.
-
-
-
-
38
-
-
84883487458
-
Image classification with the Fisher vector: Theory and practice
-
Sanchez, J., Perronnin, F., Mensink, T., & Verbeek, J. (2013). Image classification with the Fisher vector: Theory and practice. International Journal of Computer Vision, 105(3), 222–245.
-
(2013)
International Journal of Computer Vision
, vol.105
, Issue.3
, pp. 222-245
-
-
Sanchez, J.1
Perronnin, F.2
Mensink, T.3
Verbeek, J.4
-
39
-
-
84906486689
-
Overfeat: Integrated recognition, localization and detection using convolutional networks
-
Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., & LeCun, Y. (2013). Overfeat: Integrated recognition, localization and detection using convolutional networks. Retrieved from arXiv:1312.6229.
-
(2013)
Retrieved from arXiv
, vol.1312
, pp. 6229
-
-
Sermanet, P.1
Eigen, D.2
Zhang, X.3
Mathieu, M.4
Fergus, R.5
LeCun, Y.6
-
40
-
-
34948845616
-
-
Shechtman, E., & Irani, M. (2007, June). Matching local self-similarities across images and videos. In Computer Vision and Pattern Recognition, 2007. CVPR’07. IEEE Conference on (pp. 1–8). IEEE
-
Shechtman, E., & Irani, M. (2007, June). Matching local self-similarities across images and videos. In Computer Vision and Pattern Recognition, 2007. CVPR’07. IEEE Conference on (pp. 1–8). IEEE.
-
-
-
-
41
-
-
33745824267
-
-
Shotton, J., Winn, J., Rother, C., & Criminisi, A. (2006). Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In ECCV (pp. 1–15). Berlin: Springer
-
Shotton, J., Winn, J., Rother, C., & Criminisi, A. (2006). Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In ECCV (pp. 1–15). Berlin: Springer.
-
-
-
-
42
-
-
5044234908
-
-
Sivic, J., & Zisserman, A. (2004, June). Video data mining using configurations of viewpoint invariant regions. In Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on (Vol. 1, pp. I–488). IEEE
-
Sivic, J., & Zisserman, A. (2004, June). Video data mining using configurations of viewpoint invariant regions. In Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on (Vol. 1, pp. I–488). IEEE.
-
-
-
-
43
-
-
84976417518
-
-
Song, S., & Xiao, J. (2014). Sliding Shapes for 3D object detection in RGB-D images. In European Conference on Computer Vision
-
Song, S., & Xiao, J. (2014). Sliding Shapes for 3D object detection in RGB-D images. In European Conference on Computer Vision.
-
-
-
-
44
-
-
77955998327
-
Some objects are more equal than others: measuring and predicting importance
-
Spain, M., & Perona, P. (2008). Some objects are more equal than others: measuring and predicting importance. In: European Conference on Computer Vision.
-
(2008)
In: European Conference on Computer Vision
-
-
Spain, M.1
Perona, P.2
-
45
-
-
54749092170
-
-
Torralba, A., Fergus, R., & Freeman, W. T. (2008). 80 million tiny images: A large data set for nonparametric object and scene recognition. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 30(11), 1958–1970
-
Torralba, A., Fergus, R., & Freeman, W. T. (2008). 80 million tiny images: A large data set for nonparametric object and scene recognition. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 30(11), 1958–1970.
-
-
-
-
46
-
-
0344120278
-
-
Torralba, A., Murphy, K. P., Freeman, W. T., & Rubin, M. A. (2003, October). Context-based vision system for place and object recognition. In Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on (pp. 273–280). IEEE
-
Torralba, A., Murphy, K. P., Freeman, W. T., & Rubin, M. A. (2003, October). Context-based vision system for place and object recognition. In Computer Vision, 2003. Proceedings. Ninth IEEE International Conference on (pp. 273–280). IEEE.
-
-
-
-
47
-
-
0001422823
-
Categories of environmental scenes
-
Tversky, B., & Hemenway, K. (1983). Categories of environmental scenes. Cognitive Psychology, 15(1), 121–149.
-
(1983)
Cognitive Psychology
, vol.15
, Issue.1
, pp. 121-149
-
-
Tversky, B.1
Hemenway, K.2
-
48
-
-
78650994992
-
-
Vedaldi, A., & Fulkerson, B. (2010, October). An open and portable library of computer vision algorithms: VLFeat. In Proceedings of the international conference on Multimedia (pp. 1469–1472). ACM
-
Vedaldi, A., & Fulkerson, B. (2010, October). An open and portable library of computer vision algorithms: VLFeat. In Proceedings of the international conference on Multimedia (pp. 1469–1472). ACM.
-
-
-
-
49
-
-
35048837175
-
-
Vogel, J., & Schiele, B. (2004). A semantic typicality measure for natural scene categorization. In Pattern Recognition (pp. 195–203). Berlin: Springer
-
Vogel, J., & Schiele, B. (2004). A semantic typicality measure for natural scene categorization. In Pattern Recognition (pp. 195–203). Berlin: Springer.
-
-
-
-
50
-
-
33846249578
-
Semantic modeling of natural scenes for content-based image retrieval
-
Vogel, J., & Schiele, B. (2007). Semantic modeling of natural scenes for content-based image retrieval. International Journal of Computer Vision, 72(2), 133–157.
-
(2007)
International Journal of Computer Vision
, vol.72
, Issue.2
, pp. 133-157
-
-
Vogel, J.1
Schiele, B.2
-
51
-
-
84866725899
-
-
Xiao, J., Ehinger, K. A., Oliva, A., & Torralba, A. (2012, June). Recognizing scene viewpoint using panoramic place representation. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on (pp. 2695–2702). IEEE
-
Xiao, J., Ehinger, K. A., Oliva, A., & Torralba, A. (2012, June). Recognizing scene viewpoint using panoramic place representation. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on (pp. 2695–2702). IEEE.
-
-
-
-
52
-
-
77955988947
-
-
Xiao, J., Hays, J., Ehinger, K. A., Oliva, A., & Torralba, A. (2010, June). Sun database: Large-scale scene recognition from abbey to zoo. In Computer vision and pattern recognition (CVPR), 2010 IEEE conference on (pp. 3485–3492). IEEE
-
Xiao, J., Hays, J., Ehinger, K. A., Oliva, A., & Torralba, A. (2010, June). Sun database: Large-scale scene recognition from abbey to zoo. In Computer vision and pattern recognition (CVPR), 2010 IEEE conference on (pp. 3485–3492). IEEE.
-
-
-
-
53
-
-
84898798081
-
-
Xiao, J., Owens, A., & Torralba, A. (2013, December). SUN3D: A database of big spaces reconstructed using sfm and object labels. In Computer Vision (ICCV), 2013 IEEE International Conference on (pp. 1625–1632). IEEE
-
Xiao, J., Owens, A., & Torralba, A. (2013, December). SUN3D: A database of big spaces reconstructed using sfm and object labels. In Computer Vision (ICCV), 2013 IEEE International Conference on (pp. 1625–1632). IEEE.
-
-
-
-
54
-
-
84976382753
-
-
Zhang, Y., Song, S., Tan, P., & Xiao, J., et al. (2014). PanoContext: A whole-room 3D context model for panoramic scene understanding. In European Conference on Computer Vision
-
Zhang, Y., Song, S., Tan, P., & Xiao, J., et al. (2014). PanoContext: A whole-room 3D context model for panoramic scene understanding. In European Conference on Computer Vision.
-
-
-
|