-
3
-
-
84866726859
-
Understanding and predicting importance in images
-
A. C. Berg, T. L. Berg, H. Daume, J. Dodge, A. Goyal, X. Han, A. Mensch, M. Mitchell, A. Sood, K. Stratos, et al. Understanding and predicting importance in images. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3 5 62-3569, 2012.
-
(2012)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 3562-3569
-
-
Berg, A.C.1
Berg, T.L.2
Daume, H.3
Dodge, J.4
Goyal, A.5
Han, X.6
Mensch, A.7
Mitchell, M.8
Sood, A.9
Stratos, K.10
-
8
-
-
84944046597
-
-
arXiv preprint arXiv:1411. 4389
-
J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term recurrent convolutional networks for visual recognition and description. arXiv preprint arXiv:1411. 4389, 20 14
-
(2014)
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
-
-
Donahue, J.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
9
-
-
84904482223
-
-
arXiv preprint arXiv:1310. 1531
-
J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. Decaf: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310. 1531, 2013
-
(2013)
Decaf: A Deep Convolutional Activation Feature for Generic Visual Recognition
-
-
Donahue, J.1
Jia, Y.2
Vinyals, O.3
Hoffman, J.4
Zhang, N.5
Tzeng, E.6
Darrell, T.7
-
12
-
-
84880644383
-
-
M. Everingham, S. A. Eslami, L. Van Gool, e. K. Williams, J. Winn, and A. Zisserman. The Pascal Visual Object Classes Challenge-a Retrospective
-
The Pascal Visual Object Classes Challenge-a Retrospective
-
-
Everingham, M.1
Eslami, S.A.2
Van Gool, L.3
Williams, E.K.4
Winn, J.5
Zisserman, A.6
-
13
-
-
84959250180
-
From captions to visual concepts and back
-
H. Fang, S. Gupta, F. landola, R. Srivastava, L. Deng, P. Dollar, J. Gao, X. He, M. Mitchell, J. Platt, et al. From captions to visual concepts and back. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
-
(2015)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Fang, H.1
Gupta, S.2
Landola, F.3
Srivastava, R.4
Deng, L.5
Dollar, P.6
Gao, J.7
He, X.8
Mitchell, M.9
Platt, J.10
-
14
-
-
78149311145
-
Every picture tells a story: Generating sentences from images
-
Springer
-
A. Farhadi, M. Hejrati, M. A. Sadeghi, P. Young, e. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: Generating sentences from images. In Computer Vision-ECCV 2010, pages 1 5-29. Springer, 2010.
-
(2010)
Computer Vision-ECCV 2010
, pp. 15-29
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.A.3
Young, P.4
Rashtchian, E.5
Hockenmaier, J.6
Forsyth, D.7
-
15
-
-
80052264770
-
Metamers of the ventral stream
-
J. Freeman and E. P. Simoncelli. Metamers of the ventral stream. Nature Neuroscience, 1 4(9): 1 1 95-1 201, 2011
-
(2011)
Nature Neuroscience
, vol.14
, Issue.9
, pp. 1195-1201
-
-
Freeman, J.1
Simoncelli, E.P.2
-
17
-
-
84898827031
-
The interestingness of images
-
M. Gygli, H. Grabner, H. Riemenschneider, F. Nater, and L. V. Gool. The interestingness of images. In 1EEE International Conference on Computer Vision (ICCV), pages 1 63 3-1 640, 2013
-
(2013)
1EEE International Conference on Computer Vision (ICCV)
, pp. 1633-1640
-
-
Gygli, M.1
Grabner, H.2
Riemenschneider, H.3
Nater, F.4
Gool, L.V.5
-
19
-
-
80052870103
-
What makes an image memorable
-
P. Isola, J. Xiao, A. Torralba, and A. Oliva. What makes an image memorable? In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1 45-1 52, 2011
-
(2011)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 145-152
-
-
Isola, P.1
Xiao, J.2
Torralba, A.3
Oliva, A.4
-
20
-
-
0032204063
-
A model of saliency-based visual attention for rapid scene analysis
-
L. Itti, e. Koch, and E. Niebur. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on pattern analysis and machine intelligence, 20( 1 1): 1 254-1 259, 1 99 8
-
(1998)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.20
, Issue.11
, pp. 1254-1259
-
-
Itti, L.1
Koch, E.2
Niebur, E.3
-
21
-
-
84959195482
-
-
arXiv preprint arXiv:1502. 04569
-
M. Jas and D. Parikh. Image Specificity. arXiv preprint arXiv:1502. 04569, 2015.
-
(2015)
Image Specificity
, pp. 3-7
-
-
Jas, M.1
Parikh, D.2
-
28
-
-
80052901011
-
Baby talk: Understanding and generating simple image descriptions
-
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. e. Berg, and T. L. Berg. Baby talk: Understanding and generating simple image descriptions. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1 60 1-1 608, 2011.
-
(2011)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1601-1608
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, E.A.6
Berg, T.L.7
-
29
-
-
0034324105
-
Next-generation web searches for visual content
-
M. S. Lew. Next-generation web searches for visual content. Computer, 3 3 ( 1 1): 46-5 3, 2000
-
(2000)
Computer
, vol.33
, Issue.11
, pp. 46-53
-
-
Lew, M.S.1
-
30
-
-
49249115835
-
Datadriven enhancement of facial attractiveness
-
T. Leyvand, D. Cohen-Or, G. Dror, and D. Lischinski. Datadriven enhancement of facial attractiveness. ACM Transactions on Graphics (TOG), 27(3): 3 8, 2008
-
(2008)
ACM Transactions on Graphics (TOG)
, vol.27
, Issue.3
, pp. 38
-
-
Leyvand, T.1
Cohen-Or, D.2
Dror, G.3
Lischinski, D.4
-
31
-
-
84862279067
-
Composing simple image descriptions using web-scale n-grams
-
Association for Computational Linguistics
-
S. Li, G. Kulkarni, T. L. Berg, A. C. Berg, and Y. Choi. Composing simple image descriptions using web-scale n-grams. In P roceedings of the Fifteenth Conference on Computational Natural Language Learning, pages 220-228. Association for Computational Linguistics, 201 1
-
(2011)
P Roceedings of the Fifteenth Conference on Computational Natural Language Learning
, pp. 220-228
-
-
Li, S.1
Kulkarni, G.2
Berg, T.L.3
Berg, A.C.4
Choi, Y.5
-
34
-
-
84911442106
-
Visual semantic search: Retrieving videos via complex textual queries
-
D. Lin, S. Fidler, C. Kong, and R. Urtasun. Visual semantic search: Retrieving videos via complex textual queries. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 2657-2664, 20 14
-
(2014)
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
, pp. 2657-2664
-
-
Lin, D.1
Fidler, S.2
Kong, C.3
Urtasun, R.4
-
35
-
-
84951072975
-
-
arXiv preprint arXiv: 1410. 1090
-
J. Mao, W Xu, Y. Yang, J. Wang, and A. L. Yuille. Explain images with multi modal recurrent neural networks. arXiv preprint arXiv: 1410. 1090, 2014
-
(2014)
Explain Images with Multi Modal Recurrent Neural Networks
-
-
Mao, J.1
Xu, W.2
Yang, Y.3
Wang, J.4
Yuille, A.L.5
-
36
-
-
84976702763
-
WordNet: A lexical database for English
-
3, 6
-
G. A. Miller. WordNet: a lexical database for English. Communications of the ACM, 3 8 ( 1 1): 3 9-4 1, 1 995. 3, 6
-
(1995)
Communications of the ACM
, vol.38
, Issue.11
, pp. 39-41
-
-
Miller, G.A.1
-
37
-
-
85034832841
-
Midge: Generating image descriptions from computer vision detections
-
Association for Computational Linguistics.
-
M. Mitchell, X. Han, J. Dodge, A. Mensch, A. Goyal, A. Berg, K. Yamaguchi, T. Berg, K. Stratos, and H. Daume III. Midge: Generating image descriptions from computer vision detections. In P roceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 747-756. Association for Computational Linguistics, 2012.
-
(2012)
P Roceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
, pp. 747-756
-
-
Mitchell, M.1
Han, X.2
Dodge, J.3
Mensch, A.4
Goyal, A.5
Berg, A.6
Yamaguchi, K.7
Berg, T.8
Stratos, K.9
Daume, H.10
-
39
-
-
85133336275
-
BLEU: A method for automatic evaluation of machine translation
-
Association for Computational Linguistics
-
K. Papineni, S. Roukos, T. Ward, and W-J. Zhu. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pages 3 1 1-3 1 8. Association for Computational Linguistics, 2002.
-
(2002)
Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.-J.4
-
40
-
-
80555140075
-
Sci kit-learn: Machine learning in Python
-
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, et al. Sci kit-learn: Machine learning in Python. The Journal of Machine Learning Research, 1 2: 2825-2830, 2011
-
(2011)
The Journal of Machine Learning Research
, vol.12
, pp. 2825-2830
-
-
Pedregosa, F.1
Varoquaux, G.2
Gramfort, A.3
Michel, V.4
Thirion, B.5
Grisel, O.6
Blondel, M.7
Prettenhofer, P.8
Weiss, R.9
Dubourg, V.10
-
41
-
-
85090348677
-
Collecting image annotations using Amazon' s Mechanical Turk
-
Association for Computational Linguistics
-
c. Rashtchian, P. Young, M. Hodosh, and J. Hockenmaier. Collecting image annotations using Amazon' s Mechanical Turk. In P roceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon 's Mechanical Turk, pages 1 3 9-1 47. Association for Computational Linguistics, 2010
-
(2010)
P Roceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon 'S Mechanical Turk
, pp. 139-147
-
-
Rashtchian, C.1
Young, P.2
Hodosh, M.3
Hockenmaier, J.4
-
42
-
-
34548133551
-
Measuring visual clutter
-
R. Rosenholtz, Y. Li, and L. Nakano. Measuring visual clutter. Journal of vision, 7(2): 1 7, 2007
-
(2007)
Journal of Vision
, vol.7
, Issue.2
, pp. 17
-
-
Rosenholtz, R.1
Li, Y.2
Nakano, L.3
-
50
-
-
77955988947
-
Sun database: Large-scale scene recognition from abbey to zoo
-
J. Xiao, J. Hays, K. A. Ehinger, A. Oliva, and A. Torralba. Sun database: Large-scale scene recognition from abbey to zoo. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3485-3492, 2010
-
(2010)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 3485-3492
-
-
Xiao, J.1
Hays, J.2
Ehinger, K.A.3
Oliva, A.4
Torralba, A.5
-
51
-
-
77954862144
-
I2t: Image parsing to text description
-
B. Z. Yao, X. Yang, L. Lin, M. W Lee, and S.-c. Zhu. I2t: Image parsing to text description. P roceedings of the IEEE, 98(8): 1 48 5-1 5 08, 2010
-
(2010)
P Roceedings of the IEEE
, vol.98
, Issue.8
, pp. 1485-1508
-
-
Yao, B.Z.1
Yang, X.2
Lin, L.3
Lee, M.W.4
Zhu, S.-C.5
-
52
-
-
84937964578
-
Learning deep features for scene recognition using places database
-
B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning Deep Features for Scene Recognition using Places Database. NIPS, 2014
-
(2014)
NIPS
-
-
Zhou, B.1
Lapedriza, A.2
Xiao, J.3
Torralba, A.4
Oliva, A.5
|