-
1
-
-
77951298115
-
The pascal visual object classes (Voc) challenge
-
Everingham, M., Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (voc) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)
-
(2010)
Int. J. Comput. Vis
, vol.88
, Issue.2
, pp. 303-338
-
-
Everingham, M.1
Gool, L.2
Williams, C.K.3
Winn, J.4
Zisserman, A.5
-
2
-
-
80053360686
-
Tree kernel-based relation extraction with context-sensitive structured parse tree information
-
Zhou, G., Zhang, M., Ji, D.H., Zhu, Q.: Tree kernel-based relation extraction with context-sensitive structured parse tree information. EMNLP-CoNLL 2007, 728 (2007)
-
(2007)
Emnlp-Conll
, vol.2007
, pp. 728
-
-
Zhou, G.1
Zhang, M.2
Ji, D.H.3
Zhu, Q.4
-
3
-
-
84859889184
-
Exploring various knowledge in relation extraction
-
GuoDong, Z., Jian, S., Jie, Z., Min, Z.: Exploring various knowledge in relation extraction. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, Association for Computational Linguistics, pp. 427–434 (2005)
-
(2005)
Proceedings of the 43Rd Annual Meeting on Association for Computational Linguistics, Association for Computational Linguistics
, pp. 427-434
-
-
Guodong, Z.1
Jian, S.2
Jie, Z.3
Min, Z.4
-
5
-
-
84870715081
-
Semantic compositionality through recursive matrix-vector spaces
-
Socher, R., Huval, B., Manning, C.D., Ng, A.Y.: Semantic compositionality through recursive matrix-vector spaces. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Association for Computational Linguistics, pp. 1201–1211 (2012)
-
(2012)
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Association for Computational Linguistics
, pp. 1201-1211
-
-
Socher, R.1
Huval, B.2
Manning, C.D.3
Ng, A.Y.4
-
7
-
-
85083951332
-
-
arXiv preprint arXiv:1301.3781
-
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
-
(2013)
Efficient Estimation of Word Representations in Vector Space
-
-
Mikolov, T.1
Chen, K.2
Corrado, G.3
Dean, J.4
-
8
-
-
84959233256
-
Image retrieval using scene graphs
-
Johnson, J., Krishna, R., Stark, M., Li, L.J., Shamma, D.A., Bernstein, M., Fei-Fei, L.: Image retrieval using scene graphs. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)
-
(2015)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Johnson, J.1
Krishna, R.2
Stark, M.3
Li, L.J.4
Shamma, D.A.5
Bernstein, M.6
Fei-Fei, L.7
-
9
-
-
84911410734
-
Costa: Co-occurrence statistics for zeroshot classification
-
IEEE
-
Mensink, T., Gavves, E., Snoek, C.G.: Costa: Co-occurrence statistics for zeroshot classification. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2441–2448. IEEE (2014)
-
(2014)
2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 2441-2448
-
-
Mensink, T.1
Gavves, E.2
Snoek, C.G.3
-
10
-
-
80052905403
-
Learning to share visual appearance for multiclass object detection
-
IEEE
-
Salakhutdinov, R., Torralba, A., Tenenbaum, J.: Learning to share visual appearance for multiclass object detection. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1481–1488. IEEE (2011)
-
(2011)
2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1481-1488
-
-
Salakhutdinov, R.1
Torralba, A.2
Tenenbaum, J.3
-
11
-
-
78149343534
-
Graph cut based inference with co-occurrence statistics
-
Daniilidis, K., Maragos, P., Paragios, N. (eds.), Springer, Heidelberg
-
Ladicky, L., Russell, C., Kohli, P., Torr, P.H.S.: Graph cut based inference with co-occurrence statistics. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 239–253. Springer, Heidelberg (2010)
-
(2010)
ECCV 2010, Part V. LNCS
, vol.6315
, pp. 239-253
-
-
Ladicky, L.1
Russell, C.2
Kohli, P.3
Torr, P.H.S.4
-
12
-
-
50649096757
-
Objects in context
-
IEEE
-
Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., Belongie, S.: Objects in context. In: IEEE 11th International Conference on Computer vision, ICCV 2007, pp. 1–8. IEEE (2007)
-
(2007)
IEEE 11Th International Conference on Computer Vision, ICCV 2007
, pp. 1-8
-
-
Rabinovich, A.1
Vedaldi, A.2
Galleguillos, C.3
Wiewiora, E.4
Belongie, S.5
-
13
-
-
51949110976
-
Object categorization using cooccurrence, location and appearance
-
IEEE
-
Galleguillos, C., Rabinovich, A., Belongie, S.: Object categorization using cooccurrence, location and appearance. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8. IEEE (2008)
-
(2008)
IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008
, pp. 1-8
-
-
Galleguillos, C.1
Rabinovich, A.2
Belongie, S.3
-
14
-
-
78651403274
-
Context based object categorization: A critical survey
-
Galleguillos, C., Belongie, S.: Context based object categorization: a critical survey. Comput. Vis. Image Underst. 114(6), 712–722 (2010)
-
(2010)
Comput. Vis. Image Underst
, vol.114
, Issue.6
, pp. 712-722
-
-
Galleguillos, C.1
Belongie, S.2
-
15
-
-
77956006912
-
Exploiting hierarchical context on a large database of object categories
-
IEEE
-
Choi, M.J., Lim, J.J., Torralba, A., Willsky, A.S.: Exploiting hierarchical context on a large database of object categories. In: 2010 IEEE Conference on Computer vision and Pattern Recognition (CVPR), pp. 129–136. IEEE (2010)
-
(2010)
2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 129-136
-
-
Choi, M.J.1
Lim, J.J.2
Torralba, A.3
Willsky, A.S.4
-
16
-
-
84911457822
-
Incorporating scene context and object layout into appearance modeling
-
IEEE
-
Izadinia, H., Sadeghi, F., Farhadi, A.: Incorporating scene context and object layout into appearance modeling. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 232–239. IEEE (2014)
-
(2014)
2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 232-239
-
-
Izadinia, H.1
Sadeghi, F.2
Farhadi, A.3
-
18
-
-
33745938597
-
Discovering objects and their location in images
-
IEEE
-
Sivic, J., Russell, B.C., Efros, A., Zisserman, A., Freeman, W.T., et al.: Discovering objects and their location in images. In: Tenth IEEE International Conference on Computer Vision, ICCV 2005, vol. 1, pp. 370–377. IEEE (2005)
-
(2005)
Tenth IEEE International Conference on Computer Vision, ICCV 2005
, vol.1
, pp. 370-377
-
-
Sivic, J.1
Russell, B.C.2
Efros, A.3
Zisserman, A.4
Freeman, W.T.5
-
19
-
-
52449123642
-
Multi-class segmentation with relative location prior
-
Gould, S., Rodgers, J., Cohen, D., Elidan, G., Koller, D.: Multi-class segmentation with relative location prior. Int. J. Comput. Vis. 80(3), 300–316 (2008)
-
(2008)
Int. J. Comput. Vis
, vol.80
, Issue.3
, pp. 300-316
-
-
Gould, S.1
Rodgers, J.2
Cohen, D.3
Elidan, G.4
Koller, D.5
-
20
-
-
84898775239
-
Translating video content to natural language descriptions
-
IEEE
-
Rohrbach, M., Qiu, W., Titov, I., Thater, S., Pinkal, M., Schiele, B.: Translating video content to natural language descriptions. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 433–440. IEEE (2013)
-
(2013)
2013 IEEE International Conference on Computer Vision (ICCV)
, pp. 433-440
-
-
Rohrbach, M.1
Qiu, W.2
Titov, I.3
Thater, S.4
Pinkal, M.5
Schiele, B.6
-
22
-
-
80052880806
-
Action recognition from a distributed representation of pose and appearance
-
IEEE
-
Maji, S., Bourdev, L., Malik, J.: Action recognition from a distributed representation of pose and appearance. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3177–3184. IEEE (2011)
-
(2011)
2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 3177-3184
-
-
Maji, S.1
Bourdev, L.2
Malik, J.3
-
23
-
-
69549121743
-
Observing human-object interactions: Using spatial and functional compatibility for recognition
-
Gupta, A., Kembhavi, A., Davis, L.S.: Observing human-object interactions: using spatial and functional compatibility for recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(10), 1775–1789 (2009)
-
(2009)
IEEE Trans. Pattern Anal. Mach. Intell
, vol.31
, Issue.10
, pp. 1775-1789
-
-
Gupta, A.1
Kembhavi, A.2
Davis, L.S.3
-
25
-
-
84959233994
-
Learning semantic relationships for better action retrieval in images
-
Ramanathan, V., Li, C., Deng, J., Han, W., Li, Z., Gu, K., Song, Y., Bengio, S., Rossenberg, C., Fei-Fei, L.: Learning semantic relationships for better action retrieval in images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1100–1109 (2015)
-
(2015)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
, pp. 1100-1109
-
-
Ramanathan, V.1
Li, C.2
Deng, J.3
Han, W.4
Li, Z.5
Gu, K.6
Song, Y.7
Bengio, S.8
Rossenberg, C.9
Fei-Fei, L.10
-
26
-
-
84898773262
-
Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
-
IEEE
-
Guadarrama, S., Krishnamoorthy, N., Malkarnenkar, G., Venugopalan, S., Mooney, R., Darrell, T., Saenko, K.: Youtube2text: recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 2712–2719. IEEE (2013)
-
(2013)
2013 IEEE International Conference on Computer Vision (ICCV)
, pp. 2712-2719
-
-
Guadarrama, S.1
Krishnamoorthy, N.2
Malkarnenkar, G.3
Venugopalan, S.4
Mooney, R.5
Darrell, T.6
Saenko, K.7
-
27
-
-
84898785648
-
Grounding action descriptions in videos
-
Regneri, M., Rohrbach, M., Wetzel, D., Thater, S., Schiele, B., Pinkal, M.: Grounding action descriptions in videos. Trans. Assoc. Comput. Linguist. 1, 25–36 (2013)
-
(2013)
Trans. Assoc. Comput. Linguist
, vol.1
, pp. 25-36
-
-
Regneri, M.1
Rohrbach, M.2
Wetzel, D.3
Thater, S.4
Schiele, B.5
Pinkal, M.6
-
28
-
-
84959932469
-
Integrating language and vision to generate natural language descriptions of videos in the wild
-
August
-
Thomason, J., Venugopalan, S., Guadarrama, S., Saenko, K., Mooney, R.: Integrating language and vision to generate natural language descriptions of videos in the wild. In: Proceedings of the 25th International Conference on Computational Linguistics (COLING), August 2014
-
(2014)
Proceedings of the 25Th International Conference on Computational Linguistics (COLING)
-
-
Thomason, J.1
Venugopalan, S.2
Guadarrama, S.3
Saenko, K.4
Mooney, R.5
-
29
-
-
84866687133
-
Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation
-
IEEE
-
Yao, J., Fidler, S., Urtasun, R.: Describing the scene as a whole: joint object detection, scene classification and semantic segmentation. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 702–709. IEEE (2012)
-
(2012)
2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 702-709
-
-
Yao, J.1
Fidler, S.2
Urtasun, R.3
-
30
-
-
84990043972
-
Baby talk: Understanding and generating image descriptions
-
Kulkarni, G., Premraj, V., Dhar, S., Li, S., Choi, Y., Berg, A.C., Berg, T.L.: Baby talk: understanding and generating image descriptions. In: Proceedings of the 24th CVPR. Citeseer (2011)
-
(2011)
Proceedings of the 24Th CVPR. Citeseer
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
31
-
-
84898772194
-
Learning the visual interpretation of sentences
-
IEEE
-
Zitnick, C.L., Parikh, D., Vanderwende, L.: Learning the visual interpretation of sentences. In: 2013 IEEE International Conference on Computer Vision (ICCV), pp. 1681–1688. IEEE (2013)
-
(2013)
2013 IEEE International Conference on Computer Vision (ICCV)
, pp. 1681-1688
-
-
Zitnick, C.L.1
Parikh, D.2
Vanderwende, L.3
-
32
-
-
57149125139
-
Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers
-
Forsyth, D., Torr, P., Zisserman, A. (eds.), Springer, Heidelberg
-
Gupta, A., Davis, L.S.: Beyond nouns: exploiting prepositions and comparative adjectives for learning visual classifiers. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 16–29. Springer, Heidelberg (2008)
-
(2008)
ECCV 2008, Part I. LNCS
, vol.5302
, pp. 16-29
-
-
Gupta, A.1
Davis, L.S.2
-
34
-
-
33845596932
-
Using multiple segmentations to discover objects and their extent in image collections
-
IEEE
-
Russell, B.C., Freeman, W.T., Efros, A., Sivic, J., Zisserman, A., et al.: Using multiple segmentations to discover objects and their extent in image collections. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 1605–1614. IEEE (2006)
-
(2006)
2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition
, vol.2
, pp. 1605-1614
-
-
Russell, B.C.1
Freeman, W.T.2
Efros, A.3
Sivic, J.4
Zisserman, A.5
-
35
-
-
78149311145
-
Every picture tells a story: Generating sentences from images
-
Daniilidis, K., Maragos, P., Paragios, N. (eds.), Springer, Heidelberg
-
Farhadi, A., Hejrati, M., Sadeghi, M.A., Young, P., Rashtchian, C., Hockenmaier, J., Forsyth, D.: Every picture tells a story: generating sentences from images. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 15–29. Springer, Heidelberg (2010)
-
(2010)
ECCV 2010, Part IV. LNCS
, vol.6314
, pp. 15-29
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.A.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
36
-
-
84866726859
-
Understanding and predicting importance in images
-
IEEE
-
Berg, A.C., Berg, T.L., Daume H., III, Dodge, J., Goyal, A., Han, X., Mensch, A., Mitchell, M., Sood, A., Stratos, K., et al.: Understanding and predicting importance in images. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3562–3569. IEEE (2012)
-
(2012)
2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 3562-3569
-
-
Berg, A.C.1
Berg, T.L.2
Daume, H.3
Dodge, J.4
Goyal, A.5
Han, X.6
Mensch, A.7
Mitchell, M.8
Sood, A.9
Stratos, K.10
-
37
-
-
51349086291
-
Putting objects in perspective
-
Hoiem, D., Efros, A.A., Hebert, M.: Putting objects in perspective. Int. J. Comput. Vis. 80(1), 3–15 (2008)
-
(2008)
Int. J. Comput. Vis
, vol.80
, Issue.1
, pp. 3-15
-
-
Hoiem, D.1
Efros, A.A.2
Hebert, M.3
-
38
-
-
84944115860
-
-
arXiv preprint arXiv:1411.4952
-
Fang, H., Gupta, S., Iandola, F., Srivastava, R., Deng, L., Dollár, P., Gao, J., He, X., Mitchell, M., Platt, J., et al.: From captions to visual concepts and back. arXiv preprint arXiv:1411.4952 (2014)
-
(2014)
From Captions to Visual Concepts and Back
-
-
Fang, H.1
Gupta, S.2
Iandola, F.3
Srivastava, R.4
Deng, L.5
Dollár, P.6
Gao, J.7
He, X.8
Mitchell, M.9
Platt, J.10
-
39
-
-
84990046210
-
Semantic parsing for text to 3d scene generation
-
Chang, A.X., Savva, M., Manning, C.D.: Semantic parsing for text to 3d scene generation. ACL 2014, 17 (2014)
-
(2014)
ACL
, vol.2014
, pp. 17
-
-
Chang, A.X.1
Savva, M.2
Manning, C.D.3
-
40
-
-
85123605149
-
Generating semantically precise scene graphs from textual descriptions for improved image retrieval
-
Schuster, S., Krishna, R., Chang, A., Fei-Fei, L., Manning, C.D.: Generating semantically precise scene graphs from textual descriptions for improved image retrieval. In: Proceedings of the Fourth Workshop on Vision and Language (VL 2015) (2015)
-
(2015)
Proceedings of the Fourth Workshop on Vision and Language (VL 2015)
-
-
Schuster, S.1
Krishna, R.2
Chang, A.3
Fei-Fei, L.4
Manning, C.D.5
-
41
-
-
84887394346
-
Understanding indoor scenes using 3d geometric phrases
-
IEEE
-
Choi, W., Chao, Y.W., Pantofaru, C., Savarese, S.: Understanding indoor scenes using 3d geometric phrases. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 33–40. IEEE (2013)
-
(2013)
2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 33-40
-
-
Choi, W.1
Chao, Y.W.2
Pantofaru, C.3
Savarese, S.4
-
42
-
-
84990070438
-
Visual genome: Connecting language and vision using crowdsourced dense image annotations
-
Krishna, R., Zhu, Y., Groth, O., Johnson, J., Hata, K., Kravitz, J., Chen, S., Kalantidis, Y., Li, L.J., Shamma, D.A., Bernstein, M., Fei-Fei, L.: Visual genome: connecting language and vision using crowdsourced dense image annotations. Int. J. Comput. Vis. (2016)
-
(2016)
Int. J. Comput. Vis
-
-
Krishna, R.1
Zhu, Y.2
Groth, O.3
Johnson, J.4
Hata, K.5
Kravitz, J.6
Chen, S.7
Kalantidis, Y.8
Li, L.J.9
Shamma, D.A.10
Bernstein, M.11
Fei-Fei, L.12
-
43
-
-
84911400494
-
Rich feature hierarchies for accurate object detection and semantic segmentation
-
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Computer Vision and Pattern Recognition (2014)
-
(2014)
Computer Vision and Pattern Recognition
-
-
Girshick, R.1
Donahue, J.2
Darrell, T.3
Malik, J.4
-
45
-
-
84866688216
-
Measuring the objectness of image windows
-
Alexe, B., Deselaers, T., Ferrari, V.: Measuring the objectness of image windows. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2189–2202 (2012)
-
(2012)
IEEE Trans. Pattern Anal. Mach. Intell
, vol.34
, Issue.11
, pp. 2189-2202
-
-
Alexe, B.1
Deselaers, T.2
Ferrari, V.3
-
46
-
-
0035328421
-
Modeling the shape of the scene: A holistic representation of the spatial envelope
-
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145–175 (2001)
-
(2001)
Int. J. Comput. Vis
, vol.42
, Issue.3
, pp. 145-175
-
-
Oliva, A.1
Torralba, A.2
-
47
-
-
3042535216
-
Distinctive image features from scale-invariant keypoints
-
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
-
(2004)
Int. J. Comput. Vis
, vol.60
, Issue.2
, pp. 91-110
-
-
Lowe, D.G.1
|