-
1
-
-
84943794851
-
Leveraging linguistic structure for open domain information extraction
-
G. Angeli, M. J. Premkumar, and C. D. Manning. Leveraging linguistic structure for open domain information extraction. In ACL, 2015
-
(2015)
ACL
-
-
Angeli, G.1
Premkumar, M.J.2
Manning, C.D.3
-
2
-
-
85041922388
-
Learning to generalize to new compositions in image understanding
-
Y. Atzmon, J. Berant, V. Kezami, A. Globerson, and G. Chechik. Learning to generalize to new compositions in image understanding. In EMNLP, 2016
-
(2016)
EMNLP
-
-
Atzmon, Y.1
Berant, J.2
Kezami, V.3
Globerson, A.4
Chechik, G.5
-
3
-
-
84986269551
-
Weakly supervised deep detection networks
-
H. Bilen and A. Vedaldi. Weakly supervised deep detection networks. In CVPR, 2016
-
(2016)
CVPR
-
-
Bilen, H.1
Vedaldi, A.2
-
4
-
-
84973868179
-
Hico: A benchmark for recognizing human-object interactions in images
-
Y.-W. Chao, Z. Wang, Y. He, J. Wang, and J. Deng. Hico: A benchmark for recognizing human-object interactions in images. In ICCV, 2015
-
(2015)
ICCV
-
-
Chao, Y.-W.1
Wang, Z.2
He, Y.3
Wang, J.4
Deng, J.5
-
5
-
-
85029348551
-
Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning
-
L. Chen, H. Zhang, J. Xiao, L. Nie, J. Shao, W. Liu, and T.-S. Chua. Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning. In CVPR, 2017
-
(2017)
CVPR
-
-
Chen, L.1
Zhang, H.2
Xiao, J.3
Nie, L.4
Shao, J.5
Liu, W.6
Chua, T.-S.7
-
6
-
-
85003782026
-
Weakly supervised object localization with multi-fold multiple instance learning
-
R. G. Cinbis, J. Verbeek, and C. Schmid. Weakly supervised object localization with multi-fold multiple instance learning. TPAMI, 2017
-
(2017)
TPAMI
-
-
Cinbis, R.G.1
Verbeek, J.2
Schmid, C.3
-
7
-
-
85041892861
-
Detecting visual relationships with deep relational networks
-
B. Dai, Y. Zhang, and D. Lin. Detecting visual relationships with deep relational networks. In CVPR, 2017
-
(2017)
CVPR
-
-
Dai, B.1
Zhang, Y.2
Lin, D.3
-
8
-
-
84877748784
-
Detecting actions, poses, and objects with relational phraselets
-
C. Desai and D. Ramanan. Detecting actions, poses, and objects with relational phraselets. In ECCV, 2012
-
(2012)
ECCV
-
-
Desai, C.1
Ramanan, D.2
-
9
-
-
84898798806
-
Restoring an image taken through a window covered with dirt or rain
-
D. Eigen, D. Krishnan, and R. Fergus. Restoring an image taken through a window covered with dirt or rain. In ICCV, 2013
-
(2013)
ICCV
-
-
Eigen, D.1
Krishnan, D.2
Fergus, R.3
-
10
-
-
85029359197
-
Fast r-cnn
-
R. Girshick. Fast r-cnn. In ICCV, 2015
-
(2015)
ICCV
-
-
Girshick, R.1
-
11
-
-
70450155469
-
Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers
-
A. Gupta and L. S. Davis. Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers. In ECCV, 2008
-
(2008)
ECCV
-
-
Gupta, A.1
Davis, L.S.2
-
12
-
-
69549121743
-
Observing human-object interactions: Using spatial and functional compatibility for recognition
-
A. Gupta, A. Kembhavi, and L. S. Davis. Observing human-object interactions: Using spatial and functional compatibility for recognition. TPAMI, 2009
-
(2009)
TPAMI
-
-
Gupta, A.1
Kembhavi, A.2
Davis, L.S.3
-
14
-
-
85041917330
-
-
arXiv preprint arXiv:1611.09978
-
R. Hu, M. Rohrbach, J. Andreas, T. Darrell, and K. Saenko. Modeling relationships in referential expressions with compositional modular networks. arXiv preprint arXiv:1611.09978, 2016
-
(2016)
Modeling Relationships in Referential Expressions with Compositional Modular Networks
-
-
Hu, R.1
Rohrbach, M.2
Andreas, J.3
Darrell, T.4
Saenko, K.5
-
15
-
-
85041929043
-
Modeling relationships in referential expressions with compositional modular networks
-
R. Hu, M. Rohrbach, J. Andreas, T. Darrell, and K. Saenko. Modeling relationships in referential expressions with compositional modular networks. In CVPR, 2017
-
(2017)
CVPR
-
-
Hu, R.1
Rohrbach, M.2
Andreas, J.3
Darrell, T.4
Saenko, K.5
-
16
-
-
85040949959
-
Deep self-taught learning for weakly supervised object localization
-
Z. Jie, Y. Wei, X. Jin, J. Feng, and W. Liu. Deep self-taught learning for weakly supervised object localization. In CVPR, 2017
-
(2017)
CVPR
-
-
Jie, Z.1
Wei, Y.2
Jin, X.3
Feng, J.4
Liu, W.5
-
17
-
-
84959233256
-
Image retrieval using scene graphs
-
J. Johnson, R. Krishna, M. Stark, L.-J. Li, D. A. Shamma, M. S. Bernstein, and L. Fei-Fei. Image retrieval using scene graphs. In CVPR, 2015
-
(2015)
CVPR
-
-
Johnson, J.1
Krishna, R.2
Stark, M.3
Li, L.-J.4
Shamma, D.A.5
Bernstein, M.S.6
Fei-Fei, L.7
-
18
-
-
85021823117
-
Contextlocnet: Context-aware deep network models for weakly supervised localization
-
V. Kantorov, M. Oquab, M. Cho, and I. Laptev. Contextlocnet: Context-aware deep network models for weakly supervised localization. In ECCV, 2016
-
(2016)
ECCV
-
-
Kantorov, V.1
Oquab, M.2
Cho, M.3
Laptev, I.4
-
20
-
-
84990070438
-
Visual genome: Connecting language and vision using crowdsourced dense image annotations
-
R. Krishna, Y. Zhu, O. Groth, J. Johnson, K. Hata, J. Kravitz, S. Chen, Y. Kalantidis, L.-J. Li, D. A. Shamma, et al. Visual genome: Connecting language and vision using crowdsourced dense image annotations. IJCV, 2016
-
(2016)
IJCV
-
-
Krishna, R.1
Zhu, Y.2
Groth, O.3
Johnson, J.4
Hata, K.5
Kravitz, J.6
Chen, S.7
Kalantidis, Y.8
Li, L.-J.9
Shamma, D.A.10
-
21
-
-
85161967298
-
Self-paced learning for latent variable models
-
M. P. Kumar, B. Packer, and D. Koller. Self-paced learning for latent variable models. In NIPS, 2010
-
(2010)
NIPS
-
-
Kumar, M.P.1
Packer, B.2
Koller, D.3
-
22
-
-
84986317248
-
Weakly supervised object localization with progressive domain adaptation
-
D. Li, J.-B. Huang, Y. Li, S. Wang, and M.-H. Yang. Weakly supervised object localization with progressive domain adaptation. In CVPR, 2016
-
(2016)
CVPR
-
-
Li, D.1
Huang, J.-B.2
Li, Y.3
Wang, S.4
Yang, M.-H.5
-
23
-
-
85018938177
-
R-fcn: Object detection via region-based fully convolutional networks
-
Y. Li, K. He, J. Sun, et al. R-fcn: Object detection via region-based fully convolutional networks. In NIPS, 2016
-
(2016)
NIPS
-
-
Li, Y.1
He, K.2
Sun, J.3
-
24
-
-
85041906062
-
Vip-cnn: Visual phrase guided convolutional neural network
-
Y. Li, W. Ouyang, and X. Wang. Vip-cnn: Visual phrase guided convolutional neural network. In CVPR, 2017
-
(2017)
CVPR
-
-
Li, Y.1
Ouyang, W.2
Wang, X.3
-
25
-
-
85041915815
-
Scene graph generation from objects, phrases and region captions
-
Y. Li, W. Ouyang, B. Zhou, K. Wang, and X. Wang. Scene graph generation from objects, phrases and region captions. In ICCV, 2017
-
(2017)
ICCV
-
-
Li, Y.1
Ouyang, W.2
Zhou, B.3
Wang, K.4
Wang, X.5
-
26
-
-
84937834115
-
Microsoft coco: Common objects in context
-
T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick. Microsoft coco: Common objects in context. In ECCV, 2014
-
(2014)
ECCV
-
-
Lin, T.-Y.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollar, P.7
Zitnick, C.L.8
-
27
-
-
85035233030
-
Surveillance video parsing with single frame supervision
-
S. Liu, C. Wang, R. Qian, H. Yu, R. Bao, and Y. Sun. Surveillance video parsing with single frame supervision. In CVPR, 2017
-
(2017)
CVPR
-
-
Liu, S.1
Wang, C.2
Qian, R.3
Yu, H.4
Bao, R.5
Sun, Y.6
-
28
-
-
84959205572
-
Fully convolutional networks for semantic segmentation
-
J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation. In CVPR, 2015
-
(2015)
CVPR
-
-
Long, J.1
Shelhamer, E.2
Darrell, T.3
-
30
-
-
84898935332
-
A framework for multiple-instance learning
-
O. Maron and T. Lozano-Perez. A framework for multiple-instance learning. In NIPS, 1998
-
(1998)
NIPS
-
-
Maron, O.1
Lozano-Perez, T.2
-
31
-
-
85021826252
-
Modeling context between objects for referring expression understanding
-
V. K. Nagaraja, V. I. Morariu, and L. S. Davis. Modeling context between objects for referring expression understanding. In ECCV, 2016
-
(2016)
ECCV
-
-
Nagaraja, V.K.1
Morariu, V.I.2
Davis, L.S.3
-
32
-
-
84856142160
-
Weakly supervised learning of interactions between humans and objects
-
A. Prest, C. Schmid, and V. Ferrari. Weakly supervised learning of interactions between humans and objects. TPAMI, 2012
-
(2012)
TPAMI
-
-
Prest, A.1
Schmid, C.2
Ferrari, V.3
-
33
-
-
84959233994
-
Learning semantic relationships for better action retrieval in images
-
V. Ramanathan, C. Li, J. Deng, W. Han, Z. Li, K. Gu, Y. Song, S. Bengio, C. Rossenberg, and L. Fei-Fei. Learning semantic relationships for better action retrieval in images. In CVPR, 2015
-
(2015)
CVPR
-
-
Ramanathan, V.1
Li, C.2
Deng, J.3
Han, W.4
Li, Z.5
Gu, K.6
Song, Y.7
Bengio, S.8
Rossenberg, C.9
Fei-Fei, L.10
-
34
-
-
84960980241
-
Faster r-cnn: Towards realtime object detection with region proposal networks
-
S. Ren, K. He, R. Girshick, and J. Sun. Faster r-cnn: Towards realtime object detection with region proposal networks. In NIPS, 2015
-
(2015)
NIPS
-
-
Ren, S.1
He, K.2
Girshick, R.3
Sun, J.4
-
35
-
-
84990024294
-
Grounding of textual phrases in images by reconstruction
-
A. Rohrbach, M. Rohrbach, R. Hu, T. Darrell, and B. Schiele. Grounding of textual phrases in images by reconstruction. In ECCV, 2016
-
(2016)
ECCV
-
-
Rohrbach, A.1
Rohrbach, M.2
Hu, R.3
Darrell, T.4
Schiele, B.5
-
36
-
-
80052889458
-
Recognition using visual phrases
-
M. A. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011
-
(2011)
CVPR
-
-
Sadeghi, M.A.1
Farhadi, A.2
-
37
-
-
85123605149
-
Generating semantically precise scene graphs from textual descriptions for improved image retrieval
-
S. Schuster, R. Krishna, A. Chang, L. Fei-Fei, and C. D. Manning. Generating semantically precise scene graphs from textual descriptions for improved image retrieval. In Workshop on Vision and Language, 2015
-
(2015)
Workshop on Vision and Language
-
-
Schuster, S.1
Krishna, R.2
Chang, A.3
Fei-Fei, L.4
Manning, C.D.5
-
38
-
-
84919792468
-
On learning to localize objects with minimal supervision
-
H. O. Song, R. B. Girshick, S. Jegelka, J. Mairal, Z. Harchaoui, T. Darrell, et al. On learning to localize objects with minimal supervision. In ICML, pages 1611-1619, 2014
-
(2014)
ICML
, pp. 1611-1619
-
-
Song, H.O.1
Girshick, R.B.2
Jegelka, S.3
Mairal, J.4
Harchaoui, Z.5
Darrell, T.6
-
39
-
-
84937522268
-
Going deeper with convolutions
-
C. Szegedy,W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. Going deeper with convolutions. In CVPR, 2015
-
(2015)
CVPR
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
Erhan, D.7
Vanhoucke, V.8
Rabinovich, A.9
-
40
-
-
84957922397
-
Yfcc100m: The new data in multimedia research
-
B. Thomee, D. A. Shamma, G. Friedland, B. Elizalde, K. Ni, D. Poland, D. Borth, and L.-J. Li. Yfcc100m: The new data in multimedia research. Communications of the ACM, 2016
-
(2016)
Communications of the ACM
-
-
Thomee, B.1
Shamma, D.A.2
Friedland, G.3
Elizalde, B.4
Ni, K.5
Poland, D.6
Borth, D.7
Li, L.-J.8
-
43
-
-
84956604127
-
Weakly supervised object localization with latent category learning
-
C. Wang, W. Ren, K. Huang, and T. Tan. Weakly supervised object localization with latent category learning. In ECCV, 2014
-
(2014)
ECCV
-
-
Wang, C.1
Ren, W.2
Huang, K.3
Tan, T.4
-
44
-
-
84986320870
-
Ask me anything: Free-form visual question answering based on knowledge from external sources
-
Q. Wu, P. Wang, C. Shen, A. Dick, and A. van den Hengel. Ask me anything: Free-form visual question answering based on knowledge from external sources. In CVPR, 2016
-
(2016)
CVPR
-
-
Wu, Q.1
Wang, P.2
Shen, C.3
Dick, A.4
Vanden Hengel, A.5
-
45
-
-
77955988492
-
Modeling mutual context of object and human pose in human-object interaction activities
-
B. Yao and L. Fei-Fei. Modeling mutual context of object and human pose in human-object interaction activities. In CVPR, 2010
-
(2010)
CVPR
-
-
Yao, B.1
Fei-Fei, L.2
-
46
-
-
84986247420
-
Situation recognition: Visual semantic role labeling for image understanding
-
M. Yatskar, L. Zettlemoyer, and A. Farhadi. Situation recognition: Visual semantic role labeling for image understanding. In CVPR, 2016
-
(2016)
CVPR
-
-
Yatskar, M.1
Zettlemoyer, L.2
Farhadi, A.3
-
47
-
-
84990061297
-
Modeling context in referring expressions
-
L. Yu, P. Poirson, S. Yang, A. C. Berg, and T. L. Berg. Modeling context in referring expressions. In ECCV, 2016
-
(2016)
ECCV
-
-
Yu, L.1
Poirson, P.2
Yang, S.3
Berg, A.C.4
Berg, T.L.5
-
48
-
-
85029388674
-
Visual translation embedding network for visual relation detection
-
H. Zhang, Z. Kyaw, S.-F. Chang, and T.-S. Chua. Visual translation embedding network for visual relation detection. In CVPR, 2017
-
(2017)
CVPR
-
-
Zhang, H.1
Kyaw, Z.2
Chang, S.-F.3
Chua, T.-S.4
-
49
-
-
85041918005
-
Relationship proposal networks
-
J. Zhang, M. Elhoseiny, S. Cohen, W. Chang, and A. Elgammal. Relationship proposal networks. In CVPR, 2017
-
(2017)
CVPR
-
-
Zhang, J.1
Elhoseiny, M.2
Cohen, S.3
Chang, W.4
Elgammal, A.5
-
50
-
-
84952018709
-
Edge boxes: Locating object proposals from edges
-
C. L. Zitnick and P. Dollar. Edge boxes: Locating object proposals from edges. In ECCV, 2014
-
(2014)
ECCV
-
-
Zitnick, C.L.1
Dollar, P.2
|