-
1
-
-
84973890960
-
In International conference on computer vision (ICCV)
-
Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., Zitnick, C. L., et al
-
Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., Zitnick, C. L., et al. (2015). VQA: Visual question answering. In International conference on computer vision (ICCV).
-
(2015)
VQA: Visual question answering
-
-
-
2
-
-
84906498422
-
In European conference on computer vision (pp. 401–416). Springer
-
Antol, S., Zitnick, C. L., & Parikh, D
-
Antol, S., Zitnick, C. L., & Parikh, D. (2014). Zero-shot learning via visual abstraction. In European conference on computer vision (pp. 401–416). Springer.
-
(2014)
Zero-shot learning via visual abstraction
-
-
-
3
-
-
85018060315
-
-
Baker, C. F., Fillmore, C. J., & Lowe, J. B. (1998). The Berkeley framenet project. In Proceedings of the 36th annual meeting of the association for computational linguistics and 17th international conference on computational linguistics—Volume 1, ACL’9PA: Association for Computational Linguistics
-
Baker, C. F., Fillmore, C. J., & Lowe, J. B. (1998). The Berkeley framenet project. In Proceedings of the 36th annual meeting of the association for computational linguistics and 17th international conference on computational linguistics—Volume 1, ACL’98 (pp. 86–90). Stroudsburg, PA: Association for Computational Linguistics.
-
(1998)
Stroudsburg
-
-
-
4
-
-
70350536521
-
Toward never ending language learning
-
Betteridge, J., Carlson, A., Hong, S. A., Hruschka, E. R, Jr., Law, E. L., Mitchell, T. M., et al. (2009). Toward never ending language learning. In AAAI spring symposium: Learning by reading and learning to read (pp. 1–2).
-
(2009)
In AAAI spring symposium: Learning by reading and learning to read
, pp. 1-2
-
-
Betteridge, J.1
Carlson, A.2
Hong, S.A.3
Hruschka, E.R.4
Law, E.L.5
Mitchell, T.M.6
-
5
-
-
85018037997
-
In Proceedings of the COLING/ACL on interactive presentation sessions (pp. 69–72). Association for Computational Linguistics
-
Bird, S
-
Bird, S. (2006). NLTK: The natural language toolkit. In Proceedings of the COLING/ACL on interactive presentation sessions (pp. 69–72). Association for Computational Linguistics.
-
(2006)
NLTK: The natural language toolkit
-
-
-
6
-
-
0000794042
-
Culture and human development: A new look
-
Bruner, J. (1990). Culture and human development: A new look. Human Development, 33(6), 344–355.
-
(1990)
Human Development
, vol.33
, Issue.6
, pp. 344-355
-
-
Bruner, J.1
-
7
-
-
80053262847
-
In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 724–731). Association for Computational Linguistics
-
Bunescu, R. C., & Mooney, R. J
-
Bunescu, R. C., & Mooney, R. J. (2005). A shortest path dependency kernel for relation extraction. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 724–731). Association for Computational Linguistics.
-
(2005)
A shortest path dependency kernel for relation extraction
-
-
-
8
-
-
84990046210
-
Semantic parsing for text to 3D scene generation
-
Chang, A. X., Savva, M., & Manning, C. D. (2014). Semantic parsing for text to 3D scene generation. In ACL 2014 (p. 17).
-
(2014)
In ACL
, vol.2014
, pp. 17
-
-
Chang, A.X.1
Savva, M.2
Manning, C.D.3
-
9
-
-
84952349295
-
-
Chen, X., Fang, H., Lin, T.-Y., Vedantam, R., Gupta, S., Dollar, P., et al. (2015). Microsoft COCO captions: Data collection and evaluation server. arXiv:1504.00325.
-
(2015)
Microsoft COCO captions: Data collection and evaluation server. arXiv
, vol.1504
, pp. 00325
-
-
Chen, X.1
Fang, H.2
Lin, T.-Y.3
Vedantam, R.4
Gupta, S.5
Dollar, P.6
-
11
-
-
84926006932
-
In EMNLP (pp. 1025–1035). Citeseer
-
Chen, X., Liu, Z., & Sun, M
-
Chen, X., Liu, Z., & Sun, M. (2014). A unified model for word sense representation and disambiguation. In EMNLP (pp. 1025–1035). Citeseer.
-
(2014)
A unified model for word sense representation and disambiguation
-
-
-
12
-
-
84898803720
-
In 2013 IEEE international conference on computer vision (ICCV) (pp. 1409–1416). IEEE
-
Chen, X., Shrivastava, A., & Gupta, A
-
Chen, X., Shrivastava, A., & Gupta, A. (2013). Neil: Extracting visual knowledge from web data. In 2013 IEEE international conference on computer vision (ICCV) (pp. 1409–1416). IEEE.
-
(2013)
Neil: Extracting visual knowledge from web data
-
-
-
13
-
-
84887394346
-
In 2013 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 33–40). IEEE
-
Choi, W., Chao, Y.-W., Pantofaru, C., & Savarese, S
-
Choi, W., Chao, Y.-W., Pantofaru, C., & Savarese, S. (2013). Understanding indoor scenes using 3D geometric phrases. In 2013 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 33–40). IEEE.
-
(2013)
Understanding indoor scenes using 3D geometric phrases
-
-
-
14
-
-
85018049956
-
In Proceedings of the 42nd annual meeting on association for computational linguistics (p. 423). Association for Computational Linguistics
-
Culotta, A., & Sorensen, J
-
Culotta, A., & Sorensen, J. (2004). Dependency tree kernels for relation extraction. In Proceedings of the 42nd annual meeting on association for computational linguistics (p. 423). Association for Computational Linguistics.
-
(2004)
Dependency tree kernels for relation extraction
-
-
-
16
-
-
85018059159
-
-
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., & Fei-Fei, L2009 (CVPR 2009) (pp. 248–255). IEEE
-
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., & Fei-Fei, L. (2009). Imagenet: A large-scale hierarchical image database. In IEEE conference on computer vision and pattern recognition, 2009 (CVPR 2009) (pp. 248–255). IEEE.
-
(2009)
Imagenet: A large-scale hierarchical image database. In IEEE conference on computer vision and pattern recognition
-
-
-
17
-
-
85018068497
-
In Proceedings of the ninth workshop on statistical machine translation. Citeseer
-
Denkowski, M., & Lavie, A
-
Denkowski, M., & Lavie, A. (2014). Meteor universal: Language specific translation evaluation for any target language. In Proceedings of the ninth workshop on statistical machine translation. Citeseer.
-
(2014)
Meteor universal: Language specific translation evaluation for any target language
-
-
-
18
-
-
84857435937
-
Pedestrian detection: An evaluation of the state of the art
-
Dollar, P., Wojek, C., Schiele, B., & Perona, P. (2012). Pedestrian detection: An evaluation of the state of the art. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(4), 743–761.
-
(2012)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.34
, Issue.4
, pp. 743-761
-
-
Dollar, P.1
Wojek, C.2
Schiele, B.3
Perona, P.4
-
19
-
-
84959236502
-
Long-term recurrent convolutional networks for visual recognition and description
-
Donahue, J., Anne Hendricks, L., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., et al. (2015). Long-term recurrent convolutional networks for visual recognition and description. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2625–2634).
-
(2015)
In Proceedings of the IEEE conference on computer vision and pattern recognition
, pp. 2625-2634
-
-
Donahue, J.1
Anne Hendricks, L.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
-
20
-
-
77951298115
-
The pascal visual object classes (VOC) challenge
-
Everingham, M., Van Gool, L., Williams, C. K., Winn, J., & Zisserman, A. (2010). The pascal visual object classes (VOC) challenge. International Journal of Computer Vision, 88(2), 303–338.
-
(2010)
International Journal of Computer Vision
, vol.88
, Issue.2
, pp. 303-338
-
-
Everingham, M.1
Van Gool, L.2
Williams, C.K.3
Winn, J.4
Zisserman, A.5
-
21
-
-
84959250180
-
-
Fang, H., Gupta, S., Iandola, F., Srivastava, R. K., Deng, L., Dollár, P., et al. (2015). From captions to visual concepts and back. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1473–1482).
-
-
-
-
22
-
-
70450207704
-
-
Farhadi, A., Endres, I., Hoiem, D., & Forsyth, D2009 (CVPR 2009) (pp. 1778–1785). IEEE
-
Farhadi, A., Endres, I., Hoiem, D., & Forsyth, D. (2009). Describing objects by their attributes. In IEEE conference on computer vision and pattern recognition, 2009 (CVPR 2009) (pp. 1778–1785). IEEE.
-
(2009)
Describing objects by their attributes. In IEEE conference on computer vision and pattern recognition
-
-
-
23
-
-
78149311145
-
In Computer vision–ECCV 2010 (pp. 15–29). Springer
-
Farhadi, A., Hejrati, M., Sadeghi, M. A., Young, P., Rashtchian, C., Hockenmaier, J., et al
-
Farhadi, A., Hejrati, M., Sadeghi, M. A., Young, P., Rashtchian, C., Hockenmaier, J., et al. (2010). Every picture tells a story: Generating sentences from images. In Computer vision–ECCV 2010 (pp. 15–29). Springer.
-
(2010)
Every picture tells a story: Generating sentences from images
-
-
-
24
-
-
34047174674
-
Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories
-
Fei-Fei, L., Fergus, R., & Perona, P. (2007). Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Computer Vision and Image Understanding, 106(1), 59–70.
-
(2007)
Computer Vision and Image Understanding
, vol.106
, Issue.1
, pp. 59-70
-
-
Fei-Fei, L.1
Fergus, R.2
Perona, P.3
-
26
-
-
79953685181
-
Building watson: An overview of the deepqa project
-
Ferrucci, D., Brown, E., Chu-Carroll, J., Fan, J., Gondek, D., Kalyanpur, A. A., et al. (2010). Building watson: An overview of the deepqa project. AI Magazine, 31(3), 59–79.
-
(2010)
AI Magazine
, vol.31
, Issue.3
, pp. 59-79
-
-
Ferrucci, D.1
Brown, E.2
Chu-Carroll, J.3
Fan, J.4
Gondek, D.5
Kalyanpur, A.A.6
Lally, A.7
Murdock, J.W.8
Nyberg, E.9
Prager, J.10
-
27
-
-
84937604790
-
Cognition does not affect perception: Evaluating the evidence for top-down effects
-
Firestone, C., & Scholl, B. J. (2015). Cognition does not affect perception: Evaluating the evidence for top-down effects. Behavioral and brain sciences (pp. 1–72).
-
(2015)
Behavioral and brain sciences
, pp. 1-72
-
-
Firestone, C.1
Scholl, B.J.2
-
28
-
-
0021613150
-
Qualitative process theory
-
Forbus, K. D. (1984). Qualitative process theory. Artificial Intelligence, 24(1), 85–168.
-
(1984)
Artificial Intelligence
, vol.24
, Issue.1
, pp. 85-168
-
-
Forbus, K.D.1
-
29
-
-
84965148420
-
Are you talking to a machine? Dataset and methods for multilingual image question
-
Gao, H., Mao, J., Zhou, J., Huang, Z., Wang, L., & Xu, W. (2015). Are you talking to a machine? Dataset and methods for multilingual image question. In Advances in neural information processing systems (pp. 2296–2304).
-
(2015)
In Advances in neural information processing systems
, pp. 2296-2304
-
-
Gao, H.1
Mao, J.2
Zhou, J.3
Huang, Z.4
Wang, L.5
Xu, W.6
-
30
-
-
84925422907
-
Visual turing test for computer vision systems
-
Geman, D., Geman, S., Hallonquist, N., & Younes, L. (2015). Visual turing test for computer vision systems. Proceedings of the National Academy of Sciences, 112(12), 3618–3623.
-
(2015)
Proceedings of the National Academy of Sciences
, vol.112
, Issue.12
, pp. 3618-3623
-
-
Geman, D.1
Geman, S.2
Hallonquist, N.3
Younes, L.4
-
32
-
-
85119023807
-
In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 580–587). IEEE
-
Girshick, R., Donahue, J., Darrell, T., & Malik, J
-
Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 580–587). IEEE.
-
(2014)
Rich feature hierarchies for accurate object detection and semantic segmentation
-
-
-
33
-
-
84911449570
-
In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 2489–2496). IEEE
-
Goering, C., Rodner, E., Freytag, A., & Denzler, J
-
Goering, C., Rodner, E., Freytag, A., & Denzler, J. (2014). Nonparametric part transfer for fine-grained recognition. In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 2489–2496). IEEE.
-
(2014)
Nonparametric part transfer for fine-grained recognition
-
-
-
35
-
-
84859889184
-
In Proceedings of the 43rd annual meeting on association for computational linguistics (pp. 427–434). Association for Computational Linguistics
-
GuoDong, Z., Jian, S., Jie, Z., & Min, Z
-
GuoDong, Z., Jian, S., Jie, Z., & Min, Z. (2005). Exploring various knowledge in relation extraction. In Proceedings of the 43rd annual meeting on association for computational linguistics (pp. 427–434). Association for Computational Linguistics.
-
(2005)
Exploring various knowledge in relation extraction
-
-
-
37
-
-
69549121743
-
Observing human–object interactions: Using spatial and functional compatibility for recognition
-
Gupta, A., Kembhavi, A., & Davis, L. S. (2009). Observing human–object interactions: Using spatial and functional compatibility for recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(10), 1775–1789.
-
(2009)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.31
, Issue.10
, pp. 1775-1789
-
-
Gupta, A.1
Kembhavi, A.2
Davis, L.S.3
-
38
-
-
0346759983
-
-
Geneva: Institut pour les études sémantiques et cognitives/Université de Genève
-
Hayes, P. J. (1978). The naive physics manifesto. Geneva: Institut pour les études sémantiques et cognitives/Université de Genève.
-
(1978)
The naive physics manifesto
-
-
Hayes, P.J.1
-
40
-
-
0003074296
-
Support vector machines
-
Hearst, M. A., Dumais, S. T., Osman, E., Platt, J., & Scholkopf, B. (1998). Support vector machines. IEEE Intelligent Systems and their Applications, 13(4), 18–28.
-
(1998)
IEEE Intelligent Systems and their Applications
, vol.13
, Issue.4
, pp. 18-28
-
-
Hearst, M.A.1
Dumais, S.T.2
Osman, E.3
Platt, J.4
Scholkopf, B.5
-
41
-
-
0031573117
-
Long short-term memory
-
Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
42
-
-
84883394520
-
Framing image description as a ranking task: Data, models and evaluation metrics
-
Hodosh, M., Young, P., & Hockenmaier, J. (2013). Framing image description as a ranking task: Data, models and evaluation metrics. Journal of Artificial Intelligence Research, 47(1), 853–899.
-
(2013)
Journal of Artificial Intelligence Research
, vol.47
, Issue.1
, pp. 853-899
-
-
Hodosh, M.1
Young, P.2
Hockenmaier, J.3
-
43
-
-
85018080332
-
In Intelligent information processing (pp. 77–89). Springer
-
Hou, C.-S. J., Noy, N. F., & Musen, M. A
-
Hou, C.-S. J., Noy, N. F., & Musen, M. A. (2002). A template-based approach toward acquisition of logical sentences. In Intelligent information processing (pp. 77–89). Springer.
-
(2002)
A template-based approach toward acquisition of logical sentences
-
-
-
44
-
-
85018081584
-
-
Huang, G. B., Mattar, M., Berg, T., & Learned-Miller, Eand recognition
-
Huang, G. B., Mattar, M., Berg, T., & Learned-Miller, E. (2008). Labeled faces in the wild: A database forstudying face recognition in unconstrained environments. In Workshop on faces in ’real-life’ images: Detection, alignment, and recognition.
-
(2008)
Labeled faces in the wild: A database forstudying face recognition in unconstrained environments. In Workshop on faces in ’real-life’ images: Detection, alignment
-
-
-
46
-
-
85119024224
-
In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 232–239). IEEE
-
Izadinia, H., Sadeghi, F., & Farhadi, A
-
Izadinia, H., Sadeghi, F., & Farhadi, A. (2014). Incorporating scene context and object layout into appearance modeling. In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 232–239). IEEE.
-
(2014)
Incorporating scene context and object layout into appearance modeling
-
-
-
47
-
-
84959233256
-
In IEEE conference on computer vision and pattern recognition (CVPR)
-
Johnson, J., Krishna, R., Stark, M., Li, L.-J., Shamma, D. A., Bernstein, M., et al
-
Johnson, J., Krishna, R., Stark, M., Li, L.-J., Shamma, D. A., Bernstein, M., et al. (2015). Image retrieval using scene graphs. In IEEE conference on computer vision and pattern recognition (CVPR).
-
(2015)
Image retrieval using scene graphs
-
-
-
50
-
-
85014738021
-
In CHI’16-SIGCHI conference on human factors in computing system
-
Krishna, R., Hata, K., Chen, S., Kravitz, J., Shamma, D. A., Fei-Fei, L., et al
-
Krishna, R., Hata, K., Chen, S., Kravitz, J., Shamma, D. A., Fei-Fei, L., et al. (2016). Embracing error to enable rapid crowdsourcing. In CHI’16-SIGCHI conference on human factors in computing system.
-
(2016)
Embracing error to enable rapid crowdsourcing
-
-
-
52
-
-
70450172710
-
-
Lampert, C. H., Nickisch, H., & Harmeling, S2009 (CVPR 2009) (pp. 951–958). IEEE
-
Lampert, C. H., Nickisch, H., & Harmeling, S. (2009). Learning to detect unseen object classes by between-class attribute transfer. In IEEE conference on computer vision and pattern recognition, 2009 (CVPR 2009) (pp. 951–958). IEEE.
-
(2009)
Learning to detect unseen object classes by between-class attribute transfer. In IEEE conference on computer vision and pattern recognition
-
-
-
53
-
-
0346336042
-
Using corpus statistics and wordnet relations for sense identification
-
Leacock, C., Miller, G. A., & Chodorow, M. (1998). Using corpus statistics and wordnet relations for sense identification. Computational Linguistics, 24(1), 147–165.
-
(1998)
Computational Linguistics
, vol.24
, Issue.1
, pp. 147-165
-
-
Leacock, C.1
Miller, G.A.2
Chodorow, M.3
-
54
-
-
84970028761
-
-
Lebret, R., Pinheiro, P. O., & Collobert, R. (2015). Phrase-based image captioning. arXiv:1502.03671.
-
(2015)
Phrase-based image captioning. arXiv
, vol.1502
, pp. 03671
-
-
Lebret, R.1
Pinheiro, P.O.2
Collobert, R.3
-
55
-
-
84906493406
-
In Computer vision–ECCV 2014 (pp. 740–755). Springer
-
Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., et al
-
Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., et al. (2014). Microsoft COCO: Common objects in context. In Computer vision–ECCV 2014 (pp. 740–755). Springer.
-
(2014)
Microsoft COCO: Common objects in context
-
-
-
56
-
-
85018078208
-
In European conference on computer vision (ECCV). IEEE
-
Lu, C., Krishna, R., Bernstein, M., & Fei-Fei, L
-
Lu, C., Krishna, R., Bernstein, M., & Fei-Fei, L. (2016). Visual relationship detection using language priors. In European conference on computer vision (ECCV). IEEE.
-
(2016)
Visual relationship detection using language priors
-
-
-
57
-
-
85018078792
-
-
Ma, L., Lu, Z., & Li, H. (2015). Learning to answer questions from image using convolutional neural network. arXiv:1506.00333.
-
(2015)
Learning to answer questions from image using convolutional neural network. arXiv
, vol.1506
, pp. 00333
-
-
Ma, L.1
Lu, Z.2
Li, H.3
-
58
-
-
84937822746
-
A multi-world approach to question answering about real-world scenes based on uncertain input
-
Malinowski, M., & Fritz, M. (2014). A multi-world approach to question answering about real-world scenes based on uncertain input. In Advances in neural information processing systems (pp. 1682–1690).
-
(2014)
In Advances in neural information processing systems
, pp. 1682-1690
-
-
Malinowski, M.1
Fritz, M.2
-
61
-
-
85117622017
-
The Stanford CoreNLP natural language processing toolkit
-
Manning, C. D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S. J., & McClosky, D. (2014). The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations (pp. 55–60).
-
(2014)
In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations
, pp. 55-60
-
-
Manning, C.D.1
Surdeanu, M.2
Bauer, J.3
Finkel, J.4
Bethard, S.J.5
McClosky, D.6
-
62
-
-
84951072975
-
-
Mao, J., Xu, W., Yang, Y., Wang, J., & Yuille, A. L. (2014). Explain images with multimodal recurrent neural networks. arXiv:1410.1090.
-
(2014)
Explain images with multimodal recurrent neural networks. arXiv
, vol.1410
, pp. 1090
-
-
Mao, J.1
Xu, W.2
Yang, Y.3
Wang, J.4
Yuille, A.L.5
-
63
-
-
85018069107
-
The senseval-3 English lexical sample task. Association for Computational Linguistics
-
Mihalcea, R., Chklovski, T. A., & Kilgarriff, A. (2004). The senseval-3 English lexical sample task. Association for Computational Linguistics, UNT Digital Library.
-
(2004)
UNT Digital Library
-
-
Mihalcea, R.1
Chklovski, T.A.2
Kilgarriff, A.3
-
64
-
-
85083951332
-
-
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv:1301.3781.
-
(2013)
Efficient estimation of word representations in vector space. arXiv
, vol.1301
, pp. 3781
-
-
Mikolov, T.1
Chen, K.2
Corrado, G.3
Dean, J.4
-
65
-
-
84976702763
-
Wordnet: a lexical database for english
-
Miller, G. A. (1995). Wordnet: a lexical database for english. Communications of the ACM, 38(11), 39–41.
-
(1995)
Communications of the ACM
, vol.38
, Issue.11
, pp. 39-41
-
-
Miller, G.A.1
-
66
-
-
84877901872
-
Elementary: Large-scale knowledge-base construction via machine learning and statistical inference
-
Niu, F., Zhang, C., Ré, C., & Shavlik, J. (2012). Elementary: Large-scale knowledge-base construction via machine learning and statistical inference. International Journal on Semantic Web and Information Systems (IJSWIS), 8(3), 42–73.
-
(2012)
International Journal on Semantic Web and Information Systems (IJSWIS)
, vol.8
, Issue.3
, pp. 42-73
-
-
Niu, F.1
Zhang, C.2
Ré, C.3
Shavlik, J.4
-
67
-
-
85162522202
-
Im2text: Describing images using 1 million captioned photographs
-
Red Hook, Curran Associates, Inc
-
Ordonez, V., Kulkarni, G., & Berg, T. L. (2011). Im2text: Describing images using 1 million captioned photographs. In J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, & K. Weinberger (Eds.), Advances in neural information processing systems (Vol. 24, pp. 1143–1151). Red Hook: Curran Associates, Inc.
-
(2011)
Advances in neural information processing systems
, pp. 1143-1151
-
-
Ordonez, V.1
Kulkarni, G.2
Berg, T.L.3
Shawe-Taylor, J.4
Zemel, R.5
Bartlett, P.6
Pereira, F.7
Weinberger, K.8
-
69
-
-
85018067251
-
In Proceedings of the 40th annual meeting on association for computational linguistics (pp. 311–318). Association for Computational Linguistics
-
Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J
-
Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J. (2002). BLEU: A method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics (pp. 311–318). Association for Computational Linguistics.
-
(2002)
BLEU: A method for automatic evaluation of machine translation
-
-
-
70
-
-
84900870389
-
The sun attribute database: Beyond categories for deeper scene understanding
-
Patterson, G., Xu, C., Su, H., & Hays, J. (2014). The sun attribute database: Beyond categories for deeper scene understanding. International Journal of Computer Vision, 108(1–2), 59–81.
-
(2014)
International Journal of Computer Vision
, vol.108
, Issue.1-2
, pp. 59-81
-
-
Patterson, G.1
Xu, C.2
Su, H.3
Hays, J.4
-
71
-
-
78149348137
-
In Computer vision–ECCV 2010 (pp. 143–156). Springer
-
Perronnin, F., Sánchez, J., & Mensink, T
-
Perronnin, F., Sánchez, J., & Mensink, T. (2010). Improving the fisher kernel for large-scale image classification. In Computer vision–ECCV 2010 (pp. 143–156). Springer.
-
(2010)
Improving the fisher kernel for large-scale image classification
-
-
-
72
-
-
84856142160
-
Weakly supervised learning of interactions between humans and objects
-
Prest, A., Schmid, C., & Ferrari, V. (2012). Weakly supervised learning of interactions between humans and objects. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(3), 601–614.
-
(2012)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.34
, Issue.3
, pp. 601-614
-
-
Prest, A.1
Schmid, C.2
Ferrari, V.3
-
73
-
-
84959233994
-
Learning semantic relationships for better action retrieval in images
-
Ramanathan, V., Li, C., Deng, J., Han, W., Li, Z., Gu, K., et al. (2015). Learning semantic relationships for better action retrieval in images. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1100–1109).
-
(2015)
In Proceedings of the IEEE conference on computer vision and pattern recognition
, pp. 1100-1109
-
-
Ramanathan, V.1
Li, C.2
Deng, J.3
Han, W.4
Li, Z.5
Gu, K.6
-
74
-
-
85018083692
-
-
Ren, M., Kiros, R., & Zemel, R. (2015a). Image question answering: A visual semantic embedding model and a new dataset. arXiv:1505.02074.
-
(2015)
Image question answering: A visual semantic embedding model and a new dataset. arXiv
, vol.1505
, pp. 02074
-
-
Ren, M.1
Kiros, R.2
Zemel, R.3
-
75
-
-
84960980241
-
Faster r-cnn: Towards real-time object detection with region proposal networks
-
Ren, S., He, K., Girshick, R., & Sun, J. (2015b). Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems (pp. 91–99).
-
(2015)
In Advances in neural information processing systems
, pp. 91-99
-
-
Ren, S.1
He, K.2
Girshick, R.3
Sun, J.4
-
76
-
-
85018058894
-
Describing common human visual actions in images. In X. Xie, M. W. Jones, & G. K. L. Tam (Eds.), Proceedings of the British machine vision conference (BMVC 2015) (pp. 52.1–52.12)
-
Ronchi, M. R., & Perona, P. (2015). Describing common human visual actions in images. In X. Xie, M. W. Jones, & G. K. L. Tam (Eds.), Proceedings of the British machine vision conference (BMVC 2015) (pp. 52.1–52.12). BMVA Press.
-
(2015)
BMVA Press
-
-
Ronchi, M.R.1
Perona, P.2
-
78
-
-
84947041871
-
ImageNet large scale visual recognition challenge
-
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., et al. (2015). ImageNet large scale visual recognition challenge. International journal of computer vision (IJCV) (pp. 1–42).
-
(2015)
International journal of computer vision (IJCV)
, pp. 1-42
-
-
Russakovsky, O.1
Deng, J.2
Su, H.3
Krause, J.4
Satheesh, S.5
Ma, S.6
-
79
-
-
39749186006
-
Labelme: A database and web-based tool for image annotation
-
Russell, B. C., Torralba, A., Murphy, K. P., & Freeman, W. T. (2008). Labelme: A database and web-based tool for image annotation. International Journal of Computer Vision, 77(1–3), 157–173.
-
(2008)
International Journal of Computer Vision
, vol.77
, Issue.1-3
, pp. 157-173
-
-
Russell, B.C.1
Torralba, A.2
Murphy, K.P.3
Freeman, W.T.4
-
81
-
-
80052889458
-
In 2011 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 1745–1752). IEEE
-
Sadeghi, M. A., & Farhadi, A
-
Sadeghi, M. A., & Farhadi, A. (2011). Recognition using visual phrases. In 2011 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 1745–1752). IEEE.
-
(2011)
Recognition using visual phrases
-
-
-
82
-
-
84951013363
-
In Proceedings of the 33rd annual ACM conference on human factors in computing systems (pp. 1621–1630). ACM
-
Salehi, N., Irani, L. C., & Bernstein, M. S
-
Salehi, N., Irani, L. C., & Bernstein, M. S. (2015). We are dynamo: Overcoming stalling and friction in collective action for crowd workers. In Proceedings of the 33rd annual ACM conference on human factors in computing systems (pp. 1621–1630). ACM.
-
(2015)
We are dynamo: Overcoming stalling and friction in collective action for crowd workers
-
-
-
83
-
-
0004016411
-
Scripts, plans, goals, and understanding: An inquiry into human knowledge structures
-
Schank, R. C., & Abelson, R. P. (2013). Scripts, plans, goals, and understanding: An inquiry into human knowledge structures. Hove: Psychology Press.
-
(2013)
Hove: Psychology Press
-
-
Schank, R.C.1
Abelson, R.P.2
-
84
-
-
79952079965
-
VerbNet: A broad-coverage, comprehensive verb lexicon. Ph.D. thesis, University of Pennsylvania, Philadelphia, PA
-
Schuler, K. K. (2005). VerbNet: A broad-coverage, comprehensive verb lexicon. Ph.D. thesis, University of Pennsylvania, Philadelphia, PA, USA (AAI3179808).
-
(2005)
USA (AAI3179808)
-
-
Schuler, K.K.1
-
85
-
-
85018069747
-
In Proceedings of the fourth workshop on vision and language (pp. 70–80). Citeseer
-
Schuster, S., Krishna, R., Chang, A., Fei-Fei, L., & Manning, C. D
-
Schuster, S., Krishna, R., Chang, A., Fei-Fei, L., & Manning, C. D. (2015). Generating semantically precise scene graphs from textual descriptions for improved image retrieval. In Proceedings of the fourth workshop on vision and language (pp. 70–80). Citeseer.
-
(2015)
Generating semantically precise scene graphs from textual descriptions for improved image retrieval
-
-
-
86
-
-
84925321058
-
-
Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., & LeCun, Y. (2013). Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229.
-
(2013)
Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv
, vol.1312
, pp. 6229
-
-
Sermanet, P.1
Eigen, D.2
Zhang, X.3
Mathieu, M.4
Fergus, R.5
LeCun, Y.6
-
87
-
-
85018060178
-
In ECCV
-
Silberman, N., Hoiem, D., Kohli, P., & Fergus, R
-
Silberman, N., Hoiem, D., Kohli, P., & Fergus, R. (2012). Indoor segmentation and support inference from RGBD images. In ECCV.
-
(2012)
Indoor segmentation and support inference from RGBD images
-
-
-
89
-
-
80053360508
-
In Proceedings of the conference on empirical methods in natural language processing (pp. 254–263). Association for Computational Linguistics
-
Snow, R., O’Connor, B., Jurafsky, D., & Ng, A. Y
-
Snow, R., O’Connor, B., Jurafsky, D., & Ng, A. Y. (2008). Cheap and fast—But is it good?: Evaluating non-expert annotations for natural language tasks. In Proceedings of the conference on empirical methods in natural language processing (pp. 254–263). Association for Computational Linguistics.
-
(2008)
Cheap and fast—But is it good?: Evaluating non-expert annotations for natural language tasks
-
-
-
90
-
-
84870715081
-
In Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning (pp. 1201–1211). Association for Computational Linguistics
-
Socher, R., Huval, B., Manning, C. D., & Ng, A. Y
-
Socher, R., Huval, B., Manning, C. D., & Ng, A. Y. (2012). Semantic compositionality through recursive matrix-vector spaces. In Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning (pp. 1201–1211). Association for Computational Linguistics.
-
(2012)
Semantic compositionality through recursive matrix-vector spaces
-
-
-
91
-
-
85018038390
-
-
Steinbach, M., Karypis, G., Kumar, V., et al. (2000). A comparison of document clustering techniques. In KDD workshop on text mining, Boston (Vol. 400, pp. 525–526).
-
-
-
-
92
-
-
84937522268
-
Going deeper with convolutions
-
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., et al. (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1–9).
-
(2015)
In Proceedings of the IEEE conference on computer vision and pattern recognition
, pp. 1-9
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
-
93
-
-
84957922397
-
YFCC100M: The new data in multimedia research
-
Thomee, B., Shamma, D. A., Friedland, G., Elizalde, B., Ni, K., Poland, D., et al. (2016). YFCC100M: The new data in multimedia research. Communications of the ACM, 59(2), 64–73.
-
(2016)
Communications of the ACM
, vol.59
, Issue.2
, pp. 64-73
-
-
Thomee, B.1
Shamma, D.A.2
Friedland, G.3
Elizalde, B.4
Ni, K.5
Poland, D.6
Borth, D.7
Li, L.-J.8
-
94
-
-
54749092170
-
Million tiny images: A large data set for nonparametric object and scene recognition
-
Torralba, A., Fergus, R., & Freeman, W. T. (2008). 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(11), 1958–1970.
-
(2008)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.30
, Issue.11
, pp. 1958-1970
-
-
Torralba, A.1
Fergus, R.2
Freeman, W.T.3
-
95
-
-
8644258401
-
A statistical approach to texture classification from single images
-
Varma, M., & Zisserman, A. (2005). A statistical approach to texture classification from single images. International Journal of Computer Vision, 62(1–2), 61–81.
-
(2005)
International Journal of Computer Vision
, vol.62
, Issue.1-2
, pp. 61-81
-
-
Varma, M.1
Zisserman, A.2
-
97
-
-
84973926486
-
Learning common sense through visual abstraction
-
Vedantam, R., Lin, X., Batra, T., Lawrence Zitnick, C., & Parikh, D. (2015b). Learning common sense through visual abstraction. In Proceedings of the IEEE international conference on computer vision (pp. 2542–2550).
-
(2015)
In Proceedings of the IEEE international conference on computer vision
, pp. 2542-2550
-
-
Vedantam, R.1
Lin, X.2
Batra, T.3
Lawrence Zitnick, C.4
Parikh, D.5
-
98
-
-
84946747440
-
Show and tell: A neural image caption generator
-
Vinyals, O., Toshev, A., Bengio, S., & Erhan, D. (2015). Show and tell: A neural image caption generator. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3156–3164).
-
(2015)
In Proceedings of the IEEE conference on computer vision and pattern recognition
, pp. 3156-3164
-
-
Vinyals, O.1
Toshev, A.2
Bengio, S.3
Erhan, D.4
-
99
-
-
84878084353
-
The Caltech-UCSD birds-200-2011 dataset. Technical Report CNS-TR-2011-001
-
Wah, C., Branson, S., Welinder, P., Perona, P., & Belongie, S. (2011). The Caltech-UCSD birds-200-2011 dataset. Technical Report CNS-TR-2011-001, California Institute of Technology.
-
(2011)
California Institute of Technology
-
-
Wah, C.1
Branson, S.2
Welinder, P.3
Perona, P.4
Belongie, S.5
-
100
-
-
77955988947
-
In 2010 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3485–3492). IEEE
-
Xiao, J., Hays, J., Ehinger, K., Oliva, A., Torralba, A., et al
-
Xiao, J., Hays, J., Ehinger, K., Oliva, A., Torralba, A., et al. (2010). Sun database: Large-scale scene recognition from abbey to zoo. In 2010 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3485–3492). IEEE.
-
(2010)
Sun database: Large-scale scene recognition from abbey to zoo
-
-
-
101
-
-
84939821074
-
Show, attend and tell: Neural image caption generation with visual attention
-
Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A. C., Salakhutdinov, R., Zemel, R. S., and Bengio, Y. (2015). Show, attend and tell: Neural image caption generation with visual attention. CoRR. arXiv:1502.03044.
-
(2015)
CoRR. arXiv
, vol.1502
, pp. 03044
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Cho, K.4
Courville, A.C.5
Salakhutdinov, R.6
Zemel, R.S.7
Bengio, Y.8
-
102
-
-
84866704901
-
In 2012 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3522–3529). IEEE
-
Yang, Y., Baker, S., Kannan, A., & Ramanan, D
-
Yang, Y., Baker, S., Kannan, A., & Ramanan, D. (2012). Recognizing proxemics in personal photos. In 2012 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3522–3529). IEEE.
-
(2012)
Recognizing proxemics in personal photos
-
-
-
104
-
-
38349066535
-
-
Yao, B., Yang, X., & Zhu, S.-Cannotation tool and benchmarks. In Energy minimization methods in computer vision and pattern recognition (pp. 169–183). Springer
-
Yao, B., Yang, X., & Zhu, S.-C. (2007). Introduction to a large-scale general purpose ground truth database: methodology, annotation tool and benchmarks. In Energy minimization methods in computer vision and pattern recognition (pp. 169–183). Springer.
-
(2007)
Introduction to a large-scale general purpose ground truth database: methodology
-
-
-
105
-
-
84906494296
-
From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
-
Young, P., Lai, A., Hodosh, M., & Hockenmaier, J. (2014). From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Transactions of the Association for Computational Linguistics, 2, 67–78.
-
(2014)
Transactions of the Association for Computational Linguistics
, vol.2
, pp. 67-78
-
-
Young, P.1
Lai, A.2
Hodosh, M.3
Hockenmaier, J.4
-
106
-
-
85018070789
-
-
Yu, L., Park, E., Berg, A. C., & Berg, T. L. (2015). Visual madlibs: Fill in the blank image generation and question answering. arXiv:1506.00278.
-
(2015)
Visual madlibs: Fill in the blank image generation and question answering. arXiv
, vol.1506
, pp. 00278
-
-
Yu, L.1
Park, E.2
Berg, A.C.3
Berg, T.L.4
-
107
-
-
84959862537
-
Relation classification via convolutional deep neural network
-
Zeng, D., Liu, K., Lai, S., Zhou, G., & Zhao, J. (2014). Relation classification via convolutional deep neural network. In Proceedings of COLING (pp. 2335–2344).
-
(2014)
In Proceedings of COLING
, pp. 2335-2344
-
-
Zeng, D.1
Liu, K.2
Lai, S.3
Zhou, G.4
Zhao, J.5
-
108
-
-
80053360686
-
Tree kernel-based relation extraction with context-sensitive structured parse tree information
-
Zhou, G., Zhang, M., Ji, D. H., & Zhu, Q. (2007). Tree kernel-based relation extraction with context-sensitive structured parse tree information. In EMNLP-CoNLL 2007 (p. 728).
-
(2007)
In EMNLP-CoNLL
, vol.2007
, pp. 728
-
-
Zhou, G.1
Zhang, M.2
Ji, D.H.3
Zhu, Q.4
-
109
-
-
84865644621
-
In Proceedings of the 18th international conference on world wide web (pp. 101–110). ACM
-
Zhu, J., Nie, Z., Liu, X., Zhang, B., & Wen, J.-R
-
Zhu, J., Nie, Z., Liu, X., Zhang, B., & Wen, J.-R. (2009). Statsnowball: A statistical approach to extracting entity relationships. In Proceedings of the 18th international conference on world wide web (pp. 101–110). ACM.
-
(2009)
Statsnowball: A statistical approach to extracting entity relationships
-
-
-
110
-
-
85018036485
-
In European conference on computer vision
-
Zhu, Y., Fathi, A., & Fei-Fei, L
-
Zhu, Y., Fathi, A., & Fei-Fei, L. (2014). Reasoning about object affordances in a knowledge base representation. In European conference on computer vision.
-
(2014)
Reasoning about object affordances in a knowledge base representation
-
-
-
111
-
-
85009429007
-
-
Zhu, Y., Zhang, C., Ré, C., & Fei-Fei, L. (2015). Building a large-scale multimodal knowledge base system for answering visual queries. arXiv:1507.05670.
-
(2015)
Building a large-scale multimodal knowledge base system for answering visual queries. arXiv
, vol.1507
, pp. 05670
-
-
Zhu, Y.1
Zhang, C.2
Ré, C.3
Fei-Fei, L.4
-
112
-
-
84887338442
-
In 2013 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3009–3016). IEEE
-
Zitnick, C. L., & Parikh, D
-
Zitnick, C. L., & Parikh, D. (2013). Bringing semantics into focus using visual abstraction. In 2013 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3009–3016). IEEE.
-
(2013)
Bringing semantics into focus using visual abstraction
-
-
|