SCOPUS 정보 검색 플랫폼

International Journal of Computer Vision

Volumn 123, Issue 1, 2017, Pages 32-73

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

(12) Krishna, Ranjay a Zhu, Yuke a Groth, Oliver b Johnson, Justin a Hata, Kenji a Kravitz, Joshua a Chen, Stephanie a Kalantidis, Yannis c Li, Li Jia d Shamma, David A e Bernstein, Michael S a Fei Fei, Li a

a STANFORD UNIVERSITY (United States)

b DRESDEN UNIVERSITY OF TECHNOLOGY (Germany)

c YAHOO INC (United States)

d SNAP INC (United States)

e CWI (Netherlands)

Author keywords

Attributes; Computer vision; Crowdsourcing; Dataset; Image; Knowledge; Language; Objects; Question answering; Relationships; Scene graph

Indexed keywords

COGNITIVE SYSTEMS; COMPUTER VISION; CROWDSOURCING; GENES; VEHICLES;

ATTRIBUTES; DATASET; IMAGE; KNOWLEDGE; LANGUAGE; OBJECTS; QUESTION ANSWERING; RELATIONSHIPS; SCENE GRAPH;

VISUAL LANGUAGES;

EID: 85011596790 PISSN: 09205691 EISSN: 15731405 Source Type: Journal
DOI: 10.1007/s11263-016-0981-7 Document Type: Article

Times cited : (5114)

References (112)

1
- 84973890960
- In International conference on computer vision (ICCV)
- Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., Zitnick, C. L., et al
- Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., Zitnick, C. L., et al. (2015). VQA: Visual question answering. In International conference on computer vision (ICCV).
- (2015) VQA: Visual question answering

2
- 84906498422
- In European conference on computer vision (pp. 401–416). Springer
- Antol, S., Zitnick, C. L., & Parikh, D
- Antol, S., Zitnick, C. L., & Parikh, D. (2014). Zero-shot learning via visual abstraction. In European conference on computer vision (pp. 401–416). Springer.
- (2014) Zero-shot learning via visual abstraction

3
- 85018060315
- Baker, C. F., Fillmore, C. J., & Lowe, J. B. (1998). The Berkeley framenet project. In Proceedings of the 36th annual meeting of the association for computational linguistics and 17th international conference on computational linguistics—Volume 1, ACL’9PA: Association for Computational Linguistics
- Baker, C. F., Fillmore, C. J., & Lowe, J. B. (1998). The Berkeley framenet project. In Proceedings of the 36th annual meeting of the association for computational linguistics and 17th international conference on computational linguistics—Volume 1, ACL’98 (pp. 86–90). Stroudsburg, PA: Association for Computational Linguistics.
- (1998) Stroudsburg

4
- 70350536521
- Toward never ending language learning
- Betteridge, J., Carlson, A., Hong, S. A., Hruschka, E. R, Jr., Law, E. L., Mitchell, T. M., et al. (2009). Toward never ending language learning. In AAAI spring symposium: Learning by reading and learning to read (pp. 1–2).
- (2009) In AAAI spring symposium: Learning by reading and learning to read , pp. 1-2
- Betteridge, J.¹ Carlson, A.² Hong, S.A.³ Hruschka, E.R.⁴ Law, E.L.⁵ Mitchell, T.M.⁶

5
- 85018037997
- In Proceedings of the COLING/ACL on interactive presentation sessions (pp. 69–72). Association for Computational Linguistics
- Bird, S
- Bird, S. (2006). NLTK: The natural language toolkit. In Proceedings of the COLING/ACL on interactive presentation sessions (pp. 69–72). Association for Computational Linguistics.
- (2006) NLTK: The natural language toolkit

6
- 0000794042
- Culture and human development: A new look
- Bruner, J. (1990). Culture and human development: A new look. Human Development, 33(6), 344–355.
- (1990) Human Development , vol.33 , Issue.6 , pp. 344-355
- Bruner, J.¹

7
- 80053262847
- In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 724–731). Association for Computational Linguistics
- Bunescu, R. C., & Mooney, R. J
- Bunescu, R. C., & Mooney, R. J. (2005). A shortest path dependency kernel for relation extraction. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 724–731). Association for Computational Linguistics.
- (2005) A shortest path dependency kernel for relation extraction

8
- 84990046210
- Semantic parsing for text to 3D scene generation
- Chang, A. X., Savva, M., & Manning, C. D. (2014). Semantic parsing for text to 3D scene generation. In ACL 2014 (p. 17).
- (2014) In ACL , vol.2014 , pp. 17
- Chang, A.X.¹ Savva, M.² Manning, C.D.³

9
- 84952349295
- Chen, X., Fang, H., Lin, T.-Y., Vedantam, R., Gupta, S., Dollar, P., et al. (2015). Microsoft COCO captions: Data collection and evaluation server. arXiv:1504.00325.
- (2015) Microsoft COCO captions: Data collection and evaluation server. arXiv , vol.1504 , pp. 00325
- Chen, X.¹ Fang, H.² Lin, T.-Y.³ Vedantam, R.⁴ Gupta, S.⁵ Dollar, P.⁶

10
- 84957029470
- Mind’s eye: A recurrent visual representation for image caption generation
- Chen, X., & Lawrence Zitnick, C. (2015). Mind’s eye: A recurrent visual representation for image caption generation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2422–2431).
- (2015) In Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 2422-2431
- Chen, X.¹ Lawrence Zitnick, C.²

11
- 84926006932
- In EMNLP (pp. 1025–1035). Citeseer
- Chen, X., Liu, Z., & Sun, M
- Chen, X., Liu, Z., & Sun, M. (2014). A unified model for word sense representation and disambiguation. In EMNLP (pp. 1025–1035). Citeseer.
- (2014) A unified model for word sense representation and disambiguation

12
- 84898803720
- In 2013 IEEE international conference on computer vision (ICCV) (pp. 1409–1416). IEEE
- Chen, X., Shrivastava, A., & Gupta, A
- Chen, X., Shrivastava, A., & Gupta, A. (2013). Neil: Extracting visual knowledge from web data. In 2013 IEEE international conference on computer vision (ICCV) (pp. 1409–1416). IEEE.
- (2013) Neil: Extracting visual knowledge from web data

13
- 84887394346
- In 2013 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 33–40). IEEE
- Choi, W., Chao, Y.-W., Pantofaru, C., & Savarese, S
- Choi, W., Chao, Y.-W., Pantofaru, C., & Savarese, S. (2013). Understanding indoor scenes using 3D geometric phrases. In 2013 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 33–40). IEEE.
- (2013) Understanding indoor scenes using 3D geometric phrases

14
- 85018049956
- In Proceedings of the 42nd annual meeting on association for computational linguistics (p. 423). Association for Computational Linguistics
- Culotta, A., & Sorensen, J
- Culotta, A., & Sorensen, J. (2004). Dependency tree kernels for relation extraction. In Proceedings of the 42nd annual meeting on association for computational linguistics (p. 423). Association for Computational Linguistics.
- (2004) Dependency tree kernels for relation extraction

15
- 84965117097
- Equilibrated adaptive learning rates for non-convex optimization
- Dauphin, Y., de Vries, H., & Bengio, Y. (2015). Equilibrated adaptive learning rates for non-convex optimization. In Advances in neural information processing systems (pp. 1504–1512).
- (2015) In Advances in neural information processing systems , pp. 1504-1512
- Dauphin, Y.¹ de Vries, H.² Bengio, Y.³

16
- 85018059159
- Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., & Fei-Fei, L2009 (CVPR 2009) (pp. 248–255). IEEE
- Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., & Fei-Fei, L. (2009). Imagenet: A large-scale hierarchical image database. In IEEE conference on computer vision and pattern recognition, 2009 (CVPR 2009) (pp. 248–255). IEEE.
- (2009) Imagenet: A large-scale hierarchical image database. In IEEE conference on computer vision and pattern recognition

17
- 85018068497
- In Proceedings of the ninth workshop on statistical machine translation. Citeseer
- Denkowski, M., & Lavie, A
- Denkowski, M., & Lavie, A. (2014). Meteor universal: Language specific translation evaluation for any target language. In Proceedings of the ninth workshop on statistical machine translation. Citeseer.
- (2014) Meteor universal: Language specific translation evaluation for any target language

18
- 84857435937
- Pedestrian detection: An evaluation of the state of the art
- Dollar, P., Wojek, C., Schiele, B., & Perona, P. (2012). Pedestrian detection: An evaluation of the state of the art. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(4), 743–761.
- (2012) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.34 , Issue.4 , pp. 743-761
- Dollar, P.¹ Wojek, C.² Schiele, B.³ Perona, P.⁴

19
- 84959236502
- Long-term recurrent convolutional networks for visual recognition and description
- Donahue, J., Anne Hendricks, L., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., et al. (2015). Long-term recurrent convolutional networks for visual recognition and description. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2625–2634).
- (2015) In Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 2625-2634
- Donahue, J.¹ Anne Hendricks, L.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶

20
- 77951298115
- The pascal visual object classes (VOC) challenge
- Everingham, M., Van Gool, L., Williams, C. K., Winn, J., & Zisserman, A. (2010). The pascal visual object classes (VOC) challenge. International Journal of Computer Vision, 88(2), 303–338.
- (2010) International Journal of Computer Vision , vol.88 , Issue.2 , pp. 303-338
- Everingham, M.¹ Van Gool, L.² Williams, C.K.³ Winn, J.⁴ Zisserman, A.⁵

21
- 84959250180
- Fang, H., Gupta, S., Iandola, F., Srivastava, R. K., Deng, L., Dollár, P., et al. (2015). From captions to visual concepts and back. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1473–1482).

22
- 70450207704
- Farhadi, A., Endres, I., Hoiem, D., & Forsyth, D2009 (CVPR 2009) (pp. 1778–1785). IEEE
- Farhadi, A., Endres, I., Hoiem, D., & Forsyth, D. (2009). Describing objects by their attributes. In IEEE conference on computer vision and pattern recognition, 2009 (CVPR 2009) (pp. 1778–1785). IEEE.
- (2009) Describing objects by their attributes. In IEEE conference on computer vision and pattern recognition

23
- 78149311145
- In Computer vision–ECCV 2010 (pp. 15–29). Springer
- Farhadi, A., Hejrati, M., Sadeghi, M. A., Young, P., Rashtchian, C., Hockenmaier, J., et al
- Farhadi, A., Hejrati, M., Sadeghi, M. A., Young, P., Rashtchian, C., Hockenmaier, J., et al. (2010). Every picture tells a story: Generating sentences from images. In Computer vision–ECCV 2010 (pp. 15–29). Springer.
- (2010) Every picture tells a story: Generating sentences from images

24
- 34047174674
- Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories
- Fei-Fei, L., Fergus, R., & Perona, P. (2007). Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Computer Vision and Image Understanding, 106(1), 59–70.
- (2007) Computer Vision and Image Understanding , vol.106 , Issue.1 , pp. 59-70
- Fei-Fei, L.¹ Fergus, R.² Perona, P.³

25
- 70450219358
- Learning visual attributes
- Ferrari, V., & Zisserman, A. (2007). Learning visual attributes. In Advances in neural information processing systems (pp. 433–440).
- (2007) In Advances in neural information processing systems , pp. 433-440
- Ferrari, V.¹ Zisserman, A.²

26
- 79953685181
- Building watson: An overview of the deepqa project
- Ferrucci, D., Brown, E., Chu-Carroll, J., Fan, J., Gondek, D., Kalyanpur, A. A., et al. (2010). Building watson: An overview of the deepqa project. AI Magazine, 31(3), 59–79.
- (2010) AI Magazine , vol.31 , Issue.3 , pp. 59-79
- Ferrucci, D.¹ Brown, E.² Chu-Carroll, J.³ Fan, J.⁴ Gondek, D.⁵ Kalyanpur, A.A.⁶ Lally, A.⁷ Murdock, J.W.⁸ Nyberg, E.⁹ Prager, J.¹⁰

27
- 84937604790
- Cognition does not affect perception: Evaluating the evidence for top-down effects
- Firestone, C., & Scholl, B. J. (2015). Cognition does not affect perception: Evaluating the evidence for top-down effects. Behavioral and brain sciences (pp. 1–72).
- (2015) Behavioral and brain sciences , pp. 1-72
- Firestone, C.¹ Scholl, B.J.²

28
- 0021613150
- Qualitative process theory
- Forbus, K. D. (1984). Qualitative process theory. Artificial Intelligence, 24(1), 85–168.
- (1984) Artificial Intelligence , vol.24 , Issue.1 , pp. 85-168
- Forbus, K.D.¹

29
- 84965148420
- Are you talking to a machine? Dataset and methods for multilingual image question
- Gao, H., Mao, J., Zhou, J., Huang, Z., Wang, L., & Xu, W. (2015). Are you talking to a machine? Dataset and methods for multilingual image question. In Advances in neural information processing systems (pp. 2296–2304).
- (2015) In Advances in neural information processing systems , pp. 2296-2304
- Gao, H.¹ Mao, J.² Zhou, J.³ Huang, Z.⁴ Wang, L.⁵ Xu, W.⁶

30
- 84925422907
- Visual turing test for computer vision systems
- Geman, D., Geman, S., Hallonquist, N., & Younes, L. (2015). Visual turing test for computer vision systems. Proceedings of the National Academy of Sciences, 112(12), 3618–3623.
- (2015) Proceedings of the National Academy of Sciences , vol.112 , Issue.12 , pp. 3618-3623
- Geman, D.¹ Geman, S.² Hallonquist, N.³ Younes, L.⁴

31
- 84964588182
- Girshick, R. (2015). Fast R-CNN. In Proceedings of the IEEE international conference on computer vision (pp. 1440–1448).
- (2015) Fast R-CNN. In Proceedings of the IEEE international conference on computer vision , pp. 1440-1448
- Girshick, R.¹

32
- 85119023807
- In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 580–587). IEEE
- Girshick, R., Donahue, J., Darrell, T., & Malik, J
- Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 580–587). IEEE.
- (2014) Rich feature hierarchies for accurate object detection and semantic segmentation

33
- 84911449570
- In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 2489–2496). IEEE
- Goering, C., Rodner, E., Freytag, A., & Denzler, J
- Goering, C., Rodner, E., Freytag, A., & Denzler, J. (2014). Nonparametric part transfer for fine-grained recognition. In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 2489–2496). IEEE.
- (2014) Nonparametric part transfer for fine-grained recognition

34
- 34948904828
- Caltech-256 object category dataset
- Griffin, G., Holub, A., & Perona, P. (2007). Caltech-256 object category dataset. Technical Report 7694.
- (2007) Technical Report , pp. 7694
- Griffin, G.¹ Holub, A.² Perona, P.³

35
- 84859889184
- In Proceedings of the 43rd annual meeting on association for computational linguistics (pp. 427–434). Association for Computational Linguistics
- GuoDong, Z., Jian, S., Jie, Z., & Min, Z
- GuoDong, Z., Jian, S., Jie, Z., & Min, Z. (2005). Exploring various knowledge in relation extraction. In Proceedings of the 43rd annual meeting on association for computational linguistics (pp. 427–434). Association for Computational Linguistics.
- (2005) Exploring various knowledge in relation extraction

36
- 57149125139
- In Computer vision–ECCV 2008 (pp. 16–29). Springer
- Gupta, A., & Davis, L. S
- Gupta, A., & Davis, L. S. (2008). Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers. In Computer vision–ECCV 2008 (pp. 16–29). Springer.
- (2008) Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers

37
- 69549121743
- Observing human–object interactions: Using spatial and functional compatibility for recognition
- Gupta, A., Kembhavi, A., & Davis, L. S. (2009). Observing human–object interactions: Using spatial and functional compatibility for recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(10), 1775–1789.
- (2009) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.31 , Issue.10 , pp. 1775-1789
- Gupta, A.¹ Kembhavi, A.² Davis, L.S.³

38
- 0346759983
- Geneva: Institut pour les études sémantiques et cognitives/Université de Genève
- Hayes, P. J. (1978). The naive physics manifesto. Geneva: Institut pour les études sémantiques et cognitives/Université de Genève.
- (1978) The naive physics manifesto
- Hayes, P.J.¹

39
- 0001782174
- The second naive physics manifesto
- Hayes, P. J. (1985). The second naive physics manifesto. Theories of the commonsense world (pp. 1–36).
- (1985) Theories of the commonsense world , pp. 1-36
- Hayes, P.J.¹

40
- 0003074296
- Support vector machines
- Hearst, M. A., Dumais, S. T., Osman, E., Platt, J., & Scholkopf, B. (1998). Support vector machines. IEEE Intelligent Systems and their Applications, 13(4), 18–28.
- (1998) IEEE Intelligent Systems and their Applications , vol.13 , Issue.4 , pp. 18-28
- Hearst, M.A.¹ Dumais, S.T.² Osman, E.³ Platt, J.⁴ Scholkopf, B.⁵

41
- 0031573117
- Long short-term memory
- Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

42
- 84883394520
- Framing image description as a ranking task: Data, models and evaluation metrics
- Hodosh, M., Young, P., & Hockenmaier, J. (2013). Framing image description as a ranking task: Data, models and evaluation metrics. Journal of Artificial Intelligence Research, 47(1), 853–899.
- (2013) Journal of Artificial Intelligence Research , vol.47 , Issue.1 , pp. 853-899
- Hodosh, M.¹ Young, P.² Hockenmaier, J.³

43
- 85018080332
- In Intelligent information processing (pp. 77–89). Springer
- Hou, C.-S. J., Noy, N. F., & Musen, M. A
- Hou, C.-S. J., Noy, N. F., & Musen, M. A. (2002). A template-based approach toward acquisition of logical sentences. In Intelligent information processing (pp. 77–89). Springer.
- (2002) A template-based approach toward acquisition of logical sentences

44
- 85018081584
- Huang, G. B., Mattar, M., Berg, T., & Learned-Miller, Eand recognition
- Huang, G. B., Mattar, M., Berg, T., & Learned-Miller, E. (2008). Labeled faces in the wild: A database forstudying face recognition in unconstrained environments. In Workshop on faces in ’real-life’ images: Detection, alignment, and recognition.
- (2008) Labeled faces in the wild: A database forstudying face recognition in unconstrained environments. In Workshop on faces in ’real-life’ images: Detection, alignment

45
- 84959192212
- Discovering states and transformations in image collections
- Isola, P., Lim, J. J., & Adelson, E. H. (2015). Discovering states and transformations in image collections. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1383–1391).
- (2015) In Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 1383-1391
- Isola, P.¹ Lim, J.J.² Adelson, E.H.³

46
- 85119024224
- In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 232–239). IEEE
- Izadinia, H., Sadeghi, F., & Farhadi, A
- Izadinia, H., Sadeghi, F., & Farhadi, A. (2014). Incorporating scene context and object layout into appearance modeling. In 2014 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 232–239). IEEE.
- (2014) Incorporating scene context and object layout into appearance modeling

47
- 84959233256
- In IEEE conference on computer vision and pattern recognition (CVPR)
- Johnson, J., Krishna, R., Stark, M., Li, L.-J., Shamma, D. A., Bernstein, M., et al
- Johnson, J., Krishna, R., Stark, M., Li, L.-J., Shamma, D. A., Bernstein, M., et al. (2015). Image retrieval using scene graphs. In IEEE conference on computer vision and pattern recognition (CVPR).
- (2015) Image retrieval using scene graphs

48
- 84946734827
- Deep visual-semantic alignments for generating image descriptions
- Karpathy, A., & Fei-Fei, L. (2015). Deep visual-semantic alignments for generating image descriptions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3128–3137).
- (2015) In Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 3128-3137
- Karpathy, A.¹ Fei-Fei, L.²

49
- 84929363334
- Multimodal neural language models
- Kiros, R., Salakhutdinov, R., & Zemel, R. (2014). Multimodal neural language models. In Proceedings of the 31st international conference on machine learning (ICML-14) (pp. 595–603).
- (2014) In Proceedings of the 31st international conference on machine learning (ICML-14) , pp. 595-603
- Kiros, R.¹ Salakhutdinov, R.² Zemel, R.³

50
- 85014738021
- In CHI’16-SIGCHI conference on human factors in computing system
- Krishna, R., Hata, K., Chen, S., Kravitz, J., Shamma, D. A., Fei-Fei, L., et al
- Krishna, R., Hata, K., Chen, S., Kravitz, J., Shamma, D. A., Fei-Fei, L., et al. (2016). Embracing error to enable rapid crowdsourcing. In CHI’16-SIGCHI conference on human factors in computing system.
- (2016) Embracing error to enable rapid crowdsourcing

51
- 84876231242
- Imagenet classification with deep convolutional neural networks
- Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097–1105).
- (2012) In Advances in neural information processing systems , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

52
- 70450172710
- Lampert, C. H., Nickisch, H., & Harmeling, S2009 (CVPR 2009) (pp. 951–958). IEEE
- Lampert, C. H., Nickisch, H., & Harmeling, S. (2009). Learning to detect unseen object classes by between-class attribute transfer. In IEEE conference on computer vision and pattern recognition, 2009 (CVPR 2009) (pp. 951–958). IEEE.
- (2009) Learning to detect unseen object classes by between-class attribute transfer. In IEEE conference on computer vision and pattern recognition

53
- 0346336042
- Using corpus statistics and wordnet relations for sense identification
- Leacock, C., Miller, G. A., & Chodorow, M. (1998). Using corpus statistics and wordnet relations for sense identification. Computational Linguistics, 24(1), 147–165.
- (1998) Computational Linguistics , vol.24 , Issue.1 , pp. 147-165
- Leacock, C.¹ Miller, G.A.² Chodorow, M.³

54
- 84970028761
- Lebret, R., Pinheiro, P. O., & Collobert, R. (2015). Phrase-based image captioning. arXiv:1502.03671.
- (2015) Phrase-based image captioning. arXiv , vol.1502 , pp. 03671
- Lebret, R.¹ Pinheiro, P.O.² Collobert, R.³

55
- 84906493406
- In Computer vision–ECCV 2014 (pp. 740–755). Springer
- Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., et al
- Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., et al. (2014). Microsoft COCO: Common objects in context. In Computer vision–ECCV 2014 (pp. 740–755). Springer.
- (2014) Microsoft COCO: Common objects in context

56
- 85018078208
- In European conference on computer vision (ECCV). IEEE
- Lu, C., Krishna, R., Bernstein, M., & Fei-Fei, L
- Lu, C., Krishna, R., Bernstein, M., & Fei-Fei, L. (2016). Visual relationship detection using language priors. In European conference on computer vision (ECCV). IEEE.
- (2016) Visual relationship detection using language priors

57
- 85018078792
- Ma, L., Lu, Z., & Li, H. (2015). Learning to answer questions from image using convolutional neural network. arXiv:1506.00333.
- (2015) Learning to answer questions from image using convolutional neural network. arXiv , vol.1506 , pp. 00333
- Ma, L.¹ Lu, Z.² Li, H.³

58
- 84937822746
- A multi-world approach to question answering about real-world scenes based on uncertain input
- Malinowski, M., & Fritz, M. (2014). A multi-world approach to question answering about real-world scenes based on uncertain input. In Advances in neural information processing systems (pp. 1682–1690).
- (2014) In Advances in neural information processing systems , pp. 1682-1690
- Malinowski, M.¹ Fritz, M.²

59
- 84973896625
- Ask your neurons: A neural-based approach to answering questions about images
- Malinowski, M., Rohrbach, M., & Fritz, M. (2015). Ask your neurons: A neural-based approach to answering questions about images. In Proceedings of the IEEE international conference on computer vision (pp. 1–9).
- (2015) In Proceedings of the IEEE international conference on computer vision , pp. 1-9
- Malinowski, M.¹ Rohrbach, M.² Fritz, M.³

60
- 51949096556
- Malisiewicz, T., Efros, A., et al2008 (CVPR 2008) (pp. 1–8). IEEE
- Malisiewicz, T., Efros, A., et al. (2008). Recognition by association via learning per-exemplar distances. In IEEE conference on computer vision and pattern recognition, 2008 (CVPR 2008) (pp. 1–8). IEEE.
- (2008) Recognition by association via learning per-exemplar distances. In IEEE conference on computer vision and pattern recognition

61
- 85117622017
- The Stanford CoreNLP natural language processing toolkit
- Manning, C. D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S. J., & McClosky, D. (2014). The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations (pp. 55–60).
- (2014) In Proceedings of 52nd annual meeting of the association for computational linguistics: system demonstrations , pp. 55-60
- Manning, C.D.¹ Surdeanu, M.² Bauer, J.³ Finkel, J.⁴ Bethard, S.J.⁵ McClosky, D.⁶

62
- 84951072975
- Mao, J., Xu, W., Yang, Y., Wang, J., & Yuille, A. L. (2014). Explain images with multimodal recurrent neural networks. arXiv:1410.1090.
- (2014) Explain images with multimodal recurrent neural networks. arXiv , vol.1410 , pp. 1090
- Mao, J.¹ Xu, W.² Yang, Y.³ Wang, J.⁴ Yuille, A.L.⁵

63
- 85018069107
- The senseval-3 English lexical sample task. Association for Computational Linguistics
- Mihalcea, R., Chklovski, T. A., & Kilgarriff, A. (2004). The senseval-3 English lexical sample task. Association for Computational Linguistics, UNT Digital Library.
- (2004) UNT Digital Library
- Mihalcea, R.¹ Chklovski, T.A.² Kilgarriff, A.³

64
- 85083951332
- Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv:1301.3781.
- (2013) Efficient estimation of word representations in vector space. arXiv , vol.1301 , pp. 3781
- Mikolov, T.¹ Chen, K.² Corrado, G.³ Dean, J.⁴

65
- 84976702763
- Wordnet: a lexical database for english
- Miller, G. A. (1995). Wordnet: a lexical database for english. Communications of the ACM, 38(11), 39–41.
- (1995) Communications of the ACM , vol.38 , Issue.11 , pp. 39-41
- Miller, G.A.¹

66
- 84877901872
- Elementary: Large-scale knowledge-base construction via machine learning and statistical inference
- Niu, F., Zhang, C., Ré, C., & Shavlik, J. (2012). Elementary: Large-scale knowledge-base construction via machine learning and statistical inference. International Journal on Semantic Web and Information Systems (IJSWIS), 8(3), 42–73.
- (2012) International Journal on Semantic Web and Information Systems (IJSWIS) , vol.8 , Issue.3 , pp. 42-73
- Niu, F.¹ Zhang, C.² Ré, C.³ Shavlik, J.⁴

67
- 85162522202
- Im2text: Describing images using 1 million captioned photographs
- Red Hook, Curran Associates, Inc
- Ordonez, V., Kulkarni, G., & Berg, T. L. (2011). Im2text: Describing images using 1 million captioned photographs. In J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, & K. Weinberger (Eds.), Advances in neural information processing systems (Vol. 24, pp. 1143–1151). Red Hook: Curran Associates, Inc.
- (2011) Advances in neural information processing systems , pp. 1143-1151
- Ordonez, V.¹ Kulkarni, G.² Berg, T.L.³ Shawe-Taylor, J.⁴ Zemel, R.⁵ Bartlett, P.⁶ Pereira, F.⁷ Weinberger, K.⁸

68
- 85011959542
- Pal, A. R., & Saha, D. (2015). Word sense disambiguation: A survey. arXiv:1508.01346.
- (2015) Word sense disambiguation: A survey. arXiv , vol.1508 , pp. 01346
- Pal, A.R.¹ Saha, D.²

69
- 85018067251
- In Proceedings of the 40th annual meeting on association for computational linguistics (pp. 311–318). Association for Computational Linguistics
- Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J
- Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J. (2002). BLEU: A method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics (pp. 311–318). Association for Computational Linguistics.
- (2002) BLEU: A method for automatic evaluation of machine translation

70
- 84900870389
- The sun attribute database: Beyond categories for deeper scene understanding
- Patterson, G., Xu, C., Su, H., & Hays, J. (2014). The sun attribute database: Beyond categories for deeper scene understanding. International Journal of Computer Vision, 108(1–2), 59–81.
- (2014) International Journal of Computer Vision , vol.108 , Issue.1-2 , pp. 59-81
- Patterson, G.¹ Xu, C.² Su, H.³ Hays, J.⁴

71
- 78149348137
- In Computer vision–ECCV 2010 (pp. 143–156). Springer
- Perronnin, F., Sánchez, J., & Mensink, T
- Perronnin, F., Sánchez, J., & Mensink, T. (2010). Improving the fisher kernel for large-scale image classification. In Computer vision–ECCV 2010 (pp. 143–156). Springer.
- (2010) Improving the fisher kernel for large-scale image classification

72
- 84856142160
- Weakly supervised learning of interactions between humans and objects
- Prest, A., Schmid, C., & Ferrari, V. (2012). Weakly supervised learning of interactions between humans and objects. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(3), 601–614.
- (2012) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.34 , Issue.3 , pp. 601-614
- Prest, A.¹ Schmid, C.² Ferrari, V.³

73
- 84959233994
- Learning semantic relationships for better action retrieval in images
- Ramanathan, V., Li, C., Deng, J., Han, W., Li, Z., Gu, K., et al. (2015). Learning semantic relationships for better action retrieval in images. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1100–1109).
- (2015) In Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 1100-1109
- Ramanathan, V.¹ Li, C.² Deng, J.³ Han, W.⁴ Li, Z.⁵ Gu, K.⁶

74
- 85018083692
- Ren, M., Kiros, R., & Zemel, R. (2015a). Image question answering: A visual semantic embedding model and a new dataset. arXiv:1505.02074.
- (2015) Image question answering: A visual semantic embedding model and a new dataset. arXiv , vol.1505 , pp. 02074
- Ren, M.¹ Kiros, R.² Zemel, R.³

75
- 84960980241
- Faster r-cnn: Towards real-time object detection with region proposal networks
- Ren, S., He, K., Girshick, R., & Sun, J. (2015b). Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in neural information processing systems (pp. 91–99).
- (2015) In Advances in neural information processing systems , pp. 91-99
- Ren, S.¹ He, K.² Girshick, R.³ Sun, J.⁴

76
- 85018058894
- Describing common human visual actions in images. In X. Xie, M. W. Jones, & G. K. L. Tam (Eds.), Proceedings of the British machine vision conference (BMVC 2015) (pp. 52.1–52.12)
- Ronchi, M. R., & Perona, P. (2015). Describing common human visual actions in images. In X. Xie, M. W. Jones, & G. K. L. Tam (Eds.), Proceedings of the British machine vision conference (BMVC 2015) (pp. 52.1–52.12). BMVA Press.
- (2015) BMVA Press
- Ronchi, M.R.¹ Perona, P.²

77
- 85018373400
- Rothe, S., & Schütze, H. (2015). Autoextend: Extending word embeddings to embeddings for synsets and lexemes. arXiv:1507.01127.
- (2015) Autoextend: Extending word embeddings to embeddings for synsets and lexemes. arXiv , vol.1507 , pp. 01127
- Rothe, S.¹ Schütze, H.²

78
- 84947041871
- ImageNet large scale visual recognition challenge
- Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., et al. (2015). ImageNet large scale visual recognition challenge. International journal of computer vision (IJCV) (pp. 1–42).
- (2015) International journal of computer vision (IJCV) , pp. 1-42
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶

79
- 39749186006
- Labelme: A database and web-based tool for image annotation
- Russell, B. C., Torralba, A., Murphy, K. P., & Freeman, W. T. (2008). Labelme: A database and web-based tool for image annotation. International Journal of Computer Vision, 77(1–3), 157–173.
- (2008) International Journal of Computer Vision , vol.77 , Issue.1-3 , pp. 157-173
- Russell, B.C.¹ Torralba, A.² Murphy, K.P.³ Freeman, W.T.⁴

80
- 84959184467
- Viske: Visual knowledge extraction and question answering by visual verification of relation phrases
- Sadeghi, F., Divvala, S. K., & Farhadi, A. (2015). Viske: Visual knowledge extraction and question answering by visual verification of relation phrases. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1456–1464).
- (2015) In Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 1456-1464
- Sadeghi, F.¹ Divvala, S.K.² Farhadi, A.³

81
- 80052889458
- In 2011 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 1745–1752). IEEE
- Sadeghi, M. A., & Farhadi, A
- Sadeghi, M. A., & Farhadi, A. (2011). Recognition using visual phrases. In 2011 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 1745–1752). IEEE.
- (2011) Recognition using visual phrases

82
- 84951013363
- In Proceedings of the 33rd annual ACM conference on human factors in computing systems (pp. 1621–1630). ACM
- Salehi, N., Irani, L. C., & Bernstein, M. S
- Salehi, N., Irani, L. C., & Bernstein, M. S. (2015). We are dynamo: Overcoming stalling and friction in collective action for crowd workers. In Proceedings of the 33rd annual ACM conference on human factors in computing systems (pp. 1621–1630). ACM.
- (2015) We are dynamo: Overcoming stalling and friction in collective action for crowd workers

83
- 0004016411
- Scripts, plans, goals, and understanding: An inquiry into human knowledge structures
- Schank, R. C., & Abelson, R. P. (2013). Scripts, plans, goals, and understanding: An inquiry into human knowledge structures. Hove: Psychology Press.
- (2013) Hove: Psychology Press
- Schank, R.C.¹ Abelson, R.P.²

84
- 79952079965
- VerbNet: A broad-coverage, comprehensive verb lexicon. Ph.D. thesis, University of Pennsylvania, Philadelphia, PA
- Schuler, K. K. (2005). VerbNet: A broad-coverage, comprehensive verb lexicon. Ph.D. thesis, University of Pennsylvania, Philadelphia, PA, USA (AAI3179808).
- (2005) USA (AAI3179808)
- Schuler, K.K.¹

85
- 85018069747
- In Proceedings of the fourth workshop on vision and language (pp. 70–80). Citeseer
- Schuster, S., Krishna, R., Chang, A., Fei-Fei, L., & Manning, C. D
- Schuster, S., Krishna, R., Chang, A., Fei-Fei, L., & Manning, C. D. (2015). Generating semantically precise scene graphs from textual descriptions for improved image retrieval. In Proceedings of the fourth workshop on vision and language (pp. 70–80). Citeseer.
- (2015) Generating semantically precise scene graphs from textual descriptions for improved image retrieval

86
- 84925321058
- Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., & LeCun, Y. (2013). Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229.
- (2013) Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv , vol.1312 , pp. 6229
- Sermanet, P.¹ Eigen, D.² Zhang, X.³ Mathieu, M.⁴ Fergus, R.⁵ LeCun, Y.⁶

87
- 85018060178
- In ECCV
- Silberman, N., Hoiem, D., Kohli, P., & Fergus, R
- Silberman, N., Hoiem, D., Kohli, P., & Fergus, R. (2012). Indoor segmentation and support inference from RGBD images. In ECCV.
- (2012) Indoor segmentation and support inference from RGBD images

88
- 84924803046
- Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556.
- (2014) Very deep convolutional networks for large-scale image recognition. arXiv , vol.1409 , pp. 1556
- Simonyan, K.¹ Zisserman, A.²

89
- 80053360508
- In Proceedings of the conference on empirical methods in natural language processing (pp. 254–263). Association for Computational Linguistics
- Snow, R., O’Connor, B., Jurafsky, D., & Ng, A. Y
- Snow, R., O’Connor, B., Jurafsky, D., & Ng, A. Y. (2008). Cheap and fast—But is it good?: Evaluating non-expert annotations for natural language tasks. In Proceedings of the conference on empirical methods in natural language processing (pp. 254–263). Association for Computational Linguistics.
- (2008) Cheap and fast—But is it good?: Evaluating non-expert annotations for natural language tasks

90
- 84870715081
- In Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning (pp. 1201–1211). Association for Computational Linguistics
- Socher, R., Huval, B., Manning, C. D., & Ng, A. Y
- Socher, R., Huval, B., Manning, C. D., & Ng, A. Y. (2012). Semantic compositionality through recursive matrix-vector spaces. In Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning (pp. 1201–1211). Association for Computational Linguistics.
- (2012) Semantic compositionality through recursive matrix-vector spaces

91
- 85018038390
- Steinbach, M., Karypis, G., Kumar, V., et al. (2000). A comparison of document clustering techniques. In KDD workshop on text mining, Boston (Vol. 400, pp. 525–526).

92
- 84937522268
- Going deeper with convolutions
- Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., et al. (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1–9).
- (2015) In Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 1-9
- Szegedy, C.¹ Liu, W.² Jia, Y.³ Sermanet, P.⁴ Reed, S.⁵ Anguelov, D.⁶

93
- 84957922397
- YFCC100M: The new data in multimedia research
- Thomee, B., Shamma, D. A., Friedland, G., Elizalde, B., Ni, K., Poland, D., et al. (2016). YFCC100M: The new data in multimedia research. Communications of the ACM, 59(2), 64–73.
- (2016) Communications of the ACM , vol.59 , Issue.2 , pp. 64-73
- Thomee, B.¹ Shamma, D.A.² Friedland, G.³ Elizalde, B.⁴ Ni, K.⁵ Poland, D.⁶ Borth, D.⁷ Li, L.-J.⁸

94
- 54749092170
- Million tiny images: A large data set for nonparametric object and scene recognition
- Torralba, A., Fergus, R., & Freeman, W. T. (2008). 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(11), 1958–1970.
- (2008) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.30 , Issue.11 , pp. 1958-1970
- Torralba, A.¹ Fergus, R.² Freeman, W.T.³

95
- 8644258401
- A statistical approach to texture classification from single images
- Varma, M., & Zisserman, A. (2005). A statistical approach to texture classification from single images. International Journal of Computer Vision, 62(1–2), 61–81.
- (2005) International Journal of Computer Vision , vol.62 , Issue.1-2 , pp. 61-81
- Varma, M.¹ Zisserman, A.²

96
- 84956980995
- Cider: Consensus-based image description evaluation
- Vedantam, R., Lawrence Zitnick, C., & Parikh, D. (2015a). Cider: Consensus-based image description evaluation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4566–4575).
- (2015) In Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 4566-4575
- Vedantam, R.¹ Lawrence Zitnick, C.² Parikh, D.³

97
- 84973926486
- Learning common sense through visual abstraction
- Vedantam, R., Lin, X., Batra, T., Lawrence Zitnick, C., & Parikh, D. (2015b). Learning common sense through visual abstraction. In Proceedings of the IEEE international conference on computer vision (pp. 2542–2550).
- (2015) In Proceedings of the IEEE international conference on computer vision , pp. 2542-2550
- Vedantam, R.¹ Lin, X.² Batra, T.³ Lawrence Zitnick, C.⁴ Parikh, D.⁵

98
- 84946747440
- Show and tell: A neural image caption generator
- Vinyals, O., Toshev, A., Bengio, S., & Erhan, D. (2015). Show and tell: A neural image caption generator. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3156–3164).
- (2015) In Proceedings of the IEEE conference on computer vision and pattern recognition , pp. 3156-3164
- Vinyals, O.¹ Toshev, A.² Bengio, S.³ Erhan, D.⁴

99
- 84878084353
- The Caltech-UCSD birds-200-2011 dataset. Technical Report CNS-TR-2011-001
- Wah, C., Branson, S., Welinder, P., Perona, P., & Belongie, S. (2011). The Caltech-UCSD birds-200-2011 dataset. Technical Report CNS-TR-2011-001, California Institute of Technology.
- (2011) California Institute of Technology
- Wah, C.¹ Branson, S.² Welinder, P.³ Perona, P.⁴ Belongie, S.⁵

100
- 77955988947
- In 2010 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3485–3492). IEEE
- Xiao, J., Hays, J., Ehinger, K., Oliva, A., Torralba, A., et al
- Xiao, J., Hays, J., Ehinger, K., Oliva, A., Torralba, A., et al. (2010). Sun database: Large-scale scene recognition from abbey to zoo. In 2010 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3485–3492). IEEE.
- (2010) Sun database: Large-scale scene recognition from abbey to zoo

101
- 84939821074
- Show, attend and tell: Neural image caption generation with visual attention
- Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A. C., Salakhutdinov, R., Zemel, R. S., and Bengio, Y. (2015). Show, attend and tell: Neural image caption generation with visual attention. CoRR. arXiv:1502.03044.
- (2015) CoRR. arXiv , vol.1502 , pp. 03044
- Xu, K.¹ Ba, J.² Kiros, R.³ Cho, K.⁴ Courville, A.C.⁵ Salakhutdinov, R.⁶ Zemel, R.S.⁷ Bengio, Y.⁸

102
- 84866704901
- In 2012 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3522–3529). IEEE
- Yang, Y., Baker, S., Kannan, A., & Ramanan, D
- Yang, Y., Baker, S., Kannan, A., & Ramanan, D. (2012). Recognizing proxemics in personal photos. In 2012 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3522–3529). IEEE.
- (2012) Recognizing proxemics in personal photos

103
- 77955988492
- In 2010 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 17–24). IEEE
- Yao, B., & Fei-Fei, L
- Yao, B., & Fei-Fei, L. (2010). Modeling mutual context of object and human pose in human–object interaction activities. In 2010 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 17–24). IEEE.
- (2010) Modeling mutual context of object and human pose in human–object interaction activities

104
- 38349066535
- Yao, B., Yang, X., & Zhu, S.-Cannotation tool and benchmarks. In Energy minimization methods in computer vision and pattern recognition (pp. 169–183). Springer
- Yao, B., Yang, X., & Zhu, S.-C. (2007). Introduction to a large-scale general purpose ground truth database: methodology, annotation tool and benchmarks. In Energy minimization methods in computer vision and pattern recognition (pp. 169–183). Springer.
- (2007) Introduction to a large-scale general purpose ground truth database: methodology

105
- 84906494296
- From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
- Young, P., Lai, A., Hodosh, M., & Hockenmaier, J. (2014). From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Transactions of the Association for Computational Linguistics, 2, 67–78.
- (2014) Transactions of the Association for Computational Linguistics , vol.2 , pp. 67-78
- Young, P.¹ Lai, A.² Hodosh, M.³ Hockenmaier, J.⁴

106
- 85018070789
- Yu, L., Park, E., Berg, A. C., & Berg, T. L. (2015). Visual madlibs: Fill in the blank image generation and question answering. arXiv:1506.00278.
- (2015) Visual madlibs: Fill in the blank image generation and question answering. arXiv , vol.1506 , pp. 00278
- Yu, L.¹ Park, E.² Berg, A.C.³ Berg, T.L.⁴

107
- 84959862537
- Relation classification via convolutional deep neural network
- Zeng, D., Liu, K., Lai, S., Zhou, G., & Zhao, J. (2014). Relation classification via convolutional deep neural network. In Proceedings of COLING (pp. 2335–2344).
- (2014) In Proceedings of COLING , pp. 2335-2344
- Zeng, D.¹ Liu, K.² Lai, S.³ Zhou, G.⁴ Zhao, J.⁵

108
- 80053360686
- Tree kernel-based relation extraction with context-sensitive structured parse tree information
- Zhou, G., Zhang, M., Ji, D. H., & Zhu, Q. (2007). Tree kernel-based relation extraction with context-sensitive structured parse tree information. In EMNLP-CoNLL 2007 (p. 728).
- (2007) In EMNLP-CoNLL , vol.2007 , pp. 728
- Zhou, G.¹ Zhang, M.² Ji, D.H.³ Zhu, Q.⁴

109
- 84865644621
- In Proceedings of the 18th international conference on world wide web (pp. 101–110). ACM
- Zhu, J., Nie, Z., Liu, X., Zhang, B., & Wen, J.-R
- Zhu, J., Nie, Z., Liu, X., Zhang, B., & Wen, J.-R. (2009). Statsnowball: A statistical approach to extracting entity relationships. In Proceedings of the 18th international conference on world wide web (pp. 101–110). ACM.
- (2009) Statsnowball: A statistical approach to extracting entity relationships

110
- 85018036485
- In European conference on computer vision
- Zhu, Y., Fathi, A., & Fei-Fei, L
- Zhu, Y., Fathi, A., & Fei-Fei, L. (2014). Reasoning about object affordances in a knowledge base representation. In European conference on computer vision.
- (2014) Reasoning about object affordances in a knowledge base representation

111
- 85009429007
- Zhu, Y., Zhang, C., Ré, C., & Fei-Fei, L. (2015). Building a large-scale multimodal knowledge base system for answering visual queries. arXiv:1507.05670.
- (2015) Building a large-scale multimodal knowledge base system for answering visual queries. arXiv , vol.1507 , pp. 05670
- Zhu, Y.¹ Zhang, C.² Ré, C.³ Fei-Fei, L.⁴

112
- 84887338442
- In 2013 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3009–3016). IEEE
- Zitnick, C. L., & Parikh, D
- Zitnick, C. L., & Parikh, D. (2013). Bringing semantics into focus using visual abstraction. In 2013 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 3009–3016). IEEE.
- (2013) Bringing semantics into focus using visual abstraction

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.