SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Volumn 07-12-June-2015, Issue , 2015, Pages 2727-2736

Image specificity

(2) Jas, Mainak a Parikh, Devi b

a AALTO UNIVERSITY (Finland)

b VIRGINIA POLYTECHNIC INSTITUTE AND STATE UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; INFORMATION RETRIEVAL; PATTERN RECOGNITION;

IMAGE CONTENT; IMAGE FEATURES; MODEL IMAGES; MULTIPLE DESCRIPTIONS; MULTIPLE PEOPLE; REFERENCE DESCRIPTIONS; TEXT-BASED IMAGE RETRIEVALS; TEXTUAL DESCRIPTION;

IMAGE RETRIEVAL;

EID: 84959214146 PISSN: 10636919 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2015.7298889 Document Type: Conference Paper

Times cited : (43)

References (53)

1
- 33847226906
- S. Banerjee and A. Lavie. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. 2005
- (2005) METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments.
- Banerjee, S.¹ Lavie, A.²

2
- 0034857154
- Learning the semantics of words and pictures
- K. Barnard and D. Forsyth. Learning the semantics of words and pictures. In IEEE International Conference on Computer Vision (ICCV), volume 2, pages 408-4 1 5, 200 1
- (2001) IEEE International Conference on Computer Vision (ICCV) , vol.2 , pp. 408-415
- Barnard, K.¹ Forsyth, D.²

3
- 84866726859
- Understanding and predicting importance in images
- A. C. Berg, T. L. Berg, H. Daume, J. Dodge, A. Goyal, X. Han, A. Mensch, M. Mitchell, A. Sood, K. Stratos, et al. Understanding and predicting importance in images. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3 5 62-3569, 2012.
- (2012) IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 3562-3569
- Berg, A.C.¹ Berg, T.L.² Daume, H.³ Dodge, J.⁴ Goyal, A.⁵ Han, X.⁶ Mensch, A.⁷ Mitchell, M.⁸ Sood, A.⁹ Stratos, K.¹⁰

4
- 85107362379
- NLTK: The natural language toolkit
- Association for Computational Linguistics
- S. Bird. NLTK: the natural language toolkit. In P roceedings of the COLINGIACL on Interactive presentation sessions, pages 69-72. Association for Computational Linguistics, 2006
- (2006) P Roceedings of the COLINGIACL on Interactive Presentation Sessions , pp. 69-72
- Bird, S.¹

5
- 84957029470
- Learning a recurrent visual representation for image caption generation
- X. Chen and e. L. Zitnick. Learning a Recurrent Visual Representation for Image Caption Generation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
- (2015) IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Chen, X.¹ Zitnick, E.L.²

6
- 70350637509
- Real time google and live image search re-ranking
- ACM
- J. Cui, F. Wen, and X. Tang. Real time google and live image search re-ranking. In Proceedings of the 16th ACM international conference on Multimedia, pages 729-732. ACM, 2008
- (2008) Proceedings of the 16th ACM International Conference on Multimedia , pp. 729-732
- Cui, J.¹ Wen, F.² Tang, X.³

7
- 80052879599
- High level describable attributes for predicting aesthetics and interestingness
- S. Dhar, V. Ordonez, and T. L. Berg. High level describable attributes for predicting aesthetics and interestingness. In 1EEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1 657-1 664, 2011
- (2011) 1EEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 1657-1664
- Dhar, S.¹ Ordonez, V.² Berg, T.L.³

8
- 84944046597
- arXiv preprint arXiv:1411. 4389
- J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term recurrent convolutional networks for visual recognition and description. arXiv preprint arXiv:1411. 4389, 20 14
- (2014) Long-term Recurrent Convolutional Networks for Visual Recognition and Description
- Donahue, J.¹ Hendricks, L.A.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

9
- 84904482223
- arXiv preprint arXiv:1310. 1531
- J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. Decaf: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310. 1531, 2013
- (2013) Decaf: A Deep Convolutional Activation Feature for Generic Visual Recognition
- Donahue, J.¹ Jia, Y.² Vinyals, O.³ Hoffman, J.⁴ Zhang, N.⁵ Tzeng, E.⁶ Darrell, T.⁷

10
- 80052883815
- Combining attributes and fisher vectors for efficient image retrieval
- M. Douze, A. Ramisa, and C. Schmid. Combining attributes and fisher vectors for efficient image retrieval. In 1EEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 745-752, 201 1
- (2011) 1EEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 745-752
- Douze, M.¹ Ramisa, A.² Schmid, C.³

11
- 85026929617
- D. Elliott and F. Keller. Comparing Automatic Evaluation Measures for Image Description
- Comparing Automatic Evaluation Measures for Image Description
- Elliott, D.¹ Keller, F.²

12
- 84880644383
- M. Everingham, S. A. Eslami, L. Van Gool, e. K. Williams, J. Winn, and A. Zisserman. The Pascal Visual Object Classes Challenge-a Retrospective
- The Pascal Visual Object Classes Challenge-a Retrospective
- Everingham, M.¹ Eslami, S.A.² Van Gool, L.³ Williams, E.K.⁴ Winn, J.⁵ Zisserman, A.⁶

13
- 84959250180
- From captions to visual concepts and back
- H. Fang, S. Gupta, F. landola, R. Srivastava, L. Deng, P. Dollar, J. Gao, X. He, M. Mitchell, J. Platt, et al. From captions to visual concepts and back. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
- (2015) IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Fang, H.¹ Gupta, S.² Landola, F.³ Srivastava, R.⁴ Deng, L.⁵ Dollar, P.⁶ Gao, J.⁷ He, X.⁸ Mitchell, M.⁹ Platt, J.¹⁰

14
- 78149311145
- Every picture tells a story: Generating sentences from images
- Springer
- A. Farhadi, M. Hejrati, M. A. Sadeghi, P. Young, e. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: Generating sentences from images. In Computer Vision-ECCV 2010, pages 1 5-29. Springer, 2010.
- (2010) Computer Vision-ECCV 2010 , pp. 15-29
- Farhadi, A.¹ Hejrati, M.² Sadeghi, M.A.³ Young, P.⁴ Rashtchian, E.⁵ Hockenmaier, J.⁶ Forsyth, D.⁷

15
- 80052264770
- Metamers of the ventral stream
- J. Freeman and E. P. Simoncelli. Metamers of the ventral stream. Nature Neuroscience, 1 4(9): 1 1 95-1 201, 2011
- (2011) Nature Neuroscience , vol.14 , Issue.9 , pp. 1195-1201
- Freeman, J.¹ Simoncelli, E.P.²

16
- 84906343066
- arXiv preprint arXiv:1311. 2524
- R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. arXiv preprint arXiv:1311. 2524, 2013
- (2013) Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
- Girshick, R.¹ Donahue, J.² Darrell, T.³ Malik, J.⁴

17
- 84898827031
- The interestingness of images
- M. Gygli, H. Grabner, H. Riemenschneider, F. Nater, and L. V. Gool. The interestingness of images. In 1EEE International Conference on Computer Vision (ICCV), pages 1 63 3-1 640, 2013
- (2013) 1EEE International Conference on Computer Vision (ICCV) , pp. 1633-1640
- Gygli, M.¹ Grabner, H.² Riemenschneider, H.³ Nater, F.⁴ Gool, L.V.⁵

18
- 85162530463
- Understanding the intrinsic memorability of images
- P. Isola, D. Parikh, A. Torralba, and A. Oliva. Understanding the intrinsic memorability of images. In Advances in Neural Information Processing Systems, 2011
- (2011) Advances in Neural Information Processing Systems
- Isola, P.¹ Parikh, D.² Torralba, A.³ Oliva, A.⁴

19
- 80052870103
- What makes an image memorable
- P. Isola, J. Xiao, A. Torralba, and A. Oliva. What makes an image memorable? In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1 45-1 52, 2011
- (2011) IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 145-152
- Isola, P.¹ Xiao, J.² Torralba, A.³ Oliva, A.⁴

20
- 0032204063
- A model of saliency-based visual attention for rapid scene analysis
- L. Itti, e. Koch, and E. Niebur. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on pattern analysis and machine intelligence, 20( 1 1): 1 254-1 259, 1 99 8
- (1998) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.20 , Issue.11 , pp. 1254-1259
- Itti, L.¹ Koch, E.² Niebur, E.³

21
- 84959195482
- arXiv preprint arXiv:1502. 04569
- M. Jas and D. Parikh. Image Specificity. arXiv preprint arXiv:1502. 04569, 2015.
- (2015) Image Specificity , pp. 3-7
- Jas, M.¹ Parikh, D.²

22
- 77953205576
- Learning to predict where humans look
- T. Judd, K. Ehinger, F. Durand, and A. Torralba. Learning to Predict Where Humans Look. In IEEE International Conference on Computer Vision (ICCV), 2009
- (2009) IEEE International Conference on Computer Vision (ICCV)
- Judd, T.¹ Ehinger, K.² Durand, F.³ Torralba, A.⁴

23
- 84946734827
- Deep visual-semantic alignments for generating image descriptions
- A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
- (2015) IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Karpathy, A.¹ Fei-Fei, L.²

24
- 84945572616
- What makes an image popular
- International World Wide Web Conferences Steering Committee
- A. Khosla, A. Das Sarma, and R. Hamid. What makes an image popular? In P roceedings of the 23rd international conference on World wide web, pages 867-876. International World Wide Web Conferences Steering Committee, 20 14
- (2014) P Roceedings of the 23rd International Conference on World Wide Web , pp. 867-876
- Khosla, A.¹ Das Sarma, A.² Hamid, R.³

25
- 84952349298
- Unifying visual-semantic embeddings with multi modal neural language models
- R. Kiros, R. Salakhutdinov, and R. S. Zemel. Unifying visual-semantic embeddings with multi modal neural language models. In Transactions of the Association for Computational Linguistics (TACL), 2015
- (2015) Transactions of the Association for Computational Linguistics (TACL)
- Kiros, R.¹ Salakhutdinov, R.² Zemel, R.S.³

26
- 84876231242
- Imagenet classification with deep convolutional neural networks
- A. Krizhevsky, l. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In Advances in Neural1nformation P rocessing Systems, pages 1 097-1 1 05, 2012
- (2012) Advances in Neural1nformation P Rocessing Systems , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, L.² Hinton, G.E.³

27
- 0026843302
- Lexical ambiguity and information retrieval
- R. Krovetz and W. B. Croft. Lexical ambiguity and information retrieval. ACM Transactions on Information Systems (T01S), 1 0(2): 1 1 5-1 4 1, 1 992
- (1992) ACM Transactions on Information Systems (T01S) , vol.10 , Issue.2 , pp. 115-141
- Krovetz, R.¹ Croft, W.B.²

28
- 80052901011
- Baby talk: Understanding and generating simple image descriptions
- G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. e. Berg, and T. L. Berg. Baby talk: Understanding and generating simple image descriptions. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1 60 1-1 608, 2011.
- (2011) IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 1601-1608
- Kulkarni, G.¹ Premraj, V.² Dhar, S.³ Li, S.⁴ Choi, Y.⁵ Berg, E.A.⁶ Berg, T.L.⁷

29
- 0034324105
- Next-generation web searches for visual content
- M. S. Lew. Next-generation web searches for visual content. Computer, 3 3 ( 1 1): 46-5 3, 2000
- (2000) Computer , vol.33 , Issue.11 , pp. 46-53
- Lew, M.S.¹

30
- 49249115835
- Datadriven enhancement of facial attractiveness
- T. Leyvand, D. Cohen-Or, G. Dror, and D. Lischinski. Datadriven enhancement of facial attractiveness. ACM Transactions on Graphics (TOG), 27(3): 3 8, 2008
- (2008) ACM Transactions on Graphics (TOG) , vol.27 , Issue.3 , pp. 38
- Leyvand, T.¹ Cohen-Or, D.² Dror, G.³ Lischinski, D.⁴

31
- 84862279067
- Composing simple image descriptions using web-scale n-grams
- Association for Computational Linguistics
- S. Li, G. Kulkarni, T. L. Berg, A. C. Berg, and Y. Choi. Composing simple image descriptions using web-scale n-grams. In P roceedings of the Fifteenth Conference on Computational Natural Language Learning, pages 220-228. Association for Computational Linguistics, 201 1
- (2011) P Roceedings of the Fifteenth Conference on Computational Natural Language Learning , pp. 220-228
- Li, S.¹ Kulkarni, G.² Berg, T.L.³ Berg, A.C.⁴ Choi, Y.⁵

32
- 26944501715
- Rouge: A package for automatic evaluation of summaries
- C.-Y. Lin. Rouge: A package for automatic evaluation of summaries. In Text Summarization Branches Out: P roceedings of the ACL-04 Workshop, pages 74-8 1, 2004.
- (2004) Text Summarization Branches Out: P Roceedings of the ACL-04 Workshop , pp. 74-81
- Lin, C.-Y.¹

33
- 33748856594
- D. Lin. An information-theoretic definition of similarity
- An Information-theoretic Definition of Similarity
- Lin, D.¹

34
- 84911442106
- Visual semantic search: Retrieving videos via complex textual queries
- D. Lin, S. Fidler, C. Kong, and R. Urtasun. Visual semantic search: Retrieving videos via complex textual queries. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 2657-2664, 20 14
- (2014) Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on , pp. 2657-2664
- Lin, D.¹ Fidler, S.² Kong, C.³ Urtasun, R.⁴

35
- 84951072975
- arXiv preprint arXiv: 1410. 1090
- J. Mao, W Xu, Y. Yang, J. Wang, and A. L. Yuille. Explain images with multi modal recurrent neural networks. arXiv preprint arXiv: 1410. 1090, 2014
- (2014) Explain Images with Multi Modal Recurrent Neural Networks
- Mao, J.¹ Xu, W.² Yang, Y.³ Wang, J.⁴ Yuille, A.L.⁵

36
- 84976702763
- WordNet: A lexical database for English
- 3, 6
- G. A. Miller. WordNet: a lexical database for English. Communications of the ACM, 3 8 ( 1 1): 3 9-4 1, 1 995. 3, 6
- (1995) Communications of the ACM , vol.38 , Issue.11 , pp. 39-41
- Miller, G.A.¹

37
- 85034832841
- Midge: Generating image descriptions from computer vision detections
- Association for Computational Linguistics.
- M. Mitchell, X. Han, J. Dodge, A. Mensch, A. Goyal, A. Berg, K. Yamaguchi, T. Berg, K. Stratos, and H. Daume III. Midge: Generating image descriptions from computer vision detections. In P roceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 747-756. Association for Computational Linguistics, 2012.
- (2012) P Roceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics , pp. 747-756
- Mitchell, M.¹ Han, X.² Dodge, J.³ Mensch, A.⁴ Goyal, A.⁵ Berg, A.⁶ Yamaguchi, K.⁷ Berg, T.⁸ Stratos, K.⁹ Daume, H.¹⁰

38
- 85162522202
- Im2text: Describing images using 1 million captioned photographs
- V. Ordonez, G. Kulkarni, and T. L. Berg. Im2text: Describing images using 1 million captioned photographs. In Advances in Neural Information Processing Systems, pages 1 1 43-1 1 5 1, 201 1.
- (2011) Advances in Neural Information Processing Systems , pp. 1143-1151
- Ordonez, V.¹ Kulkarni, G.² Berg, T.L.³

39
- 85133336275
- BLEU: A method for automatic evaluation of machine translation
- Association for Computational Linguistics
- K. Papineni, S. Roukos, T. Ward, and W-J. Zhu. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pages 3 1 1-3 1 8. Association for Computational Linguistics, 2002.
- (2002) Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , pp. 311-318
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.-J.⁴

40
- 80555140075
- Sci kit-learn: Machine learning in Python
- F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, et al. Sci kit-learn: Machine learning in Python. The Journal of Machine Learning Research, 1 2: 2825-2830, 2011
- (2011) The Journal of Machine Learning Research , vol.12 , pp. 2825-2830
- Pedregosa, F.¹ Varoquaux, G.² Gramfort, A.³ Michel, V.⁴ Thirion, B.⁵ Grisel, O.⁶ Blondel, M.⁷ Prettenhofer, P.⁸ Weiss, R.⁹ Dubourg, V.¹⁰

41
- 85090348677
- Collecting image annotations using Amazon' s Mechanical Turk
- Association for Computational Linguistics
- c. Rashtchian, P. Young, M. Hodosh, and J. Hockenmaier. Collecting image annotations using Amazon' s Mechanical Turk. In P roceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon 's Mechanical Turk, pages 1 3 9-1 47. Association for Computational Linguistics, 2010
- (2010) P Roceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon 'S Mechanical Turk , pp. 139-147
- Rashtchian, C.¹ Young, P.² Hodosh, M.³ Hockenmaier, J.⁴

42
- 34548133551
- Measuring visual clutter
- R. Rosenholtz, Y. Li, and L. Nakano. Measuring visual clutter. Journal of vision, 7(2): 1 7, 2007
- (2007) Journal of Vision , vol.7 , Issue.2 , pp. 17
- Rosenholtz, R.¹ Li, Y.² Nakano, L.³

43
- 80052894348
- Image ranking and retrieval based on multi-attribute queries
- B. Siddiquie, R. S. Feris, and L. S. Davis. Image ranking and retrieval based on multi-attribute queries. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 8 0 1-808, 201 1
- (2011) IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 801-808
- Siddiquie, B.¹ Feris, R.S.² Davis, L.S.³

44
- 54749098734
- M. Spain and P. Perona. Measuring and predicting importance of objects in our visual world. 2007
- (2007) Measuring and Predicting Importance of Objects in Our Visual World.
- Spain, M.¹ Perona, P.²

45
- 84898832240
- Attribute dominance: What pops out
- N. Turakhia and D. Parikh. Attribute dominance: What pops out? In IEEE International Conference on Computer Vision (ICCV), pages 1 225-1232, 2013
- (2013) IEEE International Conference on Computer Vision (ICCV) , pp. 1225-1232
- Turakhia, N.¹ Parikh, D.²

46
- 84956980995
- CIDEr: Consensus-based image description evaluation
- R. Vedantam, C. L. Zitnick, and D. Parikh. CIDEr: Consensus-based Image Description Evaluation. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
- (2015) IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Vedantam, R.¹ Zitnick, C.L.² Parikh, D.³

47
- 84946747440
- Show and tell: A neural image caption generator
- O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
- (2015) IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Vinyals, O.¹ Toshev, A.² Bengio, S.³ Erhan, D.⁴

48
- 77953177673
- Joint learning of visual attributes, object classes and visual saliency
- G. Wang and D. Forsyth. Joint learning of visual attributes, object classes and visual saliency. In Computer Vision, 2009 IEEE 12th International Conference on, pages 537-544, 2009
- (2009) Computer Vision, 2009 IEEE 12th International Conference on , pp. 537-544
- Wang, G.¹ Forsyth, D.²

49
- 80052882164
- Query-specific visual semantic spaces for web image re-ranking
- X. Wang, K. Liu, and X. Tang. Query-specific visual semantic spaces for web image re-ranking. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 857-8 64, 2011
- (2011) IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 857-864
- Wang, X.¹ Liu, K.² Tang, X.³

50
- 77955988947
- Sun database: Large-scale scene recognition from abbey to zoo
- J. Xiao, J. Hays, K. A. Ehinger, A. Oliva, and A. Torralba. Sun database: Large-scale scene recognition from abbey to zoo. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3485-3492, 2010
- (2010) IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 3485-3492
- Xiao, J.¹ Hays, J.² Ehinger, K.A.³ Oliva, A.⁴ Torralba, A.⁵

51
- 77954862144
- I2t: Image parsing to text description
- B. Z. Yao, X. Yang, L. Lin, M. W Lee, and S.-c. Zhu. I2t: Image parsing to text description. P roceedings of the IEEE, 98(8): 1 48 5-1 5 08, 2010
- (2010) P Roceedings of the IEEE , vol.98 , Issue.8 , pp. 1485-1508
- Yao, B.Z.¹ Yang, X.² Lin, L.³ Lee, M.W.⁴ Zhu, S.-C.⁵

52
- 84937964578
- Learning deep features for scene recognition using places database
- B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning Deep Features for Scene Recognition using Places Database. NIPS, 2014
- (2014) NIPS
- Zhou, B.¹ Lapedriza, A.² Xiao, J.³ Torralba, A.⁴ Oliva, A.⁵

53
- 84887338442
- Bringing semantics into focus using visual abstraction
- c. L. Zitnick and D. Parikh. Bringing semantics into focus using visual abstraction. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3009-3 0 1 6, 2013 .
- (2013) IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 3009-3016
- Zitnick, C.L.¹ Parikh, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.