SCOPUS 정보 검색 플랫폼

Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017

Volumn 2017-January, Issue , 2017, Pages 3298-3308

Detecting visual relationships with deep relational networks

(3) Dai, Bo a Zhang, Yuqi a Lin, Dahua a

a CHINESE UNIVERSITY OF HONG KONG (Hong Kong)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; DEEP LEARNING;

INDIVIDUAL OBJECTS; INTEGRATED FRAMEWORKS; LARGE DATASETS; LEARNING TECHNIQUES; RELATIONAL NETWORK; STATE OF THE ART; STATISTICAL DEPENDENCIES; VISUAL APPEARANCE;

PATTERN RECOGNITION;

EID: 85041892861 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2017.352 Document Type: Conference Paper

Times cited : (503)

References (64)

1
- 85035196624
- Cewu Lu, Ranjay Krishna, Michael Bernstein, and Li Fei-Fei. Visual relationship detection with language priors. arXiv preprint arXiv: 1608.00187, 2016.
- (2016) Visual Relationship Detection With Language Priors
- Lu, C.¹ Krishna, R.² Bernstein, M.³ Fei-Fei, L.⁴

2
- 84960980241
- Faster r-cnn: Towards real-time object detection with region proposal networks
- Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. Faster R-CNN: Towards real-time object detection with region proposal networks. In Advances in Neural Information Processing Systems (NIPS), 2015.
- (2015) Advances in Neural Information Processing Systems (NIPS)
- Ren, S.¹ He, K.² Girshick, R.³ Sun, J.⁴

3
- 85006390452
- Bolei Zhou, Aditya Khosla, Agata Lapedriza, Antonio Torralba, and Aude Oliva. Places: An image database for deep scene understanding. arXiv preprint arXiv: 1610.02055, 2016.
- (2016) Places: An image database for deep scene understanding
- Zhou, B.¹ Khosla, A.² Lapedriza, A.³ Torralba, A.⁴ Oliva, A.⁵

4
- 84911443783
- Panda: Pose aligned networks for deep attribute modeling
- Ning Zhang, Manohar Paluri, Marc'Aurelio Ranzato, Trevor Darrell, and Lubomir Bourdev. Panda: Pose aligned networks for deep attribute modeling. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1637-1644, 2014.
- (2014) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 1637-1644
- Zhang, N.¹ Paluri, M.² Ranzato, M.³ Darrell, T.⁴ Bourdev, L.⁵

5
- 84978730111
- Ranjay Krishna, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen, Yannis Kalantidis, Li-Jia Li, David A Shamma, Michael Bernstein, and Li Fei-Fei. Visual genome: Connecting language and vision using crowdsourced dense image annotations. 2016.
- (2016) Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
- Krishna, R.¹ Zhu, Y.² Groth, O.³ Johnson, J.⁴ Hata, K.⁵ Kravitz, J.⁶ Chen, S.⁷ Kalantidis, Y.⁸ Li, L.-J.⁹ Shamma, D.A.¹⁰ Bernstein, M.¹¹ Fei-Fei, L.¹²

6
- 80052889458
- Recognition using visual phrases
- IEEE
- Mohammad Amin Sadeghi and Ali Farhadi. Recognition using visual phrases. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 1745-1752. IEEE, 2011.
- (2011) Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on , pp. 1745-1752
- Sadeghi, M.A.¹ Farhadi, A.²

7
- 25444533246
- John Lafferty, Andrew McCallum, and Fernando CN Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. 2001.
- (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data
- Lafferty, J.¹ McCallum, A.² Pereira, F.C.N.³

8
- 57149125139
- Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers
- Springer
- Abhinav Gupta and Larry S Davis. Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers. In European conference on computer vision, pages 16-29. Springer, 2008.
- (2008) European conference on computer vision , pp. 16-29
- Gupta, A.¹ Davis, L.S.²

9
- 84959233256
- Image retrieval using scene graphs
- IEEE
- Justin Johnson, Ranjay Krishna, Michael Stark, Li-Jia Li, David A Shamma, Michael S Bernstein, and Li Fei-Fei. Image retrieval using scene graphs. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3668-3678. IEEE, 2015.
- (2015) 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 3668-3678
- Johnson, J.¹ Krishna, R.² Stark, M.³ Li, L.-J.⁴ Shamma, D.A.⁵ Bernstein, M.S.⁶ Fei-Fei, L.⁷

10
- 51949110976
- Object categorization using co-occurrence, location and appearance
- IEEE
- Carolina Galleguillos, Andrew Rabinovich, and Serge Belongie. Object categorization using co-occurrence, location and appearance. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, pages 1-8. IEEE, 2008.
- (2008) Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on , pp. 1-8
- Galleguillos, C.¹ Rabinovich, A.² Belongie, S.³

11
- 84887394346
- Understanding indoor scenes using 3d geometric phrases
- Wongun Choi, Yu-Wei Chao, Caroline Pantofaru, and Silvio Savarese. Understanding indoor scenes using 3d geometric phrases. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 33-40, 2013.
- (2013) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 33-40
- Choi, W.¹ Chao, Y.-W.² Pantofaru, C.³ Savarese, S.⁴

12
- 80052901011
- Baby talk: Understanding and generating image descriptions
- Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C Berg, and Tamara L Berg. Baby talk: Understanding and generating image descriptions. In Proceedings of the 24th CVPR. Citeseer, 2011.
- (2011) Proceedings of the 24th CVPR. Citeseer
- Kulkarni, G.¹ Premraj, V.² Dhar, S.³ Li, S.⁴ Choi, Y.⁵ Berg, A.C.⁶ Berg, T.L.⁷

13
- 77955987964
- Grouplet: A structured image representation for recognizing human and object interactions
- IEEE
- Bangpeng Yao and Li Fei-Fei. Grouplet: A structured image representation for recognizing human and object interactions. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, page 916. IEEE, 2010.
- (2010) Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on , pp. 916
- Yao, B.¹ Fei-Fei, L.²

14
- 84973872492
- Contextual action recognition with r cnn
- Georgia Gkioxari, Ross Girshick, and Jitendra Malik. Contextual action recognition with r cnn. In Proceedings of the IEEE International Conference on Computer Vision, pages 1080-1088, 2015.
- (2015) Proceedings of the IEEE International Conference on Computer Vision , pp. 1080-1088
- Gkioxari, G.¹ Girshick, R.² Malik, J.³

15
- 84898785648
- Grounding action descriptions in videos
- Michaela Regneri, Marcus Rohrbach, Dominikus Wetzel, Stefan Thater, Bernt Schiele, and Manfred Pinkal. Grounding action descriptions in videos. Transactions of the Association for Computational Linguistics, 1: 25-36, 2013.
- (2013) Transactions of the Association for Computational Linguistics , vol.1 , pp. 25-36
- Regneri, M.¹ Rohrbach, M.² Wetzel, D.³ Thater, S.⁴ Schiele, B.⁵ Pinkal, M.⁶

16
- 84959932469
- Integrating language and vision to generate natural language descriptions of videos in the wild
- Jesse Thomason, Subhashini Venugopalan, Sergio Guadarrama, Kate Saenko, and Raymond J Mooney. Integrating language and vision to generate natural language descriptions of videos in the wild. In COLING, volume 2, page 9, 2014.
- (2014) COLING , vol.2 , pp. 9
- Thomason, J.¹ Venugopalan, S.² Guadarrama, S.³ Saenko, K.⁴ Mooney, R.J.⁵

17
- 84959233994
- Learning semantic relationships for better action retrieval in images
- IEEE
- Vignesh Ramanathan, Congcong Li, Jia Deng, Wei Han, Zhen Li, Kunlong Gu, Yang Song, Samy Bengio, Chuck Rossenberg, and Li Fei-Fei. Learning semantic relationships for better action retrieval in images. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1100-1109. IEEE, 2015.
- (2015) 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 1100-1109
- Ramanathan, V.¹ Li, C.² Deng, J.³ Han, W.⁴ Li, Z.⁵ Gu, K.⁶ Song, Y.⁷ Bengio, S.⁸ Rossenberg, C.⁹ Fei-Fei, L.¹⁰

18
- 84898775239
- Translating video content to natural language descriptions
- Marcus Rohrbach, Wei Qiu, Ivan Titov, Stefan Thater, Manfred Pinkal, and Bernt Schiele. Translating video content to natural language descriptions. In Proceedings of the IEEE International Conference on Computer Vision, pages 433-440, 2013.
- (2013) Proceedings of the IEEE International Conference on Computer Vision , pp. 433-440
- Rohrbach, M.¹ Qiu, W.² Titov, I.³ Thater, S.⁴ Pinkal, M.⁵ Schiele, B.⁶

19
- 84898773262
- Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
- Sergio Guadarrama, Niveda Krishnamoorthy, Girish Malkarnenkar, Subhashini Venugopalan, Raymond Mooney, Trevor Darrell, and Kate Saenko. Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In Proceedings of the IEEE International Conference on Computer Vision, pages 2712-2719, 2013.
- (2013) Proceedings of the IEEE International Conference on Computer Vision , pp. 2712-2719
- Guadarrama, S.¹ Krishnamoorthy, N.² Malkarnenkar, G.³ Venugopalan, S.⁴ Mooney, R.⁵ Darrell, T.⁶ Saenko, K.⁷

20
- 84906498422
- Zero-shot learning via visual abstraction
- Springer
- Stanislaw Antol, C Lawrence Zitnick, and Devi Parikh. Zero-shot learning via visual abstraction. In European Conference on Computer Vision, pages 401-416. Springer, 2014.
- (2014) European Conference on Computer Vision , pp. 401-416
- Antol, S.¹ Zitnick, C.L.² Parikh, D.³

21
- 85044252753
- Mohamed Elhoseiny, Scott Cohen, Walter Chang, Brian Price, and Ahmed Elgammal. Sherlock: Scalable fact learning in images. arXiv preprint arXiv: 1511.04891, 2015.
- (2015) Sherlock: Scalable Fact Learning in Images
- Elhoseiny, M.¹ Cohen, S.² Chang, W.³ Price, B.⁴ Elgammal, A.⁵

22
- 78149311145
- Every picture tells a story: Generating sentences from images
- Springer
- Ali Farhadi, Mohsen Hejrati, Mohammad Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, and David Forsyth. Every picture tells a story: Generating sentences from images. In European Conference on Computer Vision, pages 15-29. Springer, 2010.
- (2010) European Conference on Computer Vision , pp. 15-29
- Farhadi, A.¹ Hejrati, M.² Sadeghi, M.A.³ Young, P.⁴ Rashtchian, C.⁵ Hockenmaier, J.⁶ Forsyth, D.⁷

23
- 84959226544
- Recognize complex events from static images by fusing deep channels
- Yuanjun Xiong, Kai Zhu, Dahua Lin, and Xiaoou Tang. Recognize complex events from static images by fusing deep channels. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1600-1609, 2015.
- (2015) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 1600-1609
- Xiong, Y.¹ Zhu, K.² Lin, D.³ Tang, X.⁴

24
- 78651403274
- Context based object categorization: A critical survey
- Carolina Galleguillos and Serge Belongie. Context based object categorization: A critical survey. Computer Vision and Image Understanding, 114(6): 712-722, 2010.
- (2010) Computer Vision and Image Understanding , vol.114 , Issue.6 , pp. 712-722
- Galleguillos, C.¹ Belongie, S.²

25
- 33745938597
- Discovering objects and their location in images
- IEEE
- Josef Sivic, Bryan C Russell, Alexei A Efros, Andrew Zisserman, and William T Freeman. Discovering objects and their location in images. In Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1, volume 1, pages 370-377. IEEE, 2005.
- (2005) Tenth IEEE International Conference on Computer Vision (ICCV'05) , vol.1 , Issue.1 , pp. 370-377
- Sivic, J.¹ Russell, B.C.² Efros, A.A.³ Zisserman, A.⁴ Freeman, W.T.⁵

26
- 77955997860
- Efficiently selecting regions for scene understanding
- IEEE
- MPawan Kumar and Daphne Koller. Efficiently selecting regions for scene understanding. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 3217-3224. IEEE, 2010.
- (2010) Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on , pp. 3217-3224
- Kumar, M.P.¹ Koller, D.²

27
- 77956006912
- Exploiting hierarchical context on a large database of object categories
- IEEE
- Myung Jin Choi, Joseph J Lim, Antonio Torralba, and Alan S Willsky. Exploiting hierarchical context on a large database of object categories. In Computer vision and pattern recognition (CVPR), 2010 IEEE conference on, pages 129-136. IEEE, 2010.
- (2010) Computer vision and pattern recognition (CVPR), 2010 IEEE conference on , pp. 129-136
- Choi, M.J.¹ Lim, J.J.² Torralba, A.³ Willsky, A.S.⁴

28
- 78149343534
- Graph cut based inference with cooccurrence statistics
- Springer
- Lubor Ladicky, Chris Russell, Pushmeet Kohli, and Philip HS Torr. Graph cut based inference with cooccurrence statistics. In European Conference on Computer Vision, pages 239-253. Springer, 2010.
- (2010) European Conference on Computer Vision , pp. 239-253
- Ladicky, L.¹ Russell, C.² Kohli, P.³ Torr, P.H.S.⁴

29
- 80052905403
- Learning to share visual appearance for multiclass object detection
- IEEE
- Ruslan Salakhutdinov, Antonio Torralba, and Josh Tenenbaum. Learning to share visual appearance for multiclass object detection. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 1481-1488. IEEE, 2011.
- (2011) Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on , pp. 1481-1488
- Salakhutdinov, R.¹ Torralba, A.² Tenenbaum, J.³

30
- 50649096757
- Objects in context
- IEEE
- Andrew Rabinovich, Andrea Vedaldi, Carolina Galleguillos, Eric Wiewiora, and Serge Belongie. Objects in context. In 2007 IEEE 11th International Conference on Computer Vision, pages 1-8. IEEE, 2007.
- (2007) 2007 IEEE 11th International Conference on Computer Vision , pp. 1-8
- Rabinovich, A.¹ Vedaldi, A.² Galleguillos, C.³ Wiewiora, E.⁴ Belongie, S.⁵

31
- 35148867545
- Towards scalable representations of object categories: Learning a hierarchy of parts
- IEEE
- Sanja Fidler and Ales Leonardis. Towards scalable representations of object categories: Learning a hierarchy of parts. In 2007 IEEE Conference on Computer Vision and Pattern Recognition, pages 1-8. IEEE, 2007.
- (2007) 2007 IEEE Conference on Computer Vision and Pattern Recognition , pp. 1-8
- Fidler, S.¹ Leonardis, A.²

32
- 33845596932
- Using multiple segmentations to discover objects and their extent in image collections
- IEEE
- Bryan C Russell, William T Freeman, Alexei A Efros, Josef Sivic, and Andrew Zisserman. Using multiple segmentations to discover objects and their extent in image collections. In 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), volume 2, pages 1605-1614. IEEE, 2006.
- (2006) 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06) , vol.2 , pp. 1605-1614
- Russell, B.C.¹ Freeman, W.T.² Efros, A.A.³ Sivic, J.⁴ Zisserman, A.⁵

33
- 84911410734
- Costa: Co-occurrence statistics for zero-shot classification
- Thomas Mensink, Efstratios Gavves, and Cees GM Snoek. Costa: Co-occurrence statistics for zero-shot classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2441-2448, 2014.
- (2014) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 2441-2448
- Mensink, T.¹ Gavves, E.² Snoek, C.G.M.³

34
- 84894905366
- A multi-view embedding space for modeling internet images, tags, and their semantics
- Yunchao Gong, Qifa Ke, Michael Isard, and Svetlana Lazebnik. A multi-view embedding space for modeling internet images, tags, and their semantics. International journal of computer vision, 106(2): 210233, 2014.
- (2014) International journal of computer vision , vol.106 , Issue.2 , pp. 210233
- Gong, Y.¹ Ke, Q.² Isard, M.³ Lazebnik, S.⁴

35
- 84898772194
- Learning the visual interpretation of sentences
- C Lawrence Zitnick, Devi Parikh, and Lucy Vanderwende. Learning the visual interpretation of sentences. In Proceedings of the IEEE International Conference on Computer Vision, pages 1681-1688, 2013.
- (2013) Proceedings of the IEEE International Conference on Computer Vision , pp. 1681-1688
- Zitnick, C.L.¹ Parikh, D.² Vanderwende, L.³

36
- 51349086291
- Putting objects in perspective
- Derek Hoiem, Alexei A Efros, and Martial Hebert. Putting objects in perspective. International Journal of Computer Vision, 80(1): 3-15, 2008.
- (2008) International Journal of Computer Vision , vol.80 , Issue.1 , pp. 3-15
- Hoiem, D.¹ Efros, A.A.² Hebert, M.³

37
- 84990046210
- Semantic parsing for text to 3d scene generation
- Angel X Chang, Manolis Savva, and Christopher D Manning. Semantic parsing for text to 3d scene generation. ACL 2014, page 17, 2014.
- (2014) ACL 2014 , pp. 17
- Chang, A.X.¹ Savva, M.² Manning, C.D.³

38
- 84866687133
- Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation
- IEEE
- Jian Yao, Sanja Fidler, and Raquel Urtasun. Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pages 702-709. IEEE, 2012.
- (2012) Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , pp. 702-709
- Yao, J.¹ Fidler, S.² Urtasun, R.³

39
- 84911457822
- Incorporating scene context and object layout into appearance modeling
- IEEE
- Hamid Izadinia, Fereshteh Sadeghi, and Ali Farhadi. Incorporating scene context and object layout into appearance modeling. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, pages 232-239. IEEE, 2014.
- (2014) 2014 IEEE Conference on Computer Vision and Pattern Recognition , pp. 232-239
- Izadinia, H.¹ Sadeghi, F.² Farhadi, A.³

40
- 52449123642
- Multi-class segmentation with relative location prior
- Stephen Gould, Jim Rodgers, David Cohen, Gal Elidan, and Daphne Koller. Multi-class segmentation with relative location prior. International Journal of Computer Vision, 80(3): 300-316, 2008.
- (2008) International Journal of Computer Vision , vol.80 , Issue.3 , pp. 300-316
- Gould, S.¹ Rodgers, J.² Cohen, D.³ Elidan, G.⁴ Koller, D.⁵

41
- 84866726859
- Understanding and predicting importance in images
- IEEE
- Alexander C Berg, Tamara L Berg, Hal Daume, Jesse Dodge, Amit Goyal, Xufeng Han, Alyssa Mensch, Margaret Mitchell, Aneesh Sood, Karl Stratos, et al. Understanding and predicting importance in images. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pages 3562-3569. IEEE, 2012.
- (2012) Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , pp. 3562-3569
- Berg, A.C.¹ Berg, T.L.² Daume, H.³ Dodge, J.⁴ Goyal, A.⁵ Han, X.⁶ Mensch, A.⁷ Mitchell, M.⁸ Sood, A.⁹ Stratos, K.¹⁰

42
- 84973856017
- Flickr30k entities: Collecting region-to-phrase correspondences for richer image-to-sentence models
- Bryan A Plummer, Liwei Wang, Chris M Cervantes, Juan C Caicedo, Julia Hockenmaier, and Svetlana Lazebnik. Flickr30k entities: Collecting region-to-phrase correspondences for richer image-to-sentence models. In Proceedings of the IEEE International Conference on Computer Vision, pages 2641-2649, 2015.
- (2015) Proceedings of the IEEE International Conference on Computer Vision , pp. 2641-2649
- Plummer, B.A.¹ Wang, L.² Cervantes, C.M.³ Caicedo, J.C.⁴ Hockenmaier, J.⁵ Lazebnik, S.⁶

43
- 84937843643
- Deep fragment embeddings for bidirectional image sentence mapping
- Andrej Karpathy, Armand Joulin, and Fei Fei F Li. Deep fragment embeddings for bidirectional image sentence mapping. In Advances in neural information processing systems, pages 1889-1897, 2014.
- (2014) Advances in neural information processing systems , pp. 1889-1897
- Karpathy, A.¹ Joulin, A.² Fei-Fei, L.³

44
- 84986327251
- Anna Rohrbach, Marcus Rohrbach, Ronghang Hu, Trevor Darrell, and Bernt Schiele. Grounding of textual phrases in images by reconstruction. arXiv preprint arXiv: 1511.03745, 2015.
- (2015) Grounding of Textual Phrases in Images By Reconstruction
- Rohrbach, A.¹ Rohrbach, M.² Hu, R.³ Darrell, T.⁴ Schiele, B.⁵

45
- 84887345951
- A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching
- Pradipto Das, Chenliang Xu, Richard F Doell, and Jason J Corso. A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2634-2641, 2013.
- (2013) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 2634-2641
- Das, P.¹ Xu, C.² Doell, R.F.³ Corso, J.J.⁴

46
- 84911368326
- Learning everything about anything: Webly-supervised visual concept learning
- Santosh K Divvala, Ali Farhadi, and Carlos Guestrin. Learning everything about anything: Webly-supervised visual concept learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3270-3277, 2014.
- (2014) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 3270-3277
- Divvala, S.K.¹ Farhadi, A.² Guestrin, C.³

47
- 84973926486
- Learning common sense through visual abstraction
- Ramakrishna Vedantam, Xiao Lin, Tanmay Batra, C Lawrence Zitnick, and Devi Parikh. Learning common sense through visual abstraction. In Proceedings of the IEEE International Conference on Computer Vision, pages 2542-2550, 2015.
- (2015) Proceedings of the IEEE International Conference on Computer Vision , pp. 2542-2550
- Vedantam, R.¹ Lin, X.² Batra, T.³ Zitnick, C.L.⁴ Parikh, D.⁵

48
- 84959250180
- From captions to visual concepts and back
- Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh K Srivastava, Li Deng, Piotr Dollár, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C Platt, et al. From captions to visual concepts and back. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1473-1482, 2015.
- (2015) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pp. 1473-1482
- Fang, H.¹ Gupta, S.² Iandola, F.³ Srivastava, R.K.⁴ Deng, L.⁵ Dollár, P.⁶ Gao, J.⁷ He, X.⁸ Mitchell, M.⁹ Platt, J.C.¹⁰

49
- 84933585162
- Very deep convolutional networks for large-scale image recognition
- abs/1409.1556
- K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556, 2014.
- (2014) CoRR
- Simonyan, K.¹ Zisserman, A.²

50
- 84958589374
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. arXiv preprint arXiv: 1512.03385, 2015.
- (2015) Deep Residual Learning For Image Recognition
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

51
- 84973861983
- Conditional random fields as recurrent neural networks
- Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, and Philip HS Torr. Conditional random fields as recurrent neural networks. In Proceedings of the IEEE International Conference on Computer Vision, pages 1529-1537, 2015.
- (2015) Proceedings of the IEEE International Conference on Computer Vision , pp. 1529-1537
- Zheng, S.¹ Jayasumana, S.² Romera-Paredes, B.³ Vineet, V.⁴ Su, Z.⁵ Du, D.⁶ Huang, C.⁷ Torr, P.H.S.⁸

52
- 33745824894
- Conditional random fields for object recognition
- Ariadna Quattoni, Michael Collins, and Trevor Darrell. Conditional random fields for object recognition. In Advances in neural information processing systems, pages 1097-1104, 2004.
- (2004) Advances in neural information processing systems , pp. 1097-1104
- Quattoni, A.¹ Collins, M.² Darrell, T.³

53
- 85162351107
- Efficient inference in fully connected crfs with Gaussian edge potentials
- Vladlen Koltun. Efficient inference in fully connected crfs with gaussian edge potentials. Adv. Neural Inf. Process. Syst, 2011.
- (2011) Adv. Neural Inf. Process. Syst
- Koltun, V.¹

54
- 0003391330
- Judea Pearl. Probabilistic reasoning in intelligent systems: Networks of plausible reasoning, 1988.
- (1988) Probabilistic Reasoning in Intelligent Systems: Networks Of Plausible Reasoning
- Pearl, J.¹

55
- 84969930631
- Learning deep structured models
- Liang-Chieh Chen, Alexander G Schwing, Alan L Yuille, and Raquel Urtasun. Learning deep structured models. In Proc. ICML, 2015.
- (2015) Proc. ICML
- Chen, L.-C.¹ Schwing, A.G.² Yuille, A.L.³ Urtasun, R.⁴

56
- 84959180722
- Alexander G Schwing and Raquel Urtasun. Fully connected deep structured networks. arXiv preprint arXiv: 1503.02351, 2015.
- (2015) Fully Connected Deep Structured Networks , pp. 02351
- Schwing, A.G.¹ Urtasun, R.²

57
- 85056505876
- David Belanger and Andrew McCallum. Structured prediction energy networks. arXiv preprint arXiv: 1511.06350, 2015.
- (2015) Structured Prediction Energy Networks
- Belanger, D.¹ McCallum, A.²

58
- 84990066623
- Deep markov random field for image modeling
- Springer
- Zhirong Wu, Dahua Lin, and Xiaoou Tang. Deep markov random field for image modeling. In European Conference on Computer Vision, pages 295-312. Springer, 2016.
- (2016) European Conference on Computer Vision , pp. 295-312
- Wu, Z.¹ Lin, D.² Tang, X.³

59
- 84913555165
- Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv: 1408.5093, 2014.
- (2014) Caffe: Convolutional Architecture for Fast Feature Embedding
- Jia, Y.¹ Shelhamer, E.² Donahue, J.³ Karayev, S.⁴ Long, J.⁵ Girshick, R.⁶ Guadarrama, S.⁷ Darrell, T.⁸

60
- 77955422240
- Object detection with discriminatively trained part based models
- P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9): 1627-1645, 2010.
- (2010) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.32 , Issue.9 , pp. 1627-1645
- Felzenszwalb, P.F.¹ Girshick, R.B.² McAllester, D.³ Ramanan, D.⁴

61
- 85021678581
- Peter Anderson, Basura Fernando, Mark Johnson, and Stephen Gould. Spice: Semantic propositional image caption evaluation. arXiv preprint arXiv: 1607.08822, 2016.
- (2016) Spice: Semantic Propositional Image Caption Evaluation
- Anderson, P.¹ Fernando, B.² Johnson, M.³ Gould, S.⁴

62
- 84990053150
- Somak Aditya, Yezhou Yang, Chitta Baral, Cornelia Fermuller, and Yiannis Aloimonos. From images to sentences through scene description graphs using commonsense reasoning and knowledge. arXiv preprint arXiv: 1511.03292, 2015.
- (2015) From Images to Sentences Through Scene Description Graphs Using Commonsense Reasoning and Knowledge
- Aditya, S.¹ Yang, Y.² Baral, C.³ Fermuller, C.⁴ Aloimonos, Y.⁵

63
- 85044286206
- Qi Wu, Damien Teney, Peng Wang, Chunhua Shen, Anthony Dick, and Anton van den Hengel. Visual question answering: A survey of methods and datasets. arXiv preprint arXiv: 1607.05910, 2016.
- (2016) Visual Question Answering: A Survey of Methods and Datasets
- Qi, Wu.¹ Teney, D.² Wang, P.³ Shen, C.⁴ Dick, A.⁵ Van Den Hengel, A.⁶

64
- 7044240786
- Measuring the similarity of labeled graphs
- Springer
- Pierre-Antoine Champin and Christine Solnon. Measuring the similarity of labeled graphs. In International Conference on Case-Based Reasoning, pages 80-95. Springer, 2003.
- (2003) International Conference on Case-Based Reasoning , pp. 80-95
- Champin, P.-A.¹ Solnon, C.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.