SCOPUS 정보 검색 플랫폼

1
- 84973890960
- Vqa: Visual question answering
- Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., Zitnick, C. L., & Parikh, D. (2015). Vqa: Visual question answering. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Antol, S.¹ Agrawal, A.² Lu, J.³ Mitchell, M.⁴ Batra, D.⁵ Zitnick, C.L.⁶ Parikh, D.⁷

2
- 85116156579
- METEOR: An automatic metric for MT evaluation with improved correlation with human judgments
- Banerjee, S., & Lavie, A. (2005). METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. In Annual Meeting of the Association for Computational Linguistics Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/or Summarization.
- (2005) Annual Meeting of the Association for Computational Linguistics Workshop on Intrinsic and Extrinsic Evaluation Measures for MT And/or Summarization
- Banerjee, S.¹ Lavie, A.²

3
- 80052896727
- Automatic attribute discovery and characterization from noisy web data
- Berg, T. L., Berg, A. C., & Shih, J. (2010). Automatic attribute discovery and characterization from noisy web data. In European Conference on Computer Vision.
- (2010) European Conference on Computer Vision
- Berg, T.L.¹ Berg, A.C.² Shih, J.³

4
- 85072028231
- Return of the devil in the details: Delving deep into convolutional nets
- Chatfield, K., Simonyan, K., Vedaldi, A., & Zisserman, A. (2014). Return of the devil in the details: Delving deep into convolutional nets. In British Machine Vision Conference.
- (2014) British Machine Vision Conference
- Chatfield, K.¹ Simonyan, K.² Vedaldi, A.³ Zisserman, A.⁴

5
- 84959908834
- Déjà image-captions: A corpus of expressive descriptions in repetition
- Chen, J., Kuznetsova, P., Warren, D., & Choi, Y. (2015). Déjà image-captions: A corpus of expressive descriptions in repetition. In North American Chapter of the Association for Computational Linguistics.
- (2015) North American Chapter of the Association for Computational Linguistics
- Chen, J.¹ Kuznetsova, P.² Warren, D.³ Choi, Y.⁴

6
- 84957029470
- Mind's eye: A recurrent visual representation for image caption generation
- Chen, X., & Zitnick, C. L. (2015). Mind's eye: A recurrent visual representation for image caption generation. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2015) IEEE Conference on Computer Vision and Pattern Recognition
- Chen, X.¹ Zitnick, C.L.²

7
- 85026937778
- Dale, R., & White, M. E. (Eds.). (2007). Workshop on Shared Tasks and Comparative Evaluation in Natural Language Generation: Position Papers.
- (2007) Workshop on Shared Tasks and Comparative Evaluation in Natural Language Generation: Position Papers
- Dale, R.¹ White, M.E.²

8
- 85107661995
- Meteor universal: Language specific translation evaluation for any target language
- Denkowski, M., & Lavie, A. (2014). Meteor Universal: Language Specific Translation Evaluation for Any Target Language. In Conference of the European Chapter of the Association for Computational Linguistics Workshop on Statistical Machine Translation.
- (2014) Conference of the European Chapter of the Association for Computational Linguistics Workshop on Statistical Machine Translation
- Denkowski, M.¹ Lavie, A.²

9
- 84944096380
- Language models for image captioning: The quirks and what works
- Devlin, J., Cheng, H., Fang, H., Gupta, S., Deng, L., He, X., Zweig, G., & Mitchell, M. (2015). Language Models for Image Captioning: The Quirks and What Works. In Annual Meeting of the Association for Computational Linguistics.
- (2015) Annual Meeting of the Association for Computational Linguistics
- Devlin, J.¹ Cheng, H.² Fang, H.³ Gupta, S.⁴ Deng, L.⁵ He, X.⁶ Zweig, G.⁷ Mitchell, M.⁸

10
- 84959236502
- Long-term recurrent convolutional networks for visual recognition and description
- Donahue, J., Hendricks, L. A., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., & Darrell, T. (2015). Long-term recurrent convolutional networks for visual recognition and description. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2015) IEEE Conference on Computer Vision and Pattern Recognition
- Donahue, J.¹ Hendricks, L.A.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

11
- 84943812736
- Describing images using inferred visual dependency representations
- Elliott, D., & de Vries, A. P. (2015). Describing images using inferred visual dependency representations. In Annual Meeting of the Association for Computational Linguistics.
- (2015) Annual Meeting of the Association for Computational Linguistics
- Elliott, D.¹ De Vries, A.P.²

12
- 84906929591
- Image description using visual dependency representations
- Elliott, D., & Keller, F. (2013). Image Description using Visual Dependency Representations. In Conference on Empirical Methods in Natural Language Processing.
- (2013) Conference on Empirical Methods in Natural Language Processing
- Elliott, D.¹ Keller, F.²

13
- 84906928552
- Comparing automatic evaluation measures for image description
- Elliott, D., & Keller, F. (2014). Comparing Automatic Evaluation Measures for Image Description. In Annual Meeting of the Association for Computational Linguistics.
- (2014) Annual Meeting of the Association for Computational Linguistics
- Elliott, D.¹ Keller, F.²

14
- 84943810574
- Query-by-Example Image Retrieval using Visual Dependency Representations
- Elliott, D., Lavrenko, V., & Keller, F. (2014). Query-by-Example Image Retrieval using Visual Dependency Representations. In International Conference on Computational Linguistics.
- (2014) International Conference on Computational Linguistics
- Elliott, D.¹ Lavrenko, V.² Keller, F.³

15
- 77951298115
- The PASCAL Visual Object Classes (VOC) Challenge
- Everingham, M., Van Gool, L., Williams, C. K. I., Winn, J., & Zisserman, A. (2010). The PASCAL Visual Object Classes (VOC) Challenge. International Journal of Computer Vision, 88 (2), 303-338.
- (2010) International Journal of Computer Vision , vol.88 , Issue.2 , pp. 303-338
- Everingham, M.¹ Van Gool, L.² Williams, C.K.I.³ Winn, J.⁴ Zisserman, A.⁵

16
- 84906932025
- Paraphrase-driven learning for open question answering
- Fader, A., Zettlemoyer, L., & Etzioni, O. (2013). Paraphrase-driven learning for open question answering. In Annual Meeting of the Association for Computational Linguistics.
- (2013) Annual Meeting of the Association for Computational Linguistics
- Fader, A.¹ Zettlemoyer, L.² Etzioni, O.³

17
- 84907031424
- Open question answering over curated and extracted knowledge bases
- Fader, A., Zettlemoyer, L., & Etzioni, O. (2014). Open question answering over curated and extracted knowledge bases. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining.
- (2014) ACM SIGKDD Conference on Knowledge Discovery and Data Mining
- Fader, A.¹ Zettlemoyer, L.² Etzioni, O.³

18
- 84959250180
- From captions to visual concepts and back
- Fang, H., Gupta, S., Iandola, F., Srivastava, R., Deng, L., Dollár, P., Gao, J., He, X., Mitchell, M., Platt, J., Zitnick, C. L., & Zweig, G. (2015). From captions to visual concepts and back. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2015) IEEE Conference on Computer Vision and Pattern Recognition
- Fang, H.¹ Gupta, S.² Iandola, F.³ Srivastava, R.⁴ Deng, L.⁵ Dollár, P.⁶ Gao, J.⁷ He, X.⁸ Mitchell, M.⁹ Platt, J.¹⁰ Zitnick, C.L.¹¹ Zweig, G.¹²

19
- 80052017343
- Every picture tells a story: Generating sentences from images
- Farhadi, A., Hejrati, M., Sadeghi, M. A., Young, P., Rashtchian, C., Hockenmaier, J., & Forsyth, D. (2010). Every picture tells a story: Generating sentences from images. In European Conference on Computer Vision.
- (2010) European Conference on Computer Vision
- Farhadi, A.¹ Hejrati, M.² Sadeghi, M.A.³ Young, P.⁴ Rashtchian, C.⁵ Hockenmaier, J.⁶ Forsyth, D.⁷

20
- 77955422240
- Object detection with discriminatively trained part-based models
- Felzenszwalb, P. F., Girshick, R. B., McAllester, D., & Ramanan, D. (2010). Object detection with discriminatively trained part-based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32 (9), 1627-1645.
- (2010) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.32 , Issue.9 , pp. 1627-1645
- Felzenszwalb, P.F.¹ Girshick, R.B.² McAllester, D.³ Ramanan, D.⁴

21
- 84859924694
- Automatic image annotation using auxiliary text information
- Feng, Y., & Lapata, M. (2008). Automatic Image Annotation Using Auxiliary Text Information. In Annual Meeting of the Association for Computational Linguistics.
- (2008) Annual Meeting of the Association for Computational Linguistics
- Feng, Y.¹ Lapata, M.²

22
- 84874541449
- Automatic caption generation for news images
- Feng, Y., & Lapata, M. (2013). Automatic caption generation for news images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35 (4), 797-812.
- (2013) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.35 , Issue.4 , pp. 797-812
- Feng, Y.¹ Lapata, M.²

23
- 84959904882
- A survey of current datasets for vision and language research
- Ferraro, F., Mostafazadeh, N., Huang, T., Vanderwende, L., Devlin, J., Galley, M., & Mitchell, M. (2015). A survey of current datasets for vision and language research. In Conference on Empirical Methods in Natural Language Processing.
- (2015) Conference on Empirical Methods in Natural Language Processing
- Ferraro, F.¹ Mostafazadeh, N.² Huang, T.³ Vanderwende, L.⁴ Devlin, J.⁵ Galley, M.⁶ Mitchell, M.⁷

24
- 84965148420
- Are you talking to a machine? Dataset and methods for multilingual image question answering
- Gao, H., Mao, J., Zhou, J., Huang, Z., & Yuille, A. (2015). Are you talking to a machine? dataset and methods for multilingual image question answering. In International Conference on Learning Representations.
- (2015) International Conference on Learning Representations
- Gao, H.¹ Mao, J.² Zhou, J.³ Huang, Z.⁴ Yuille, A.⁵

25
- 84925422907
- Visual turing test for computer vision systems
- Geman, D., Geman, S., Hallonquist, N., & Younes, L. (2015). Visual turing test for computer vision systems. Proceedings of the National Academy of Sciences, 112 (12), 3618-3623.
- (2015) Proceedings of the National Academy of Sciences , vol.112 , Issue.12 , pp. 3618-3623
- Geman, D.¹ Geman, S.² Hallonquist, N.³ Younes, L.⁴

26
- 84911400494
- Rich feature hierarchies for accurate object detection and semantic segmentation
- Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2014) IEEE Conference on Computer Vision and Pattern Recognition
- Girshick, R.¹ Donahue, J.² Darrell, T.³ Malik, J.⁴

27
- 84959243872
- Improving image-sentence embeddings using large weakly annotated photo collections
- Gong, Y., Wang, L., Hodosh, M., Hockenmaier, J., & Lazebnik, S. (2014). Improving Image-Sentence Embeddings Using Large Weakly Annotated Photo Collections. In European Conference on Computer Vision.
- (2014) European Conference on Computer Vision
- Gong, Y.¹ Wang, L.² Hodosh, M.³ Hockenmaier, J.⁴ Lazebnik, S.⁵

28
- 38049183286
- The IAPR TC-12 benchmark: A new evaluation resource for visual information systems
- Grubinger, M., Clough, P., Muller, H., & Deselaers, T. (2006). The IAPR TC-12 benchmark: A new evaluation resource for visual information systems. In International Conference on Language Resources and Evaluation.
- (2006) International Conference on Language Resources and Evaluation
- Grubinger, M.¹ Clough, P.² Muller, H.³ Deselaers, T.⁴

29
- 84898773262
- Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
- Guadarrama, S., Krishnamoorthy, N., Malkarnenkar, G., Venugopalan, S., Mooney, R., Darrell, T., & Saenko, K. (2013). Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In International Conference on Computer Vision.
- (2013) International Conference on Computer Vision
- Guadarrama, S.¹ Krishnamoorthy, N.² Malkarnenkar, G.³ Venugopalan, S.⁴ Mooney, R.⁵ Darrell, T.⁶ Saenko, K.⁷

30
- 84868289993
- Choosing linguistics over vision to describe images
- Gupta, A., Verma, Y., & Jawahar, C. V. (2012). Choosing linguistics over vision to describe images. In AAAI Conference on Artificial Intelligence.
- (2012) AAAI Conference on Artificial Intelligence
- Gupta, A.¹ Verma, Y.² Jawahar, C.V.³

31
- 10044285992
- Canonical correlation analysis: An overview with application to learning methods
- Hardoon, D. R., Szedmak, S., & Shawe-Taylor, J. (2004). Canonical correlation analysis: An overview with application to learning methods. Neural Computation, 16 (12), 2639-2664.
- (2004) Neural Computation , vol.16 , Issue.12 , pp. 2639-2664
- Hardoon, D.R.¹ Szedmak, S.² Shawe-Taylor, J.³

32
- 84884918162
- Sentence-based image description with scalable, explicit models
- Hodosh, M., & Hockenmaier, J. (2013). Sentence-based image description with scalable, explicit models. In IEEE Conference on Computer Vision and Pattern Recognition Workshops.
- (2013) IEEE Conference on Computer Vision and Pattern Recognition Workshops
- Hodosh, M.¹ Hockenmaier, J.²

33
- 84883394520
- Framing image description as a ranking task: Data, models and evaluation metrics
- Hodosh, M., Young, P., & Hockenmaier, J. (2013). Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics. Journal of Artificial Intelligence Research, 47, 853-899.
- (2013) Journal of Artificial Intelligence Research , vol.47 , pp. 853-899
- Hodosh, M.¹ Young, P.² Hockenmaier, J.³

34
- 0000107975
- Relations between two sets of variates
- Hotelling, H. (1936). Relations between two sets of variates. Biometrika, 0, 321-377.
- (1936) Biometrika , pp. 321-377
- Hotelling, H.¹

35
- 0033909136
- A conceptual framework for indexing visual information at multiple levels
- Jaimes, A., & Chang, S.-F. (2000). A conceptual framework for indexing visual information at multiple levels. In IST SPIE Internet Imaging.
- (2000) IST SPIE Internet Imaging
- Jaimes, A.¹ Chang, S.-F.²

36
- 84959214146
- Image specificity
- Jas, M., & Parikh, D. (2015). Image specificity. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2015) IEEE Conference on Computer Vision and Pattern Recognition
- Jas, M.¹ Parikh, D.²

37
- 84973917813
- Guiding the long-short term memory model for image caption generation
- Jia, X., Gavves, E., Fernando, B., & Tuytelaars, T. (2015). Guiding the long-short term memory model for image caption generation. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Jia, X.¹ Gavves, E.² Fernando, B.³ Tuytelaars, T.⁴

38
- 84959233256
- Image retrieval using scene graphs
- Johnson, J., Krishna, R., Stark, M., Li, L.-J., Shamma, D. A., Bernstein, M., & Fei-Fei, L. (2015). Image retrieval using scene graphs. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2015) IEEE Conference on Computer Vision and Pattern Recognition
- Johnson, J.¹ Krishna, R.² Stark, M.³ Li, L.-J.⁴ Shamma, D.A.⁵ Bernstein, M.⁶ Fei-Fei, L.⁷

39
- 84946734827
- Deep visual-semantic alignments for generating image descriptions
- Karpathy, A., & Fei-Fei, L. (2015). Deep visual-semantic alignments for generating image descriptions. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2015) IEEE Conference on Computer Vision and Pattern Recognition
- Karpathy, A.¹ Fei-Fei, L.²

40
- 84937843643
- Deep fragment embeddings for bidirectional image sentence mapping
- Karpathy, A., Joulin, A., & Fei-Fei, L. (2014). Deep Fragment Embeddings for Bidirectional Image Sentence Mapping. In Advances in Neural Information Processing Systems.
- (2014) Advances in Neural Information Processing Systems
- Karpathy, A.¹ Joulin, A.² Fei-Fei, L.³

41
- 84863075153
- Towards coherent natural language description of video streams
- Khan, M. U. G., Zhang, L., & Gotoh, Y. (2011). Towards coherent natural language description of video streams. In International Conference on Computer Vision Workshops.
- (2011) International Conference on Computer Vision Workshops
- Khan, M.U.G.¹ Zhang, L.² Gotoh, Y.³

42
- 84944113729
- Unifying visual-semantic embeddings with multimodal neural language models
- Kiros, R., Salakhutdinov, R., & Zemel, R. S. (2015). Unifying visual-semantic embeddings with multimodal neural language models. In Advances in Neural Information Processing Systems Deep Learning Workshop.
- (2015) Advances in Neural Information Processing Systems Deep Learning Workshop
- Kiros, R.¹ Salakhutdinov, R.² Zemel, R.S.³

43
- 84893398951
- Generating natural-language video descriptions using text-mined knowledge
- Krishnamoorthy, N., Malkarnenkar, G., Mooney, R., Saenko, K., & Guadarrama, S. (2013). Generating Natural-Language Video Descriptions Using Text-Mined Knowledge. In Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.
- (2013) Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
- Krishnamoorthy, N.¹ Malkarnenkar, G.² Mooney, R.³ Saenko, K.⁴ Guadarrama, S.⁵

44
- 80052901011
- Baby talk: Understanding and generating simple image descriptions
- Kulkarni, G., Premraj, V., Dhar, S., Li, S., Choi, Y., Berg, A. C., & Berg, T. L. (2011). Baby talk: Understanding and generating simple image descriptions. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2011) IEEE Conference on Computer Vision and Pattern Recognition
- Kulkarni, G.¹ Premraj, V.² Dhar, S.³ Li, S.⁴ Choi, Y.⁵ Berg, A.C.⁶ Berg, T.L.⁷

45
- 84878189119
- Collective generation of natural image descriptions
- Kuznetsova, P., Ordonez, V., Berg, A. C., Berg, T. L., & Choi, Y. (2012). Collective Generation of Natural Image Descriptions. In Annual Meeting of the Association for Computational Linguistics.
- (2012) Annual Meeting of the Association for Computational Linguistics
- Kuznetsova, P.¹ Ordonez, V.² Berg, A.C.³ Berg, T.L.⁴ Choi, Y.⁵

46
- 84934873221
- TREETALK: Composition and compression of trees for image descriptions
- Kuznetsova, P., Ordonezz, V., Berg, T. L., & Choi, Y. (2014). TREETALK: Composition and compression of trees for image descriptions. In Conference on Empirical Methods in Natural Language Processing.
- (2014) Conference on Empirical Methods in Natural Language Processing
- Kuznetsova, P.¹ Ordonezz, V.² Berg, T.L.³ Choi, Y.⁴

47
- 70450172710
- Learning to detect unseen object classes by between-class attribute transfer
- Lampert, C. H., Nickisch, H., & Harmeling, S. (2009). Learning to detect unseen object classes by between-class attribute transfer. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2009) IEEE Conference on Computer Vision and Pattern Recognition
- Lampert, C.H.¹ Nickisch, H.² Harmeling, S.³

48
- 33845572523
- Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
- Lazebnik, S., Schmid, C., & Ponce, J. (2006). Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2006) IEEE Conference on Computer Vision and Pattern Recognition
- Lazebnik, S.¹ Schmid, C.² Ponce, J.³

49
- 84970028761
- Phrase-based image captioning
- Lebret, R., Pinheiro, P. O., & Collobert, R. (2015). Phrase-based image captioning. In International Conference on Machine Learning.
- (2015) International Conference on Machine Learning
- Lebret, R.¹ Pinheiro, P.O.² Collobert, R.³

50
- 84862279067
- Composing simple image descriptions using web-scale n-grams
- Li, S., Kulkarni, G., Berg, T. L., Berg, A. C., & Choi, Y. (2011). Composing simple image descriptions using web-scale n-grams. In The SIGNLL Conference on Computational Natural Language Learning.
- (2011) The SIGNLL Conference on Computational Natural Language Learning
- Li, S.¹ Kulkarni, G.² Berg, T.L.³ Berg, A.C.⁴ Choi, Y.⁵

51
- 84877085938
- Learning dependency-based compositional semantics
- Liang, P., Jordan, M. I., & Klein, D. (2012). Learning dependency-based compositional semantics. Computational Linguistics, 39 (2), 389-446.
- (2012) Computational Linguistics , vol.39 , Issue.2 , pp. 389-446
- Liang, P.¹ Jordan, M.I.² Klein, D.³

52
- 29344465396
- Automatic evaluation of summaries using n-gram cooccurrence statistics
- Lin, C.-Y., & Hovy, E. (2008). Automatic evaluation of summaries using n-gram cooccurrence statistics. In Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.
- (2008) Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
- Lin, C.-Y.¹ Hovy, E.²

53
- 84960173401
- Generating multi-sentence natural language descriptions of indoor scenes
- Lin, D., Fidler, S., Kong, C., & Urtasun, R. (2015). Generating multi-sentence natural language descriptions of indoor scenes. In British Machine Vision Conference.
- (2015) British Machine Vision Conference
- Lin, D.¹ Fidler, S.² Kong, C.³ Urtasun, R.⁴

54
- 84937834115
- Microsoft COCO: Common objects in context
- Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., & Zitnick, C. L. (2014). Microsoft COCO: Common objects in context. In European Conference on Computer Vision.
- (2014) European Conference on Computer Vision
- Lin, T.-Y.¹ Maire, M.² Belongie, S.³ Hays, J.⁴ Perona, P.⁵ Ramanan, D.⁶ Dollár, P.⁷ Zitnick, C.L.⁸

55
- 3042535216
- Distinctive image features from scale-invariant keypoints
- Lowe, D. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60 (4), 91-110.
- (2004) International Journal of Computer Vision , vol.60 , Issue.4 , pp. 91-110
- Lowe, D.¹

56
- 85007153677
- Learning to answer questions from image using convolutional neural network
- Ma, L., Lu, Z., & Li, H. (2016). Learning to answer questions from image using convolutional neural network. In AAAI Conference on Artificial Intelligence.
- (2016) AAAI Conference on Artificial Intelligence
- Ma, L.¹ Lu, Z.² Li, H.³

57
- 84937822746
- A multi-world approach to question answering about real-world scenes based on uncertain input
- Malinowski, M., & Fritz, M. (2014a). A multi-world approach to question answering about real-world scenes based on uncertain input. In Advances in Neural Information Processing Systems.
- (2014) Advances in Neural Information Processing Systems
- Malinowski, M.¹ Fritz, M.²

58
- 84951975735
- Towards a visual turing challenge
- Malinowski, M., & Fritz, M. (2014b). Towards a visual turing challenge. In Advances in Neural Information Processing Systems Workshop on Learning Semantics.
- (2014) Advances in Neural Information Processing Systems Workshop on Learning Semantics
- Malinowski, M.¹ Fritz, M.²

59
- 84973896625
- Ask your neurons: A neural-based approach to answering questions about images
- Malinowski, M., Rohrbach, M., & Fritz, M. (2015). Ask your neurons: A neural-based approach to answering questions about images. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Malinowski, M.¹ Rohrbach, M.² Fritz, M.³

60
- 85083950512
- Deep captioning with multimodal recurrent neural networks (m-RNN)
- Mao, J., Xu, W., Yang, Y., Wang, J., & Yuille, A. L. (2015a). Deep captioning with multimodal recurrent neural networks (m-RNN). In International Conference on Learning Representations.
- (2015) International Conference on Learning Representations
- Mao, J.¹ Xu, W.² Yang, Y.³ Wang, J.⁴ Yuille, A.L.⁵

61
- 84973863256
- Learning like a child: Fast novel visual concept learning from sentence descriptions of images
- Mao, J., Wei, X., Yang, Y., Wang, J., Huang, Z., & Yuille, A. L. (2015b). Learning like a child: Fast novel visual concept learning from sentence descriptions of images. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Mao, J.¹ Wei, X.² Yang, Y.³ Wang, J.⁴ Huang, Z.⁵ Yuille, A.L.⁶

62
- 84906925144
- Nonparametric method for data-driven image captioning
- Mason, R., & Charniak, E. (2014). Nonparametric Method for Data-driven Image Captioning. In Annual Meeting of the Association for Computational Linguistics.
- (2014) Annual Meeting of the Association for Computational Linguistics
- Mason, R.¹ Charniak, E.²

63
- 70449657723
- Financial incentives and the "performance of crowds"
- Mason, W. A., & Watts, D. J. (2009). Financial incentives and the "performance of crowds". In ACM SIGKDD Workshop on Human Computation.
- (2009) ACM SIGKDD Workshop on Human Computation
- Mason, W.A.¹ Watts, D.J.²

64
- 85034832841
- Midge: Generating image descriptions from computer vision detections
- Mitchell, M., Han, X., Dodge, J., Mensch, A., Goyal, A., Berg, A. C., Yamaguchi, K., Berg, T. L., Stratos, K., Daume, III, H., & III (2012). Midge: generating image descriptions from computer vision detections. In Conference of the European Chapter of the Association for Computational Linguistics.
- (2012) Conference of the European Chapter of the Association for Computational Linguistics
- Mitchell, M.¹ Han, X.² Dodge, J.³ Mensch, A.⁴ Goyal, A.⁵ Berg, A.C.⁶ Yamaguchi, K.⁷ Berg, T.L.⁸ Stratos, K.⁹ Daume, H.¹⁰

65
- 34547462915
- Tech. rep., Microsoft Research
- Nenkova, A., & Vanderwende, L. (2005). The impact of frequency on summarization. Tech. rep., Microsoft Research.
- (2005) The Impact of Frequency on Summarization
- Nenkova, A.¹ Vanderwende, L.²

66
- 0035328421
- Modeling the shape of the scene: A holistic representation of the spatial envelope
- Oliva, A., & Torralba, A. (2001). Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42 (3), 145-175.
- (2001) International Journal of Computer Vision , vol.42 , Issue.3 , pp. 145-175
- Oliva, A.¹ Torralba, A.²

67
- 85162522202
- Im2text: Describing images using 1 million captioned photographs
- Ordonez, V., Kulkarni, G., & Berg, T. L. (2011). Im2text: Describing images using 1 million captioned photographs. In Advances in Neural Information Processing Systems.
- (2011) Advances in Neural Information Processing Systems
- Ordonez, V.¹ Kulkarni, G.² Berg, T.L.³

68
- 84960194772
- Learning to interpret and describe abstract scenes
- Ortiz, L. M. G., Wolff, C., & Lapata, M. (2015). Learning to Interpret and Describe Abstract Scenes. In Conference of the North American Chapter of the Association of Computational Linguistics.
- (2015) Conference of the North American Chapter of the Association of Computational Linguistics
- Ortiz, L.M.G.¹ Wolff, C.² Lapata, M.³

69
- 0003591791
- Oxford University Press
- Panofsky, E. (1939). Studies in Iconology. Oxford University Press.
- (1939) Studies in Iconology
- Panofsky, E.¹

70
- 85133336275
- BLEU: A method for automatic evaluation of machine translation
- Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J. (2002). BLEU: A method for automatic evaluation of machine translation. In Annual Meeting of the Association for Computational Linguistics.
- (2002) Annual Meeting of the Association for Computational Linguistics
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.-J.⁴

71
- 84856670612
- Relative attributes
- Parikh, D., & Grauman, K. (2011). Relative attributes. In International Conference on Computer Vision.
- (2011) International Conference on Computer Vision
- Parikh, D.¹ Grauman, K.²

72
- 84965149840
- Expressing an image stream with a sequence of natural sentences
- Park, C., & Kim, G. (2015). Expressing an image stream with a sequence of natural sentences. In Advances in Neural Information Processing Systems.
- (2015) Advances in Neural Information Processing Systems
- Park, C.¹ Kim, G.²

73
- 84900870389
- The SUN attribute database: Beyond categories for deeper scene understanding
- Patterson, G., Xu, C., Su, H., & Hays, J. (2014). The SUN Attribute Database: Beyond Categories for Deeper Scene Understanding. International Journal of Computer Vision, 108 (1-2), 59-81.
- (2014) International Journal of Computer Vision , vol.108 , Issue.1-2 , pp. 59-81
- Patterson, G.¹ Xu, C.² Su, H.³ Hays, J.⁴

74
- 85083952381
- Simple image description generator via a linear phrase-based model
- Pinheiro, P., Lebret, R., & Collobert, R. (2015). Simple image description generator via a linear phrase-based model. In International Conference on Learning Representations Workshop.
- (2015) International Conference on Learning Representations Workshop
- Pinheiro, P.¹ Lebret, R.² Collobert, R.³

75
- 84856142160
- Weakly supervised learning of interactions between humans and objects
- Prest, A., Schmid, C., & Ferrari, V. (2012). Weakly supervised learning of interactions between humans and objects. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34 (3), 601-614.
- (2012) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.34 , Issue.3 , pp. 601-614
- Prest, A.¹ Schmid, C.² Ferrari, V.³

76
- 85090348677
- Collecting image annotations using amazon's mechanical turk
- Rashtchian, C., Young, P., Hodosh, M., & Hockenmaier, J. (2010). Collecting image annotations using amazon's mechanical turk. In North American Chapter of the Association for Computational Linguistics: Human Language Technologies Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk.
- (2010) North American Chapter of the Association for Computational Linguistics: Human Language Technologies Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk
- Rashtchian, C.¹ Young, P.² Hodosh, M.³ Hockenmaier, J.⁴

77
- 71749094730
- An investigation into the validity of some metrics for automatically evaluating natural language generation systems
- Reiter, E., & Belz, A. (2009). An investigation into the validity of some metrics for automatically evaluating natural language generation systems. Computational Linguistics, 35 (4), 529-588.
- (2009) Computational Linguistics , vol.35 , Issue.4 , pp. 529-588
- Reiter, E.¹ Belz, A.²

78
- 0003612827
- Cambridge University Press
- Reiter, E., & Dale, R. (2006). Building Natural Language Generation Systems. Cambridge University Press.
- (2006) Building Natural Language Generation Systems
- Reiter, E.¹ Dale, R.²

79
- 84962816362
- Image question answering: A visual semantic embedding model and a new dataset
- Ren, M., Kiros, R., & Zemel, R. (2015). Image question answering: A visual semantic embedding model and a new dataset. In International Conference on Machine Learningt Deep Learning Workshop.
- (2015) International Conference on Machine Learningt Deep Learning Workshop
- Ren, M.¹ Kiros, R.² Zemel, R.³

80
- 84926345282
- MCTest: A challenge dataset for the open-domain machine comprehension of text
- Richardson, M., Burges, C. J., & Renshaw, E. (2013). MCTest: A challenge dataset for the open-domain machine comprehension of text. In Conference on Empirical Methods in Natural Language Processing.
- (2013) Conference on Empirical Methods in Natural Language Processing
- Richardson, M.¹ Burges, C.J.² Renshaw, E.³

81
- 84959211977
- A dataset for movie description
- Rohrbach, A., Rohrback, M., Tandon, N., & Schiele, B. (2015). A dataset for movie description. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Rohrbach, A.¹ Rohrback, M.² Tandon, N.³ Schiele, B.⁴

82
- 84898775239
- Translating video content to natural language descriptions
- Rohrbach, M., Qiu, W., Titov, I., Thater, S., Pinkal, M., & Schiele, B. (2013). Translating Video Content to Natural Language Descriptions. In International Conference on Computer Vision.
- (2013) International Conference on Computer Vision
- Rohrbach, M.¹ Qiu, W.² Titov, I.³ Thater, S.⁴ Pinkal, M.⁵ Schiele, B.⁶

83
- 85123605149
- Generating semantically precise scene graphs from textual descriptions for improved image retrieval
- Schuster, S., Krishna, R., Chang, A., Fei-Fei, L., & Manning, C. D. (2015). Generating semantically precise scene graphs from textual descriptions for improved image retrieval. In Conference on Empirical Methods in Natural Language Processing Vision and Language Workshop.
- (2015) Conference on Empirical Methods in Natural Language Processing Vision and Language Workshop
- Schuster, S.¹ Krishna, R.² Chang, A.³ Fei-Fei, L.⁴ Manning, C.D.⁵

84
- 84952235015
- Analyzing the subject of a picture: A theoretical approach
- Shatford, S. (1986). Analyzing the subject of a picture: A theoretical approach. Cataloging & Classification Quarterly, 6, 39-62.
- (1986) Cataloging & Classification Quarterly , vol.6 , pp. 39-62
- Shatford, S.¹

85
- 84881536861
- Indoor segmentation and support inference from RGBD images
- Silberman, N., Kohli, P., Hoiem, D., & Fergus, R. (2012). Indoor segmentation and support inference from RGBD images. In European Conference on Computer Vision.
- (2012) European Conference on Computer Vision
- Silberman, N.¹ Kohli, P.² Hoiem, D.³ Fergus, R.⁴

86
- 77955998009
- Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora
- Socher, R., & Fei-Fei, L. (2010). Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2010) IEEE Conference on Computer Vision and Pattern Recognition
- Socher, R.¹ Fei-Fei, L.²

87
- 84906925854
- Grounded compositional semantics for finding and describing images with sentences
- Socher, R., Karpathy, A., Le, Q. V., Manning, C. D., & Ng, A. (2014). Grounded Compositional Semantics for Finding and Describing Images with Sentences. Transactions of the Association for Computational Linguistics, 2, 207-218.
- (2014) Transactions of the Association for Computational Linguistics , vol.2 , pp. 207-218
- Socher, R.¹ Karpathy, A.² Le, Q.V.³ Manning, C.D.⁴ Ng, A.⁵

88
- 84973888835
- Automatic concept discovery from parallel text and visual corpora
- Sun, C., Gan, C., & Nevatia, R. (2015). Automatic concept discovery from parallel text and visual corpora. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Sun, C.¹ Gan, C.² Nevatia, R.³

89
- 84959932469
- Integrating language and vision to generate natural language descriptions of videos in the wild
- Thomason, J., Venugopalan, S., Guadarrama, S., Saenko, K., & Mooney, R. (2014). Integrating Language and Vision to Generate Natural Language Descriptions of Videos in the Wild. In International Conference on Computational Linguistics.
- (2014) International Conference on Computational Linguistics
- Thomason, J.¹ Venugopalan, S.² Guadarrama, S.³ Saenko, K.⁴ Mooney, R.⁵

90
- 54749092170
- 80 million tiny images: A large data set for nonparametric object and scene recognition
- Torralba, A., Fergus, R., & Freeman, W. T. (2008). 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30 (11), 1958-1970.
- (2008) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.30 , Issue.11 , pp. 1958-1970
- Torralba, A.¹ Fergus, R.² Freeman, W.T.³

91
- 84973861187
- Common subspace for model and similarity: Phrase learning for caption generation from images
- Ushiku, Y., Yamaguchi, M., Mukuta, Y., & Harada, T. (2015). Common subspace for model and similarity: Phrase learning for caption generation from images. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Ushiku, Y.¹ Yamaguchi, M.² Mukuta, Y.³ Harada, T.⁴

92
- 84956980995
- Cider: Consensus-based image description evaluation
- Vedantam, R., Lawrence Zitnick, C., & Parikh, D. (2015). Cider: Consensus-based image description evaluation. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2015) IEEE Conference on Computer Vision and Pattern Recognition
- Vedantam, R.¹ Lawrence Zitnick, C.² Parikh, D.³

93
- 85088059797
- Im2Text and Text2Im: Associating images and texts for cross-modal retrieval
- Verma, Y., & Jawahar, C. V. (2014). Im2Text and Text2Im: Associating Images and Texts for Cross-Modal Retrieval. In British Machine Vision Conference.
- (2014) British Machine Vision Conference
- Verma, Y.¹ Jawahar, C.V.²

94
- 84946747440
- Show and tell: A neural image caption generator
- Vinyals, O., Toshev, A., Bengio, S., & Erhan, D. (2015). Show and tell: A neural image caption generator. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2015) IEEE Conference on Computer Vision and Pattern Recognition
- Vinyals, O.¹ Toshev, A.² Bengio, S.³ Erhan, D.⁴

95
- 84970002232
- Show, attend and tell: Neural image caption generation with visual attention
- Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhutdinov, R., Zemel, R., & Bengio, Y. (2015). Show, attend and tell: Neural image caption generation with visual attention. In International Conference on Machine Learning.
- (2015) International Conference on Machine Learning
- Xu, K.¹ Ba, J.² Kiros, R.³ Cho, K.⁴ Courville, A.⁵ Salakhutdinov, R.⁶ Zemel, R.⁷ Bengio, Y.⁸

96
- 84944068309
- A distributed representation based query expansion approach for image captioning
- Yagcioglu, S., Erdem, E., Erdem, A., & Cakici, R. (2015). A Distributed Representation Based Query Expansion Approach for Image Captioning. In Annual Meeting of the Association for Computational Linguistics.
- (2015) Annual Meeting of the Association for Computational Linguistics
- Yagcioglu, S.¹ Erdem, E.² Erdem, A.³ Cakici, R.⁴

97
- 80053258778
- Corpus-guided sentence generation of natural images
- Yang, Y., Teo, C. L., Daume, III, H., & Aloimonos, Y. (2011). Corpus-guided sentence generation of natural images. In Conference on Empirical Methods in Natural Language Processing.
- (2011) Conference on Empirical Methods in Natural Language Processing
- Yang, Y.¹ Teo, C.L.² Daume, H.³ Aloimonos, Y.⁴

98
- 77955987964
- Grouplet: A structured image representation for recognizing human and object interactions
- Yao, B., & Fei-Fei, L. (2010). Grouplet: A structured image representation for recognizing human and object interactions. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2010) IEEE Conference on Computer Vision and Pattern Recognition
- Yao, B.¹ Fei-Fei, L.²

99
- 84973884896
- Describing videos by exploiting temporal structure
- Yao, L., Torabi, A., Cho, K., Ballas, N., Pal, C., Larochelle, H., & Courville, A. (2015). Describing videos by exploiting temporal structure. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Yao, L.¹ Torabi, A.² Cho, K.³ Ballas, N.⁴ Pal, C.⁵ Larochelle, H.⁶ Courville, A.⁷

100
- 85026937926
- See no evil, say no evil: Description generation from densely labeled images
- Yatskar, M., Galley, M., Vanderwende, L., & Zettlemoyer, L. (2014). See No Evil, Say No Evil: Description Generation from Densely Labeled Images. In Joint Conference on Lexical and Computation Semantics.
- (2014) Joint Conference on Lexical and Computation Semantics
- Yatskar, M.¹ Galley, M.² Vanderwende, L.³ Zettlemoyer, L.⁴

101
- 84906494296
- From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
- Young, P., Lai, A., Hodosh, M., & Hockenmaier, J. (2014). From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Transactions of the Association for Computational Linguistics, 2, 67-78.
- (2014) Transactions of the Association for Computational Linguistics , vol.2 , pp. 67-78
- Young, P.¹ Lai, A.² Hodosh, M.³ Hockenmaier, J.⁴

102
- 84973892583
- Visual madlibs: Fill in the blank description generation and question answering
- Yu, L., Park, E., Berg, A. C., & Berg, T. L. (2015). Visual madlibs: Fill in the blank description generation and question answering. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Yu, L.¹ Park, E.² Berg, A.C.³ Berg, T.L.⁴

103
- 84973911532
- Aligning books and movies: Towards story-like visual explanations by watching movies and reading books
- Zhu, Y., Kiros, R., Zemel, R., Salakhutdinov, R., Urtasun, R., Torralba, A., & Fidler, S. (2015). Aligning books and movies: Towards story-like visual explanations by watching movies and reading books. In International Conference on Computer Vision.
- (2015) International Conference on Computer Vision
- Zhu, Y.¹ Kiros, R.² Zemel, R.³ Salakhutdinov, R.⁴ Urtasun, R.⁵ Torralba, A.⁶ Fidler, S.⁷

104
- 84898772194
- Learning the visual interpretation of sentences
- Zitnick, C. L., Parikh, D., & Vanderwende, L. (2013). Learning the visual interpretation of sentences. In International Conference on Computer Vision.
- (2013) International Conference on Computer Vision
- Zitnick, C.L.¹ Parikh, D.² Vanderwende, L.³

105
- 84887338442
- Bringing semantics into focus using visual abstraction
- Zitnick, C. L., & Parikh, D. (2013). Bringing semantics into focus using visual abstraction. In IEEE Conference on Computer Vision and Pattern Recognition.
- (2013) IEEE Conference on Computer Vision and Pattern Recognition
- Zitnick, C.L.¹ Parikh, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.