SCOPUS 정보 검색 플랫폼

Proceedings of the 3rd Joint Conference on Lexical and Computational Semantics, *SEM 2014

Volumn , Issue , 2014, Pages 110-120

See no evil, say no evil: Description generation from densely labeled images

(4) Yatskar, Mark a Galley, Michel b Vanderwende, Lucy b Zettlemoyer, Luke a

a UNIVERSITY OF WASHINGTON (United States)

b MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

SEMANTICS;

COMPREHENSIVE MODEL; HIGH QUALITY; HUMAN ANNOTATIONS; HUMAN LIKE; LABELED IMAGES; LANGUAGE GENERATION; SPATIAL LOCATION; VISUAL INFORMATION;

VISUAL LANGUAGES;

EID: 85026937926 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.3115/v1/s14-1015 Document Type: Conference Paper

Times cited : (44)

References (37)

1
- 80053251990
- A simple domain-independent probabilistic approach to generation
- Gabor Angeli, Percy Liang, and Dan Klein. 2010. A simple domain-independent probabilistic approach to generation. In Empirical Methods in Natural Language Processing (EMNLP).
- (2010) Empirical Methods in Natural Language Processing (EMNLP
- Angeli, G.¹ Liang, P.² Klein, D.³

2
- 80053402398
- Fast, cheap, and creative: Evaluating translation quality using Amazon's Mechanical Turk
- August
- Chris Callison-Burch. 2009. Fast, cheap, and creative: Evaluating translation quality using Amazon's Mechanical Turk. In EMNLP, pages 286-295, August.
- (2009) EMNLP , pp. 286-295
- Callison-Burch, C.¹

3
- 77952710493
- Training a multilingual sportscaster: Using perceptual context to learn language
- David L. Chen, Joohyun Kim, and Raymond J. Mooney. 2010. Training a multilingual sportscaster: Using perceptual context to learn language. JAIR, 37:397-435.
- (2010) JAIR , vol.37 , pp. 397-435
- Chen, D.L.¹ Kim, J.² Mooney, R.J.³

4
- 85037338954
- Generating typed dependency parses from phrase structure parses
- Marie-Catherine de Marneffe, Bill MacCartney, Christopher D Manning, et al. 2006. Generating typed dependency parses from phrase structure parses. In LREC, volume 6, pages 449-454.
- (2006) LREC , vol.6 , pp. 449-454
- De Marneffe, M.-C.¹ MacCartney, B.² Manning, C.D.³

5
- 84906929591
- Image Description using Visual Dependency Representations
- Desmond Elliott and Frank Keller. 2013. Image Description using Visual Dependency Representations. In EMNLP.
- (2013) EMNLP
- Elliott, D.¹ Keller, F.²

6
- 70450207704
- Describing objects by their attributes
- IEEE
- Ali Farhadi, Ian Endres, Derek Hoiem, and David Forsyth. 2009. Describing objects by their attributes. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, pages 1778-1785. IEEE.
- (2009) Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference On , pp. 1778-1785
- Farhadi, A.¹ Endres, I.² Hoiem, D.³ Forsyth, D.⁴

7
- 78149311145
- Every picture tells a story: Generating sentences from images
- Ali Farhadi, Mohsen Hejrati, Mohammad Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, and David Forsyth. 2010. Every picture tells a story: Generating sentences from images. In Proceedings of the 11th European conference on Computer Vision, ECCV'10, pages 15-29.
- (2010) Proceedings of the 11th European Conference On Computer Vision, ECCV'10 , pp. 15-29
- Farhadi, A.¹ Hejrati, M.² Sadeghi, M.A.³ Young, P.⁴ Rashtchian, C.⁵ Hockenmaier, J.⁶ Forsyth, D.⁷

8
- 77955422240
- Object detection with discriminatively trained part-based models
- Pedro F Felzenszwalb, Ross B Girshick, David McAllester, and Deva Ramanan. 2010. Object detection with discriminatively trained part-based models. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 32(9):1627-1645.
- (2010) Pattern Analysis and Machine Intelligence, IEEE Transactions On , vol.32 , Issue.9 , pp. 1627-1645
- Felzenszwalb, P.F.¹ Girshick, R.B.² McAllester, D.³ Ramanan, D.⁴

9
- 80052878949
- How many words is a picture worth? Automatic caption generation for news images
- Yansong Feng and Mirella Lapata. 2010. How many words is a picture worth? Automatic caption generation for news images. In ACL, pages 1239-1249.
- (2010) ACL , pp. 1239-1249
- Feng, Y.¹ Lapata, M.²

10
- 84908171707
- Learning distributions over logical forms for referring expression generation
- Nicholas FitzGerald, Yoav Artzi, and Luke Zettlemoyer. 2013. Learning distributions over logical forms for referring expression generation. In EMNLP.
- (2013) EMNLP
- Gerald, N.F.¹ Artzi, Y.² Zettlemoyer, L.³

11
- 84869018122
- From image annotation to image description
- Ankush Gupta and Prashanth Mannem. 2012. From image annotation to image description. In NIPS, volume 7667, pages 196-204.
- (2012) NIPS , vol.7667 , pp. 196-204
- Gupta, A.¹ Mannem, P.²

12
- 84878188505
- Conceptto-text generation via discriminative reranking
- Ioannis Konstas and Mirella Lapata. 2012. Conceptto-text generation via discriminative reranking. In ACL, pages 369-378.
- (2012) ACL , pp. 369-378
- Konstas, I.¹ Lapata, M.²

13
- 84856184938
- Computational generation of referring expressions: A survey
- Emiel Krahmer and Kees Van Deemter. 2012. Computational generation of referring expressions: A survey. Computational Linguistics, 38(1):173-218.
- (2012) Computational Linguistics , vol.38 , Issue.1 , pp. 173-218
- Krahmer, E.¹ Van Deemter, K.²

14
- 84893398951
- Generating natural-language video descriptions using text-mined knowledge
- Niveda Krishnamoorthy, Girish Malkarnenkar, Raymond Mooney, Kate Saenko, and Sergio Guadarrama. 2013. Generating natural-language video descriptions using text-mined knowledge. Procedings of AAAI, 2013(2):3.
- (2013) Procedings of AAAI , vol.2013 , Issue.2 , pp. 3
- Krishnamoorthy, N.¹ Malkarnenkar, G.² Mooney, R.³ Saenko, K.⁴ Guadarrama, S.⁵

15
- 80052901011
- Baby talk: Understanding and generating simple image descriptions
- G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A.C. Berg, and T.L. Berg. 2011. Baby talk: Understanding and generating simple image descriptions. In Computer Vision and Pattern Recognition (CVPR), pages 1601-1608.
- (2011) Computer Vision and Pattern Recognition (CVPR) , pp. 1601-1608
- Kulkarni, G.¹ Premraj, V.² Dhar, S.³ Li, S.⁴ Choi, Y.⁵ Berg, A.C.⁶ Berg, T.L.⁷

16
- 84878189119
- Collective generation of natural image descriptions
- Polina Kuznetsova, Vicente Ordonez, Alexander C Berg, Tamara L Berg, and Yejin Choi. 2012. Collective generation of natural image descriptions. In ACL, pages 359-368.
- (2012) ACL , pp. 359-368
- Kuznetsova, P.¹ Ordonez, V.² Berg, A.C.³ Berg, T.L.⁴ Choi, Y.⁵

17
- 50649103674
- What, where and who? Classifying events by scene and object recognition
- IEEE
- Li-Jia Li and Li Fei-Fei. 2007. What, where and who? Classifying events by scene and object recognition. In ICCV, pages 1-8. IEEE.
- (2007) ICCV , pp. 1-8
- Li, L.-J.¹ Fei-Fei, L.²

18
- 33646393800
- Semantic feature production norms for a large set of living and nonliving things
- Ken McRae, George S. Cree, Mark S. Seidenberg, and Chris Mcnorgan. 2005. Semantic feature production norms for a large set of living and nonliving things. Behavior Research Methods, 37(4):547-559.
- (2005) Behavior Research Methods , vol.37 , Issue.4 , pp. 547-559
- McRae, K.¹ Cree, G.S.² Seidenberg, M.S.³ Mcnorgan, C.⁴

19
- 84976702763
- WordNet: A lexical database for english
- George A Miller. 1995. WordNet: a lexical database for english. Communications of the ACM, 38(11):39-41.
- (1995) Communications of the ACM , vol.38 , Issue.11 , pp. 39-41
- Miller, G.A.¹

20
- 85034832841
- Midge: Generating image descriptions from computer vision detections
- Margaret Mitchell, Xufeng Han, Jesse Dodge, Alyssa Mensch, Amit Goyal, Alex Berg, Kota Yamaguchi, Tamara Berg, Karl Stratos, and Hal Daumé, III. 2012. Midge: Generating image descriptions from computer vision detections. In EACL, pages 747-756.
- (2012) EACL , pp. 747-756
- Mitchell, M.¹ Han, X.² Dodge, J.³ Mensch, A.⁴ Goyal, A.⁵ Berg, A.⁶ Yamaguchi, K.⁷ Berg, T.⁸ Stratos, K.⁹ Daumé, H.¹⁰

21
- 84908171705
- Generating expressions that refer to visible objects
- Margaret Mitchell, Kees van Deemter, and Ehud Reiter. 2013. Generating expressions that refer to visible objects. In Proceedings of NAACL-HLT, pages 1174-1184.
- (2013) Proceedings of NAACL-HLT , pp. 1174-1184
- Mitchell, M.¹ Van Deemter, K.² Reiter, E.³

22
- 84882967809
- Improved alignment models for statistical machine translation
- F. Och, C. Tillmann, and H. Ney. 1999. Improved alignment models for statistical machine translation. In Proc. of the Joint Conf. of Empirical Methods in Natural Language Processing and Very Large Corpora, pages 20-28.
- (1999) Proc. of the Joint Conf. of Empirical Methods in Natural Language Processing and Very Large Corpora , pp. 20-28
- Och, F.¹ Tillmann, C.² Ney, H.³

23
- 84944098666
- Minimum error rate training in statistical machine translation
- Franz Josef Och. 2003. Minimum error rate training in statistical machine translation. In ACL, pages 160-167.
- (2003) ACL , pp. 160-167
- Och, F.J.¹

24
- 85162522202
- Im2Text: Describing images using 1 million captioned photographs
- Vicente Ordonez, Girish Kulkarni, and Tamara L. Berg. 2011. Im2Text: Describing images using 1 million captioned photographs. In NIPS, pages 1143-1151.
- (2011) NIPS , pp. 1143-1151
- Ordonez, V.¹ Kulkarni, G.² Berg, T.L.³

25
- 0013363097
- BLEU: A method for automatic evaluation of machine translation
- Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2001. BLEU: a method for automatic evaluation of machine translation. In Proc. of ACL.
- (2001) Proc. of ACL
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.-J.⁴

26
- 85090348677
- Collecting image annotations using amazon's mechanical turk
- C. Rashtchian, P. Young, M. Hodosh, and J. Hockenmaier. 2010. Collecting image annotations using Amazon's Mechanical Turk. In Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, pages 139-147.
- (2010) Proceedings of the NAACL HLT 2010 Workshop On Creating Speech and Language Data with Amazon's Mechanical Turk , pp. 139-147
- Rashtchian, C.¹ Young, P.² Hodosh, M.³ Hockenmaier, J.⁴

27
- 84887383068
- Expanded parts model for human attribute and action recognition in still images
- Gaurav Sharma, Frédéric Jurie, Cordelia Schmid, et al. 2013. Expanded parts model for human attribute and action recognition in still images. In CVPR.
- (2013) CVPR
- Sharma, G.¹ Jurie, F.² Schmid, C.³

28
- 84883376937
- Grounded models of semantic representation
- July
- Carina Silberer and Mirella Lapata. 2012. Grounded models of semantic representation. In EMNLP, July.
- (2012) EMNLP
- Silberer, C.¹ Lapata, M.²

29
- 84906927181
- Models of semantic representation with visual attributes
- Carina Silberer, Vittorio Ferrari, and Mirella Lapata. 2013. Models of semantic representation with visual attributes. In ACL, pages 572-582.
- (2013) ACL , pp. 572-582
- Silberer, C.¹ Ferrari, V.² Lapata, M.³

30
- 77954862606
- LabelMe: Online image annotation and applications
- Antonio Torralba, Bryan C Russell, and Jenny Yuen. 2010. LabelMe: Online image annotation and applications. Proceedings of the IEEE, 98(8):1467-1484.
- (2010) Proceedings of the IEEE , vol.98 , Issue.8 , pp. 1467-1484
- Torralba, A.¹ Russell, B.C.² Yuen, J.³

31
- 78751648503
- A survey of vision-based methods for action representation, segmentation and recognition
- Daniel Weinland, Remi Ronfard, and Edmond Boyer. 2011. A survey of vision-based methods for action representation, segmentation and recognition. Computer Vision and Image Understanding, 115(2):224-241.
- (2011) Computer Vision and Image Understanding , vol.115 , Issue.2 , pp. 224-241
- Weinland, D.¹ Ronfard, R.² Boyer, E.³

32
- 80053258778
- Corpus-guided sentence generation of natural images
- Yezhou Yang, Ching Lik Teo, Hal Daumé III, and Yiannis Aloimonos. 2011. Corpus-guided sentence generation of natural images. In Empirical Methods in Natural Language Processing.
- (2011) Empirical Methods in Natural Language Processing
- Yang, Y.¹ Teo, C.L.² Daumé, H.³ Aloimonos, Y.⁴

33
- 84856672971
- Action recognition by learning bases of action attributes and parts
- Barcelona, Spain, November
- Bangpeng Yao, Xiaoye Jiang, Aditya Khosla, Andy Lai Lin, Leonidas J. Guibas, and Li Fei-Fei. 2011. Action recognition by learning bases of action attributes and parts. In ICCV, Barcelona, Spain, November.
- (2011) ICCV
- Yao, B.¹ Jiang, X.² Khosla, A.³ Lin, A.L.⁴ Guibas, L.J.⁵ Fei-Fei, L.⁶

34
- 84897743886
- Grounded language learning from video described with sentences
- Haonan Yu and Jeffrey Mark Siskind. 2013. Grounded language learning from video described with sentences. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, volume 1, pages 53-63.
- (2013) Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics , vol.1 , pp. 53-63
- Yu, H.¹ Siskind, J.M.²

35
- 61349199906
- Animacy encoding in English: Why and how
- Annie Zaenen, Jean Carletta, Gregory Garretson, Joan Bresnan, Andrew Koontz-Garboden, Tatiana Nikitina, M Catherine O'Connor, and Tom Wasow. 2004. Animacy encoding in English: why and how. In ACL Workshop on Discourse Annotation, pages 118-125.
- (2004) ACL Workshop On Discourse Annotation , pp. 118-125
- Zaenen, A.¹ Carletta, J.² Garretson, G.³ Bresnan, J.⁴ Koontz-Garboden, A.⁵ Nikitina, T.⁶ O'Connor, M.C.⁷ Wasow, T.⁸

36
- 85050418554
- Z-MERT: A fully configurable open source tool for minimum error rate training of machine translation systems
- Omar F. Zaidan. 2009. Z-MERT: A fully configurable open source tool for minimum error rate training of machine translation systems. The Prague Bulletin of Mathematical Linguistics, 91:79-88.
- (2009) The Prague Bulletin of Mathematical Linguistics , vol.91 , pp. 79-88
- Zaidan, O.F.¹

37
- 84887338442
- Bringing semantics into focus using visual abstraction
- C. Lawrence Zitnick and Devi Parikh. 2013. Bringing semantics into focus using visual abstraction. In CVPR.
- (2013) CVPR
- Zitnick, C.L.¹ Parikh, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.