-
2
-
-
80053402398
-
Fast, cheap, and creative: Evaluating translation quality using Amazon's Mechanical Turk
-
August
-
Chris Callison-Burch. 2009. Fast, cheap, and creative: Evaluating translation quality using Amazon's Mechanical Turk. In EMNLP, pages 286-295, August.
-
(2009)
EMNLP
, pp. 286-295
-
-
Callison-Burch, C.1
-
3
-
-
77952710493
-
Training a multilingual sportscaster: Using perceptual context to learn language
-
David L. Chen, Joohyun Kim, and Raymond J. Mooney. 2010. Training a multilingual sportscaster: Using perceptual context to learn language. JAIR, 37:397-435.
-
(2010)
JAIR
, vol.37
, pp. 397-435
-
-
Chen, D.L.1
Kim, J.2
Mooney, R.J.3
-
4
-
-
85037338954
-
Generating typed dependency parses from phrase structure parses
-
Marie-Catherine de Marneffe, Bill MacCartney, Christopher D Manning, et al. 2006. Generating typed dependency parses from phrase structure parses. In LREC, volume 6, pages 449-454.
-
(2006)
LREC
, vol.6
, pp. 449-454
-
-
De Marneffe, M.-C.1
MacCartney, B.2
Manning, C.D.3
-
5
-
-
84906929591
-
Image Description using Visual Dependency Representations
-
Desmond Elliott and Frank Keller. 2013. Image Description using Visual Dependency Representations. In EMNLP.
-
(2013)
EMNLP
-
-
Elliott, D.1
Keller, F.2
-
6
-
-
70450207704
-
Describing objects by their attributes
-
IEEE
-
Ali Farhadi, Ian Endres, Derek Hoiem, and David Forsyth. 2009. Describing objects by their attributes. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, pages 1778-1785. IEEE.
-
(2009)
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference On
, pp. 1778-1785
-
-
Farhadi, A.1
Endres, I.2
Hoiem, D.3
Forsyth, D.4
-
7
-
-
78149311145
-
Every picture tells a story: Generating sentences from images
-
Ali Farhadi, Mohsen Hejrati, Mohammad Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, and David Forsyth. 2010. Every picture tells a story: Generating sentences from images. In Proceedings of the 11th European conference on Computer Vision, ECCV'10, pages 15-29.
-
(2010)
Proceedings of the 11th European Conference On Computer Vision, ECCV'10
, pp. 15-29
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.A.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
8
-
-
77955422240
-
Object detection with discriminatively trained part-based models
-
Pedro F Felzenszwalb, Ross B Girshick, David McAllester, and Deva Ramanan. 2010. Object detection with discriminatively trained part-based models. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 32(9):1627-1645.
-
(2010)
Pattern Analysis and Machine Intelligence, IEEE Transactions On
, vol.32
, Issue.9
, pp. 1627-1645
-
-
Felzenszwalb, P.F.1
Girshick, R.B.2
McAllester, D.3
Ramanan, D.4
-
9
-
-
80052878949
-
How many words is a picture worth? Automatic caption generation for news images
-
Yansong Feng and Mirella Lapata. 2010. How many words is a picture worth? Automatic caption generation for news images. In ACL, pages 1239-1249.
-
(2010)
ACL
, pp. 1239-1249
-
-
Feng, Y.1
Lapata, M.2
-
10
-
-
84908171707
-
Learning distributions over logical forms for referring expression generation
-
Nicholas FitzGerald, Yoav Artzi, and Luke Zettlemoyer. 2013. Learning distributions over logical forms for referring expression generation. In EMNLP.
-
(2013)
EMNLP
-
-
Gerald, N.F.1
Artzi, Y.2
Zettlemoyer, L.3
-
11
-
-
84869018122
-
From image annotation to image description
-
Ankush Gupta and Prashanth Mannem. 2012. From image annotation to image description. In NIPS, volume 7667, pages 196-204.
-
(2012)
NIPS
, vol.7667
, pp. 196-204
-
-
Gupta, A.1
Mannem, P.2
-
12
-
-
84878188505
-
Conceptto-text generation via discriminative reranking
-
Ioannis Konstas and Mirella Lapata. 2012. Conceptto-text generation via discriminative reranking. In ACL, pages 369-378.
-
(2012)
ACL
, pp. 369-378
-
-
Konstas, I.1
Lapata, M.2
-
13
-
-
84856184938
-
Computational generation of referring expressions: A survey
-
Emiel Krahmer and Kees Van Deemter. 2012. Computational generation of referring expressions: A survey. Computational Linguistics, 38(1):173-218.
-
(2012)
Computational Linguistics
, vol.38
, Issue.1
, pp. 173-218
-
-
Krahmer, E.1
Van Deemter, K.2
-
14
-
-
84893398951
-
Generating natural-language video descriptions using text-mined knowledge
-
Niveda Krishnamoorthy, Girish Malkarnenkar, Raymond Mooney, Kate Saenko, and Sergio Guadarrama. 2013. Generating natural-language video descriptions using text-mined knowledge. Procedings of AAAI, 2013(2):3.
-
(2013)
Procedings of AAAI
, vol.2013
, Issue.2
, pp. 3
-
-
Krishnamoorthy, N.1
Malkarnenkar, G.2
Mooney, R.3
Saenko, K.4
Guadarrama, S.5
-
15
-
-
80052901011
-
Baby talk: Understanding and generating simple image descriptions
-
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A.C. Berg, and T.L. Berg. 2011. Baby talk: Understanding and generating simple image descriptions. In Computer Vision and Pattern Recognition (CVPR), pages 1601-1608.
-
(2011)
Computer Vision and Pattern Recognition (CVPR)
, pp. 1601-1608
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
16
-
-
84878189119
-
Collective generation of natural image descriptions
-
Polina Kuznetsova, Vicente Ordonez, Alexander C Berg, Tamara L Berg, and Yejin Choi. 2012. Collective generation of natural image descriptions. In ACL, pages 359-368.
-
(2012)
ACL
, pp. 359-368
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, A.C.3
Berg, T.L.4
Choi, Y.5
-
17
-
-
50649103674
-
What, where and who? Classifying events by scene and object recognition
-
IEEE
-
Li-Jia Li and Li Fei-Fei. 2007. What, where and who? Classifying events by scene and object recognition. In ICCV, pages 1-8. IEEE.
-
(2007)
ICCV
, pp. 1-8
-
-
Li, L.-J.1
Fei-Fei, L.2
-
18
-
-
33646393800
-
Semantic feature production norms for a large set of living and nonliving things
-
Ken McRae, George S. Cree, Mark S. Seidenberg, and Chris Mcnorgan. 2005. Semantic feature production norms for a large set of living and nonliving things. Behavior Research Methods, 37(4):547-559.
-
(2005)
Behavior Research Methods
, vol.37
, Issue.4
, pp. 547-559
-
-
McRae, K.1
Cree, G.S.2
Seidenberg, M.S.3
Mcnorgan, C.4
-
19
-
-
84976702763
-
WordNet: A lexical database for english
-
George A Miller. 1995. WordNet: a lexical database for english. Communications of the ACM, 38(11):39-41.
-
(1995)
Communications of the ACM
, vol.38
, Issue.11
, pp. 39-41
-
-
Miller, G.A.1
-
20
-
-
85034832841
-
Midge: Generating image descriptions from computer vision detections
-
Margaret Mitchell, Xufeng Han, Jesse Dodge, Alyssa Mensch, Amit Goyal, Alex Berg, Kota Yamaguchi, Tamara Berg, Karl Stratos, and Hal Daumé, III. 2012. Midge: Generating image descriptions from computer vision detections. In EACL, pages 747-756.
-
(2012)
EACL
, pp. 747-756
-
-
Mitchell, M.1
Han, X.2
Dodge, J.3
Mensch, A.4
Goyal, A.5
Berg, A.6
Yamaguchi, K.7
Berg, T.8
Stratos, K.9
Daumé, H.10
-
23
-
-
84944098666
-
Minimum error rate training in statistical machine translation
-
Franz Josef Och. 2003. Minimum error rate training in statistical machine translation. In ACL, pages 160-167.
-
(2003)
ACL
, pp. 160-167
-
-
Och, F.J.1
-
24
-
-
85162522202
-
Im2Text: Describing images using 1 million captioned photographs
-
Vicente Ordonez, Girish Kulkarni, and Tamara L. Berg. 2011. Im2Text: Describing images using 1 million captioned photographs. In NIPS, pages 1143-1151.
-
(2011)
NIPS
, pp. 1143-1151
-
-
Ordonez, V.1
Kulkarni, G.2
Berg, T.L.3
-
25
-
-
0013363097
-
BLEU: A method for automatic evaluation of machine translation
-
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2001. BLEU: a method for automatic evaluation of machine translation. In Proc. of ACL.
-
(2001)
Proc. of ACL
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.-J.4
-
27
-
-
84887383068
-
Expanded parts model for human attribute and action recognition in still images
-
Gaurav Sharma, Frédéric Jurie, Cordelia Schmid, et al. 2013. Expanded parts model for human attribute and action recognition in still images. In CVPR.
-
(2013)
CVPR
-
-
Sharma, G.1
Jurie, F.2
Schmid, C.3
-
28
-
-
84883376937
-
Grounded models of semantic representation
-
July
-
Carina Silberer and Mirella Lapata. 2012. Grounded models of semantic representation. In EMNLP, July.
-
(2012)
EMNLP
-
-
Silberer, C.1
Lapata, M.2
-
29
-
-
84906927181
-
Models of semantic representation with visual attributes
-
Carina Silberer, Vittorio Ferrari, and Mirella Lapata. 2013. Models of semantic representation with visual attributes. In ACL, pages 572-582.
-
(2013)
ACL
, pp. 572-582
-
-
Silberer, C.1
Ferrari, V.2
Lapata, M.3
-
30
-
-
77954862606
-
LabelMe: Online image annotation and applications
-
Antonio Torralba, Bryan C Russell, and Jenny Yuen. 2010. LabelMe: Online image annotation and applications. Proceedings of the IEEE, 98(8):1467-1484.
-
(2010)
Proceedings of the IEEE
, vol.98
, Issue.8
, pp. 1467-1484
-
-
Torralba, A.1
Russell, B.C.2
Yuen, J.3
-
31
-
-
78751648503
-
A survey of vision-based methods for action representation, segmentation and recognition
-
Daniel Weinland, Remi Ronfard, and Edmond Boyer. 2011. A survey of vision-based methods for action representation, segmentation and recognition. Computer Vision and Image Understanding, 115(2):224-241.
-
(2011)
Computer Vision and Image Understanding
, vol.115
, Issue.2
, pp. 224-241
-
-
Weinland, D.1
Ronfard, R.2
Boyer, E.3
-
33
-
-
84856672971
-
Action recognition by learning bases of action attributes and parts
-
Barcelona, Spain, November
-
Bangpeng Yao, Xiaoye Jiang, Aditya Khosla, Andy Lai Lin, Leonidas J. Guibas, and Li Fei-Fei. 2011. Action recognition by learning bases of action attributes and parts. In ICCV, Barcelona, Spain, November.
-
(2011)
ICCV
-
-
Yao, B.1
Jiang, X.2
Khosla, A.3
Lin, A.L.4
Guibas, L.J.5
Fei-Fei, L.6
-
35
-
-
61349199906
-
Animacy encoding in English: Why and how
-
Annie Zaenen, Jean Carletta, Gregory Garretson, Joan Bresnan, Andrew Koontz-Garboden, Tatiana Nikitina, M Catherine O'Connor, and Tom Wasow. 2004. Animacy encoding in English: why and how. In ACL Workshop on Discourse Annotation, pages 118-125.
-
(2004)
ACL Workshop On Discourse Annotation
, pp. 118-125
-
-
Zaenen, A.1
Carletta, J.2
Garretson, G.3
Bresnan, J.4
Koontz-Garboden, A.5
Nikitina, T.6
O'Connor, M.C.7
Wasow, T.8
-
36
-
-
85050418554
-
Z-MERT: A fully configurable open source tool for minimum error rate training of machine translation systems
-
Omar F. Zaidan. 2009. Z-MERT: A fully configurable open source tool for minimum error rate training of machine translation systems. The Prague Bulletin of Mathematical Linguistics, 91:79-88.
-
(2009)
The Prague Bulletin of Mathematical Linguistics
, vol.91
, pp. 79-88
-
-
Zaidan, O.F.1
-
37
-
-
84887338442
-
Bringing semantics into focus using visual abstraction
-
C. Lawrence Zitnick and Devi Parikh. 2013. Bringing semantics into focus using visual abstraction. In CVPR.
-
(2013)
CVPR
-
-
Zitnick, C.L.1
Parikh, D.2
|