-
1
-
-
78650686637
-
Dis-tributional memory: A general framework for corpus-based semantics
-
2
-
Marco Baroni and Alessandro Lenci. 2010. Dis-tributional Memory: A General Framework for Corpus-Based Semantics. Computational Linguis-tics, 36(4):673-721. 2
-
(2010)
Computational Linguis-tics
, vol.36
, Issue.4
, pp. 673-721
-
-
Baroni, M.1
Lenci, A.2
-
2
-
-
84906930943
-
Don't count, predict! A systematic comparison of context-counting vs. Context-predicting semantic vectors
-
Marco Baroni, Georgiana Dinu, and German Kruszewski. 2014. Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors. In Proc. of ACL. 3
-
(2014)
Proc. of ACL.
, vol.3
-
-
Baroni, M.1
Dinu, G.2
Kruszewski, G.3
-
3
-
-
84866726859
-
Understanding and predicting importance in images
-
Alexander C Berg, Tamara L Berg, Hal Daume, Jesse Dodge, Amit Goyal, Xufeng Han, Alyssa Mensch, Margaret Mitchell, Aneesh Sood, Karl Stratos, et al. 2012. Understanding and Predicting Importance in Images. In Proc. of CVPR. 1
-
(2012)
Proc. of CVPR.
, vol.1
-
-
Berg, A.C.1
Berg, T.L.2
Daume, H.3
Dodge, J.4
Goyal, A.5
Han, X.6
Mensch, A.7
Mitchell, M.8
Sood, A.9
Stratos, K.10
-
4
-
-
84870673011
-
A compar-ison of vector-based representations for semantic composition
-
William Blacoe and Mirella Lapata. 2012. A Compar-ison of Vector-based Representations for Semantic Composition. In Proc. of EMNLP-CoNLL. 3
-
(2012)
Proc. of EMNLP-CoNLL.
, vol.3
-
-
Blacoe, W.1
Lapata, M.2
-
5
-
-
84944130694
-
Mind's eye: A recurrent visual representation for image caption generation
-
Xinlei Chen and C. Lawrence Zitnick. 2015. Mind's Eye: A Recurrent Visual Representation for Image Caption Generation. In Proc. of CVPR. 1
-
(2015)
Proc. of CVPR.
, vol.1
-
-
Chen, X.1
Lawrence Zitnick, C.2
-
6
-
-
84859020282
-
Better hypothesis testing for statisti-cal machine translation: Controlling for optimizer instability
-
Jonathan H Clark, Chris Dyer, Alon Lavie, and Noah A Smith. 2011. Better hypothesis testing for statisti-cal machine translation: Controlling for optimizer instability. In Proc. of ACL. 4
-
(2011)
Proc. of ACL.
, vol.4
-
-
Clark, J.H.1
Dyer, C.2
Lavie, A.3
Smith, N.A.4
-
8
-
-
84906928552
-
Comparing automatic evaluation measures for image descrip-tion
-
Desmond Elliott and Frank Keller. 2014. Comparing Automatic Evaluation Measures for Image Descrip-tion. In Proc. of ACL. 4
-
(2014)
Proc. of ACL.
, vol.4
-
-
Elliott, D.1
Keller, F.2
-
9
-
-
80052017343
-
Every picture tells a story: Gen-erating sentences from images
-
Ali Farhadi, M Hejrati, Mohammad Amin Sadeghi, P Young, C Rashtchian, J Hockenmaier, and David Forsyth. 2010. Every Picture Tells a Story: Gen-erating Sentences from Images. In Proc. ofECCV. 1
-
(2010)
Proc. OfECCV.
, vol.1
-
-
Farhadi, A.1
Hejrati, M.2
Amin Sadeghi, M.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
10
-
-
84944028434
-
Framing image description as a ranking task: Data, models and evaluation metrics
-
Micah Hodosh, Peter Young, and Julia Hockenmaier. 2013. Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics. Jour-nal of Artificial Intelligence Research. 1, 3
-
(2013)
Jour-nal of Artificial Intelligence Research.
, vol.1
, pp. 3
-
-
Hodosh, M.1
Young, P.2
Hockenmaier, J.3
-
11
-
-
84913580146
-
Caffe: Con-volutional architecture for fast feature embedding
-
ACM MM. 2
-
Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Con-volutional Architecture for Fast Feature Embedding. In Proc. of ACM MM. 2
-
(2014)
Proc. of
-
-
Jia, Y.1
Shelhamer, E.2
Donahue, J.3
Karayev, S.4
Long, J.5
Girshick, R.6
Guadarrama, S.7
Darrell, T.8
-
12
-
-
84952902559
-
Deep visual-semantic alignments for generating image descrip-tions
-
Andrej Karpathy and Li Fei-Fei. 2015. Deep Visual-semantic Alignments for Generating Image Descrip-tions. In Proc. of CVPR. 1
-
(2015)
Proc. of CVPR.
, vol.1
-
-
Karpathy, A.1
Fei-Fei, L.2
-
13
-
-
84959252592
-
Deep fragment embeddings for bidirectional image sentence mapping
-
Andrej Karpathy, Armand Joulin, and Li Fei-Fei. 2014. Deep Fragment Embeddings for Bidirectional Image Sentence Mapping. In Proc. of NIPS. 1, 3
-
(2014)
Proc. of NIPS.
, vol.1
, pp. 3
-
-
Karpathy, A.1
Joulin, A.2
Fei-Fei, L.3
-
14
-
-
80052901011
-
Baby talk: Understanding and gener-ating simple image descriptions
-
Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Sim-Ing Li, Yejin Choi, Alexander C Berg, and Tamara L Berg. 2011. Baby Talk: Understanding and Gener-ating Simple Image Descriptions. In Proc. of CVPR. 1
-
(2011)
Proc. of CVPR.
, vol.1
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
15
-
-
84878189119
-
Collec-tive generation of natural image descriptions
-
Polina Kuznetsova, Vicente Ordonez, Alexander C Berg, Tamara L Berg, and Yejin Choi. 2012. Collec-tive Generation of Natural Image Descriptions. In Proc. of ACL. 1,2,4
-
(2012)
Proc. of ACL. 1,2,4
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, A.C.3
Berg, T.L.4
Choi, Y.5
-
16
-
-
84937834115
-
Microsoft COCO: Common objects in context
-
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollar, and C Lawrence Zitnick. 2014. Microsoft COCO: Common Objects in Context. In Proc. ofECCV. 3
-
(2014)
Proc. OfECCV.
, vol.3
-
-
Lin, T.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollar, P.7
Lawrence Zitnick, C.8
-
17
-
-
84906925144
-
Non-parametric method for data-driven image caption-ing
-
Rebecca Mason and Eugene Charniak. 2014. Non-parametric Method for Data-driven Image Caption-ing. In Proc. of ACL. 1, 2,4
-
(2014)
Proc. of ACL.
, vol.1
, Issue.2
, pp. 4
-
-
Mason, R.1
Charniak, E.2
-
18
-
-
84898956512
-
Distributed representa-tions of words and phrases and their composition-ality
-
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Cor-rado, and Jeff Dean. 2013. Distributed Representa-tions of Words and Phrases and their Composition-ality. In Proc. of NIPS. 2, 3
-
(2013)
Proc. of NIPS
, vol.2
, pp. 3
-
-
Mikolov, T.1
Sutskever, I.2
Chen, K.3
Cor-Rado, G.S.4
Dean, J.5
-
19
-
-
85034832841
-
Midge: Generating image descriptions from computer vision detections
-
Margaret Mitchell, Xufeng Han, Jesse Dodge, Alyssa Mensch, Amit Goyal, Alex Berg, Kota Yamaguchi, Tamara Berg, Karl Stratos, and Hal Daume III. 2012. Midge: Generating Image Descriptions from Computer Vision Detections. In Proc. of EACL. 1
-
(2012)
Proc. of EACL.
, vol.1
-
-
Mitchell, M.1
Han, X.2
Dodge, J.3
Mensch, A.4
Goyal, A.5
Berg, A.6
Yamaguchi, K.7
Berg, T.8
Stratos, K.9
Daume, H.10
-
20
-
-
84944096380
-
Language models for image captioning: The quirks and what works
-
Margaret Mitchell, Hao Fang, Hao Cheng, Saurabh Gupta, Jacob Devlin, and Geoffrey Zweig. 2015. Language Models for Image Captioning: The Quirks and What Works. In Proc. of ACL. 2
-
(2015)
Proc. of ACL
, vol.2
-
-
Mitchell, M.1
Fang, H.2
Cheng, H.3
Gupta, S.4
Devlin, J.5
Zweig, G.6
-
21
-
-
85162522202
-
Im2text: Describing images using 1 million captioned photographs
-
Vicente Ordonez, Girish Kulkarni, and Tamara L Berg. 2011. Im2text: Describing Images using 1 Million Captioned Photographs. In Proc. of NIPS. 1, 2, 3
-
(2011)
Proc. of NIPS.
, vol.1
, Issue.2
, pp. 3
-
-
Ordonez, V.1
Kulkarni, G.2
Berg, T.L.3
-
22
-
-
85133336275
-
BLEU: A Method for automatic evaluation of machine translation
-
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a Method for Automatic Evaluation of Machine Translation. In Proc. of ACL. 4
-
(2002)
Proc. of ACL.
, vol.4
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.4
-
23
-
-
84900870389
-
The SUN attribute database: Beyond categories for deeper scene understanding
-
1, 2
-
Genevieve Patterson, Chen Xu, Hang Su, and James Hays. 2014. The SUN Attribute Database: Beyond Categories for Deeper Scene Understanding. Inter-national Journal of Computer Vision, 108(l-2):59-81. 1,2
-
(2014)
Inter-national Journal of Computer Vision
, vol.108
, Issue.1-2
, pp. 59-81
-
-
Patterson, G.1
Xu, C.2
Su, H.3
Hays, J.4
-
24
-
-
84942666203
-
GloVe: Global vectors for word representation
-
Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. GloVe: Global Vectors for Word Representation. Proc. ofEMNLP. 2, 3
-
(2014)
Proc. OfEMNLP
, vol.2
, pp. 3
-
-
Pennington, J.1
Socher, R.2
Manning, C.D.3
-
25
-
-
84906925854
-
Grounded compositional semantics for finding and describing images with sentences
-
Richard Socher, Andrej Karpathy, Quoc V. Le, Christopher D. Manning, and Andrew Y. Ng. 2014. Grounded Compositional Semantics for Finding and Describing Images with Sentences. Transactions of the Association for Computational Linguistics. 1
-
(2014)
Transactions of the Association for Computational Linguistics
, vol.1
-
-
Socher, R.1
Karpathy, A.2
Le, Q.V.3
Manning, C.D.4
Ng, A.Y.5
-
29
-
-
84970002232
-
Show, attend and tell: Neural image caption generation with VI-sual attention
-
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard Zemel, and Yoshua Bengio. 2015. Show, Attend and Tell: Neural Image Caption Generation with Vi-sual attention. In Proc. oflCML. 1
-
(2015)
Proc. OflCML.
, vol.1
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Cho, K.4
Courville, A.5
Salakhutdinov, R.6
Zemel, R.7
Bengio, Y.8
|