-
1
-
-
41549147844
-
Modeling local coherence: An entity-based approach
-
R. Barzilay and M. Lapata. Modeling Local Coherence: An Entity-Based Approach. In ACL, 2008.
-
(2008)
ACL
-
-
Barzilay, R.1
Lapata, M.2
-
3
-
-
84957029470
-
Mind's eye: A recurrent visual representation for image caption generation
-
X. Chen and C. L. Zitnick. Mind's Eye: A Recurrent Visual Representation for Image Caption Generation. In CVPR, 2015.
-
(2015)
CVPR
-
-
Chen, X.1
Zitnick, C.L.2
-
5
-
-
84959236502
-
Long-term recurrent convolutional networks for visual recognition and description
-
J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term Recurrent Convolutional Networks for Visual Recognition and Description. In CVPR, 2015.
-
(2015)
CVPR
-
-
Donahue, J.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
6
-
-
84959243872
-
Improving image-sentence embeddings using large weakly annotated photo collections
-
Y. Gong, L. Wang, M. Hodosh, J. Hockenmaier, and S. Lazebnik. Improving Image-Sentence Embeddings Using Large Weakly Annotated Photo Collections. In ECCV, 2014.
-
(2014)
ECCV
-
-
Gong, Y.1
Wang, L.2
Hodosh, M.3
Hockenmaier, J.4
Lazebnik, S.5
-
8
-
-
84883394520
-
Framing image description as a ranking task: Data, models and evaluation metrics
-
M. Hodosh, P. Young, and J. Hockenmaier. Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics. JAIR, 47:853-899, 2013.
-
(2013)
JAIR
, vol.47
, pp. 853-899
-
-
Hodosh, M.1
Young, P.2
Hockenmaier, J.3
-
9
-
-
84946734827
-
Deep visual-semantic alignments for generating image descriptions
-
A. Karpathy and L. Fei-Fei. Deep Visual-Semantic Alignments for Generating Image Descriptions. In CVPR, 2015.
-
(2015)
CVPR
-
-
Karpathy, A.1
Fei-Fei, L.2
-
10
-
-
84959191227
-
Joint photo stream and blog post summarization and exploration
-
G. Kim, S. Moon, and L. Sigal. Joint Photo Stream and Blog Post Summarization and Exploration. In CVPR, 2015.
-
(2015)
CVPR
-
-
Kim, G.1
Moon, S.2
Sigal, L.3
-
11
-
-
84959189488
-
Ranking and retrieval of image sequences from multiple paragraph queries
-
G. Kim, S. Moon, and L. Sigal. Ranking and Retrieval of Image Sequences from Multiple Paragraph Queries. In CVPR, 2015.
-
(2015)
CVPR
-
-
Kim, G.1
Moon, S.2
Sigal, L.3
-
13
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet Classification with Deep Convolutional Neural Networks. In NIPS, 2012.
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
14
-
-
80052901011
-
Baby talk: Understanding and generating image descriptions
-
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg. Baby Talk: Understanding and Generating Image Descriptions. In CVPR, 2011.
-
(2011)
CVPR
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
15
-
-
84934873221
-
TreeTalk: Composition and compression of trees for image descriptions
-
P. Kuznetsova, V. Ordonez, T. L. Berg, and Y. Choi. TreeTalk: Composition and Compression of Trees for Image Descriptions. In TACL, 2014.
-
(2014)
TACL
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, T.L.3
Choi, Y.4
-
16
-
-
33847226906
-
METEOR: An automatic metric for MT evaluation with improved correlation with human judgments
-
S. B. A. Lavie. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. In ACL, 2005.
-
(2005)
ACL
-
-
Lavie, S.B.A.1
-
17
-
-
84919829999
-
Distributed representations of sentences and documents
-
Q. Le and T. Mikolov. Distributed Representations of Sentences and Documents. In ICML, 2014.
-
(2014)
ICML
-
-
Le, Q.1
Mikolov, T.2
-
18
-
-
85117622017
-
The stanford CoreNLP natural language processing toolkit
-
C. D. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. J. Bethard, and D. McClosky. The Stanford CoreNLP Natural Language Processing Toolkit. In ACL, 2014.
-
(2014)
ACL
-
-
Manning, C.D.1
Surdeanu, M.2
Bauer, J.3
Finkel, J.4
Bethard, S.J.5
McClosky, D.6
-
19
-
-
85083950512
-
Deep captioning with multimodal recurrent neural networks (m-RNN)
-
J. Mao, W. Xu, Y. Yang, J. Wang, Z. Huang, and A. L. Yuille. Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN). In ICLR, 2015.
-
(2015)
ICLR
-
-
Mao, J.1
Xu, W.2
Yang, Y.3
Wang, J.4
Huang, Z.5
Yuille, A.L.6
-
21
-
-
85162522202
-
Im2Text: Describing images using 1 million captioned photographs
-
V. Ordonez, G. Kulkarni, and T. L. Berg. Im2Text: Describing Images Using 1 Million Captioned Photographs. In NIPS, 2011.
-
(2011)
NIPS
-
-
Ordonez, V.1
Kulkarni, G.2
Berg, T.L.3
-
22
-
-
85133336275
-
BLEU: A method for automatic evaluation of machine translation
-
K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. BLEU: A Method for Automatic Evaluation of Machine Translation. In ACL, 2002.
-
(2002)
ACL
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.-J.4
-
23
-
-
84898775239
-
Translating video content to natural language descriptions
-
M. Rohrbach, W. Qiu, I. Titov, S. Thater, M. Pinkal, and B. Schiele. Translating Video Content to Natural Language Descriptions. In ICCV, 2013.
-
(2013)
ICCV
-
-
Rohrbach, M.1
Qiu, W.2
Titov, I.3
Thater, S.4
Pinkal, M.5
Schiele, B.6
-
24
-
-
84965144884
-
Bidirectional recurrent neural networks
-
M. Schuster and K. K. Paliwal. Bidirectional Recurrent Neural Networks. In IEEE TSP, 1997.
-
(1997)
IEEE TSP
-
-
Schuster, M.1
Paliwal, K.K.2
-
25
-
-
85083953063
-
Very deep convolutional networks for large-scale image recognition
-
K. Simonyan and A. Zisserman. Very Deep Convolutional Networks for Large-Scale Image Recognition. In ICLR, 2015.
-
(2015)
ICLR
-
-
Simonyan, K.1
Zisserman, A.2
-
26
-
-
84906925854
-
Grounded compositional semantics for finding and describing images with sentences
-
R. Socher, A. Karpathy, Q. V. Le, C. D. Manning, and A. Y. Ng. Grounded Compositional Semantics for Finding and Describing Images with Sentences. In TACL, 2013.
-
(2013)
TACL
-
-
Socher, R.1
Karpathy, A.2
Le, Q.V.3
Manning, C.D.4
Ng, A.Y.5
-
27
-
-
84877724347
-
Multimodal learning with deep boltzmann machines
-
N. Srivastava and R. Salakhutdinov. Multimodal Learning with Deep Boltzmann Machines. In NIPS, 2012.
-
(2012)
NIPS
-
-
Srivastava, N.1
Salakhutdinov, R.2
-
31
-
-
0000903748
-
Generalization of backpropagation with application to a recurrent gas market model
-
P. J. Werbos. Generalization of Backpropagation with Application to a Recurrent Gas Market Model. Neural Networks, 1:339-356, 1988.
-
(1988)
Neural Networks
, vol.1
, pp. 339-356
-
-
Werbos, P.J.1
-
32
-
-
84952349307
-
Jointly modeling deep video and compositional text to bridge vision and language in a unified framework
-
R. Xu, C. Xiong, W. Chen, and J. J. Corso. Jointly Modeling Deep Video and Compositional Text to Bridge Vision and Language in a Unified Framework. In AAAI, 2015.
-
(2015)
AAAI
-
-
Xu, R.1
Xiong, C.2
Chen, W.3
Corso, J.J.4
|