-
2
-
-
84944046597
-
-
arXiv preprint arXiv: 1411.4389
-
Donahue, I., Hendricks, L. A., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., and Darrell, T. Long-term Recurrent Convolutional Networks for Visual Recognition and Description. arXiv preprint arXiv: 1411.4389, 2014.
-
(2014)
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
-
-
Donahue, I.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
3
-
-
84959250180
-
From captions to visual concepts and back
-
Fang, H., Gupta, S., Iandola, F., Srivastava, R., Deng, L., Dollár, P., Gao, J., He, X., Mitchell, M., Piatt, J., Zitnick, C. L., and Zweig, G. From captions to visual concepts and back. In IEEE International Concference on Computer Vision and Patter Recognition (CVPR), 2015.
-
(2015)
IEEE International Concference on Computer Vision and Patter Recognition (CVPR)
-
-
Fang, H.1
Gupta, S.2
Iandola, F.3
Srivastava, R.4
Deng, L.5
Dollár, P.6
Gao, J.7
He, X.8
Mitchell, M.9
Piatt, J.10
Zitnick, C.L.11
Zweig, G.12
-
4
-
-
84883394520
-
Framing image description as a ranking task: Data, models and evaluation metrics
-
Hodosh, M., Young, P., and Hockenmaier, J. Framing image description as a ranking task: data, models and evaluation metrics. Journal of Artificial Intelligence Research, 2013.
-
(2013)
Journal of Artificial Intelligence Research
-
-
Hodosh, M.1
Young, P.2
Hockenmaier, J.3
-
7
-
-
84887601544
-
Baby talk: Understanding and generating simple image descriptions
-
Kulkarni, G., Premraj, V., Dhar, S., Li, Siming, Choi, Yejin, Berg, A. C., and Berg, T. L. Baby Talk: Understanding and Generating Simple Image Descriptions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35 (12):2891-2903, 2013.
-
(2013)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.35
, Issue.12
, pp. 2891-2903
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
8
-
-
84878189119
-
Collective generation of natural image descriptions
-
Association for Computational Linguistics, luly
-
Kuznetsova, P., Ordonez, V., Berg, A. C., Berg, T. L., and Choi, Y. Collective Generation of Natural Image Descriptions. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 359-368. Association for Computational Linguistics, luly 2012.
-
(2012)
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
, pp. 359-368
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, A.C.3
Berg, T.L.4
Choi, Y.5
-
9
-
-
84942673026
-
Rehabilitation of count-based models for word vector representations
-
Gelbukh, Alexander (ed.), of Lecture Notes in Computer Science, Springer International Publishing
-
Lebret, R. and Collobert, R. Rehabilitation of count-based models for word vector representations. In Gelbukh, Alexander (ed.), Computational Linguistics and Intelligent Text Processing, volume 9041 of Lecture Notes in Computer Science, pp. 417-429. Springer International Publishing, 2015.
-
(2015)
Computational Linguistics and Intelligent Text Processing
, vol.9041
, pp. 417-429
-
-
Lebret, R.1
Collobert, R.2
-
10
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Le Cun, Y., Bottou, L., Bengio, Y., and Haffner, P. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 1998.
-
(1998)
Proceedings of the IEEE
-
-
Le Cun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
11
-
-
84906493406
-
Microsoft COCO: Common objects in context
-
Springer
-
Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., and Zitnick, C. L. Microsoft COCO: Common Objects in Context. In Computer Vision-ECCV2014, pp. 740-755. Springer, 2014.
-
(2014)
Computer Vision-ECCV2014
, pp. 740-755
-
-
Lin, T.-Y.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollár, P.7
Zitnick, C.L.8
-
12
-
-
85083950512
-
Deep captioning with multimodal recurrent neural networks (m-RNN)
-
Mao, I., Xu, W., Yang, Y., Wang, J., Huang, Z., and Yuille, A. L. Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN). In International Conference on Learning Representations (ICLR), 2015.
-
(2015)
International Conference on Learning Representations (ICLR)
-
-
Mao, I.1
Xu, W.2
Yang, Y.3
Wang, J.4
Huang, Z.5
Yuille, A.L.6
-
13
-
-
85083951332
-
-
arXiv preprint arXiv:1301.3781
-
Mikolov, T., Chen, K., Corrado, G., and Dean, I. Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781, 2013a.
-
(2013)
Efficient Estimation of Word Representations in Vector Space
-
-
Mikolov, T.1
Chen, K.2
Corrado, G.3
Dean, I.4
-
14
-
-
84898956512
-
Distributed representations of words and phrases and their compositionality
-
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., and Dean, J. Distributed Representations of Words and Phrases and their Compositionality. In Advances in Neural Information Processing Systems, pp. 3111-3119. 2013b.
-
(2013)
Advances in Neural Information Processing Systems
, pp. 3111-3119
-
-
Mikolov, T.1
Sutskever, I.2
Chen, K.3
Corrado, G.4
Dean, J.5
-
15
-
-
85034832841
-
Midge: Generating image descriptions from computer vision detections
-
Association for Computational Linguistics
-
Mitchell, M., Han, X., Dodge, J., Mensch, A., Goyal, A., Berg, A., Yamaguchi, K., Berg, T., Stratos, K., and Daume, III, H. Midge: Generating Image Descriptions from Computer Vision Detections. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pp. 747-756. Association for Computational Linguistics, 2012.
-
(2012)
Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
, pp. 747-756
-
-
Mitchell, M.1
Han, X.2
Dodge, J.3
Mensch, A.4
Goyal, A.5
Berg, A.6
Yamaguchi, K.7
Berg, T.8
Stratos, K.9
Daume, H.10
-
17
-
-
85133336275
-
BLEU: A method for automatic evaluation of machine translation
-
Association for Computational Linguistics
-
Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J. BLEU: A Method for Automatic Evaluation of Machine Translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pp. 311-318. Association for Computational Linguistics, 2002.
-
(2002)
Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.-J.4
-
18
-
-
84961289992
-
GloVe: Global vectors for word representation
-
Pennington, J., Socher, R., and Manning, C. D. GloVe: Global Vectors for Word Representation. In Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014), volume 12, 2014.
-
(2014)
Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014)
, vol.12
-
-
Pennington, J.1
Socher, R.2
Manning, C.D.3
-
19
-
-
84933585162
-
Very deep convolutional networks for large-scale image recognition
-
Simonyan, K. and Zisserman, A. Very deep convolutional networks for large-scale image recognition. CoRR, 2014.
-
(2014)
CoRR
-
-
Simonyan, K.1
Zisserman, A.2
-
20
-
-
84964474107
-
Grounded compositional semantics for finding and describing images with sentences
-
Socher, R., Karpathy, A., Le, Q. V., Manning, C. D., and Ng, A. Y. Grounded Compositional Semantics for Finding and Describing Images with Sentences. Transactions of the Association for Computational Linguistics, 2:207-218, 2014.
-
(2014)
Transactions of the Association for Computational Linguistics
, vol.2
, pp. 207-218
-
-
Socher, R.1
Karpathy, A.2
Le, Q.V.3
Manning, C.D.4
Ng, A.Y.5
-
22
-
-
84944069490
-
-
arXiv preprint arXiv:1412.4729
-
Venugopalan, S., Xu, H., Donahue, J., Rohrbach, M., Mooney, R. J., and Saenko, K. Translating Videos to Natural Language Using Deep Recurrent Neural Networks. arXiv preprint arXiv:1412.4729, 2014.
-
(2014)
Translating Videos to Natural Language Using Deep Recurrent Neural Networks
-
-
Venugopalan, S.1
Xu, H.2
Donahue, J.3
Rohrbach, M.4
Mooney, R.J.5
Saenko, K.6
-
23
-
-
84939821075
-
-
arXiv preprint arXiv: 1411.4555
-
Vinyals, O., Toshev, A., Bengio, S., and Erhan, D. Show and tell: A neural image caption generator. arXiv preprint arXiv: 1411.4555, 2014.
-
(2014)
Show and Tell: A Neural Image Caption Generator
-
-
Vinyals, O.1
Toshev, A.2
Bengio, S.3
Erhan, D.4
-
24
-
-
77954862144
-
I2T: Image parsing to text description
-
Yao, B. Z., Yang, X., Lin, L., Lee, M. W., and Zhu, S. C. I2T: Image Parsing to Text Description. Proceedings of the IEEE, 98(8): 1485-1508, 2010.
-
(2010)
Proceedings of the IEEE
, vol.98
, Issue.8
, pp. 1485-1508
-
-
Yao, B.Z.1
Yang, X.2
Lin, L.3
Lee, M.W.4
Zhu, S.C.5
|