-
1
-
-
84973890960
-
Vqa: Visual question answering
-
Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, and Devi Parikh. 2015. Vqa: Visual question answering. In International Conference on Computer Vision (ICCV).
-
(2015)
International Conference on Computer Vision (ICCV)
-
-
Antol, S.1
Agrawal, A.2
Lu, J.3
Mitchell, M.4
Batra, D.5
Zitnick, C.L.6
Parikh, D.7
-
2
-
-
84959908834
-
Déjà image-captions: A corpus of expressive descriptions in repetition
-
Denver, Colorado, May-June. Association for Computational Linguistics
-
Jianfu Chen, Polina Kuznetsova, David Warren, and Yejin Choi. 2015. Déjà image-captions: A corpus of expressive descriptions in repetition. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 504-514, Denver, Colorado, May-June. Association for Computational Linguistics.
-
(2015)
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
, pp. 504-514
-
-
Chen, J.1
Kuznetsova, P.2
Warren, D.3
Choi, Y.4
-
3
-
-
84921940378
-
Learning phrase representations using RNN encoder-decoder for statistical machine translation
-
Kyunghyun Cho, Bart van Merrienboer, Caglar Gulcehre, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. CoRR.
-
(2014)
CoRR
-
-
Cho, K.1
Van Merrienboer, B.2
Gulcehre, C.3
Bougares, F.4
Schwenk, H.5
Bengio, Y.6
-
4
-
-
84944096380
-
Language models for image captioning: The quirks and what works
-
Short Papers Beijing, China, July. Association for Computational Linguistics
-
Jacob Devlin, Hao Cheng, Hao Fang, Saurabh Gupta, Li Deng, Xiaodong He, Geoffrey Zweig, and Margaret Mitchell. 2015. Language models for image captioning: The quirks and what works. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 100-105, Beijing, China, July. Association for Computational Linguistics.
-
(2015)
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing
, vol.2
, pp. 100-105
-
-
Devlin, J.1
Cheng, H.2
Fang, H.3
Gupta, S.4
Deng, L.5
He, X.6
Zweig, G.7
Mitchell, M.8
-
5
-
-
84906929591
-
Image description using visual dependency representations
-
Seattle, Washington, USA, October. Association for Computational Linguistics
-
Desmond Elliott and Frank Keller. 2013. Image description using visual dependency representations. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1292-1302, Seattle, Washington, USA, October. Association for Computational Linguistics.
-
(2013)
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
, pp. 1292-1302
-
-
Elliott, D.1
Keller, F.2
-
6
-
-
84959250180
-
From captions to visual concepts and back
-
Hao Fang, Saurabh Gupta, Forrest N. Iandola, Rupesh Srivastava, Li Deng, Piotr Dollár, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C. Platt, C. Lawrence Zitnick, and Geoffrey Zweig. 2015. From captions to visual concepts and back. In Computer Vision and Pattern Recognition (CVPR).
-
(2015)
Computer Vision and Pattern Recognition (CVPR)
-
-
Fang, H.1
Gupta, S.2
Iandola, F.N.3
Srivastava, R.4
Deng, L.5
Dollár, P.6
Gao, J.7
He, X.8
Mitchell, M.9
Platt, J.C.10
Zitnick, C.L.11
Zweig, G.12
-
7
-
-
84959904882
-
A survey of current datasets for vision and language research
-
Lisbon, Portugal, September. Association for Computational Linguistics
-
Francis Ferraro, Nasrin Mostafazadeh, Ting-Hao K. Huang, Lucy Vanderwende, Jacob Devlin, Michel Galley, and Margaret Mitchell. 2015. A survey of current datasets for vision and language research. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 207-213, Lisbon, Portugal, September. Association for Computational Linguistics.
-
(2015)
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
, pp. 207-213
-
-
Ferraro, F.1
Mostafazadeh, N.2
Huang, T.-H.K.3
Vanderwende, L.4
Devlin, J.5
Galley, M.6
Mitchell, M.7
-
8
-
-
84965148420
-
Are you talking to a machine? Dataset and methods for multilingual image question
-
C. Cortes, N.D. Lawrence, D.D. Lee, M. Sugiyama, R. Garnett, and R. Garnett, editors, Curran Associates, Inc
-
Haoyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, and Wei Xu. 2015. Are you talking to a machine? dataset and methods for multilingual image question. In C. Cortes, N.D. Lawrence, D.D. Lee, M. Sugiyama, R. Garnett, and R. Garnett, editors, Advances in Neural Information Processing Systems 28, pages 2287-2295. Curran Associates, Inc.
-
(2015)
Advances in Neural Information Processing Systems
, vol.28
, pp. 2287-2295
-
-
Gao, H.1
Mao, J.2
Zhou, J.3
Huang, Z.4
Wang, L.5
Xu, W.6
-
10
-
-
84965153327
-
Skip-thought vectors
-
C. Cortes, N.D. Lawrence, D.D. Lee, M. Sugiyama, R. Garnett, and R. Garnett, editors, Curran Associates, Inc
-
Ryan Kiros, Yukun Zhu, Ruslan R Salakhutdinov, Richard Zemel, Raquel Urtasun, Antonio Torralba, and Sanja Fidler. 2015. Skip-thought vectors. In C. Cortes, N.D. Lawrence, D.D. Lee, M. Sugiyama, R. Garnett, and R. Garnett, editors, Advances in Neural Information Processing Systems 28, pages 3276-3284. Curran Associates, Inc.
-
(2015)
Advances in Neural Information Processing Systems
, vol.28
, pp. 3276-3284
-
-
Kiros, R.1
Zhu, Y.2
Salakhutdinov, R.R.3
Zemel, R.4
Urtasun, R.5
Torralba, A.6
Fidler, S.7
-
11
-
-
84978730111
-
-
Ranjay Krishna, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen, Yannis Kalanditis, Li-Jia Li, David A Shamma, Michael Bernstein, and Li Fei-Fei. 2016. Visual genome: Connecting language and vision using crowdsourced dense image annotations.
-
(2016)
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
-
-
Krishna, R.1
Zhu, Y.2
Groth, O.3
Johnson, J.4
Hata, K.5
Kravitz, J.6
Chen, S.7
Kalanditis, Y.8
Li, L.-J.9
Shamma, D.A.10
Bernstein, M.11
Fei-Fei, L.12
-
12
-
-
84994184277
-
A diversity-promoting objective function for neural conversation models
-
Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016. A diversity-promoting objective function for neural conversation models. NAACL HLT 2016.
-
(2016)
NAACL HLT 2016
-
-
Li, J.1
Galley, M.2
Brockett, C.3
Gao, J.4
Dolan, B.5
-
13
-
-
85149140250
-
Automatic evaluation of machine translation quality using longest common subsequence and skip-bigram statistics
-
Stroudsburg, PA, USA. Association for Computational Linguistics
-
Chin-Yew Lin and Franz Josef Och. 2004. Automatic evaluation of machine translation quality using longest common subsequence and skip-bigram statistics. In Proceedings of the 42Nd Annual Meeting on Association for Computational Linguistics, ACL '04, Stroudsburg, PA, USA. Association for Computational Linguistics.
-
(2004)
Proceedings of the 42Nd Annual Meeting on Association for Computational Linguistics, ACL '04
-
-
Lin, C.-Y.1
Och, F.J.2
-
14
-
-
84906493406
-
Microsoft coco: Common objects in context
-
Springer
-
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Computer Vision-ECCV 2014, pages 740-755. Springer.
-
(2014)
Computer Vision-ECCV 2014
, pp. 740-755
-
-
Lin, T.-Y.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollár, P.7
Zitnick, C.L.8
-
15
-
-
84937822746
-
A multiworld approach to question answering about realworld scenes based on uncertain input
-
Z. Ghahramani, M. Welling, C. Cortes, N.D. Lawrence, and K.Q. Weinberger, editors, Curran Associates, Inc
-
Mateusz Malinowski and Mario Fritz. 2014. A multiworld approach to question answering about realworld scenes based on uncertain input. In Z. Ghahramani, M. Welling, C. Cortes, N.D. Lawrence, and K.Q. Weinberger, editors, Advances in Neural Information Processing Systems 27, pages 1682-1690. Curran Associates, Inc.
-
(2014)
Advances in Neural Information Processing Systems
, vol.27
, pp. 1682-1690
-
-
Malinowski, M.1
Fritz, M.2
-
16
-
-
85117622017
-
The Stanford CoreNLP natural language processing toolkit
-
Christopher D. Manning, Mihai Surdeanu, John Bauer, Jenny Finkel, Steven J. Bethard, and David McClosky. 2014. The Stanford CoreNLP natural language processing toolkit. In Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 55-60.
-
(2014)
Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations
, pp. 55-60
-
-
Manning, C.D.1
Surdeanu, M.2
Bauer, J.3
Finkel, J.4
Bethard, S.J.5
McClosky, D.6
-
19
-
-
84965170394
-
Exploring models and data for image question answering
-
C. Cortes, N.D. Lawrence, D.D. Lee, M. Sugiyama, R. Garnett, and R. Garnett, editors, Curran Associates, Inc
-
Mengye Ren, Ryan Kiros, and Richard Zemel. 2015. Exploring models and data for image question answering. In C. Cortes, N.D. Lawrence, D.D. Lee, M. Sugiyama, R. Garnett, and R. Garnett, editors, Advances in Neural Information Processing Systems 28, pages 2935-2943. Curran Associates, Inc.
-
(2015)
Advances in Neural Information Processing Systems
, vol.28
, pp. 2935-2943
-
-
Ren, M.1
Kiros, R.2
Zemel, R.3
-
21
-
-
84949572890
-
-
arXiv preprint arXiv:1503.01817
-
Bart Thomee, David A Shamma, Gerald Friedland, Benjamin Elizalde, Karl Ni, Douglas Poland, Damian Borth, and Li-Jia Li. 2015. The new data and new challenges in multimedia research. arXiv preprint arXiv:1503.01817.
-
(2015)
The New Data and New Challenges in Multimedia Research
-
-
Thomee, B.1
Shamma, D.A.2
Friedland, G.3
Elizalde, B.4
Ni, K.5
Poland, D.6
Borth, D.7
Li, L.-J.8
-
22
-
-
85044451662
-
Show and tell: A neural image caption generator
-
Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2014. Show and tell: a neural image caption generator. In CVPR.
-
(2014)
CVPR
-
-
Vinyals, O.1
Toshev, A.2
Bengio, S.3
Erhan, D.4
-
24
-
-
84904646639
-
Embers of society: Firelight talk among the ju/hoansi bushmen
-
Polly W Wiessner. 2014. Embers of society: Firelight talk among the ju/hoansi bushmen. Proceedings of the National Academy of Sciences, 111(39):14027-14035.
-
(2014)
Proceedings of the National Academy of Sciences
, vol.111
, Issue.39
, pp. 14027-14035
-
-
Wiessner, P.W.1
-
25
-
-
84939821074
-
-
arXiv preprint arXiv:1502.03044
-
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. arXiv preprint arXiv:1502.03044.
-
(2015)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Cho, K.4
Courville, A.5
Salakhutdinov, R.6
Zemel, R.7
Bengio, Y.8
-
26
-
-
84906494296
-
From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
-
Peter Young, Alice Lai, Micah Hodosh, and Julia Hockenmaier. 2014. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Transactions of the Association for Computational Linguistics, 2:67-78.
-
(2014)
Transactions of the Association for Computational Linguistics
, vol.2
, pp. 67-78
-
-
Young, P.1
Lai, A.2
Hodosh, M.3
Hockenmaier, J.4
|