-
1
-
-
84985013144
-
Deep compositional question answering with neural module networks
-
Jacob Andreas, Marcus Rohrbach, Trevor Darrell, and Dan Klein. Deep compositional question answering with neural module networks. In CVPR, 2016.
-
(2016)
CVPR
-
-
Andreas, J.1
Rohrbach, M.2
Darrell, T.3
Klein, D.4
-
2
-
-
84973890960
-
Vqa: Visual question answering
-
Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C Lawrence Zitnick, and Devi Parikh. Vqa: Visual question answering. In ICCV, 2015.
-
(2015)
ICCV
-
-
Antol, S.1
Agrawal, A.2
Lu, J.3
Mitchell, M.4
Batra, D.5
Lawrence Zitnick, C.6
Parikh, D.7
-
3
-
-
85083953689
-
Neural Machine translation by jointly learning to align and translate
-
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. In ICLR, 2015.
-
(2015)
ICLR
-
-
Bahdanau, D.1
Cho, K.2
Bengio, Y.3
-
5
-
-
84990044140
-
-
arXiv preprint arXiv:1606.03556
-
Abhishek Das, Harsh Agrawal, C Lawrence Zitnick, Devi Parikh, and Dhruv Batra. Human attention in visual question answering: Do humans and deep networks look at the same regions? arXiv preprint arXiv:1606.03556, 2016.
-
(2016)
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
-
-
Das, A.1
Agrawal, H.2
Lawrence Zitnick, C.3
Parikh, D.4
Batra, D.5
-
6
-
-
84965148420
-
Are you talking to a Machine? Dataset and methods for multilingual image question answering
-
Haoyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, and Wei Xu. Are you talking to a machine? dataset and methods for multilingual image question answering. In NIPS, 2015.
-
(2015)
NIPS
-
-
Gao, H.1
Mao, J.2
Zhou, J.3
Huang, Z.4
Wang, L.5
Xu, W.6
-
7
-
-
84986274465
-
Deep residual learning for image recognition
-
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In CVPR, 2016.
-
(2016)
CVPR
-
-
He, K.1
Zhang, X.2
Ren, S.3
Sun, J.4
-
8
-
-
84965139942
-
Teaching Machines to read and comprehend
-
Karl Moritz Hermann, Tomas Kocisky, Edward Grefenstette, Lasse Espeholt, Will Kay, Mustafa Suleyman, and Phil Blunsom. Teaching machines to read and comprehend. In NIPS, 2015.
-
(2015)
NIPS
-
-
Hermann, K.M.1
Kocisky, T.2
Grefenstette, E.3
Espeholt, L.4
Kay, W.5
Suleyman, M.6
Blunsom, P.7
-
9
-
-
84937936034
-
Convolutional neural network architectures for matching natural language sentences
-
Baotian Hu, Zhengdong Lu, Hang Li, and Qingcai Chen. Convolutional neural network architectures for matching natural language sentences. In NIPS, 2014.
-
(2014)
NIPS
-
-
Hu, B.1
Lu, Z.2
Li, H.3
Chen, Q.4
-
11
-
-
84978730111
-
-
arXiv preprint arXiv:1602.07332
-
Ranjay Krishna, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen, Yannis Kalantidis, Li-Jia Li, David A Shamma, et al. Visual genome: Connecting language and vision using crowdsourced dense image annotations. arXiv preprint arXiv:1602.07332, 2016.
-
(2016)
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
-
-
Krishna, R.1
Zhu, Y.2
Groth, O.3
Johnson, J.4
Hata, K.5
Kravitz, J.6
Chen, S.7
Kalantidis, Y.8
Li, J.-L.9
Shamma, D.A.10
-
12
-
-
84937834115
-
Microsoft coco: Common objects in context
-
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. Microsoft coco: Common objects in context. In ECCV, 2014.
-
(2014)
ECCV
-
-
Tsung, Y.-L.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollár, P.7
Lawrence Zitnick, C.8
-
13
-
-
85007153677
-
Learning to answer questions from image using convolutional neural network
-
Lin Ma, Zhengdong Lu, and Hang Li. Learning to answer questions from image using convolutional neural network. In AAAI, 2016.
-
(2016)
AAAI
-
-
Ma, L.1
Lu, Z.2
Li, H.3
-
14
-
-
84973896625
-
Ask your neurons: A neural-based approach to answering questions about images
-
Mateusz Malinowski, Marcus Rohrbach, and Mario Fritz. Ask your neurons: A neural-based approach to answering questions about images. In ICCV, 2015.
-
(2015)
ICCV
-
-
Malinowski, M.1
Rohrbach, M.2
Fritz, M.3
-
15
-
-
84965170394
-
Exploring models and data for image question answering
-
Mengye Ren, Ryan Kiros, and Richard Zemel. Exploring models and data for image question answering. In NIPS, 2015.
-
(2015)
NIPS
-
-
Ren, M.1
Kiros, R.2
Zemel, R.3
-
16
-
-
85083950860
-
Reasoning about entailment with neural attention
-
Tim Rocktäschel, Edward Grefenstette, Karl Moritz Hermann, Tomáš Kočiskỳ, and Phil Blunsom. Reasoning about entailment with neural attention. In ICLR, 2016.
-
(2016)
ICLR
-
-
Rocktäschel, T.1
Grefenstette, E.2
Hermann, K.M.3
Kočiskỳ, T.4
Blunsom, P.5
-
18
-
-
84986327457
-
Where to look: Focus regions for visual question answering
-
Kevin J Shih, Saurabh Singh, and Derek Hoiem. Where to look: Focus regions for visual question answering. In CVPR, 2016.
-
(2016)
CVPR
-
-
Shih, K.J.1
Singh, S.2
Hoiem, D.3
-
19
-
-
84933585162
-
Very deep convolutional networks for large-scale image recognition
-
Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556, 2014.
-
(2014)
CoRR
-
-
Simonyan, K.1
Zisserman, A.2
-
20
-
-
84937522268
-
Going deeper with convolutions
-
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper with convolutions. In CVPR, 2015.
-
(2015)
CVPR
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
Erhan, D.7
Vanhoucke, V.8
Rabinovich, A.9
-
21
-
-
84999008900
-
Dynamic memory networks for visual and textual question answering
-
Caiming Xiong, Stephen Merity, and Richard Socher. Dynamic memory networks for visual and textual question answering. In ICML, 2016.
-
(2016)
ICML
-
-
Xiong, C.1
Merity, S.2
Socher, R.3
-
23
-
-
84986334021
-
Stacked attention networks for image question answering
-
Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng, and Alex Smola. Stacked attention networks for image question answering. In CVPR, 2016.
-
(2016)
CVPR
-
-
Yang, Z.1
He, X.2
Gao, J.3
Deng, L.4
Smola, A.5
-
24
-
-
85015342918
-
Abcnn: Attention-based convolutional neural network for modeling sentence pairs
-
Wenpeng Yin, Hinrich Schütze, Bing Xiang, and Bowen Zhou. Abcnn: Attention-based convolutional neural network for modeling sentence pairs. In ACL, 2016.
-
(2016)
ACL
-
-
Yin, W.1
Schütze, H.2
Xiang, B.3
Zhou, B.4
-
25
-
-
84990069011
-
-
arXiv preprint arXiv:1511.05099
-
Peng Zhang, Yash Goyal, Douglas Summers-Stay, Dhruv Batra, and Devi Parikh. Yin and yang: Balancing and answering binary visual questions. arXiv preprint arXiv:1511.05099, 2015.
-
(2015)
Yin and Yang: Balancing and Answering Binary Visual Questions
-
-
Zhang, P.1
Goyal, Y.2
Summers-Stay, D.3
Batra, D.4
Parikh, D.5
-
26
-
-
84986275767
-
Visual7w: Grounded question answering in images
-
Yuke Zhu, Oliver Groth, Michael Bernstein, and Li Fei-Fei. Visual7w: Grounded question answering in images. In CVPR, 2016.
-
(2016)
CVPR
-
-
Zhu, Y.1
Groth, O.2
Bernstein, M.3
Fei-Fei, L.4
-
27
-
-
85018934522
-
Measuring Machine intelligence through visual question answering
-
C Lawrence Zitnick, Aishwarya Agrawal, Stanislaw Antol, Margaret Mitchell, Dhruv Batra, and Devi Parikh. Measuring machine intelligence through visual question answering. AI Magazine, 37(1), 2016.
-
(2016)
AI Magazine
, vol.37
, Issue.1
-
-
Lawrence Zitnick, C.1
Agrawal, A.2
Antol, S.3
Mitchell, M.4
Batra, D.5
Parikh, D.6
|