-
1
-
-
84973890960
-
Vqa: Visual question answering
-
S. Antol, A. Agrawal, J. Lu, M. Mitchell, D. Batra, Z. Lawrence, and D. Parikh. Vqa: Visual question answering. In Proc. of ICCV, 2015.
-
(2015)
Proc. of ICCV
-
-
Antol, S.1
Agrawal, A.2
Lu, J.3
Mitchell, M.4
Batra, D.5
Lawrence, Z.6
Parikh, D.7
-
3
-
-
84933576022
-
Words jump-start vision: A label advantage in object recognition
-
B. Boutonnet and G. Lupyan. Words jump-start vision: A label advantage in object recognition. Journal of Neuroscience, 35(25): 9329-9335, 2015.
-
(2015)
Journal of Neuroscience
, vol.35
, Issue.25
, pp. 9329-9335
-
-
Boutonnet, B.1
Lupyan, G.2
-
4
-
-
84961291190
-
Learning phrase representations using RNN encoder-decoder for statistical machine translation
-
K. Cho, B. Van Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In Proc. of EMNLP, 2014.
-
(2014)
Proc. of EMNLP
-
-
Cho, K.1
Van Merriënboer, B.2
Gulcehre, C.3
Bahdanau, D.4
Bougares, F.5
Schwenk, H.6
Bengio, Y.7
-
5
-
-
85041927710
-
Visual dialog
-
A. Das, S. Kottur, K. Gupta, A. Singh, D. Yadav, J. Moura, D. Parikh, and D. Batra. Visual Dialog. In Proc. of CVPR, 2017.
-
(2017)
Proc. of CVPR
-
-
Das, A.1
Kottur, S.2
Gupta, K.3
Singh, A.4
Yadav, D.5
Moura, J.6
Parikh, D.7
Batra, D.8
-
6
-
-
85041919303
-
GuessWhat?! Visual object discovery through multi-modal dialogue
-
H. de Vries, F. Strub, S. Chandar, O. Pietquin, H. Larochelle, and A. Courville. GuessWhat?! Visual object discovery through multi-modal dialogue. In Proc. of CVPR, 2017.
-
(2017)
Proc. of CVPR
-
-
De Vries, H.1
Strub, F.2
Chandar, S.3
Pietquin, O.4
Larochelle, H.5
Courville, A.6
-
8
-
-
34848886912
-
Introduction to the special issue on language-vision interactions
-
F. Ferreira and M. Tanenhaus. Introduction to the special issue on language-vision interactions. Journal of Memory and Language, 57(4): 455-459, 2007.
-
(2007)
Journal of Memory and Language
, vol.57
, Issue.4
, pp. 455-459
-
-
Ferreira, F.1
Tanenhaus, M.2
-
9
-
-
85044506279
-
Multimodal compact bilinear pooling for visual question answering and visual grounding
-
A. Fukui, D. Huk Park, D. Yang, A. Rohrbach, T. Darrell, and M. Rohrbach. Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding. In Proc. of EMNLP, 2016.
-
(2016)
Proc. of EMNLP
-
-
Fukui, A.1
Huk Park, D.2
Yang, D.3
Rohrbach, A.4
Darrell, T.5
Rohrbach, M.6
-
10
-
-
0031573117
-
Long short-term memory
-
MIT Press
-
S. Hochreiter and J. Schmidhuber. Long short-term memory. In Neural computation, Volume 9, pages 1735-1780. MIT Press, 1997.
-
(1997)
Neural Computation
, vol.9
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
11
-
-
85018917850
-
Hierarchical question-image co-attention for visual question answering
-
J. Jiasen, J. Yang, D. Batra, and D. Parikh. Hierarchical question-image co-attention for visual question answering. In Proc. of NIPS, 2016.
-
(2016)
Proc. of NIPS
-
-
Jiasen, J.1
Yang, J.2
Batra, D.3
Parikh, D.4
-
13
-
-
85018868398
-
Multimodal residual learning for visual qa
-
J. Kim, S. Lee, D. Kwak, M. Heo, J. Kim, J. Ha, and B. Zhang. Multimodal residual learning for visual qa. In Proc. of NIPS, 2016.
-
(2016)
Proc. of NIPS
-
-
Kim, J.1
Lee, S.2
Kwak, D.3
Heo, M.4
Kim, J.5
Ha, J.6
Zhang, B.7
-
14
-
-
85087529518
-
Hadamard product for low-rank bilinear pooling
-
J. Kim, K. On, J. Kim, J. Ha, and B. Zhang. Hadamard product for low-rank bilinear pooling. In Proc. of ICLR, 2017.
-
(2017)
Proc. of ICLR
-
-
Kim, J.1
On, K.2
Kim, J.3
Ha, J.4
Zhang, B.5
-
15
-
-
84901593868
-
Prior expectations evoke stimulus templates in the primary visual cortex
-
P. Kok, M. Failing, and F. de Lange. Prior expectations evoke stimulus templates in the primary visual cortex. Journal of Cognitive Neuroscience, 26(7): 1546-1554, 2014.
-
(2014)
Journal of Cognitive Neuroscience
, vol.26
, Issue.7
, pp. 1546-1554
-
-
Kok, P.1
Failing, M.2
De Lange, F.3
-
16
-
-
84937834115
-
Microsoft coco: Common objects in context
-
T. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and L. Zitnick. Microsoft coco: Common objects in context. In Proc of ECCV, 2014.
-
(2014)
Proc of ECCV
-
-
Lin, T.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollár, P.7
Zitnick, L.8
-
17
-
-
84973896625
-
Ask your neurons: A neural-based approach to answering questions about images
-
M. Malinowski, M. Rohrbach, and M. Fritz. Ask your neurons: A neural-based approach to answering questions about images. In Proc. of ICCV, 2015.
-
(2015)
Proc. of ICCV
-
-
Malinowski, M.1
Rohrbach, M.2
Fritz, M.3
-
20
-
-
84965170394
-
Exploring models and data for image question answering
-
M. Ren, R. Kiros, and R. Zemel. Exploring models and data for image question answering. In Proc. of NIPS, 2015.
-
(2015)
Proc. of NIPS
-
-
Ren, M.1
Kiros, R.2
Zemel, R.3
-
21
-
-
84964923476
-
Batch normalization: Accelerating deep network training by reducing internal covariate shift
-
I. Sergey and S. Christian. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. In Proc. of ICML, 2015.
-
(2015)
Proc. of ICML
-
-
Sergey, I.1
Christian, S.2
-
23
-
-
85041900002
-
Making the V in VQA matter: Elevating the role of image understanding in visual question answering
-
G. Yashand K. Tejas, S. Douglas, Dhruv B, and P. Devi. Making the V in VQA matter: Elevating the role of image understanding in Visual Question Answering. In Proc. of CVPR, 2017.
-
(2017)
Proc. of CVPR
-
-
Yashand, G.1
Tejas, K.2
Douglas, S.3
Dhruv, B.4
Devi, P.5
-
24
-
-
63149129198
-
Unconscious effects of language-specific terminology on preattentive color perception
-
G. Thierry, P. Athanasopoulos, A. Wiggett, B. Dering, and JR. Kuipers. Unconscious effects of language-specific terminology on preattentive color perception. PNAS, 106(11): 4567-4570, 2009.
-
(2009)
PNAS
, vol.106
, Issue.11
, pp. 4567-4570
-
-
Thierry, G.1
Athanasopoulos, P.2
Wiggett, A.3
Dering, B.4
Kuipers, J.R.5
-
25
-
-
57249084011
-
Visualizing data using t-sne
-
L. Maaten van and G. der Hinton. Visualizing data using t-sne. JMLR, 9(Nov): 2579-2605, 2008.
-
(2008)
JMLR
, vol.9
, Issue.NOV
, pp. 2579-2605
-
-
Maaten Van, L.1
Der Hinton, G.2
-
26
-
-
84990044633
-
Ask, attend and answer: Exploring question-guided spatial attention for visual question answering
-
H. Xu and K. Saenko. Ask, attend and answer: Exploring question-guided spatial attention for visual question answering. In Proc. of ECCV, 2015.
-
(2015)
Proc. of ECCV
-
-
Xu, H.1
Saenko, K.2
-
27
-
-
84970002232
-
Show, attend and tell: Neural image caption generation with visual attention
-
K. Xu, J. Ba, R. Kiros, K. Cho, A. Courville, R. Salakhutdinov, R. Zemel, and Y. Bengio. Show, attend and tell: Neural image caption generation with visual attention. In Proc. of ICML, 2015.
-
(2015)
Proc. of ICML
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Cho, K.4
Courville, A.5
Salakhutdinov, R.6
Zemel, R.7
Bengio, Y.8
|