-
1
-
-
84973890960
-
VQA: Visual question answering
-
1, 2, 5, 6, 7, 8
-
S. Antol, A. Agrawal, J. Lu, M. Mitchell, D. Batra, C. L. Zitnick, and D. Parikh. VQA: visual question answering. In ICCV, 2015.
-
(2015)
ICCV
-
-
Antol, S.1
Agrawal, A.2
Lu, J.3
Mitchell, M.4
Batra, D.5
Zitnick, C.L.6
Parikh, D.7
-
2
-
-
84973882857
-
Predicting deep zero-shot convolutional neural networks using textual descriptions
-
2
-
J. Ba, K. Swersky, S. Fidler, and R. Salakhutdinov. Predicting deep zero-shot convolutional neural networks using textual descriptions. In ICCV, 2015.
-
(2015)
ICCV
-
-
Ba, J.1
Swersky, K.2
Fidler, S.3
Salakhutdinov, R.4
-
3
-
-
84969930652
-
Compressing neural networks with the hashing trick
-
2, 4, 5
-
W. Chen, J. T. Wilson, S. Tyree, K. Q. Weinberger, and Y. Chen. Compressing neural networks with the hashing trick. In ICML, 2015.
-
(2015)
ICML
-
-
Chen, W.1
Wilson, J.T.2
Tyree, S.3
Weinberger, K.Q.4
Chen, Y.5
-
4
-
-
84939821078
-
Empirical evaluation of gated recurrent neural networks on sequence modeling
-
4, 5, 7
-
J. Chung, C. Gulcehre, K. Cho, and Y. Bengio. Empirical evaluation of gated recurrent neural networks on sequence modeling. In NIPS Deep Learning Workshop, 2014.
-
(2014)
NIPS Deep Learning Workshop
-
-
Chung, J.1
Gulcehre, C.2
Cho, K.3
Bengio, Y.4
-
5
-
-
84911453074
-
Describing textures in the wild
-
1
-
M. Cimpoi, S. Maji, I. Kokkinos, S. Mohamed, and A. Vedaldi. Describing textures in the wild. In CVPR, 2014.
-
(2014)
CVPR
-
-
Cimpoi, M.1
Maji, S.2
Kokkinos, I.3
Mohamed, S.4
Vedaldi, A.5
-
6
-
-
85198028989
-
Imagenet: A large-scale hierarchical image database
-
3
-
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
-
(2009)
CVPR
-
-
Deng, J.1
Dong, W.2
Socher, R.3
Li, L.-J.4
Li, K.5
Fei-Fei, L.6
-
8
-
-
84919881041
-
DeCAF: A deep convolutional activation feature for generic visual recognition
-
1
-
J. Donahue, Y. Jia, O. Vinyals, J. Hoffman, N. Zhang, E. Tzeng, and T. Darrell. DeCAF: A deep convolutional activation feature for generic visual recognition. In ICML, 2014.
-
(2014)
ICML
-
-
Donahue, J.1
Jia, Y.2
Vinyals, O.3
Hoffman, J.4
Zhang, N.5
Tzeng, E.6
Darrell, T.7
-
10
-
-
84965148420
-
Are you talking to a machine dataset and methods for multilingual image question answering
-
1, 2
-
H. Gao, J. Mao, J. Zhou, Z. Huang, L. Wang, and W. Xu. Are you talking to a machine dataset and methods for multilingual image question answering. In NIPS, 2015.
-
(2015)
NIPS
-
-
Gao, H.1
Mao, J.2
Zhou, J.3
Huang, Z.4
Wang, L.5
Xu, W.6
-
12
-
-
84969584486
-
Batch normalization: Accelerating deep network training by reducing internal covariate shift
-
6
-
S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML, 2015.
-
(2015)
ICML
-
-
Ioffe, S.1
Szegedy, C.2
-
13
-
-
85083951076
-
Adam: A method for stochastic optimization
-
6
-
D. Kingma and J. Ba. Adam: A method for stochastic optimization. In ICLR, 2015.
-
(2015)
ICLR
-
-
Kingma, D.1
Ba, J.2
-
14
-
-
84965153327
-
Skip-thought vectors
-
2, 4, 5
-
R. Kiros, Y. Zhu, R. Salakhutdinov, R. S. Zemel, A. Torralba, R. Urtasun, and S. Fidler. Skip-thought vectors. In NIPS, 2015.
-
(2015)
NIPS
-
-
Kiros, R.1
Zhu, Y.2
Salakhutdinov, R.3
Zemel, R.S.4
Torralba, A.5
Urtasun, R.6
Fidler, S.7
-
15
-
-
85009931853
-
Microsoft COCO: Common objects in context
-
6
-
T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick. Microsoft COCO: common objects in context. In ECCV, 2014.
-
(2014)
ECCV
-
-
Lin, T.-Y.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollár, P.7
Zitnick, C.L.8
-
16
-
-
85007153677
-
Learning to answer questions from image using convolutional neural network
-
1, 2, 3, 7
-
L. Ma, Z. Lu, and H. Li. Learning to answer questions from image using convolutional neural network. In AAAI, 2016.
-
(2016)
AAAI
-
-
Ma, L.1
Lu, Z.2
Li, H.3
-
17
-
-
84937822746
-
A multi-world approach to question answering about real-world scenes based on uncertain input
-
1, 2, 6, 7
-
M. Malinowski and M. Fritz. A multi-world approach to question answering about real-world scenes based on uncertain input. In NIPS, 2014.
-
(2014)
NIPS
-
-
Malinowski, M.1
Fritz, M.2
-
18
-
-
84973896625
-
Ask your neurons: A neural-based approach to answering questions about images
-
1, 2, 7
-
M. Malinowski, M. Rohrbach, and M. Fritz. Ask your neurons: A neural-based approach to answering questions about images. In ICCV, 2015.
-
(2015)
ICCV
-
-
Malinowski, M.1
Rohrbach, M.2
Fritz, M.3
-
19
-
-
79959829092
-
Recurrent neural network based language model
-
5
-
T. Mikolov, M. Karafiát, L. Burget, J. Cernockỳ, and S. Khudanpur. Recurrent neural network based language model. In INTERSPEECH, 2010.
-
(2010)
INTERSPEECH
-
-
Mikolov, T.1
Karafiát, M.2
Burget, L.3
Cernockỳ, J.4
Khudanpur, S.5
-
20
-
-
84886073305
-
Indoor segmentation and support inference from rgbd images
-
6
-
P. K. Nathan Silberman, Derek Hoiem and R. Fergus. Indoor segmentation and support inference from rgbd images. In ECCV, 2012.
-
(2012)
ECCV
-
-
Nathan Silberman, P.K.1
Hoiem, D.2
Fergus, R.3
-
21
-
-
84911449395
-
Learning and transferring mid-level image representations using convolutional neural networks
-
1
-
M. Oquab, L. Bottou, I. Laptev, and J. Sivic. Learning and transferring mid-level image representations using convolutional neural networks. In CVPR, 2014.
-
(2014)
CVPR
-
-
Oquab, M.1
Bottou, L.2
Laptev, I.3
Sivic, J.4
-
22
-
-
84897497795
-
On the difficulty of training recurrent neural networks
-
6
-
R. Pascanu, T. Mikolov, and Y. Bengio. On the difficulty of training recurrent neural networks. In ICML, 2013.
-
(2013)
ICML
-
-
Pascanu, R.1
Mikolov, T.2
Bengio, Y.3
-
23
-
-
84965170394
-
Exploring models and data for image question answering
-
1, 2, 3, 5, 6, 7
-
M. Ren, R. Kiros, and R. S. Zemel. Exploring models and data for image question answering. In NIPS, 2015.
-
(2015)
NIPS
-
-
Ren, M.1
Kiros, R.2
Zemel, R.S.3
-
24
-
-
85083953063
-
Very deep convolutional networks for large-scale image recognition
-
1, 3
-
K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
-
(2015)
ICLR
-
-
Simonyan, K.1
Zisserman, A.2
-
25
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
5
-
I. Sutskever, O. Vinyals, and Q. V. Le. Sequence to sequence learning with neural networks. In NIPS, 2014.
-
(2014)
NIPS
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
26
-
-
84937522268
-
Going deeper with convolutions
-
1
-
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. Going deeper with convolutions. In CVPR, 2015.
-
(2015)
CVPR
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
Erhan, D.7
Vanhoucke, V.8
Rabinovich, A.9
-
27
-
-
84911198048
-
Deepface: Closing the gap to human-level performance in face verification
-
1
-
L. Wolf. Deepface: Closing the gap to human-level performance in face verification. In CVPR, 2014.
-
(2014)
CVPR
-
-
Wolf, L.1
-
28
-
-
85146676791
-
Verbs semantics and lexical selection
-
6
-
Z. Wu and M. Palmer. Verbs semantics and lexical selection. In ACL, 1994.
-
(1994)
ACL
-
-
Wu, Z.1
Palmer, M.2
-
29
-
-
84970002232
-
Show, attend and tell: Neural image caption generation with visual attention
-
6, 8
-
K. Xu, J. Ba, R. Kiros, A. Courville, R. Salakhutdinov, R. Zemel, and Y. Bengio. Show, attend and tell: Neural image caption generation with visual attention. In ICML, 2015.
-
(2015)
ICML
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Courville, A.4
Salakhutdinov, R.5
Zemel, R.6
Bengio, Y.7
-
30
-
-
84866687133
-
Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation
-
1
-
J. Yao, S. Fidler, and R. Urtasun. Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation. In CVPR, 2012.
-
(2012)
CVPR
-
-
Yao, J.1
Fidler, S.2
Urtasun, R.3
-
31
-
-
84937964578
-
Learning deep features for scene recognition using places database
-
1
-
B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning deep features for scene recognition using places database. In NIPS, 2014.
-
(2014)
NIPS
-
-
Zhou, B.1
Lapedriza, A.2
Xiao, J.3
Torralba, A.4
Oliva, A.5
|