-
1
-
-
84937522268
-
Going deeper with convolutions
-
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015)
-
(2015)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
Erhan, D.7
Vanhoucke, V.8
Rabinovich, A.9
-
2
-
-
84986274465
-
Deep residual learning for image recognition
-
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
-
(2016)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
-
-
He, K.1
Zhang, X.2
Ren, S.3
Sun, J.4
-
3
-
-
84978730111
-
-
arXiv:1602.07332
-
Krishna, R., Zhu, Y., Groth, O., Johnson, J., Hata, K., Kravitz, J., Chen, S., Kalanditis, Y., Li, L.J., Shamma, D., Bernstein, M., Fei-Fei, L.: Visual genome: connecting language and vision using crowdsourced dense image annotations. arXiv:1602.07332 (2016)
-
(2016)
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
-
-
Krishna, R.1
Zhu, Y.2
Groth, O.3
Johnson, J.4
Hata, K.5
Kravitz, J.6
Chen, S.7
Kalanditis, Y.8
Li, L.J.9
Shamma, D.10
Bernstein, M.11
Fei-Fei, L.12
-
4
-
-
84925422907
-
Visual turing test for computer vision systems
-
Geman, D., Geman, S., Hallonquist, N., Younes, L.: Visual turing test for computer vision systems. Proc. Natl. Acad. Sci. 112(12), 3618-3623 (2015)
-
(2015)
Proc. Natl. Acad. Sci
, vol.112
, Issue.12
, pp. 3618-3623
-
-
Geman, D.1
Geman, S.2
Hallonquist, N.3
Younes, L.4
-
5
-
-
84973890960
-
VQA: Visual question answering
-
Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., Zitnick, C., Parikh, D.: VQA: visual question answering. In: Proceedings of the International Conference on Computer Vision (2015)
-
(2015)
Proceedings of the International Conference on Computer Vision
-
-
Antol, S.1
Agrawal, A.2
Lu, J.3
Mitchell, M.4
Batra, D.5
Zitnick, C.6
Parikh, D.7
-
7
-
-
84959862697
-
-
arXiv:1506.00278
-
Yu, L., Park, E., Berg, A., Berg, T.: Visual madlibs: fill in the blank image generation and question answering. arXiv:1506.00278 (2015)
-
(2015)
Visual Madlibs: Fill in the Blank Image Generation and Question Answering
-
-
Yu, L.1
Park, E.2
Berg, A.3
Berg, T.4
-
8
-
-
84990038229
-
-
arXiv:1511.03416
-
Zhu, Y., Groth, O., Bernstein, M., Fei-Fei, L.: Visual7W: grounded question answering in images. arXiv:1511.03416 (2015)
-
(2015)
Visual7w: Grounded Question Answering in Images
-
-
Zhu, Y.1
Groth, O.2
Bernstein, M.3
Fei-Fei, L.4
-
9
-
-
84986301525
-
-
arXiv:1512.02167
-
Zhou, B., Tian, Y., Sukhbataar, S., Szlam, A., Fergus, R.: Simple baseline for visual question answering. arXiv:1512.02167 (2015)
-
(2015)
Simple Baseline for Visual Question Answering
-
-
Zhou, B.1
Tian, Y.2
Sukhbataar, S.3
Szlam, A.4
Fergus, R.5
-
10
-
-
84990044140
-
-
arXiv:1606.03556
-
Das, A., Agrawal, H., Zitnick, C.L., Parikh, D., Batra, D.: Human attention in visual question answering: do humans and deep networks look at the same regions? arXiv:1606.03556 (2016)
-
(2016)
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
-
-
Das, A.1
Agrawal, H.2
Zitnick, C.L.3
Parikh, D.4
Batra, D.5
-
11
-
-
85011809824
-
Statistical significance tests for machine translation evaluation
-
Koehn, P.: Statistical significance tests for machine translation evaluation. In: EMNLP, pp. 388-395 (2004)
-
(2004)
EMNLP
, pp. 388-395
-
-
Koehn, P.1
-
12
-
-
84893361786
-
Re-evaluation the role of BLEU in machine translation research
-
Callison-Burch, C., Osborne, M., Koehn, P.: Re-evaluation the role of BLEU in machine translation research. In: EACL, vol. 6, pp. 249-256 (2006)
-
(2006)
EACL
, vol.6
, pp. 249-256
-
-
Callison-Burch, C.1
Osborne, M.2
Koehn, P.3
-
14
-
-
84937834115
-
Microsoft COCO: Common objects in context
-
Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollar, P., Zitnick, C.: Microsoft COCO: common objects in context. In: Proceedings of the European Conference on Computer Vision (2014)
-
(2014)
Proceedings of the European Conference on Computer Vision
-
-
Lin, T.Y.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollar, P.7
Zitnick, C.8
-
16
-
-
84990062072
-
-
arXiv:1603.02814
-
Wu, Q., Shen, C., van den Hengel, A., Wang, P., Dick, A.: Image captioning and visual question answering based on attributes and their related external knowledge. arXiv:1603.02814 (2016)
-
(2016)
Image Captioning and Visual Question Answering Based on Attributes and Their Related External Knowledge
-
-
Wu, Q.1
Shen, C.2
van Den Hengel, A.3
Wang, P.4
Dick, A.5
-
17
-
-
49949092526
-
DBpedia: A nucleus for a web of open data
-
Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.), Springer, Heidelberg
-
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC-2007. LNCS, vol. 4825, pp. 722-735. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76298-0_52
-
(2007)
ASWC/ISWC-2007. LNCS
, vol.4825
, pp. 722-735
-
-
Auer, S.1
Bizer, C.2
Kobilarov, G.3
Lehmann, J.4
Cyganiak, R.5
Ives, Z.6
-
18
-
-
84965148420
-
Are you talking to a machine? Dataset and methods for multilingual image question answering
-
Gao, H., Mao, J., Zhou, J., Huang, Z., Wang, L., Xu, W.: Are you talking to a machine? Dataset and methods for multilingual image question answering. In: Advances in Neural Information Processing Systems (2015)
-
(2015)
Advances in Neural Information Processing Systems
-
-
Gao, H.1
Mao, J.2
Zhou, J.3
Huang, Z.4
Wang, L.5
Xu, W.6
-
20
-
-
84990021264
-
-
arXiv:1511.02799
-
Andreas, J., Rohrbach, M., Darrell, T., Klein, D.: Deep compositional question answering with neural module networks. arXiv:1511.02799 (2015)
-
(2015)
Deep Compositional Question Answering with Neural Module Networks
-
-
Andreas, J.1
Rohrbach, M.2
Darrell, T.3
Klein, D.4
-
21
-
-
84990060711
-
-
Fukui, A., Huk Park, D., Yang, D., Rohrbach, A., Darrell, T., Rohrbach, M.: Multimodal compact bilinear pooling for visual question answering and visual grounding (2016)
-
(2016)
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
-
-
Fukui, A.1
Huk Park, D.2
Yang, D.3
Rohrbach, A.4
Darrell, T.5
Rohrbach, M.6
-
22
-
-
84990020800
-
-
Lu, J., Yang, J., Batra, D., Parikh, D.: Hierarchical question-image co-attention for visual question answering (2016)
-
(2016)
Hierarchical Question-Image Co-Attention for Visual Question Answering
-
-
Lu, J.1
Yang, J.2
Batra, D.3
Parikh, D.4
-
24
-
-
84904163933
-
Dropout: A simple way to prevent neural networks from overfitting
-
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929-1958 (2014)
-
(2014)
J. Mach. Learn. Res
, vol.15
, Issue.1
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
26
-
-
85083951332
-
-
arXiv:1301.3781
-
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv:1301.3781 (2013)
-
(2013)
Efficient Estimation of Word Representations in Vector Space
-
-
Mikolov, T.1
Chen, K.2
Corrado, G.3
Dean, J.4
-
29
-
-
84979046490
-
-
arXiv:1511.0225
-
Joulin, A., van der Maaten, L., Jabri, A., Vasilache, N.: Learning visual features from large weakly supervised data. arXiv:1511.0225 (2015)
-
(2015)
Learning Visual Features from Large Weakly Supervised Data
-
-
Joulin, A.1
van Der Maaten, L.2
Jabri, A.3
Vasilache, N.4
-
30
-
-
84906506420
-
-
arXiv:1403.6382
-
Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition. arXiv:1403.6382 (2014)
-
(2014)
CNN Features Off-The-Shelf: An Astounding Baseline for Recognition
-
-
Razavian, A.S.1
Azizpour, H.2
Sullivan, J.3
Carlsson, S.4
|