SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 9912 LNCS, Issue , 2016, Pages 727-739

Revisiting visual question answering baselines

(3) Jabri, Allan a Joulin, Armand a van Der Maaten, Laurens a

a FACEBOOK AI RESEARCH (United States)

Author keywords

Dataset bias; Visual question answering

Indexed keywords

ELECTRIC GROUNDING;

BINARY CLASSIFICATION; DATASET BIAS; LEARNING SETTINGS; MEMORY MECHANISM; MODEL-BASED OPC; MULTI-CLASS CLASSIFIER; QUESTION ANSWERING; STATE-OF-THE-ART PERFORMANCE;

CLASSIFICATION (OF INFORMATION);

EID: 84990032802 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-319-46484-8_44 Document Type: Conference Paper

Times cited : (211)

References (30)

1
- 84937522268
- Going deeper with convolutions
- Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015)
- (2015) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
- Szegedy, C.¹ Liu, W.² Jia, Y.³ Sermanet, P.⁴ Reed, S.⁵ Anguelov, D.⁶ Erhan, D.⁷ Vanhoucke, V.⁸ Rabinovich, A.⁹

2
- 84986274465
- Deep residual learning for image recognition
- He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
- (2016) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

3
- 84978730111
- arXiv:1602.07332
- Krishna, R., Zhu, Y., Groth, O., Johnson, J., Hata, K., Kravitz, J., Chen, S., Kalanditis, Y., Li, L.J., Shamma, D., Bernstein, M., Fei-Fei, L.: Visual genome: connecting language and vision using crowdsourced dense image annotations. arXiv:1602.07332 (2016)
- (2016) Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
- Krishna, R.¹ Zhu, Y.² Groth, O.³ Johnson, J.⁴ Hata, K.⁵ Kravitz, J.⁶ Chen, S.⁷ Kalanditis, Y.⁸ Li, L.J.⁹ Shamma, D.¹⁰ Bernstein, M.¹¹ Fei-Fei, L.¹²

4
- 84925422907
- Visual turing test for computer vision systems
- Geman, D., Geman, S., Hallonquist, N., Younes, L.: Visual turing test for computer vision systems. Proc. Natl. Acad. Sci. 112(12), 3618-3623 (2015)
- (2015) Proc. Natl. Acad. Sci , vol.112 , Issue.12 , pp. 3618-3623
- Geman, D.¹ Geman, S.² Hallonquist, N.³ Younes, L.⁴

5
- 84973890960
- VQA: Visual question answering
- Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., Zitnick, C., Parikh, D.: VQA: visual question answering. In: Proceedings of the International Conference on Computer Vision (2015)
- (2015) Proceedings of the International Conference on Computer Vision
- Antol, S.¹ Agrawal, A.² Lu, J.³ Mitchell, M.⁴ Batra, D.⁵ Zitnick, C.⁶ Parikh, D.⁷

6
- 84965170394
- Exploring models and data for image question answering
- Ren, M., Kiros, R., Zemel, R.: Exploring models and data for image question answering. In: Advances in Neural Information Processing Systems (2015)
- (2015) Advances in Neural Information Processing Systems
- Ren, M.¹ Kiros, R.² Zemel, R.³

7
- 84959862697
- arXiv:1506.00278
- Yu, L., Park, E., Berg, A., Berg, T.: Visual madlibs: fill in the blank image generation and question answering. arXiv:1506.00278 (2015)
- (2015) Visual Madlibs: Fill in the Blank Image Generation and Question Answering
- Yu, L.¹ Park, E.² Berg, A.³ Berg, T.⁴

8
- 84990038229
- arXiv:1511.03416
- Zhu, Y., Groth, O., Bernstein, M., Fei-Fei, L.: Visual7W: grounded question answering in images. arXiv:1511.03416 (2015)
- (2015) Visual7w: Grounded Question Answering in Images
- Zhu, Y.¹ Groth, O.² Bernstein, M.³ Fei-Fei, L.⁴

9
- 84986301525
- arXiv:1512.02167
- Zhou, B., Tian, Y., Sukhbataar, S., Szlam, A., Fergus, R.: Simple baseline for visual question answering. arXiv:1512.02167 (2015)
- (2015) Simple Baseline for Visual Question Answering
- Zhou, B.¹ Tian, Y.² Sukhbataar, S.³ Szlam, A.⁴ Fergus, R.⁵

10
- 84990044140
- arXiv:1606.03556
- Das, A., Agrawal, H., Zitnick, C.L., Parikh, D., Batra, D.: Human attention in visual question answering: do humans and deep networks look at the same regions? arXiv:1606.03556 (2016)
- (2016) Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
- Das, A.¹ Agrawal, H.² Zitnick, C.L.³ Parikh, D.⁴ Batra, D.⁵

11
- 85011809824
- Statistical significance tests for machine translation evaluation
- Koehn, P.: Statistical significance tests for machine translation evaluation. In: EMNLP, pp. 388-395 (2004)
- (2004) EMNLP , pp. 388-395
- Koehn, P.¹

12
- 84893361786
- Re-evaluation the role of BLEU in machine translation research
- Callison-Burch, C., Osborne, M., Koehn, P.: Re-evaluation the role of BLEU in machine translation research. In: EACL, vol. 6, pp. 249-256 (2006)
- (2006) EACL , vol.6 , pp. 249-256
- Callison-Burch, C.¹ Osborne, M.² Koehn, P.³

13
- 84956979389
- CoRR abs/1410.0210
- Malinowski, M., Fritz, M.: A multi-world approach to question answering about real-world scenes based on uncertain input. CoRR abs/1410.0210 (2014)
- (2014) A Multi-World Approach to Question Answering about Real-World Scenes Based on Uncertain Input
- Malinowski, M.¹ Fritz, M.²

14
- 84937834115
- Microsoft COCO: Common objects in context
- Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollar, P., Zitnick, C.: Microsoft COCO: common objects in context. In: Proceedings of the European Conference on Computer Vision (2014)
- (2014) Proceedings of the European Conference on Computer Vision
- Lin, T.Y.¹ Maire, M.² Belongie, S.³ Hays, J.⁴ Perona, P.⁵ Ramanan, D.⁶ Dollar, P.⁷ Zitnick, C.⁸

15
- 84973896625
- Ask your neurons: A neural-based approach to answering questions about images
- Malinowski, M., Rohrbach, M., Fritz, M.: Ask your neurons: a neural-based approach to answering questions about images. In: Proceedings of the Internation Conference on Computer Vision(2015)
- (2015) Proceedings of the Internation Conference on Computer Vision
- Malinowski, M.¹ Rohrbach, M.² Fritz, M.³

16
- 84990062072
- arXiv:1603.02814
- Wu, Q., Shen, C., van den Hengel, A., Wang, P., Dick, A.: Image captioning and visual question answering based on attributes and their related external knowledge. arXiv:1603.02814 (2016)
- (2016) Image Captioning and Visual Question Answering Based on Attributes and Their Related External Knowledge
- Wu, Q.¹ Shen, C.² van Den Hengel, A.³ Wang, P.⁴ Dick, A.⁵

17
- 49949092526
- DBpedia: A nucleus for a web of open data
- Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.), Springer, Heidelberg
- Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC/ISWC-2007. LNCS, vol. 4825, pp. 722-735. Springer, Heidelberg (2007). doi:10.1007/978-3-540-76298-0_52
- (2007) ASWC/ISWC-2007. LNCS , vol.4825 , pp. 722-735
- Auer, S.¹ Bizer, C.² Kobilarov, G.³ Lehmann, J.⁴ Cyganiak, R.⁵ Ives, Z.⁶

18
- 84965148420
- Are you talking to a machine? Dataset and methods for multilingual image question answering
- Gao, H., Mao, J., Zhou, J., Huang, Z., Wang, L., Xu, W.: Are you talking to a machine? Dataset and methods for multilingual image question answering. In: Advances in Neural Information Processing Systems (2015)
- (2015) Advances in Neural Information Processing Systems
- Gao, H.¹ Mao, J.² Zhou, J.³ Huang, Z.⁴ Wang, L.⁵ Xu, W.⁶

19
- 84957021783
- arXiv:1506.00333
- Ma, L., Lu, Z., Li, H.: Learning to answer questions from image using convolutional neural network. arXiv:1506.00333 (2015)
- (2015) Learning to Answer Questions from Image Using Convolutional Neural Network
- Ma, L.¹ Lu, Z.² Li, H.³

20
- 84990021264
- arXiv:1511.02799
- Andreas, J., Rohrbach, M., Darrell, T., Klein, D.: Deep compositional question answering with neural module networks. arXiv:1511.02799 (2015)
- (2015) Deep Compositional Question Answering with Neural Module Networks
- Andreas, J.¹ Rohrbach, M.² Darrell, T.³ Klein, D.⁴

21
- 84990060711
- Fukui, A., Huk Park, D., Yang, D., Rohrbach, A., Darrell, T., Rohrbach, M.: Multimodal compact bilinear pooling for visual question answering and visual grounding (2016)
- (2016) Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
- Fukui, A.¹ Huk Park, D.² Yang, D.³ Rohrbach, A.⁴ Darrell, T.⁵ Rohrbach, M.⁶

22
- 84990020800
- Lu, J., Yang, J., Batra, D., Parikh, D.: Hierarchical question-image co-attention for visual question answering (2016)
- (2016) Hierarchical Question-Image Co-Attention for Visual Question Answering
- Lu, J.¹ Yang, J.² Batra, D.³ Parikh, D.⁴

23
- 84990054468
- arXiv:1511.07394
- Shih, K.J., Singh, S., Hoiem, D.: Where to look: Focus regions for visual question answering. arXiv:1511.07394 (2016)
- (2016) Where to Look: Focus Regions for Visual Question Answering
- Shih, K.J.¹ Singh, S.² Hoiem, D.³

24
- 84904163933
- Dropout: A simple way to prevent neural networks from overfitting
- Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929-1958 (2014)
- (2014) J. Mach. Learn. Res , vol.15 , Issue.1 , pp. 1929-1958
- Srivastava, N.¹ Hinton, G.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

25
- 84990056975
- Gross, S., Wilber, M.: Training and investigating residual nets (2016)
- (2016) Training and Investigating Residual Nets
- Gross, S.¹ Wilber, M.²

26
- 85083951332
- arXiv:1301.3781
- Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv:1301.3781 (2013)
- (2013) Efficient Estimation of Word Representations in Vector Space
- Mikolov, T.¹ Chen, K.² Corrado, G.³ Dean, J.⁴

27
- 84876231242
- Imagenet classification with deep convolutional neural networks
- Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (2012)
- (2012) Advances in Neural Information Processing Systems
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.³

28
- 0031573117
- Long short-term memory
- Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735-1780 (1997)
- (1997) Neural Comput , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

29
- 84979046490
- arXiv:1511.0225
- Joulin, A., van der Maaten, L., Jabri, A., Vasilache, N.: Learning visual features from large weakly supervised data. arXiv:1511.0225 (2015)
- (2015) Learning Visual Features from Large Weakly Supervised Data
- Joulin, A.¹ van Der Maaten, L.² Jabri, A.³ Vasilache, N.⁴

30
- 84906506420
- arXiv:1403.6382
- Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition. arXiv:1403.6382 (2014)
- (2014) CNN Features Off-The-Shelf: An Astounding Baseline for Recognition
- Razavian, A.S.¹ Azizpour, H.² Sullivan, J.³ Carlsson, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.