메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 2296-2304

Are you talking to a machine? Dataset and methods for multilingual image question answering

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; INFORMATION SCIENCE; LINGUISTICS; NEURAL NETWORKS; STATISTICAL TESTS;

EID: 84965148420     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (450)

References (44)
  • 3
    • 85083954148 scopus 로고    scopus 로고
    • Semantic image segmentation with deep convolutional nets and fully connected crfs
    • L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille. Semantic image segmentation with deep convolutional nets and fully connected crfs. ICLR, 2015.
    • (2015) ICLR
    • Chen, L.-C.1    Papandreou, G.2    Kokkinos, I.3    Murphy, K.4    Yuille, A.L.5
  • 4
    • 84957029470 scopus 로고    scopus 로고
    • Learning a recurrent visual representation for image caption generation
    • X. Chen and C. L. Zitnick. Learning a recurrent visual representation for image caption generation. In CVPR, 2015.
    • (2015) CVPR
    • Chen, X.1    Zitnick, C.L.2
  • 7
    • 26444565569 scopus 로고
    • Finding structure in time
    • J. L. Elman. Finding structure in time. Cognitive science, 14(2):179-211, 1990.
    • (1990) Cognitive Science , vol.14 , Issue.2 , pp. 179-211
    • Elman, J.L.1
  • 9
    • 84925422907 scopus 로고    scopus 로고
    • Visual turing test for computer vision systems
    • D. Geman, S. Geman, N. Hallonquist, and L. Younes. Visual turing test for computer vision systems. PNAS, 112(12):3618-3623, 2015.
    • (2015) PNAS , vol.112 , Issue.12 , pp. 3618-3623
    • Geman, D.1    Geman, S.2    Hallonquist, N.3    Younes, L.4
  • 10
    • 84911400494 scopus 로고    scopus 로고
    • Rich feature hierarchies for accurate object detection and semantic segmentation
    • R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In CVPR, 2014.
    • (2014) CVPR
    • Girshick, R.1    Donahue, J.2    Darrell, T.3    Malik, J.4
  • 13
    • 84926283798 scopus 로고    scopus 로고
    • Recurrent continuous translation models
    • N. Kalchbrenner and P. Blunsom. Recurrent continuous translation models. In EMNLP, pages 1700-1709, 2013.
    • (2013) EMNLP , pp. 1700-1709
    • Kalchbrenner, N.1    Blunsom, P.2
  • 14
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In CVPR, 2015.
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 15
    • 84952349298 scopus 로고    scopus 로고
    • Unifying visual-semantic embeddings with multimodal neural language models
    • R. Kiros, R. Salakhutdinov, and R. S. Zemel. Unifying visual-semantic embeddings with multimodal neural language models. TACL, 2015.
    • (2015) TACL
    • Kiros, R.1    Salakhutdinov, R.2    Zemel, R.S.3
  • 17
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 18
    • 85120046073 scopus 로고    scopus 로고
    • Meteor: An automatic metric for mt evaluation with high levels of correlation with human judgements
    • Association for Computational Linguistics
    • A. Lavie and A. Agarwal. Meteor: An automatic metric for mt evaluation with high levels of correlation with human judgements. In Workshop on Statistical Machine Translation, pages 228-231. Association for Computational Linguistics, 2007.
    • (2007) Workshop on Statistical Machine Translation , pp. 228-231
    • Lavie, A.1    Agarwal, A.2
  • 22
    • 84937822746 scopus 로고    scopus 로고
    • A multi-world approach to question answering about real-world scenes based on uncertain input
    • M. Malinowski and M. Fritz. A multi-world approach to question answering about real-world scenes based on uncertain input. In Advances in Neural Information Processing Systems, pages 1682-1690, 2014.
    • (2014) Advances in Neural Information Processing Systems , pp. 1682-1690
    • Malinowski, M.1    Fritz, M.2
  • 24
    • 85083950512 scopus 로고    scopus 로고
    • Deep captioning with multimodal recurrent neural networks (m-rnn)
    • J. Mao, W. Xu, Y. Yang, J. Wang, Z. Huang, and A. Yuille. Deep captioning with multimodal recurrent neural networks (m-rnn). In ICLR, 2015.
    • (2015) ICLR
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Huang, Z.5    Yuille, A.6
  • 29
    • 84898956512 scopus 로고    scopus 로고
    • Distributed representations of words and phrases and their compositionality
    • T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, pages 3111-3119, 2013.
    • (2013) NIPS , pp. 3111-3119
    • Mikolov, T.1    Sutskever, I.2    Chen, K.3    Corrado, G.S.4    Dean, J.5
  • 30
    • 77956509090 scopus 로고    scopus 로고
    • Rectified linear units improve restricted boltzmann machines
    • V. Nair and G. E. Hinton. Rectified linear units improve restricted boltzmann machines. In ICML, pages 807-814, 2010.
    • (2010) ICML , pp. 807-814
    • Nair, V.1    Hinton, G.E.2
  • 31
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: a method for automatic evaluation of machine translation. In ACL, pages 311-318, 2002.
    • (2002) ACL , pp. 311-318
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 34
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
    • (2015) ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 35
    • 84928547704 scopus 로고    scopus 로고
    • Sequence to sequence learning with neural networks
    • I. Sutskever, O. Vinyals, and Q. V. Le. Sequence to sequence learning with neural networks. In NIPS, pages 3104-3112, 2014.
    • (2014) NIPS , pp. 3104-3112
    • Sutskever, I.1    Vinyals, O.2    Le, Q.V.3
  • 37
    • 84901405262 scopus 로고    scopus 로고
    • Joint video and text parsing for understanding events and answering queries
    • K. Tu, M. Meng, M. W. Lee, T. E. Choe, and S.-C. Zhu. Joint video and text parsing for understanding events and answering queries. MultiMedia, IEEE, 21(2):42-70, 2014.
    • (2014) MultiMedia, IEEE , vol.21 , Issue.2 , pp. 42-70
    • Tu, K.1    Meng, M.2    Lee, M.W.3    Choe, T.E.4    Zhu, S.-C.5
  • 38
    • 0002988210 scopus 로고
    • Computing machinery and intelligence
    • A. M. Turing. Computing machinery and intelligence. Mind, pages 433-460, 1950.
    • (1950) Mind , pp. 433-460
    • Turing, A.M.1
  • 39
    • 84956980995 scopus 로고    scopus 로고
    • Cider: Consensus-based image description evaluation
    • R. Vedantam, C. L. Zitnick, and D. Parikh. Cider: Consensus-based image description evaluation. In CVPR, 2015.
    • (2015) CVPR
    • Vedantam, R.1    Zitnick, C.L.2    Parikh, D.3
  • 40
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. In CVPR, 2015.
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 41
    • 85146676791 scopus 로고
    • Verbs semantics and lexical selection
    • Z. Wu and M. Palmer. Verbs semantics and lexical selection. In ACL, pages 133-138, 1994.
    • (1994) ACL , pp. 133-138
    • Wu, Z.1    Palmer, M.2
  • 43
    • 84906494296 scopus 로고    scopus 로고
    • From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
    • P. Young, A. Lai, M. Hodosh, and J. Hockenmaier. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. In ACL, pages 479-488, 2014.
    • (2014) ACL , pp. 479-488
    • Young, P.1    Lai, A.2    Hodosh, M.3    Hockenmaier, J.4
  • 44
    • 84937851238 scopus 로고    scopus 로고
    • Learning from weakly supervised data by the expectation loss SVM (e-SVM) algorithm
    • J. Zhu, J. Mao, and A. L. Yuille. Learning from weakly supervised data by the expectation loss svm (e-svm) algorithm. In NIPS, pages 1125-1133, 2014.
    • (2014) NIPS , pp. 1125-1133
    • Zhu, J.1    Mao, J.2    Yuille, A.L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.