메뉴 건너뛰기




Volumn 2015 International Conference on Computer Vision, ICCV 2015, Issue , 2015, Pages 2407-2415

Guiding the long-short term memory model for image caption generation

Author keywords

[No Author keywords available]

Indexed keywords

BRAIN; SEMANTICS;

EID: 84973917813     PISSN: 15505499     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICCV.2015.277     Document Type: Conference Paper
Times cited : (417)

References (40)
  • 1
    • 85083953689 scopus 로고    scopus 로고
    • Neural machine translation by jointly learning to align and translate
    • 1, 2, 3, 4
    • D. Bahdanau, K. Cho, and Y. Bengio. Neural machine translation by jointly learning to align and translate. In ICLR, 2015. 1, 2, 3, 4
    • (2015) ICLR
    • Bahdanau, D.1    Cho, K.2    Bengio, Y.3
  • 3
    • 84957029470 scopus 로고    scopus 로고
    • Mind's eye: A recurrent visual representation for image caption generation
    • 6
    • X. Chen and C. L. Zitnick. Mind's eye: A recurrent visual representation for image caption generation. In CVPR, 2015. 6
    • (2015) CVPR
    • Chen, X.1    Zitnick, C.L.2
  • 10
    • 84894905366 scopus 로고    scopus 로고
    • A multi-view embedding space for modeling internet images, tags, and their semantics
    • 4, 6
    • Y. Gong, Q. Ke, M. Isard, and S. Lazebnik. A multi-view embedding space for modeling internet images, tags, and their semantics. IJCV, 106 (2): 210-233, 2014. 4, 6
    • (2014) IJCV , vol.106 , Issue.2 , pp. 210-233
    • Gong, Y.1    Ke, Q.2    Isard, M.3    Lazebnik, S.4
  • 14
    • 0031573117 scopus 로고    scopus 로고
    • Long short-term memory
    • 2, 3
    • S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural Comput., 9 (8): 1735-1780, 1997. 2, 3
    • (1997) Neural Comput. , vol.9 , Issue.8 , pp. 1735-1780
    • Hochreiter, S.1    Schmidhuber, J.2
  • 15
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • 6
    • M. Hodosh, P. Young, and J. Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. JAIR, 47: 853-899, 2013. 6
    • (2013) JAIR , vol.47 , pp. 853-899
    • Hodosh, M.1    Young, P.2    Hockenmaier, J.3
  • 16
    • 0000107975 scopus 로고
    • Relations between two sets of variates
    • 4
    • H. Hotelling. Relations between two sets of variates. Biometrika, pages 321-377, 1936. 4
    • (1936) Biometrika , pp. 321-377
    • Hotelling, H.1
  • 17
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • 1, 2, 3, 6, 8
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In CVPR, 2015. 1, 2, 3, 6, 8
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 18
    • 84937843643 scopus 로고    scopus 로고
    • Deep fragment embeddings for bidirectional image sentence mapping
    • 6
    • A. Karpathy, A. Joulin, and F. Li. Deep fragment embeddings for bidirectional image sentence mapping. In NIPS, 2014. 6
    • (2014) NIPS
    • Karpathy, A.1    Joulin, A.2    Li, F.3
  • 19
  • 20
    • 84887601544 scopus 로고    scopus 로고
    • Babytalk: Understanding and generating simple image descriptions
    • 1, 2
    • G. Kulkarni, V. Premraj, V. Ordonez, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg. Babytalk: Understanding and generating simple image descriptions. TPAMI, 35 (12): 2891-2903, 2013. 1, 2
    • (2013) TPAMI , vol.35 , Issue.12 , pp. 2891-2903
    • Kulkarni, G.1    Premraj, V.2    Ordonez, V.3    Dhar, S.4    Li, S.5    Choi, Y.6    Berg, A.C.7    Berg, T.L.8
  • 22
    • 84907331257 scopus 로고    scopus 로고
    • Generalizing image captions for image-text parallel corpus
    • 1, 2
    • P. Kuznetsova, V. Ordonez, A. C. Berg, T. L. Berg, and Y. Choi. Generalizing image captions for image-text parallel corpus. In ACL, 2013. 1, 2
    • (2013) ACL
    • Kuznetsova, P.1    Ordonez, V.2    Berg, A.C.3    Berg, T.L.4    Choi, Y.5
  • 23
    • 84934873221 scopus 로고    scopus 로고
    • Treetalk: Composition and compression of trees for image descriptions
    • 1, 2
    • P. Kuznetsova, V. Ordonez, T. Berg, and Y. Choi. Treetalk: Composition and compression of trees for image descriptions. TACL, 2: 351-362, 2014. 1, 2
    • (2014) TACL , vol.2 , pp. 351-362
    • Kuznetsova, P.1    Ordonez, V.2    Berg, T.3    Choi, Y.4
  • 24
    • 52149112996 scopus 로고    scopus 로고
    • Meteor: An automatic metric for mt evaluation with high levels of correlation with human judgments
    • 6
    • A. Lavie and A. Agarwal. Meteor: An automatic metric for mt evaluation with high levels of correlation with human judgments. In Second Workshop on Statistical Machine Translation, 2007. 6
    • (2007) Second Workshop on Statistical Machine Translation
    • Lavie, A.1    Agarwal, A.2
  • 26
    • 85083950512 scopus 로고    scopus 로고
    • Deep captioning with multimodal recurrent neural networks (mrnn)
    • 1, 2, 3, 8
    • J. Mao, W. Xu, Y. Yang, J. Wang, and A. L. Yuille. Deep captioning with multimodal recurrent neural networks (mrnn). In ICLR, 2015. 1, 2, 3, 8
    • (2015) ICLR
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Yuille, A.L.5
  • 27
    • 84906925144 scopus 로고    scopus 로고
    • Nonparametric method for datadriven image captioning
    • 1, 2
    • R. Mason and E. Charniak. Nonparametric method for datadriven image captioning. In ACL, 2014. 1, 2
    • (2014) ACL
    • Mason, R.1    Charniak, E.2
  • 29
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • 6
    • K. Papineni, S. Roukos, T. Ward, and W. Zhu. Bleu: A method for automatic evaluation of machine translation. In ACL, 2002. 6
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.4
  • 30
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • 6, 8
    • K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015. 6, 8
    • (2015) ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 31
    • 80053459857 scopus 로고    scopus 로고
    • Generating text with recurrent neural networks
    • 2
    • I. Sutskever, J. Martens, and G. Hinton. Generating text with recurrent neural networks. In ICML, 2011. 2
    • (2011) ICML
    • Sutskever, I.1    Martens, J.2    Hinton, G.3
  • 32
    • 84928547704 scopus 로고    scopus 로고
    • Sequence to sequence learning with neural networks
    • 1, 2, 3, 5
    • I. Sutskever, O. Vinyals, and Q. V. Le. Sequence to sequence learning with neural networks. In NIPS, 2014. 1, 2, 3, 5
    • (2014) NIPS
    • Sutskever, I.1    Vinyals, O.2    Le, Q.V.3
  • 36
    • 84956980995 scopus 로고    scopus 로고
    • Cider: Consensus-based image description evaluation
    • 6
    • R. Vedantam, C. L. Zitnick, and D. Parikh. Cider: Consensus-based image description evaluation. In CVPR, 2015. 6
    • (2015) CVPR
    • Vedantam, R.1    Zitnick, C.L.2    Parikh, D.3
  • 37
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • 1, 2, 3, 4, 6, 8
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. In CVPR, 2015. 1, 2, 3, 4, 6, 8
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 39
    • 80053258778 scopus 로고    scopus 로고
    • Corpusguided sentence generation of natural images
    • 1, 2
    • Y. Yang, C. L. Teo, H. D. III, and Y. Aloimonos. Corpusguided sentence generation of natural images. In EMNLP, 2011. 1, 2
    • (2011) EMNLP
    • Yang, Y.1    Teo, C.L.2    Aloimonos, Y.3
  • 40
    • 84906494296 scopus 로고    scopus 로고
    • From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
    • 6
    • P. Young, A. Lai, M. Hodosh, and J. Hockenmaier. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. TACL, 2: 67-78, 2014. 6
    • (2014) TACL , vol.2 , pp. 67-78
    • Young, P.1    Lai, A.2    Hodosh, M.3    Hockenmaier, J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.