메뉴 건너뛰기




Volumn 2017-January, Issue , 2017, Pages 1170-1178

Captioning images with diverse objects

Author keywords

[No Author keywords available]

Indexed keywords

CHARACTER RECOGNITION; COMPUTER VISION; OBJECT RECOGNITION; SEMANTICS;

EID: 85044269789     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CVPR.2017.130     Document Type: Conference Paper
Times cited : (154)

References (27)
  • 6
    • 84986274465 scopus 로고    scopus 로고
    • Deep residual learning for image recognition
    • K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In CVPR, 2016.
    • (2016) CVPR
    • He, K.1    Zhang, X.2    Ren, S.3    Sun, J.4
  • 7
    • 84986274522 scopus 로고    scopus 로고
    • Deep compositional captioning: Describing novel object categories without paired training data
    • L. A. Hendricks, S. Venugopalan, M. Rohrbach, R. Mooney, K. Saenko, and T. Darrell. Deep compositional captioning: Describing novel object categories without paired training data. In CVPR, 2016.
    • (2016) CVPR
    • Hendricks, L.A.1    Venugopalan, S.2    Rohrbach, M.3    Mooney, R.4    Saenko, K.5    Darrell, T.6
  • 8
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In CVPR, 2015.
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 10
    • 84952349298 scopus 로고    scopus 로고
    • Unifying visual-semantic embeddings with multimodal neural language models
    • R. Kiros, R. Salakhutdinov, and R. S. Zemel. Unifying visual-semantic embeddings with multimodal neural language models. TACL, 2015.
    • (2015) TACL
    • Kiros, R.1    Salakhutdinov, R.2    Zemel, R.S.3
  • 11
    • 84934873221 scopus 로고    scopus 로고
    • Treetalk: Composition and compression of trees for image descriptions
    • P. Kuznetsova, V. Ordonez, T. L. Berg, U. C. Hill, and Y. Choi. Treetalk: Composition and compression of trees for image descriptions. In TACL, 2014.
    • (2014) TACL
    • Kuznetsova, P.1    Ordonez, V.2    Berg, T.L.3    Hill, U.C.4    Choi, Y.5
  • 12
    • 84906927509 scopus 로고    scopus 로고
    • Is this a wampimuk? Cross-modal mapping between distributional semantics and the visual world
    • A. Lazaridou, E. Bruni, and M. Baroni. Is this a wampimuk? cross-modal mapping between distributional semantics and the visual world. In ACL, 2014.
    • (2014) ACL
    • Lazaridou, A.1    Bruni, E.2    Baroni, M.3
  • 15
    • 85083950512 scopus 로고    scopus 로고
    • Deep captioning with multimodal recurrent neural networks (m-rnn)
    • J. Mao, W. Xu, Y. Yang, J. Wang, Z. Huang, and A. Yuille. Deep captioning with multimodal recurrent neural networks (m-rnn). In ICLR, 2015.
    • (2015) ICLR
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Huang, Z.5    Yuille, A.6
  • 16
    • 84973863256 scopus 로고    scopus 로고
    • Learning like a child: Fast novel visual concept learning from sentence descriptions of images
    • J. Mao, W. Xu, Y. Yang, J. Wang, Z. Huang, and A. L. Yuille. Learning like a child: Fast novel visual concept learning from sentence descriptions of images. In ICCV, 2015.
    • (2015) ICCV
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Huang, Z.5    Yuille, A.L.6
  • 17
    • 84898956512 scopus 로고    scopus 로고
    • Distributed representations of words and phrases and their compositionality
    • T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, 2013.
    • (2013) NIPS
    • Mikolov, T.1    Sutskever, I.2    Chen, K.3    Corrado, G.S.4    Dean, J.5
  • 22
    • 84933585162 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. CoRR, abs/1409.1556, 2014.
    • (2014) CoRR, abs/1409.1556
    • Simonyan, K.1    Zisserman, A.2
  • 23
    • 84906925854 scopus 로고    scopus 로고
    • Grounded compositional semantics for finding and describing images with sentences
    • R. Socher, A. Karpathy, Q. V. Le, C. D. Manning, and A. Y. Ng. Grounded compositional semantics for finding and describing images with sentences. TACL, 2014.
    • (2014) TACL
    • Socher, R.1    Karpathy, A.2    Le, Q.V.3    Manning, C.D.4    Ng, A.Y.5
  • 25
    • 85072843664 scopus 로고    scopus 로고
    • Improving LSTM-based video description with linguistic knowledge mined from text
    • S. Venugopalan, L. A. Hendricks, R. Mooney, and K. Saenko. Improving LSTM-based video description with linguistic knowledge mined from text. In EMNLP, 2016.
    • (2016) EMNLP
    • Venugopalan, S.1    Hendricks, L.A.2    Mooney, R.3    Saenko, K.4
  • 26
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. In CVPR, 2015.
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 27
    • 80053258778 scopus 로고    scopus 로고
    • Corpus-guided sentence generation of natural images
    • Y. Yang, C. L. Teo, H. Daumé III, and Y. Aloimonos. Corpus-guided sentence generation of natural images. In EMNLP, 2011.
    • (2011) EMNLP
    • Yang, Y.1    Teo, C.L.2    Daumé, H.3    Aloimonos, Y.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.