메뉴 건너뛰기




Volumn , Issue , 2017, Pages 936-945

Guided open vocabulary image captioning with constrained beam search

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; NATURAL LANGUAGE PROCESSING SYSTEMS;

EID: 85048487879     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.18653/v1/d17-1098     Document Type: Conference Paper
Times cited : (235)

References (32)
  • 2
    • 85021678581 scopus 로고    scopus 로고
    • SPICE: Semantic propositional image caption evaluation
    • Peter Anderson, Basura Fernando, Mark Johnson, and Stephen Gould. 2016. SPICE: Semantic propositional image caption evaluation. In ECCV.
    • (2016) ECCV
    • Anderson, P.1    Fernando, B.2    Johnson, M.3    Gould, S.4
  • 8
    • 84943812736 scopus 로고    scopus 로고
    • Describing images using inferred visual dependency representations
    • Desmond Elliot and Arjen P. de Vries. 2015. Describing images using inferred visual dependency representations. In ACL.
    • (2015) ACL
    • Elliot, D.1    de Vries, A.P.2
  • 12
    • 84986274465 scopus 로고    scopus 로고
    • Deep residual learning for image recognition
    • Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR.
    • (2016) CVPR
    • He, K.1    Zhang, X.2    Ren, S.3    Sun, J.4
  • 13
    • 84986274522 scopus 로고    scopus 로고
    • Deep compositional captioning: Describing novel object categories without paired training data
    • Lisa Anne Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond Mooney, Kate Saenko, and Trevor Darrell. 2016. Deep compositional captioning: Describing novel object categories without paired training data. In CVPR.
    • (2016) CVPR
    • Hendricks, L.A.1    Venugopalan, S.2    Rohrbach, M.3    Mooney, R.4    Saenko, K.5    Darrell, T.6
  • 16
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • Andrej Karpathy and Li Fei-Fei. 2015. Deep visual-semantic alignments for generating image descriptions. In CVPR.
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 17
    • 49449108990 scopus 로고    scopus 로고
    • Cambridge University Press, New York, NY, USA, 1st edition
    • Philipp Koehn. 2010. Statistical Machine Translation. Cambridge University Press, New York, NY, USA, 1st edition.
    • (2010) Statistical Machine Translation
    • Koehn, P.1
  • 20
    • 85083950512 scopus 로고    scopus 로고
    • Deep captioning with multimodal recurrent neural networks (m-RNN)
    • Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, and Alan L. Yuille. 2015. Deep captioning with multimodal recurrent neural networks (m-RNN). In ICLR.
    • (2015) ICLR
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Yuille, A.L.5
  • 21
    • 84961289992 scopus 로고    scopus 로고
    • Glove: Global vectors for word representation
    • Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global vectors for word representation. In EMNLP.
    • (2014) EMNLP
    • Pennington, J.1    Socher, R.2    Manning, C.D.3
  • 22
    • 84960980241 scopus 로고    scopus 로고
    • Faster R-CNN: Towards real-time object detection with region proposal networks
    • Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In NIPS.
    • (2015) NIPS
    • Ren, S.1    He, K.2    Girshick, R.3    Sun, J.4
  • 24
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In ICLR.
    • (2015) ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 27
    • 84956980995 scopus 로고    scopus 로고
    • CiDer: Consensus-based image description evaluation
    • Ramakrishna Vedantam, C. Lawrence Zitnick, and Devi Parikh. 2015. CIDEr: Consensus-based image description evaluation. In CVPR.
    • (2015) CVPR
    • Vedantam, R.1    Zitnick, C.L.2    Parikh, D.3
  • 29
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2015. Show and tell: A neural image caption generator. In CVPR.
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 30
    • 84986301177 scopus 로고    scopus 로고
    • What value do explicit high level concepts have in vision to language problems?
    • Q. Wu, C. Shen, L. Liu, A. Dick, and A. van den Hengel. 2016. What Value Do Explicit High Level Concepts Have in Vision to Language Problems? In CVPR.
    • (2016) CVPR
    • Wu, Q.1    Shen, C.2    Liu, L.3    Dick, A.4    van den Hengel, A.5
  • 31
    • 84906494296 scopus 로고    scopus 로고
    • From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
    • Peter Young, Alice Lai, Micah Hodosh, and Julia Hockenmaier. 2014. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. TACL .
    • (2014) TACL
    • Young, P.1    Lai, A.2    Hodosh, M.3    Hockenmaier, J.4
  • 32
    • 84986272569 scopus 로고    scopus 로고
    • Fast zero-shot image tagging
    • Yang Zhang, Boqing Gong, and Mubarak Shah. 2016. Fast zero-shot image tagging. In CVPR.
    • (2016) CVPR
    • Zhang, Y.1    Gong, B.2    Shah, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.