메뉴 건너뛰기




Volumn , Issue , 2016, Pages 1-8

Exploiting scene context for image captioning

Author keywords

[No Author keywords available]

Indexed keywords

BENCHMARKING; COMPUTATIONAL LINGUISTICS; NEURAL NETWORKS;

EID: 84995460741     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2983563.2983571     Document Type: Conference Paper
Times cited : (9)

References (47)
  • 6
    • 85107661995 scopus 로고    scopus 로고
    • Meteor universal: Language specific translation evaluation for any target language
    • M. Denkowski and A. Lavie. Meteor universal: Language specific translation evaluation for any target language. In EACL, 2014.
    • (2014) EACL
    • Denkowski, M.1    Lavie, A.2
  • 9
    • 84943812736 scopus 로고    scopus 로고
    • Describing images using inferred visual dependency representations
    • D. Elliott and A. P. de Vries. Describing images using inferred visual dependency representations. In ACL, 2015.
    • (2015) ACL
    • Elliott, D.1    De Vries, A.P.2
  • 10
    • 84906929591 scopus 로고    scopus 로고
    • Image description using visual dependency representations
    • D. Elliott and F. Keller. Image description using visual dependency representations. In EMNLP, 2013.
    • (2013) EMNLP
    • Elliott, D.1    Keller, F.2
  • 15
    • 84986274465 scopus 로고    scopus 로고
    • Deep residual learning for image recognition
    • K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In CVPR, 2016.
    • (2016) CVPR
    • He, K.1    Zhang, X.2    Ren, S.3    Sun, J.4
  • 17
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • M. Hodosh, P. Young, and J. Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. JAIR, 2013.
    • (2013) JAIR
    • Hodosh, M.1    Young, P.2    Hockenmaier, J.3
  • 19
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In CVPR, 2015.
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 20
    • 84937843643 scopus 로고    scopus 로고
    • Deep fragment embeddings for bidirectional image sentence mapping
    • A. Karpathy, A. Joulin, and L. Fei-Fei. Deep fragment embeddings for bidirectional image sentence mapping. In NIPS, 2014.
    • (2014) NIPS
    • Karpathy, A.1    Joulin, A.2    Fei-Fei, L.3
  • 21
    • 84961376850 scopus 로고    scopus 로고
    • Convolutional neural networks for sentence classification
    • Y. Kim. Convolutional neural networks for sentence classification. In EMNLP, 2014.
    • (2014) EMNLP
    • Kim, Y.1
  • 24
    • 84913582676 scopus 로고    scopus 로고
    • Convolutional network features for scene recognition
    • M. Koskela and J. Laaksonen. Convolutional network features for scene recognition. In ACMMM, 2014.
    • (2014) ACMMM
    • Koskela, M.1    Laaksonen, J.2
  • 25
    • 84862279067 scopus 로고    scopus 로고
    • Composing simple image descriptions using web-scale n-grams
    • S. Li, G. Kulkarni, T. L. Berg, A. C. Berg, and Y. Choi. Composing simple image descriptions using web-scale n-grams. In CoNLL, 2011.
    • (2011) CoNLL
    • Li, S.1    Kulkarni, G.2    Berg, T.L.3    Berg, A.C.4    Choi, Y.5
  • 28
    • 84898956512 scopus 로고    scopus 로고
    • Distributed representations of words and phrases and their compositionality
    • T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, 2013.
    • (2013) NIPS
    • Mikolov, T.1    Sutskever, I.2    Chen, K.3    Corrado, G.S.4    Dean, J.5
  • 29
    • 85162522202 scopus 로고    scopus 로고
    • Im2text: Describing images using 1 million captioned photographs
    • V. Ordonez, G. Kulkarni, and T. L. Berg. Im2text: Describing images using 1 million captioned photographs. In NIPS, 2011.
    • (2011) NIPS
    • Ordonez, V.1    Kulkarni, G.2    Berg, T.L.3
  • 30
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: A method for automatic evaluation of machine translation. In ACL, 2002.
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 31
    • 84961289992 scopus 로고    scopus 로고
    • Glove: Global vectors for word representation
    • J. Pennington, R. Socher, and C. D. Manning. Glove: Global vectors for word representation. In EMNLP, 2014.
    • (2014) EMNLP
    • Pennington, J.1    Socher, R.2    Manning, C.D.3
  • 35
    • 84977650097 scopus 로고    scopus 로고
    • Video captioning with recurrent networks based on frame-and video-level features and visual content classification
    • abs/1512.02949
    • R. Shetty and J. Laaksonen. Video captioning with recurrent networks based on frame-and video-level features and visual content classification. ICCV Workshop on LSMDC, abs/1512.02949, 2015.
    • (2015) ICCV Workshop on LSMDC
    • Shetty, R.1    Laaksonen, J.2
  • 36
    • 84994666053 scopus 로고    scopus 로고
    • Frame-and segment-level features and candidate pool evaluation for video caption generation
    • R. Shetty and J. Laaksonen. Frame-and segment-level features and candidate pool evaluation for video caption generation. In ACMMM Multimedia Grand Challenge Solutions, 2016.
    • (2016) ACMMM Multimedia Grand Challenge Solutions
    • Shetty, R.1    Laaksonen, J.2
  • 38
  • 39
    • 84956980995 scopus 로고    scopus 로고
    • CIDEr: Consensus-based image description evaluation
    • R. Vedantam, C. L. Zitnick, and D. Parikh. CIDEr: Consensus-based image description evaluation. In CVPR, 2015.
    • (2015) CVPR
    • Vedantam, R.1    Zitnick, C.L.2    Parikh, D.3
  • 40
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. In CVPR, 2015.
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 41
    • 84924067462 scopus 로고    scopus 로고
    • Sun database: Exploring a large collection of scene categories
    • J. Xiao, K. A. Ehinger, J. Hays, A. Torralba, and A. Oliva. Sun database: Exploring a large collection of scene categories. IJCV, 2014.
    • (2014) IJCV
    • Xiao, J.1    Ehinger, K.A.2    Hays, J.3    Torralba, A.4    Oliva, A.5
  • 42
    • 77955988947 scopus 로고    scopus 로고
    • SUN database: Large-scale scene recognition from abbey to zoo
    • J. Xiao, J. Hays, K. Ehinger, A. Oliva, and A. Torralba. SUN database: Large-scale scene recognition from abbey to zoo. In CVPR, 2010.
    • (2010) CVPR
    • Xiao, J.1    Hays, J.2    Ehinger, K.3    Oliva, A.4    Torralba, A.5
  • 45
    • 84906494296 scopus 로고    scopus 로고
    • From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
    • P. Young, A. Lai, M. Hodosh, and J. Hockenmaier. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. TACL, 2014.
    • (2014) TACL
    • Young, P.1    Lai, A.2    Hodosh, M.3    Hockenmaier, J.4
  • 47
    • 84937964578 scopus 로고    scopus 로고
    • Learning deep features for scene recognition using places database
    • B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning deep features for scene recognition using places database. In NIPS, 2014.
    • (2014) NIPS
    • Zhou, B.1    Lapedriza, A.2    Xiao, J.3    Torralba, A.4    Oliva, A.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.