SCOPUS 정보 검색 플랫폼

EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings

Volumn , Issue , 2017, Pages 936-945

Guided open vocabulary image captioning with constrained beam search

(4) Anderson, Peter a Fernando, Basura a Johnson, Mark b Gould, Stephen a

a AUSTRALIAN NATIONAL UNIVERSITY (Australia)

b MACQUARIE UNIVERSITY (Australia)

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; NATURAL LANGUAGE PROCESSING SYSTEMS;

CONSTRAINED BEAMS; GROUND TRUTH; IMAGE CAPTIONING; REAL-WORLD; STATE OF THE ART; TEST TIME; VOCABULARY EXPANSIONS;

IMAGE ENHANCEMENT;

EID: 85048487879 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.18653/v1/d17-1098 Document Type: Conference Paper

Times cited : (235)

References (32)

1
- 84910073639
- Pushdown automata in statistical machine translation
- Cyril Allauzen, Bill Byrne, Adrià de Gispert, Gonzalo Iglesias, and Michael Riley. 2014. Pushdown automata in statistical machine translation. Computational Linguistics 40(3):687–723.
- (2014) Computational Linguistics , vol.40 , Issue.3 , pp. 687-723
- Allauzen, C.¹ Byrne, B.² de Gispert, A.³ Iglesias, G.⁴ Riley, M.⁵

2
- 85021678581
- SPICE: Semantic propositional image caption evaluation
- Peter Anderson, Basura Fernando, Mark Johnson, and Stephen Gould. 2016. SPICE: Semantic propositional image caption evaluation. In ECCV.
- (2016) ECCV
- Anderson, P.¹ Fernando, B.² Johnson, M.³ Gould, S.⁴

3
- 84897530662
- Fast image tagging
- Minmin Chen, Alice X Zheng, and Kilian Q Weinberger. 2013. Fast image tagging. In ICML.
- (2013) ICML
- Chen, M.¹ Zheng, A.X.² Weinberger, K.Q.³

4
- 84952349295
- arXiv preprint
- Xinlei Chen, Tsung-Yi Lin Hao Fang, Ramakrishna Vedantam, Saurabh Gupta, Piotr Dollar, and C. Lawrence Zitnick. 2015. Microsoft COCO captions: Data collection and evaluation server. arXiv preprint arXiv:1504.00325 .
- (2015) Microsoft COCO Captions: Data Collection and Evaluation Server
- Chen, X.¹ Fang, T.-Y.L.H.² Vedantam, R.³ Gupta, S.⁴ Dollar, P.⁵ Zitnick, C.L.⁶

5
- 85107661995
- Meteor universal: Language specific translation evaluation for any target language
- Michael Denkowski and Alon Lavie. 2014. Meteor universal: Language specific translation evaluation for any target language. In Proceedings of the EACL 2014 Workshop on Statistical Machine Translation.
- (2014) Proceedings of the EACL 2014 Workshop on Statistical Machine Translation
- Denkowski, M.¹ Lavie, A.²

6
- 84944096380
- Language models for image captioning: The quirks and what works
- Jacob Devlin, Hao Cheng, Hao Fang, Saurabh Gupta, Li Deng, Xiaodong He, Geoffrey Zweig, and Margaret Mitchell. 2015. Language models for image captioning: The quirks and what works. In ACL.
- (2015) ACL
- Devlin, J.¹ Cheng, H.² Fang, H.³ Gupta, S.⁴ Deng, L.⁵ He, X.⁶ Zweig, G.⁷ Mitchell, M.⁸

7
- 84959236502
- Long-term recurrent convolutional networks for visual recognition and description
- Jeffrey Donahue, Lisa A. Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, and Trevor Darrell. 2015. Long-term recurrent convolutional networks for visual recognition and description. In CVPR.
- (2015) CVPR
- Donahue, J.¹ Hendricks, L.A.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

8
- 84943812736
- Describing images using inferred visual dependency representations
- Desmond Elliot and Arjen P. de Vries. 2015. Describing images using inferred visual dependency representations. In ACL.
- (2015) ACL
- Elliot, D.¹ de Vries, A.P.²

9
- 84959250180
- From captions to visual concepts and back
- Hao Fang, Saurabh Gupta, Forrest N. Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C. Platt, C. Lawrence Zitnick, and Geoffrey Zweig. 2015. From captions to visual concepts and back. In CVPR.
- (2015) CVPR
- Fang, H.¹ Gupta, S.² Iandola, F.N.³ Srivastava, R.⁴ Deng, L.⁵ Dollar, P.⁶ Gao, J.⁷ He, X.⁸ Mitchell, M.⁹ Platt, J.C.¹⁰ Zitnick, C.L.¹¹ Zweig, G.¹²

10
- 0004289791
- Bradford Books
- Christiane Fellbaum. 1998. WordNet: An Electronic Lexical Database. Bradford Books.
- (1998) WordNet: An Electronic Lexical Database
- Fellbaum, C.¹

11
- 85054983947
- Generating topical poetry
- Marjan Ghazvininejad, Xing Shi, Yejin Choi, and Kevin Knight. 2016. Generating topical poetry. In EMNLP.
- (2016) EMNLP
- Ghazvininejad, M.¹ Shi, X.² Choi, Y.³ Knight, K.⁴

12
- 84986274465
- Deep residual learning for image recognition
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR.
- (2016) CVPR
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

13
- 84986274522
- Deep compositional captioning: Describing novel object categories without paired training data
- Lisa Anne Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond Mooney, Kate Saenko, and Trevor Darrell. 2016. Deep compositional captioning: Describing novel object categories without paired training data. In CVPR.
- (2016) CVPR
- Hendricks, L.A.¹ Venugopalan, S.² Rohrbach, M.³ Mooney, R.⁴ Saenko, K.⁵ Darrell, T.⁶

14
- 0031573117
- Long short-term memory
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation 9(8).
- (1997) Neural Computation , vol.9 , Issue.8
- Hochreiter, S.¹ Schmidhuber, J.²

15
- 84913555165
- arXiv preprint
- Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408.5093 .
- (2014) Caffe: Convolutional Architecture for Fast Feature Embedding
- Jia, Y.¹ Shelhamer, E.² Donahue, J.³ Karayev, S.⁴ Long, J.⁵ Girshick, R.⁶ Guadarrama, S.⁷ Darrell, T.⁸

16
- 84946734827
- Deep visual-semantic alignments for generating image descriptions
- Andrej Karpathy and Li Fei-Fei. 2015. Deep visual-semantic alignments for generating image descriptions. In CVPR.
- (2015) CVPR
- Karpathy, A.¹ Fei-Fei, L.²

17
- 49449108990
- Cambridge University Press, New York, NY, USA, 1st edition
- Philipp Koehn. 2010. Statistical Machine Translation. Cambridge University Press, New York, NY, USA, 1st edition.
- (2010) Statistical Machine Translation
- Koehn, P.¹

18
- 85044305404
- The unreasonable effectiveness of noisy data for fine-grained recognition
- Jonathan Krause, Benjamin Sapp, Andrew Howard, Howard Zhou, Alexander Toshev, Tom Duerig, James Philbin, and Li Fei-Fei. 2016. The unreasonable effectiveness of noisy data for fine-grained recognition. In ECCV.
- (2016) ECCV
- Krause, J.¹ Sapp, B.² Howard, A.³ Zhou, H.⁴ Toshev, A.⁵ Duerig, T.⁶ Philbin, J.⁷ Fei-Fei, L.⁸

19
- 84937834115
- Microsoft COCO: Common objects in context
- T.Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick. 2014. Microsoft COCO: Common objects in context. In ECCV.
- (2014) ECCV
- Lin, T.Y.¹ Maire, M.² Belongie, S.³ Hays, J.⁴ Perona, P.⁵ Ramanan, D.⁶ Dollar, P.⁷ Zitnick, C.L.⁸

20
- 85083950512
- Deep captioning with multimodal recurrent neural networks (m-RNN)
- Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, and Alan L. Yuille. 2015. Deep captioning with multimodal recurrent neural networks (m-RNN). In ICLR.
- (2015) ICLR
- Mao, J.¹ Xu, W.² Yang, Y.³ Wang, J.⁴ Yuille, A.L.⁵

21
- 84961289992
- Glove: Global vectors for word representation
- Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global vectors for word representation. In EMNLP.
- (2014) EMNLP
- Pennington, J.¹ Socher, R.² Manning, C.D.³

22
- 84960980241
- Faster R-CNN: Towards real-time object detection with region proposal networks
- Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In NIPS.
- (2015) NIPS
- Ren, S.¹ He, K.² Girshick, R.³ Sun, J.⁴

23
- 84947041871
- Imagenet large scale visual recognition challenge
- Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. 2015. Imagenet large scale visual recognition challenge. International Journal of Computer Vision (IJCV) 115(3):211–252.
- (2015) International Journal of Computer Vision (IJCV) , vol.115 , Issue.3 , pp. 211-252
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶ Huang, Z.⁷ Karpathy, A.⁸ Khosla, A.⁹ Bernstein, M.¹⁰ Berg, A.C.¹¹ Fei-Fei, L.¹²

24
- 85083953063
- Very deep convolutional networks for large-scale image recognition
- Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In ICLR.
- (2015) ICLR
- Simonyan, K.¹ Zisserman, A.²

25
- 0004276049
- Cengage Learning, 3rd edition
- Michael Sipser. 2012. Introduction to the Theory of Computation. Cengage Learning, 3rd edition.
- (2012) Introduction to the Theory of Computation
- Sipser, M.¹

26
- 85010205139
- Rich image captioning in the wild
- Kenneth Tran, Xiaodong He, Lei Zhang, Jian Sun, Cornelia Carapcea, Chris Thrasher, Chris Buehler, and Chris Sienkiewicz. 2016. Rich image captioning in the wild. In CVPR Workshop.
- (2016) CVPR Workshop
- Tran, K.¹ He, X.² Zhang, L.³ Sun, J.⁴ Carapcea, C.⁵ Thrasher, C.⁶ Buehler, C.⁷ Sienkiewicz, C.⁸

27
- 84956980995
- CiDer: Consensus-based image description evaluation
- Ramakrishna Vedantam, C. Lawrence Zitnick, and Devi Parikh. 2015. CIDEr: Consensus-based image description evaluation. In CVPR.
- (2015) CVPR
- Vedantam, R.¹ Zitnick, C.L.² Parikh, D.³

28
- 85034846838
- arXiv preprint
- Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond J. Mooney, Trevor Darrell, and Kate Saenko. 2016. Captioning images with diverse objects. arXiv preprint arXiv:1606.07770 .
- (2016) Captioning Images with Diverse Objects
- Venugopalan, S.¹ Hendricks, L.A.² Rohrbach, M.³ Mooney, R.J.⁴ Darrell, T.⁵ Saenko, K.⁶

29
- 84946747440
- Show and tell: A neural image caption generator
- Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2015. Show and tell: A neural image caption generator. In CVPR.
- (2015) CVPR
- Vinyals, O.¹ Toshev, A.² Bengio, S.³ Erhan, D.⁴

30
- 84986301177
- What value do explicit high level concepts have in vision to language problems?
- Q. Wu, C. Shen, L. Liu, A. Dick, and A. van den Hengel. 2016. What Value Do Explicit High Level Concepts Have in Vision to Language Problems? In CVPR.
- (2016) CVPR
- Wu, Q.¹ Shen, C.² Liu, L.³ Dick, A.⁴ van den Hengel, A.⁵

31
- 84906494296
- From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
- Peter Young, Alice Lai, Micah Hodosh, and Julia Hockenmaier. 2014. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. TACL .
- (2014) TACL
- Young, P.¹ Lai, A.² Hodosh, M.³ Hockenmaier, J.⁴

32
- 84986272569
- Fast zero-shot image tagging
- Yang Zhang, Boqing Gong, and Mubarak Shah. 2016. Fast zero-shot image tagging. In CVPR.
- (2016) CVPR
- Zhang, Y.¹ Gong, B.² Shah, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.