메뉴 건너뛰기




Volumn 2017-October, Issue , 2017, Pages 1251-1259

Areas of Attention for Image Captioning

Author keywords

[No Author keywords available]

Indexed keywords

OBJECT DETECTION;

EID: 85041899820     PISSN: 15505499     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICCV.2017.140     Document Type: Conference Paper
Times cited : (214)

References (39)
  • 1
    • 85083953689 scopus 로고    scopus 로고
    • Neural machine translation by jointly learning to align and translate
    • D. Bahdanau, K. Cho, and Y. Bengio. Neural machine translation by jointly learning to align and translate. In ICLR, 2015
    • (2015) ICLR
    • Bahdanau, D.1    Cho, K.2    Bengio, Y.3
  • 2
    • 84965179228 scopus 로고    scopus 로고
    • Scheduled sampling for sequence prediction with recurrent neural networks
    • S. Bengio, O. Vinyals, N. Jaitly, and N. Shazeer. Scheduled sampling for sequence prediction with recurrent neural networks. In NIPS, 2015
    • (2015) NIPS
    • Bengio, S.1    Vinyals, O.2    Jaitly, N.3    Shazeer, N.4
  • 3
    • 84986269551 scopus 로고    scopus 로고
    • Weakly supervised deep detection networks
    • H. Bilen and A. Vedaldi. Weakly supervised deep detection networks. In CVPR, 2016
    • (2016) CVPR
    • Bilen, H.1    Vedaldi, A.2
  • 5
  • 6
    • 84911376072 scopus 로고    scopus 로고
    • Multi-fold MIL training for weakly supervised object localization
    • R. Cinbis, J. Verbeek, and C. Schmid. Multi-fold MIL training for weakly supervised object localization. In CVPR, 2014
    • (2014) CVPR
    • Cinbis, R.1    Verbeek, J.2    Schmid, C.3
  • 10
    • 85029359197 scopus 로고    scopus 로고
    • Fast r-cnn
    • R. Girshick. Fast R-CNN. In ICCV, 2015
    • (2015) ICCV
    • Girshick, R.1
  • 11
  • 12
    • 84928278589 scopus 로고    scopus 로고
    • Spatial pyramid pooling in deep convolutional networks for visual recognition
    • K. He, X. Zhang, S. Ren, and J. Sun. Spatial pyramid pooling in deep convolutional networks for visual recognition. In ECCV, 2014
    • (2014) ECCV
    • He, K.1    Zhang, X.2    Ren, S.3    Sun, J.4
  • 16
    • 84986245786 scopus 로고    scopus 로고
    • DenseCap: Fully convolutional localization networks for dense captioning
    • J. Johnson, A. Karpathy, and L. Fei-Fei. DenseCap: Fully convolutional localization networks for dense captioning. In CVPR, 2016
    • (2016) CVPR
    • Johnson, J.1    Karpathy, A.2    Fei-Fei, L.3
  • 17
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In CVPR, 2015
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 18
    • 85083951076 scopus 로고    scopus 로고
    • Adam: A method for stochastic optimization
    • D. Kingma and J. Ba. Adam: A method for stochastic optimization. In ICLR, 2015
    • (2015) ICLR
    • Kingma, D.1    Ba, J.2
  • 21
    • 85030472316 scopus 로고    scopus 로고
    • Attention correctness in neural image captioning
    • C. Liu, J. Mao, F. Sha, and A. Yuille. Attention correctness in neural image captioning. In AAAI, 2017
    • (2017) AAAI
    • Liu, C.1    Mao, J.2    Sha, F.3    Yuille, A.4
  • 23
    • 85083950512 scopus 로고    scopus 로고
    • Deep captioning with multimodal recurrent neural networks (m-RNN)
    • J. Mao, W. Xu, Y. Yang, J. Wang, Z. Huang, and A. Yuille. Deep captioning with multimodal recurrent neural networks (m-RNN). ICLR, 2015
    • (2015) ICLR
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Huang, Z.5    Yuille, A.6
  • 24
    • 84973856017 scopus 로고    scopus 로고
    • Flickr30k entities: Collecting region-to-phrase correspondences for richer image-tosentence models
    • B. Plummer, L. Wang, C. Cervantes, J. Caicedo, J. Hockenmaier, and S. Lazebnik. Flickr30k entities: Collecting region-to-phrase correspondences for richer image-tosentence models. In ICCV, 2015
    • (2015) ICCV
    • Plummer, B.1    Wang, L.2    Cervantes, C.3    Caicedo, J.4    Hockenmaier, J.5    Lazebnik, S.6
  • 25
    • 85083951479 scopus 로고    scopus 로고
    • Sequence level training with recurrent neural networks
    • M. Ranzato, S. Chopra, M. Auli, andW. Zaremba. Sequence level training with recurrent neural networks. In ICLR, 2016
    • (2016) ICLR
    • Ranzato, M.1    Chopra, S.2    Auli, M.3    Zaremba, W.4
  • 26
    • 84960980241 scopus 로고    scopus 로고
    • Faster R-CNN: Towards real-time object detection with region proposal networks
    • S. Ren, K. He, R. Girshick, and J. Sun. Faster R-CNN: Towards real-time object detection with region proposal networks. In NIPS, 2015
    • (2015) NIPS
    • Ren, S.1    He, K.2    Girshick, R.3    Sun, J.4
  • 28
    • 84885881090 scopus 로고    scopus 로고
    • Objectcentric spatial pooling for image classification
    • O. Russakovsky, Y. Lin, K. Yu, and L. Fei-Fei. Objectcentric spatial pooling for image classification. In ECCV, 2012
    • (2012) ECCV
    • Russakovsky, O.1    Lin, Y.2    Yu, K.3    Fei-Fei, L.4
  • 29
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015
    • (2015) ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 30
    • 84928547704 scopus 로고    scopus 로고
    • Sequence to sequence learning with neural networks
    • I. Sutskever, O. Vinyals, and Q. Le. Sequence to sequence learning with neural networks. In NIPS, 2014
    • (2014) NIPS
    • Sutskever, I.1    Vinyals, O.2    Le, Q.3
  • 31
    • 84881160857 scopus 로고    scopus 로고
    • Selective search for object recognition
    • J. Uijlings, K. van de Sande, T. Gevers, and A. Smeulders. Selective search for object recognition. IJCV, 104(2):154-171, 2013
    • (2013) IJCV , vol.104 , Issue.2 , pp. 154-171
    • Uijlings, J.1    Van de Sande, K.2    Gevers, T.3    Smeulders, A.4
  • 32
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. In CVPR, 2015
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 33
    • 84986301177 scopus 로고    scopus 로고
    • What value do explicit high level concepts have in vision to language problems
    • Q. Wu, C. Shen, L. Liu, A. Dick, and A. van den Hengel. What value do explicit high level concepts have in vision to language problems In CVPR, 2016
    • (2016) CVPR
    • Wu, Q.1    Shen, C.2    Liu, L.3    Dick, A.4    Van den Hengel, A.5
  • 35
    • 85030211479 scopus 로고    scopus 로고
    • Encode, review, and decode: Reviewer module for caption generation
    • Z. Yang, Y. Yuan, Y. Wu, R. Salakhutdinov, and W. Cohen. Encode, review, and decode: Reviewer module for caption generation. In NIPS, 2016
    • (2016) NIPS
    • Yang, Z.1    Yuan, Y.2    Wu, Y.3    Salakhutdinov, R.4    Cohen, W.5
  • 38
    • 84986317307 scopus 로고    scopus 로고
    • Image captioning with semantic attention
    • Q. You, H. Jin, Z. Wang, C. Fang, and J. Luo. Image captioning with semantic attention. In CVPR, 2016
    • (2016) CVPR
    • You, Q.1    Jin, H.2    Wang, Z.3    Fang, C.4    Luo, J.5
  • 39
    • 84952018709 scopus 로고    scopus 로고
    • Edge boxes: Locating object proposals from edges
    • C. Zitnick and P. Dollár. Edge boxes: locating object proposals from edges. In ECCV, 2014.
    • (2014) ECCV
    • Zitnick, C.1    Dollár, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.