메뉴 건너뛰기




Volumn , Issue , 2018, Pages 1029-1037

Decoupled novel object captioner

Author keywords

Image captioning; Novel object; Novel object captioning

Indexed keywords

DETECTION MODELS; IMAGE CAPTIONING; NOVEL OBJECT; NOVEL OBJECT CAPTIONING; OBJECT CATEGORIES; OBJECT DESCRIPTION; SEQUENCE MODELING; VISUAL INFORMATION;

EID: 85058217998     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/3240508.3240640     Document Type: Conference Paper
Times cited : (64)

References (43)
  • 2
    • 85048487879 scopus 로고    scopus 로고
    • Guided open vocabulary image captioning with constrained beam search
    • Peter Anderson, Basura Fernando, Mark Johnson, and Stephen Gould. 2017. Guided open vocabulary image captioning with constrained beam search. In EMNLP.
    • (2017) EMNLP
    • Anderson, P.1    Fernando, B.2    Johnson, M.3    Gould, S.4
  • 4
    • 85116156579 scopus 로고    scopus 로고
    • Meteor: An automatic metric for MT evaluation with improved correlation with human judgments
    • Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In ACL-W. 65-72.
    • (2005) ACL-W , pp. 65-72
    • Banerjee, S.1    Lavie, A.2
  • 5
    • 84965179228 scopus 로고    scopus 로고
    • Scheduled sampling for sequence prediction with recurrent neural networks
    • Samy Bengio, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. 2015. Scheduled sampling for sequence prediction with recurrent neural networks. In NIPS. 1171-1179.
    • (2015) NIPS , pp. 1171-1179
    • Bengio, S.1    Vinyals, O.2    Jaitly, N.3    Shazeer, N.4
  • 7
    • 85058215742 scopus 로고    scopus 로고
    • Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering
    • Xuanyi Dong, Linchao Zhu, De Zhang, Yi Yang, and Fei Wu. 2018. Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering. In ACM on Multimedia.
    • (2018) ACM on Multimedia
    • Dong, X.1    Zhu, L.2    Zhang, D.3    Yang, Y.4    Wu, F.5
  • 9
    • 85046762258 scopus 로고    scopus 로고
    • Model-agnostic meta-learning for fast adaptation of deep networks
    • Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. In ICML. 1126-1135.
    • (2017) ICML , pp. 1126-1135
    • Finn, C.1    Abbeel, P.2    Levine, S.3
  • 10
    • 0031573117 scopus 로고    scopus 로고
    • Long short-term memory
    • 1997
    • Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735-1780.
    • (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
    • Hochreiter, S.1    Schmidhuber, J.2
  • 12
    • 84962449644 scopus 로고    scopus 로고
    • Bridging the ultimate semantic gap: A semantic search engine for internet videos
    • Lu Jiang, Shoou-I Yu, Deyu Meng, Teruko Mitamura, and Alexander G Hauptmann. 2015. Bridging the ultimate semantic gap: A semantic search engine for internet videos. In ICMR. 27-34.
    • (2015) ICMR , pp. 27-34
    • Jiang, L.1    Yu, S.-I.2    Meng, D.3    Mitamura, T.4    Hauptmann, A.G.5
  • 14
    • 84986245786 scopus 로고    scopus 로고
    • DenseCap: Fully convolutional localization networks for dense captioning
    • Justin Johnson, Andrej Karpathy, and Li Fei-Fei. 2016. Densecap: Fully convolutional localization networks for dense captioning. In CVPR. 4565-4574.
    • (2016) CVPR , pp. 4565-4574
    • Johnson, J.1    Karpathy, A.2    Fei-Fei, L.3
  • 15
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • Andrej Karpathy and Li Fei-Fei. 2015. Deep visual-semantic alignments for generating image descriptions. In CVPR. 3128-3137.
    • (2015) CVPR , pp. 3128-3137
    • Karpathy, A.1    Fei-Fei, L.2
  • 16
    • 85083951076 scopus 로고    scopus 로고
    • ADaM: A method for stochastic optimization
    • Diederik P Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In ICLR.
    • (2015) ICLR
    • Kingma, D.P.1    Ba, J.2
  • 17
    • 84929363334 scopus 로고    scopus 로고
    • Multimodal neural language models
    • Ryan Kiros, Ruslan Salakhutdinov, and Rich Zemel. 2014. Multimodal neural language models. In ICML. 595-603.
    • (2014) ICML , pp. 595-603
    • Kiros, R.1    Salakhutdinov, R.2    Zemel, R.3
  • 21
    • 85058234384 scopus 로고    scopus 로고
    • Neural baby talk
    • Jiasen Lu, Jianwei Yang, Dhruv Batra, and Devi Parikh. 2018. Neural Baby Talk. In CVPR. 7219-7228.
    • (2018) CVPR , pp. 7219-7228
    • Lu, J.1    Yang, J.2    Batra, D.3    Parikh, D.4
  • 22
    • 84973863256 scopus 로고    scopus 로고
    • Learning like a child: Fast novel visual concept learning from sentence descriptions of images
    • Junhua Mao, Xu Wei, Yi Yang, Jiang Wang, Zhiheng Huang, and Alan L Yuille. 2015. Learning like a child: Fast novel visual concept learning from sentence descriptions of images. In ICCV. 2533-2541.
    • (2015) ICCV , pp. 2533-2541
    • Mao, J.1    Wei, X.2    Yang, Y.3    Wang, J.4    Huang, Z.5    Yuille, A.L.6
  • 23
    • 85083950512 scopus 로고    scopus 로고
    • Deep captioning with multimodal recurrent neural networks (m-RNN)
    • 2015
    • Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, and Alan Yuille. 2015. Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN). ICLR (2015).
    • (2015) ICLR
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Huang, Z.5    Yuille, A.6
  • 26
    • 85162522202 scopus 로고    scopus 로고
    • Im2Text: Describing images using 1 million captioned photographs
    • Vicente Ordonez, Girish Kulkarni, and Tamara L Berg. 2011. Im2text: Describing images using 1 million captioned photographs. In NIPS. 1143-1151.
    • (2011) NIPS , pp. 1143-1151
    • Ordonez, V.1    Kulkarni, G.2    Berg, T.L.3
  • 27
    • 85083951479 scopus 로고    scopus 로고
    • Sequence level training with recurrent neural networks
    • Marc'Aurelio Ranzato, Sumit Chopra, Michael Auli, and Wojciech Zaremba. 2016. Sequence level training with recurrent neural networks. In ICLR.
    • (2016) ICLR
    • Ranzato, M.1    Chopra, S.2    Auli, M.3    Zaremba, W.4
  • 28
    • 84960980241 scopus 로고    scopus 로고
    • Faster R-CNN: Towards real-time object detection with region proposal networks
    • Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. In NIPS. 91-99.
    • (2015) NIPS , pp. 91-99
    • Ren, S.1    He, K.2    Girshick, R.3    Sun, J.4
  • 29
    • 80052892795 scopus 로고    scopus 로고
    • Evaluating knowledge transfer and zero-shot learning in a large-scale setting
    • Marcus Rohrbach, Michael Stark, and Bernt Schiele. 2011. Evaluating knowledge transfer and zero-shot learning in a large-scale setting. In CVPR. 1641-1648.
    • (2011) CVPR , pp. 1641-1648
    • Rohrbach, M.1    Stark, M.2    Schiele, B.3
  • 31
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • Karen Simonyan and Andrew Zisserman. 2015. Very deep convolutional networks for large-scale image recognition. In ICLR.
    • (2015) ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 32
    • 85028013193 scopus 로고    scopus 로고
    • Inception-v4, inception-resnet and the impact of residual connections on learning
    • Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alexander A Alemi. 2017. Inception-v4, inception-resnet and the impact of residual connections on learning. In AAAI.
    • (2017) AAAI
    • Szegedy, C.1    Ioffe, S.2    Vanhoucke, V.3    Alemi, A.A.4
  • 33
    • 85041928364 scopus 로고    scopus 로고
    • Paying attention to descriptions generated by image captioning models
    • Hamed R Tavakoliy, Rakshith Shetty, Ali Borji, and Jorma Laaksonen. 2017. Paying Attention to Descriptions Generated by Image Captioning Models. In ICCV. 2506-2515.
    • (2017) ICCV , pp. 2506-2515
    • Tavakoliy, H.R.1    Shetty, R.2    Borji, A.3    Laaksonen, J.4
  • 36
    • 85018863845 scopus 로고    scopus 로고
    • Matching networks for one shot learning
    • Oriol Vinyals, Charles Blundell, Tim Lillicrap, Daan Wierstra, et al. 2016. Matching networks for one shot learning. In NIPS. 3630-3638.
    • (2016) NIPS , pp. 3630-3638
    • Vinyals, O.1    Blundell, C.2    Lillicrap, T.3    Wierstra, D.4
  • 37
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2015. Show and tell: A neural image caption generator. In CVPR. 3156-3164.
    • (2015) CVPR , pp. 3156-3164
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 40
    • 84970002232 scopus 로고    scopus 로고
    • Show, attend and tell: Neural image caption generation with visual attention
    • Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. 2015. Show, attend and tell: Neural image caption generation with visual attention. In ICML. 2048-2057.
    • (2015) ICML , pp. 2048-2057
    • Xu, K.1    Ba, J.2    Kiros, R.3    Cho, K.4    Courville, A.5    Salakhudinov, R.6    Zemel, R.7    Bengio, Y.8
  • 41
    • 85029391966 scopus 로고    scopus 로고
    • Incorporating copying mechanism in image captioning for learning novel objects
    • Ting Yao, Yingwei Pan, Yehao Li, and Tao Mei. 2017. Incorporating copying mechanism in image captioning for learning novel objects. In CVPR. 5263-5271.
    • (2017) CVPR , pp. 5263-5271
    • Yao, T.1    Pan, Y.2    Li, Y.3    Mei, T.4
  • 42
    • 84986317307 scopus 로고    scopus 로고
    • Image captioning with semantic attention
    • Quanzeng You, Hailin Jin, Zhaowen Wang, Chen Fang, and Jiebo Luo. 2016. Image captioning with semantic attention. In CVPR. 4651-4659.
    • (2016) CVPR , pp. 4651-4659
    • You, Q.1    Jin, H.2    Wang, Z.3    Fang, C.4    Luo, J.5
  • 43
    • 85023773582 scopus 로고    scopus 로고
    • Uncovering the temporal context for video question answering
    • 01 Sep 2017
    • Linchao Zhu, Zhongwen Xu, Yi Yang, and Alexander G. Hauptmann. 2017. Uncovering the Temporal Context for Video Question Answering. International Journal of Computer Vision 124, 3 (01 Sep 2017), 409-421. https://doi.org/10.1007/s11263-017-1033-7
    • (2017) International Journal of Computer Vision , vol.124 , Issue.3 , pp. 409-421
    • Zhu, L.1    Xu, Z.2    Yang, Y.3    Hauptmann, A.G.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.