메뉴 건너뛰기




Volumn 2016-December, Issue , 2016, Pages 1-10

Deep compositional captioning: Describing novel object categories without paired training data

Author keywords

[No Author keywords available]

Indexed keywords

CHARACTER RECOGNITION; COMPUTER VISION; KNOWLEDGE MANAGEMENT; OBJECT RECOGNITION;

EID: 84986274522     PISSN: 10636919     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CVPR.2016.8     Document Type: Conference Paper
Times cited : (305)

References (37)
  • 2
    • 84859089502 scopus 로고    scopus 로고
    • Collecting highly parallel data for paraphrase evaluation
    • D. L. Chen and W. B. Dolan. Collecting highly parallel data for paraphrase evaluation. In ACL, 2011.
    • (2011) ACL
    • Chen, D.L.1    Dolan, W.B.2
  • 3
    • 80052876786 scopus 로고    scopus 로고
    • What does classifying more than 10, 000 image categories tell us
    • J. Deng, A. Berg, K. Li, and L. Fei-Fei. What does classifying more than 10, 000 image categories tell us In ECCV, 2010.
    • (2010) ECCV
    • Deng, J.1    Berg, A.2    Li, K.3    Fei-Fei, L.4
  • 9
    • 70450202741 scopus 로고    scopus 로고
    • Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
    • A. Guptal, P. Srinivasan, J. Shi, and L. Davis. Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos. In CVPR, 2009.
    • (2009) CVPR
    • Guptal, A.1    Srinivasan, P.2    Shi, J.3    Davis, L.4
  • 11
    • 84906494296 scopus 로고    scopus 로고
    • From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
    • P. Hodosh, A. Young, M. Lai, and J. Hockenmaier. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. In TACL, 2014.
    • (2014) TACL
    • Hodosh, P.1    Young, A.2    Lai, M.3    Hockenmaier, J.4
  • 15
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. CVPR, 2015.
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 16
    • 84952349298 scopus 로고    scopus 로고
    • Unifying visual-semantic embeddings with multimodal neural language models
    • R. Kiros, R. Salakhuditnov, and R. S. Zemel. Unifying visual-semantic embeddings with multimodal neural language models. TACL, 2015.
    • (2015) TACL
    • Kiros, R.1    Salakhuditnov, R.2    Zemel, R.S.3
  • 19
    • 84894522762 scopus 로고    scopus 로고
    • Attributebased classification for zero-shot visual object categorization
    • C. Lampert, H. Nickisch, and S. Harmeling. Attributebased classification for zero-shot visual object categorization. TPAMI, 2014.
    • (2014) TPAMI
    • Lampert, C.1    Nickisch, H.2    Harmeling, S.3
  • 21
    • 84973863256 scopus 로고    scopus 로고
    • Learning like a child: Fast novel visual concept learning from sentence descriptions of images
    • J. Mao, W. Xu, Y. Yang, J. Wang, Z. Huang, and A. L. Yuille. Learning like a child: Fast novel visual concept learning from sentence descriptions of images. In ICCV, 2015.
    • (2015) ICCV
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Huang, Z.5    Yuille, A.L.6
  • 22
    • 85083951332 scopus 로고    scopus 로고
    • Efficient estimation of word representations in vector space
    • T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. ICLR Workshop, 2013.
    • (2013) ICLR Workshop
    • Mikolov, T.1    Chen, K.2    Corrado, G.3    Dean, J.4
  • 23
    • 85133336275 scopus 로고    scopus 로고
    • BLEU: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. BLEU: A method for automatic evaluation of machine translation. In ACL, 2002.
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 24
  • 26
    • 84973887740 scopus 로고    scopus 로고
    • The long-short story of movie description
    • A. Rohrbach, M. Rohrbach, and B. Schiele. The long-short story of movie description. GCPR, 2015.
    • (2015) GCPR
    • Rohrbach, A.1    Rohrbach, M.2    Schiele, B.3
  • 27
    • 77955989949 scopus 로고    scopus 로고
    • What helps Where-and Why Semantic Relatedness for Knowledge Transfer
    • M. Rohrbach, M. Stark, G. Szarvas, I. Gurevych, and B. Schiele. What helps Where-and Why Semantic Relatedness for Knowledge Transfer. In CVPR, 2010.
    • (2010) CVPR
    • Rohrbach, M.1    Stark, M.2    Szarvas, G.3    Gurevych, I.4    Schiele, B.5
  • 29
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. ICLR, 2015.
    • (2015) ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 30
    • 84898938559 scopus 로고    scopus 로고
    • Zero-shot learning through cross-modal transfer
    • R. Socher, M. Ganjoo, C. D. Manning, and A. Ng. Zero-shot learning through cross-modal transfer. In NIPS. 2013.
    • (2013) NIPS.
    • Socher, R.1    Ganjoo, M.2    Manning, C.D.3    Ng, A.4
  • 31
    • 84959932469 scopus 로고    scopus 로고
    • Integrating language and vision to generate natural language descriptions of videos in the wild
    • J. Thomason, S. Venugopalan, S. Guadarrama, K. Saenko, and R. J. Mooney. Integrating language and vision to generate natural language descriptions of videos in the wild. In COLING, 2014.
    • (2014) COLING
    • Thomason, J.1    Venugopalan, S.2    Guadarrama, S.3    Saenko, K.4    Mooney, R.J.5
  • 32
    • 84905470734 scopus 로고    scopus 로고
    • Overview of the imageclef 2012 flickr photo annotation and retrieval task
    • B. Thomee and A. Popescu. Overview of the imageclef 2012 flickr photo annotation and retrieval task. In CLEF (Online Working Notes/Labs/Workshop), volume 12, 2012.
    • (2012) CLEF (Online Working Notes/Labs/Workshop) , vol.12
    • Thomee, B.1    Popescu, A.2
  • 33
    • 84983470508 scopus 로고    scopus 로고
    • Feature-rich part-of-speech tagging with a cyclic dependency network
    • K. Toutanova, D. Klein, C. D. Manning, and Y. Singer. Feature-rich part-of-speech tagging with a cyclic dependency network. In NAACL, 2003.
    • (2003) NAACL
    • Toutanova, K.1    Klein, D.2    Manning, C.D.3    Singer, Y.4
  • 36
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. CVPR, 2015.
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.