메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 73-81

Expressing an image stream with a sequence of natural sentences

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER HARDWARE DESCRIPTION LANGUAGES; CONVOLUTION; INFORMATION SCIENCE; NEURAL NETWORKS;

EID: 84965149840     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (102)

References (32)
  • 1
    • 41549147844 scopus 로고    scopus 로고
    • Modeling local coherence: An entity-based approach
    • R. Barzilay and M. Lapata. Modeling Local Coherence: An Entity-Based Approach. In ACL, 2008.
    • (2008) ACL
    • Barzilay, R.1    Lapata, M.2
  • 3
    • 84957029470 scopus 로고    scopus 로고
    • Mind's eye: A recurrent visual representation for image caption generation
    • X. Chen and C. L. Zitnick. Mind's Eye: A Recurrent Visual Representation for Image Caption Generation. In CVPR, 2015.
    • (2015) CVPR
    • Chen, X.1    Zitnick, C.L.2
  • 4
  • 6
    • 84959243872 scopus 로고    scopus 로고
    • Improving image-sentence embeddings using large weakly annotated photo collections
    • Y. Gong, L. Wang, M. Hodosh, J. Hockenmaier, and S. Lazebnik. Improving Image-Sentence Embeddings Using Large Weakly Annotated Photo Collections. In ECCV, 2014.
    • (2014) ECCV
    • Gong, Y.1    Wang, L.2    Hodosh, M.3    Hockenmaier, J.4    Lazebnik, S.5
  • 8
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • M. Hodosh, P. Young, and J. Hockenmaier. Framing Image Description as a Ranking Task: Data, Models and Evaluation Metrics. JAIR, 47:853-899, 2013.
    • (2013) JAIR , vol.47 , pp. 853-899
    • Hodosh, M.1    Young, P.2    Hockenmaier, J.3
  • 9
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • A. Karpathy and L. Fei-Fei. Deep Visual-Semantic Alignments for Generating Image Descriptions. In CVPR, 2015.
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 10
    • 84959191227 scopus 로고    scopus 로고
    • Joint photo stream and blog post summarization and exploration
    • G. Kim, S. Moon, and L. Sigal. Joint Photo Stream and Blog Post Summarization and Exploration. In CVPR, 2015.
    • (2015) CVPR
    • Kim, G.1    Moon, S.2    Sigal, L.3
  • 11
    • 84959189488 scopus 로고    scopus 로고
    • Ranking and retrieval of image sequences from multiple paragraph queries
    • G. Kim, S. Moon, and L. Sigal. Ranking and Retrieval of Image Sequences from Multiple Paragraph Queries. In CVPR, 2015.
    • (2015) CVPR
    • Kim, G.1    Moon, S.2    Sigal, L.3
  • 13
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet Classification with Deep Convolutional Neural Networks. In NIPS, 2012.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 15
    • 84934873221 scopus 로고    scopus 로고
    • TreeTalk: Composition and compression of trees for image descriptions
    • P. Kuznetsova, V. Ordonez, T. L. Berg, and Y. Choi. TreeTalk: Composition and Compression of Trees for Image Descriptions. In TACL, 2014.
    • (2014) TACL
    • Kuznetsova, P.1    Ordonez, V.2    Berg, T.L.3    Choi, Y.4
  • 16
    • 33847226906 scopus 로고    scopus 로고
    • METEOR: An automatic metric for MT evaluation with improved correlation with human judgments
    • S. B. A. Lavie. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. In ACL, 2005.
    • (2005) ACL
    • Lavie, S.B.A.1
  • 17
    • 84919829999 scopus 로고    scopus 로고
    • Distributed representations of sentences and documents
    • Q. Le and T. Mikolov. Distributed Representations of Sentences and Documents. In ICML, 2014.
    • (2014) ICML
    • Le, Q.1    Mikolov, T.2
  • 19
    • 85083950512 scopus 로고    scopus 로고
    • Deep captioning with multimodal recurrent neural networks (m-RNN)
    • J. Mao, W. Xu, Y. Yang, J. Wang, Z. Huang, and A. L. Yuille. Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN). In ICLR, 2015.
    • (2015) ICLR
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Huang, Z.5    Yuille, A.L.6
  • 21
    • 85162522202 scopus 로고    scopus 로고
    • Im2Text: Describing images using 1 million captioned photographs
    • V. Ordonez, G. Kulkarni, and T. L. Berg. Im2Text: Describing Images Using 1 Million Captioned Photographs. In NIPS, 2011.
    • (2011) NIPS
    • Ordonez, V.1    Kulkarni, G.2    Berg, T.L.3
  • 22
    • 85133336275 scopus 로고    scopus 로고
    • BLEU: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. BLEU: A Method for Automatic Evaluation of Machine Translation. In ACL, 2002.
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 24
    • 84965144884 scopus 로고    scopus 로고
    • Bidirectional recurrent neural networks
    • M. Schuster and K. K. Paliwal. Bidirectional Recurrent Neural Networks. In IEEE TSP, 1997.
    • (1997) IEEE TSP
    • Schuster, M.1    Paliwal, K.K.2
  • 25
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • K. Simonyan and A. Zisserman. Very Deep Convolutional Networks for Large-Scale Image Recognition. In ICLR, 2015.
    • (2015) ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 26
    • 84906925854 scopus 로고    scopus 로고
    • Grounded compositional semantics for finding and describing images with sentences
    • R. Socher, A. Karpathy, Q. V. Le, C. D. Manning, and A. Y. Ng. Grounded Compositional Semantics for Finding and Describing Images with Sentences. In TACL, 2013.
    • (2013) TACL
    • Socher, R.1    Karpathy, A.2    Le, Q.V.3    Manning, C.D.4    Ng, A.Y.5
  • 27
    • 84877724347 scopus 로고    scopus 로고
    • Multimodal learning with deep boltzmann machines
    • N. Srivastava and R. Salakhutdinov. Multimodal Learning with Deep Boltzmann Machines. In NIPS, 2012.
    • (2012) NIPS
    • Srivastava, N.1    Salakhutdinov, R.2
  • 30
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and Tell: A Neural Image Caption Generator. In CVPR, 2015.
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 31
    • 0000903748 scopus 로고
    • Generalization of backpropagation with application to a recurrent gas market model
    • P. J. Werbos. Generalization of Backpropagation with Application to a Recurrent Gas Market Model. Neural Networks, 1:339-356, 1988.
    • (1988) Neural Networks , vol.1 , pp. 339-356
    • Werbos, P.J.1
  • 32
    • 84952349307 scopus 로고    scopus 로고
    • Jointly modeling deep video and compositional text to bridge vision and language in a unified framework
    • R. Xu, C. Xiong, W. Chen, and J. J. Corso. Jointly Modeling Deep Video and Compositional Text to Bridge Vision and Language in a Unified Framework. In AAAI, 2015.
    • (2015) AAAI
    • Xu, R.1    Xiong, C.2    Chen, W.3    Corso, J.J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.