메뉴 건너뛰기




Volumn 9358, Issue , 2015, Pages 209-221

The long-short story of movie description

Author keywords

[No Author keywords available]

Indexed keywords

HUMAN ROBOT INTERACTION; MOTION PICTURES; ROBOTS;

EID: 84952308628     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-319-24947-6_17     Document Type: Conference Paper
Times cited : (84)

References (40)
  • 2
    • 84859089502 scopus 로고    scopus 로고
    • Collecting highly parallel data for paraphrase evaluation
    • Chen, D., Dolan, W.: Collecting highly parallel data for paraphrase evaluation. In: ACL (2011)
    • (2011) ACL
    • Chen, D.1    Dolan, W.2
  • 4
    • 84887345951 scopus 로고    scopus 로고
    • Thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching
    • Das, P., Xu, C., Doell, R., Corso, J.: Thousand frames in just a few words: lingual description of videos through latent topics and sparse object stitching. In: CVPR (2013)
    • (2013) CVPR
    • Das, P.1    Xu, C.2    Doell, R.3    Corso, J.4
  • 7
    • 84906929591 scopus 로고    scopus 로고
    • Image description using visual dependency representations
    • Elliott, D., Keller, F.: Image description using visual dependency representations. In: EMNLP, pp. 1292-1302 (2013)
    • (2013) EMNLP , pp. 1292-1302
    • Elliott, D.1    Keller, F.2
  • 9
    • 78149311145 scopus 로고    scopus 로고
    • Every picture tells a story: Generating sentences from images
    • In: Daniilidis, K., Maragos, P., Paragios, N. (eds.), Springer, Heidelberg
    • Farhadi, A., Hejrati, M., Sadeghi, M.A., Young, P., Rashtchian, C., Hockenmaier, J., Forsyth, D.: Every picture tells a story: generating sentences from images. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 15-29. Springer, Heidelberg (2010)
    • (2010) ECCV 2010, Part IV. LNCS , vol.6314 , pp. 15-29
    • Farhadi, A.1    Hejrati, M.2    Sadeghi, M.A.3    Young, P.4    Rashtchian, C.5    Hockenmaier, J.6    Forsyth, D.7
  • 13
  • 16
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. In: CVPR (2015)
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 17
    • 84952349298 scopus 로고    scopus 로고
    • Unifying visual-semantic embeddings with multimodal neural language models
    • Kiros, R., Salakhutdinov, R., Zemel, R.S.: Unifying visual-semantic embeddings with multimodal neural language models. TACL (2015)
    • (2015) TACL
    • Kiros, R.1    Salakhutdinov, R.2    Zemel, R.S.3
  • 18
    • 0036843382 scopus 로고    scopus 로고
    • Natural language description of human activities from video images based on concept hierarchy of actions
    • Kojima, A., Tamura, T., Fukunaga, K.: Natural language description of human activities from video images based on concept hierarchy of actions. IJCV 50(2), 171-184 (2002)
    • (2002) IJCV , vol.50 , Issue.2 , pp. 171-184
    • Kojima, A.1    Tamura, T.2    Fukunaga, K.3
  • 20
    • 84934873221 scopus 로고    scopus 로고
    • Treetalk: Composition and compression of trees for image descriptions
    • Kuznetsova, P., Ordonez, V., Berg, T.L., Hill, U.C., Choi, Y.: Treetalk: composition and compression of trees for image descriptions. In: TACL (2014)
    • (2014) TACL
    • Kuznetsova, P.1    Ordonez, V.2    Berg, T.L.3    Hill, U.C.4    Choi, Y.5
  • 21
    • 85107661995 scopus 로고    scopus 로고
    • Meteor universal: Language specific translation evaluation for any target language
    • Lavie, M.D.A.: Meteor universal: language specific translation evaluation for any target language. In: ACL 2014, p. 376 (2014)
    • (2014) ACL 2014 , pp. 376
    • Lavie, M.D.A.1
  • 22
    • 85083950512 scopus 로고    scopus 로고
    • Deep captioning with multimodal recurrent neural networks (M-RNN)
    • Mao, J., Xu, W., Yang, Y., Wang, J., Huang, Z., Yuille, A.: Deep captioning with multimodal recurrent neural networks (m-RNN). In: ICLR (2015)
    • (2015) ICLR
    • Mao, J.1    Xu, W.2    Yang, Y.3    Wang, J.4    Huang, Z.5    Yuille, A.6
  • 25
    • 85133336275 scopus 로고    scopus 로고
    • BLEU: A method for automatic evaluation of machine translation
    • Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: a method for automatic evaluation of machine translation. In: ACL (2002)
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.J.4
  • 26
    • 84908670256 scopus 로고    scopus 로고
    • Coherent multi-sentence video description with variable level of detail
    • In: Jiang, X., Hornegger, J., Koch, R. (eds.), Springer, Heidelberg
    • Rohrbach, A., Rohrbach, M., Qiu, W., Friedrich, A., Pinkal, M., Schiele, B.: Coherent multi-sentence video description with variable level of detail. In: Jiang, X., Hornegger, J., Koch, R. (eds.) GCPR 2014. LNCS, vol. 8753, pp. 184-195. Springer, Heidelberg (2014)
    • (2014) GCPR 2014. LNCS , vol.8753 , pp. 184-195
    • Rohrbach, A.1    Rohrbach, M.2    Qiu, W.3    Friedrich, A.4    Pinkal, M.5    Schiele, B.6
  • 30
    • 84959932469 scopus 로고    scopus 로고
    • Integrating language and vision to generate natural language descriptions of videos in the wild
    • Thomason, J., Venugopalan, S., Guadarrama, S., Saenko, K., Mooney, R.J.: Integrating language and vision to generate natural language descriptions of videos in the wild. In: COLING (2014)
    • (2014) COLING
    • Thomason, J.1    Venugopalan, S.2    Guadarrama, S.3    Saenko, K.4    Mooney, R.J.5
  • 31
    • 84952349304 scopus 로고    scopus 로고
    • arXiv:1503.01070v1
    • Torabi, A., Pal, C., Larochelle, H., Courville, A.: Using descriptive video services to create a large data source for video annotation research (2015). arXiv:1503.01070v1
    • Torabi, A.1    Pal, C.2    Larochelle, H.3    Courville, A.4
  • 32
    • 84956980995 scopus 로고    scopus 로고
    • Cider: Consensus-based image description evaluation
    • Vedantam, R., Zitnick, C.L., Parikh, D.: Cider: Consensus-based image description evaluation. In: CVPR (2015)
    • (2015) CVPR
    • Vedantam, R.1    Zitnick, C.L.2    Parikh, D.3
  • 35
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • Vinyals, O., Toshev, A., Bengio, S., Erhan, D.: Show and tell: A neural image caption generator. In: CVPR (2015)
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 36
    • 84898805910 scopus 로고    scopus 로고
    • Action recognition with improved trajectories
    • Wang, H., Schmid, C.: Action recognition with improved trajectories. In: ICCV (2013)
    • (2013) ICCV
    • Wang, H.1    Schmid, C.2
  • 37
    • 84952349307 scopus 로고    scopus 로고
    • Jointly modeling deep video and compositional text to bridge vision and language in a unified framework
    • Xu, R., Xiong, C., Chen, W., Corso, J.J.: Jointly modeling deep video and compositional text to bridge vision and language in a unified framework. In: AAAI (2015)
    • (2015) AAAI
    • Xu, R.1    Xiong, C.2    Chen, W.3    Corso, J.J.4
  • 39
    • 84906494296 scopus 로고    scopus 로고
    • From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
    • Young, P., Lai, A., Hodosh, M., Hockenmaier, J.: From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. TACL 2, 67-78 (2014)
    • (2014) TACL , vol.2 , pp. 67-78
    • Young, P.1    Lai, A.2    Hodosh, M.3    Hockenmaier, J.4
  • 40
    • 84937964578 scopus 로고    scopus 로고
    • Learning Deep Features for Scene Recognition using Places Database
    • Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., Oliva, A.: Learning Deep Features for Scene Recognition using Places Database. In: NIPS (2014)
    • (2014) NIPS
    • Zhou, B.1    Lapedriza, A.2    Xiao, J.3    Torralba, A.4    Oliva, A.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.