메뉴 건너뛰기




Volumn 2016-December, Issue , 2016, Pages 5288-5296

MSR-VTT: A large video description dataset for bridging video and language

Author keywords

[No Author keywords available]

Indexed keywords

MULTIMEDIA SYSTEMS; NATURAL LANGUAGE PROCESSING SYSTEMS; PATTERN RECOGNITION; SEARCH ENGINES;

EID: 84986260127     PISSN: 10636919     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CVPR.2016.571     Document Type: Conference Paper
Times cited : (2111)

References (42)
  • 1
    • 85116156579 scopus 로고    scopus 로고
    • METEOR: An automatic metric for mt evaluation with improved correlation with human judgments
    • S. Banerjee and A. Lavie. METEOR: An automatic metric for mt evaluation with improved correlation with human judgments. In Proceedings of ACL Workshop, pages 65-72, 2005.
    • (2005) Proceedings of ACL Workshop , pp. 65-72
    • Banerjee, S.1    Lavie, A.2
  • 3
    • 84859089502 scopus 로고    scopus 로고
    • Collecting highly parallel data for paraphrase evaluation
    • D. L. Chen and W. B. Dolan. Collecting highly parallel data for paraphrase evaluation. In Proceedings of ACL, pages 190-200, 2011.
    • (2011) Proceedings of ACL , pp. 190-200
    • Chen, D.L.1    Dolan, W.B.2
  • 4
    • 84957029470 scopus 로고    scopus 로고
    • Mind's Eye: A recurrent visual representation for image caption generation
    • X. Chen and C. L. Zitnick. Mind's Eye: A recurrent visual representation for image caption generation. In Proceedings of CVPR, 2015.
    • (2015) Proceedings of CVPR
    • Chen, X.1    Zitnick, C.L.2
  • 5
    • 84887345951 scopus 로고    scopus 로고
    • A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching
    • P. Das, C. Xu, R. F. Doell, and J. J. Corso. A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching. In Proceedings of CVPR, pages 2634-2641, 2013.
    • (2013) Proceedings of CVPR , pp. 2634-2641
    • Das, P.1    Xu, C.2    Doell, R.F.3    Corso, J.J.4
  • 11
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In Proceedings of CVPR, 2015.
    • (2015) Proceedings of CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 14
    • 84952349298 scopus 로고    scopus 로고
    • Unifying visualsemantic embeddings with multimodal neural language models
    • R. Kiros, R. Salakhutdinov, and R. S. Zemel. Unifying visualsemantic embeddings with multimodal neural language models. TACL, 2015.
    • (2015) TACL
    • Kiros, R.1    Salakhutdinov, R.2    Zemel, R.S.3
  • 15
    • 0036843382 scopus 로고    scopus 로고
    • Natural language description of human activities from video images based on concept hierarchy of actions
    • A. Kojima, T. Tamura, and K. Fukunaga. Natural language description of human activities from video images based on concept hierarchy of actions. International Journal of Computer Vision, 50(2):171-184, 2002.
    • (2002) International Journal of Computer Vision , vol.50 , Issue.2 , pp. 171-184
    • Kojima, A.1    Tamura, T.2    Fukunaga, K.3
  • 17
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In Proceedings of NIPS, pages 1097-1105, 2012.
    • (2012) Proceedings of NIPS , pp. 1097-1105
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 23
    • 84893956152 scopus 로고    scopus 로고
    • Multimedia search reranking: A literature survey
    • T. Mei, Y. Rui, S. Li, and Q. Tian. Multimedia search reranking: A literature survey. ACM Computing Surveys (CSUR), 46(3):38, 2014.
    • (2014) ACM Computing Surveys (CSUR) , vol.46 , Issue.3 , pp. 38
    • Mei, T.1    Rui, Y.2    Li, S.3    Tian, Q.4
  • 24
    • 85133336275 scopus 로고    scopus 로고
    • BLEU: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. BLEU: a method for automatic evaluation of machine translation. In Proceedings of ACL, pages 311-318, 2002.
    • (2002) Proceedings of ACL , pp. 311-318
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 29
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In Proceedings of ICLR, 2015.
    • (2015) Proceedings of ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 30
    • 84973888835 scopus 로고    scopus 로고
    • Automatic concept discovery from parallel text and visual corpora
    • C. Sun, C. Gan, and R. Nevatia. Automatic concept discovery from parallel text and visual corpora. In ICCV, pages 2596-2604, 2015.
    • (2015) ICCV , pp. 2596-2604
    • Sun, C.1    Gan, C.2    Nevatia, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.