메뉴 건너뛰기




Volumn 2015 International Conference on Computer Vision, ICCV 2015, Issue , 2015, Pages 4534-4542

Sequence to sequence - Video to text

Author keywords

[No Author keywords available]

Indexed keywords

COMPLEX NETWORKS; RECURRENT NEURAL NETWORKS;

EID: 84973882730     PISSN: 15505499     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICCV.2015.515     Document Type: Conference Paper
Times cited : (1483)

References (44)
  • 1
    • 77951155435 scopus 로고    scopus 로고
    • Video2text: Learning to annotate video content
    • H. Aradhye, G. Toderici, and J. Yagnik. Video2text: Learning to annotate video content. In ICDMW, 2009.
    • (2009) ICDMW
    • Aradhye, H.1    Toderici, G.2    Yagnik, J.3
  • 2
    • 35048833329 scopus 로고    scopus 로고
    • High accuracy optical flow estimation based on a theory for warping
    • T. Brox, A. Bruhn, N. Papenberg, and J. Weickert. High accuracy optical flow estimation based on a theory for warping. In ECCV, pages 25-36, 2004.
    • (2004) ECCV , pp. 25-36
    • Brox, T.1    Bruhn, A.2    Papenberg, N.3    Weickert, J.4
  • 3
    • 84859089502 scopus 로고    scopus 로고
    • Collecting highly parallel data for paraphrase evaluation
    • D. L. Chen and W. B. Dolan. Collecting highly parallel data for paraphrase evaluation. In ACL, 2011.
    • (2011) ACL
    • Chen, D.L.1    Dolan, W.B.2
  • 5
    • 84957029470 scopus 로고    scopus 로고
    • Learning a recurrent visual representation for image caption generation
    • X. Chen and C. L. Zitnick. Learning a recurrent visual representation for image caption generation. CVPR, 2015.
    • (2015) CVPR
    • Chen, X.1    Zitnick, C.L.2
  • 7
    • 85107661995 scopus 로고    scopus 로고
    • Meteor universal: Language specific translation evaluation for any target language
    • M. Denkowski and A. Lavie. Meteor universal: Language specific translation evaluation for any target language. In EACL, 2014.
    • (2014) EACL
    • Denkowski, M.1    Lavie, A.2
  • 10
    • 84919832465 scopus 로고    scopus 로고
    • Towards end-to-end speech recognition with recurrent neural networks
    • A. Graves and N. Jaitly. Towards end-to-end speech recognition with recurrent neural networks. In ICML, 2014.
    • (2014) ICML
    • Graves, A.1    Jaitly, N.2
  • 13
    • 84906494296 scopus 로고    scopus 로고
    • From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
    • P. Hodosh, A. Young, M. Lai, and J. Hockenmaier. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. In TACL, 2014.
    • (2014) TACL
    • Hodosh, P.1    Young, A.2    Lai, M.3    Hockenmaier, J.4
  • 14
    • 85072312550 scopus 로고    scopus 로고
    • A multi-modal clustering method for web videos
    • H. Huang, Y. Lu, F. Zhang, and S. Sun. A multi-modal clustering method for web videos. In ISCTCS. 2013.
    • (2013) ISCTCS.
    • Huang, H.1    Lu, Y.2    Zhang, F.3    Sun, S.4
  • 16
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. CVPR, 2015.
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 20
    • 84934873221 scopus 로고    scopus 로고
    • Treetalk: Composition and compression of trees for image descriptions
    • P. Kuznetsova, V. Ordonez, T. L. Berg, U. C. Hill, and Y. Choi. Treetalk: Composition and compression of trees for image descriptions. In TACL, 2014.
    • (2014) TACL
    • Kuznetsova, P.1    Ordonez, V.2    Berg, T.L.3    Hill, U.C.4    Choi, Y.5
  • 26
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: A method for automatic evaluation of machine translation. In ACL, 2002.
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 27
    • 84973887740 scopus 로고    scopus 로고
    • The long-short story of movie description
    • A. Rohrbach, M. Rohrbach, and B. Schiele. The long-short story of movie description. GCPR, 2015.
    • (2015) GCPR
    • Rohrbach, A.1    Rohrbach, M.2    Schiele, B.3
  • 31
    • 84937862424 scopus 로고    scopus 로고
    • Two-stream convolutional networks for action recognition in videos
    • K. Simonyan and A. Zisserman. Two-stream convolutional networks for action recognition in videos. In NIPS, 2014.
    • (2014) NIPS
    • Simonyan, K.1    Zisserman, A.2
  • 33
    • 84969544782 scopus 로고    scopus 로고
    • Unsupervised learning of video representations using LSTMs
    • N. Srivastava, E. Mansimov, and R. Salakhutdinov. Unsupervised learning of video representations using LSTMs. ICML, 2015.
    • (2015) ICML
    • Srivastava, N.1    Mansimov, E.2    Salakhutdinov, R.3
  • 34
    • 84928547704 scopus 로고    scopus 로고
    • Sequence to sequence learning with neural networks
    • I. Sutskever, O. Vinyals, and Q. V. Le. Sequence to sequence learning with neural networks. In NIPS, 2014.
    • (2014) NIPS
    • Sutskever, I.1    Vinyals, O.2    Le, Q.V.3
  • 36
    • 84959932469 scopus 로고    scopus 로고
    • Integrating language and vision to generate natural language descriptions of videos in the wild
    • J. Thomason, S. Venugopalan, S. Guadarrama, K. Saenko, and R. J. Mooney. Integrating language and vision to generate natural language descriptions of videos in the wild. In COLING, 2014.
    • (2014) COLING
    • Thomason, J.1    Venugopalan, S.2    Guadarrama, S.3    Saenko, K.4    Mooney, R.J.5
  • 38
    • 84956980995 scopus 로고    scopus 로고
    • CIDEr: Consensus-based image description evaluation
    • R. Vedantam, C. L. Zitnick, and D. Parikh. CIDEr: Consensus-based image description evaluation. CVPR, 2015.
    • (2015) CVPR
    • Vedantam, R.1    Zitnick, C.L.2    Parikh, D.3
  • 40
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. CVPR, 2015.
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 41
    • 84898805910 scopus 로고    scopus 로고
    • Action recognition with improved trajectories
    • IEEE
    • H. Wang and C. Schmid. Action recognition with improved trajectories. In ICCV, pages 3551-3558. IEEE, 2013.
    • (2013) ICCV , pp. 3551-3558
    • Wang, H.1    Schmid, C.2
  • 42
    • 77954177620 scopus 로고    scopus 로고
    • Multimodal fusion for video search reranking
    • S. Wei, Y. Zhao, Z. Zhu, and N. Liu. Multimodal fusion for video search reranking. TKDE, 2010.
    • (2010) TKDE
    • Wei, S.1    Zhao, Y.2    Zhu, Z.3    Liu, N.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.