메뉴 건너뛰기




Volumn 2016-December, Issue , 2016, Pages 1029-1038

Hierarchical recurrent neural encoder for video representation with application to captioning

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; NEURAL NETWORKS; VIDEO RECORDING;

EID: 84986290372     PISSN: 10636919     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CVPR.2016.117     Document Type: Conference Paper
Times cited : (441)

References (45)
  • 1
    • 85083953689 scopus 로고    scopus 로고
    • Neural machine translation by jointly learning to align and translate
    • D. Bahdanau, K. Cho, and Y. Bengio. Neural machine translation by jointly learning to align and translate. In ICLR, 2015.
    • (2015) ICLR
    • Bahdanau, D.1    Cho, K.2    Bengio, Y.3
  • 3
    • 0028392483 scopus 로고
    • Learning long-term dependencies with gradient descent is difficult
    • Y. Bengio, P. Simard, and P. Frasconi. Learning long-term dependencies with gradient descent is difficult. Neural Networks, IEEE Transactions on, 5 (2): 157-166, 1994.
    • (1994) Neural Networks, IEEE Transactions on , vol.5 , Issue.2 , pp. 157-166
    • Bengio, Y.1    Simard, P.2    Frasconi, P.3
  • 5
    • 84859089502 scopus 로고    scopus 로고
    • Collecting highly parallel data for paraphrase evaluation
    • D. L. Chen and W. B. Dolan. Collecting highly parallel data for paraphrase evaluation. In ACL, 2011.
    • (2011) ACL
    • Chen, D.L.1    Dolan, W.B.2
  • 8
    • 85107661995 scopus 로고    scopus 로고
    • Meteor universal: Language specific translation evaluation for any target language
    • M. Denkowski and A. Lavie. Meteor universal: Language specific translation evaluation for any target language. In EACL, 2014.
    • (2014) EACL
    • Denkowski, M.1    Lavie, A.2
  • 12
    • 84969584486 scopus 로고    scopus 로고
    • Batch normalization: Accelerating deep network training by reducing internal covariate shift
    • S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. ICML, 2015.
    • (2015) ICML
    • Ioffe, S.1    Szegedy, C.2
  • 13
    • 77956004473 scopus 로고    scopus 로고
    • Aggregating local descriptors into a compact image representation
    • H. Jégou, M. Douze, C. Schmid, and P. Pérez. Aggregating local descriptors into a compact image representation. In CVPR, 2010.
    • (2010) CVPR
    • Jégou, H.1    Douze, M.2    Schmid, C.3    Pérez, P.4
  • 14
    • 84870183903 scopus 로고    scopus 로고
    • 3d convolutional neural networks for human action recognition
    • S. Ji, W. Xu, M. Yang, and K. Yu. 3d convolutional neural networks for human action recognition. TPAMI, 35 (1): 221-231, 2013.
    • (2013) TPAMI , vol.35 , Issue.1 , pp. 221-231
    • Ji, S.1    Xu, W.2    Yang, M.3    Yu, K.4
  • 16
    • 85083951076 scopus 로고    scopus 로고
    • ADAM: A method for stochastic optimization
    • D. Kingma and J. Ba. ADAM: A method for stochastic optimization. In ICLR, 2015.
    • (2015) ICLR
    • Kingma, D.1    Ba, J.2
  • 17
    • 84876231242 scopus 로고    scopus 로고
    • ImageNet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton. ImageNet classification with deep convolutional neural networks. In NIPS, 2012.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 18
    • 26944501715 scopus 로고    scopus 로고
    • ROUGE: A package for automatic evaluation of summaries
    • C.-Y. Lin. ROUGE: A package for automatic evaluation of summaries. In ACL workshop, 2004.
    • (2004) ACL Workshop
    • Lin, C.-Y.1
  • 21
    • 84986332702 scopus 로고    scopus 로고
    • Jointly modeling embedding and translation to bridge video and language
    • Y. Pan, T. Mei, T. Yao, H. Li, and Y. Rui. Jointly modeling embedding and translation to bridge video and language. CVPR, 2016.
    • (2016) CVPR
    • Pan, Y.1    Mei, T.2    Yao, T.3    Li, H.4    Rui, Y.5
  • 22
    • 85133336275 scopus 로고    scopus 로고
    • BLEU: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. BLEU: A method for automatic evaluation of machine translation. In ACL, 2002.
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 25
    • 84883487458 scopus 로고    scopus 로고
    • Image classification with the fisher vector: Theory and practice
    • J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek. Image classification with the fisher vector: Theory and practice. IJCV, 105 (3): 222-245, 2013.
    • (2013) IJCV , vol.105 , Issue.3 , pp. 222-245
    • Sánchez, J.1    Perronnin, F.2    Mensink, T.3    Verbeek, J.4
  • 26
    • 84937862424 scopus 로고    scopus 로고
    • Two-stream convolutional networks for action recognition in videos
    • K. Simonyan and A. Zisserman. Two-stream convolutional networks for action recognition in videos. In NIPS, 2014.
    • (2014) NIPS
    • Simonyan, K.1    Zisserman, A.2
  • 27
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
    • (2015) ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 28
    • 51449089344 scopus 로고    scopus 로고
    • Video google: A text retrieval approach to object matching in videos
    • J. Sivic and A. Zisserman. Video google: A text retrieval approach to object matching in videos. In CVPR, 2003.
    • (2003) CVPR
    • Sivic, J.1    Zisserman, A.2
  • 29
    • 84958256008 scopus 로고    scopus 로고
    • A hierarchical recurrent encoder-decoder for generative context-aware query suggestion
    • A. Sordoni, Y. Bengio, H. Vahabi, C. Lioma, J. G. Simonsen, and J.-Y. Nie. A hierarchical recurrent encoder-decoder for generative context-aware query suggestion. In CIKM, 2015.
    • (2015) CIKM
    • Sordoni, A.1    Bengio, Y.2    Vahabi, H.3    Lioma, C.4    Simonsen, J.G.5    Nie, J.-Y.6
  • 30
    • 84904163933 scopus 로고    scopus 로고
    • Dropout: A simple way to prevent neural networks from overfitting
    • N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. JMLR, 15 (1): 1929-1958, 2014.
    • (2014) JMLR , vol.15 , Issue.1 , pp. 1929-1958
    • Srivastava, N.1    Hinton, G.2    Krizhevsky, A.3    Sutskever, I.4    Salakhutdinov, R.5
  • 31
    • 84928547704 scopus 로고    scopus 로고
    • Sequence to sequence learning with neural networks
    • I. Sutskever, O. Vinyals, and Q. V. Le. Sequence to sequence learning with neural networks. In NIPS, 2014.
    • (2014) NIPS
    • Sutskever, I.1    Vinyals, O.2    Le, Q.V.3
  • 33
    • 84959932469 scopus 로고    scopus 로고
    • Integrating language and vision to generate natural language descriptions of videos in the wild
    • J. Thomason, S. Venugopalan, S. Guadarrama, K. Saenko, and R. Mooney. Integrating language and vision to generate natural language descriptions of videos in the wild. In COLING, 2014.
    • (2014) COLING
    • Thomason, J.1    Venugopalan, S.2    Guadarrama, S.3    Saenko, K.4    Mooney, R.5
  • 36
    • 84956980995 scopus 로고    scopus 로고
    • CIDEr: Consensus-based image description evaluation
    • R. Vedantam, C. L. Zitnick, and D. Parikh. CIDEr: Consensus-based image description evaluation. In CVPR, 2015.
    • (2015) CVPR
    • Vedantam, R.1    Zitnick, C.L.2    Parikh, D.3
  • 40
    • 84898805910 scopus 로고    scopus 로고
    • Action recognition with improved trajectories
    • H. Wang and C. Schmid. Action recognition with improved trajectories. In ICCV, 2013.
    • (2013) ICCV
    • Wang, H.1    Schmid, C.2
  • 41
    • 84959226659 scopus 로고    scopus 로고
    • A discriminative CNN video representation for event detection
    • Z. Xu, Y. Yang, and A. G. Hauptmann. A discriminative CNN video representation for event detection. In CVPR, 2015.
    • (2015) CVPR
    • Xu, Z.1    Yang, Y.2    Hauptmann, A.G.3
  • 43
    • 84986275061 scopus 로고    scopus 로고
    • Video paragraph captioning using hierarchical recurrent neural networks
    • H. Yu, J. Wang, Z. Huang, Y. Yang, and W. Xu. Video paragraph captioning using hierarchical recurrent neural networks. CVPR, 2016.
    • (2016) CVPR
    • Yu, H.1    Wang, J.2    Huang, Z.3    Yang, Y.4    Xu, W.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.