메뉴 건너뛰기




Volumn 2015 International Conference on Computer Vision, ICCV 2015, Issue , 2015, Pages 4507-4515

Describing videos by exploiting temporal structure

Author keywords

[No Author keywords available]

Indexed keywords

BEHAVIORAL RESEARCH; COMPUTATIONAL LINGUISTICS; COMPUTER VISION; MODELING LANGUAGES; NEURAL NETWORKS; RECURRENT NEURAL NETWORKS;

EID: 84973884896     PISSN: 15505499     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICCV.2015.512     Document Type: Conference Paper
Times cited : (1129)

References (43)
  • 1
    • 85083953689 scopus 로고    scopus 로고
    • Neural machine translation by jointly learning to align and translate
    • D. Bahdanau, K. Cho, and Y. Bengio. Neural machine translation by jointly learning to align and translate. ICLR, 2015.
    • (2015) ICLR
    • Bahdanau, D.1    Cho, K.2    Bengio, Y.3
  • 6
    • 84859089502 scopus 로고    scopus 로고
    • Collecting highly parallel data for paraphrase evaluation
    • D. L. Chen and W. B. Dolan. Collecting highly parallel data for paraphrase evaluation. In ACL, 2011.
    • (2011) ACL
    • Chen, D.L.1    Dolan, W.B.2
  • 8
    • 84961291190 scopus 로고    scopus 로고
    • Learning phrase representations using RNN encoder-decoder for statistical machine translation
    • Oct.
    • K. Cho, B. van Merrienboer, C. Gulcehre, F. Bougares, H. Schwenk, and Y. Bengio. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP, Oct. 2014.
    • (2014) EMNLP
    • Cho, K.1    Van Merrienboer, B.2    Gulcehre, C.3    Bougares, F.4    Schwenk, H.5    Bengio, Y.6
  • 9
    • 34948855444 scopus 로고    scopus 로고
    • Human detection using oriented histograms of flow and appearance
    • N. Dalal, B. Triggs, and C. Schmid. Human detection using oriented histograms of flow and appearance. In ECCV. 2006.
    • (2006) ECCV
    • Dalal, N.1    Triggs, B.2    Schmid, C.3
  • 10
    • 84926007060 scopus 로고    scopus 로고
    • Meteor universal: Language specific translation evaluation for any target language
    • M. Denkowski and A. Lavie. Meteor universal: Language specific translation evaluation for any target language. In EACL Workshop, 2014.
    • (2014) EACL Workshop
    • Denkowski, M.1    Lavie, A.2
  • 12
    • 84973872525 scopus 로고    scopus 로고
    • Temporal localization of actions with actoms
    • A. Gaidon, Z. Harchaoui, and C. Schmid. Temporal localization of actions with actoms. PAMI, 2013.
    • (2013) PAMI
    • Gaidon, A.1    Harchaoui, Z.2    Schmid, C.3
  • 15
    • 84870183903 scopus 로고    scopus 로고
    • 3d convolutional neural networks for human action recognition
    • S. Ji, W. Xu, M. Yang, and K. Yu. 3d convolutional neural networks for human action recognition. PAMI, 2013.
    • (2013) PAMI
    • Ji, S.1    Xu, W.2    Yang, M.3    Yu, K.4
  • 17
    • 84952902559 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In CVPR, 2014.
    • (2014) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 19
    • 84952349298 scopus 로고    scopus 로고
    • Unifying visual-semantic embeddings with multimodal neural language models
    • R. Kiros, R. Salakhutdinov, and R. S. Zemel. Unifying visual-semantic embeddings with multimodal neural language models. ACL, 2014.
    • (2014) ACL
    • Kiros, R.1    Salakhutdinov, R.2    Zemel, R.S.3
  • 20
    • 0036843382 scopus 로고    scopus 로고
    • Natural language description of human activities from video images based on concept hierarchy of actions
    • A. Kojima, T. Tamura, and K. Fukunaga. Natural language description of human activities from video images based on concept hierarchy of actions. IJCV, 2002.
    • (2002) IJCV
    • Kojima, A.1    Tamura, T.2    Fukunaga, K.3
  • 21
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 23
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: A method for automatic evaluation of machine translation. In ACL, 2002.
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 27
    • 85083951635 scopus 로고    scopus 로고
    • Overfeat: Integrated recognition, localization and detection using convolutional networks
    • P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun. Overfeat: Integrated recognition, localization and detection using convolutional networks. ICLR, 2014.
    • (2014) ICLR
    • Sermanet, P.1    Eigen, D.2    Zhang, X.3    Mathieu, M.4    Fergus, R.5    LeCun, Y.6
  • 28
    • 84937862424 scopus 로고    scopus 로고
    • Two-stream convolutional networks for action recognition in videos
    • K. Simonyan and A. Zisserman. Two-stream convolutional networks for action recognition in videos. NIPS, 2014.
    • (2014) NIPS
    • Simonyan, K.1    Zisserman, A.2
  • 30
    • 84928547704 scopus 로고    scopus 로고
    • Sequence to sequence learning with neural networks
    • I. Sutskever, O. Vinyals, and Q. V. V. Le. Sequence to sequence learning with neural networks. In NIPS. 2014.
    • (2014) NIPS
    • Sutskever, I.1    Vinyals, O.2    Le, Q.V.V.3
  • 32
    • 84887372329 scopus 로고    scopus 로고
    • Learning latent temporal structure for complex event detection
    • K. Tang, L. Fei-Fei, and D. Koller. Learning latent temporal structure for complex event detection. In CVPR. IEEE, 2012.
    • (2012) CVPR. IEEE
    • Tang, K.1    Fei-Fei, L.2    Koller, D.3
  • 34
    • 84959932469 scopus 로고    scopus 로고
    • Integrating language and vision to generate natural language descriptions of videos in the wild
    • J. Thomason, S. Venugopalan, S. Guadarrama, K. Saenko, and R. Mooney. Integrating language and vision to generate natural language descriptions of videos in the wild. In COLING, 2014.
    • (2014) COLING
    • Thomason, J.1    Venugopalan, S.2    Guadarrama, S.3    Saenko, K.4    Mooney, R.5
  • 37
    • 84956980995 scopus 로고    scopus 로고
    • CIDEr: Consensus-based image description evaluation
    • R. Vedantam, C. L. Zitnick, and D. Parikh. CIDEr: Consensus-based image description evaluation. CVPR, 2015.
    • (2015) CVPR
    • Vedantam, R.1    Zitnick, C.L.2    Parikh, D.3
  • 39
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. CVPR, 2015.
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.