메뉴 건너뛰기




Volumn 2016-December, Issue , 2016, Pages 4594-4602

Jointly modeling embedding and translation to bridge video and language

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; PATTERN RECOGNITION; SEMANTICS;

EID: 84986332702     PISSN: 10636919     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CVPR.2016.497     Document Type: Conference Paper
Times cited : (620)

References (35)
  • 4
    • 84859089502 scopus 로고    scopus 로고
    • Collecting highly parallel data for paraphrase evaluation
    • D. L. Chen and W. B. Dolan. Collecting highly parallel data for paraphrase evaluation. In ACL, 2011.
    • (2011) ACL
    • Chen, D.L.1    Dolan, W.B.2
  • 10
    • 84856653718 scopus 로고    scopus 로고
    • Learning cross-modality similarity for multinomial data
    • Y. Jia, M. Salzmann, and T. Darrell. Learning cross-modality similarity for multinomial data. In ICCV, 2011.
    • (2011) ICCV
    • Jia, Y.1    Salzmann, M.2    Darrell, T.3
  • 13
    • 84952349298 scopus 로고    scopus 로고
    • Unifying visual-semantic embeddings with multimodal neural language models
    • R. Kiros, R. Salakhutdinov, and R. S. Zemel. Unifying visual-semantic embeddings with multimodal neural language models. TACL, 2015.
    • (2015) TACL
    • Kiros, R.1    Salakhutdinov, R.2    Zemel, R.S.3
  • 14
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 18
    • 84904538400 scopus 로고    scopus 로고
    • Clickthrough-based cross-view learning for image search
    • Y. Pan, T. Yao, T. Mei, H. Li, C.-W. Ngo, and Y. Rui. Clickthrough-based cross-view learning for image search. In SIGIR, 2014.
    • (2014) SIGIR
    • Pan, Y.1    Yao, T.2    Mei, T.3    Li, H.4    Ngo, C.-W.5    Rui, Y.6
  • 19
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: A method for automatic evaluation of machine translation. In ACL, 2002.
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 23
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
    • (2015) ICLR
    • Simonyan, K.1    Zisserman, A.2
  • 24
    • 77955998009 scopus 로고    scopus 로고
    • Connecting modalities: Semisupervised segmentation and annotation of images using unaligned text corpora
    • R. Socher and L. Fei-Fei. Connecting modalities: Semisupervised segmentation and annotation of images using unaligned text corpora. In CVPR, 2010.
    • (2010) CVPR
    • Socher, R.1    Fei-Fei, L.2
  • 26
    • 84959932469 scopus 로고    scopus 로고
    • Integrating language and vision to generate natural language descriptions of videos in the wild
    • J. Thomason, S. Venugopalan, S. Guadarrama, K. Saenko, and R. Mooney. Integrating language and vision to generate natural language descriptions of videos in the wild. In COLING, 2014.
    • (2014) COLING
    • Thomason, J.1    Venugopalan, S.2    Guadarrama, S.3    Saenko, K.4    Mooney, R.5
  • 28
    • 84973865953 scopus 로고    scopus 로고
    • Learning spatiotemporal features with 3d convolutional networks
    • D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri. Learning spatiotemporal features with 3d convolutional networks. In ICCV, 2015.
    • (2015) ICCV
    • Tran, D.1    Bourdev, L.2    Fergus, R.3    Torresani, L.4    Paluri, M.5
  • 31
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. In CVPR, 2015.
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 32
    • 84962921420 scopus 로고    scopus 로고
    • Modeling spatial-temporal clues in a hybrid deep learning framework for video classification
    • Z. Wu, X. Wang, Y.-G. Jiang, H. Ye, and X. Xue. Modeling spatial-temporal clues in a hybrid deep learning framework for video classification. MM, 2015.
    • (2015) MM
    • Wu, Z.1    Wang, X.2    Jiang, Y.-G.3    Ye, H.4    Xue, X.5
  • 33
    • 84952349307 scopus 로고    scopus 로고
    • Jointly modeling deep video and compositional text to bridge vision and language in a unified framework
    • R. Xu, C. Xiong, W. Chen, and J. J. Corso. Jointly modeling deep video and compositional text to bridge vision and language in a unified framework. In AAAI, 2015.
    • (2015) AAAI
    • Xu, R.1    Xiong, C.2    Chen, W.3    Corso, J.J.4
  • 35
    • 84973907734 scopus 로고    scopus 로고
    • Learning query and image similarities with ranking canonical correlation analysis
    • T. Yao, T. Mei, and C.-W. Ngo. Learning query and image similarities with ranking canonical correlation analysis. In ICCV, 2015.
    • (2015) ICCV
    • Yao, T.1    Mei, T.2    Ngo, C.-W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.