메뉴 건너뛰기




Volumn 2015 International Conference on Computer Vision, ICCV 2015, Issue , 2015, Pages 2641-2649

Flickr30k entities: Collecting region-to-phrase correspondences for richer image-to-sentence models

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION;

EID: 84973856017     PISSN: 15505499     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICCV.2015.303     Document Type: Conference Paper
Times cited : (1701)

References (41)
  • 5
  • 8
    • 84887365305 scopus 로고    scopus 로고
    • A sentence is worth a thousand pixels
    • 8
    • S. Fidler, A. Sharma, and R. Urtasun. A sentence is worth a thousand pixels. In CVPR, 2013. 8
    • (2013) CVPR
    • Fidler, S.1    Sharma, A.2    Urtasun, R.3
  • 9
    • 84894905366 scopus 로고    scopus 로고
    • A multi-view embedding space for modeling internet images, tags, and their semantics
    • 6
    • Y. Gong, Q. Ke, M. Isard, and S. Lazebnik. A multi-view embedding space for modeling internet images, tags, and their semantics. IJCV, 106 (2): 210-233, 2014. 6
    • (2014) IJCV , vol.106 , Issue.2 , pp. 210-233
    • Gong, Y.1    Ke, Q.2    Isard, M.3    Lazebnik, S.4
  • 10
    • 84959243872 scopus 로고    scopus 로고
    • Improving image-sentence embeddings using large weakly annotated photo collections
    • 1, 5, 6, 7
    • Y. Gong, L. Wang, M. Hodosh, J. Hockenmaier, and S. Lazebnik. Improving image-sentence embeddings using large weakly annotated photo collections. In ECCV, 2014. 1, 5, 6, 7
    • (2014) ECCV
    • Gong, Y.1    Wang, L.2    Hodosh, M.3    Hockenmaier, J.4    Lazebnik, S.5
  • 11
    • 38049183286 scopus 로고    scopus 로고
    • The iapr tc-12 benchmark: A new evaluation resource for visual information systems
    • 1, 2
    • M. Grubinger, P. Clough, H. Müller, and T. Deselaers. The iapr tc-12 benchmark: A new evaluation resource for visual information systems. In International Workshop OntoImage, pages 13-23, 2006. 1, 2
    • (2006) International Workshop OntoImage , pp. 13-23
    • Grubinger, M.1    Clough, P.2    Müller, H.3    Deselaers, T.4
  • 12
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • 1
    • M. Hodosh, P. Young, and J. Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. JAIR, 2013. 1
    • (2013) JAIR
    • Hodosh, M.1    Young, P.2    Hockenmaier, J.3
  • 13
    • 84862286506 scopus 로고    scopus 로고
    • Crosscaption coreference resolution for automatic image understanding
    • 3, 8. ACL
    • M. Hodosh, P. Young, C. Rashtchian, and J. Hockenmaier. Crosscaption coreference resolution for automatic image understanding. In CoNLL, pages 162-171. ACL, 2010. 3, 8
    • (2010) CoNLL , pp. 162-171
    • Hodosh, M.1    Young, P.2    Rashtchian, C.3    Hockenmaier, J.4
  • 14
    • 0000107975 scopus 로고
    • Relations between two sets of variates
    • 5
    • H. Hotelling. Relations between two sets of variates. Biometrika, pages 321-377, 1936. 5
    • (1936) Biometrika , pp. 321-377
    • Hotelling, H.1
  • 17
    • 84937843643 scopus 로고    scopus 로고
    • Deep fragment embeddings for bidirectional image sentence mapping
    • 1
    • A. Karpathy, A. Joulin, and L. Fei-Fei. Deep fragment embeddings for bidirectional image sentence mapping. In NIPS, 2014. 1
    • (2014) NIPS
    • Karpathy, A.1    Joulin, A.2    Fei-Fei, L.3
  • 18
    • 84943540775 scopus 로고    scopus 로고
    • Referitgame: Referring to objects in photographs of natural scenes
    • 2
    • S. Kazemzadeh, V. Ordonez, M. Matten, and T. Berg. Referitgame: Referring to objects in photographs of natural scenes. In EMNLP, 2014. 2
    • (2014) EMNLP
    • Kazemzadeh, S.1    Ordonez, V.2    Matten, M.3    Berg, T.4
  • 20
    • 84965125568 scopus 로고    scopus 로고
    • Fisher vectors derived from hybrid Gaussian-laplacian mixture models for image annotation
    • 1, 5, 6, 7, 8
    • B. Klein, G. Lev, G. Sadeh, and L. Wolf. Fisher vectors derived from hybrid Gaussian-laplacian mixture models for image annotation. CVPR, 2015. 1, 5, 6, 7, 8
    • (2015) CVPR
    • Klein, B.1    Lev, G.2    Sadeh, G.3    Wolf, L.4
  • 21
    • 84911370987 scopus 로고    scopus 로고
    • What are you talking about? Text-to-image coreference
    • 5
    • C. Kong, D. Lin, M. Bansal, R. Urtasun, and S. Fidler. What are you talking about? text-to-image coreference. In CVPR, 2014. 5
    • (2014) CVPR
    • Kong, C.1    Lin, D.2    Bansal, M.3    Urtasun, R.4    Fidler, S.5
  • 27
    • 84898956512 scopus 로고    scopus 로고
    • Distributed representations of words and phrases and their compositionality
    • 5
    • T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, 2013. 5
    • (2013) NIPS
    • Mikolov, T.1    Sutskever, I.2    Chen, K.3    Corrado, G.S.4    Dean, J.5
  • 28
    • 85162522202 scopus 로고    scopus 로고
    • Im2Text: Describing images using 1 million captioned photographs
    • 1
    • V. Ordonez, G. Kulkarni, and T. L. Berg. Im2Text: Describing images using 1 million captioned photographs. NIPS, 2011. 1
    • (2011) NIPS
    • Ordonez, V.1    Kulkarni, G.2    Berg, T.L.3
  • 29
    • 79959771606 scopus 로고    scopus 로고
    • Improving the fisher kernel for large-scale image classification
    • 5
    • F. Perronnin, J. Sánchez, and T. Mensink. Improving the fisher kernel for large-scale image classification. In ECCV, 2010. 5
    • (2010) ECCV
    • Perronnin, F.1    Sánchez, J.2    Mensink, T.3
  • 30
    • 84943782750 scopus 로고    scopus 로고
    • Linking people in videos with "their" names using coreference resolution
    • 3
    • V. Ramanathan, A. Joulin, P. Liang, and L. Fei-Fei. Linking people in videos with "their" names using coreference resolution. In ECCV, 2014. 3
    • (2014) ECCV
    • Ramanathan, V.1    Joulin, A.2    Liang, P.3    Fei-Fei, L.4
  • 33
    • 84893795422 scopus 로고    scopus 로고
    • Parsing with compositional vector grammars
    • 6
    • R. Socher, J. Bauer, C. D. Manning, and A. Y. Ng. Parsing With Compositional Vector Grammars. In ACL, 2013. 6
    • (2013) ACL
    • Socher, R.1    Bauer, J.2    Manning, C.D.3    Ng, A.Y.4
  • 34
    • 0039891959 scopus 로고    scopus 로고
    • A machine learning approach to coreference resolution of noun phrases
    • 3
    • W. M. Soon, H. T. Ng, and D. C. Y. Lim. A machine learning approach to coreference resolution of noun phrases. Computational Linguistics, 27 (4): 521-544, 2001. 3
    • (2001) Computational Linguistics , vol.27 , Issue.4 , pp. 521-544
    • Soon, W.M.1    Ng, H.T.2    Lim, D.C.Y.3
  • 35
    • 52049123532 scopus 로고    scopus 로고
    • Utility data annotation with Amazon Mechanical Turk
    • 4
    • A. Sorokin and D. Forsyth. Utility data annotation with Amazon Mechanical Turk. Internet Vision Workshop, 2008. 4
    • (2008) Internet Vision Workshop
    • Sorokin, A.1    Forsyth, D.2
  • 39
    • 77954862144 scopus 로고    scopus 로고
    • I2T: Image parsing to text description
    • 1
    • B. Yao, X. Yang, L. Lin, M. W. Lee, and S.-C. Zhu. I2T: Image parsing to text description. Proc. IEEE, 98 (8): 1485-1508, 2010. 1
    • (2010) Proc. IEEE , vol.98 , Issue.8 , pp. 1485-1508
    • Yao, B.1    Yang, X.2    Lin, L.3    Lee, M.W.4    Zhu, S.-C.5
  • 40
    • 84906494296 scopus 로고    scopus 로고
    • From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
    • 1, 3
    • P. Young, A. Lai, M. Hodosh, and J. Hockenmaier. From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. TACL, 2: 67-78, 2014. 1, 3
    • (2014) TACL , vol.2 , pp. 67-78
    • Young, P.1    Lai, A.2    Hodosh, M.3    Hockenmaier, J.4
  • 41
    • 84952018709 scopus 로고    scopus 로고
    • Edge boxes: Locating object proposals from edges
    • 6
    • C. L. Zitnick and P. Dollár. Edge boxes: Locating object proposals from edges. In ECCV, 2014. 6
    • (2014) ECCV
    • Zitnick, C.L.1    Dollár, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.