메뉴 건너뛰기




Volumn 2015 International Conference on Computer Vision, ICCV 2015, Issue , 2015, Pages 2668-2676

Common subspace for model and similarity: Phrase learning for caption generation from images

Author keywords

[No Author keywords available]

Indexed keywords

NATURAL LANGUAGE PROCESSING SYSTEMS; SAMPLING; VECTORS;

EID: 84973861187     PISSN: 15505499     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICCV.2015.306     Document Type: Conference Paper
Times cited : (57)

References (48)
  • 1
    • 77951458444 scopus 로고    scopus 로고
    • An online algorithm for large scale image similarity learning
    • 3
    • G. Chechik, U. Shalit, V. Sharma, and S. Bengio. An online algorithm for large scale image similarity learning. In NIPS, 2009. 3
    • (2009) NIPS
    • Chechik, G.1    Shalit, U.2    Sharma, V.3    Bengio, S.4
  • 2
    • 84957029470 scopus 로고    scopus 로고
    • Mind's eye: A recurrent visual representation for image caption generation
    • 3, 7
    • X. Chen and C. L. Zitnick. Mind's eye: A recurrent visual representation for image caption generation. In CVPR, 2015. 3, 7
    • (2015) CVPR
    • Chen, X.1    Zitnick, C.L.2
  • 3
    • 84455207551 scopus 로고    scopus 로고
    • Automatic evaluation of machine translation quality using n-gram co-occurrence statistics
    • 6
    • G. Doddington. Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In HLT, 2002. 6
    • (2002) HLT
    • Doddington, G.1
  • 7
    • 84887839738 scopus 로고    scopus 로고
    • Phrasal recognition
    • 1, 3
    • A. Farhadi and M. A. Sadeghi. Phrasal recognition. PAMI, 35 (12): 2854-65, 2013. 1, 3
    • (2013) PAMI , vol.35 , Issue.12 , pp. 2854-2865
    • Farhadi, A.1    Sadeghi, M.A.2
  • 9
    • 84973931408 scopus 로고    scopus 로고
    • From image annotation to image description
    • 5, 6, 13
    • A. Gupta and P. Mannem. From image annotation to image description. In ICONIP, 2012. 5, 6, 13
    • (2012) ICONIP
    • Gupta, A.1    Mannem, P.2
  • 10
    • 85059866463 scopus 로고    scopus 로고
    • Choosing linguistics over vision to describe images
    • 1, 3, 5, 7
    • A. Gupta, Y. Verma, and C. V. Jawahar. Choosing linguistics over vision to describe images. In AAAI, 2012. 1, 3, 5, 7
    • (2012) AAAI
    • Gupta, A.1    Verma, Y.2    Jawahar, C.V.3
  • 12
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • 2
    • M. Hodosh, P. Young, and J. Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. JAIR, 47: 853-899, 2013. 2
    • (2013) JAIR , vol.47 , pp. 853-899
    • Hodosh, M.1    Young, P.2    Hockenmaier, J.3
  • 14
    • 84946734827 scopus 로고    scopus 로고
    • Deep visual-semantic alignments for generating image descriptions
    • 3, 7, 8
    • A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In CVPR, 2015. 3, 7, 8
    • (2015) CVPR
    • Karpathy, A.1    Fei-Fei, L.2
  • 15
    • 84944113729 scopus 로고    scopus 로고
    • Unifying visualsemantic embeddings with multimodal neural language models
    • 3
    • R. Kiros, R. Salakhutdinov, and R. S. Zemel. Unifying visualsemantic embeddings with multimodal neural language models. In NIPS, 2014. 3
    • (2014) NIPS
    • Kiros, R.1    Salakhutdinov, R.2    Zemel, R.S.3
  • 16
    • 85146417759 scopus 로고    scopus 로고
    • Accurate unlexicalized parsing
    • 5
    • D. Klein and C. D. Manning. Accurate unlexicalized parsing. In ACL, 2003. 5
    • (2003) ACL
    • Klein, D.1    Manning, C.D.2
  • 17
    • 84876231242 scopus 로고    scopus 로고
    • Imagenet classification with deep convolutional neural networks
    • 1, 3, 5, 7
    • A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012. 1, 3, 5, 7
    • (2012) NIPS
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 20
    • 84878189119 scopus 로고    scopus 로고
    • Collective generation of natural image descriptions
    • 1, 2, 5, 7
    • P. Kuznetsova, V. Ordonez, A. C. Berg, T. L. Berg, and Y. Choi. Collective generation of natural image descriptions. In ACL, 2012. 1, 2, 5, 7
    • (2012) ACL
    • Kuznetsova, P.1    Ordonez, V.2    Berg, A.C.3    Berg, T.L.4    Choi, Y.5
  • 21
    • 52149112996 scopus 로고    scopus 로고
    • Meteor: An automatic metric for mt evaluation with high levels of correlation with human judgments
    • 7
    • A. Lavie and A. Agarwal. Meteor: An automatic metric for mt evaluation with high levels of correlation with human judgments. In ACL WMT, 2007. 7
    • (2007) ACL WMT
    • Lavie, A.1    Agarwal, A.2
  • 22
    • 84862279067 scopus 로고    scopus 로고
    • Composing simple image descriptions using web-scale n-grams
    • 1, 2, 3
    • S. Li, G. Kulkarni, T. L. Berg, A. C. Berg, and Y. Choi. Composing simple image descriptions using web-scale n-grams. In CoNLL, 2011. 1, 2, 3
    • (2011) CoNLL
    • Li, S.1    Kulkarni, G.2    Berg, T.L.3    Berg, A.C.4    Choi, Y.5
  • 24
    • 3042535216 scopus 로고    scopus 로고
    • Distinctive image features from scale-invariant keypoints
    • 5, 10
    • D. G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, (2): 91-110, 2004. 5, 10
    • (2004) IJCV , Issue.2 , pp. 91-110
    • Lowe, D.G.1
  • 25
    • 34948830130 scopus 로고    scopus 로고
    • Semantic hierarchies for visual object recognition
    • 2
    • M. Marszalek and C. Schmid. Semantic hierarchies for visual object recognition. In CVPR, 2007. 2
    • (2007) CVPR
    • Marszalek, M.1    Schmid, C.2
  • 27
    • 84883488616 scopus 로고    scopus 로고
    • Metric learning for large scale image classification: Generalizing to new classes at near-zero cost
    • 2, 3, 11
    • T. Mensink, J. Verbeek, F. Perronnin, and G. Csurka. Metric learning for large scale image classification: Generalizing to new classes at near-zero cost. In ECCV, 2012. 2, 3, 11
    • (2012) ECCV
    • Mensink, T.1    Verbeek, J.2    Perronnin, F.3    Csurka, G.4
  • 29
    • 85162522202 scopus 로고    scopus 로고
    • Im2text: Describing images using 1 million captioned photographs
    • 2, 5, 7
    • V. Ordonez, G. Kulkarni, and T. L. Berg. Im2text: Describing images using 1 million captioned photographs. In NIPS, 2011. 2, 5, 7
    • (2011) NIPS
    • Ordonez, V.1    Kulkarni, G.2    Berg, T.L.3
  • 30
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • 6
    • K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: A method for automatic evaluation of machine translation. In ACL, 2002. 6
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.-J.4
  • 31
    • 79959771606 scopus 로고    scopus 로고
    • Improving the fisher kernel for large-scale image classification
    • 5
    • F. Perronnin, J. Sánchez, and T. Mensink. Improving the fisher kernel for large-scale image classification. In ECCV, 2010. 5
    • (2010) ECCV
    • Perronnin, F.1    Sánchez, J.2    Mensink, T.3
  • 35
    • 80052889458 scopus 로고    scopus 로고
    • Recognition using visual phrases
    • 1, 3
    • M. A. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011. 1, 3
    • (2011) CVPR
    • Sadeghi, M.A.1    Farhadi, A.2
  • 36
    • 80052905403 scopus 로고    scopus 로고
    • Learning to share visual appearance for multiclass object detection
    • 2
    • R. Salakhutdinov, A. Torralba, and J. Tenenbaum. Learning to share visual appearance for multiclass object detection. In CVPR, 2011. 2
    • (2011) CVPR
    • Salakhutdinov, R.1    Torralba, A.2    Tenenbaum, J.3
  • 37
    • 80052885179 scopus 로고    scopus 로고
    • High-dimensional signature compression for large-scale image classification
    • 1, 3
    • J. Sánchez and F. Perronnin. High-dimensional signature compression for large-scale image classification. In CVPR, 2011. 1, 3
    • (2011) CVPR
    • Sánchez, J.1    Perronnin, F.2
  • 38
    • 0031268931 scopus 로고    scopus 로고
    • Bidirectional recurrent neural networks
    • 7
    • M. Schuster and K. K. Paliwal. Bidirectional recurrent neural networks. TSP, 45 (11): 2673-2681, 1997. 7
    • (1997) TSP , vol.45 , Issue.11 , pp. 2673-2681
    • Schuster, M.1    Paliwal, K.K.2
  • 39
    • 84943761635 scopus 로고    scopus 로고
    • Very deep convolutional networks for large-scale image recognition
    • 5, 7
    • K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In CVPR, 2015. 5, 7
    • (2015) CVPR
    • Simonyan, K.1    Zisserman, A.2
  • 41
    • 84871392832 scopus 로고    scopus 로고
    • Efficient image annotation for automatic sentence generation
    • 1, 3, 4, 5, 6, 7, 11, 12
    • Y. Ushiku, T. Harada, and Y. Kuniyoshi. Efficient image annotation for automatic sentence generation. In ACMMM, 2012. 1, 3, 4, 5, 6, 7, 11, 12
    • (2012) ACMMM
    • Ushiku, Y.1    Harada, T.2    Kuniyoshi, Y.3
  • 42
    • 25844477556 scopus 로고    scopus 로고
    • Less: A model-based classifier for sparse subspaces
    • 2, 3
    • C. J. Veenman and D. M. Tax. Less: A model-based classifier for sparse subspaces. PAMI, 27 (9): 1496-500, 2005. 2, 3
    • (2005) PAMI , vol.27 , Issue.9 , pp. 1496-1500
    • Veenman, C.J.1    Tax, D.M.2
  • 44
    • 84946747440 scopus 로고    scopus 로고
    • Show and tell: A neural image caption generator
    • 3, 7, 8
    • O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. In CVPR, 2015. 3, 7, 8
    • (2015) CVPR
    • Vinyals, O.1    Toshev, A.2    Bengio, S.3    Erhan, D.4
  • 45
    • 33749550361 scopus 로고    scopus 로고
    • Distance metric learning for large margin nearest neighbor classification
    • 3, 7
    • K. Q. Weinberger, J. Blitzer, and L. K. Saul. Distance metric learning for large margin nearest neighbor classification. In NIPS, 2006. 3, 7
    • (2006) NIPS
    • Weinberger, K.Q.1    Blitzer, J.2    Saul, L.K.3
  • 46
    • 77955654853 scopus 로고    scopus 로고
    • Large scale image annotation: Learning to rank with joint word-image embeddings
    • 4, 11
    • J. Weston, S. Bengio, and N. Usunier. Large scale image annotation: Learning to rank with joint word-image embeddings. Machine Learning, 81: 21-35, 2010. 4, 11
    • (2010) Machine Learning , vol.81 , pp. 21-35
    • Weston, J.1    Bengio, S.2    Usunier, N.3
  • 47
    • 84867117593 scopus 로고    scopus 로고
    • Wsabie: Scaling up to large vocabulary image annotation
    • 2, 3, 4, 11
    • J. Weston, S. Bengio, and N. Usunier. Wsabie: Scaling up to large vocabulary image annotation. In IJCAI, 2011. 2, 3, 4, 11
    • (2011) IJCAI
    • Weston, J.1    Bengio, S.2    Usunier, N.3
  • 48
    • 80053258778 scopus 로고    scopus 로고
    • Corpus-guided sentence generation of natural images
    • 1, 2, 5, 7, 8
    • Y. Yang, C. L. Teo, H. Daumé III, and Y. Aloimonos. Corpus-guided sentence generation of natural images. In EMNLP, 2011. 1, 2, 5, 7, 8
    • (2011) EMNLP
    • Yang, Y.1    Teo, C.L.2    Daumé, H.3    Aloimonos, Y.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.