메뉴 건너뛰기




Volumn , Issue , 2014, Pages

Im2Text and Text2Im: Associating images and texts for cross-modal retrieval

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; FORECASTING; SEMANTICS; STATISTICS; SUPPORT VECTOR MACHINES;

EID: 85088059797     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.5244/c.28.97     Document Type: Conference Paper
Times cited : (41)

References (40)
  • 1
    • 9444259451 scopus 로고    scopus 로고
    • Latent dirichlet allocation
    • D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. JMLR, 12(1):234-278, 2003.
    • (2003) JMLR , vol.12 , Issue.1 , pp. 234-278
    • Blei, D.1    Ng, A.2    Jordan, M.3
  • 2
    • 43249093335 scopus 로고    scopus 로고
    • Image retrieval: Ideas, influences and trends of new age
    • R. Datta, D. Joshi, J. Li, and J. Wang. Image retrieval: Ideas, influences and trends of new age. ACM Computing Surveys, 40(2):1-60, 2008.
    • (2008) ACM Computing Surveys , vol.40 , Issue.2 , pp. 1-60
    • Datta, R.1    Joshi, D.2    Li, J.3    Wang, J.4
  • 3
    • 84911372708 scopus 로고    scopus 로고
    • Multimodal learning in looselyorganized web images
    • Kun Duan, David J. Crandall, and Dhruv Batra. Multimodal learning in looselyorganized web images. In CVPR, 2014.
    • (2014) CVPR
    • Duan, K.1    Crandall, D.J.2    Batra, D.3
  • 4
    • 0038401728 scopus 로고    scopus 로고
    • Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary
    • P. Duygulu, K. Barnard, J. F. G. de Freitas, and D. A. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In ECCV, 2002.
    • (2002) ECCV
    • Duygulu, P.1    Barnard, K.2    De Freitas, J.F.G.3    Forsyth, D.A.4
  • 6
    • 5044225521 scopus 로고    scopus 로고
    • Multiple Bernoulli relevance models for image and video annotation
    • S. L. Feng, R. Manmatha, and V. Lavrenko. Multiple Bernoulli relevance models for image and video annotation. In CVPR, 2004.
    • (2004) CVPR
    • Feng, S.L.1    Manmatha, R.2    Lavrenko, V.3
  • 7
    • 84894905366 scopus 로고    scopus 로고
    • A multi-view embedding space for modeling internet images, tags, and their semantics
    • Yunchao Gong, Qifa Ke, Michael Isard, and Svetlana Lazebnik. A multi-view embedding space for modeling internet images, tags, and their semantics. IJCV, 106(2): 210-233, 2013.
    • (2013) IJCV , vol.106 , Issue.2 , pp. 210-233
    • Gong, Y.1    Ke, Q.2    Isard, M.3    Lazebnik, S.4
  • 9
    • 84898773262 scopus 로고    scopus 로고
    • YouTube2Text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
    • Sergio Guadarrama, Niveda Krishnamoorthy, Girish Malkarnenkar, Subhashini Venugopalan, Raymond Mooney, Trevor Darrell, and Kate Saenko. YouTube2Text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In ICCV, 2013.
    • (2013) ICCV
    • Guadarrama, S.1    Krishnamoorthy, N.2    Malkarnenkar, G.3    Venugopalan, S.4    Mooney, R.5    Darrell, T.6    Saenko, K.7
  • 10
    • 77953202699 scopus 로고    scopus 로고
    • Tagprop: Discriminative metric learning in nearest neighbour models for image auto-annotation
    • M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbour models for image auto-annotation. In ICCV, 2009.
    • (2009) ICCV
    • Guillaumin, M.1    Mensink, T.2    Verbeek, J.3    Schmid, C.4
  • 11
    • 85059866463 scopus 로고    scopus 로고
    • Choosing linguistics over vision to describe images
    • Ankush Gupta, Yashaswi Verma, and C. V. Jawahar. Choosing linguistics over vision to describe images. In AAAI, 2012.
    • (2012) AAAI
    • Gupta, A.1    Verma, Y.2    Jawahar, C.V.3
  • 12
    • 84883394520 scopus 로고    scopus 로고
    • Framing image description as a ranking task: Data, models and evaluation metrics
    • Micah Hodosh, Peter Young, and Julia Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. JAIR, 47:853-899, 2013.
    • (2013) JAIR , vol.47 , pp. 853-899
    • Hodosh, M.1    Young, P.2    Hockenmaier, J.3
  • 13
    • 0000107975 scopus 로고
    • Relations between two sets of variates
    • H. Hotelling. Relations between two sets of variates. Biometrika, 28:321-377, 1936.
    • (1936) Biometrika , vol.28 , pp. 321-377
    • Hotelling, H.1
  • 14
    • 80052901011 scopus 로고    scopus 로고
    • Baby Talk: Understanding and generating simple image descriptions
    • Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C. Berg, and Tamara L. Berg. Baby Talk: Understanding and generating simple image descriptions. In CVPR, 2011.
    • (2011) CVPR
    • Kulkarni, G.1    Premraj, V.2    Dhar, S.3    Li, S.4    Choi, Y.5    Berg, A.C.6    Berg, T.L.7
  • 15
    • 84878189119 scopus 로고    scopus 로고
    • Collective generation of natural image descriptions
    • Polina Kuznetsova, Vicente Ordonez, Alexander C. Berg, Tamara L. Berg, and Yejin Choi. Collective generation of natural image descriptions. In ACL, 2012.
    • (2012) ACL
    • Kuznetsova, P.1    Ordonez, V.2    Berg, A.C.3    Berg, T.L.4    Choi, Y.5
  • 16
    • 84862279067 scopus 로고    scopus 로고
    • Composing simple image descriptions using web-scale n-grams
    • Siming Li, Girish Kulkarni, Tamara L. Berg, Alexander C. Berg, and Yejin Choi. Composing simple image descriptions using web-scale n-grams. In CoNLL, 2011.
    • (2011) CoNLL
    • Li, S.1    Kulkarni, G.2    Berg, T.L.3    Berg, A.C.4    Choi, Y.5
  • 17
    • 85016508365 scopus 로고    scopus 로고
    • Automatic evaluation of summaries using n-gram co-occurrence statistics
    • C.-Y. Lin and E. Hovy. Automatic evaluation of summaries using n-gram co-occurrence statistics. In NAACLHLT, 2003.
    • (2003) NAACLHLT
    • Lin, C.-Y.1    Hovy, E.2
  • 18
    • 3042535216 scopus 로고    scopus 로고
    • Distinctive image features from scale-invariant keypoints
    • David G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60 (2):91-110, 2004.
    • (2004) IJCV , vol.60 , Issue.2 , pp. 91-110
    • Lowe, D.G.1
  • 19
    • 70449580491 scopus 로고    scopus 로고
    • A new baseline for image annotation
    • Ameesh Makadia, Vladimir Pavlovic, and Sanjiv Kumar. A new baseline for image annotation. In ECCV, 2008.
    • (2008) ECCV
    • Makadia, A.1    Pavlovic, V.2    Kumar, S.3
  • 22
    • 85162522202 scopus 로고    scopus 로고
    • Im2text: Describing images using 1 million captioned photographs
    • Vicente Ordonez, Girish Kulkarni, and Tamara L. Berg. Im2text: Describing images using 1 million captioned photographs. In NIPS, 2011.
    • (2011) NIPS
    • Ordonez, V.1    Kulkarni, G.2    Berg, T.L.3
  • 23
    • 85133336275 scopus 로고    scopus 로고
    • Bleu: A method for automatic evaluation of machine translation
    • K. Papineni, S. Roukos, T. Ward, and W. Zhu. Bleu: A method for automatic evaluation of machine translation. In ACL, 2002.
    • (2002) ACL
    • Papineni, K.1    Roukos, S.2    Ward, T.3    Zhu, W.4
  • 24
    • 77955899888 scopus 로고    scopus 로고
    • Diversity in photo retrieval: Overview of the imageclefphoto task 2009
    • M. Paramita, M. Sanderson, and P. Clough. Diversity in photo retrieval: overview of the imageclefphoto task 2009. CLEF working notes, 2009.
    • (2009) CLEF Working Notes
    • Paramita, M.1    Sanderson, M.2    Clough, P.3
  • 27
    • 84898493831 scopus 로고    scopus 로고
    • Label embedding for text recognition
    • Jose Rodriguez and Florent Perronnin. Label embedding for text recognition. In BMVC, 2013.
    • (2013) BMVC
    • Rodriguez, J.1    Perronnin, F.2
  • 28
    • 84898775239 scopus 로고    scopus 로고
    • Translating video content to natural language descriptions
    • Marcus Rohrbach, Wei Qiu, and Ivan Titov. Translating video content to natural language descriptions. In ICCV, 2013.
    • (2013) ICCV
    • Rohrbach, M.1    Qiu, W.2    Titov, I.3
  • 29
    • 80052889458 scopus 로고    scopus 로고
    • Recognition using visual phrases
    • M. A. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011.
    • (2011) CVPR
    • Sadeghi, M.A.1    Farhadi, A.2
  • 31
    • 0034498523 scopus 로고    scopus 로고
    • Content-based image retrieval at the end of the early years
    • A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. PAMI, 22(12):1349-1380, 2000.
    • (2000) PAMI , vol.22 , Issue.12 , pp. 1349-1380
    • Smeulders, A.1    Worring, M.2    Santini, S.3    Gupta, A.4    Jain, R.5
  • 32
    • 14344250451 scopus 로고    scopus 로고
    • Support vector machine learning for interdependent and structured output spaces
    • Ioannis Tsochantaridis, Thomas Hofmann, Thorsten Joachims, and Yasemin Altun. Support vector machine learning for interdependent and structured output spaces. In ICML, 2004.
    • (2004) ICML
    • Tsochantaridis, I.1    Hofmann, T.2    Joachims, T.3    Altun, Y.4
  • 33
    • 84919753222 scopus 로고    scopus 로고
    • Understanding images with natural sentences
    • Yoshitaka Ushiku, Tatsuya Harada, and Yasuo Kuniyoshi. Understanding images with natural sentences. In ACM MM, 2011.
    • (2011) ACM MM
    • Ushiku, Y.1    Harada, T.2    Kuniyoshi, Y.3
  • 35
    • 84885412937 scopus 로고    scopus 로고
    • Image annotation using metric learning in semantic neighbourhoods
    • Yashaswi Verma and C. V. Jawahar. Image annotation using metric learning in semantic neighbourhoods. In ECCV, 2012.
    • (2012) ECCV
    • Verma, Y.1    Jawahar, C.V.2
  • 36
    • 84898490664 scopus 로고    scopus 로고
    • Exploring SVM for image annotation in presence of confusing labels
    • Yashaswi Verma and C. V. Jawahar. Exploring SVM for image annotation in presence of confusing labels. In BMVC, 2013.
    • (2013) BMVC
    • Verma, Y.1    Jawahar, C.V.2
  • 38
    • 84867117593 scopus 로고    scopus 로고
    • WSABIE: Scaling up to large vocabulary image annotation
    • Jason Weston, Samy Bengio, and Nicolas Usunier. WSABIE: Scaling up to large vocabulary image annotation. In IJCAI, 2011.
    • (2011) IJCAI
    • Weston, J.1    Bengio, S.2    Usunier, N.3
  • 39
    • 80053258778 scopus 로고    scopus 로고
    • Corpus-guided sentence generation of natural images
    • Y. Yang, C. L. Teo, Hal Daumé III, and Y. Aloimonos. Corpus-guided sentence generation of natural images. In EMNLP, 2011.
    • (2011) EMNLP
    • Yang, Y.1    Teo, C.L.2    Daumé, H.3    Aloimonos, Y.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.