SCOPUS 정보 검색 플랫폼

BMVC 2014 - Proceedings of the British Machine Vision Conference 2014

Volumn , Issue , 2014, Pages

Im2Text and Text2Im: Associating images and texts for cross-modal retrieval

(2) Verma, Yashaswi a Jawahar, C V a

a INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY (India)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; FORECASTING; SEMANTICS; STATISTICS; SUPPORT VECTOR MACHINES;

CANONICAL CORRELATION ANALYSIS; LATENT DIRICHLET ALLOCATION; PREDICTION TASKS; QUERY IMAGES; RETRIEVAL FRAMEWORKS; SEMANTIC ASSOCIATIONS; TEXTUAL DATA; UNIFIED FORMULATIONS;

PROBABILITY DISTRIBUTIONS;

EID: 85088059797 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.5244/c.28.97 Document Type: Conference Paper

Times cited : (41)

References (40)

1
- 9444259451
- Latent dirichlet allocation
- D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. JMLR, 12(1):234-278, 2003.
- (2003) JMLR , vol.12 , Issue.1 , pp. 234-278
- Blei, D.¹ Ng, A.² Jordan, M.³

2
- 43249093335
- Image retrieval: Ideas, influences and trends of new age
- R. Datta, D. Joshi, J. Li, and J. Wang. Image retrieval: Ideas, influences and trends of new age. ACM Computing Surveys, 40(2):1-60, 2008.
- (2008) ACM Computing Surveys , vol.40 , Issue.2 , pp. 1-60
- Datta, R.¹ Joshi, D.² Li, J.³ Wang, J.⁴

3
- 84911372708
- Multimodal learning in looselyorganized web images
- Kun Duan, David J. Crandall, and Dhruv Batra. Multimodal learning in looselyorganized web images. In CVPR, 2014.
- (2014) CVPR
- Duan, K.¹ Crandall, D.J.² Batra, D.³

4
- 0038401728
- Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary
- P. Duygulu, K. Barnard, J. F. G. de Freitas, and D. A. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In ECCV, 2002.
- (2002) ECCV
- Duygulu, P.¹ Barnard, K.² De Freitas, J.F.G.³ Forsyth, D.A.⁴

5
- 80051961229
- Every picture tells a story: Generating sentences for images
- Ali Farhadi, Mohsen Hejrati, Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, and David Forsyth. Every picture tells a story: Generating sentences for images. In ECCV, 2010.
- (2010) ECCV
- Farhadi, A.¹ Hejrati, M.² Sadeghi, A.³ Young, P.⁴ Rashtchian, C.⁵ Hockenmaier, J.⁶ Forsyth, D.⁷

6
- 5044225521
- Multiple Bernoulli relevance models for image and video annotation
- S. L. Feng, R. Manmatha, and V. Lavrenko. Multiple Bernoulli relevance models for image and video annotation. In CVPR, 2004.
- (2004) CVPR
- Feng, S.L.¹ Manmatha, R.² Lavrenko, V.³

7
- 84894905366
- A multi-view embedding space for modeling internet images, tags, and their semantics
- Yunchao Gong, Qifa Ke, Michael Isard, and Svetlana Lazebnik. A multi-view embedding space for modeling internet images, tags, and their semantics. IJCV, 106(2): 210-233, 2013.
- (2013) IJCV , vol.106 , Issue.2 , pp. 210-233
- Gong, Y.¹ Ke, Q.² Isard, M.³ Lazebnik, S.⁴

8
- 70349458852
- PhD thesis, Victoria University, Melbourne, Australia
- M. Grubinger. Analysis and Evaluation of Visual Information Systems Performance. PhD thesis, Victoria University, Melbourne, Australia, 2007.
- (2007) Analysis and Evaluation of Visual Information Systems Performance
- Grubinger, M.¹

9
- 84898773262
- YouTube2Text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
- Sergio Guadarrama, Niveda Krishnamoorthy, Girish Malkarnenkar, Subhashini Venugopalan, Raymond Mooney, Trevor Darrell, and Kate Saenko. YouTube2Text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In ICCV, 2013.
- (2013) ICCV
- Guadarrama, S.¹ Krishnamoorthy, N.² Malkarnenkar, G.³ Venugopalan, S.⁴ Mooney, R.⁵ Darrell, T.⁶ Saenko, K.⁷

10
- 77953202699
- Tagprop: Discriminative metric learning in nearest neighbour models for image auto-annotation
- M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbour models for image auto-annotation. In ICCV, 2009.
- (2009) ICCV
- Guillaumin, M.¹ Mensink, T.² Verbeek, J.³ Schmid, C.⁴

11
- 85059866463
- Choosing linguistics over vision to describe images
- Ankush Gupta, Yashaswi Verma, and C. V. Jawahar. Choosing linguistics over vision to describe images. In AAAI, 2012.
- (2012) AAAI
- Gupta, A.¹ Verma, Y.² Jawahar, C.V.³

12
- 84883394520
- Framing image description as a ranking task: Data, models and evaluation metrics
- Micah Hodosh, Peter Young, and Julia Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. JAIR, 47:853-899, 2013.
- (2013) JAIR , vol.47 , pp. 853-899
- Hodosh, M.¹ Young, P.² Hockenmaier, J.³

13
- 0000107975
- Relations between two sets of variates
- H. Hotelling. Relations between two sets of variates. Biometrika, 28:321-377, 1936.
- (1936) Biometrika , vol.28 , pp. 321-377
- Hotelling, H.¹

14
- 80052901011
- Baby Talk: Understanding and generating simple image descriptions
- Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C. Berg, and Tamara L. Berg. Baby Talk: Understanding and generating simple image descriptions. In CVPR, 2011.
- (2011) CVPR
- Kulkarni, G.¹ Premraj, V.² Dhar, S.³ Li, S.⁴ Choi, Y.⁵ Berg, A.C.⁶ Berg, T.L.⁷

15
- 84878189119
- Collective generation of natural image descriptions
- Polina Kuznetsova, Vicente Ordonez, Alexander C. Berg, Tamara L. Berg, and Yejin Choi. Collective generation of natural image descriptions. In ACL, 2012.
- (2012) ACL
- Kuznetsova, P.¹ Ordonez, V.² Berg, A.C.³ Berg, T.L.⁴ Choi, Y.⁵

16
- 84862279067
- Composing simple image descriptions using web-scale n-grams
- Siming Li, Girish Kulkarni, Tamara L. Berg, Alexander C. Berg, and Yejin Choi. Composing simple image descriptions using web-scale n-grams. In CoNLL, 2011.
- (2011) CoNLL
- Li, S.¹ Kulkarni, G.² Berg, T.L.³ Berg, A.C.⁴ Choi, Y.⁵

17
- 85016508365
- Automatic evaluation of summaries using n-gram co-occurrence statistics
- C.-Y. Lin and E. Hovy. Automatic evaluation of summaries using n-gram co-occurrence statistics. In NAACLHLT, 2003.
- (2003) NAACLHLT
- Lin, C.-Y.¹ Hovy, E.²

18
- 3042535216
- Distinctive image features from scale-invariant keypoints
- David G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60 (2):91-110, 2004.
- (2004) IJCV , vol.60 , Issue.2 , pp. 91-110
- Lowe, D.G.¹

19
- 70449580491
- A new baseline for image annotation
- Ameesh Makadia, Vladimir Pavlovic, and Sanjiv Kumar. A new baseline for image annotation. In ECCV, 2008.
- (2008) ECCV
- Makadia, A.¹ Pavlovic, V.² Kumar, S.³

20
- 0003596936
- Emerald Group Pub Ltd
- C. Meadow, B. Boyce, D. Kraft, and C. Barry. Text information retrieval systems. Emerald Group Pub Ltd, 2007.
- (2007) Text Information Retrieval Systems
- Meadow, C.¹ Boyce, B.² Kraft, D.³ Barry, C.⁴

21
- 85034832841
- Midge: Generating image descriptions from computer vision detections
- Margaret Mitchell, Jesse Dodge, Amit Goyal, Kota Yamaguchi, Karl Sratos, Xufeng Han, Alysssa Mensch, Alexander C. Berg, Tamara L. Berg, and Hal Daumé III. Midge: Generating image descriptions from computer vision detections. In EACL, 2012.
- (2012) EACL
- Mitchell, M.¹ Dodge, J.² Goyal, A.³ Yamaguchi, K.⁴ Sratos, K.⁵ Han, X.⁶ Mensch, A.⁷ Berg, A.C.⁸ Berg, T.L.⁹ Daumé, H.¹⁰

22
- 85162522202
- Im2text: Describing images using 1 million captioned photographs
- Vicente Ordonez, Girish Kulkarni, and Tamara L. Berg. Im2text: Describing images using 1 million captioned photographs. In NIPS, 2011.
- (2011) NIPS
- Ordonez, V.¹ Kulkarni, G.² Berg, T.L.³

23
- 85133336275
- Bleu: A method for automatic evaluation of machine translation
- K. Papineni, S. Roukos, T. Ward, and W. Zhu. Bleu: A method for automatic evaluation of machine translation. In ACL, 2002.
- (2002) ACL
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.⁴

24
- 77955899888
- Diversity in photo retrieval: Overview of the imageclefphoto task 2009
- M. Paramita, M. Sanderson, and P. Clough. Diversity in photo retrieval: overview of the imageclefphoto task 2009. CLEF working notes, 2009.
- (2009) CLEF Working Notes
- Paramita, M.¹ Sanderson, M.² Clough, P.³

25
- 85090348677
- Collecting image annotation using amazon's mechanical turk
- C. Rashtchian, P. Young, M. Hodosh, and J. Hockenmaier. Collecting image annotation using amazon's mechanical turk. In NAACLHLT Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010.
- (2010) NAACLHLT Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk
- Rashtchian, C.¹ Young, P.² Hodosh, M.³ Hockenmaier, J.⁴

26
- 84887454767
- A new approach to cross-modal multimedia retrieval
- N. Rasiwasia, J. C. Pereira, E. Coviello, G. Doyle, G. R. G. Lanckriet, R. Levy, and N. Vasconcelos. A new approach to cross-modal multimedia retrieval. In ACM MM, 2010.
- (2010) ACM MM
- Rasiwasia, N.¹ Pereira, J.C.² Coviello, E.³ Doyle, G.⁴ Lanckriet, G.R.G.⁵ Levy, R.⁶ Vasconcelos, N.⁷

27
- 84898493831
- Label embedding for text recognition
- Jose Rodriguez and Florent Perronnin. Label embedding for text recognition. In BMVC, 2013.
- (2013) BMVC
- Rodriguez, J.¹ Perronnin, F.²

28
- 84898775239
- Translating video content to natural language descriptions
- Marcus Rohrbach, Wei Qiu, and Ivan Titov. Translating video content to natural language descriptions. In ICCV, 2013.
- (2013) ICCV
- Rohrbach, M.¹ Qiu, W.² Titov, I.³

29
- 80052889458
- Recognition using visual phrases
- M. A. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011.
- (2011) CVPR
- Sadeghi, M.A.¹ Farhadi, A.²

30
- 34547401486
- Evaluation campaigns and trecvid
- A. F. Smeaton, P. Over, andW. Kraaij. Evaluation campaigns and trecvid. In MIR: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, 2006.
- (2006) MIR: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval
- Smeaton, A.F.¹ Over, P.² Kraaij, W.³

31
- 0034498523
- Content-based image retrieval at the end of the early years
- A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. PAMI, 22(12):1349-1380, 2000.
- (2000) PAMI , vol.22 , Issue.12 , pp. 1349-1380
- Smeulders, A.¹ Worring, M.² Santini, S.³ Gupta, A.⁴ Jain, R.⁵

32
- 14344250451
- Support vector machine learning for interdependent and structured output spaces
- Ioannis Tsochantaridis, Thomas Hofmann, Thorsten Joachims, and Yasemin Altun. Support vector machine learning for interdependent and structured output spaces. In ICML, 2004.
- (2004) ICML
- Tsochantaridis, I.¹ Hofmann, T.² Joachims, T.³ Altun, Y.⁴

33
- 84919753222
- Understanding images with natural sentences
- Yoshitaka Ushiku, Tatsuya Harada, and Yasuo Kuniyoshi. Understanding images with natural sentences. In ACM MM, 2011.
- (2011) ACM MM
- Ushiku, Y.¹ Harada, T.² Kuniyoshi, Y.³

34
- 84897541533
- A. Vedaldi. A MATLAB wrapper of SVMstruct. http://www.vlfeat.org/~vedaldi/code/svm-struct-matlab.html, 2011.
- (2011) A MATLAB Wrapper of SVMstruct
- Vedaldi, A.¹

35
- 84885412937
- Image annotation using metric learning in semantic neighbourhoods
- Yashaswi Verma and C. V. Jawahar. Image annotation using metric learning in semantic neighbourhoods. In ECCV, 2012.
- (2012) ECCV
- Verma, Y.¹ Jawahar, C.V.²

36
- 84898490664
- Exploring SVM for image annotation in presence of confusing labels
- Yashaswi Verma and C. V. Jawahar. Exploring SVM for image annotation in presence of confusing labels. In BMVC, 2013.
- (2013) BMVC
- Verma, Y.¹ Jawahar, C.V.²

37
- 84884963254
- Generating image descriptions using semantic similarities in the output space
- Yashaswi Verma, Ankush Gupta, Prashanth Mannem, and C. V. Jawahar. Generating image descriptions using semantic similarities in the output space. In V&L Net Workshop on Language for Vision, in conjunction with CVPR, 2013.
- (2013) V&L Net Workshop on Language for Vision, in Conjunction with CVPR
- Verma, Y.¹ Gupta, A.² Mannem, P.³ Jawahar, C.V.⁴

38
- 84867117593
- WSABIE: Scaling up to large vocabulary image annotation
- Jason Weston, Samy Bengio, and Nicolas Usunier. WSABIE: Scaling up to large vocabulary image annotation. In IJCAI, 2011.
- (2011) IJCAI
- Weston, J.¹ Bengio, S.² Usunier, N.³

39
- 80053258778
- Corpus-guided sentence generation of natural images
- Y. Yang, C. L. Teo, Hal Daumé III, and Y. Aloimonos. Corpus-guided sentence generation of natural images. In EMNLP, 2011.
- (2011) EMNLP
- Yang, Y.¹ Teo, C.L.² Daumé, H.³ Aloimonos, Y.⁴

40
- 84885873069
- I2T: Image parsing to text description
- B. Z. Yao, X. Yang, L. Lin, M. W. Lee, and S.-C. Zhu. I2T: Image parsing to text description. In Proceedings of the IEEE, 2008.
- (2008) Proceedings of the IEEE
- Yao, B.Z.¹ Yang, X.² Lin, L.³ Lee, M.W.⁴ Zhu, S.-C.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.