-
1
-
-
9444259451
-
Latent dirichlet allocation
-
D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. JMLR, 12(1):234-278, 2003.
-
(2003)
JMLR
, vol.12
, Issue.1
, pp. 234-278
-
-
Blei, D.1
Ng, A.2
Jordan, M.3
-
2
-
-
43249093335
-
Image retrieval: Ideas, influences and trends of new age
-
R. Datta, D. Joshi, J. Li, and J. Wang. Image retrieval: Ideas, influences and trends of new age. ACM Computing Surveys, 40(2):1-60, 2008.
-
(2008)
ACM Computing Surveys
, vol.40
, Issue.2
, pp. 1-60
-
-
Datta, R.1
Joshi, D.2
Li, J.3
Wang, J.4
-
3
-
-
84911372708
-
Multimodal learning in looselyorganized web images
-
Kun Duan, David J. Crandall, and Dhruv Batra. Multimodal learning in looselyorganized web images. In CVPR, 2014.
-
(2014)
CVPR
-
-
Duan, K.1
Crandall, D.J.2
Batra, D.3
-
4
-
-
0038401728
-
Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary
-
P. Duygulu, K. Barnard, J. F. G. de Freitas, and D. A. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In ECCV, 2002.
-
(2002)
ECCV
-
-
Duygulu, P.1
Barnard, K.2
De Freitas, J.F.G.3
Forsyth, D.A.4
-
5
-
-
80051961229
-
Every picture tells a story: Generating sentences for images
-
Ali Farhadi, Mohsen Hejrati, Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, and David Forsyth. Every picture tells a story: Generating sentences for images. In ECCV, 2010.
-
(2010)
ECCV
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, A.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
6
-
-
5044225521
-
Multiple Bernoulli relevance models for image and video annotation
-
S. L. Feng, R. Manmatha, and V. Lavrenko. Multiple Bernoulli relevance models for image and video annotation. In CVPR, 2004.
-
(2004)
CVPR
-
-
Feng, S.L.1
Manmatha, R.2
Lavrenko, V.3
-
7
-
-
84894905366
-
A multi-view embedding space for modeling internet images, tags, and their semantics
-
Yunchao Gong, Qifa Ke, Michael Isard, and Svetlana Lazebnik. A multi-view embedding space for modeling internet images, tags, and their semantics. IJCV, 106(2): 210-233, 2013.
-
(2013)
IJCV
, vol.106
, Issue.2
, pp. 210-233
-
-
Gong, Y.1
Ke, Q.2
Isard, M.3
Lazebnik, S.4
-
9
-
-
84898773262
-
YouTube2Text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
-
Sergio Guadarrama, Niveda Krishnamoorthy, Girish Malkarnenkar, Subhashini Venugopalan, Raymond Mooney, Trevor Darrell, and Kate Saenko. YouTube2Text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In ICCV, 2013.
-
(2013)
ICCV
-
-
Guadarrama, S.1
Krishnamoorthy, N.2
Malkarnenkar, G.3
Venugopalan, S.4
Mooney, R.5
Darrell, T.6
Saenko, K.7
-
10
-
-
77953202699
-
Tagprop: Discriminative metric learning in nearest neighbour models for image auto-annotation
-
M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbour models for image auto-annotation. In ICCV, 2009.
-
(2009)
ICCV
-
-
Guillaumin, M.1
Mensink, T.2
Verbeek, J.3
Schmid, C.4
-
11
-
-
85059866463
-
Choosing linguistics over vision to describe images
-
Ankush Gupta, Yashaswi Verma, and C. V. Jawahar. Choosing linguistics over vision to describe images. In AAAI, 2012.
-
(2012)
AAAI
-
-
Gupta, A.1
Verma, Y.2
Jawahar, C.V.3
-
12
-
-
84883394520
-
Framing image description as a ranking task: Data, models and evaluation metrics
-
Micah Hodosh, Peter Young, and Julia Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. JAIR, 47:853-899, 2013.
-
(2013)
JAIR
, vol.47
, pp. 853-899
-
-
Hodosh, M.1
Young, P.2
Hockenmaier, J.3
-
13
-
-
0000107975
-
Relations between two sets of variates
-
H. Hotelling. Relations between two sets of variates. Biometrika, 28:321-377, 1936.
-
(1936)
Biometrika
, vol.28
, pp. 321-377
-
-
Hotelling, H.1
-
14
-
-
80052901011
-
Baby Talk: Understanding and generating simple image descriptions
-
Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Yejin Choi, Alexander C. Berg, and Tamara L. Berg. Baby Talk: Understanding and generating simple image descriptions. In CVPR, 2011.
-
(2011)
CVPR
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
15
-
-
84878189119
-
Collective generation of natural image descriptions
-
Polina Kuznetsova, Vicente Ordonez, Alexander C. Berg, Tamara L. Berg, and Yejin Choi. Collective generation of natural image descriptions. In ACL, 2012.
-
(2012)
ACL
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, A.C.3
Berg, T.L.4
Choi, Y.5
-
16
-
-
84862279067
-
Composing simple image descriptions using web-scale n-grams
-
Siming Li, Girish Kulkarni, Tamara L. Berg, Alexander C. Berg, and Yejin Choi. Composing simple image descriptions using web-scale n-grams. In CoNLL, 2011.
-
(2011)
CoNLL
-
-
Li, S.1
Kulkarni, G.2
Berg, T.L.3
Berg, A.C.4
Choi, Y.5
-
17
-
-
85016508365
-
Automatic evaluation of summaries using n-gram co-occurrence statistics
-
C.-Y. Lin and E. Hovy. Automatic evaluation of summaries using n-gram co-occurrence statistics. In NAACLHLT, 2003.
-
(2003)
NAACLHLT
-
-
Lin, C.-Y.1
Hovy, E.2
-
18
-
-
3042535216
-
Distinctive image features from scale-invariant keypoints
-
David G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60 (2):91-110, 2004.
-
(2004)
IJCV
, vol.60
, Issue.2
, pp. 91-110
-
-
Lowe, D.G.1
-
19
-
-
70449580491
-
A new baseline for image annotation
-
Ameesh Makadia, Vladimir Pavlovic, and Sanjiv Kumar. A new baseline for image annotation. In ECCV, 2008.
-
(2008)
ECCV
-
-
Makadia, A.1
Pavlovic, V.2
Kumar, S.3
-
21
-
-
85034832841
-
Midge: Generating image descriptions from computer vision detections
-
Margaret Mitchell, Jesse Dodge, Amit Goyal, Kota Yamaguchi, Karl Sratos, Xufeng Han, Alysssa Mensch, Alexander C. Berg, Tamara L. Berg, and Hal Daumé III. Midge: Generating image descriptions from computer vision detections. In EACL, 2012.
-
(2012)
EACL
-
-
Mitchell, M.1
Dodge, J.2
Goyal, A.3
Yamaguchi, K.4
Sratos, K.5
Han, X.6
Mensch, A.7
Berg, A.C.8
Berg, T.L.9
Daumé, H.10
-
22
-
-
85162522202
-
Im2text: Describing images using 1 million captioned photographs
-
Vicente Ordonez, Girish Kulkarni, and Tamara L. Berg. Im2text: Describing images using 1 million captioned photographs. In NIPS, 2011.
-
(2011)
NIPS
-
-
Ordonez, V.1
Kulkarni, G.2
Berg, T.L.3
-
23
-
-
85133336275
-
Bleu: A method for automatic evaluation of machine translation
-
K. Papineni, S. Roukos, T. Ward, and W. Zhu. Bleu: A method for automatic evaluation of machine translation. In ACL, 2002.
-
(2002)
ACL
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.4
-
24
-
-
77955899888
-
Diversity in photo retrieval: Overview of the imageclefphoto task 2009
-
M. Paramita, M. Sanderson, and P. Clough. Diversity in photo retrieval: overview of the imageclefphoto task 2009. CLEF working notes, 2009.
-
(2009)
CLEF Working Notes
-
-
Paramita, M.1
Sanderson, M.2
Clough, P.3
-
26
-
-
84887454767
-
A new approach to cross-modal multimedia retrieval
-
N. Rasiwasia, J. C. Pereira, E. Coviello, G. Doyle, G. R. G. Lanckriet, R. Levy, and N. Vasconcelos. A new approach to cross-modal multimedia retrieval. In ACM MM, 2010.
-
(2010)
ACM MM
-
-
Rasiwasia, N.1
Pereira, J.C.2
Coviello, E.3
Doyle, G.4
Lanckriet, G.R.G.5
Levy, R.6
Vasconcelos, N.7
-
27
-
-
84898493831
-
Label embedding for text recognition
-
Jose Rodriguez and Florent Perronnin. Label embedding for text recognition. In BMVC, 2013.
-
(2013)
BMVC
-
-
Rodriguez, J.1
Perronnin, F.2
-
28
-
-
84898775239
-
Translating video content to natural language descriptions
-
Marcus Rohrbach, Wei Qiu, and Ivan Titov. Translating video content to natural language descriptions. In ICCV, 2013.
-
(2013)
ICCV
-
-
Rohrbach, M.1
Qiu, W.2
Titov, I.3
-
29
-
-
80052889458
-
Recognition using visual phrases
-
M. A. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011.
-
(2011)
CVPR
-
-
Sadeghi, M.A.1
Farhadi, A.2
-
31
-
-
0034498523
-
Content-based image retrieval at the end of the early years
-
A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. PAMI, 22(12):1349-1380, 2000.
-
(2000)
PAMI
, vol.22
, Issue.12
, pp. 1349-1380
-
-
Smeulders, A.1
Worring, M.2
Santini, S.3
Gupta, A.4
Jain, R.5
-
32
-
-
14344250451
-
Support vector machine learning for interdependent and structured output spaces
-
Ioannis Tsochantaridis, Thomas Hofmann, Thorsten Joachims, and Yasemin Altun. Support vector machine learning for interdependent and structured output spaces. In ICML, 2004.
-
(2004)
ICML
-
-
Tsochantaridis, I.1
Hofmann, T.2
Joachims, T.3
Altun, Y.4
-
33
-
-
84919753222
-
Understanding images with natural sentences
-
Yoshitaka Ushiku, Tatsuya Harada, and Yasuo Kuniyoshi. Understanding images with natural sentences. In ACM MM, 2011.
-
(2011)
ACM MM
-
-
Ushiku, Y.1
Harada, T.2
Kuniyoshi, Y.3
-
35
-
-
84885412937
-
Image annotation using metric learning in semantic neighbourhoods
-
Yashaswi Verma and C. V. Jawahar. Image annotation using metric learning in semantic neighbourhoods. In ECCV, 2012.
-
(2012)
ECCV
-
-
Verma, Y.1
Jawahar, C.V.2
-
36
-
-
84898490664
-
Exploring SVM for image annotation in presence of confusing labels
-
Yashaswi Verma and C. V. Jawahar. Exploring SVM for image annotation in presence of confusing labels. In BMVC, 2013.
-
(2013)
BMVC
-
-
Verma, Y.1
Jawahar, C.V.2
-
37
-
-
84884963254
-
Generating image descriptions using semantic similarities in the output space
-
Yashaswi Verma, Ankush Gupta, Prashanth Mannem, and C. V. Jawahar. Generating image descriptions using semantic similarities in the output space. In V&L Net Workshop on Language for Vision, in conjunction with CVPR, 2013.
-
(2013)
V&L Net Workshop on Language for Vision, in Conjunction with CVPR
-
-
Verma, Y.1
Gupta, A.2
Mannem, P.3
Jawahar, C.V.4
-
38
-
-
84867117593
-
WSABIE: Scaling up to large vocabulary image annotation
-
Jason Weston, Samy Bengio, and Nicolas Usunier. WSABIE: Scaling up to large vocabulary image annotation. In IJCAI, 2011.
-
(2011)
IJCAI
-
-
Weston, J.1
Bengio, S.2
Usunier, N.3
-
39
-
-
80053258778
-
Corpus-guided sentence generation of natural images
-
Y. Yang, C. L. Teo, Hal Daumé III, and Y. Aloimonos. Corpus-guided sentence generation of natural images. In EMNLP, 2011.
-
(2011)
EMNLP
-
-
Yang, Y.1
Teo, C.L.2
Daumé, H.3
Aloimonos, Y.4
-
40
-
-
84885873069
-
I2T: Image parsing to text description
-
B. Z. Yao, X. Yang, L. Lin, M. W. Lee, and S.-C. Zhu. I2T: Image parsing to text description. In Proceedings of the IEEE, 2008.
-
(2008)
Proceedings of the IEEE
-
-
Yao, B.Z.1
Yang, X.2
Lin, L.3
Lee, M.W.4
Zhu, S.-C.5
|