-
2
-
-
0041876117
-
Matching words and pictures
-
K. Barnard, P. Duygulu, D. Forsyth, N. De Freitas, D. Blei, and M. Jordan. Matching words and pictures. The Journal of Machine Learning Research, 3:1107-1135, 2003.
-
(2003)
The Journal of Machine Learning Research
, vol.3
, pp. 1107-1135
-
-
Barnard, K.1
Duygulu, P.2
Forsyth, D.3
De Freitas, N.4
Blei, D.5
Jordan, M.6
-
5
-
-
33847419773
-
Supervised learning of semantic classes for image annotation and retrieval
-
DOI 10.1109/TPAMI.2007.61
-
G. Carneiro, A. Chan, P. Moreno, and N. Vasconcelos. Supervised learning of semantic classes for image annotation and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 394-410, 2007. (Pubitemid 46336416)
-
(2007)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.29
, Issue.3
, pp. 394-410
-
-
Carneiro, G.1
Chan, A.B.2
Moreno, P.J.3
Vasconcelos, N.4
-
7
-
-
33645146449
-
Histograms of oriented gradients for human detection
-
DOI 10.1109/CVPR.2005.177, 1467360, Proceedings - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005
-
N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In IEEE Conference on Computer Vision and Pattern Recognition, volume 1, pages 886-893. IEEE, 2005. (Pubitemid 43897286)
-
(2005)
Proceedings - 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005
, vol.I
, pp. 886-893
-
-
Dalal, N.1
Triggs, B.2
-
8
-
-
84898427335
-
Recognizing human actions in still images: A study of bag-of-features and part-based representations
-
V. Delaitre, L. I., and S. J. Recognizing human actions in still images: a study of bag-of-features and part-based representations. In British Machine Vision Conference, 2009.
-
(2009)
British Machine Vision Conference
-
-
Delaitre, V.1
I, L.2
J, S.3
-
9
-
-
78149311145
-
Every picture tells a story: Generating sentences from images
-
A. Farhadi, M. Hejrati, M. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every Picture Tells a Story: Generating Sentences from Images. ECCV 2010, pages 15-29, 2010.
-
(2010)
ECCV 2010
, pp. 15-29
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
10
-
-
77950676180
-
What, where and who? Telling the story of an image by activity classification, scene recognition and object categorization
-
L. Fei-Fei and L. Li. What, Where and Who? Telling the Story of an Image by Activity Classification, Scene Recognition and Object Categorization. Computer Vision, pages 157-171, 2010.
-
(2010)
Computer Vision
, pp. 157-171
-
-
Fei-Fei, L.1
Li, L.2
-
13
-
-
70450202741
-
Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
-
Citeseer
-
A. Gupta, P. Srinivasan, J. Shi, and L. Davis. Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos. In IEEE Conference on Computer Vision and Pattern Recognition., pages 2012-2019. Citeseer, 2009.
-
(2009)
IEEE Conference on Computer Vision and Pattern Recognition.
, pp. 2012-2019
-
-
Gupta, A.1
Srinivasan, P.2
Shi, J.3
Davis, L.4
-
14
-
-
0000107975
-
Relations between two sets of variates
-
H. Hotelling. Relations between two sets of variates. Biometrika, 28(3-4):321, 1936.
-
(1936)
Biometrika
, vol.28
, Issue.3-4
, pp. 321
-
-
Hotelling, H.1
-
16
-
-
84865688064
-
Combining image captions and visual analysis for image concept classification
-
ACM
-
T. Kliegr, K. Chandramouli, J. Nemrava, V. Svatek, and E. Izquierdo. Combining image captions and visual analysis for image concept classification. In Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008, pages 8-17. ACM, 2008.
-
(2008)
Proceedings of the 9th International Workshop on Multimedia Data Mining: Held in Conjunction with the ACM SIGKDD 2008
, pp. 8-17
-
-
Kliegr, T.1
Chandramouli, K.2
Nemrava, J.3
Svatek, V.4
Izquierdo, E.5
-
17
-
-
33845572523
-
Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
-
DOI 10.1109/CVPR.2006.68, 1641019, Proceedings - 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006
-
S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In IEEE Conference on Computer Vision and Pattern Recognition, volume 2, pages 2169-2178. IEEE, 2006. (Pubitemid 44931582)
-
(2006)
Proceedings - 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006
, vol.2
, pp. 2169-2178
-
-
Lazebnik, S.1
Schmid, C.2
Ponce, J.3
-
18
-
-
77951297833
-
Optimol: Automatic online picture collection via incremental model learning
-
L. Li and L. Fei-Fei. Optimol: automatic online picture collection via incremental model learning. International Journal of Computer Vision, 88(2):147-168, 2010.
-
(2010)
International Journal of Computer Vision
, vol.88
, Issue.2
, pp. 147-168
-
-
Li, L.1
Fei-Fei, L.2
-
21
-
-
33749236045
-
Chapter 2 Building the gist of a scene: The role of global image features in recognition
-
DOI 10.1016/S0079-6123(06)55002-2, PII S0079612306550022, Visual Perception Fundamentals of Awareness: Multy-Sensory Integration and High-Order Perception
-
A. Oliva and A. Torralba. Building the gist of a scene: The role of global image features in recognition. Progress in Brain Research, 155:23-36, 2006. (Pubitemid 44485189)
-
(2006)
Progress in Brain Research
, vol.155 B
, pp. 23-36
-
-
Oliva, A.1
Torralba, A.2
-
22
-
-
63749115565
-
Latent semantic fusion model for image retrieval and annotation
-
ACM
-
T. Pham, N. Maillot, J. Lim, and J. Chevallet. Latent semantic fusion model for image retrieval and annotation. In Proceedings of the 16th ACM Conference on Information and Knowledge Management, pages 439-444. ACM, 2007.
-
(2007)
Proceedings of the 16th ACM Conference on Information and Knowledge Management
, pp. 439-444
-
-
Pham, T.1
Maillot, N.2
Lim, J.3
Chevallet, J.4
-
24
-
-
84977916396
-
MEAD - A platform for multidocument multilingual text summarization
-
Lisbon, Portugal, May
-
D. Radev, T. Allison, S. Blair-Goldensohn, J. Blitzer, A. Çelebi, S. Dimitrov, E. Drabek, A. Hakim, W. Lam, D. Liu, J. Otterbacher, H. Qi, H. Saggion, S. Teufel, M. Topper, A. Winkel, and Z. Zhang. MEAD - a platform for multidocument multilingual text summarization. In LREC 2004, Lisbon, Portugal, May 2004.
-
(2004)
LREC 2004
-
-
Radev, D.1
Allison, T.2
Blair-Goldensohn, S.3
Blitzer, J.4
Çelebi, A.5
Dimitrov, S.6
Drabek, E.7
Hakim, A.8
Lam, W.9
Liu, D.10
Otterbacher, J.11
Qi, H.12
Saggion, H.13
Teufel, S.14
Topper, M.15
Winkel, A.16
Zhang, Z.17
-
25
-
-
78650967345
-
A new approach to cross-modal multimedia retrieval
-
ACM
-
N. Rasiwasia, J. Pereira, E. Coviello, G. Doyle, G. Lanckriet, R. Levy, and N. Vasconcelos. A New Approach to Cross-Modal Multimedia Retrieval. In Proceedings of ACM International Conference on Multimedia. ACM, 2010.
-
(2010)
Proceedings of ACM International Conference on Multimedia
-
-
Rasiwasia, N.1
Pereira, J.2
Coviello, E.3
Doyle, G.4
Lanckriet, G.5
Levy, R.6
Vasconcelos, N.7
-
26
-
-
77953196456
-
Multiple kernels for object detection
-
IEEE
-
A. Vedaldi, V. Gulshan, M. Varma, and A. Zisserman. Multiple kernels for object detection. In IEEE International Conference on Computer Vision, pages 606-613. IEEE, 2010.
-
(2010)
IEEE International Conference on Computer Vision
, pp. 606-613
-
-
Vedaldi, A.1
Gulshan, V.2
Varma, M.3
Zisserman, A.4
-
29
-
-
77954862144
-
I2T: Image parsing to text description
-
B. Yao, X. Yang, L. Lin, M. Lee, and S. Zhu. I2T: Image parsing to text description. Proceedings of the IEEE, 98(8):1485-1508, 2010.
-
(2010)
Proceedings of the IEEE
, vol.98
, Issue.8
, pp. 1485-1508
-
-
Yao, B.1
Yang, X.2
Lin, L.3
Lee, M.4
Zhu, S.5
|