-
1
-
-
78149311145
-
Every picture tells a story: Generating sentences from images
-
Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. Springer, Heidelberg
-
Farhadi, A., Hejrati, M., Sadeghi, M.A., Young, P., Rashtchian, C., Hockenmaier, J., Forsyth, D.: Every picture tells a story: Generating sentences from images. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 15-29. Springer, Heidelberg (2010)
-
(2010)
LNCS
, vol.6314
, pp. 15-29
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.A.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
2
-
-
80052901011
-
Baby talk: Understanding and generating image descriptions
-
Kulkarni, G., Premraj, V., Dhar, S., Li, S., Choi, Y., Berg, A.C., Berg, T.L.: Baby talk: Understanding and generating image descriptions. In: CVPR (2011)
-
(2011)
CVPR
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
3
-
-
84862279067
-
Composing simple image descriptions using web-scale n-grams
-
Li, S., Kulkarni, G., Berg, T.L., Berg, A.C., Choi, Y.: Composing simple image descriptions using web-scale n-grams. In: CoNLL (2011)
-
(2011)
CoNLL
-
-
Li, S.1
Kulkarni, G.2
Berg, T.L.3
Berg, A.C.4
Choi, Y.5
-
4
-
-
85034832841
-
Midge: Generating image descriptions from computer vision detections
-
Mitchell, M., Han, X., Dodge, J., Mensch, A., Goyal, A., Berg, A., Yamaguchi, K., Berg, T., Stratos, K., Daumé, I.H.: Midge: Generating image descriptions from computer vision detections. In: EACL (2012)
-
(2012)
EACL
-
-
Mitchell, M.1
Han, X.2
Dodge, J.3
Mensch, A.4
Goyal, A.5
Berg, A.6
Yamaguchi, K.7
Berg, T.8
Stratos, K.9
Daumé, I.H.10
-
5
-
-
84887365305
-
A sentence is worth a thousand pixels
-
Fidler, S., Sharma, A., Urtasun, R.: A sentence is worth a thousand pixels. In: CVPR (2013)
-
(2013)
CVPR
-
-
Fidler, S.1
Sharma, A.2
Urtasun, R.3
-
6
-
-
77954862144
-
I2T: Image parsing to text description
-
Yao, B.Z., Yang, X., Lin, L., Lee, M.W., Zhu, S.C.: I2T: Image parsing to text description. Proceedings of the IEEE 98 (2010)
-
(2010)
Proceedings of the IEEE
, vol.98
-
-
Yao, B.Z.1
Yang, X.2
Lin, L.3
Lee, M.W.4
Zhu, S.C.5
-
7
-
-
84883394520
-
Framing image description as a ranking task: Data, models and evaluation metrics
-
Hodosh, M., Young, P., Hockenmaier, J.: Framing image description as a ranking task: Data, models and evaluation metrics. Journal of Artificial Intelligence Research (2013)
-
(2013)
Journal of Artificial Intelligence Research
-
-
Hodosh, M.1
Young, P.2
Hockenmaier, J.3
-
8
-
-
85162522202
-
Im2Text: Describing images using 1 million captioned photographs
-
Ordonez, V., Kulkarni, G., Berg, T.L.: Im2Text: Describing images using 1 million captioned photographs. In: NIPS (2011)
-
(2011)
NIPS
-
-
Ordonez, V.1
Kulkarni, G.2
Berg, T.L.3
-
9
-
-
84906925854
-
Grounded compositional semantics for finding and describing images with sentences
-
Socher, R., Le, Q.V., Manning, C.D., Ng, A.Y.: Grounded compositional semantics for finding and describing images with sentences. In: ACL (2013)
-
(2013)
ACL
-
-
Socher, R.1
Le, Q.V.2
Manning, C.D.3
Ng, A.Y.4
-
10
-
-
84878189119
-
Collective generation of natural image descriptions
-
Kuznetsova, P., Ordonez, V., Berg, A.C., Berg, T.L., Choi, Y.: Collective generation of natural image descriptions. In: ACL (2012)
-
(2012)
ACL
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, A.C.3
Berg, T.L.4
Choi, Y.5
-
11
-
-
85133336275
-
Bleu: A method for automatic evaluation of machine translation
-
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. In: ACL, pp. 311-318 (2002)
-
(2002)
ACL
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.J.4
-
12
-
-
10044285992
-
Canonical correlation analysis; an overview with application to learning methods
-
Hardoon, D., Szedmak, S., Shawe-Taylor, J.: Canonical correlation analysis; an overview with application to learning methods. Neural Computation 16 (2004)
-
(2004)
Neural Computation
, vol.16
-
-
Hardoon, D.1
Szedmak, S.2
Shawe-Taylor, J.3
-
13
-
-
84906498766
-
A multi-view embedding space for modeling internet images, tags, and their semantics
-
Gong, Y., Ke, Q., Isard, M., Lazebnik, S.: A multi-view embedding space for modeling internet images, tags, and their semantics. IJCV (2013)
-
(2013)
IJCV
-
-
Gong, Y.1
Ke, Q.2
Isard, M.3
Lazebnik, S.4
-
14
-
-
84897476317
-
Connecting the dots with landmarks: Discriminatively learning domain-invariant features for unsupervised domain adaptation
-
Gong, B., Grauman, K., Sha, F.: Connecting the dots with landmarks: Discriminatively learning domain-invariant features for unsupervised domain adaptation. In: ICML, pp. 222-230 (2013)
-
(2013)
ICML
, pp. 222-230
-
-
Gong, B.1
Grauman, K.2
Sha, F.3
-
15
-
-
78149318752
-
Adapting visual category models to new domains
-
Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. Springer, Heidelberg
-
Saenko, K., Kulis, B., Fritz, M., Darrell, T.: Adapting visual category models to new domains. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 213-226. Springer, Heidelberg (2010)
-
(2010)
LNCS
, vol.6314
, pp. 213-226
-
-
Saenko, K.1
Kulis, B.2
Fritz, M.3
Darrell, T.4
-
16
-
-
82455188167
-
Data-driven visual similarity for cross-domain image matching
-
Shrivastava, A., Malisiewicz, T., Gupta, A., Efros, A.A.: Data-driven visual similarity for cross-domain image matching. ACM SIGGRAPH ASIA 30(6) (2011)
-
(2011)
ACM SIGGRAPH ASIA
, vol.30
, Issue.6
-
-
Shrivastava, A.1
Malisiewicz, T.2
Gupta, A.3
Efros, A.A.4
-
18
-
-
84866661767
-
Large-scale knowledge transfer for object localization in imageNet
-
Guillaumin, M., Ferrari, V.: Large-scale knowledge transfer for object localization in imageNet. In: CVPR, 3202-3209 (2012)
-
(2012)
CVPR
, pp. 3202-3209
-
-
Guillaumin, M.1
Ferrari, V.2
-
19
-
-
77956006653
-
Multimodal semi-supervised learning for image classification
-
Guillaumin, M., Verbeek, J., Schmid, C.: Multimodal semi-supervised learning for image classification. In: CVPR, 902-909 (2010)
-
(2010)
CVPR
, pp. 902-909
-
-
Guillaumin, M.1
Verbeek, J.2
Schmid, C.3
-
20
-
-
35148862171
-
Learning visual representations using images with captions
-
Quattoni, A., Collins, M., Darrell, T.: Learning visual representations using images with captions. In: CVPR (2007)
-
(2007)
CVPR
-
-
Quattoni, A.1
Collins, M.2
Darrell, T.3
-
21
-
-
70450207253
-
Building text features for object image classification
-
Wang, G., Hoiem, D., Forsyth, D.: Building text features for object image classification. In: CVPR (2009)
-
(2009)
CVPR
-
-
Wang, G.1
Hoiem, D.2
Forsyth, D.3
-
22
-
-
84906494296
-
From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions
-
Young, P., Lai, A., Hodosh, M., Hockenmaier, J.: From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. In: TACL (2014)
-
(2014)
TACL
-
-
Young, P.1
Lai, A.2
Hodosh, M.3
Hockenmaier, J.4
-
23
-
-
0035328421
-
Modeling the shape of the scene: A holistic representation of the spatial envelope
-
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. IJCV (2001)
-
(2001)
IJCV
-
-
Oliva, A.1
Torralba, A.2
-
24
-
-
77955426203
-
Evaluating color descriptors for object and scene recognition
-
van de Sande, K.E.A., Gevers, T., Snoek, C.G.M.: Evaluating color descriptors for object and scene recognition. PAMI 32(9), 1582-1596 (2010)
-
(2010)
PAMI
, vol.32
, Issue.9
, pp. 1582-1596
-
-
Van De Sande, K.E.A.1
Gevers, T.2
Snoek, C.G.M.3
-
25
-
-
33645146449
-
Histograms of oriented gradients for human detection
-
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
-
(2005)
CVPR
-
-
Dalal, N.1
Triggs, B.2
-
26
-
-
77956004473
-
Aggregating local descriptors into a compact image representation
-
Jégou, H., Douze, M., Schmid, C., Perez, P.: Aggregating local descriptors into a compact image representation. In: CVPR (2010)
-
(2010)
CVPR
-
-
Jégou, H.1
Douze, M.2
Schmid, C.3
Perez, P.4
-
27
-
-
84876231242
-
ImageNet classification with deep convolutional neural networks
-
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS (2012)
-
(2012)
NIPS
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
28
-
-
84906504048
-
DeCAF: A deep convolutional activation feature for generic visual recognition
-
abs/1310.1531
-
Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: DeCAF: A deep convolutional activation feature for generic visual recognition. CoRR abs/1310.1531 (2013)
-
(2013)
CoRR
-
-
Donahue, J.1
Jia, Y.2
Vinyals, O.3
Hoffman, J.4
Zhang, N.5
Tzeng, E.6
Darrell, T.7
-
29
-
-
85198028989
-
ImageNet: A large-scale hierarchical image database
-
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: A large-scale hierarchical image database. In: CVPR (2009)
-
(2009)
CVPR
-
-
Deng, J.1
Dong, W.2
Socher, R.3
Li, L.J.4
Li, K.5
Fei-Fei, L.6
-
31
-
-
84867117593
-
Wsabie: Scaling up to large vocabulary image annotation
-
Weston, J., Bengio, S., Usunier, N.: Wsabie: Scaling up to large vocabulary image annotation. In: IJCAI (2011)
-
(2011)
IJCAI
-
-
Weston, J.1
Bengio, S.2
Usunier, N.3
-
32
-
-
80052250414
-
Adaptive subgradient methods for online learning and stochastic optimization
-
Duchi, J., Hazan, E., Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization. JMLR (2011)
-
(2011)
JMLR
-
-
Duchi, J.1
Hazan, E.2
Singer, Y.3
-
34
-
-
84898938559
-
Zeroshot learning through cross-modal transfer
-
Socher, R., Ganjoo, M., Sridhar, H., Bastani, O., Manning, C.D., Ng, A.Y.: Zeroshot learning through cross-modal transfer. In: NIPS (2013)
-
(2013)
NIPS
-
-
Socher, R.1
Ganjoo, M.2
Sridhar, H.3
Bastani, O.4
Manning, C.D.5
Ng, A.Y.6
-
35
-
-
0000107975
-
Relations between two sets of variables
-
Hotelling, H.: Relations between two sets of variables. Biometrika 28, 312-377 (1936)
-
(1936)
Biometrika
, vol.28
, pp. 312-377
-
-
Hotelling, H.1
-
36
-
-
84866699225
-
Leveraging category-level labels for instance-level image retrieval
-
Gordo, A., Rodriguez-Serrano, J.A., Perronnin, F., Valveny, E.: Leveraging category-level labels for instance-level image retrieval. In: CVPR (2012)
-
(2012)
CVPR
-
-
Gordo, A.1
Rodriguez-Serrano, J.A.2
Perronnin, F.3
Valveny, E.4
-
37
-
-
84863396387
-
Domain adaptation for object recognition: An unsupervised approach
-
Gopalan, R., Li, R., Chellappa, R.: Domain adaptation for object recognition: An unsupervised approach. In: ICCV (2011)
-
(2011)
ICCV
-
-
Gopalan, R.1
Li, R.2
Chellappa, R.3
-
38
-
-
84906513179
-
From sBoW to dCoT: Marginalized encoders for text representation
-
Xu, Z., Chen, M., Weinberger, K.Q., Sha, F.: From sBoW to dCoT: Marginalized encoders for text representation. In: CIKM (2011)
-
(2011)
CIKM
-
-
Xu, Z.1
Chen, M.2
Weinberger, K.Q.3
Sha, F.4
-
39
-
-
77953218689
-
Random features for large-scale kernel machines
-
Rahimi, A., Recht, B.: Random features for large-scale kernel machines. In: NIPS (2007)
-
(2007)
NIPS
-
-
Rahimi, A.1
Recht, B.2
-
40
-
-
56449089103
-
Extracting and composing robust features with denoising autoencoders
-
Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.A.: Extracting and composing robust features with denoising autoencoders. In: ICML, pp. 1096-1103 (2008)
-
(2008)
ICML
, pp. 1096-1103
-
-
Vincent, P.1
Larochelle, H.2
Bengio, Y.3
Manzagol, P.A.4
|