-
2
-
-
85026926619
-
Understanding and predicting importance in images
-
A. C. Berg, T. L. Berg, H. D. III, J. Dodge, A. Goyal, X. Han, A. Mensch, M. Mitchell, A. Sood, K. Stratos, and K. Yamaguchi. Understanding and predicting importance in images. In CVPR. IEEE, 2012.
-
(2012)
CVPR. IEEE
-
-
Berg, A.C.1
Berg, T.L.2
Dodge, J.3
Goyal, A.4
Han, X.5
Mensch, A.6
Mitchell, M.7
Sood, A.8
Stratos, K.9
Yamaguchi, K.10
-
3
-
-
33750347385
-
The physics of optimal decision making: A formal analysis of models of performance in two-alternative forcedchoice tasks
-
Oct.
-
R. Bogacz, E. Brown, J. Moehlis, P. Holmes, and J. D. Cohen. The physics of optimal decision making: A formal analysis of models of performance in two-alternative forcedchoice tasks. Psychol Rev, 113(4):700-765, Oct. 2006.
-
(2006)
Psychol Rev
, vol.113
, Issue.4
, pp. 700-765
-
-
Bogacz, R.1
Brown, E.2
Moehlis, J.3
Holmes, P.4
Cohen, J.D.5
-
4
-
-
84893361786
-
Re-evaluating the role of bleu in machine translation research
-
C. Callison-burch and M. Osborne. Re-evaluating the role of bleu in machine translation research. In In EACL, pages 249-256, 2006.
-
(2006)
EACL
, pp. 249-256
-
-
Callison-Burch, C.1
Osborne, M.2
-
5
-
-
84952349295
-
-
ArXiv e-prints, Apr.
-
X. Chen, H. Fang, T.-Y. Lin, R. Vedantam, S. Gupta, P. Dollar, and C. L. Zitnick. Microsoft COCO Captions: Data Collection and Evaluation Server. ArXiv e-prints, Apr. 2015.
-
(2015)
Microsoft COCO Captions: Data Collection and Evaluation Server
-
-
Chen, X.1
Fang, H.2
Lin, T.-Y.3
Vedantam, R.4
Gupta, S.5
Dollar, P.6
Zitnick, C.L.7
-
6
-
-
84944115859
-
Learning a recurrent visual representation for image caption generation
-
X. Chen and C. L. Zitnick. Learning a recurrent visual representation for image caption generation. CoRR, abs/1411. 5654, 2014.
-
(2014)
CoRR, abs/1411. 5654
-
-
Chen, X.1
Zitnick, C.L.2
-
7
-
-
72249100259
-
ImageNet: A large-scale hierarchical image database
-
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR09, 2009.
-
(2009)
CVPR09
-
-
Deng, J.1
Dong, W.2
Socher, R.3
Li, L.-J.4
Li, K.5
Fei-Fei, L.6
-
10
-
-
84946802546
-
Long-term recurrent convolutional networks for visual recognition and description
-
J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term recurrent convolutional networks for visual recognition and description. CoRR, abs/1411. 4389, 2014.
-
(2014)
CoRR, abs/1411. 4389
-
-
Donahue, J.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
11
-
-
84906929591
-
Image description using visual dependency representations
-
D. Elliott and F. Keller. Image description using visual dependency representations. In EMNLP, pages 1292-1302. ACL, 2013.
-
(2013)
EMNLP 1292-1302. ACL
-
-
Elliott, D.1
Keller, F.2
-
14
-
-
80052017343
-
Every picture tells a story: Generating sentences from images
-
A. Farhadi, M. Hejrati, M. A. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: Generating sentences from images. In Pro-ceedings of the 11th European Conference on Computer Vi-sion: Part IV, ECCV'10, 2010.
-
(2010)
Pro-ceedings of the 11th European Conference on Computer VI-sion: Part IV, ECCV'10
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.A.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
15
-
-
77955422240
-
Object detection with discriminatively trained part based models
-
P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9):1627-1645, 2010.
-
(2010)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.32
, Issue.9
, pp. 1627-1645
-
-
Felzenszwalb, P.F.1
Girshick, R.B.2
McAllester, D.3
Ramanan, D.4
-
16
-
-
57149125139
-
Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers
-
D. A. Forsyth, P. H. S. Torr, and A. Zisserman, editors, , of Lecture Notes in Com-puter Science. Springer
-
A. Gupta and L. S. Davis. Beyond nouns: Exploiting prepositions and comparative adjectives for learning visual classifiers. In D. A. Forsyth, P. H. S. Torr, and A. Zisserman, editors, ECCV (1), volume 5302 of Lecture Notes in Com-puter Science, pages 16-29. Springer, 2008.
-
(2008)
ECCV
, vol.5302
, Issue.1
, pp. 16-29
-
-
Gupta, A.1
Davis, L.S.2
-
18
-
-
84883394520
-
Framing image description as a ranking task: Data, models and evaluation metrics
-
M. Hodosh, P. Young, and J. Hockenmaier. Framing image description as a ranking task: Data, models and evaluation metrics. J. Artif. Intell. Res. (JAIR), 47:853-899, 2013.
-
(2013)
J. Artif. Intell. Res. (JAIR)
, vol.47
, pp. 853-899
-
-
Hodosh, M.1
Young, P.2
Hockenmaier, J.3
-
19
-
-
84959099868
-
Deep visual-semantic alignments for generating image descriptions
-
A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. CoRR, abs/1412. 2306, 2014.
-
(2014)
CoRR, abs/1412. 2306
-
-
Karpathy, A.1
Fei-Fei, L.2
-
20
-
-
84959252592
-
Deep fragment embeddings for bidirectional image sentence mapping
-
A. Karpathy, A. Joulin, and L. Fei-Fei. Deep fragment embeddings for bidirectional image sentence mapping. CoRR, 2014.
-
(2014)
CoRR
-
-
Karpathy, A.1
Joulin, A.2
Fei-Fei, L.3
-
21
-
-
84946802533
-
Unifying visual-semantic embeddings with multimodal neural language models
-
R. Kiros, R. Salakhutdinov, and R. S. Zemel. Unifying visual-semantic embeddings with multimodal neural language models. CoRR, abs/1411. 2539, 2014.
-
(2014)
CoRR, abs/1411. 2539
-
-
Kiros, R.1
Salakhutdinov, R.2
Zemel, R.S.3
-
22
-
-
80052901011
-
Baby talk: Understanding and generating image descriptions
-
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg. Baby talk: Understanding and generating image descriptions. In Proceedings of the 24th CVPR, 2011.
-
(2011)
Proceedings of the 24th CVPR
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
23
-
-
70450172710
-
Learning to detect unseen object classes by betweenclass attribute transfer
-
C. H. Lampert, H. Nickisch, and S. Harmeling. Learning to detect unseen object classes by betweenclass attribute transfer. In In CVPR, 2009.
-
(2009)
CVPR
-
-
Lampert, C.H.1
Nickisch, H.2
Harmeling, S.3
-
24
-
-
84862279067
-
Composing simple image descriptions using web-scale n-grams
-
Stroudsburg, PA, USA, Association for Computational Linguistics
-
S. Li, G. Kulkarni, T. L. Berg, A. C. Berg, and Y. Choi. Composing simple image descriptions using web-scale n-grams. In Proceedings of the Fifteenth Conference on Computa-tional Natural Language Learning, CoNLL '11, pages 220-228, Stroudsburg, PA, USA, 2011. Association for Computational Linguistics.
-
(2011)
Proceedings of the Fifteenth Conference on Computa-tional Natural Language Learning, CoNLL '11
, pp. 220-228
-
-
Li, S.1
Kulkarni, G.2
Berg, T.L.3
Berg, A.C.4
Choi, Y.5
-
25
-
-
84937834115
-
Microsoft COCO: Common objects in context
-
T. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick. Microsoft COCO: Common objects in context. In ECCV, 2014.
-
(2014)
ECCV
-
-
Lin, T.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollár, P.7
Zitnick, C.L.8
-
27
-
-
84951072975
-
Explain images with multimodal recurrent neural networks
-
J. Mao, W. Xu, Y. Yang, J. Wang, and A. L. Yuille. Explain images with multimodal recurrent neural networks. CoRR, abs/1410. 1090, 2014.
-
(2014)
CoRR, abs/1410. 1090
-
-
Mao, J.1
Xu, W.2
Yang, Y.3
Wang, J.4
Yuille, A.L.5
-
28
-
-
0034850577
-
A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics
-
July
-
D. Martin, C. Fowlkes, D. Tal, and J. Malik. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proc. 8th Int'l Conf. Computer Vision, volume 2, pages 416-423, July 2001.
-
(2001)
Proc. 8th Int'l Conf. Computer Vision
, vol.2
, pp. 416-423
-
-
Martin, D.1
Fowlkes, C.2
Tal, D.3
Malik, J.4
-
29
-
-
84959231566
-
Midge: Generating descriptions of images
-
Stroudsburg, PA, USA. Association for Computational Linguistics
-
M. Mitchell, X. Han, and J. Hayes. Midge: Generating descriptions of images. In Proceedings of the Seventh Interna-tional Natural Language Generation Conference, INLG '12, pages 131-133, Stroudsburg, PA, USA, 2012. Association for Computational Linguistics.
-
(2012)
Proceedings of the Seventh Interna-tional Natural Language Generation Conference, INLG '12
, pp. 131-133
-
-
Mitchell, M.1
Han, X.2
Hayes, J.3
-
30
-
-
79951843401
-
-
Springer Publishing Company, Incorporated, 1st edition
-
H. Mller, P. Clough, T. Deselaers, and B. Caputo. Image-CLEF: Experimental Evaluation in Visual Information Re-trieval. Springer Publishing Company, Incorporated, 1st edition, 2010.
-
(2010)
Image-CLEF: Experimental Evaluation in Visual Information Re-trieval
-
-
Mller, H.1
Clough, P.2
Deselaers, T.3
Caputo, B.4
-
31
-
-
85013202438
-
Evaluating content selection in summarization: The pyramid method
-
A. Nenkova and R. J. Passonneau. Evaluating content selection in summarization: The pyramid method. In HLT-NAACL, pages 145-152, 2004.
-
(2004)
HLT-NAACL
, pp. 145-152
-
-
Nenkova, A.1
Passonneau, R.J.2
-
33
-
-
85133336275
-
Bleu: A method for automatic evaluation of machine translation
-
Stroudsburg, PA, USA. Association for Computational Linguistics
-
K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, ACL '02, pages 311-318, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics.
-
(2002)
Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, ACL '02
, pp. 311-318
-
-
Papineni, K.1
Roukos, S.2
Ward, T.3
Zhu, W.-J.4
-
35
-
-
85090348677
-
Collecting image annotations using amazon's mechanical turk
-
Stroudsburg, PA, USA. Association for Computational Linguistics
-
C. Rashtchian, P. Young, M. Hodosh, and J. Hockenmaier. Collecting image annotations using amazon's mechanical turk. In Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, CSLDAMT '10, Stroudsburg, PA, USA, 2010. Association for Computational Linguistics.
-
(2010)
Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, CSLDAMT '10
-
-
Rashtchian, C.1
Young, P.2
Hodosh, M.3
Hockenmaier, J.4
-
36
-
-
8844253324
-
Understanding inverse document frequency: On theoretical arguments for idf
-
S. Robertson. Understanding inverse document frequency: On theoretical arguments for idf. Journal of Documentation, 60:2004, 2004.
-
(2004)
Journal of Documentation
, vol.60
, pp. 2004
-
-
Robertson, S.1
-
37
-
-
84898775239
-
Translating video content to natural language descriptions
-
December
-
M. Rohrbach, W. Qiu, I. Titov, S. Thater, M. Pinkal, and B. Schiele. Translating video content to natural language descriptions. In IEEE International Conference on Computer Vision (ICCV), December 2013.
-
(2013)
IEEE International Conference on Computer Vision (ICCV)
-
-
Rohrbach, M.1
Qiu, W.2
Titov, I.3
Thater, S.4
Pinkal, M.5
Schiele, B.6
-
39
-
-
0036537472
-
A taxonomy and evaluation of dense two-frame stereo correspondence algorithms
-
D. Scharstein and R. Szeliski. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vision, 2002.
-
(2002)
Int. J. Comput. Vision
-
-
Scharstein, D.1
Szeliski, R.2
-
41
-
-
80053456767
-
Adaptively learning the crowd kernel
-
O. Tamuz, C. Liu, S. Belongie, O. Shamir, and A. T. Kalai. Adaptively learning the crowd kernel. In In ICML11, 2011.
-
(2011)
ICML11
-
-
Tamuz, O.1
Liu, C.2
Belongie, S.3
Shamir, O.4
Kalai, A.T.5
-
43
-
-
84951910303
-
Show and tell: A neural image caption generator
-
O. Vinyals, A. Toshev, S. Bengio, and D. Erhan. Show and tell: A neural image caption generator. CoRR, abs/1411. 4555, 2014.
-
(2014)
CoRR, abs/1411. 4555
-
-
Vinyals, O.1
Toshev, A.2
Bengio, S.3
Erhan, D.4
-
44
-
-
85026931000
-
Corpusguided sentence generation of natural images
-
Y. Yang, C. L. Teo, H. D. III, and Y. Aloimonos. Corpusguided sentence generation of natural images. In EMNLP. ACL, 2011.
-
(2011)
EMNLP. ACL
-
-
Yang, Y.1
Teo, C.L.2
Aloimonos, Y.3
-
45
-
-
85026937926
-
See no evil, say no evil: Description generation from densely labeled images
-
Association for Computational Linguistics and Dublin City University, Dublin, Ireland, August
-
M. Yatskar, M. Galley, L. Vanderwende, and L. Zettlemoyer. See no evil, say no evil: Description generation from densely labeled images. In Proceedings of the Third Joint Conference on Lexical and Computational Semantics (SEM 2014), page 110120, Dublin, Ireland, August 2014. Association for Computational Linguistics and Dublin City University.
-
(2014)
Proceedings of the Third Joint Conference on Lexical and Computational Semantics (SEM 2014)
, pp. 110120
-
-
Yatskar, M.1
Galley, M.2
Vanderwende, L.3
Zettlemoyer, L.4
-
47
-
-
84887338442
-
Bringing semantics into focus using visual abstraction
-
C. L. Zitnick and D. Parikh. Bringing semantics into focus using visual abstraction. In CVPR, 2013.
-
(2013)
CVPR
-
-
Zitnick, C.L.1
Parikh, D.2
|