-
1
-
-
84885996388
-
Video in sentences out
-
Andrei Barbu, Alexander Bridge, Zachary Burchill, Dan Coroian, Sven Dickinson, Sanja Fidler, Aaron Michaux, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, Jinlian Wei, Yifan Yin, and Zhiqi Zhang. 2012. Video in sentences out. In Association for Uncertainty in Artificial Intelligence (UAI).
-
(2012)
Association for Uncertainty in Artificial Intelligence (UAI)
-
-
Barbu, A.1
Bridge, A.2
Burchill, Z.3
Coroian, D.4
Dickinson, S.5
Fidler, S.6
Michaux, A.7
Mussman, S.8
Narayanaswamy, S.9
Salvi, D.10
Schmidt, L.11
Shangguan, J.12
Siskind, J.M.13
Waggoner, J.14
Wang, S.15
Wei, J.16
Yin, Y.17
Zhang, Z.18
-
9
-
-
84904482223
-
-
arXiv preprint arXiv:1310.1531
-
Jeff Donahue, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. 2013. Decaf: A deep convolutional activation feature for generic visual recognition. arXiv preprint arXiv:1310.1531.
-
(2013)
Decaf: A Deep Convolutional Activation Feature for Generic Visual Recognition
-
-
Donahue, J.1
Jia, Y.2
Vinyals, O.3
Hoffman, J.4
Zhang, N.5
Tzeng, E.6
Darrell, T.7
-
10
-
-
77951298115
-
The pascal visual object classes (voc) challenge
-
June
-
Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, and Andrew Zisserman. 2010. The pascal visual object classes (voc) challenge. International Journal of Computer Vision (IJCV), 88(2):303-338, June.
-
(2010)
International Journal of Computer Vision (IJCV)
, vol.88
, Issue.2
, pp. 303-338
-
-
Everingham, M.1
Gool, L.V.2
Williams, C.K.I.3
Winn, J.4
Zisserman, A.5
-
12
-
-
84898773262
-
Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
-
December
-
Sergio Guadarrama, Niveda Krishnamoorthy, Girish Malkarnenkar, Subhashini Venugopalan, Raymond Mooney, Trevor Darrell, and Kate Saenko. 2013. Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In IEEE International Conference on Computer Vision (ICCV), December.
-
(2013)
IEEE International Conference on Computer Vision (ICCV)
-
-
Guadarrama, S.1
Krishnamoorthy, N.2
Malkarnenkar, G.3
Venugopalan, S.4
Mooney, R.5
Darrell, T.6
Saenko, K.7
-
13
-
-
84893398951
-
Generating natural-language video descriptions using text-mined knowledge
-
Niveda Krishnamoorthy, Girish Malkarnenkar, Raymond J. Mooney, Kate Saenko, and Sergio Guadarrama. 2013. Generating natural-language video descriptions using text-mined knowledge. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), pages 541-547.
-
(2013)
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI)
, pp. 541-547
-
-
Krishnamoorthy, N.1
Malkarnenkar, G.2
Mooney, R.J.3
Saenko, K.4
Guadarrama, S.5
-
14
-
-
80052901011
-
Baby talk: Understanding and generating image descriptions
-
IEEE
-
Girish Kulkarni, Visruth Premraj, Sagnik Dhar, Siming Li, Alexander Berg, Yejin Choi, and Tamara Berg. 2011. Baby talk: Understanding and generating image descriptions. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE.
-
(2011)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Berg, A.5
Choi, Y.6
Berg, T.7
-
17
-
-
84862279067
-
Composing simple image descriptions using web-scale n-grams
-
Stroudsburg, PA, USA. Association for Computational Linguistics
-
Siming Li, Girish Kulkarni, Tamara L. Berg, Alexander C. Berg, and Yejin Choi. 2011. Composing simple image descriptions using web-scale n-grams. In Proceedings of the Fifteenth Conference on Computational Natural Language Learning (CoNLL), pages 220-228, Stroudsburg, PA, USA. Association for Computational Linguistics.
-
(2011)
Proceedings of the Fifteenth Conference on Computational Natural Language Learning (CoNLL)
, pp. 220-228
-
-
Li, S.1
Kulkarni, G.2
Berg, T.L.3
Berg, A.C.4
Choi, Y.5
-
21
-
-
0003243224
-
Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods
-
MIT Press
-
John C. Platt. 1999. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In Advances In Large Margin Classifiers, pages 61-74. MIT Press.
-
(1999)
Advances in Large Margin Classifiers
, pp. 61-74
-
-
Platt, J.C.1
-
22
-
-
84898775239
-
Translating video content to natural language descriptions
-
Marcus Rohrbach, QiuWei, Ivan Titov, Stefan Thater, Manfred Pinkal, and Bernt Schiele. 2013. Translating video content to natural language descriptions. In IEEE International Conference on Computer Vision (ICCV).
-
(2013)
IEEE International Conference on Computer Vision (ICCV)
-
-
Rohrbach, M.1
Wei, D.2
Titov, I.3
Thater, S.4
Pinkal, M.5
Schiele, B.6
-
23
-
-
84908684165
-
-
arXiv preprint arXiv:1403.6173
-
Anna Senina, Marcus Rohrbach, Wei Qiu, Annemarie Friedrich, Sikandar Amin, Mykhaylo Andriluka, Manfred Pinkal, and Bernt Schiele. 2014. Coherent multi-sentence video description with variable level of detail. arXiv preprint arXiv:1403.6173.
-
(2014)
Coherent Multi-sentence Video Description with Variable Level of Detail
-
-
Senina, A.1
Rohrbach, M.2
Qiu, W.3
Friedrich, A.4
Amin, S.5
Andriluka, M.6
Pinkal, M.7
Schiele, B.8
-
25
-
-
77955988947
-
Sun database: Large scale scene recognition from abbey to zoo
-
Jianxiong Xiao, James Hays, Krista A. Ehinger, Aude Oliva, and Antonio Torralba. 2010. Sun database: Largescale scene recognition from abbey to zoo. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3485-3492.
-
(2010)
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 3485-3492
-
-
Xiao, J.1
Hays, J.2
Ehinger, K.A.3
Oliva, A.4
Torralba, A.5
-
28
-
-
33846580425
-
Local features and kernels for classification of texture and object categories: A comprehensive study
-
Jianguo Zhang, Marcin Marszałek, Svetlana Lazebnik, and Cordelia Schmid. 2007. Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision (IJCV), 73(2):213-238.
-
(2007)
International Journal of Computer Vision (IJCV)
, vol.73
, Issue.2
, pp. 213-238
-
-
Zhang, J.1
Marszałek, M.2
Lazebnik, S.3
Schmid, C.4
|