-
1
-
-
77951155435
-
Video2text: Learning to annotate video content
-
ICDMW '09 IEEE International Conference on
-
H. Aradhye, G. Toderici, and J. Yagnik. Video2text: Learning to annotate video content. In Data Mining Workshops, 2009. ICDMW '09. IEEE International Conference on, pages 144-151, 2009.
-
(2009)
In Data Mining Workshops 2009
, pp. 144-151
-
-
Aradhye, H.1
Toderici, G.2
Yagnik, J.3
-
2
-
-
84885996388
-
Video in sentences out
-
A. Barbu, A. Bridge, Z. Burchill, D. Coroian, S. Dickinson, S. Fidler, A. Michaux, S. Mussman, S. Narayanaswamy, D. Salvi, et al. Video in sentences out. In Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence (UAI), pages 102-12, 2012.
-
(2012)
In Proceedings of the 28th Conference on Uncertainty in Artificial Intelligence (UAI)
, pp. 102-112
-
-
Barbu, A.1
Bridge, A.2
Burchill, Z.3
Coroian, D.4
Dickinson, S.5
Fidler, S.6
Michaux, A.7
Mussman, S.8
Narayanaswamy, S.9
Salvi, D.10
-
4
-
-
84859089502
-
Collecting highly parallel data for paraphrase evaluation
-
Portland, Oregon
-
D. L. Chen and W. B. Dolan. Collecting highly parallel data for paraphrase evaluation. In Proceddings of ACL, 2013, pages 190-200, Portland, Oregon, 2011.
-
(2011)
In Proceddings of ACL 2013
, pp. 190-200
-
-
Chen, D.L.1
Dolan, W.B.2
-
5
-
-
84887345951
-
A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching
-
IEEE Computer Society
-
P. Das, C. Xu, R. F. Doell, and J. J. Corso. A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching. In Computer Vision and Pattern Recognition (CVPR), 2013., pages 2634-2641. IEEE Computer Society, 2013.
-
(2013)
In Computer Vision and Pattern Recognition (CVPR 2013)
, pp. 2634-2641
-
-
Das, P.1
Xu, C.2
Doell, R.F.3
Corso, J.J.4
-
6
-
-
84866674680
-
Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition
-
IEEE
-
J. Deng, J. Krause, A. C. Berg, and L. Fei-Fei. Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition. In Computer Vision and Pattern Recognition (CVPR), 2012., pages 3450-3457. IEEE, 2012.
-
(2012)
In Computer Vision and Pattern Recognition (CVPR 2012)
, pp. 3450-3457
-
-
Deng, J.1
Krause, J.2
Berg, A.C.3
Fei-Fei, L.4
-
7
-
-
84946590544
-
Construction and analysis of a large scale image ontology
-
J. Deng, K. Li, M. Do, H. Su, and L. Fei-Fei. Construction and Analysis of a Large Scale Image Ontology. In Vision Sciences Society, 2009.
-
(2009)
In Vision Sciences Society
-
-
Deng, J.1
Li, K.2
Do, M.3
Su, H.4
Fei-Fei, L.5
-
8
-
-
84864139941
-
Beyond audio and video retrieval: Towards multimedia summarization
-
ACM
-
D. Ding, F. Metze, S. Rawat, P. Schulam, S. Burger, E. Younessian, L. Bao, M. Christel, and A. Hauptmann. Beyond audio and video retrieval: towards multimedia summarization. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, page 2. ACM, 2012.
-
(2012)
In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
, pp. 2
-
-
Ding, D.1
Metze, F.2
Rawat, S.3
Schulam, P.4
Burger, S.5
Younessian, E.6
Bao, L.7
Christel, M.8
Hauptmann, A.9
-
9
-
-
77951298115
-
The pascal visual object classes (voc) challenge
-
June
-
M. Everingham, L. Van Gool, C. K. I.Williams, J.Winn, and A. Zisserman. The pascal visual object classes (voc) challenge. International Journal of Computer Vision, 88(2):303-338, June 2010.
-
(2010)
International Journal of Computer Vision
, vol.88
, Issue.2
, pp. 303-338
-
-
Everingham, M.1
Van Gool, L.2
Williams, C.K.I.3
Winn, J.4
Zisserman, A.5
-
10
-
-
78149311145
-
Every picture tells a story: Generating sentences from images
-
A. Farhadi, M. Hejrati, M. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: Generating sentences from images. Computer Vision-ECCV 2010, pages 15-29, 2010.
-
(2010)
Computer Vision-ECCV 2010
, pp. 15-29
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
11
-
-
77955422240
-
Object detection with discriminatively trained part-based models
-
P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained part-based models. IEEE Transactions on Pattern Analysis and Machine Intelligence., 32(9):1627-1645, 2010.
-
(2010)
IEEE Transactions on Pattern Analysis and Machine Intelligence.
, vol.32
, Issue.9
, pp. 1627-1645
-
-
Felzenszwalb, P.F.1
Girshick, R.B.2
McAllester, D.3
Ramanan, D.4
-
13
-
-
0036843382
-
Natural language description of human activities from video images based on concept hierarchy of actions
-
A. Kojima, T. Tamura, and K. Fukunaga. Natural language description of human activities from video images based on concept hierarchy of actions. International Journal of Computer Vision, 50(2):171-184, 2002.
-
(2002)
International Journal of Computer Vision
, vol.50
, Issue.2
, pp. 171-184
-
-
Kojima, A.1
Tamura, T.2
Fukunaga, K.3
-
14
-
-
84893398951
-
Generating natural-language video descriptions using text-mined knowledge
-
N. Krishnamoorthy, G. Malkarnenkar, R. J. Mooney, K. Saenko, and S. Guadarrama. Generating natural-language video descriptions using text-mined knowledge. In Procedings of AAAI, 2013, 2013.
-
(2013)
In Procedings of AAAI 2013
-
-
Krishnamoorthy, N.1
Malkarnenkar, G.2
Mooney, R.J.3
Saenko, K.4
Guadarrama, S.5
-
15
-
-
80052901011
-
Baby talk: Understanding and generating simple image descriptions
-
IEEE
-
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. Berg, and T. Berg. Baby talk: Understanding and generating simple image descriptions. In Computer Vision and Pattern Recognition (CVPR), 2011., pages 1601-1608. IEEE, 2011.
-
(2011)
In Computer Vision and Pattern Recognition (CVPR 2011)
, pp. 1601-1608
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.6
Berg, T.7
-
17
-
-
51849094354
-
Save: A framework for semantic annotation of visual events
-
CVPRW'08 IEEE
-
M. Lee, A. Hakeem, N. Haering, and S. Zhu. Save: A framework for semantic annotation of visual events. In Computer Vision and Pattern Recognition Workshops, 2008. CVPRW'08., pages 1-8. IEEE, 2008.
-
(2008)
In Computer Vision and Pattern Recognition Workshops 2008
, pp. 1-8
-
-
Lee, M.1
Hakeem, A.2
Haering, N.3
Zhu, S.4
-
18
-
-
85162513516
-
Object bank: A high-level image representation for scene classification and semantic feature sparsification
-
L. Li, H. Su, E. Xing, and L. Fei-Fei. Object bank: A high-level image representation for scene classification and semantic feature sparsification. Advances in Neural Information Processing Systems, 24, 2010.
-
(2010)
Advances in Neural Information Processing Systems
, vol.24
-
-
Li, L.1
Su, H.2
Xing, E.3
Fei-Fei, L.4
-
19
-
-
84862279067
-
Composing simple image descriptions using web-scale n-grams
-
Association for Computational Linguistics
-
S. Li, G. Kulkarni, T. Berg, A. Berg, and Y. Choi. Composing simple image descriptions using web-scale n-grams. In Proceedings of the Fifteenth Conference on Computational Natural Language Learning, pages 220-228. Association for Computational Linguistics, 2011.
-
(2011)
In Proceedings of the Fifteenth Conference on Computational Natural Language Learning
, pp. 220-228
-
-
Li, S.1
Kulkarni, G.2
Berg, T.3
Berg, A.4
Choi, Y.5
-
21
-
-
85081941118
-
Wordnet: Similarity: Measuring the relatedness of concepts
-
Association for Computational Linguistics
-
T. Pedersen, S. Patwardhan, and J. Michelizzi. Wordnet:: Similarity: measuring the relatedness of concepts. In Demonstration Papers at HLT-NAACL 2004, pages 38-41. Association for Computational Linguistics, 2004.
-
(2004)
In Demonstration Papers at HLT-NAACL 2004
, pp. 38-41
-
-
Pedersen, T.1
Patwardhan, S.2
Michelizzi, J.3
-
23
-
-
0003243224
-
Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods
-
MIT Press
-
J. C. Platt. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In ADVANCES IN LARGE MARGIN CLASSIFIERS, pages 61-74. MIT Press, 1999.
-
(1999)
In ADVANCES in LARGE MARGin CLASSIFIERS
, pp. 61-74
-
-
Platt, J.C.1
-
24
-
-
84879550059
-
Recognizing 50 human action categories of web videos
-
K. Reddy and M. Shah. Recognizing 50 human action categories of web videos. Machine Vision and Applications, pages 1-11, 2012.
-
(2012)
Machine Vision and Applications
, pp. 1-11
-
-
Reddy, K.1
Shah, M.2
-
26
-
-
10044233701
-
Recognizing human actions: A local svm approach
-
ICPR 2004 IEEE
-
C. Schuldt, I. Laptev, and B. Caputo. Recognizing human actions: A local svm approach. In Pattern Recognition, 2004. ICPR 2004., volume 3, pages 32-36. IEEE, 2004.
-
(2004)
In Pattern Recognition 2004
, vol.3
, pp. 32-36
-
-
Schuldt, C.1
Laptev, I.2
Caputo, B.3
-
27
-
-
80052877143
-
Action recognition by dense trajectories
-
IEEE
-
H. Wang, A. Klaser, C. Schmid, and C.-L. Liu. Action recognition by dense trajectories. In Computer Vision and Pattern Recognition (CVPR), 2011., pages 3169-3176. IEEE, 2011.
-
(2011)
In Computer Vision and Pattern Recognition (CVPR 2011)
, pp. 3169-3176
-
-
Wang, H.1
Klaser, A.2
Schmid, C.3
Liu, C.-L.4
-
29
-
-
80053258778
-
Corpus-guided sentence generation of natural images
-
EMNLP '11
-
Y. Yang, C. L. Teo, H. Daum?e, III, and Y. Aloimonos. Corpus-guided sentence generation of natural images. In Proc. of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11, pages 444-454, 2011.
-
(2011)
In Proc. of the Conference on Empirical Methods in Natural Language Processing
, pp. 444-454
-
-
Yang, Y.1
Teo, C.L.2
Iii. Daume, H.3
Aloimonos, Y.4
-
30
-
-
77954862144
-
I2t: Image parsing to text description
-
B. Yao, X. Yang, L. Lin, M. Lee, and S. Zhu. I2t: Image parsing to text description. Proceedings of the IEEE, 98(8):1485-1508, 2010.
-
(2010)
Proceedings of the IEEE
, vol.98
, Issue.8
, pp. 1485-1508
-
-
Yao, B.1
Yang, X.2
Lin, L.3
Lee, M.4
Zhu, S.5
-
31
-
-
33846580425
-
Local features and kernels for classification of texture and object categories: A comprehensive study
-
J. Zhang, M. Marszałek, S. Lazebnik, and C. Schmid. Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision, 73(2):213-238, 2007.
-
(2007)
International Journal of Computer Vision
, vol.73
, Issue.2
, pp. 213-238
-
-
Zhang, J.1
Marszałek, M.2
Lazebnik, S.3
Schmid, C.4
|