-
1
-
-
84885996388
-
Video in sentences out
-
Barbu, A.; Bridge, A.; Burchill, Z.; Coroian, D.; Dickinson, S.; Fidler, S.; Michaux, A.; Mussman, S.; Narayanaswamy, S.; Salvi, D.; et al. 2012. Video in sentences out. In UAI, 102-12.
-
(2012)
UAI
, pp. 102-112
-
-
Barbu, A.1
Bridge, A.2
Burchill, Z.3
Coroian, D.4
Dickinson, S.5
Fidler, S.6
Michaux, A.7
Mussman, S.8
Narayanaswamy, S.9
Salvi, D.10
-
3
-
-
0033329799
-
An empirical study of smoothing techniques for language modeling
-
Chen, S., and Goodman, J. 1999. An empirical study of smoothing techniques for language modeling. Computer Speech & Language 13(4):359-393.
-
(1999)
Computer Speech & Language
, vol.13
, Issue.4
, pp. 359-393
-
-
Chen, S.1
Goodman, J.2
-
4
-
-
84893397324
-
Collecting highly parallel data for paraphrase evaluation
-
Chen, D.; Dolan, W.; Raghavan, S.; Huynh, T.; Mooney, R.; Blythe, J.; Hobbs, J.; Domingos, P.; Kate, R.; Garrette, D.; et al. 2010. Collecting highly parallel data for paraphrase evaluation. JAIR 37:397-435.
-
(2010)
JAIR
, vol.37
, pp. 397-435
-
-
Chen, D.1
Dolan, W.2
Raghavan, S.3
Huynh, T.4
Mooney, R.5
Blythe, J.6
Hobbs, J.7
Domingos, P.8
Kate, R.9
Garrette, D.10
-
5
-
-
85037338954
-
Generating typed dependency parses from phrase structure parses
-
De Marneffe, M.; MacCartney, B.; and Manning, C. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of LREC, volume 6, 449-454.
-
(2006)
Proceedings of LREC
, vol.6
, pp. 449-454
-
-
De Marneffe, M.1
Maccartney, B.2
Manning, C.3
-
6
-
-
84864139941
-
Beyond audio and video retrieval: Towards multimedia summarization
-
ACM
-
Ding, D.; Metze, F.; Rawat, S.; Schulam, P.; Burger, S.; Younessian, E.; Bao, L.; Christel, M.; and Hauptmann, A. 2012. Beyond audio and video retrieval: towards multimedia summarization. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, 2. ACM.
-
(2012)
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
, vol.2
-
-
Ding, D.1
Metze, F.2
Rawat, S.3
Schulam, P.4
Burger, S.5
Younessian, E.6
Bao, L.7
Christel, M.8
Hauptmann, A.9
-
7
-
-
78149311145
-
Every picture tells a story: Generating sentences from images
-
Farhadi, A.; Hejrati, M.; Sadeghi, M.; Young, P.; Rashtchian, C.; Hockenmaier, J.; and Forsyth, D. 2010. Every picture tells a story: Generating sentences from images. Computer Vision-ECCV 2010 15-29.
-
(2010)
Computer Vision-ECCV
, vol.2010
, pp. 15-29
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
8
-
-
51949101231
-
A discriminatively trained, multiscale, deformable part model
-
IEEE
-
Felzenszwalb, P.; McAllester, D.; and Ramanan, D. 2008. A discriminatively trained, multiscale, deformable part model. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, 1-8. IEEE.
-
(2008)
Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
, pp. 1-8
-
-
Felzenszwalb, P.1
McAllester, D.2
Ramanan, D.3
-
9
-
-
84898785322
-
Describing video contents in natural language
-
Khan, M., and Gotoh, Y. 2012. Describing video contents in natural language. EACL 2012 27.
-
(2012)
EACL 2012
, pp. 27
-
-
Khan, M.1
Gotoh, Y.2
-
10
-
-
0036843382
-
Natural language description of human activities from video images based on concept hierarchy of actions
-
Kojima, A.; Tamura, T.; and Fukunaga, K. 2002. Natural language description of human activities from video images based on concept hierarchy of actions. International Journal of Computer Vision 50(2):171-184.
-
(2002)
International Journal of Computer Vision
, vol.50
, Issue.2
, pp. 171-184
-
-
Kojima, A.1
Tamura, T.2
Fukunaga, K.3
-
11
-
-
80052901011
-
Baby talk: Understanding and generating simple image descriptions
-
IEEE
-
Kulkarni, G.; Premraj, V.; Dhar, S.; Li, S.; Choi, Y.; Berg, A.; and Berg, T. 2011. Baby talk: Understanding and generating simple image descriptions. In CVPR, 1601-1608. IEEE.
-
(2011)
CVPR
, pp. 1601-1608
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.6
Berg, T.7
-
13
-
-
51949083365
-
Learning realistic human actions from movies
-
IEEE
-
Laptev, I.; Marszalek, M.; Schmid, C.; and Rozenfeld, B. 2008. Learning realistic human actions from movies. In CVPR, 1-8. IEEE.
-
(2008)
CVPR
, pp. 1-8
-
-
Laptev, I.1
Marszalek, M.2
Schmid, C.3
Rozenfeld, B.4
-
14
-
-
51849094354
-
Save: A framework for semantic annotation of visual events
-
IEEE
-
Lee, M.; Hakeem, A.; Haering, N.; and Zhu, S. 2008. Save: A framework for semantic annotation of visual events. In CVPRW, 1-8. IEEE.
-
(2008)
CVPRW
, pp. 1-8
-
-
Lee, M.1
Hakeem, A.2
Haering, N.3
Zhu, S.4
-
15
-
-
85162513516
-
Object bank: A high-level image representation for scene classification and semantic feature sparsification
-
Li, L.; Su, H.; Xing, E.; and Fei-Fei, L. 2010. Object bank: A high-level image representation for scene classification and semantic feature sparsification. Advances in Neural Information Processing Systems 24.
-
(2010)
Advances in Neural Information Processing Systems
, vol.24
-
-
Li, L.1
Su, H.2
Xing, E.3
Fei-Fei, L.4
-
16
-
-
84862279067
-
Composing simple image descriptions using web-scale ngrams
-
Association for Computational Linguistics
-
Li, S.; Kulkarni, G.; Berg, T.; Berg, A.; and Choi, Y. 2011. Composing simple image descriptions using web-scale ngrams. In Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 220-228. Association for Computational Linguistics.
-
(2011)
Proceedings of the Fifteenth Conference on Computational Natural Language Learning
, pp. 220-228
-
-
Li, S.1
Kulkarni, G.2
Berg, T.3
Berg, A.4
Choi, Y.5
-
17
-
-
85095712904
-
Syntactic annotations for the Google books ngram corpus
-
Lin, Y.; Michel, J.; Aiden, E.; Orwant, J.; Brockman, W.; and Petrov, S. 2012. Syntactic annotations for the google books ngram corpus. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics.
-
(2012)
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics
-
-
Lin, Y.1
Michel, J.2
Aiden, E.3
Orwant, J.4
Brockman, W.5
Petrov, S.6
-
18
-
-
84959182849
-
Improving video activity recognition using object recognition and text mining
-
Motwani, T., and Mooney, R. 2012. Improving video activity recognition using object recognition and text mining. ECAI.
-
(2012)
ECAI
-
-
Motwani, T.1
Mooney, R.2
-
19
-
-
85162522202
-
Im2text: Describing images using 1 million captioned photographs
-
Ordonez, V.; Kulkarni, G.; and Berg, T. 2011. Im2text: Describing images using 1 million captioned photographs. In Proceedings of NIPS.
-
(2011)
Proceedings of NIPS
-
-
Ordonez, V.1
Kulkarni, G.2
Berg, T.3
-
20
-
-
84866717619
-
A combined pose, object, and feature model for action understanding
-
IEEE
-
Packer, B.; Saenko, K.; and Koller, D. 2012. A combined pose, object, and feature model for action understanding. In CVPR, 1378-1385. IEEE.
-
(2012)
CVPR
, pp. 1378-1385
-
-
Packer, B.1
Saenko, K.2
Koller, D.3
-
22
-
-
85081941118
-
Wordnet:: Similarity: Measuring the relatedness of concepts
-
Association for Computational Linguistics
-
Pedersen, T.; Patwardhan, S.; and Michelizzi, J. 2004. Wordnet:: Similarity: measuring the relatedness of concepts. In Demonstration Papers at HLT-NAACL 2004, 38-41. Association for Computational Linguistics.
-
(2004)
Demonstration Papers at HLT-NAACL 2004
, pp. 38-41
-
-
Pedersen, T.1
Patwardhan, S.2
Michelizzi, J.3
-
23
-
-
84879550059
-
Recognizing 50 human action categories of web videos
-
Reddy, K., and Shah, M. 2012. Recognizing 50 human action categories of web videos. Machine Vision and Applications 1-11.
-
(2012)
Machine Vision and Applications
, pp. 1-11
-
-
Reddy, K.1
Shah, M.2
-
24
-
-
10044233701
-
Recognizing human actions: A local SVM approach
-
IEEE
-
Schuldt, C.; Laptev, I.; and Caputo, B. 2004. Recognizing human actions: A local SVM approach. In Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on, volume 3, 32-36. IEEE.
-
(2004)
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
, vol.3
, pp. 32-36
-
-
Schuldt, C.1
Laptev, I.2
Caputo, B.3
-
25
-
-
80052877143
-
Action recognition by dense trajectories
-
IEEE
-
Wang, H.; Klaser, A.; Schmid, C.; and Liu, C.-L. 2011. Action recognition by dense trajectories. In CVPR, 3169-3176. IEEE.
-
(2011)
CVPR
, pp. 3169-3176
-
-
Wang, H.1
Klaser, A.2
Schmid, C.3
Liu, C.-L.4
-
26
-
-
80053258778
-
Corpus-guided sentence generation of natural images
-
Association for Computational Linguistics
-
Yang, Y.; Teo, C. L.; Daumé, III, H.; and Aloimonos, Y. 2011. Corpus-guided sentence generation of natural images. In EMNLP, EMNLP '11, 444-454. Association for Computational Linguistics.
-
(2011)
EMNLP, EMNLP '11
, pp. 444-454
-
-
Yang, Y.1
Teo, C.L.2
Daumé III, H.3
Aloimonos, Y.4
-
27
-
-
77955988492
-
Modeling mutual context of object and human pose in human-object interaction activities
-
Yao, B., and Fei-Fei, L. 2010. Modeling mutual context of object and human pose in human-object interaction activities. In CVPR.
-
(2010)
CVPR
-
-
Yao, B.1
Fei-Fei, L.2
-
28
-
-
77954862144
-
I2t: Image parsing to text description
-
Yao, B.; Yang, X.; Lin, L.; Lee, M.; and Zhu, S. 2010. I2t: Image parsing to text description. Proceedings of the IEEE 98(8):1485-1508.
-
(2010)
Proceedings of the IEEE
, vol.98
, Issue.8
, pp. 1485-1508
-
-
Yao, B.1
Yang, X.2
Lin, L.3
Lee, M.4
Zhu, S.5
-
29
-
-
33846580425
-
Local features and kernels for classification of texture and object categories: A comprehensive study
-
Zhang, J.; Marszałek, M.; Lazebnik, S.; and Schmid, C. 2007. Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision 73(2):213-238.
-
(2007)
International Journal of Computer Vision
, vol.73
, Issue.2
, pp. 213-238
-
-
Zhang, J.1
Marszałek, M.2
Lazebnik, S.3
Schmid, C.4
|