-
1
-
-
84959210562
-
-
British amazon
-
British amazon. http://www. amazon. co. uk/, 2014
-
(2014)
-
-
-
3
-
-
85026927356
-
-
Makemkv. http://www. makemkv. com/, 2014
-
(2014)
-
-
-
4
-
-
85026933692
-
-
Subtitle edit. http://www. nikse. dk/SubtitleEdit/, 2014
-
(2014)
-
-
-
5
-
-
85026924402
-
-
Xmedia recode. http://www. xmedia-recode. de/, 2014
-
(2014)
-
-
-
8
-
-
84885996388
-
Video in sentences out
-
A. Barbu, A. Bridge, Z. Burchill, D. Coroian, S. Dickinson, S. Fidler, A. Michaux, S. Mussman, S. Narayanaswamy, D. Salvi, L. Schmidt, J. Shangguan, J. M. Siskind, J. Waggoner, S. Wang, J. Wei, Y. Yin, and Z. Zhang. Video in sentences out. In Proceedings of the conference on Uncertainty in Artificial Intelligence (UAI), 2012
-
(2012)
Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI)
-
-
Barbu, A.1
Bridge, A.2
Burchill, Z.3
Coroian, D.4
Dickinson, S.5
Fidler, S.6
Michaux, A.7
Mussman, S.8
Narayanaswamy, S.9
Salvi, D.10
Schmidt, L.11
Shangguan, J.12
Siskind, J.M.13
Waggoner, J.14
Wang, S.15
Wei, J.16
Yin, Y.17
Zhang, Z.18
-
10
-
-
84898792367
-
Finding actors and actions in movies
-
P. Bojanowski, F. Bach, I. Laptev, J. Ponce, C. Schmid, and J. Sivic. Finding actors and actions in movies. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2013
-
(2013)
Proceedings of the IEEE International Conference on Computer Vision (ICCV)
-
-
Bojanowski, P.1
Bach, F.2
Laptev, I.3
Ponce, J.4
Schmid, C.5
Sivic, J.6
-
11
-
-
84943800045
-
Weakly supervised action labeling in videos under ordering constraints
-
P. Bojanowski, R. Lajugie, F. Bach, I. Laptev, J. Ponce, C. Schmid, and J. Sivic. Weakly supervised action labeling in videos under ordering constraints. In Proceedings of the European Conference on Computer Vision (ECCV), 2014
-
(2014)
Proceedings of the European Conference on Computer Vision (ECCV)
-
-
Bojanowski, P.1
Lajugie, R.2
Bach, F.3
Laptev, I.4
Ponce, J.5
Schmid, C.6
Sivic, J.7
-
19
-
-
72249100259
-
Imagenet: A large-scale hierarchical image database
-
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009
-
(2009)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Deng, J.1
Dong, W.2
Socher, R.3
Li, L.-J.4
Li, K.5
Fei-Fei, L.6
-
20
-
-
84959236502
-
Long-term recurrent convolutional networks for visual recognition and description
-
J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term recurrent convolutional networks for visual recognition and description. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
-
(2015)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Donahue, J.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
21
-
-
85081863350
-
Automatic annotation of human actions in video
-
O. Duchenne, I. Laptev, J. Sivic, F. Bach, and J. Ponce. Automatic annotation of human actions in video. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2009
-
(2009)
Proceedings of the IEEE International Conference on Computer Vision (ICCV)
-
-
Duchenne, O.1
Laptev, I.2
Sivic, J.3
Bach, F.4
Ponce, J.5
-
24
-
-
84944115860
-
-
arXiv:1411. 4952
-
H. Fang, S. Gupta, F. N. Iandola, R. Srivastava, L. Deng, P. Dollár, J. Gao, X. He, M. Mitchell, J. C. Platt, C. L. Zitnick, and G. Zweig. From captions to visual concepts and back. arXiv:1411. 4952, 2014
-
(2014)
From Captions to Visual Concepts and Back
-
-
Fang, H.1
Gupta, S.2
Iandola, F.N.3
Srivastava, R.4
Deng, L.5
Dollár, P.6
Gao, J.7
He, X.8
Mitchell, M.9
Platt, J.C.10
Zitnick, C.L.11
Zweig, G.12
-
25
-
-
80052017343
-
Every picture tells a story: Generating sentences from images
-
A. Farhadi, M. Hejrati, M. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: Generating sentences from images. In Proceedings of the European Conference on Computer Vision (ECCV), 2010
-
(2010)
Proceedings of the European Conference on Computer Vision (ECCV)
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
27
-
-
77956527163
-
A computer-vision-assisted system for videodescription scripting
-
L. Gagnon, C. Chapdelaine, D. Byrns, S. Foucher, M. Heritier, and V. Gupta. A computer-vision-assisted system for videodescription scripting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPR Workshops), 2010
-
(2010)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPR Workshops)
-
-
Gagnon, L.1
Chapdelaine, C.2
Byrns, D.3
Foucher, S.4
Heritier, M.5
Gupta, V.6
-
28
-
-
84898773262
-
Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shoot recognition
-
S. Guadarrama, N. Krishnamoorthy, G. Malkarnenkar, S. Venugopalan, R. Mooney, T. Darrell, and K. Saenko. Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shoot recognition. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2013
-
(2013)
Proceedings of the IEEE International Conference on Computer Vision (ICCV)
-
-
Guadarrama, S.1
Krishnamoorthy, N.2
Malkarnenkar, G.3
Venugopalan, S.4
Mooney, R.5
Darrell, T.6
Saenko, K.7
-
29
-
-
70450202741
-
Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
-
A. Gupta, P. Srinivasan, J. Shi, and L. Davis. Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009
-
(2009)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Gupta, A.1
Srinivasan, P.2
Shi, J.3
Davis, L.4
-
32
-
-
84924803045
-
LSDA: Large scale detection through adaptation
-
J. Hoffman, S. Guadarrama, E. Tzeng, J. Donahue, R. Girshick, T. Darrell, and K. Saenko. LSDA: Large scale detection through adaptation. In Advances in Neural Information Processing Systems (NIPS), 2014
-
(2014)
Advances in Neural Information Processing Systems (NIPS)
-
-
Hoffman, J.1
Guadarrama, S.2
Tzeng, E.3
Donahue, J.4
Girshick, R.5
Darrell, T.6
Saenko, K.7
-
38
-
-
85110867932
-
Moses: Open source toolkit for statistical machine translation
-
P. Koehn, H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, C. Dyer, O. Bojar, A. Constantin, and E. Herbst. Moses: Open source toolkit for statistical machine translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2007
-
(2007)
Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL)
-
-
Koehn, P.1
Hoang, H.2
Birch, A.3
Callison-Burch, C.4
Federico, M.5
Bertoldi, N.6
Cowan, B.7
Shen, W.8
Moran, C.9
Zens, R.10
Dyer, C.11
Bojar, O.12
Constantin, A.13
Herbst, E.14
-
41
-
-
80052901011
-
Baby talk: Understanding and generating simple image descriptions
-
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg. Baby talk: Understanding and generating simple image descriptions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011
-
(2011)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
42
-
-
84878189119
-
Collective generation of natural image descriptions
-
P. Kuznetsova, V. Ordonez, A. C. Berg, T. L. Berg, and Y. Choi. Collective generation of natural image descriptions. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2012
-
(2012)
Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL)
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, A.C.3
Berg, T.L.4
Choi, Y.5
-
43
-
-
84934873221
-
Treetalk: Composition and compression of trees for image descriptions
-
P. Kuznetsova, V. Ordonez, T. L. Berg, U. C. Hill, and Y. Choi. Treetalk: Composition and compression of trees for image descriptions. In Transactions of the Association for Computational Linguistics (TACL), 2014
-
(2014)
Transactions of the Association for Computational Linguistics (TACL)
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, T.L.3
Hill, U.C.4
Choi, Y.5
-
47
-
-
84862279067
-
Composing simple image descriptions using web-scale N-grams
-
S. Li, G. Kulkarni, T. Berg, A. Berg, and Y. Choi. Composing simple image descriptions using web-scale N-grams. In Proceedings of the Fifteenth Conference on Computational Natural Language Learning (CoNLL). Association for Computational Linguistics, 2011
-
(2011)
Proceedings of the Fifteenth Conference on Computational Natural Language Learning (CoNLL). Association for Computational Linguistics
-
-
Li, S.1
Kulkarni, G.2
Berg, T.3
Berg, A.4
Choi, Y.5
-
49
-
-
84937834115
-
Microsoft coco: Common objects in context
-
T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick. Microsoft coco: Common objects in context. In Proceedings of the European Conference on Computer Vision (ECCV), 2014
-
(2014)
Proceedings of the European Conference on Computer Vision (ECCV)
-
-
Lin, T.-Y.1
Maire, M.2
Belongie, S.3
Hays, J.4
Perona, P.5
Ramanan, D.6
Dollár, P.7
Zitnick, C.L.8
-
50
-
-
84939821073
-
-
arXiv:1412. 6632
-
J. Mao, W. Xu, Y. Yang, J. Wang, and A. L. Yuille. Deep captioning with multimodal recurrent neural networks (mrnn). arXiv:1412. 6632, 2014
-
(2014)
Deep Captioning with Multimodal Recurrent Neural Networks (Mrnn)
-
-
Mao, J.1
Xu, W.2
Yang, Y.3
Wang, J.4
Yuille, A.L.5
-
52
-
-
85034832841
-
Generating image descriptions from computer vision detections
-
M. Mitchell, J. Dodge, A. Goyal, K. Yamaguchi, K. Stratos, X. Han, A. Mensch, A. C. Berg, T. L. Berg, and H. D. III. Midge: Generating image descriptions from computer vision detections. In Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2012
-
(2012)
Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL)
-
-
Mitchell, M.1
Dodge, J.2
Goyal, A.3
Yamaguchi, K.4
Stratos, K.5
Han, X.6
Mensch, A.7
Berg, A.C.8
Berg, T.L.9
Midge, H.D.10
-
54
-
-
84905274625
-
Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics
-
USA
-
P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, B. Shaw, A. F. Smeaton, and G. Quéenot. Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of TRECVID 2012. NIST, USA, 2012
-
(2012)
Proceedings of TRECVID 2012. NIST
-
-
Over, P.1
Awad, G.2
Michel, M.3
Fiscus, J.4
Sanders, G.5
Shaw, B.6
Smeaton, A.F.7
Quéenot, G.8
-
57
-
-
84898785648
-
Grounding Action Descriptions in Videos
-
M. Regneri, M. Rohrbach, D. Wetzel, S. Thater, B. Schiele, and M. Pinkal. Grounding Action Descriptions in Videos. Transactions of the Association for Computational Linguistics (TACL), 1, 2013
-
(2013)
Transactions of the Association for Computational Linguistics (TACL)
, vol.1
-
-
Regneri, M.1
Rohrbach, M.2
Wetzel, D.3
Thater, S.4
Schiele, B.5
Pinkal, M.6
-
58
-
-
84960170289
-
Coherent multi-sentence video description with variable level of detail
-
September
-
A. Rohrbach, M. Rohrbach,W. Qiu, A. Friedrich, M. Pinkal, and B. Schiele. Coherent multi-sentence video description with variable level of detail. In Proceedings of the German Confeence on Pattern Recognition (GCPR), September 2014
-
(2014)
Proceedings of the German Confeence on Pattern Recognition (GCPR)
-
-
Rohrbach, A.1
Rohrbach, M.2
Qiu, W.3
Friedrich, A.4
Pinkal, M.5
Schiele, B.6
-
59
-
-
84898775239
-
Translating video content to natural language descriptions
-
M. Rohrbach, W. Qiu, I. Titov, S. Thater, M. Pinkal, and B. Schiele. Translating video content to natural language descriptions. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2013
-
(2013)
Proceedings of the IEEE International Conference on Computer Vision (ICCV)
-
-
Rohrbach, M.1
Qiu, W.2
Titov, I.3
Thater, S.4
Pinkal, M.5
Schiele, B.6
-
60
-
-
84909978410
-
-
O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei. ImageNet Large Scale Visual Recognition Challenge, 2014
-
(2014)
ImageNet Large Scale Visual Recognition Challenge
-
-
Russakovsky, O.1
Deng, J.2
Su, H.3
Krause, J.4
Satheesh, S.5
Ma, S.6
Huang, Z.7
Karpathy, A.8
Khosla, A.9
Bernstein, M.10
Berg, A.C.11
Fei-Fei, L.12
-
63
-
-
79960117324
-
Verbnet overview, extensions, mappings and applications
-
K. K. Schuler, A. Korhonen, and S. W. Brown. Verbnet overview, extensions, mappings and applications. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2009
-
(2009)
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL)
-
-
Schuler, K.K.1
Korhonen, A.2
Brown, S.W.3
-
68
-
-
84959932469
-
Integrating language and vision to generate natural language descriptions of videos in the wild
-
J. Thomason, S. Venugopalan, S. Guadarrama, K. Saenko, and R. J. Mooney. Integrating language and vision to generate natural language descriptions of videos in the wild. In Proceedings of the International Conference on Computational Linguistics (COLING), 2014
-
(2014)
Proceedings of the International Conference on Computational Linguistics (COLING)
-
-
Thomason, J.1
Venugopalan, S.2
Guadarrama, S.3
Saenko, K.4
Mooney, R.J.5
-
70
-
-
84959876769
-
Translating videos to natural language using deep recurrent neural networks
-
S. Venugopalan, H. Xu, J. Donahue, M. Rohrbach, R. Mooney, and K. Saenko. Translating videos to natural language using deep recurrent neural networks. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2015
-
(2015)
Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL)
-
-
Venugopalan, S.1
Xu, H.2
Donahue, J.3
Rohrbach, M.4
Mooney, R.5
Saenko, K.6
-
73
-
-
77955988947
-
Sun database: Large-scale scene recognition from abbey to zoo
-
J. Xiao, J. Hays, K. A. Ehinger, A. Oliva, and A. Torralba. Sun database: Large-scale scene recognition from abbey to zoo. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010
-
(2010)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Xiao, J.1
Hays, J.2
Ehinger, K.A.3
Oliva, A.4
Torralba, A.5
-
74
-
-
84959223725
-
-
arXiv:1502. 08029v3
-
L. Yao, A. Torabi, K. Cho, N. Ballas, C. Pal, H. Larochelle, and A. Courville. Video description generation incorporating spatio-temporal features and a soft-attention mechanism. arXiv:1502. 08029v3, 2015
-
(2015)
Video Description Generation Incorporating Spatio-temporal Features and A Soft-attention Mechanism
-
-
Yao, L.1
Torabi, A.2
Cho, K.3
Ballas, N.4
Pal, C.5
Larochelle, H.6
Courville, A.7
-
76
-
-
84937964578
-
Learning deep features for scene recognition using places database
-
B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning Deep Features for Scene Recognition using Places Database. Advances in Neural Information Processing Systems (NIPS), 2014.
-
(2014)
Advances in Neural Information Processing Systems (NIPS)
-
-
Zhou, B.1
Lapedriza, A.2
Xiao, J.3
Torralba, A.4
Oliva, A.5
|