-
1
-
-
84887345951
-
Thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching
-
Das, P., Xu, C., Doell, R.F., Corso, J.: Thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)
-
(2013)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Das, P.1
Xu, C.2
Doell, R.F.3
Corso, J.4
-
3
-
-
78149311145
-
Every picture tells a story: Generating sentences from images
-
In: Daniilidis, K., Maragos, P., Paragios, N. (eds.), Springer, Heidelberg
-
Farhadi, A., Hejrati, M., Sadeghi, M.A., Young, P., Rashtchian, C., Hockenmaier, J., Forsyth, D.: Every picture tells a story: generating sentences from images. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 15–29. Springer, Heidelberg (2010)
-
(2010)
ECCV 2010, Part IV. LNCS
, vol.6314
, pp. 15-29
-
-
Farhadi, A.1
Hejrati, M.2
Sadeghi, M.A.3
Young, P.4
Rashtchian, C.5
Hockenmaier, J.6
Forsyth, D.7
-
4
-
-
84898773262
-
Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shoot recognition
-
Guadarrama, S., Krishnamoorthy, N., Malkarnenkar, G., Mooney, R., Darrell, T., Saenko, K.: Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shoot recognition. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV) (2013)
-
(2013)
Proceedings of the IEEE International Conference on Computer Vision (ICCV)
-
-
Guadarrama, S.1
Krishnamoorthy, N.2
Malkarnenkar, G.3
Mooney, R.4
Darrell, T.5
Saenko, K.6
-
5
-
-
70450202741
-
Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
-
Gupta, A., Srinivasan, P., Shi, J.B., Davis, L.: Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
-
(2009)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Gupta, A.1
Srinivasan, P.2
Shi, J.B.3
Davis, L.4
-
7
-
-
85110867932
-
Moses: Open source toolkit for statistical machine translation
-
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: Open source toolkit for statistical machine translation. In: Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (demo) (2007)
-
(2007)
Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (demo)
-
-
Koehn, P.1
Hoang, H.2
Birch, A.3
Callison-Burch, C.4
Federico, M.5
Bertoldi, N.6
Cowan, B.7
Shen, W.8
Moran, C.9
Zens, R.10
Dyer, C.11
Bojar, O.12
Constantin, A.13
Herbst, E.14
-
8
-
-
0036843382
-
Natural language description of human activities from video images based on concept hierarchy of actions
-
Kojima, A., Tamura, T., Fukunaga, K.: Natural language description of human activities from video images based on concept hierarchy of actions. Int. J. Comput. Vis. (IJCV) 50, 171–184 (2002)
-
(2002)
Int. J. Comput. Vis. (IJCV)
, vol.50
, pp. 171-184
-
-
Kojima, A.1
Tamura, T.2
Fukunaga, K.3
-
9
-
-
84893398951
-
Generating natural-language video descriptions using text-mined knowledge
-
Krishnamoorthy, N., Malkarnenkar, G., Mooney, R.J., Saenko, K., Guadarrama, S.: Generating natural-language video descriptions using text-mined knowledge. In: AAAI Conference on Artificial Intelligence (AAAI) (2013)
-
(2013)
AAAI Conference on Artificial Intelligence (AAAI)
-
-
Krishnamoorthy, N.1
Malkarnenkar, G.2
Mooney, R.J.3
Saenko, K.4
Guadarrama, S.5
-
10
-
-
80052901011
-
Baby talk: Understanding and generating simple image descriptions
-
Kulkarni, G., Premraj, V., Dhar, S., Li, S., Choi, Y., Berg, A.C., Berg, T.L.: Baby talk: Understanding and generating simple image descriptions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2011)
-
(2011)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Kulkarni, G.1
Premraj, V.2
Dhar, S.3
Li, S.4
Choi, Y.5
Berg, A.C.6
Berg, T.L.7
-
11
-
-
84878189119
-
Collective generation of natural image descriptions
-
Kuznetsova, P., Ordonez, V., Berg, A.C., Berg, T.L., Choi, Y.: Collective generation of natural image descriptions. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL) (2012)
-
(2012)
Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL)
-
-
Kuznetsova, P.1
Ordonez, V.2
Berg, A.C.3
Berg, T.L.4
Choi, Y.5
-
12
-
-
85034832841
-
Midge: Generating image descriptions from computer vision detections
-
Mitchell, M., Dodge, J., Goyal, A., Yamaguchi, K., Stratos, K., Han, X., Mensch, A., Berg, A.C., Berg, T.L., III H.D.: Midge: Generating image descriptions from computer vision detections. In: Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL) (2012)
-
(2012)
Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL)
-
-
Mitchell, M.1
Dodge, J.2
Goyal, A.3
Yamaguchi, K.4
Stratos, K.5
Han, X.6
Mensch, A.7
Berg, A.C.8
Berg, T.L.9
-
13
-
-
84898785648
-
Grounding action descriptions in videos
-
Regneri, M., Rohrbach, M., Wetzel, D., Thater, S., Schiele, B., Pinkal, M.: Grounding action descriptions in videos. Trans. Assoc. Comput. Linguist. (TACL) 1, 25–36 (2013)
-
(2013)
Trans. Assoc. Comput. Linguist. (TACL)
, vol.1
, pp. 25-36
-
-
Regneri, M.1
Rohrbach, M.2
Wetzel, D.3
Thater, S.4
Schiele, B.5
Pinkal, M.6
-
14
-
-
84898775239
-
Translating video content to natural language descriptions
-
Rohrbach, M., Qiu, W., Titov, I., Thater, S., Pinkal, M., Schiele, B.: Translating video content to natural language descriptions. In: IEEE International Conference on Computer Vision (ICCV) (2013)
-
(2013)
IEEE International Conference on Computer Vision (ICCV)
-
-
Rohrbach, M.1
Qiu, W.2
Titov, I.3
Thater, S.4
Pinkal, M.5
Schiele, B.6
-
15
-
-
84867726359
-
Script data for attribute-based recognition of composite activities
-
In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.), Springer, Heidelberg
-
Rohrbach, M., Regneri, M., Andriluka, M., Amin, S., Pinkal, M., Schiele, B.: Script data for attribute-based recognition of composite activities. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part I. LNCS, vol. 7572, pp. 144–157. Springer, Heidelberg (2012)
-
(2012)
ECCV 2012, Part I. LNCS
, vol.7572
, pp. 144-157
-
-
Rohrbach, M.1
Regneri, M.2
Andriluka, M.3
Amin, S.4
Pinkal, M.5
Schiele, B.6
-
17
-
-
84908684165
-
-
arXiv:1403.6173
-
Senina, A., Rohrbach, M., Qiu, W., Friedrich, A., Amin, S., Andriluka, M., Pinkal, M., Schiele, B.: Coherent multi-sentence video description with variable level of detail. arXiv:1403.6173 (2014)
-
(2014)
Coherent multi-sentence video description with variable level of detail
-
-
Senina, A.1
Rohrbach, M.2
Qiu, W.3
Friedrich, A.4
Amin, S.5
Andriluka, M.6
Pinkal, M.7
Schiele, B.8
-
19
-
-
84455192418
-
Towards textually describing complex video contents with audio-visual concept classifiers
-
Tan, C.C., Jiang, Y.G., Ngo, C.W.: Towards textually describing complex video contents with audio-visual concept classifiers. In: ACM Multimedia (2011)
-
(2011)
ACM Multimedia
-
-
Tan, C.C.1
Jiang, Y.G.2
Ngo, C.W.3
-
21
-
-
84876945537
-
Dense trajectories and motion boundary descriptors for action recognition
-
Wang, H., Kläser, A., Schmid, C., Liu, C.: Dense trajectories and motion boundary descriptors for action recognition. Int. J. Comput. Vis. (IJCV) 103, 60–79 (2013)
-
(2013)
Int. J. Comput. Vis. (IJCV)
, vol.103
, pp. 60-79
-
-
Wang, H.1
Kläser, A.2
Schmid, C.3
Liu, C.4
-
23
-
-
0035030120
-
Natural language processing and user modeling: Synergies and limitations
-
Zukerman, I., Litman, D.: Natural language processing and user modeling: Synergies and limitations. User Model. User-Adap. Inter. 11, 129–158 (2001)
-
(2001)
User Model. User-Adap. Inter
, vol.11
, pp. 129-158
-
-
Zukerman, I.1
Litman, D.2
|