SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 8753, Issue , 2014, Pages 184-195

Coherent multi-sentence video description with variable level of detail

(6) Rohrbach, Anna a Rohrbach, Marcus a,b Qiu, Wei a,c Friedrich, Annemarie c Pinkal, Manfred c Schiele, Bernt a

a MAX PLANCK INSTITUTE FOR INFORMATICS (Germany)

b UNIVERSITY OF CALIFORNIA (United States)

c SAARLAND UNIVERSITY (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

LEVEL OF DETAIL;

EID: 84908670256 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-319-11752-2_15 Document Type: Conference Paper

Times cited : (166)

References (23)

1
- 84887345951
- Thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching
- Das, P., Xu, C., Doell, R.F., Corso, J.: Thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)
- (2013) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Das, P.¹ Xu, C.² Doell, R.F.³ Corso, J.⁴

2
- 80053264667
- Generalizing word lattice translation
- Dyer, C., Muresan, S., Resnik, P.: Generalizing word lattice translation. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL) (2008)
- (2008) Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL)
- Dyer, C.¹ Muresan, S.² Resnik, P.³

3
- 78149311145
- Every picture tells a story: Generating sentences from images
- In: Daniilidis, K., Maragos, P., Paragios, N. (eds.), Springer, Heidelberg
- Farhadi, A., Hejrati, M., Sadeghi, M.A., Young, P., Rashtchian, C., Hockenmaier, J., Forsyth, D.: Every picture tells a story: generating sentences from images. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 15–29. Springer, Heidelberg (2010)
- (2010) ECCV 2010, Part IV. LNCS , vol.6314 , pp. 15-29
- Farhadi, A.¹ Hejrati, M.² Sadeghi, M.A.³ Young, P.⁴ Rashtchian, C.⁵ Hockenmaier, J.⁶ Forsyth, D.⁷

4
- 84898773262
- Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shoot recognition
- Guadarrama, S., Krishnamoorthy, N., Malkarnenkar, G., Mooney, R., Darrell, T., Saenko, K.: Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shoot recognition. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV) (2013)
- (2013) Proceedings of the IEEE International Conference on Computer Vision (ICCV)
- Guadarrama, S.¹ Krishnamoorthy, N.² Malkarnenkar, G.³ Mooney, R.⁴ Darrell, T.⁵ Saenko, K.⁶

5
- 70450202741
- Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
- Gupta, A., Srinivasan, P., Shi, J.B., Davis, L.: Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2009)
- (2009) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Gupta, A.¹ Srinivasan, P.² Shi, J.B.³ Davis, L.⁴

6
- 84863029475
- Human focused video description
- Khan, M.U.G., Zhang, L., Gotoh, Y.: Human focused video description. In: Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCV Workshops) (2011)
- (2011) Proceedings of the IEEE International Conference on Computer Vision Workshops (ICCV Workshops)
- Khan, M.U.G.¹ Zhang, L.² Gotoh, Y.³

7
- 85110867932
- Moses: Open source toolkit for statistical machine translation
- Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: Open source toolkit for statistical machine translation. In: Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (demo) (2007)
- (2007) Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (demo)
- Koehn, P.¹ Hoang, H.² Birch, A.³ Callison-Burch, C.⁴ Federico, M.⁵ Bertoldi, N.⁶ Cowan, B.⁷ Shen, W.⁸ Moran, C.⁹ Zens, R.¹⁰ Dyer, C.¹¹ Bojar, O.¹² Constantin, A.¹³ Herbst, E.¹⁴

8
- 0036843382
- Natural language description of human activities from video images based on concept hierarchy of actions
- Kojima, A., Tamura, T., Fukunaga, K.: Natural language description of human activities from video images based on concept hierarchy of actions. Int. J. Comput. Vis. (IJCV) 50, 171–184 (2002)
- (2002) Int. J. Comput. Vis. (IJCV) , vol.50 , pp. 171-184
- Kojima, A.¹ Tamura, T.² Fukunaga, K.³

9
- 84893398951
- Generating natural-language video descriptions using text-mined knowledge
- Krishnamoorthy, N., Malkarnenkar, G., Mooney, R.J., Saenko, K., Guadarrama, S.: Generating natural-language video descriptions using text-mined knowledge. In: AAAI Conference on Artificial Intelligence (AAAI) (2013)
- (2013) AAAI Conference on Artificial Intelligence (AAAI)
- Krishnamoorthy, N.¹ Malkarnenkar, G.² Mooney, R.J.³ Saenko, K.⁴ Guadarrama, S.⁵

10
- 80052901011
- Baby talk: Understanding and generating simple image descriptions
- Kulkarni, G., Premraj, V., Dhar, S., Li, S., Choi, Y., Berg, A.C., Berg, T.L.: Baby talk: Understanding and generating simple image descriptions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2011)
- (2011) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Kulkarni, G.¹ Premraj, V.² Dhar, S.³ Li, S.⁴ Choi, Y.⁵ Berg, A.C.⁶ Berg, T.L.⁷

11
- 84878189119
- Collective generation of natural image descriptions
- Kuznetsova, P., Ordonez, V., Berg, A.C., Berg, T.L., Choi, Y.: Collective generation of natural image descriptions. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL) (2012)
- (2012) Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL)
- Kuznetsova, P.¹ Ordonez, V.² Berg, A.C.³ Berg, T.L.⁴ Choi, Y.⁵

12
- 85034832841
- Midge: Generating image descriptions from computer vision detections
- Mitchell, M., Dodge, J., Goyal, A., Yamaguchi, K., Stratos, K., Han, X., Mensch, A., Berg, A.C., Berg, T.L., III H.D.: Midge: Generating image descriptions from computer vision detections. In: Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL) (2012)
- (2012) Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL)
- Mitchell, M.¹ Dodge, J.² Goyal, A.³ Yamaguchi, K.⁴ Stratos, K.⁵ Han, X.⁶ Mensch, A.⁷ Berg, A.C.⁸ Berg, T.L.⁹

13
- 84898785648
- Grounding action descriptions in videos
- Regneri, M., Rohrbach, M., Wetzel, D., Thater, S., Schiele, B., Pinkal, M.: Grounding action descriptions in videos. Trans. Assoc. Comput. Linguist. (TACL) 1, 25–36 (2013)
- (2013) Trans. Assoc. Comput. Linguist. (TACL) , vol.1 , pp. 25-36
- Regneri, M.¹ Rohrbach, M.² Wetzel, D.³ Thater, S.⁴ Schiele, B.⁵ Pinkal, M.⁶

14
- 84898775239
- Translating video content to natural language descriptions
- Rohrbach, M., Qiu, W., Titov, I., Thater, S., Pinkal, M., Schiele, B.: Translating video content to natural language descriptions. In: IEEE International Conference on Computer Vision (ICCV) (2013)
- (2013) IEEE International Conference on Computer Vision (ICCV)
- Rohrbach, M.¹ Qiu, W.² Titov, I.³ Thater, S.⁴ Pinkal, M.⁵ Schiele, B.⁶

15
- 84867726359
- Script data for attribute-based recognition of composite activities
- In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.), Springer, Heidelberg
- Rohrbach, M., Regneri, M., Andriluka, M., Amin, S., Pinkal, M., Schiele, B.: Script data for attribute-based recognition of composite activities. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part I. LNCS, vol. 7572, pp. 144–157. Springer, Heidelberg (2012)
- (2012) ECCV 2012, Part I. LNCS , vol.7572 , pp. 144-157
- Rohrbach, M.¹ Regneri, M.² Andriluka, M.³ Amin, S.⁴ Pinkal, M.⁵ Schiele, B.⁶

16
- 84872942522
- Schmidt, M.: UGM: Matlab code for undirected graphical models (2013). http:// www.di.ens.fr/∼mschmidt/Software/UGM.html
- UGM: Matlab code for undirected graphical models (2013)
- Schmidt, M.¹

17
- 84908684165
- arXiv:1403.6173
- Senina, A., Rohrbach, M., Qiu, W., Friedrich, A., Amin, S., Andriluka, M., Pinkal, M., Schiele, B.: Coherent multi-sentence video description with variable level of detail. arXiv:1403.6173 (2014)
- (2014) Coherent multi-sentence video description with variable level of detail
- Senina, A.¹ Rohrbach, M.² Qiu, W.³ Friedrich, A.⁴ Amin, S.⁵ Andriluka, M.⁶ Pinkal, M.⁷ Schiele, B.⁸

18
- 84911429020
- Seeing what youre told: Sentence-guided activity recognition in video
- Siddharth, N., Barbu, A., Siskind, J.M.: Seeing what youre told: Sentence-guided activity recognition in video. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2014)
- (2014) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- Siddharth, N.¹ Barbu, A.² Siskind, J.M.³

19
- 84455192418
- Towards textually describing complex video contents with audio-visual concept classifiers
- Tan, C.C., Jiang, Y.G., Ngo, C.W.: Towards textually describing complex video contents with audio-visual concept classifiers. In: ACM Multimedia (2011)
- (2011) ACM Multimedia
- Tan, C.C.¹ Jiang, Y.G.² Ngo, C.W.³

20
- 70349362313
- Vedaldi, A., Fulkerson, B.: VLFeat: An open and portable library of computer vision algorithms (2008). http://www.vlfeat.org/
- (2008) VLFeat: An open and portable library of computer vision algorithms
- Vedaldi, A.¹ Fulkerson, B.²

21
- 84876945537
- Dense trajectories and motion boundary descriptors for action recognition
- Wang, H., Kläser, A., Schmid, C., Liu, C.: Dense trajectories and motion boundary descriptors for action recognition. Int. J. Comput. Vis. (IJCV) 103, 60–79 (2013)
- (2013) Int. J. Comput. Vis. (IJCV) , vol.103 , pp. 60-79
- Wang, H.¹ Kläser, A.² Schmid, C.³ Liu, C.⁴

22
- 84897743886
- Grounded language learning from videos described with sentences
- Yu, H., Siskind, J.M.: Grounded language learning from videos described with sentences. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL) (2013)
- (2013) Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL)
- Yu, H.¹ Siskind, J.M.²

23
- 0035030120
- Natural language processing and user modeling: Synergies and limitations
- Zukerman, I., Litman, D.: Natural language processing and user modeling: Synergies and limitations. User Model. User-Adap. Inter. 11, 129–158 (2001)
- (2001) User Model. User-Adap. Inter , vol.11 , pp. 129-158
- Zukerman, I.¹ Litman, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.