SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Volumn , Issue , 2013, Pages 2634-2641

A thousand frames in just a few words: Lingual description of videos through latent topics and sparse object stitching

(4) Das, Pradipto a Xu, Chenliang a Doell, Richard F a Corso, Jason J a

a the State University of New York (United States)

Author keywords

multimodal topic model; natural language; video to text; video understanding

Indexed keywords

BOTTOM-UP AND TOP-DOWN; IMAGE DESCRIPTIONS; NATURAL LANGUAGES; NEAREST NEIGHBOR TECHNIQUE; TOPIC MODELING; VIDEO TO TEXT; VIDEO UNDERSTANDING; VISION COMMUNITIES;

COMPUTATIONAL LINGUISTICS; HYBRID SYSTEMS;

PATTERN RECOGNITION;

EID: 84887345951 PISSN: 10636919 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2013.340 Document Type: Conference Paper

Times cited : (318)

References (34)

1
- 84885996388
- Video in sentences out
- A. Barbu, A. Bridge, Z. Burchill, D. Coroian, S. J. Dickinson, S. Fidler, A. Michaux, S. Mussman, S. Narayanaswamy, D. Salvi, L. Schmidt, J. Shangguan, J. M. Siskind, J. W. Waggoner, S. Wang, J. Wei, Y. Yin, and Z. Zhang. Video in sentences out. In UAI, 2012.
- (2012) UAI
- Barbu, A.¹ Bridge, A.² Burchill, Z.³ Coroian, D.⁴ Dickinson, S.J.⁵ Fidler, S.⁶ Michaux, A.⁷ Mussman, S.⁸ Narayanaswamy, S.⁹ Salvi, D.¹⁰ Schmidt, L.¹¹ Shangguan, J.¹² Siskind, J.M.¹³ Waggoner, J.W.¹⁴ Wang, S.¹⁵ Wei, J.¹⁶ Yin, Y.¹⁷ Zhang, Z.¹⁸

2
- 84893365704
- Comparing automatic and human evaluation of nlg systems
- A. Belz and E. Reiter. Comparing automatic and human evaluation of nlg systems. In EACL, 2006.
- (2006) EACL
- Belz, A.¹ Reiter, E.²

3
- 84864999445
- Evaluation of local descriptors for action recognition in videos
- P. Bilinski and F. Bremond. Evaluation of local descriptors for action recognition in videos. In ICCV, 2011.
- (2011) ICCV
- Bilinski, P.¹ Bremond, F.²

4
- 1542287501
- Modeling annotated data
- D. M. Blei and M. I. Jordan. Modeling annotated data. In SIGIR, 2003.
- (2003) SIGIR
- Blei, D.M.¹ Jordan, M.I.²

5
- 0141607824
- Latent dirichlet allocation
- D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. JMLR, 2003.
- (2003) JMLR
- Blei, D.M.¹ Ng, A.Y.² Jordan, M.I.³

6
- 50649087214
- Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes
- L. Cao and L. Fei-Fei. Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes. In ICCV, 2007.
- (2007) ICCV
- Cao, L.¹ Fei-Fei, L.²

7
- 84887329824
- Translating related words to videos and back through latent topics
- P. Das, R. K. Srihari, and J. J. Corso. Translating related words to videos and back through latent topics. In ACM WSDM, 2013.
- (2013) ACM WSDM
- Das, P.¹ Srihari, R.K.² Corso, J.J.³

8
- 77951298115
- The Pascal Visual Object Classes (VOC) Challenge
- M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The Pascal Visual Object Classes (VOC) Challenge. IJCV, 2010.
- (2010) IJCV
- Everingham, M.¹ Van Gool, L.² Williams, C.K.I.³ Winn, J.⁴ Zisserman, A.⁵

9
- 80052017343
- Every picture tells a story: Generating sentences from images
- A. Farhadi, M. Hejrati, M. A. Sadeghi, P. Young, C. Rashtchian, J. Hockenmaier, and D. Forsyth. Every picture tells a story: generating sentences from images. In ECCV, 2010.
- (2010) ECCV
- Farhadi, A.¹ Hejrati, M.² Sadeghi, M.A.³ Young, P.⁴ Rashtchian, C.⁵ Hockenmaier, J.⁶ Forsyth, D.⁷

10
- 77955422240
- Object detection with discriminatively trained part based models
- P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained part based models. TPAMI, 2010.
- (2010) IntPAMI
- Felzenszwalb, P.F.¹ Girshick, R.B.² McAllester, D.³ Ramanan, D.⁴

11
- 80053231413
- Topic models for image annotation and text illustration
- Y. Feng and M. Lapata. Topic models for image annotation and text illustration. In NAACL HLT, 2010.
- (2010) NAACL HLT
- Feng, Y.¹ Lapata, M.²

12
- 77953202699
- Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation
- M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In ICCV, 2009.
- (2009) ICCV
- Guillaumin, M.¹ Mensink, T.² Verbeek, J.³ Schmid, C.⁴

13
- 85085175172
- A markov clustering topic model for mining behaviour in video
- T. M. Hospedales, S. Gong, and T. Xiang. A markov clustering topic model for mining behaviour in video. In ICCV, 2009.
- (2009) ICCV
- Hospedales, T.M.¹ Gong, S.² Xiang, T.³

14
- 84863075153
- Towards coherent natural language description of video streams
- M. U. G. Khan, L. Zhang, and Y. Gotoh. Towards coherent natural language description of video streams. In ICCVW, 2011.
- (2011) ICCVW
- Khan, M.U.G.¹ Zhang, L.² Gotoh, Y.³

15
- 84898426452
- A spatio-temporal descriptor based on 3d-gradients
- A. Klaser, M. Marszalek, and C. Schmid. A spatio-temporal descriptor based on 3d-gradients. In BMVC, 2008.
- (2008) BMVC
- Klaser, A.¹ Marszalek, M.² Schmid, C.³

16
- 80052901011
- Baby talk: Understanding and generating simple image descriptions
- G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg, and T. L. Berg. Baby talk: Understanding and generating simple image descriptions. In CVPR, 2011.
- (2011) CVPR
- Kulkarni, G.¹ Premraj, V.² Dhar, S.³ Li, S.⁴ Choi, Y.⁵ Berg, A.C.⁶ Berg, T.L.⁷

17
- 29344465396
- Automatic evaluation of summaries using n-gram co-occurrence statistics
- C.-Y. Lin and E. Hovy. Automatic evaluation of summaries using n-gram co-occurrence statistics. In NAACL HLT, 2003.
- (2003) NAACL HLT
- Lin, C.-Y.¹ Hovy, E.²

18
- 70449580491
- A new baseline for image annotation
- A. Makadia, V. Pavlovic, and S. Kumar. A new baseline for image annotation. In ECCV, 2008.
- (2008) ECCV
- Makadia, A.¹ Pavlovic, V.² Kumar, S.³

19
- 84893398951
- Generating natural-language video descriptions using text-mined knowledge
- G. Malkarnenkar, N. Krishnamoorthy, S. Guadarrama, K. Saenko, and R. Mooney. Generating natural-language video descriptions using text-mined knowledge. In AAAI, 2013.
- (2013) AAAI
- Malkarnenkar, G.¹ Krishnamoorthy, N.² Guadarrama, S.³ Saenko, K.⁴ Mooney, R.⁵

20
- 84905274625
- Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics
- P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, B. Shaw, W. Kraaij, A. F. Smeaton, and G. Quéenot. Trecvid 2012-an overview of the goals, tasks, data, evaluation mechanisms and metrics. In TRECVID 2012, 2012.
- (2012) IntRECVID 2012
- Over, P.¹ Awad, G.² Michel, M.³ Fiscus, J.⁴ Sanders, G.⁵ Shaw, B.⁶ Kraaij, W.⁷ Smeaton, A.F.⁸ Quéenot, G.⁹

21
- 85133336275
- Bleu: A method for automatic evaluation of machine translation
- K. Papineni, S. Roukos, T. Ward, and W.-J. Zhu. Bleu: a method for automatic evaluation of machine translation. In ACL, 2002.
- (2002) ACL
- Papineni, K.¹ Roukos, S.² Ward, T.³ Zhu, W.-J.⁴

22
- 77955999239
- Topic regression multi-modal latent dirichlet allocation for image annotation
- D. Putthividhya, H. T. Attias, and S. S. Nagarajan. Topic regression multi-modal latent dirichlet allocation for image annotation. In CVPR, 2010.
- (2010) CVPR
- Putthividhya, D.¹ Attias, H.T.² Nagarajan, S.S.³

23
- 85090348677
- Collecting image annotations using amazon's mechanical turk
- C. Rashtchian, P. Young, M. Hodosh, and J. Hockenmaier. Collecting image annotations using amazon's mechanical turk. In NAACL HLT 2010, 2010.
- (2010) NAACL HLT 2010
- Rashtchian, C.¹ Young, P.² Hodosh, M.³ Hockenmaier, J.⁴

24
- 84887351648
- Script data for attribute-based recognition of composite activities
- M. Rohrbach, M. Regneri, M. Andriluka, S. Amin, M. Pinkal, and B. Schiele. Script data for attribute-based recognition of composite activities. In ECCV, 2012.
- (2012) ECCV
- Rohrbach, M.¹ Regneri, M.² Andriluka, M.³ Amin, S.⁴ Pinkal, M.⁵ Schiele, B.⁶

25
- 84866718894
- Action bank: A high-level representation of activity in video
- S. Sadanand and J. J. Corso. Action bank: A high-level representation of activity in video. In CVPR, 2012.
- (2012) CVPR
- Sadanand, S.¹ Corso, J.J.²

26
- 80052889458
- Recognition using visual phrases
- M. A. Sadeghi and A. Farhadi. Recognition using visual phrases. In CVPR, 2011.
- (2011) CVPR
- Sadeghi, M.A.¹ Farhadi, A.²

27
- 80052908300
- Unbiased look at dataset bias
- A. Torralba and A. A. Efros. Unbiased look at dataset bias. In CVPR, 2011.
- (2011) CVPR
- Torralba, A.¹ Efros, A.A.²

28
- 77955426203
- Evaluating color descriptors for object and scene recognition
- K. E. A. van de Sande, T. Gevers, and C. G. M. Snoek. Evaluating color descriptors for object and scene recognition. TPAMI, 2010.
- (2010) IntPAMI
- Sande De Van A., K.E.¹ Gevers, T.² Snoek, C.G.M.³

29
- 84877615350
- Efficiently scaling up crowdsourced video annotation
- C. Vondrick, D. Patterson, and D. Ramanan. Efficiently scaling up crowdsourced video annotation. IJCV.
- IJCV
- Vondrick, C.¹ Patterson, D.² Ramanan, D.³

30
- 78149294370
- Now Publishers Inc
- M. J. Wainwright and M. I. Jordan. Graphical Models, Exponential Families, and Variational Inference. Now Publishers Inc., 2008.
- (2008) Graphical Models, Exponential Families, and Variational Inference
- Wainwright, M.J.¹ Jordan, M.I.²

31
- 79952129745
- Rethinking LDA: Why priors matter
- H. M. Wallach, D. Mimno, and A. McCallum. Rethinking LDA: Why priors matter. In NIPS, 2009.
- (2009) NIPS
- Wallach, H.M.¹ Mimno, D.² McCallum, A.³

32
- 70450178502
- Simultaneous image classification and annotation
- C. Wang, D. M. Blei, and F.-F. Li. Simultaneous image classification and annotation. In CVPR, 2009.
- (2009) CVPR
- Wang, C.¹ Blei, D.M.² Li, F.-F.³

33
- 77952406197
- Topic models for semantics-preserving video compression
- J. Wanke, A. Ulges, C. H. Lampert, and T. M. Breuel. Topic models for semantics-preserving video compression. In MIR, 2010.
- (2010) MIR
- Wanke, J.¹ Ulges, A.² Lampert, C.H.³ Breuel, T.M.⁴

34
- 80053258778
- Corpus-guided sentence generation of natural images
- Y. Yang, C. L. Teo, H. Daumé III, and Y. Aloimonos. Corpus-guided sentence generation of natural images. In EMNLP, 2011.
- (2011) EMNLP
- Yang, Y.¹ Teo, C.L.² Daumé III, H.³ Aloimonos, Y.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.