SCOPUS 정보 검색 플랫폼

Proceedings of the 27th AAAI Conference on Artificial Intelligence, AAAI 2013

Volumn , Issue , 2013, Pages 541-547

Generating natural-language video descriptions using text-mined knowledge

(5) Krishnamoorthy, Niveda a Malkarnenkar, Girish a Mooney, Raymond a Saenko, Kate b Guadarrama, Sergio c

a UNIVERSITY OF TEXAS AT AUSTIN (United States)

b UNIVERSITY OF MASSACHUSETTS LOWELL (United States)

c UNIVERSITY OF CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CONTEXTUAL INFORMATION; REAL-WORLD; SELECTION ALGORITHM; TEXT CORPORA;

ARTIFICIAL INTELLIGENCE;

EID: 84893398951 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (192)

References (29)

1
- 84885996388
- Video in sentences out
- Barbu, A.; Bridge, A.; Burchill, Z.; Coroian, D.; Dickinson, S.; Fidler, S.; Michaux, A.; Mussman, S.; Narayanaswamy, S.; Salvi, D.; et al. 2012. Video in sentences out. In UAI, 102-12.
- (2012) UAI , pp. 102-112
- Barbu, A.¹ Bridge, A.² Burchill, Z.³ Coroian, D.⁴ Dickinson, S.⁵ Fidler, S.⁶ Michaux, A.⁷ Mussman, S.⁸ Narayanaswamy, S.⁹ Salvi, D.¹⁰

2
- 79955702502
- Libsvm: A library for support vector machines
- Chang, C., and Lin, C. 2011. Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST) 2(3):27.
- (2011) ACM Transactions on Intelligent Systems and Technology (TIST) , vol.2 , Issue.3 , pp. 27
- Chang, C.¹ Lin, C.²

3
- 0033329799
- An empirical study of smoothing techniques for language modeling
- Chen, S., and Goodman, J. 1999. An empirical study of smoothing techniques for language modeling. Computer Speech & Language 13(4):359-393.
- (1999) Computer Speech & Language , vol.13 , Issue.4 , pp. 359-393
- Chen, S.¹ Goodman, J.²

4
- 84893397324
- Collecting highly parallel data for paraphrase evaluation
- Chen, D.; Dolan, W.; Raghavan, S.; Huynh, T.; Mooney, R.; Blythe, J.; Hobbs, J.; Domingos, P.; Kate, R.; Garrette, D.; et al. 2010. Collecting highly parallel data for paraphrase evaluation. JAIR 37:397-435.
- (2010) JAIR , vol.37 , pp. 397-435
- Chen, D.¹ Dolan, W.² Raghavan, S.³ Huynh, T.⁴ Mooney, R.⁵ Blythe, J.⁶ Hobbs, J.⁷ Domingos, P.⁸ Kate, R.⁹ Garrette, D.¹⁰

5
- 85037338954
- Generating typed dependency parses from phrase structure parses
- De Marneffe, M.; MacCartney, B.; and Manning, C. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of LREC, volume 6, 449-454.
- (2006) Proceedings of LREC , vol.6 , pp. 449-454
- De Marneffe, M.¹ Maccartney, B.² Manning, C.³

6
- 84864139941
- Beyond audio and video retrieval: Towards multimedia summarization
- ACM
- Ding, D.; Metze, F.; Rawat, S.; Schulam, P.; Burger, S.; Younessian, E.; Bao, L.; Christel, M.; and Hauptmann, A. 2012. Beyond audio and video retrieval: towards multimedia summarization. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, 2. ACM.
- (2012) Proceedings of the 2nd ACM International Conference on Multimedia Retrieval , vol.2
- Ding, D.¹ Metze, F.² Rawat, S.³ Schulam, P.⁴ Burger, S.⁵ Younessian, E.⁶ Bao, L.⁷ Christel, M.⁸ Hauptmann, A.⁹

7
- 78149311145
- Every picture tells a story: Generating sentences from images
- Farhadi, A.; Hejrati, M.; Sadeghi, M.; Young, P.; Rashtchian, C.; Hockenmaier, J.; and Forsyth, D. 2010. Every picture tells a story: Generating sentences from images. Computer Vision-ECCV 2010 15-29.
- (2010) Computer Vision-ECCV , vol.2010 , pp. 15-29
- Farhadi, A.¹ Hejrati, M.² Sadeghi, M.³ Young, P.⁴ Rashtchian, C.⁵ Hockenmaier, J.⁶ Forsyth, D.⁷

8
- 51949101231
- A discriminatively trained, multiscale, deformable part model
- IEEE
- Felzenszwalb, P.; McAllester, D.; and Ramanan, D. 2008. A discriminatively trained, multiscale, deformable part model. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on, 1-8. IEEE.
- (2008) Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on , pp. 1-8
- Felzenszwalb, P.¹ McAllester, D.² Ramanan, D.³

9
- 84898785322
- Describing video contents in natural language
- Khan, M., and Gotoh, Y. 2012. Describing video contents in natural language. EACL 2012 27.
- (2012) EACL 2012 , pp. 27
- Khan, M.¹ Gotoh, Y.²

10
- 0036843382
- Natural language description of human activities from video images based on concept hierarchy of actions
- Kojima, A.; Tamura, T.; and Fukunaga, K. 2002. Natural language description of human activities from video images based on concept hierarchy of actions. International Journal of Computer Vision 50(2):171-184.
- (2002) International Journal of Computer Vision , vol.50 , Issue.2 , pp. 171-184
- Kojima, A.¹ Tamura, T.² Fukunaga, K.³

11
- 80052901011
- Baby talk: Understanding and generating simple image descriptions
- IEEE
- Kulkarni, G.; Premraj, V.; Dhar, S.; Li, S.; Choi, Y.; Berg, A.; and Berg, T. 2011. Baby talk: Understanding and generating simple image descriptions. In CVPR, 1601-1608. IEEE.
- (2011) CVPR , pp. 1601-1608
- Kulkarni, G.¹ Premraj, V.² Dhar, S.³ Li, S.⁴ Choi, Y.⁵ Berg, A.⁶ Berg, T.⁷

12
- 50649122769
- Retrieving actions in movies
- IEEE
- Laptev, I., and Perez, P. 2007. Retrieving actions in movies. In Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on, 1-8. IEEE.
- (2007) Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on , pp. 1-8
- Laptev, I.¹ Perez, P.²

13
- 51949083365
- Learning realistic human actions from movies
- IEEE
- Laptev, I.; Marszalek, M.; Schmid, C.; and Rozenfeld, B. 2008. Learning realistic human actions from movies. In CVPR, 1-8. IEEE.
- (2008) CVPR , pp. 1-8
- Laptev, I.¹ Marszalek, M.² Schmid, C.³ Rozenfeld, B.⁴

14
- 51849094354
- Save: A framework for semantic annotation of visual events
- IEEE
- Lee, M.; Hakeem, A.; Haering, N.; and Zhu, S. 2008. Save: A framework for semantic annotation of visual events. In CVPRW, 1-8. IEEE.
- (2008) CVPRW , pp. 1-8
- Lee, M.¹ Hakeem, A.² Haering, N.³ Zhu, S.⁴

15
- 85162513516
- Object bank: A high-level image representation for scene classification and semantic feature sparsification
- Li, L.; Su, H.; Xing, E.; and Fei-Fei, L. 2010. Object bank: A high-level image representation for scene classification and semantic feature sparsification. Advances in Neural Information Processing Systems 24.
- (2010) Advances in Neural Information Processing Systems , vol.24
- Li, L.¹ Su, H.² Xing, E.³ Fei-Fei, L.⁴

16
- 84862279067
- Composing simple image descriptions using web-scale ngrams
- Association for Computational Linguistics
- Li, S.; Kulkarni, G.; Berg, T.; Berg, A.; and Choi, Y. 2011. Composing simple image descriptions using web-scale ngrams. In Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 220-228. Association for Computational Linguistics.
- (2011) Proceedings of the Fifteenth Conference on Computational Natural Language Learning , pp. 220-228
- Li, S.¹ Kulkarni, G.² Berg, T.³ Berg, A.⁴ Choi, Y.⁵

17
- 85095712904
- Syntactic annotations for the Google books ngram corpus
- Lin, Y.; Michel, J.; Aiden, E.; Orwant, J.; Brockman, W.; and Petrov, S. 2012. Syntactic annotations for the google books ngram corpus. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics.
- (2012) Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics
- Lin, Y.¹ Michel, J.² Aiden, E.³ Orwant, J.⁴ Brockman, W.⁵ Petrov, S.⁶

18
- 84959182849
- Improving video activity recognition using object recognition and text mining
- Motwani, T., and Mooney, R. 2012. Improving video activity recognition using object recognition and text mining. ECAI.
- (2012) ECAI
- Motwani, T.¹ Mooney, R.²

19
- 85162522202
- Im2text: Describing images using 1 million captioned photographs
- Ordonez, V.; Kulkarni, G.; and Berg, T. 2011. Im2text: Describing images using 1 million captioned photographs. In Proceedings of NIPS.
- (2011) Proceedings of NIPS
- Ordonez, V.¹ Kulkarni, G.² Berg, T.³

20
- 84866717619
- A combined pose, object, and feature model for action understanding
- IEEE
- Packer, B.; Saenko, K.; and Koller, D. 2012. A combined pose, object, and feature model for action understanding. In CVPR, 1378-1385. IEEE.
- (2012) CVPR , pp. 1378-1385
- Packer, B.¹ Saenko, K.² Koller, D.³

21
- 84455177089
- Faster and smaller n-gram language models
- Pauls, A., and Klein, D. 2011. Faster and smaller n-gram language models. In Proceedings of the 49th annual meeting of the Association for Computational Linguistics: Human Language Technologies, volume 1, 258-267.
- (2011) Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies , vol.1 , pp. 258-267
- Pauls, A.¹ Klein, D.²

22
- 85081941118
- Wordnet:: Similarity: Measuring the relatedness of concepts
- Association for Computational Linguistics
- Pedersen, T.; Patwardhan, S.; and Michelizzi, J. 2004. Wordnet:: Similarity: measuring the relatedness of concepts. In Demonstration Papers at HLT-NAACL 2004, 38-41. Association for Computational Linguistics.
- (2004) Demonstration Papers at HLT-NAACL 2004 , pp. 38-41
- Pedersen, T.¹ Patwardhan, S.² Michelizzi, J.³

23
- 84879550059
- Recognizing 50 human action categories of web videos
- Reddy, K., and Shah, M. 2012. Recognizing 50 human action categories of web videos. Machine Vision and Applications 1-11.
- (2012) Machine Vision and Applications , pp. 1-11
- Reddy, K.¹ Shah, M.²

24
- 10044233701
- Recognizing human actions: A local SVM approach
- IEEE
- Schuldt, C.; Laptev, I.; and Caputo, B. 2004. Recognizing human actions: A local SVM approach. In Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on, volume 3, 32-36. IEEE.
- (2004) Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on , vol.3 , pp. 32-36
- Schuldt, C.¹ Laptev, I.² Caputo, B.³

25
- 80052877143
- Action recognition by dense trajectories
- IEEE
- Wang, H.; Klaser, A.; Schmid, C.; and Liu, C.-L. 2011. Action recognition by dense trajectories. In CVPR, 3169-3176. IEEE.
- (2011) CVPR , pp. 3169-3176
- Wang, H.¹ Klaser, A.² Schmid, C.³ Liu, C.-L.⁴

26
- 80053258778
- Corpus-guided sentence generation of natural images
- Association for Computational Linguistics
- Yang, Y.; Teo, C. L.; Daumé, III, H.; and Aloimonos, Y. 2011. Corpus-guided sentence generation of natural images. In EMNLP, EMNLP '11, 444-454. Association for Computational Linguistics.
- (2011) EMNLP, EMNLP '11 , pp. 444-454
- Yang, Y.¹ Teo, C.L.² Daumé III, H.³ Aloimonos, Y.⁴

27
- 77955988492
- Modeling mutual context of object and human pose in human-object interaction activities
- Yao, B., and Fei-Fei, L. 2010. Modeling mutual context of object and human pose in human-object interaction activities. In CVPR.
- (2010) CVPR
- Yao, B.¹ Fei-Fei, L.²

28
- 77954862144
- I2t: Image parsing to text description
- Yao, B.; Yang, X.; Lin, L.; Lee, M.; and Zhu, S. 2010. I2t: Image parsing to text description. Proceedings of the IEEE 98(8):1485-1508.
- (2010) Proceedings of the IEEE , vol.98 , Issue.8 , pp. 1485-1508
- Yao, B.¹ Yang, X.² Lin, L.³ Lee, M.⁴ Zhu, S.⁵

29
- 33846580425
- Local features and kernels for classification of texture and object categories: A comprehensive study
- Zhang, J.; Marszałek, M.; Lazebnik, S.; and Schmid, C. 2007. Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision 73(2):213-238.
- (2007) International Journal of Computer Vision , vol.73 , Issue.2 , pp. 213-238
- Zhang, J.¹ Marszałek, M.² Lazebnik, S.³ Schmid, C.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.