SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE International Conference on Computer Vision

Volumn 2015 International Conference on Computer Vision, ICCV 2015, Issue , 2015, Pages 4588-4596

Objects2action: Classifying and localizing actions without any video example

(4) Jain, Mihir a Gemert, Jan C Van a,c Mensink, Thomas a Snoek, Cees G M a,b

a UNIVERSITY OF AMSTERDAM (Netherlands)

b DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

c Qualcomm Research (Netherlands)

Author keywords

[No Author keywords available]

Indexed keywords

SEMANTICS;

ATTRIBUTE MAPPINGS; AUTOMATED SELECTION; CONVEX COMBINATIONS; GRAM MODELS; OBJECT CATEGORIES; OBJECT ENCODING; SEMANTIC EMBEDDING; SPATIO TEMPORAL;

COMPUTER VISION;

EID: 84973868024 PISSN: 15505499 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICCV.2015.521 Document Type: Conference Paper

Times cited : (172)

References (49)

1
- 84887338331
- Labelembedding for attribute-based classification
- Z. Akata, F. Perronnin, Z. Harchaoui, and C. Schmid. Labelembedding for attribute-based classification. In CVPR, 2013.
- (2013) CVPR
- Akata, Z.¹ Perronnin, F.² Harchaoui, Z.³ Schmid, C.⁴

2
- 77955989314
- Cross-dataset action detection
- L. Cao, Z. Liu, and T. S. Huang. Cross-dataset action detection. In CVPR, 2010.
- (2010) CVPR
- Cao, L.¹ Liu, Z.² Huang, T.S.³

3
- 84899755756
- Event-driven semantic concept discovery by exploiting weakly tagged internet images
- J. Chen, Y. Cui, G. Ye, D. Liu, and S.-F. Chang. Event-driven semantic concept discovery by exploiting weakly tagged internet images. In ICMR, 2014.
- (2014) ICMR
- Chen, J.¹ Cui, Y.² Ye, G.³ Liu, D.⁴ Chang, S.-F.⁵

4
- 84959216393
- Textual similarity with a bag-ofembedded-words model
- S. Clinchant and F. Perronnin. Textual similarity with a bag-ofembedded-words model. In ICTIR, 2013.
- (2013) ICTIR
- Clinchant, S.¹ Perronnin, F.²

5
- 85198028989
- Imagenet: A large-scale hierarchical image database
- J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR, 2009.
- (2009) CVPR
- Deng, J.¹ Dong, W.² Socher, R.³ Li, L.-J.⁴ Li, K.⁵ Fei-Fei, L.⁶

6
- 84898803425
- Write a classifier: Zeroshot learning using purely textual descriptions
- M. Elhoseiny, B. Saleh, and A. Elgammal. Write a classifier: Zeroshot learning using purely textual descriptions. In ICCV, 2013.
- (2013) ICCV
- Elhoseiny, M.¹ Saleh, B.² Elgammal, A.³

7
- 84900116262
- Evaluation of color spatio-temporal interest points for human action recognition
- I. Everts, J. C. van Gemert, and T. Gevers. Evaluation of color spatio-temporal interest points for human action recognition. TIP, 23(4):1569-1580, 2014.
- (2014) TIP , vol.23 , Issue.4 , pp. 1569-1580
- Everts, I.¹ Van Gemert, J.C.² Gevers, T.³

8
- 70450207704
- Describing objects by their attributes
- A. Farhadi, I. Endres, D. Hoiem, and D. Forsyth. Describing objects by their attributes. In CVPR, 2009.
- (2009) CVPR
- Farhadi, A.¹ Endres, I.² Hoiem, D.³ Forsyth, D.⁴

9
- 84898958665
- Devise: A deep visual-semantic embedding model
- A. FRome, G. Corrado, J. Shlens, S. Bengio, J. Dean, M. Ranzato, and T. Mikolov. Devise: A deep visual-semantic embedding model. In NIPS, 2013.
- (2013) NIPS
- Rome, F.A.¹ Corrado, G.² Shlens, J.³ Bengio, S.⁴ Dean, J.⁵ Ranzato, M.⁶ Mikolov, T.⁷

10
- 84898773262
- Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition
- S. Guadarrama, N. Krishnamoorthy, G. Malkarnenkar, S. Venugopalan, R. Mooney, T. Darrell, and K. Saenko. Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition. In ICCV, 2013.
- (2013) ICCV
- Guadarrama, S.¹ Krishnamoorthy, N.² Malkarnenkar, G.³ Venugopalan, S.⁴ Mooney, R.⁵ Darrell, T.⁶ Saenko, K.⁷

11
- 84899708619
- Composite concept discovery for zero-shot video event detection
- A. Habibian, T. Mensink, and C. Snoek. Composite concept discovery for zero-shot video event detection. In ICMR, 2014.
- (2014) ICMR
- Habibian, A.¹ Mensink, T.² Snoek, C.³

12
- 84887398298
- Better exploiting motion for better action recognition
- M. Jain, H. Jégou, and P. Bouthemy. Better exploiting motion for better action recognition. In CVPR, 2013.
- (2013) CVPR
- Jain, M.¹ Jégou, H.² Bouthemy, P.³

13
- 84911453664
- Action localization with tubelets from motion
- M. Jain, J. van Gemert, H. Jégou, P. Bouthemy, and C. Snoek. Action localization with tubelets from motion. In CVPR, 2014.
- (2014) CVPR
- Jain, M.¹ Van Gemert, J.² Jégou, H.³ Bouthemy, P.⁴ Snoek, C.⁵

14
- 84959235126
- What do 15, 000 object categories tell us about classifying and localizing actions? in
- M. Jain, J. van Gemert, and C. Snoek. What do 15, 000 object categories tell us about classifying and localizing actions? In CVPR, 2015.
- (2015) CVPR
- Jain, M.¹ Van Gemert, J.² Snoek, C.³

15
- 84899746163
- Zero-example event search using multimodal pseudo relevance feedback
- L. Jiang, T. Mitamura, S. Yu, and A. Hauptmann. Zero-example event search using multimodal pseudo relevance feedback. In ICMR, 2014.
- (2014) ICMR
- Jiang, L.¹ Mitamura, T.² Yu, S.³ Hauptmann, A.⁴

16
- 84905052261
- Y.-G. Jiang, J. Liu, A. Roshan Zamir, G. Toderici, I. Laptev, M. Shah, and R. Sukthankar. THUMOS challenge: Action recognition with a large number of classes. http://crcv. ucf. edu/THUMOS14/, 2014.
- (2014) THUMOS Challenge: Action Recognition with A Large Number of Classes.
- Jiang, Y.-G.¹ Liu, J.² Roshan Zamir, A.³ Toderici, G.⁴ Laptev, I.⁵ Shah, M.⁶ Sukthankar, R.⁷

17
- 84911364368
- Large-scale video classification with convolutional neural networks
- A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and L. Fei-Fei. Large-scale video classification with convolutional neural networks. In CVPR, 2014.
- (2014) CVPR
- Karpathy, A.¹ Toderici, G.² Shetty, S.³ Leung, T.⁴ Sukthankar, R.⁵ Fei-Fei, L.⁶

18
- 84876231242
- Imagenet classification with deep convolutional neural networks
- A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
- (2012) NIPS
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

19
- 84856682691
- HMDB: A large video database for human motion recognition
- H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, and T. Serre. HMDB: A large video database for human motion recognition. In ICCV, 2011.
- (2011) ICCV
- Kuehne, H.¹ Jhuang, H.² Garrote, E.³ Poggio, T.⁴ Serre, T.⁵

20
- 70450172710
- Learning to detect unseen object classes by between-class attribute transfer
- C. Lampert, H. Nickisch, and S. Harmeling. Learning to detect unseen object classes by between-class attribute transfer. In CVPR, 2009.
- (2009) CVPR
- Lampert, C.¹ Nickisch, H.² Harmeling, S.³

21
- 84863083227
- Discriminative figure-centric models for joint action localization and recognition
- T. Lan, Y. Wang, and G. Mori. Discriminative figure-centric models for joint action localization and recognition. In ICCV, 2011.
- (2011) ICCV
- Lan, T.¹ Wang, Y.² Mori, G.³

22
- 84919829999
- Distributed representations of sentences and documents
- Q. Le and T. Mikolov. Distributed representations of sentences and documents. In ICML, 2014.
- (2014) ICML
- Le, Q.¹ Mikolov, T.²

23
- 84894424062
- Object bank: An objectlevel image representation for high-level visual recognition
- L.-J. Li, H. Su, Y. Lim, and L. Fei-Fei. Object bank: An objectlevel image representation for high-level visual recognition. IJCV, 107(1):20-39, 2014.
- (2014) IJCV , vol.107 , Issue.1 , pp. 20-39
- Li, L.-J.¹ Su, H.² Lim, Y.³ Fei-Fei, L.⁴

24
- 80052915325
- Recognizing human actions by attributes
- J. Liu, B. Kuipers, and S. Savarese. Recognizing human actions by attributes. In CVPR, 2011.
- (2011) CVPR
- Liu, J.¹ Kuipers, B.² Savarese, S.³

25
- 84911410734
- Costa: Co-occurrence statistics for zero-shot classification
- T. Mensink, E. Gavves, and C. Snoek. Costa: Co-occurrence statistics for zero-shot classification. In CVPR, 2014.
- (2014) CVPR
- Mensink, T.¹ Gavves, E.² Snoek, C.³

26
- 85083951332
- Efficient estimation of word representations in vector space
- T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient estimation of word representations in vector space. In ICLR, 2013.
- (2013) ICLR
- Mikolov, T.¹ Chen, K.² Corrado, G.³ Dean, J.⁴

27
- 84898956512
- Distributed representations of words and phrases and their compositionality
- T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, 2013.
- (2013) NIPS
- Mikolov, T.¹ Sutskever, I.² Chen, K.³ Corrado, G.S.⁴ Dean, J.⁵

28
- 84926034201
- Evaluating neural word representations in tensor-based compositional settings
- D. Milajevs, D. Kartsaklis, M. Sadrzadeh, and M. Purver. Evaluating neural word representations in tensor-based compositional settings. In EMNLP, 2014.
- (2014) EMNLP
- Milajevs, D.¹ Kartsaklis, D.² Sadrzadeh, M.³ Purver, M.⁴

29
- 85083952206
- Zero-shot learning by convex combination of semantic embeddings
- M. Norouzi, T. Mikolov, S. Bengio, Y. Singer, J. Shlens, A. FRome, G. Corrado, and J. Dean. Zero-shot learning by convex combination of semantic embeddings. In ICLR, 2014.
- (2014) ICLR
- Norouzi, M.¹ Mikolov, T.² Bengio, S.³ Singer, Y.⁴ Shlens, J.⁵ Frome, A.⁶ Corrado, G.⁷ Dean, J.⁸

30
- 84939802616
- Spatio-temporal object detection proposals
- D. Oneata, J. Revaud, J. Verbeek, and C. Schmid. Spatio-temporal object detection proposals. In ECCV, 2014.
- (2014) ECCV
- Oneata, D.¹ Revaud, J.² Verbeek, J.³ Schmid, C.⁴

31
- 85085788280
- Trecvid 2013-an introduction to the goals, tasks, data, evaluation mechanisms, and metrics
- P. Over, G. Awad, J. Fiscus, and G. Sanders. Trecvid 2013-an introduction to the goals, tasks, data, evaluation mechanisms, and metrics. In TRECVID Workshop, 2013.
- (2013) TRECVID Workshop
- Over, P.¹ Awad, G.² Fiscus, J.³ Sanders, G.⁴

32
- 84856670612
- Relative attributes
- D. Parikh and K. Grauman. Relative attributes. In ICCV, 2011.
- (2011) ICCV
- Parikh, D.¹ Grauman, K.²

33
- 84959242896
- Boosting vlad with supervised dictionary learning and high-order statistics
- X. Peng, L. Wang, Y. Qiao, and Q. Peng. Boosting vlad with supervised dictionary learning and high-order statistics. In ECCV, 2014.
- (2014) ECCV
- Peng, X.¹ Wang, L.² Qiao, Y.³ Peng, Q.⁴

34
- 84947130265
- Action recognition with stacked fisher vectors
- X. Peng, C. Zou, Y. Qiao, and Q. Peng. Action recognition with stacked fisher vectors. In ECCV, 2014.
- (2014) ECCV
- Peng, X.¹ Zou, C.² Qiao, Y.³ Peng, Q.⁴

35
- 51949084792
- Action Mach: A spatiotemporal maximum average correlation height filter for action recognition
- M. D. Rodriguez, J. Ahmed, and M. Shah. Action MACH: A spatiotemporal maximum average correlation height filter for action recognition. In CVPR, 2008.
- (2008) CVPR
- Rodriguez, M.D.¹ Ahmed, J.² Shah, M.³

36
- 80052892795
- Evaluating knowledge transfer and zero-shot learning in a large-scale setting
- M. Rohrbach, M. Stark, and B. Schiele. Evaluating knowledge transfer and zero-shot learning in a large-scale setting. In CVPR, 2011.
- (2011) CVPR
- Rohrbach, M.¹ Stark, M.² Schiele, B.³

37
- 84866718894
- Action bank: A high-level representation of activity in video
- S. Sadanand and J. J. Corso. Action bank: A high-level representation of activity in video. In CVPR, 2012.
- (2012) CVPR
- Sadanand, S.¹ Corso, J.J.²

38
- 84883487458
- Image classification with the fisher vector: Theory and practice
- J. Sánchez, F. Perronnin, T. Mensink, and J. Verbeek. Image classification with the fisher vector: Theory and practice. IJCV, 2013.
- (2013) IJCV
- Sánchez, J.¹ Perronnin, F.² Mensink, T.³ Verbeek, J.⁴

39
- 84937862424
- Two-stream convolutional networks for action recognition in videos
- K. Simonyan and A. Zisserman. Two-stream convolutional networks for action recognition in videos. In NIPS, 2014.
- (2014) NIPS
- Simonyan, K.¹ Zisserman, A.²

40
- 85085787180
- MediaMill at TRECVID 2013: Searching concepts, objects, instances and events in video
- C. Snoek et al. MediaMill at TRECVID 2013: Searching concepts, objects, instances and events in video. In TRECVID, 2013.
- (2013) TRECVID
- Snoek, C.¹

41
- 84898938559
- Zero-shot learning through cross-modal transfer
- R. Socher, M. Ganjoo, C. D. Manning, and A. Ng. Zero-shot learning through cross-modal transfer. In NIPS, 2013.
- (2013) NIPS
- Socher, R.¹ Ganjoo, M.² Manning, C.D.³ Ng, A.⁴

42
- 84893702065
- UCF101: A dataset of 101 human actions classes from videos in the wild
- K. Soomro, A. R. Zamir, and M. Shah. UCF101: A dataset of 101 human actions classes from videos in the wild. CoRR, 2012.
- (2012) CoRR
- Soomro, K.¹ Zamir, A.R.² Shah, M.³

43
- 84973858597
- Semantic aware video transcription using random forest classifiers
- C. Sun and R. Nevatia. Semantic aware video transcription using random forest classifiers. In ECCV, 2014.
- (2014) ECCV
- Sun, C.¹ Nevatia, R.²

44
- 84949572890
- arXiv preprint arXiv:1503. 01817
- B. Thomee, D. A. Shamma, G. Friedland, B. Elizalde, K. Ni, D. Poland, D. Borth, and L.-J. Li. The new data and new challenges in multimedia research. ArXiv preprint arXiv:1503. 01817, 2015.
- (2015) The New Data and New Challenges in Multimedia Research
- Thomee, B.¹ Shamma, D.A.² Friedland, G.³ Elizalde, B.⁴ Ni, K.⁵ Poland, D.⁶ Borth, D.⁷ Li, L.-J.⁸

45
- 84887356306
- Spatiotemporal deformable part models for action detection
- Y. Tian, R. Sukthankar, and M. Shah. Spatiotemporal deformable part models for action detection. In CVPR, 2013.
- (2013) CVPR
- Tian, Y.¹ Sukthankar, R.² Shah, M.³

46
- 84973913561
- APT: Action localization proposals from dense trajectories
- J. van Gemert, M. Jain, E. Gati, and C. Snoek. APT: Action localization proposals from dense trajectories. In BMVC, 2015.
- (2015) BMVC
- Van Gemert, J.¹ Jain, M.² Gati, E.³ Snoek, C.⁴

47
- 84898805910
- Action recognition with improved trajectories
- H. Wang and C. Schmid. Action recognition with improved trajectories. In ICCV, 2013.
- (2013) ICCV
- Wang, H.¹ Schmid, C.²

48
- 84911434661
- Zeroshot event detection using multi-modal fusion of weakly supervised concepts
- S. Wu, S. Bondugula, F. Luisier, X. Zhuang, and P. Natarajan. Zeroshot event detection using multi-modal fusion of weakly supervised concepts. In CVPR, 2014.
- (2014) CVPR
- Wu, S.¹ Bondugula, S.² Luisier, F.³ Zhuang, X.⁴ Natarajan, P.⁵

49
- 84959226659
- A discriminative CNN video representation for event detection
- Z. Xu, Y. Yang, and A. G. Hauptmann. A discriminative CNN video representation for event detection. In CVPR, 2015.
- (2015) CVPR
- Xu, Z.¹ Yang, Y.² Hauptmann, A.G.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.