SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Volumn 07-12-June-2015, Issue , 2015, Pages 1600-1609

Recognize complex events from static images by fusing deep channels

(4) Xiong, Yuanjun a Zhu, Kai a Lin, Dahua a Tang, Xiaoou a,b

a CHINESE UNIVERSITY OF HONG KONG (Hong Kong)

b SHENZHEN INSTITUTES OF ADVANCED TECHNOLOGY (China)

Author keywords

[No Author keywords available]

Indexed keywords

COMPLEX NETWORKS; COMPUTER VISION; SEMANTICS;

EVENT RECOGNITION; INDIVIDUAL OBJECTS; NOVEL STRATEGIES; PERSONAL LIVES; SEMANTIC FUSION; SOCIAL ACTIVITIES; STATE OF THE ART; VISUAL APPEARANCE;

PATTERN RECOGNITION;

EID: 84959226544 PISSN: 10636919 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2015.7298768 Document Type: Conference Paper

Times cited : (139)

References (41)

1
- 80052870289
- Probabilistic event logic for interval-based event recognition
- June
- W. Brendel, A. Fern, and S. Todorovic. Probabilistic event logic for interval-based event recognition. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 3329-3336, June 2011
- (2011) Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on , pp. 3329-3336
- Brendel, W.¹ Fern, A.² Todorovic, S.³

2
- 84867846313
- Detecting actions, poses, and objects with relational phraselets
- Springer
- C. Desai and D. Ramanan. Detecting actions, poses, and objects with relational phraselets. In Computer Vision-ECCV 2012, pages 158-172. Springer, 2012
- (2012) Computer Vision-ECCV 2012 , pp. 158-172
- Desai, C.¹ Ramanan, D.²

3
- 84903622275
- Fast feature pyramids for object detection
- Aug
- P. Dollar, R. Appel, S. Belongie, and P. Perona. Fast feature pyramids for object detection. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 36(8):1532-1545, Aug 2014
- (2014) Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.36 , Issue.8 , pp. 1532-1545
- Dollar, P.¹ Appel, R.² Belongie, S.³ Perona, P.⁴

4
- 33846622081
- Behavior recognition via sparse spatio-temporal features
- IEEE
- P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005. 2nd Joint IEEE International Workshop on, pages 65-72. IEEE, 2005
- (2005) Visual Surveillance and Performance Evaluation of Tracking and Surveillance, 2005. 2nd Joint IEEE International Workshop on , pp. 65-72
- Dollár, P.¹ Rabaud, V.² Cottrell, G.³ Belongie, S.⁴

5
- 84865579385
- Visual event recognition in videos by learning from web data
- Sept
- L. Duan, D. Xu, I.-H. Tsang, and J. Luo. Visual event recognition in videos by learning from web data. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 34(9):1667-1680, Sept 2012
- (2012) Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.34 , Issue.9 , pp. 1667-1680
- Duan, L.¹ Xu, D.² Tsang, I.-H.³ Luo, J.⁴

6
- 84876258641
- Learning hierarchical features for scene labeling
- C. Farabet, C. Couprie, L. Najman, and Y. LeCun. Learning hierarchical features for scene labeling. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 35(8):1915-1929, 2013
- (2013) Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.35 , Issue.8 , pp. 1915-1929
- Farabet, C.¹ Couprie, C.² Najman, L.³ LeCun, Y.⁴

7
- 84911400494
- Rich feature hierarchies for accurate object detection and semantic segmentation
- June
- R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 580-587, June 2014
- (2014) Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on , pp. 580-587
- Girshick, R.¹ Donahue, J.² Darrell, T.³ Malik, J.⁴

8
- 84906344142
- Learning rich features from rgb-d images for object detection and segmentation
- Springer International Publishing
- S. Gupta, R. Girshick, P. Arbelez, and J. Malik. Learning rich features from rgb-d images for object detection and segmentation. In Computer Vision ECCV 2014, volume 8695 of Lecture Notes in Computer Science, pages 345-360. Springer International Publishing, 2014
- (2014) Computer Vision ECCV 2014, Volume 8695 of Lecture Notes in Computer Science , pp. 345-360
- Gupta, S.¹ Girshick, R.² Arbelez, P.³ Malik, J.⁴

9
- 33746649771
- Semantic analysis of soccer video using dynamic Bayesian network
- C.-L. Huang, H.-C. Shih, and C.-Y. Chao. Semantic analysis of soccer video using dynamic Bayesian network. Multimedia, IEEE Transactions on, 8(4):749-760, 2006
- (2006) Multimedia, IEEE Transactions on , vol.8 , Issue.4 , pp. 749-760
- Huang, C.-L.¹ Shih, H.-C.² Chao, C.-Y.³

10
- 77957969222
- Recognizing actions from still images
- IEEE
- N. Ikizler, R. G. Cinbis, S. Pehlivan, and P. Duygulu. Recognizing actions from still images. In Pattern Recognition, 2008. ICPR 2008. 19th International Conference on, pages 1-4. IEEE, 2008
- (2008) Pattern Recognition, 2008. ICPR 2008. 19th International Conference on , pp. 1-4
- Ikizler, N.¹ Cinbis, R.G.² Pehlivan, S.³ Duygulu, P.⁴

11
- 84913555165
- arXiv preprint arXiv:1408. 5093
- Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408. 5093, 2014
- (2014) Caffe: Convolutional Architecture for Fast Feature Embedding
- Jia, Y.¹ Shelhamer, E.² Donahue, J.³ Karayev, S.⁴ Long, J.⁵ Girshick, R.⁶ Guadarrama, S.⁷ Darrell, T.⁸

12
- 84876231242
- Imagenet classification with deep convolutional neural networks
- Curran Associates, Inc.
- A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems 25, pages 1097-1105. Curran Associates, Inc., 2012
- (2012) Advances in Neural Information Processing Systems , vol.25 , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

13
- 33845572523
- Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
- IEEE
- S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, volume 2, pages 2169-2178. IEEE, 2006
- (2006) Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on , vol.2 , pp. 2169-2178
- Lazebnik, S.¹ Schmid, C.² Ponce, J.³

14
- 84887340599
- Learning surf cascade for fast and accurate object detection
- IEEE
- J. Li and Y. Zhang. Learning surf cascade for fast and accurate object detection. In Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pages 3468-3475. IEEE, 2013
- (2013) Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on , pp. 3468-3475
- Li, J.¹ Zhang, Y.²

15
- 50649103674
- What, where and who? Classifying events by scene and object recognition
- Oct
- L.-J. Li and L. Fei-Fei. What, where and who? classifying events by scene and object recognition. In Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on, pages 1-8, Oct 2007
- (2007) Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on , pp. 1-8
- Li, L.-J.¹ Fei-Fei, L.²

16
- 70450219021
- Towards total scene understanding: Classification, annotation and segmentation in an automatic framework
- June
- L.-J. Li, R. Socher, and L. Fei-Fei. Towards total scene understanding: Classification, annotation and segmentation in an automatic framework. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, pages 2036-2043, June 2009
- (2009) Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on , pp. 2036-2043
- Li, L.-J.¹ Socher, R.² Fei-Fei, L.³

17
- 85162513516
- Object bank: A high-level image representation for scene classification & semantic feature sparsification
- L.-J. Li, H. Su, L. Fei-Fei, and E. P. Xing. Object bank: A high-level image representation for scene classification & semantic feature sparsification. In Advances in neural information processing systems, pages 1378-1386, 2010
- (2010) Advances in Neural Information Processing Systems , pp. 1378-1386
- Li, L.-J.¹ Su, H.² Fei-Fei, L.³ Xing, E.P.⁴

18
- 70350581485
- Exploiting multi-modal interactions: A unified framework
- M. Li, X.-B. Xue, and Z.-H. Zhou. Exploiting multi-modal interactions: A unified framework. In IJCAI, pages 1120-1125, 2009
- (2009) IJCAI , pp. 1120-1125
- Li, M.¹ Xue, X.-B.² Zhou, Z.-H.³

19
- 84906486177
- Exploiting privileged information from web data for image categorization
- Springer International Publishing
- W. Li, L. Niu, and D. Xu. Exploiting privileged information from web data for image categorization. In Computer Vision ECCV 2014, volume 8693 of Lecture Notes in Computer Science, pages 437-452. Springer International Publishing, 2014
- (2014) Computer Vision ECCV 2014, Volume 8693 of Lecture Notes in Computer Science , pp. 437-452
- Li, W.¹ Niu, L.² Xu, D.³

20
- 84898770979
- Pedestrian parsing via deep decompositional network
- IEEE
- P. Luo, X. Wang, and X. Tang. Pedestrian parsing via deep decompositional network. In Computer Vision (ICCV), 2013 IEEE International Conference on, pages 2648-2655. IEEE, 2013
- (2013) Computer Vision (ICCV), 2013 IEEE International Conference on , pp. 2648-2655
- Luo, P.¹ Wang, X.² Tang, X.³

21
- 80052880806
- Action recognition from a distributed representation of pose and appearance
- IEEE
- S. Maji, L. Bourdev, and J. Malik. Action recognition from a distributed representation of pose and appearance. In Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on, pages 3177-3184. IEEE, 2011
- (2011) Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on , pp. 3177-3184
- Maji, S.¹ Bourdev, L.² Malik, J.³

22
- 33747626730
- Large-scale concept ontology for multimedia
- M. Naphade, J. R. Smith, J. Tesic, S.-F. Chang, W. Hsu, L. Kennedy, A. Hauptmann, and J. Curtis. Large-scale concept ontology for multimedia. MultiMedia, IEEE, 13(3):86-91, 2006
- (2006) MultiMedia, IEEE , vol.13 , Issue.3 , pp. 86-91
- Naphade, M.¹ Smith, J.R.² Tesic, J.³ Chang, S.-F.⁴ Hsu, W.⁵ Kennedy, L.⁶ Hauptmann, A.⁷ Curtis, J.⁸

23
- 80053437179
- Multimodal deep learning
- J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Y. Ng. Multimodal deep learning. In Proceedings of the 28th International Conference on Machine Learning (ICML-11), pages 689-696, 2011
- (2011) Proceedings of the 28th International Conference on Machine Learning (ICML-11) , pp. 689-696
- Ngiam, J.¹ Khosla, A.² Kim, M.³ Nam, J.⁴ Lee, H.⁵ Ng, A.Y.⁶

24
- 0035328421
- Modeling the shape of the scene: A holistic representation of the spatial envelope
- A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. International journal of computer vision, 42(3):145-175, 2001
- (2001) International Journal of Computer Vision , vol.42 , Issue.3 , pp. 145-175
- Oliva, A.¹ Torralba, A.²

25
- 85077311325
- Trecvid 2014-an overview of the goals, tasks, data, evaluation mechanisms and metrics
- USA
- P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, W. Kraaij, A. F. Smeaton, and G. Quenot. Trecvid 2014-an overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of TRECVID 2014. NIST, USA, 2014
- (2014) Proceedings of TRECVID 2014. NIST
- Over, P.¹ Awad, G.² Michel, M.³ Fiscus, J.⁴ Sanders, G.⁵ Kraaij, W.⁶ Smeaton, A.F.⁷ Quenot, G.⁸

26
- 84856650974
- Scene recognition and weakly supervised object localization with deformable part-based models
- Nov
- M. Pandey and S. Lazebnik. Scene recognition and weakly supervised object localization with deformable part-based models. In Computer Vision (ICCV), 2011 IEEE International Conference on, pages 1307-1314, Nov 2011
- (2011) Computer Vision (ICCV), 2011 IEEE International Conference on , pp. 1307-1314
- Pandey, M.¹ Lazebnik, S.²

27
- 78149349613
- Tracklet descriptors for action modeling and video analysis
- Springer
- M. Raptis and S. Soatto. Tracklet descriptors for action modeling and video analysis. In Computer Vision-ECCV 2010, pages 577-590. Springer, 2010
- (2010) Computer Vision-ECCV 2010 , pp. 577-590
- Raptis, M.¹ Soatto, S.²

28
- 84909978410
- CoRR, abs/1409. 0575
- O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. S. Bernstein, A. C. Berg, and L. Fei-Fei. Imagenet large scale visual recognition challenge. CoRR, abs/1409. 0575, 2014
- (2014) Imagenet Large Scale Visual Recognition Challenge
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶ Huang, Z.⁷ Karpathy, A.⁸ Khosla, A.⁹ Bernstein, M.S.¹⁰ Berg, A.C.¹¹ Fei-Fei, L.¹²

29
- 84924949081
- CoRR, abs/1406. 2199
- K. Simonyan and A. Zisserman. Two-stream convolutional networks for action recognition in videos. CoRR, abs/1406. 2199, 2014
- (2014) Two-stream Convolutional Networks for Action Recognition in Videos
- Simonyan, K.¹ Zisserman, A.²

30
- 84877724347
- Multimodal learning with deep boltzmann machines
- Curran Associates, Inc.
- N. Srivastava and R. Salakhutdinov. Multimodal learning with deep boltzmann machines. In Advances in Neural Information Processing Systems 25, pages 2222-2230. Curran Associates, Inc., 2012
- (2012) Advances in Neural Information Processing Systems , vol.25 , pp. 2222-2230
- Srivastava, N.¹ Salakhutdinov, R.²

31
- 84906338496
- CoRR, abs/1406. 4773
- Y. Sun, X. Wang, and X. Tang. Deep learning face representation by joint identification-verification. CoRR, abs/1406. 4773, 2014
- (2014) Deep Learning Face Representation by Joint Identification-verification
- Sun, Y.¹ Wang, X.² Tang, X.³

32
- 84866658784
- Learning latent temporal structure for complex event detection
- June
- K. Tang, L. Fei-Fei, and D. Koller. Learning latent temporal structure for complex event detection. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, pages 1250-1257, June 2012
- (2012) Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , pp. 1250-1257
- Tang, K.¹ Fei-Fei, L.² Koller, D.³

33
- 84881160857
- Selective search for object recognition
- J. R. Uijlings, K. E. van de Sande, T. Gevers, and A. W. Smeulders. Selective search for object recognition. International journal of computer vision, 104(2):154-171, 2013
- (2013) International Journal of Computer Vision , vol.104 , Issue.2 , pp. 154-171
- Uijlings, J.R.¹ Sande De Van, K.E.² Gevers, T.³ Smeulders, A.W.⁴

34
- 84898890371
- Evaluation of local spatio-temporal features for action recognition
- H. Wang, M. M. Ullah, A. Klaser, I. Laptev, C. Schmid, et al. Evaluation of local spatio-temporal features for action recognition. In BMVC 2009-British Machine Vision Conference, 2009
- (2009) BMVC 2009-British Machine Vision Conference
- Wang, H.¹ Ullah, M.M.² Klaser, A.³ Laptev, I.⁴ Schmid, C.⁵

35
- 84911434661
- Zero-shot event detection using multi-modal fusion of weakly supervised concepts
- IEEE
- S. Wu, S. Bondugula, F. Luisier, X. Zhuang, and P. Natarajan. Zero-shot event detection using multi-modal fusion of weakly supervised concepts. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on, pages 2665-2672. IEEE, 2014
- (2014) Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on , pp. 2665-2672
- Wu, S.¹ Bondugula, S.² Luisier, F.³ Zhuang, X.⁴ Natarajan, P.⁵

36
- 77955988947
- Sun database: Large-scale scene recognition from abbey to zoo
- June
- J. Xiao, J. Hays, K. Ehinger, A. Oliva, and A. Torralba. Sun database: Large-scale scene recognition from abbey to zoo. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 3485-3492, June 2010
- (2010) Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on , pp. 3485-3492
- Xiao, J.¹ Hays, J.² Ehinger, K.³ Oliva, A.⁴ Torralba, A.⁵

37
- 85060905667
- Recognizing human action in time-sequential images using hidden markov model
- IEEE
- J. Yamato, J. Ohya, and K. Ishii. Recognizing human action in time-sequential images using hidden markov model. In Computer Vision and Pattern Recognition, 1992. Proceedings CVPR'92., 1992 IEEE Computer Society Conference on, pages 379-385. IEEE, 1992
- (1992) Computer Vision and Pattern Recognition, 1992. Proceedings CVPR'92., 1992 IEEE Computer Society Conference on , pp. 379-385
- Yamato, J.¹ Ohya, J.² Ishii, K.³

38
- 77955996308
- Recognizing human actions from still images with latent poses
- IEEE
- W. Yang, Y. Wang, and G. Mori. Recognizing human actions from still images with latent poses. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pages 2030-2037. IEEE, 2010
- (2010) Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on , pp. 2030-2037
- Yang, W.¹ Wang, Y.² Mori, G.³

39
- 84867886443
- Complex events detection using datadriven concepts
- Springer Berlin Heidelberg
- Y. Yang and M. Shah. Complex events detection using datadriven concepts. In Computer Vision ECCV 2012, volume 7574, pages 722-735. Springer Berlin Heidelberg, 2012
- (2012) Computer Vision ECCV 2012 , vol.7574 , pp. 722-735
- Yang, Y.¹ Shah, M.²

40
- 84865593256
- Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses
- B. Yao and L. Fei-Fei. Recognizing human-object interactions in still images by modeling the mutual context of objects and human poses. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 34(9):1691-1703, 2012
- (2012) Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.34 , Issue.9 , pp. 1691-1703
- Yao, B.¹ Fei-Fei, L.²

41
- 84906489617
- Edge boxes: Locating object proposals from edges
- Springer International Publishing
- C. Zitnick and P. Dollr. Edge boxes: Locating object proposals from edges. In Computer Vision ECCV 2014, volume 8693, pages 391-405. Springer International Publishing, 2014.
- (2014) Computer Vision ECCV 2014 , vol.8693 , pp. 391-405
- Zitnick, C.¹ Dollr, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.