SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 9908 LNCS, Issue , 2016, Pages 744-759

Multi-region two-stream R-CNN for action detection

(2) Peng, Xiaojiang a Schmid, Cordelia a

a INRIA (France)

Author keywords

Action detection; Faster R CNN; Multi region CNNs; Two stream R CNN

Indexed keywords

VITERBI ALGORITHM;

FASTER R-CNN; HIGH QUALITY; LEVEL DETECTIONS; MOTION REGION; MULTI-REGION CNNS; STATE OF THE ART; SUB-ARRAYS; TWO-STREAM;

CONVOLUTIONAL NEURAL NETWORKS;

EID: 84990036931 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-319-46493-0_45 Document Type: Conference Paper

Times cited : (328)

References (44)

1
- 84898805910
- Action recognition with improved trajectories
- Wang, H., Schmid, C.: Action recognition with improved trajectories. In: ICCV, pp. 3551-3558 (2013)
- (2013) ICCV , pp. 3551-3558
- Wang, H.¹ Schmid, C.²

2
- 84887337772
- Representing videos using mid-level discriminative patches
- Jain, A., Gupta, A., Rodriguez, M., Davis, L.: Representing videos using mid-level discriminative patches. In: CVPR, pp. 2571-2578 (2013)
- (2013) CVPR , pp. 2571-2578
- Jain, A.¹ Gupta, A.² Rodriguez, M.³ Davis, L.⁴

3
- 84906510060
- Action recognition with stacked fisher vectors
- Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.), Springer, Heidelberg
- Peng, X., Zou, C., Qiao, Y., Peng, Q.: Action recognition with stacked fisher vectors. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 581-595. Springer, Heidelberg (2014)
- (2014) ECCV 2014, Part V , vol.8693 , pp. 581-595
- Peng, X.¹ Zou, C.² Qiao, Y.³ Peng, Q.⁴

4
- 84955282488
- Action recognition with trajectory-pooled deepconvolutional descriptors
- Wang, L., Qiao, Y., Tang, X.: Action recognition with trajectory-pooled deepconvolutional descriptors. In: CVPR, pp. 4305-4314 (2015)
- (2015) CVPR , pp. 4305-4314
- Wang, L.¹ Qiao, Y.² Tang, X.³

5
- 84887356306
- Spatiotemporal deformable part models for action detection
- Tian, Y., Sukthankar, R., Shah, M.: Spatiotemporal deformable part models for action detection. In: CVPR, pp. 2642-2649 (2013)
- (2013) CVPR , pp. 2642-2649
- Tian, Y.¹ Sukthankar, R.² Shah, M.³

6
- 84906484374
- Video action detection with relational dynamicposelets
- Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.), Springer, Heidelberg
- Wang, L., Qiao, Y., Tang, X.: Video action detection with relational dynamicposelets. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 565-580. Springer, Heidelberg (2014)
- (2014) ECCV 2014, Part V , vol.8693 , pp. 565-580
- Wang, L.¹ Qiao, Y.² Tang, X.³

7
- 84959196122
- Finding action tubes
- Gkioxari, G., Malik, J.: Finding action tubes. In: CVPR, pp. 759-768 (2015)
- (2015) CVPR , pp. 759-768
- Gkioxari, G.¹ Malik, J.²

8
- 84973931775
- Learning to track for spatio-temporal action localization
- Weinzaepfel, P., Harchaoui, Z., Schmid, C.: Learning to track for spatio-temporal action localization. In: ICCV, pp. 3164-3172 (2015)
- (2015) ICCV , pp. 3164-3172
- Weinzaepfel, P.¹ Harchaoui, Z.² Schmid, C.³

9
- 84884557275
- Temporal localization of actions with actoms
- Gaidon, A., Harchaoui, Z., Schmid, C.: Temporal localization of actions with actoms. PAMI 35(11), 2782-2795 (2013)
- (2013) PAMI , vol.35 , Issue.11 , pp. 2782-2795
- Gaidon, A.¹ Harchaoui, Z.² Schmid, C.³

10
- 84911423364
- Efficient action localization with approximately normalized Fisher vectors
- Oneata, D., Verbeek, J., Schmid, C.: Efficient action localization with approximately normalized Fisher vectors. In: CVPR, pp. 2545-2552 (2014)
- (2014) CVPR , pp. 2545-2552
- Oneata, D.¹ Verbeek, J.² Schmid, C.³

11
- 84925310875
- ChaLearn looking at people challenge 2014: Dataset and results
- Agapito, L., Bronstein, M.M., Rother, C. (eds.), Springer, Heidelberg (2015)
- Escalera, S., et al.: ChaLearn looking at people challenge 2014: dataset and results. In: Agapito, L., Bronstein, M.M., Rother, C. (eds.) ECCV 2014. LNCS, vol. 8925, pp. 459-473. Springer, Heidelberg (2015). doi:10.1007/978-3-319-16178-5_32
- ECCV 2014 , vol.8925 , pp. 459-473
- Escalera, S.¹

12
- 51949101231
- A discriminatively trained, multiscale, deformable part model
- Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: CVPR, pp. 1-8 (2008)
- (2008) CVPR , pp. 1-8
- Felzenszwalb, P.¹ McAllester, D.² Ramanan, D.³

13
- 85112851150
- Poselets: Body part detectors trained using 3D human pose annotations
- Bourdev, L., Malik, J.: Poselets: body part detectors trained using 3D human pose annotations. In: ICCV, pp. 1365-1372 (2009)
- (2009) ICCV , pp. 1365-1372
- Bourdev, L.¹ Malik, J.²

14
- 84906489617
- Edge boxes: Locating object proposals from edges
- Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.), Springer, Heidelberg
- Zitnick, C.L., Dollar, P.: Edge boxes: locating object proposals from edges. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 391-405. Springer, Heidelberg (2014)
- (2014) ECCV 2014, Part V , vol.8693 , pp. 391-405
- Zitnick, C.L.¹ Dollar, P.²

15
- 84973872492
- Contextual action recognition with R*CNN
- Gkioxari, G., Girshick, R., Malik, J.: Contextual action recognition with R*CNN. In: ICCV, pp. 1080-1088 (2015)
- (2015) ICCV , pp. 1080-1088
- Gkioxari, G.¹ Girshick, R.² Malik, J.³

16
- 84911400494
- Rich feature hierarchies for accurate object detection and semantic segmentation
- Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR, pp. 580-587 (2014)
- (2014) CVPR , pp. 580-587
- Girshick, R.¹ Donahue, J.² Darrell, T.³ Malik, J.⁴

17
- 84960980241
- Faster R-CNN: Towards real-time object detection with region proposal networks
- Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS, pp. 91-99 (2015)
- (2015) NIPS , pp. 91-99
- Ren, S.¹ He, K.² Girshick, R.³ Sun, J.⁴

18
- 84973864191
- Object detection via a multi-region and semantic segmentation-aware CNN model
- Gidaris, S., Komodakis, N.: Object detection via a multi-region and semantic segmentation-aware CNN model. In: ICCV, pp. 1134-1142 (2015)
- (2015) ICCV , pp. 1134-1142
- Gidaris, S.¹ Komodakis, N.²

19
- 84881160857
- Selective search for object recognition
- Uijlings, J.R., van de Sande, K.E., Gevers, T., Smeulders, A.W.: Selective search for object recognition. IJCV 104(2), 154-171 (2013)
- (2013) IJCV , vol.104 , Issue.2 , pp. 154-171
- Uijlings, J.R.¹ Van De Sande, K.E.² Gevers, T.³ Smeulders, A.W.⁴

20
- 84898819791
- Towards understanding action recognition
- Jhuang, H., Gall, J., Zuffi, S., Schmid, C., Black, M.: Towards understanding action recognition. In: ICCV, pp. 3192-3199 (2013)
- (2013) ICCV , pp. 3192-3199
- Jhuang, H.¹ Gall, J.² Zuffi, S.³ Schmid, C.⁴ Black, M.⁵

21
- 84973879622
- P-CNN: Pose-based CNN features for action recognition
- Cheron, G., Laptev, I., Schmid, C.: P-CNN: pose-based CNN features for action recognition. In: ICCV, pp. 3218-3226 (2015)
- (2015) ICCV , pp. 3218-3226
- Cheron, G.¹ Laptev, I.² Schmid, C.³

22
- 84937862424
- Two-stream convolutional networks for action recognition in videos
- Simonyan, K., Zisserman, A.: Two-stream convolutional networks for action recognition in videos. In: NIPS, pp. 568-576 (2014)
- (2014) NIPS , pp. 568-576
- Simonyan, K.¹ Zisserman, A.²

23
- 79955649703
- Human activity analysis: A review
- Aggarwal, J.K., Ryoo, M.S.: Human activity analysis: a review. ACM Comput. Surv. (CSUR) 43(3), 16 (2011)
- (2011) ACM Comput. Surv. (CSUR) , vol.43 , Issue.3 , pp. 16
- Aggarwal, J.K.¹ Ryoo, M.S.²

24
- 84930630277
- LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436-444 (2015)
- (2015) Deep Learning. Nature , vol.521 , Issue.7553 , pp. 436-444
- Lecun, Y.¹ Bengio, Y.² Hinton, G.³

25
- 0345414182
- Video Google: A text retrieval approach to object matching in videos
- Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: ICCV, pp. 1470-1477 (2003)
- (2003) ICCV , pp. 1470-1477
- Sivic, J.¹ Zisserman, A.²

26
- 84911364368
- Largescale video classification with convolutional neural networks
- Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L.: Largescale video classification with convolutional neural networks. In: CVPR, pp. 1725-1732 (2014)
- (2014) CVPR , pp. 1725-1732
- Karpathy, A.¹ Toderici, G.² Shetty, S.³ Leung, T.⁴ Sukthankar, R.⁵ Fei-Fei, L.⁶

27
- 84911395416
- DL-SFA: Deeply-learned slow feature analysis for action recognition
- Sun, L., Jia, K., Chan, T.H., Fang, Y., Wang, G., Yan, S.: DL-SFA: deeply-learned slow feature analysis for action recognition. In: CVPR, pp. 2625-2632 (2014)
- (2014) CVPR , pp. 2625-2632
- Sun, L.¹ Jia, K.² Chan, T.H.³ Fang, Y.⁴ Wang, G.⁵ Yan, S.⁶

28
- 78149348137
- Improving the fisher kernel for large-scale image lassification
- Daniilidis, K., Maragos, P., Paragios, N. (eds.), Springer, Heidelberg
- Perronnin, F., Sanchez, J., Mensink, T.: Improving the fisher kernel for large-scale image lassification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 143-156. Springer, Heidelberg (2010)
- (2010) ECCV 2010, Part IV , vol.6314 , pp. 143-156
- Perronnin, F.¹ Sanchez, J.² Mensink, T.³

29
- 50649122769
- Retrieving actions in movies
- Laptev, I., Pérez, P.: Retrieving actions in movies. In: ICCV 2007, pp. 1-8 (2007)
- (2007) ICCV 2007 , pp. 1-8
- Laptev, I.¹ Pérez, P.²

30
- 70450164163
- Discriminative subvolume search for efficient action detection
- Yuan, J., Liu, Z., Wu, Y.: Discriminative subvolume search for efficient action detection. In: CVPR, pp. 2442-2449 (2009)
- (2009) CVPR , pp. 2442-2449
- Yuan, J.¹ Liu, Z.² Wu, Y.³

31
- 51949084792
- Action MACH a spatio-temporal maximum average correlation height filter for action recognition
- Rodriguez, M.D., Ahmed, J., Shah, M.: Action MACH a spatio-temporal maximum average correlation height filter for action recognition. In: CVPR, pp. 1-8 (2008)
- (2008) CVPR , pp. 1-8
- Rodriguez, M.D.¹ Ahmed, J.² Shah, M.³

32
- 77955992066
- Efficient action spotting based on a spacetime oriented structure representation
- Derpanis, K.G., Sizintsev, M., Cannons, K., Wildes, R.P.: Efficient action spotting based on a spacetime oriented structure representation. In: CVPR, pp. 1990-1997 (2010)
- (2010) CVPR , pp. 1990-1997
- Derpanis, K.G.¹ Sizintsev, M.² Cannons, K.³ Wildes, R.P.⁴

33
- 84891607575
- Video event detection: From subvolume localization to spatiotemporal path search
- Tran, D., Yuan, J., Forsyth, D.: Video event detection: from subvolume localization to spatiotemporal path search. PAMI 36(2), 404-416 (2014)
- (2014) PAMI , vol.36 , Issue.2 , pp. 404-416
- Tran, D.¹ Yuan, J.² Forsyth, D.³

34
- 84876231242
- ImageNet classification with deep convolutional neural networks
- Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS, pp. 1097-1105 (2012)
- (2012) NIPS , pp. 1097-1105
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

35
- 84939247735
- Spatial pyramid pooling in deep convolutional networks for visual recognition
- He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. PAMI 37(9), 1904-1916 (2015)
- (2015) PAMI , vol.37 , Issue.9 , pp. 1904-1916
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

36
- 84964588182
- Fast R-CNN
- Girshick, R.: Fast R-CNN. In: ICCV, pp. 1440-1448 (2015)
- (2015) ICCV , pp. 1440-1448
- Girshick, R.¹

37
- 84935113569
- Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
- Viterbi, A.J.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. Inf. Theory 13(2), 260-269 (1967)
- (1967) Inf. Theory , vol.13 , Issue.2 , pp. 260-269
- Viterbi, A.J.¹

38
- 84925410541
- arXiv:1409.1556
- Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556 (2014)
- (2014) Very Deep Convolutional Networks for Large-Scale Image Recognition
- Simonyan, K.¹ Zisserman, A.²

39
- 84947041871
- ImageNet large scale visual recognition challenge
- Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., et al.: ImageNet large scale visual recognition challenge. IJCV 115(3), 211-252 (2015)
- (2015) IJCV , vol.115 , Issue.3 , pp. 211-252
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶ Huang, Z.⁷ Karpathy, A.⁸ Khosla, A.⁹ Bernstein, M.¹⁰

40
- 84959216100
- Convolutional feature masking for joint object and stuff segmentation
- Dai, J., He, K., Sun, J.: Convolutional feature masking for joint object and stuff segmentation. In: CVPR, pp. 3992-4000 (2015)
- (2015) CVPR , pp. 3992-4000
- Dai, J.¹ He, K.² Sun, J.³

41
- 84976702883
- Programming pearls: Algorithm design techniques
- Bentley, J.: Programming pearls: algorithm design techniques. Commun. ACM 27(9), 865-873 (1984)
- (1984) Commun. ACM , vol.27 , Issue.9 , pp. 865-873
- Bentley, J.¹

42
- 84984657335
- arXiv:1212.0402
- Soomro, K., Zamir, A.R., Shah, M.: UCF101: a dataset of 101 human actions classes from videos in the wild. arXiv:1212.0402 (2012)
- (2012) UCF101: A Dataset of 101 Human Actions Classes from Videos in the Wild
- Soomro, K.¹ Zamir, A.R.² Shah, M.³

43
- 35048833329
- High accuracy optical flow estimation based on a theory for warping
- Pajdla, T., Matas, J.G. (eds.), Springer, Heidelberg
- Brox, T., Bruhn, A., Papenberg, N., Weickert, J.: High accuracy optical flow estimation based on a theory for warping. In: Pajdla, T., Matas, J.G. (eds.) ECCV 2004. LNCS, vol. 3024, pp. 25-36. Springer, Heidelberg (2004)
- (2004) ECCV 2004 , vol.3024 , pp. 25-36
- Brox, T.¹ Bruhn, A.² Papenberg, N.³ Weickert, J.⁴

44
- 84959191147
- Fast action proposals for human action detection and search
- Yu, G., Yuan, J.: Fast action proposals for human action detection and search. In: CVPR, pp. 1302-1311 (2015)
- (2015) CVPR , pp. 1302-1311
- Yu, G.¹ Yuan, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.