메뉴 건너뛰기




Volumn 9908 LNCS, Issue , 2016, Pages 744-759

Multi-region two-stream R-CNN for action detection

Author keywords

Action detection; Faster R CNN; Multi region CNNs; Two stream R CNN

Indexed keywords

VITERBI ALGORITHM;

EID: 84990036931     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-319-46493-0_45     Document Type: Conference Paper
Times cited : (328)

References (44)
  • 1
    • 84898805910 scopus 로고    scopus 로고
    • Action recognition with improved trajectories
    • Wang, H., Schmid, C.: Action recognition with improved trajectories. In: ICCV, pp. 3551-3558 (2013)
    • (2013) ICCV , pp. 3551-3558
    • Wang, H.1    Schmid, C.2
  • 2
    • 84887337772 scopus 로고    scopus 로고
    • Representing videos using mid-level discriminative patches
    • Jain, A., Gupta, A., Rodriguez, M., Davis, L.: Representing videos using mid-level discriminative patches. In: CVPR, pp. 2571-2578 (2013)
    • (2013) CVPR , pp. 2571-2578
    • Jain, A.1    Gupta, A.2    Rodriguez, M.3    Davis, L.4
  • 3
    • 84906510060 scopus 로고    scopus 로고
    • Action recognition with stacked fisher vectors
    • Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.), Springer, Heidelberg
    • Peng, X., Zou, C., Qiao, Y., Peng, Q.: Action recognition with stacked fisher vectors. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 581-595. Springer, Heidelberg (2014)
    • (2014) ECCV 2014, Part V , vol.8693 , pp. 581-595
    • Peng, X.1    Zou, C.2    Qiao, Y.3    Peng, Q.4
  • 4
    • 84955282488 scopus 로고    scopus 로고
    • Action recognition with trajectory-pooled deepconvolutional descriptors
    • Wang, L., Qiao, Y., Tang, X.: Action recognition with trajectory-pooled deepconvolutional descriptors. In: CVPR, pp. 4305-4314 (2015)
    • (2015) CVPR , pp. 4305-4314
    • Wang, L.1    Qiao, Y.2    Tang, X.3
  • 5
    • 84887356306 scopus 로고    scopus 로고
    • Spatiotemporal deformable part models for action detection
    • Tian, Y., Sukthankar, R., Shah, M.: Spatiotemporal deformable part models for action detection. In: CVPR, pp. 2642-2649 (2013)
    • (2013) CVPR , pp. 2642-2649
    • Tian, Y.1    Sukthankar, R.2    Shah, M.3
  • 6
    • 84906484374 scopus 로고    scopus 로고
    • Video action detection with relational dynamicposelets
    • Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.), Springer, Heidelberg
    • Wang, L., Qiao, Y., Tang, X.: Video action detection with relational dynamicposelets. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 565-580. Springer, Heidelberg (2014)
    • (2014) ECCV 2014, Part V , vol.8693 , pp. 565-580
    • Wang, L.1    Qiao, Y.2    Tang, X.3
  • 7
    • 84959196122 scopus 로고    scopus 로고
    • Finding action tubes
    • Gkioxari, G., Malik, J.: Finding action tubes. In: CVPR, pp. 759-768 (2015)
    • (2015) CVPR , pp. 759-768
    • Gkioxari, G.1    Malik, J.2
  • 8
    • 84973931775 scopus 로고    scopus 로고
    • Learning to track for spatio-temporal action localization
    • Weinzaepfel, P., Harchaoui, Z., Schmid, C.: Learning to track for spatio-temporal action localization. In: ICCV, pp. 3164-3172 (2015)
    • (2015) ICCV , pp. 3164-3172
    • Weinzaepfel, P.1    Harchaoui, Z.2    Schmid, C.3
  • 9
    • 84884557275 scopus 로고    scopus 로고
    • Temporal localization of actions with actoms
    • Gaidon, A., Harchaoui, Z., Schmid, C.: Temporal localization of actions with actoms. PAMI 35(11), 2782-2795 (2013)
    • (2013) PAMI , vol.35 , Issue.11 , pp. 2782-2795
    • Gaidon, A.1    Harchaoui, Z.2    Schmid, C.3
  • 10
    • 84911423364 scopus 로고    scopus 로고
    • Efficient action localization with approximately normalized Fisher vectors
    • Oneata, D., Verbeek, J., Schmid, C.: Efficient action localization with approximately normalized Fisher vectors. In: CVPR, pp. 2545-2552 (2014)
    • (2014) CVPR , pp. 2545-2552
    • Oneata, D.1    Verbeek, J.2    Schmid, C.3
  • 11
    • 84925310875 scopus 로고    scopus 로고
    • ChaLearn looking at people challenge 2014: Dataset and results
    • Agapito, L., Bronstein, M.M., Rother, C. (eds.), Springer, Heidelberg (2015)
    • Escalera, S., et al.: ChaLearn looking at people challenge 2014: dataset and results. In: Agapito, L., Bronstein, M.M., Rother, C. (eds.) ECCV 2014. LNCS, vol. 8925, pp. 459-473. Springer, Heidelberg (2015). doi:10.1007/978-3-319-16178-5_32
    • ECCV 2014 , vol.8925 , pp. 459-473
    • Escalera, S.1
  • 12
    • 51949101231 scopus 로고    scopus 로고
    • A discriminatively trained, multiscale, deformable part model
    • Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: CVPR, pp. 1-8 (2008)
    • (2008) CVPR , pp. 1-8
    • Felzenszwalb, P.1    McAllester, D.2    Ramanan, D.3
  • 13
    • 85112851150 scopus 로고    scopus 로고
    • Poselets: Body part detectors trained using 3D human pose annotations
    • Bourdev, L., Malik, J.: Poselets: body part detectors trained using 3D human pose annotations. In: ICCV, pp. 1365-1372 (2009)
    • (2009) ICCV , pp. 1365-1372
    • Bourdev, L.1    Malik, J.2
  • 14
    • 84906489617 scopus 로고    scopus 로고
    • Edge boxes: Locating object proposals from edges
    • Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.), Springer, Heidelberg
    • Zitnick, C.L., Dollar, P.: Edge boxes: locating object proposals from edges. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 391-405. Springer, Heidelberg (2014)
    • (2014) ECCV 2014, Part V , vol.8693 , pp. 391-405
    • Zitnick, C.L.1    Dollar, P.2
  • 15
    • 84973872492 scopus 로고    scopus 로고
    • Contextual action recognition with R*CNN
    • Gkioxari, G., Girshick, R., Malik, J.: Contextual action recognition with R*CNN. In: ICCV, pp. 1080-1088 (2015)
    • (2015) ICCV , pp. 1080-1088
    • Gkioxari, G.1    Girshick, R.2    Malik, J.3
  • 16
    • 84911400494 scopus 로고    scopus 로고
    • Rich feature hierarchies for accurate object detection and semantic segmentation
    • Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: CVPR, pp. 580-587 (2014)
    • (2014) CVPR , pp. 580-587
    • Girshick, R.1    Donahue, J.2    Darrell, T.3    Malik, J.4
  • 17
    • 84960980241 scopus 로고    scopus 로고
    • Faster R-CNN: Towards real-time object detection with region proposal networks
    • Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: NIPS, pp. 91-99 (2015)
    • (2015) NIPS , pp. 91-99
    • Ren, S.1    He, K.2    Girshick, R.3    Sun, J.4
  • 18
    • 84973864191 scopus 로고    scopus 로고
    • Object detection via a multi-region and semantic segmentation-aware CNN model
    • Gidaris, S., Komodakis, N.: Object detection via a multi-region and semantic segmentation-aware CNN model. In: ICCV, pp. 1134-1142 (2015)
    • (2015) ICCV , pp. 1134-1142
    • Gidaris, S.1    Komodakis, N.2
  • 20
    • 84898819791 scopus 로고    scopus 로고
    • Towards understanding action recognition
    • Jhuang, H., Gall, J., Zuffi, S., Schmid, C., Black, M.: Towards understanding action recognition. In: ICCV, pp. 3192-3199 (2013)
    • (2013) ICCV , pp. 3192-3199
    • Jhuang, H.1    Gall, J.2    Zuffi, S.3    Schmid, C.4    Black, M.5
  • 21
    • 84973879622 scopus 로고    scopus 로고
    • P-CNN: Pose-based CNN features for action recognition
    • Cheron, G., Laptev, I., Schmid, C.: P-CNN: pose-based CNN features for action recognition. In: ICCV, pp. 3218-3226 (2015)
    • (2015) ICCV , pp. 3218-3226
    • Cheron, G.1    Laptev, I.2    Schmid, C.3
  • 22
    • 84937862424 scopus 로고    scopus 로고
    • Two-stream convolutional networks for action recognition in videos
    • Simonyan, K., Zisserman, A.: Two-stream convolutional networks for action recognition in videos. In: NIPS, pp. 568-576 (2014)
    • (2014) NIPS , pp. 568-576
    • Simonyan, K.1    Zisserman, A.2
  • 25
    • 0345414182 scopus 로고    scopus 로고
    • Video Google: A text retrieval approach to object matching in videos
    • Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: ICCV, pp. 1470-1477 (2003)
    • (2003) ICCV , pp. 1470-1477
    • Sivic, J.1    Zisserman, A.2
  • 26
  • 27
    • 84911395416 scopus 로고    scopus 로고
    • DL-SFA: Deeply-learned slow feature analysis for action recognition
    • Sun, L., Jia, K., Chan, T.H., Fang, Y., Wang, G., Yan, S.: DL-SFA: deeply-learned slow feature analysis for action recognition. In: CVPR, pp. 2625-2632 (2014)
    • (2014) CVPR , pp. 2625-2632
    • Sun, L.1    Jia, K.2    Chan, T.H.3    Fang, Y.4    Wang, G.5    Yan, S.6
  • 28
    • 78149348137 scopus 로고    scopus 로고
    • Improving the fisher kernel for large-scale image lassification
    • Daniilidis, K., Maragos, P., Paragios, N. (eds.), Springer, Heidelberg
    • Perronnin, F., Sanchez, J., Mensink, T.: Improving the fisher kernel for large-scale image lassification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 143-156. Springer, Heidelberg (2010)
    • (2010) ECCV 2010, Part IV , vol.6314 , pp. 143-156
    • Perronnin, F.1    Sanchez, J.2    Mensink, T.3
  • 29
    • 50649122769 scopus 로고    scopus 로고
    • Retrieving actions in movies
    • Laptev, I., Pérez, P.: Retrieving actions in movies. In: ICCV 2007, pp. 1-8 (2007)
    • (2007) ICCV 2007 , pp. 1-8
    • Laptev, I.1    Pérez, P.2
  • 30
    • 70450164163 scopus 로고    scopus 로고
    • Discriminative subvolume search for efficient action detection
    • Yuan, J., Liu, Z., Wu, Y.: Discriminative subvolume search for efficient action detection. In: CVPR, pp. 2442-2449 (2009)
    • (2009) CVPR , pp. 2442-2449
    • Yuan, J.1    Liu, Z.2    Wu, Y.3
  • 31
    • 51949084792 scopus 로고    scopus 로고
    • Action MACH a spatio-temporal maximum average correlation height filter for action recognition
    • Rodriguez, M.D., Ahmed, J., Shah, M.: Action MACH a spatio-temporal maximum average correlation height filter for action recognition. In: CVPR, pp. 1-8 (2008)
    • (2008) CVPR , pp. 1-8
    • Rodriguez, M.D.1    Ahmed, J.2    Shah, M.3
  • 32
    • 77955992066 scopus 로고    scopus 로고
    • Efficient action spotting based on a spacetime oriented structure representation
    • Derpanis, K.G., Sizintsev, M., Cannons, K., Wildes, R.P.: Efficient action spotting based on a spacetime oriented structure representation. In: CVPR, pp. 1990-1997 (2010)
    • (2010) CVPR , pp. 1990-1997
    • Derpanis, K.G.1    Sizintsev, M.2    Cannons, K.3    Wildes, R.P.4
  • 33
    • 84891607575 scopus 로고    scopus 로고
    • Video event detection: From subvolume localization to spatiotemporal path search
    • Tran, D., Yuan, J., Forsyth, D.: Video event detection: from subvolume localization to spatiotemporal path search. PAMI 36(2), 404-416 (2014)
    • (2014) PAMI , vol.36 , Issue.2 , pp. 404-416
    • Tran, D.1    Yuan, J.2    Forsyth, D.3
  • 34
    • 84876231242 scopus 로고    scopus 로고
    • ImageNet classification with deep convolutional neural networks
    • Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: NIPS, pp. 1097-1105 (2012)
    • (2012) NIPS , pp. 1097-1105
    • Krizhevsky, A.1    Sutskever, I.2    Hinton, G.E.3
  • 35
    • 84939247735 scopus 로고    scopus 로고
    • Spatial pyramid pooling in deep convolutional networks for visual recognition
    • He, K., Zhang, X., Ren, S., Sun, J.: Spatial pyramid pooling in deep convolutional networks for visual recognition. PAMI 37(9), 1904-1916 (2015)
    • (2015) PAMI , vol.37 , Issue.9 , pp. 1904-1916
    • He, K.1    Zhang, X.2    Ren, S.3    Sun, J.4
  • 36
    • 84964588182 scopus 로고    scopus 로고
    • Fast R-CNN
    • Girshick, R.: Fast R-CNN. In: ICCV, pp. 1440-1448 (2015)
    • (2015) ICCV , pp. 1440-1448
    • Girshick, R.1
  • 37
    • 84935113569 scopus 로고
    • Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
    • Viterbi, A.J.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. Inf. Theory 13(2), 260-269 (1967)
    • (1967) Inf. Theory , vol.13 , Issue.2 , pp. 260-269
    • Viterbi, A.J.1
  • 40
    • 84959216100 scopus 로고    scopus 로고
    • Convolutional feature masking for joint object and stuff segmentation
    • Dai, J., He, K., Sun, J.: Convolutional feature masking for joint object and stuff segmentation. In: CVPR, pp. 3992-4000 (2015)
    • (2015) CVPR , pp. 3992-4000
    • Dai, J.1    He, K.2    Sun, J.3
  • 41
    • 84976702883 scopus 로고
    • Programming pearls: Algorithm design techniques
    • Bentley, J.: Programming pearls: algorithm design techniques. Commun. ACM 27(9), 865-873 (1984)
    • (1984) Commun. ACM , vol.27 , Issue.9 , pp. 865-873
    • Bentley, J.1
  • 43
    • 35048833329 scopus 로고    scopus 로고
    • High accuracy optical flow estimation based on a theory for warping
    • Pajdla, T., Matas, J.G. (eds.), Springer, Heidelberg
    • Brox, T., Bruhn, A., Papenberg, N., Weickert, J.: High accuracy optical flow estimation based on a theory for warping. In: Pajdla, T., Matas, J.G. (eds.) ECCV 2004. LNCS, vol. 3024, pp. 25-36. Springer, Heidelberg (2004)
    • (2004) ECCV 2004 , vol.3024 , pp. 25-36
    • Brox, T.1    Bruhn, A.2    Papenberg, N.3    Weickert, J.4
  • 44
    • 84959191147 scopus 로고    scopus 로고
    • Fast action proposals for human action detection and search
    • Yu, G., Yuan, J.: Fast action proposals for human action detection and search. In: CVPR, pp. 1302-1311 (2015)
    • (2015) CVPR , pp. 1302-1311
    • Yu, G.1    Yuan, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.