메뉴 건너뛰기




Volumn 72, Issue , 2017, Pages 504-516

Human action recognition in RGB-D videos using motion sequence information and deep learning

Author keywords

Deep learning; Extreme learning machines; Motion information; Multi modal action recognition

Indexed keywords

EDUCATION; IMAGE RECOGNITION; LEARNING SYSTEMS; NEURAL NETWORKS; VIDEO STREAMING;

EID: 85023621585     PISSN: 00313203     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patcog.2017.07.013     Document Type: Article
Times cited : (146)

References (52)
  • 1
    • 84890399408 scopus 로고    scopus 로고
    • Enhanced computer vision with microsoft kinect sensor: a review.
    • Han, J., Shao, L., Xu, D., Shotton, J., Enhanced computer vision with microsoft kinect sensor: a review. IEEE Trans. Cybern. 43:5 (2013), 1318–1334.
    • (2013) IEEE Trans. Cybern. , vol.43 , Issue.5 , pp. 1318-1334
    • Han, J.1    Shao, L.2    Xu, D.3    Shotton, J.4
  • 2
    • 85027480235 scopus 로고    scopus 로고
    • Rgb-d-based action recognition datasets: a survey
    • Zhang, J., Li, W., Ogunbona, P.O., Wang, P., Tang, C., Rgb-d-based action recognition datasets: a survey. CoRR, abs/1601.05511, 2016.
    • (2016) CoRR , vol.abs/1601.05511
    • Zhang, J.1    Li, W.2    Ogunbona, P.O.3    Wang, P.4    Tang, C.5
  • 3
    • 85027481933 scopus 로고    scopus 로고
    • RGBD datasets: past, present and future
    • Firman, M., RGBD datasets: past, present and future. CoRR, abs/1604.00999, 2016.
    • (2016) CoRR , vol.abs/1604.00999
    • Firman, M.1
  • 4
    • 85027483782 scopus 로고    scopus 로고
    • Ntu rgb+d: a large scale dataset for 3d human activity analysis
    • Shahroudy, A., Liu, J., Ng, T., Wang, G., Ntu rgb+d: a large scale dataset for 3d human activity analysis. CoRR, abs/1604.02808, 2016.
    • (2016) CoRR , vol.abs/1604.02808
    • Shahroudy, A.1    Liu, J.2    Ng, T.3    Wang, G.4
  • 5
    • 84956626439 scopus 로고    scopus 로고
    • Utd-mhad: a multimodal dataset for human action recognition utilizing a depth camera and a wearable inertial sensor
    • Chen, C., Jafari, R., Kehtarnavaz, N., Utd-mhad: a multimodal dataset for human action recognition utilizing a depth camera and a wearable inertial sensor. IEEE International Conference on Image Processing (ICIP), 2015, 168–172, 10.1109/ICIP.2015.7350781.
    • (2015) IEEE International Conference on Image Processing (ICIP) , pp. 168-172
    • Chen, C.1    Jafari, R.2    Kehtarnavaz, N.3
  • 6
    • 84885873397 scopus 로고    scopus 로고
    • Rgbd-hudaact: a color-depth video database for human daily activity recognition.
    • A. Fossati J. Gall H. Grabner X. Ren K. Konolige Consumer Depth Cameras for Computer Vision (Advances in Computer Vision and Pattern Recognition) Springer
    • Ni, B., Wang, G., Moulin, P., Rgbd-hudaact: a color-depth video database for human daily activity recognition. Fossati, A., Gall, J., Grabner, H., Ren, X., Konolige, K., (eds.) (Trans.) Consumer Depth Cameras for Computer Vision (Advances in Computer Vision and Pattern Recognition), 2013, Springer, 193–208.
    • (2013) , pp. 193-208
    • Ni, B.1    Wang, G.2    Moulin, P.3
  • 7
    • 84867697201 scopus 로고    scopus 로고
    • Human daily action analysis with multi-view and color-depth data.
    • A. Fusiello V. Murino R. Cucchiara Springer
    • Cheng, Z., Qin, L., Ye, Y., Huang, Q., Tian, Q., Human daily action analysis with multi-view and color-depth data. Fusiello, A., Murino, V., Cucchiara, R., (eds.) ECCV Workshops (2) Lecture Notes in Computer Science, 7584, 2012, Springer, 52–61.
    • (2012) ECCV Workshops (2), Lecture Notes in Computer Science , vol.7584 , pp. 52-61
    • Cheng, Z.1    Qin, L.2    Ye, Y.3    Huang, Q.4    Tian, Q.5
  • 9
    • 84977961716 scopus 로고    scopus 로고
    • Heterogeneous discriminant analysis for cross-view action recognition
    • Sui, W., Wu, X., Feng, Y., Jia, Y., Heterogeneous discriminant analysis for cross-view action recognition. Neurocomputing 191 (2016), 286–295, 10.1016/j.neucom.2016.01.051.
    • (2016) Neurocomputing , vol.191 , pp. 286-295
    • Sui, W.1    Wu, X.2    Feng, Y.3    Jia, Y.4
  • 10
    • 84919627765 scopus 로고    scopus 로고
    • Single/multi-view human action recognition via regularized multi-task learning
    • Part 2.
    • Liu, A.-A., Xu, N., Su, Y.-T., Lin, H., Hao, T., Yang, Z.-X., Single/multi-view human action recognition via regularized multi-task learning. Neurocomputing 151 (2015), 544–553 Part 2.
    • (2015) Neurocomputing , vol.151 , pp. 544-553
    • Liu, A.-A.1    Xu, N.2    Su, Y.-T.3    Lin, H.4    Hao, T.5    Yang, Z.-X.6
  • 11
    • 84908180127 scopus 로고    scopus 로고
    • A survey on multi-view learning
    • Xu, C., Tao, D., Xu, C., A survey on multi-view learning. CoRR, abs/1304.5634, 2013.
    • (2013) CoRR , vol.abs/1304.5634
    • Xu, C.1    Tao, D.2    Xu, C.3
  • 13
    • 85026329063 scopus 로고    scopus 로고
    • Co-occurrence feature learning for skeleton based action recognition using regularized deep lstm networks
    • Zhu, W., Lan, C., Xing, J., Zeng, W., Li, Y., Shen, L., Xie, X., Co-occurrence feature learning for skeleton based action recognition using regularized deep lstm networks. CoRR, abs/1603.07772, 2016.
    • (2016) CoRR , vol.abs/1603.07772
    • Zhu, W.1    Lan, C.2    Xing, J.3    Zeng, W.4    Li, Y.5    Shen, L.6    Xie, X.7
  • 20
    • 84884571014 scopus 로고    scopus 로고
    • Label consistent k-svd: learning a discriminative dictionary for recognition.
    • Jiang, Z., Lin, Z., Davis, L.S., Label consistent k-svd: learning a discriminative dictionary for recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35:11 (2013), 2651–2664.
    • (2013) IEEE Trans. Pattern Anal. Mach. Intell. , vol.35 , Issue.11 , pp. 2651-2664
    • Jiang, Z.1    Lin, Z.2    Davis, L.S.3
  • 21
    • 84887597255 scopus 로고    scopus 로고
    • A heat-map-based algorithm for recognizing group activities in videos.
    • Lin, W., Chu, H., Wu, J., Sheng, B., Chen, Z., A heat-map-based algorithm for recognizing group activities in videos. IEEE Trans. Circuits Syst. Video Technol. 23:11 (2013), 1980–1992.
    • (2013) IEEE Trans. Circuits Syst. Video Technol. , vol.23 , Issue.11 , pp. 1980-1992
    • Lin, W.1    Chu, H.2    Wu, J.3    Sheng, B.4    Chen, Z.5
  • 22
    • 84874537383 scopus 로고    scopus 로고
    • Explicit modeling of human-object interactions in realistic videos
    • Prest, A., Ferrari, V., Schmid, C., Explicit modeling of human-object interactions in realistic videos. IEEE Trans. Pattern Anal. Mach. Intell. 35:4 (2013), 835–848.
    • (2013) IEEE Trans. Pattern Anal. Mach. Intell. , vol.35 , Issue.4 , pp. 835-848
    • Prest, A.1    Ferrari, V.2    Schmid, C.3
  • 23
    • 84870183903 scopus 로고    scopus 로고
    • 3d convolutional neural networks for human action recognition.
    • Ji, S., Xu, W., Yang, M., Yu, K., 3d convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35:1 (2013), 221–231.
    • (2013) IEEE Trans. Pattern Anal. Mach. Intell. , vol.35 , Issue.1 , pp. 221-231
    • Ji, S.1    Xu, W.2    Yang, M.3    Yu, K.4
  • 26
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Lecun, Y., Bottou, L., Bengio, Y., Haffner, P., Gradient-based learning applied to document recognition. Proc. IEEE 86:11 (1998), 2278–2324, 10.1109/5.726791.
    • (1998) Proc. IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • Lecun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 27
    • 84879854889 scopus 로고    scopus 로고
    • Representation learning: a review and new perspectives
    • Bengio, Y., Courville, A., Vincent, P., Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35:8 (2013), 1798–1828, 10.1109/TPAMI.2013.50.
    • (2013) IEEE Trans. Pattern Anal. Mach. Intell. , vol.35 , Issue.8 , pp. 1798-1828
    • Bengio, Y.1    Courville, A.2    Vincent, P.3
  • 28
    • 84937508363 scopus 로고    scopus 로고
    • How transferable are features in deep neural networks?
    • Z. Ghahramani M. Welling C. Cortes N. Lawrence K. Weinberger Curran Associates, Inc.
    • Yosinski, J., Clune, J., Bengio, Y., Lipson, H., How transferable are features in deep neural networks?. Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N., Weinberger, K., (eds.) Advances in Neural Information Processing Systems (NIPS) 27, 2014, Curran Associates, Inc., 3320–3328.
    • (2014) Advances in Neural Information Processing Systems (NIPS) 27 , pp. 3320-3328
    • Yosinski, J.1    Clune, J.2    Bengio, Y.3    Lipson, H.4
  • 29
    • 33745903481 scopus 로고    scopus 로고
    • Extreme learning machine: theory and applications
    • Neural Networks Selected Papers from the 7th Brazilian Symposium on Neural Networks (SBRN ’04).
    • Extreme learning machine: theory and applications. Neurocomputing 70:13 (2006), 489–501 Neural Networks Selected Papers from the 7th Brazilian Symposium on Neural Networks (SBRN ’04).
    • (2006) Neurocomputing , vol.70 , Issue.13 , pp. 489-501
  • 33
    • 84887029098 scopus 로고    scopus 로고
    • Recognition of Human Actions from RGB-D Videos Using a Reject Option, Springer Berlin Heidelberg, Berlin, Heidelberg
    • V. Carletti, P. Foggia, G. Percannella, A. Saggese, M. Vento, Recognition of Human Actions from RGB-D Videos Using a Reject Option, Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 436–445. 10.1007/978-3-642-41190-8_47.
    • Carletti, V.1    Foggia, P.2    Percannella, G.3    Saggese, A.4    Vento, M.5
  • 37
    • 84983483288 scopus 로고    scopus 로고
    • Continuous body and hand gesture recognition for natural human-computer interaction
    • Song, Y., Demirdjian, D., Davis, R., Continuous body and hand gesture recognition for natural human-computer interaction. ACM Trans. Interact. Intell. Syst. 2:1 (2012), 5:1–5:28.
    • (2012) ACM Trans. Interact. Intell. Syst. , vol.2 , Issue.1 , pp. 51-5:28
    • Song, Y.1    Demirdjian, D.2    Davis, R.3
  • 39
    • 85027484703 scopus 로고    scopus 로고
    • A deep structured model with radius-margin bound for 3d human activity recognition
    • Lin, L., Wang, K., Zuo, W., Wang, M., Luo, J., Zhang, L., A deep structured model with radius-margin bound for 3d human activity recognition. CoRR, abs/1512.01642, 2015.
    • (2015) CoRR , vol.abs/1512.01642
    • Lin, L.1    Wang, K.2    Zuo, W.3    Wang, M.4    Luo, J.5    Zhang, L.6
  • 40
    • 85030330214 scopus 로고    scopus 로고
    • An end-to-end spatio-temporal attention model for human action recognition from skeleton data
    • Song, S., Lan, C., Xing, J., Zeng, W., Liu, J., An end-to-end spatio-temporal attention model for human action recognition from skeleton data. CoRR, abs/1611.06067, 2016.
    • (2016) CoRR , vol.abs/1611.06067
    • Song, S.1    Lan, C.2    Xing, J.3    Zeng, W.4    Liu, J.5
  • 41
    • 84990059379 scopus 로고    scopus 로고
    • Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition, Springer International Publishing, Cham
    • J. Liu, A. Shahroudy, D. Xu, G. Wang, Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition, Springer International Publishing, Cham, pp. 816–833. 10.1007/978-3-319-46487-9_50.
    • Liu, J.1    Shahroudy, A.2    Xu, D.3    Wang, G.4
  • 43
    • 85027482209 scopus 로고    scopus 로고
    • Generating local temporal poses from gestures with aligned cluster analysis for human action recognition
    • G.K.L. Tam BMVA Press
    • Edwards, M., Xie, X., Generating local temporal poses from gestures with aligned cluster analysis for human action recognition. Tam, G.K.L., (eds.) UK Computer Vision Student Workshop (BMVW), 2015, BMVA Press, 1.1–1.12.
    • (2015) UK Computer Vision Student Workshop (BMVW) , pp. 1.1-1.12
    • Edwards, M.1    Xie, X.2
  • 46
    • 46749104831 scopus 로고    scopus 로고
    • Similarity by composition
    • B. Schȵlkopf J. Platt T. Hoffman MIT Press Cambridge, MA
    • Boiman, O., Irani, M., Similarity by composition. Schȵlkopf, B., Platt, J., Hoffman, T., (eds.) Advances in Neural Information Processing Systems (NIPS), 2006, MIT Press, Cambridge, MA, 177–184.
    • (2006) Advances in Neural Information Processing Systems (NIPS) , pp. 177-184
    • Boiman, O.1    Irani, M.2
  • 48
    • 69549129405 scopus 로고    scopus 로고
    • Human action recognition by semilatent topic models
    • Wang, Y., Mori, G., Human action recognition by semilatent topic models. IEEE Trans. Pattern Anal. Mach. Intell. (PAMI) 31:10 (2009), 1762–1774.
    • (2009) IEEE Trans. Pattern Anal. Mach. Intell. (PAMI) , vol.31 , Issue.10 , pp. 1762-1774
    • Wang, Y.1    Mori, G.2
  • 49
    • 84859007933 scopus 로고    scopus 로고
    • Extreme learning machine for regression and multiclass classification
    • Huang, G.B., Zhou, H., Ding, X., Zhang, R., Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern.Part B (Cybernetics) 42:2 (2012), 513–529, 10.1109/TSMCB.2011.2168604.
    • (2012) IEEE Trans. Syst. Man Cybern.Part B (Cybernetics) , vol.42 , Issue.2 , pp. 513-529
    • Huang, G.B.1    Zhou, H.2    Ding, X.3    Zhang, R.4
  • 50
    • 57249084011 scopus 로고    scopus 로고
    • Visualizing data using t-sne
    • Maaten, L.v.d., Hinton, G., Visualizing data using t-sne. J. Mach. Learn. Res. 9:November (2008), 2579–2605.
    • (2008) J. Mach. Learn. Res. , vol.9 , Issue.November , pp. 2579-2605
    • Maaten, L.V.D.1    Hinton, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.