메뉴 건너뛰기




Volumn 118, Issue 2, 2016, Pages 256-273

A Deep Structured Model with Radius–Margin Bound for 3D Human Activity Recognition

Author keywords

Deep learning; Human action and activity; RGB depth analysis; Structured model

Indexed keywords

ITERATIVE METHODS; LEARNING ALGORITHMS; NETWORK LAYERS; NEURAL NETWORKS; PATTERN RECOGNITION;

EID: 84952023029     PISSN: 09205691     EISSN: 15731405     Source Type: Journal    
DOI: 10.1007/s11263-015-0876-z     Document Type: Article
Times cited : (90)

References (57)
  • 1
    • 84866674206 scopus 로고    scopus 로고
    • Sum-product networks for modeling activities with stochastic structure
    • Amer, M. R., & Todorovic, S. (2012). Sum-product networks for modeling activities with stochastic structure. In CVPR, pp 1314–1321
    • (2012) In CVPR , pp. 1314-1321
    • Amer, M.R.1    Todorovic, S.2
  • 2
    • 85083953811 scopus 로고    scopus 로고
    • Bayer, J., Osendorfer, C., Korhammer, D., Chen, N., Urban, S., & van der Smagt, P. In Proc. ICLR
    • Bayer, J., Osendorfer, C., Korhammer, D., Chen, N., Urban, S., & van der Smagt, P. (2014). On fast dropout and its applicability to recurrent networks. In Proc. ICLR
    • (2014) On fast dropout and its applicability to recurrent networks.
  • 3
    • 84856661125 scopus 로고    scopus 로고
    • Learning spatiotemporal graphs of human activities. In: ICCV
    • Brendel, W., & Todorovic, S. (2011). Learning spatiotemporal graphs of human activities. In: ICCV, pp 778–785
    • (2011) pp 778–785
    • Brendel, W.1    Todorovic, S.2
  • 4
    • 0036161011 scopus 로고    scopus 로고
    • Choosing multiple parameters for support vector machines
    • Chapelle, O., Vapnik, V., Bousquet, O., & Mukherjee, S. (2002). Choosing multiple parameters for support vector machines. Machine Learning, 46(1–3), 131–159.
    • (2002) Machine Learning , vol.46 , Issue.1-3 , pp. 131-159
    • Chapelle, O.1    Vapnik, V.2    Bousquet, O.3    Mukherjee, S.4
  • 6
    • 84455205109 scopus 로고    scopus 로고
    • Human group activity analysis with fusion of motion and appearance information
    • Cheng, Z., Qin, L., Huang, Q., Jiang, S., Yan, S., & Tian, Q. (2011). Human group activity analysis with fusion of motion and appearance information. In ACM Multimedia, pp 1401–1404
    • (2011) In ACM Multimedia , pp. 1401-1404
    • Cheng, Z.1    Qin, L.2    Huang, Q.3    Jiang, S.4    Yan, S.5    Tian, Q.6
  • 7
    • 0141430928 scopus 로고    scopus 로고
    • Radius margin bounds for support vector machines with the rbf kernel
    • Chung, K. M., Kao, W. C., Sun, C. L., Wang, L. L., & Lin, C. J. (2003). Radius margin bounds for support vector machines with the rbf kernel. Neural Computation, 15(11), 2643–2681.
    • (2003) Neural Computation , vol.15 , Issue.11 , pp. 2643-2681
    • Chung, K.M.1    Kao, W.C.2    Sun, C.L.3    Wang, L.L.4    Lin, C.J.5
  • 8
    • 84897556574 scopus 로고    scopus 로고
    • Convex formulations of radius-margin based support vector machines
    • Do, H., & Kalousis, A. (2013). Convex formulations of radius-margin based support vector machines. In: ICML
    • (2013) In: ICML
    • Do, H.1    Kalousis, A.2
  • 12
    • 85119025686 scopus 로고    scopus 로고
    • Girshick, R., Donahue, J., Darrell, T., & Malik, J. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
    • Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
    • (2014) Rich feature hierarchies for accurate object detection and semantic segmentation.
  • 14
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • Hinton, G. E., & Salakhutdinov, R. R. (2006). Reducing the dimensionality of data with neural networks. Science, 313(5786), 504–507.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 15
    • 33845597145 scopus 로고    scopus 로고
    • Large-scale learning with svm and convolutional for generic object categorization
    • Huang, F. J., & LeCun, Y. (2006). Large-scale learning with svm and convolutional for generic object categorization. In CVPR, pp 284–291
    • (2006) In CVPR , pp. 284-291
    • Huang, F.J.1    LeCun, Y.2
  • 16
    • 84870183903 scopus 로고    scopus 로고
    • 3d convolutional neural networks for human action recognition
    • Ji, S., Xu, W., Yang, M., & Yu, K. (2013). 3d convolutional neural networks for human action recognition. IEEE Trans Pattern Anal Mach Intell, 35(1), 221–231.
    • (2013) IEEE Trans Pattern Anal Mach Intell , vol.35 , Issue.1 , pp. 221-231
    • Ji, S.1    Xu, W.2    Yang, M.3    Yu, K.4
  • 17
    • 84957580317 scopus 로고    scopus 로고
    • Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., & Fei-Fei, L. In CVPR
    • Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., & Fei-Fei, L. (2014). Large-scale video classification with convolutional neural networks. In CVPR
    • (2014) Large-scale video classification with convolutional neural networks.
  • 19
    • 84897508000 scopus 로고    scopus 로고
    • Learning spatio-temporal structure from rgb-d videos for human activity detection and anticipation
    • Koppula, H. S., & Saxena, A. (2013). Learning spatio-temporal structure from rgb-d videos for human activity detection and anticipation. In ICML pp 792–800
    • (2013) In ICML , pp. 792-800
    • Koppula, H.S.1    Saxena, A.2
  • 21
    • 84869785889 scopus 로고
    • LeCun, Y., Boser, B., Denker, J., Henderson, D., Howard, R., Hubbard, W., & Jackel ea, L. D. In Advances in neural information processing systems
    • LeCun, Y., Boser, B., Denker, J., Henderson, D., Howard, R., Hubbard, W., & Jackel ea, L. D. (1990). Handwritten digit recognition with a back-propagation network. In Advances in neural information processing systems
    • (1990) Handwritten digit recognition with a back-propagation network.
  • 22
    • 84887476984 scopus 로고    scopus 로고
    • Learning latent spatio-temporal compositional model for human action recognition
    • Liang, X., Lin, L., & Cao, L. (2013). Learning latent spatio-temporal compositional model for human action recognition. In ACM Multimedia, pp 263–272
    • (2013) In ACM Multimedia , pp. 263-272
    • Liang, X.1    Lin, L.2    Cao, L.3
  • 24
    • 62349137210 scopus 로고    scopus 로고
    • A stochastic graph grammar for compositional object representation and recognition
    • Lin, L., Wu, T., Porway, J., & Xu, Z. (2009). A stochastic graph grammar for compositional object representation and recognition. Pattern Recognition, 42(7), 1297–1307.
    • (2009) Pattern Recognition , vol.42 , Issue.7 , pp. 1297-1307
    • Lin, L.1    Wu, T.2    Porway, J.3    Xu, Z.4
  • 26
    • 84898796864 scopus 로고    scopus 로고
    • A deep sum-product architecture for robust facial attributes analysis
    • Luo, P., Wang, X., & Tang, X. (2013a). A deep sum-product architecture for robust facial attributes analysis. In ICCV, pp 2864–2871
    • (2013) In ICCV , pp. 2864-2871
    • Luo, P.1    Wang, X.2    Tang, X.3
  • 27
    • 84898770979 scopus 로고    scopus 로고
    • Pedestrian parsing via deep decompositional neural network
    • Luo, P., Wang, X., & Tang, X. (2013b). Pedestrian parsing via deep decompositional neural network. In ICCV, pp 2648–2655
    • (2013) In ICCV , pp. 2648-2655
    • Luo, P.1    Wang, X.2    Tang, X.3
  • 28
    • 84977441029 scopus 로고    scopus 로고
    • Rgbd-hudaact: A color-depth video database for human daily activity recognition. Consumer Depth Cameras for Computer Vision, Lecture Notes in Computer Science (pp. 193–208)
    • Ni, B., Wang, G., & Moulin, P. (2013a). Rgbd-hudaact: A color-depth video database for human daily activity recognition. Consumer Depth Cameras for Computer Vision, Lecture Notes in Computer Science (pp. 193–208). Springer.
    • (2013) Springer
    • Ni, B.1    Wang, G.2    Moulin, P.3
  • 30
    • 84887375927 scopus 로고    scopus 로고
    • Hon4d: Histogram of oriented 4d normals for activity recognition from depth sequences
    • Oreifej, O., & Liu, Z. (2013). Hon4d: Histogram of oriented 4d normals for activity recognition from depth sequences. In CVPR, pp 716–723
    • (2013) In CVPR , pp. 716-723
    • Oreifej, O.1    Liu, Z.2
  • 31
    • 84866717619 scopus 로고    scopus 로고
    • A combined pose, object, and feature model for action understanding
    • Packer, B., Saenko, K., & Koller, D. (2012). A combined pose, object, and feature model for action understanding. In CVPR, pp 1378–1385
    • (2012) In CVPR , pp. 1378-1385
    • Packer, B.1    Saenko, K.2    Koller, D.3
  • 32
    • 84856646751 scopus 로고    scopus 로고
    • Parsing video events with goal inference and intent prediction
    • Pei, M., Jia, Y., & Zhu, S. (2011). Parsing video events with goal inference and intent prediction. In ICCV, pp 487–494
    • (2011) In ICCV , pp. 487-494
    • Pei, M.1    Jia, Y.2    Zhu, S.3
  • 33
    • 84866718894 scopus 로고    scopus 로고
    • Action bank: A high-level representation of activity in video
    • Sadanand, S., & Corso, J. J. (2012). Action bank: A high-level representation of activity in video. In CVPR, pp 1234–1241
    • (2012) In CVPR , pp. 1234-1241
    • Sadanand, S.1    Corso, J.J.2
  • 34
    • 37849037402 scopus 로고    scopus 로고
    • A 3-dimensional sift descriptor and its application to action recognition
    • Scovanner, P., Ali, S., & Shah, M. (2007) A 3-dimensional sift descriptor and its application to action recognition. In ACM Multimedia, pp 357–360
    • (2007) In ACM Multimedia , pp. 357-360
    • Scovanner, P.1    Ali, S.2    Shah, M.3
  • 37
    • 84864487638 scopus 로고    scopus 로고
    • Unstructured human activity detection from rgbd images
    • Sung, J., Ponce, C., Selman, B., & Saxena, A. (2012) Unstructured human activity detection from rgbd images. In ICRA, pp 842–849
    • (2012) In ICRA , pp. 842-849
    • Sung, J.1    Ponce, C.2    Selman, B.3    Saxena, A.4
  • 38
    • 84866658784 scopus 로고    scopus 로고
    • Learning latent temporal structure for complex event detection
    • Tang, K., Fei-Fei, L., & Koller, D. (2012). Learning latent temporal structure for complex event detection. In CVPR, pp 1250–1257
    • (2012) In CVPR , pp. 1250-1257
    • Tang, K.1    Fei-Fei, L.2    Koller, D.3
  • 39
    • 84901405262 scopus 로고    scopus 로고
    • Joint video and text parsing for understanding events and answering queries
    • Tu, K., Meng, M., Lee, M. W., Choi, T., & Zhu, S. (2014). Joint video and text parsing for understanding events and answering queries. IEEE Transactions on Multimedia, 21(2), 42–70.
    • (2014) IEEE Transactions on Multimedia , vol.21 , Issue.2 , pp. 42-70
    • Tu, K.1    Meng, M.2    Lee, M.W.3    Choi, T.4    Zhu, S.5
  • 41
    • 84944069490 scopus 로고    scopus 로고
    • Venugopalan, S., Xu, H., Donahue, J., Rohrbach, M., Mooney, R., & Saenko, K. In North American Chapter of the Association for Computational Linguistics
    • Venugopalan, S., Xu, H., Donahue, J., Rohrbach, M., Mooney, R., & Saenko, K. (2015). Translating videos to natural language using deep recurrent neural networks. In North American Chapter of the Association for Computational Linguistics
    • (2015) Translating videos to natural language using deep recurrent neural networks.
  • 42
    • 84866672692 scopus 로고    scopus 로고
    • Mining actionlet ensemble for action recognition with depth cameras
    • Wang, J., Liu, Z., Wu, Y., & Yuan, J. (2012). Mining actionlet ensemble for action recognition with depth cameras. In CVPR, pp 1290–1297
    • (2012) In CVPR , pp. 1290-1297
    • Wang, J.1    Liu, Z.2    Wu, Y.3    Yuan, J.4
  • 43
    • 79957467077 scopus 로고    scopus 로고
    • Hidden part models for human action recognition: Probabilistic vs. max-margin
    • Wang, Y., & Mori, G. (2011). Hidden part models for human action recognition: Probabilistic vs. max-margin. IEEE Trans Pattern Anal Mach Intell, 33(7), 1310–1323.
    • (2011) IEEE Trans Pattern Anal Mach Intell , vol.33 , Issue.7 , pp. 1310-1323
    • Wang, Y.1    Mori, G.2
  • 45
    • 84887346790 scopus 로고    scopus 로고
    • An approach to pose-based action recognition
    • Wang, C., Wang, Y., & Yuille, A. L. (2013). An approach to pose-based action recognition. In CVPR, pp 915–922
    • (2013) In CVPR , pp. 915-922
    • Wang, C.1    Wang, Y.2    Yuille, A.L.3
  • 46
    • 84898794902 scopus 로고    scopus 로고
    • Learning maximum margin temporal warping for action recognition
    • Wang, J., & Wu, Y. (2013) Learning maximum margin temporal warping for action recognition. In ICCV, pp 2688–2695
    • (2013) In ICCV , pp. 2688-2695
    • Wang, J.1    Wu, Y.2
  • 47
    • 0002210265 scopus 로고
    • On the convergence properties of the em algorithm
    • Wu, C. F. J. (1983). On the convergence properties of the em algorithm. Annals of Statistics, 11(1), 95–103.
    • (1983) Annals of Statistics , vol.11 , Issue.1 , pp. 95-103
    • Wu, C.F.J.1
  • 48
    • 84887419657 scopus 로고    scopus 로고
    • Online multimodal deep similarity learning with application to image retrieval
    • Wu, P., Hoi, S., Xia, H., Zhao, P., Wang, D., & Miao, C. (2013) Online multimodal deep similarity learning with application to image retrieval. In ACM Mutilmedia, pp 153–162
    • (2013) In ACM Mutilmedia , pp. 153-162
    • Wu, P.1    Hoi, S.2    Xia, H.3    Zhao, P.4    Wang, D.5    Miao, C.6
  • 49
    • 84887324355 scopus 로고    scopus 로고
    • Spatio-temporal depth cuboid similarity feature for activity recognition using depth camera
    • Xia, L., & Aggarwal, J. (2013) Spatio-temporal depth cuboid similarity feature for activity recognition using depth camera. In CVPR, pp 2834–2841
    • (2013) In CVPR , pp. 2834-2841
    • Xia, L.1    Aggarwal, J.2
  • 50
    • 84865033379 scopus 로고    scopus 로고
    • View invariant human action recognition using histograms of 3d joints. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2012 IEEE Computer Society Conference on
    • Xia, L., Chen, C., & Aggarwal, J. (2012a). View invariant human action recognition using histograms of 3d joints. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2012 IEEE Computer Society Conference on, IEEE, pp 20–27
    • (2012) IEEE , pp. 20-27
    • Xia, L.1    Chen, C.2    Aggarwal, J.3
  • 51
    • 84865033379 scopus 로고    scopus 로고
    • View invariant human action recognition using histograms of 3d joints
    • Xia, L., Chen, C., & Aggarwal, J. K. (2012b). View invariant human action recognition using histograms of 3d joints. In CVPRW, pp 20–27
    • (2012) In CVPRW , pp. 20-27
    • Xia, L.1    Chen, C.2    Aggarwal, J.K.3
  • 52
    • 84871394796 scopus 로고    scopus 로고
    • Recognizing actions using depth motion maps-based histograms of oriented gradients
    • Yang, X., Zhang, C., & Tian, Y. (2012). Recognizing actions using depth motion maps-based histograms of oriented gradients. In ACM Multimedia, pp 1057–1060
    • (2012) In ACM Multimedia , pp. 1057-1060
    • Yang, X.1    Zhang, C.2    Tian, Y.3
  • 53
    • 80052889296 scopus 로고    scopus 로고
    • Learning image representations from the pixel level via hierarchical sparse coding
    • Yu, K., Lin, Y., & Lafferty, J. (2011). Learning image representations from the pixel level via hierarchical sparse coding. In CVPR, pp 1713–1720
    • (2011) In CVPR , pp. 1713-1720
    • Yu, K.1    Lin, Y.2    Lafferty, J.3
  • 54
    • 84865015840 scopus 로고    scopus 로고
    • Yun, K., Honorio, J., Chattopadhyay, D., Berg, T. L., & Samaras, D.In Computer Vision and Pattern Recognition Workshops (CVPRW), 2012 IEEE Computer Society Conference on, IEEE
    • Yun, K., Honorio, J., Chattopadhyay, D., Berg, T. L., & Samaras, D. (2012) Two-person interaction detection using body-pose features and multiple instance learning. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2012 IEEE Computer Society Conference on, IEEE
    • (2012) Two-person interaction detection using body-pose features and multiple instance learning.
  • 55
    • 84887474318 scopus 로고    scopus 로고
    • Exploring discriminative pose sub-patterns for effective action classification. In: ACM Multimedia
    • Zhao, X., Liu, Y., & Fu, Y. (2013). Exploring discriminative pose sub-patterns for effective action classification. In: ACM Multimedia, pp 273–282
    • (2013) pp 273–282
    • Zhao, X.1    Liu, Y.2    Fu, Y.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.