메뉴 건너뛰기




Volumn , Issue , 2014, Pages 97-106

3D Human activity recognition with reconfigurable convolutional neural networks

Author keywords

3D activity; Deep learning; Structured model; Video parsing

Indexed keywords

BACKPROPAGATION; CONVOLUTION; DEEP LEARNING; IMAGE SEGMENTATION; ITERATIVE METHODS; PATTERN RECOGNITION;

EID: 84913584483     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2647868.2654912     Document Type: Conference Paper
Times cited : (88)

References (41)
  • 1
    • 84866674206 scopus 로고    scopus 로고
    • Sum-product networks for modeling activities with stochastic structure
    • M. R. Amer and S. Todorovic. Sum-product networks for modeling activities with stochastic structure. In CVPR, pages 1314-1321, 2012.
    • (2012) CVPR , pp. 1314-1321
    • Amer, M.R.1    Todorovic, S.2
  • 2
    • 84856661125 scopus 로고    scopus 로고
    • Learning spatiotemporal graphs of human activities
    • W. Brendel and S. Todorovic. Learning spatiotemporal graphs of human activities. In ICCV, pages 778-785, 2011.
    • (2011) ICCV , pp. 778-785
    • Brendel, W.1    Todorovic, S.2
  • 4
    • 84455205109 scopus 로고    scopus 로고
    • Human group activity analysis with fusion of motion and appearance information
    • Z. Cheng, L. Qin, Q. Huang, S. Jiang, S. Yan, and Q. Tian. Human group activity analysis with fusion of motion and appearance information. In ACM Multimedia, pages 1401-1404, 2011.
    • (2011) ACM Multimedia , pp. 1401-1404
    • Cheng, Z.1    Qin, L.2    Huang, Q.3    Jiang, S.4    Yan, S.5    Tian, Q.6
  • 5
    • 84911400494 scopus 로고    scopus 로고
    • Rich feature hierarchies for accurate object detection and semantic segmentation
    • R. Girshick, J. Donahue, T. Darrell, and J. Malik. Rich feature hierarchies for accurate object detection and semantic segmentation. In CVPR, 2014.
    • (2014) CVPR
    • Girshick, R.1    Donahue, J.2    Darrell, T.3    Malik, J.4
  • 7
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G. E. Hinton and R. R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786):504-507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 8
    • 84870183903 scopus 로고    scopus 로고
    • 3d convolutional neural networks for human action recognition
    • S. Ji, W. Xu, M. Yang, and K. Yu. 3d convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell., 35(1):221-231, 2013.
    • (2013) IEEE Trans. Pattern Anal. Mach. Intell. , vol.35 , Issue.1 , pp. 221-231
    • Ji, S.1    Xu, W.2    Yang, M.3    Yu, K.4
  • 10
    • 84926464781 scopus 로고    scopus 로고
    • Learning spatio-temporal structure from rgb-d videos for human activity detection and anticipation
    • H. S. Koppula and A. Saxena. Learning spatio-temporal structure from rgb-d videos for human activity detection and anticipation. In ICML, pages 792-800, 2013.
    • (2013) ICML , pp. 792-800
    • Koppula, H.S.1    Saxena, A.2
  • 12
    • 80052874098 scopus 로고    scopus 로고
    • Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis
    • Q. V. Le, W. Y. Zou, S. Y. Yeung, and A. Y. Ng. Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In CVPR, pages 3361-3368, 2011.
    • (2011) CVPR , pp. 3361-3368
    • Le, Q.V.1    Zou, W.Y.2    Yeung, S.Y.3    Ng, A.Y.4
  • 14
    • 84887476984 scopus 로고    scopus 로고
    • Learning latent spatio-temporal compositional model for human action recognition
    • X. Liang, L. Lin, and L. Cao. Learning latent spatio-temporal compositional model for human action recognition. In ACM Multimedia, pages 263-272, 2013.
    • (2013) ACM Multimedia , pp. 263-272
    • Liang, X.1    Lin, L.2    Cao, L.3
  • 15
    • 56049121516 scopus 로고    scopus 로고
    • Semantic event representation and recognition using syntactic attribute graph grammar
    • L. Lin, H. Gong, L. Li, and L. Wang. Semantic event representation and recognition using syntactic attribute graph grammar. Pattern Recognition Letters, 30(2):180-186, 2009.
    • (2009) Pattern Recognition Letters , vol.30 , Issue.2 , pp. 180-186
    • Lin, L.1    Gong, H.2    Li, L.3    Wang, L.4
  • 16
    • 62349137210 scopus 로고    scopus 로고
    • A stochastic graph grammar for compositional object representation and recognition
    • L. Lin, T. Wu, J. Porway, and Z. Xu. A stochastic graph grammar for compositional object representation and recognition. Pattern Recognition, 42(7):1297-1307, 2009.
    • (2009) Pattern Recognition , vol.42 , Issue.7 , pp. 1297-1307
    • Lin, L.1    Wu, T.2    Porway, J.3    Xu, Z.4
  • 17
    • 84898796864 scopus 로고    scopus 로고
    • A deep sum-product architecture for robust facial attributes analysis
    • P. Luo, X. Wang, and X. Tang. A deep sum-product architecture for robust facial attributes analysis. In ICCV, pages 2864-2871, 2013.
    • (2013) ICCV , pp. 2864-2871
    • Luo, P.1    Wang, X.2    Tang, X.3
  • 18
    • 84898770979 scopus 로고    scopus 로고
    • Pedestrian parsing via deep decompositional neural network
    • P. Luo, X. Wang, and X. Tang. Pedestrian parsing via deep decompositional neural network. In ICCV, pages 2648-2655, 2013.
    • (2013) ICCV , pp. 2648-2655
    • Luo, P.1    Wang, X.2    Tang, X.3
  • 20
    • 84887375927 scopus 로고    scopus 로고
    • Hon4d: Histogram of oriented 4d normals for activity recognition from depth sequences
    • O. Oreifej and Z. Liu. Hon4d: Histogram of oriented 4d normals for activity recognition from depth sequences. In CVPR, pages 716-723, 2013.
    • (2013) CVPR , pp. 716-723
    • Oreifej, O.1    Liu, Z.2
  • 21
    • 84866717619 scopus 로고    scopus 로고
    • A combined pose, object, and feature model for action understanding
    • B. Packer, K. Saenko, and D. Koller. A combined pose, object, and feature model for action understanding. In CVPR, pages 1378-1385, 2012.
    • (2012) CVPR , pp. 1378-1385
    • Packer, B.1    Saenko, K.2    Koller, D.3
  • 22
    • 84856646751 scopus 로고    scopus 로고
    • Parsing video events with goal inference and intent prediction
    • M. Pei, Y. Jia, and S. Zhu. Parsing video events with goal inference and intent prediction. In ICCV, pages 487-494, 2011.
    • (2011) ICCV , pp. 487-494
    • Pei, M.1    Jia, Y.2    Zhu, S.3
  • 23
    • 84866718894 scopus 로고    scopus 로고
    • Action bank: A high-level representation of activity in video
    • S. Sadanand and J. J. Corso. Action bank: A high-level representation of activity in video. In CVPR, pages 1234-1241, 2012.
    • (2012) CVPR , pp. 1234-1241
    • Sadanand, S.1    Corso, J.J.2
  • 24
    • 37849037402 scopus 로고    scopus 로고
    • A 3-dimensional sift descriptor and its application to action recognition
    • P. Scovanner, S. Ali, and M. Shah. A 3-dimensional sift descriptor and its application to action recognition. In ACM Multimedia, pages 357-360, 2007.
    • (2007) ACM Multimedia , pp. 357-360
    • Scovanner, P.1    Ali, S.2    Shah, M.3
  • 25
    • 84864487638 scopus 로고    scopus 로고
    • Unstructured human activity detection from rgbd images
    • J. Sung, C. Ponce, B. Selman, and A. Saxena. Unstructured human activity detection from rgbd images. In ICRA, pages 842-849, 2012.
    • (2012) ICRA , pp. 842-849
    • Sung, J.1    Ponce, C.2    Selman, B.3    Saxena, A.4
  • 26
    • 84866658784 scopus 로고    scopus 로고
    • Learning latent temporal structure for complex event detection
    • K. Tang, L. Fei-Fei, and D. Koller. Learning latent temporal structure for complex event detection. In CVPR, pages 1250-1257, 2012.
    • (2012) CVPR , pp. 1250-1257
    • Tang, K.1    Fei-Fei, L.2    Koller, D.3
  • 27
    • 78149336740 scopus 로고    scopus 로고
    • Convolutional learning of spatio-temporal features
    • G. W. Taylor, R. Fergus, Y. Le Cun, and C. Bregler. Convolutional learning of spatio-temporal features. In ECCV, pages 140-153, 2010.
    • (2010) ECCV , pp. 140-153
    • Taylor, G.W.1    Fergus, R.2    Le Cun, Y.3    Bregler, C.4
  • 28
    • 84901405262 scopus 로고    scopus 로고
    • Joint video and text parsing for understanding events and answering queries
    • K. Tu, M. Meng, M. W. Lee, T. Choi, and S. Zhu. Joint video and text parsing for understanding events and answering queries. IEEE Transactions on Multimedia, 21(2):42-70, 2014.
    • (2014) IEEE Transactions on Multimedia , vol.21 , Issue.2 , pp. 42-70
    • Tu, K.1    Meng, M.2    Lee, M.W.3    Choi, T.4    Zhu, S.5
  • 29
    • 84887346790 scopus 로고    scopus 로고
    • An approach to pose-based action recognition
    • C. Wang, Y. Wang, and A. L. Yuille. An approach to pose-based action recognition. In CVPR, pages 915-922, 2013.
    • (2013) CVPR , pp. 915-922
    • Wang, C.1    Wang, Y.2    Yuille, A.L.3
  • 30
    • 84866672692 scopus 로고    scopus 로고
    • Mining actionlet ensemble for action recognition with depth cameras
    • J. Wang, Z. Liu, Y. Wu, and J. Yuan. Mining actionlet ensemble for action recognition with depth cameras. In CVPR, pages 1290-1297, 2012.
    • (2012) CVPR , pp. 1290-1297
    • Wang, J.1    Liu, Z.2    Wu, Y.3    Yuan, J.4
  • 31
    • 84898794902 scopus 로고    scopus 로고
    • Learning maximum margin temporal warping for action recognition
    • J. Wang and Y. Wu. Learning maximum margin temporal warping for action recognition. In ICCV, pages 2688-2695, 2013.
    • (2013) ICCV , pp. 2688-2695
    • Wang, J.1    Wu, Y.2
  • 32
    • 84887381206 scopus 로고    scopus 로고
    • Incorporating structural alternatives and sharing into hierarchy for multiclass object recognition and detection
    • X. Wang, L. Lin, L. Huang, and S. Yan. Incorporating structural alternatives and sharing into hierarchy for multiclass object recognition and detection. In CVPR, pages 3334-3341, 2013.
    • (2013) CVPR , pp. 3334-3341
    • Wang, X.1    Lin, L.2    Huang, L.3    Yan, S.4
  • 33
    • 79957467077 scopus 로고    scopus 로고
    • Hidden part models for human action recognition: Probabilistic vs
    • Y. Wang and G. Mori. Hidden part models for human action recognition: Probabilistic vs. max-margin. IEEE Trans. Pattern Anal. Mach. Intell., 33(7):1310-1323, 2011.
    • (2011) Max-margin. IEEE Trans. Pattern Anal. Mach. Intell. , vol.33 , Issue.7 , pp. 1310-1323
    • Wang, Y.1    Mori, G.2
  • 34
    • 84887419657 scopus 로고    scopus 로고
    • Online multimodal deep similarity learning with application to image retrieval
    • P. Wu, S. Hoi, H. Xia, P. Zhao, D. Wang, and C. Miao. Online multimodal deep similarity learning with application to image retrieval. In ACM Mutilmedia, pages 153-162, 2013.
    • (2013) ACM Mutilmedia , pp. 153-162
    • Wu, P.1    Hoi, S.2    Xia, H.3    Zhao, P.4    Wang, D.5    Miao, C.6
  • 35
    • 84887324355 scopus 로고    scopus 로고
    • Spatio-temporal depth cuboid similarity feature for activity recognition using depth camera
    • L. Xia and J. Aggarwal. Spatio-temporal depth cuboid similarity feature for activity recognition using depth camera. In CVPR, pages 2834-2841, 2013.
    • (2013) CVPR , pp. 2834-2841
    • Xia, L.1    Aggarwal, J.2
  • 36
    • 84865033379 scopus 로고    scopus 로고
    • View invariant human action recognition using histograms of 3d joints
    • L. Xia, C. Chen, and J. K. Aggarwal. View invariant human action recognition using histograms of 3d joints. In CVPRW, pages 20-27, 2012.
    • (2012) CVPRW , pp. 20-27
    • Xia, L.1    Chen, C.2    Aggarwal, J.K.3
  • 37
    • 84871394796 scopus 로고    scopus 로고
    • Recognizing actions using depth motion maps-based histograms of oriented gradients
    • X. Yang, C. Zhang, and Y. Tian. Recognizing actions using depth motion maps-based histograms of oriented gradients. In ACM Multimedia, pages 1057-1060, 2012.
    • (2012) ACM Multimedia , pp. 1057-1060
    • Yang, X.1    Zhang, C.2    Tian, Y.3
  • 38
    • 80052889296 scopus 로고    scopus 로고
    • Learning image representations from the pixel level via hierarchical sparse coding
    • K. Yu, Y. Lin, and J. Lafferty. Learning image representations from the pixel level via hierarchical sparse coding. In CVPR, pages 1713-1720, 2011.
    • (2011) CVPR , pp. 1713-1720
    • Yu, K.1    Lin, Y.2    Lafferty, J.3
  • 39
    • 84887474318 scopus 로고    scopus 로고
    • Exploring discriminative pose sub-patterns for effective action classification
    • X. Zhao, Y. Liu, and Y. Fu. Exploring discriminative pose sub-patterns for effective action classification. In ACM Multimedia, pages 273-282, 2013.
    • (2013) ACM Multimedia , pp. 273-282
    • Zhao, X.1    Liu, Y.2    Fu, Y.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.