메뉴 건너뛰기




Volumn , Issue , 2013, Pages 263-272

Learning latent spatio-temporal compositional model for human action recognition

Author keywords

Action recognition; And or graph; Structural learning; Video understanding

Indexed keywords

ACTION RECOGNITION; AND- OR GRAPH; COMPOSITIONAL MODELING; HUMAN-ACTION RECOGNITION; SPATIO-TEMPORAL STRUCTURES; STRUCTURAL CONFIGURATIONS; STRUCTURAL LEARNING; VIDEO UNDERSTANDING;

EID: 84887476984     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2502081.2502089     Document Type: Conference Paper
Times cited : (32)

References (47)
  • 2
    • 77957182145 scopus 로고
    • Actions and events in interval temporal logic
    • J. F. Allen and G. Ferguson. Actions and events in interval temporal logic. J. Log. Comput., 4(5):531-579, 1994.
    • (1994) J. Log. Comput. , vol.4 , Issue.5 , pp. 531-579
    • Allen, J.F.1    Ferguson, G.2
  • 3
    • 84867859826 scopus 로고    scopus 로고
    • Cost-sensitive top-down/bottom-up inference for multiscale activity recognition
    • M. R. Amer, D. Xie, M. Zhao, S. Todorovic, and S. C. Zhu. Cost-sensitive top-down/bottom-up inference for multiscale activity recognition. In ECCV (4), pages 187-200, 2012.
    • (2012) ECCV , Issue.4 , pp. 187-200
    • Amer, M.R.1    Xie, D.2    Zhao, M.3    Todorovic, S.4    Zhu, S.C.5
  • 4
    • 84856661125 scopus 로고    scopus 로고
    • Learning spatiotemporal graphs of human activities
    • W. Brendel and S. Todorovic. Learning spatiotemporal graphs of human activities. In ICCV, pages 778-785, 2011.
    • (2011) ICCV , pp. 778-785
    • Brendel, W.1    Todorovic, S.2
  • 5
    • 77955989314 scopus 로고    scopus 로고
    • Cross-dataset action detection
    • L. Cao, Z. Liu, and T. S. Huang. Cross-dataset action detection. In CVPR, pages 1998-2005, 2010.
    • (2010) CVPR , pp. 1998-2005
    • Cao, L.1    Liu, Z.2    Huang, T.S.3
  • 6
    • 80052874112 scopus 로고    scopus 로고
    • Learning context for collective activity recognition
    • W. Choi, K. Shahid, and S. Savarese. Learning context for collective activity recognition. In CVPR, 2011.
    • (2011) CVPR
    • Choi, W.1    Shahid, K.2    Savarese, S.3
  • 7
    • 33846622081 scopus 로고    scopus 로고
    • Behavior recognition via sparse spatio-temporal features
    • October
    • P. Dollár, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In VS-PETS, October 2005.
    • (2005) VS-PETS
    • Dollár, P.1    Rabaud, V.2    Cottrell, G.3    Belongie, S.4
  • 8
    • 84871399506 scopus 로고    scopus 로고
    • Discovering video shot categories by unsupervised stochastic graph partition
    • X. Duan, L. Lin, and H. Chao. Discovering video shot categories by unsupervised stochastic graph partition. IEEE Transactions on Multimedia, 15(1):167-180, 2013.
    • (2013) IEEE Transactions on Multimedia , vol.15 , Issue.1 , pp. 167-180
    • Duan, X.1    Lin, L.2    Chao, H.3
  • 9
    • 33745164643 scopus 로고    scopus 로고
    • Activity recognition and abnormality detection with the switching hidden semi-markov model
    • T. V. Duong, H. H. Bui, D. Q. Phung, and S. Venkatesh. Activity recognition and abnormality detection with the switching hidden semi-markov model. In CVPR (1), pages 838-845, 2005.
    • (2005) CVPR , Issue.1 , pp. 838-845
    • Duong, T.V.1    Bui, H.H.2    Phung, D.Q.3    Venkatesh, S.4
  • 11
    • 70450202741 scopus 로고    scopus 로고
    • Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
    • A. Gupta, P. Srinivasan, J. Shi, and L. S. Davis. Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos. In CVPR, pages 2012-2019, 2009.
    • (2009) CVPR , pp. 2012-2019
    • Gupta, A.1    Srinivasan, P.2    Shi, J.3    Davis, L.S.4
  • 12
    • 78149348487 scopus 로고    scopus 로고
    • Object, scene and actions: Combining multiple features for human action recognition
    • N. Ikizler-Cinbis and S. Sclaroff. Object, scene and actions: Combining multiple features for human action recognition. In ECCV (1), pages 494-507, 2010.
    • (2010) ECCV , Issue.1 , pp. 494-507
    • Ikizler-Cinbis, N.1    Sclaroff, S.2
  • 13
    • 84871359352 scopus 로고    scopus 로고
    • Leveraging high-level and low-level features for multimedia event detection
    • L. Jiang, A. G. Hauptmann, and G. Xiang. Leveraging high-level and low-level features for multimedia event detection. In ACM Multimedia, pages 449-458, 2012.
    • (2012) ACM Multimedia , pp. 449-458
    • Jiang, L.1    Hauptmann, A.G.2    Xiang, G.3
  • 14
    • 84867849524 scopus 로고    scopus 로고
    • Trajectory-based modeling of human actions with motion reference points
    • Y.-G. Jiang, Q. Dai, X. Xue, W. Liu, and C.-W. Ngo. Trajectory-based modeling of human actions with motion reference points. In ECCV (5), 2012.
    • (2012) ECCV , Issue.5
    • Jiang, Y.-G.1    Dai, Q.2    Xue, X.3    Liu, W.4    Ngo, C.-W.5
  • 15
    • 84898426452 scopus 로고    scopus 로고
    • A spatio-temporal descriptor based on 3d-gradients
    • A. Kläser, M. Marszalek, and C. Schmid. A spatio-temporal descriptor based on 3d-gradients. In BMVC, 2008.
    • (2008) BMVC
    • Kläser, A.1    Marszalek, M.2    Schmid, C.3
  • 18
    • 70350660670 scopus 로고    scopus 로고
    • Real-time human action recognition by luminance field trajectory analysis
    • Z. Li, Y. Fu, T. S. Huang, and S. Yan. Real-time human action recognition by luminance field trajectory analysis. In ACM Multimedia, pages 671-676, 2008.
    • (2008) ACM Multimedia , pp. 671-676
    • Li, Z.1    Fu, Y.2    Huang, T.S.3    Yan, S.4
  • 19
    • 56049121516 scopus 로고    scopus 로고
    • Semantic event representation and recognition using syntactic attribute graph grammar
    • L. Lin, H. Gong, L. Li, and L. Wang. Semantic event representation and recognition using syntactic attribute graph grammar. Pattern Recognition Letters, 30(2):180-186, 2009.
    • (2009) Pattern Recognition Letters , vol.30 , Issue.2 , pp. 180-186
    • Lin, L.1    Gong, H.2    Li, L.3    Wang, L.4
  • 20
    • 84866698552 scopus 로고    scopus 로고
    • Learning contour-fragment-based shape model with and-or tree representation
    • L. Lin, X. Wang, W. Yang, and J. Lai. Learning contour-fragment-based shape model with and-or tree representation. In CVPR, pages 135-142, 2012.
    • (2012) CVPR , pp. 135-142
    • Lin, L.1    Wang, X.2    Yang, W.3    Lai, J.4
  • 21
    • 62349137210 scopus 로고    scopus 로고
    • A stochastic graph grammar for compositional object representation and recognition
    • L. Lin, T. Wu, J. Porway, and Z. Xu. A stochastic graph grammar for compositional object representation and recognition. Pattern Recognition, 42(7):1297-1307, 2009.
    • (2009) Pattern Recognition , vol.42 , Issue.7 , pp. 1297-1307
    • Lin, L.1    Wu, T.2    Porway, J.3    Xu, Z.4
  • 22
    • 70450203660 scopus 로고    scopus 로고
    • Recognizing realistic actions from videos
    • J. Liu, J. Luo, and M. Shah. Recognizing realistic actions from videos. In CVPR, pages 1996-2003, 2009.
    • (2009) CVPR , pp. 1996-2003
    • Liu, J.1    Luo, J.2    Shah, M.3
  • 23
    • 84871363788 scopus 로고    scopus 로고
    • Knowledge adaptation for ad hoc multimedia event detection with few exemplars
    • Z. Ma, Y. Yang, Y. Cai, N. Sebe, and A. G. Hauptmann. Knowledge adaptation for ad hoc multimedia event detection with few exemplars. In ACM Multimedia, pages 469-478, 2012.
    • (2012) ACM Multimedia , pp. 469-478
    • Ma, Z.1    Yang, Y.2    Cai, Y.3    Sebe, N.4    Hauptmann, A.G.5
  • 25
    • 78149353400 scopus 로고    scopus 로고
    • Modeling temporal structure of decomposable motion segments for activity classification
    • J. C. Niebles, C.-W. Chen, and F.-F. Li. Modeling temporal structure of decomposable motion segments for activity classification. In ECCV (2), pages 392-405, 2010.
    • (2010) ECCV , Issue.2 , pp. 392-405
    • Niebles, J.C.1    Chen, C.-W.2    Li, F.-F.3
  • 26
    • 84866661728 scopus 로고    scopus 로고
    • Discovering discriminative action parts from mid-level video representations
    • M. Raptis, I. Kokkinos, and S. Soatto. Discovering discriminative action parts from mid-level video representations. In CVPR, pages 1242-1249, 2012.
    • (2012) CVPR , pp. 1242-1249
    • Raptis, M.1    Kokkinos, I.2    Soatto, S.3
  • 27
    • 77953187842 scopus 로고    scopus 로고
    • Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities
    • M. S. Ryoo and J. K. Aggarwal. Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities. In ICCV, pages 1593-1600, 2009.
    • (2009) ICCV , pp. 1593-1600
    • Ryoo, M.S.1    Aggarwal, J.K.2
  • 28
    • 84866718894 scopus 로고    scopus 로고
    • Action bank: A high-level representation of activity in video
    • S. Sadanand and J. J. Corso. Action bank: A high-level representation of activity in video. In CVPR, pages 1234-1241, 2012.
    • (2012) CVPR , pp. 1234-1241
    • Sadanand, S.1    Corso, J.J.2
  • 29
    • 78149294036 scopus 로고    scopus 로고
    • Modeling the temporal extent of actions
    • S. Satkin and M. Hebert. Modeling the temporal extent of actions. In ECCV (1), pages 536-548, 2010.
    • (2010) ECCV , Issue.1 , pp. 536-548
    • Satkin, S.1    Hebert, M.2
  • 30
    • 37849037402 scopus 로고    scopus 로고
    • A 3-dimensional sift descriptor and its application to action recognition
    • P. Scovanner, S. Ali, and M. Shah. A 3-dimensional sift descriptor and its application to action recognition. In ACM Multimedia, pages 357-360, 2007.
    • (2007) ACM Multimedia , pp. 357-360
    • Scovanner, P.1    Ali, S.2    Shah, M.3
  • 31
    • 84856636962 scopus 로고    scopus 로고
    • Unsupervised learning of event and-or grammar and semantics from video
    • Z. Si, M. Pei, B. Yao, and S.-C. Zhu. Unsupervised learning of event and-or grammar and semantics from video. In ICCV, pages 41-48, 2011.
    • (2011) ICCV , pp. 41-48
    • Si, Z.1    Pei, M.2    Yao, B.3    Zhu, S.-C.4
  • 32
    • 84866720929 scopus 로고    scopus 로고
    • Multi-view latent variable discriminative models for action recognition
    • Y. Song, L.-P. Morency, and R. Davis. Multi-view latent variable discriminative models for action recognition. In CVPR, pages 2120-2127, 2012.
    • (2012) CVPR , pp. 2120-2127
    • Song, Y.1    Morency, L.-P.2    Davis, R.3
  • 33
    • 84887413550 scopus 로고    scopus 로고
    • Exploring probabilistic localized video representation for human action recognition
    • Y. Song, S. Tang, Y.-T. Zheng, T.-S. Chua, Y. Zhang, and S. Lin. Exploring probabilistic localized video representation for human action recognition. Multimedia Tools Appl., 58(3):663-685, 2012.
    • (2012) Multimedia Tools Appl , vol.58 , Issue.3 , pp. 663-685
    • Song, Y.1    Tang, S.2    Zheng, Y.-T.3    Chua, T.-S.4    Zhang, Y.5    Lin, S.6
  • 34
    • 70450214829 scopus 로고    scopus 로고
    • Hierarchical spatio-temporal context modeling for action recognition
    • J. Sun, X. Wu, S. Yan, L. F. Cheong, T.-S. Chua, and J. Li. Hierarchical spatio-temporal context modeling for action recognition. In CVPR, pages 2004-2011, 2009.
    • (2009) CVPR , pp. 2004-2011
    • Sun, J.1    Wu, X.2    Yan, S.3    Cheong, L.F.4    Chua, T.-S.5    Li, J.6
  • 35
    • 84866658784 scopus 로고    scopus 로고
    • Learning latent temporal structure for complex event detection
    • K. Tang, F.-F. Li, and D. Koller. Learning latent temporal structure for complex event detection. In CVPR, pages 1250-1257, 2012.
    • (2012) CVPR , pp. 1250-1257
    • Tang, K.1    Li, F.-F.2    Koller, D.3
  • 36
    • 80052877143 scopus 로고    scopus 로고
    • Action recognition by dense trajectories
    • H. Wang, A. Kläser, C. Schmid, and C.-L. Liu. Action recognition by dense trajectories. In CVPR, pages 3169-3176, 2011.
    • (2011) CVPR , pp. 3169-3176
    • Wang, H.1    Kläser, A.2    Schmid, C.3    Liu, C.-L.4
  • 37
    • 84866674455 scopus 로고    scopus 로고
    • Action recognition by exploring data distribution and feature correlation
    • S. Wang, Y. Yang, Z. Ma, X. Li, C. Pang, and A. G. Hauptmann. Action recognition by exploring data distribution and feature correlation. In CVPR, pages 1370-1377, 2012.
    • (2012) CVPR , pp. 1370-1377
    • Wang, S.1    Yang, Y.2    Ma, Z.3    Li, X.4    Pang, C.5    Hauptmann, A.G.6
  • 38
    • 84877770379 scopus 로고    scopus 로고
    • Dynamical and-or graph learning for object shape modeling and detection
    • X. Wang and L. Lin. Dynamical and-or graph learning for object shape modeling and detection. In NIPS, pages 242-250, 2012.
    • (2012) NIPS , pp. 242-250
    • Wang, X.1    Lin, L.2
  • 39
    • 79957467077 scopus 로고    scopus 로고
    • Hidden part models for human action recognition: Probabilistic versus max margin
    • Y. Wang and G. Mori. Hidden part models for human action recognition: Probabilistic versus max margin. IEEE Trans. Pattern Anal. Mach. Intell., 33(7):1310-1323, 2011.
    • (2011) IEEE Trans. Pattern Anal. Mach. Intell. , vol.33 , Issue.7 , pp. 1310-1323
    • Wang, Y.1    Mori, G.2
  • 40
    • 77953218032 scopus 로고    scopus 로고
    • Learning deformable action templates from cluttered videos
    • B. Yao and S. C. Zhu. Learning deformable action templates from cluttered videos. In ICCV, pages 1507-1514, 2009.
    • (2009) ICCV , pp. 1507-1514
    • Yao, B.1    Zhu, S.C.2
  • 41
    • 84455206064 scopus 로고    scopus 로고
    • Real-time human action search using random forest based hough voting
    • G. Yu, J. Yuan, and Z. Liu. Real-time human action search using random forest based hough voting. In ACM Multimedia, pages 1149-1152, 2011.
    • (2011) ACM Multimedia , pp. 1149-1152
    • Yu, G.1    Yuan, J.2    Liu, Z.3
  • 42
    • 80051863221 scopus 로고    scopus 로고
    • Discriminative video pattern search for efficient action detection
    • J. Yuan, Z. Liu, and Y. Wu. Discriminative video pattern search for efficient action detection. IEEE Trans. Pattern Anal. Mach. Intell., 33(9):1728-1743, 2011.
    • (2011) IEEE Trans. Pattern Anal. Mach. Intell. , vol.33 , Issue.9 , pp. 1728-1743
    • Yuan, J.1    Liu, Z.2    Wu, Y.3
  • 43
    • 0037686659 scopus 로고    scopus 로고
    • The concave-convex procedure
    • A. L. Yuille and A. Rangarajan. The concave-convex procedure. Neural Computation, 15(4):915-936, 2003.
    • (2003) Neural Computation , vol.15 , Issue.4 , pp. 915-936
    • Yuille, A.L.1    Rangarajan, A.2
  • 44
    • 84867850268 scopus 로고    scopus 로고
    • Spatio-temporal phrases for activity recognition
    • Y. Zhang, X. Liu, M.-C. Chang, W. Ge, and T. Chen. Spatio-temporal phrases for activity recognition. In ECCV (3), pages 707-721, 2012.
    • (2012) ECCV , Issue.3 , pp. 707-721
    • Zhang, Y.1    Liu, X.2    Chang, M.-C.3    Ge, W.4    Chen, T.5
  • 46
    • 72449171990 scopus 로고    scopus 로고
    • Detecting video events based on action recognition in complex scenes using spatio-temporal descriptor
    • G. Zhu, M. Yang, K. Yu, W. Xu, and Y. Gong. Detecting video events based on action recognition in complex scenes using spatio-temporal descriptor. In ACM Multimedia, pages 165-174, 2009.
    • (2009) ACM Multimedia , pp. 165-174
    • Zhu, G.1    Yang, M.2    Yu, K.3    Xu, W.4    Gong, Y.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.