메뉴 건너뛰기




Volumn 43, Issue 4, 2013, Pages 875-885

Realistic human action recognition with multimodal feature selection and fusion

Author keywords

Fuzzy integral; Multimodal fusion; Multiple kernel learning (MKL); Realistic human action recognition

Indexed keywords

ACTION RECOGNITION; DYNAMIC BACKGROUND; FUZZY INTEGRAL; HUMAN-ACTION RECOGNITION; MULTI-MODAL FUSION; MULTIMODAL FEATURES; MULTIPLE KERNEL LEARNING; REALISTIC SCENARIO; CONTROLLED CONDITIONS;

EID: 84887050628     PISSN: 10834427     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCA.2012.2226575     Document Type: Article
Times cited : (73)

References (51)
  • 2
    • 77949275097 scopus 로고    scopus 로고
    • A survey on vision-based human action recognition
    • Jun. 2010
    • R. Poppe, A survey on vision-based human action recognition, Image Vis. Comput., vol. 28, no. 6, pp. 976-990, Jun. 2010.
    • Image Vis. Comput. , vol.28 , Issue.6 , pp. 976-990
    • Poppe, R.1
  • 3
    • 79955649703 scopus 로고    scopus 로고
    • Human activity analysis: A review
    • Apr. 2011
    • J. Aggarwal and M. Ryoo, Human activity analysis: A review, ACM Comput. Surv., vol. 43, no. 3, p. 16, Apr. 2011.
    • ACM Comput. Surv. , vol.43 , Issue.3 , pp. 16
    • Aggarwal, J.1    Ryoo, M.2
  • 4
    • 10044233701 scopus 로고    scopus 로고
    • Recognizing human actions: A local SVM approach
    • C. Schüldt, I. Laptev, and B. Caputo, Recognizing human actions: A local SVM approach, in Proc. IEEE ICPR, 2004, vol. 3, pp. 32-36.
    • (2004) Proc. IEEE ICPR , vol.3 , pp. 32-36
    • Schüldt, C.1    Laptev, I.2    Caputo, B.3
  • 7
    • 70450192896 scopus 로고    scopus 로고
    • Dense saliency-based spatiotemporal feature points for action recognition
    • K. Rapantzikos, Y. Avrithis, and S. Kollias, Dense saliency-based spatiotemporal feature points for action recognition, in Proc. IEEE CVPR, 2009, pp. 1454-1461.
    • (2009) Proc. IEEE CVPR , pp. 1454-1461
    • Rapantzikos, K.1    Avrithis, Y.2    Kollias, S.3
  • 8
    • 70450214829 scopus 로고    scopus 로고
    • Hierarchical spatio-temporal context modeling for action recognition
    • J. Sun, X. Wu, S. Yan, L.-F. Cheong, T.-S. Chua, and J. Li, Hierarchical spatio-temporal context modeling for action recognition, in Proc. IEEE CVPR, 2009, pp. 2004-2011.
    • (2009) Proc. IEEE CVPR , pp. 2004-2011
    • Sun, J.1    Wu, X.2    Yan, S.3    Cheong, L.-F.4    Chua, T.-S.5    Li, J.6
  • 9
    • 70450203660 scopus 로고    scopus 로고
    • Recognizing realistic actions from videos in the wild?
    • J. Liu, J. Luo, and M. Shah, Recognizing realistic actions from videos ?in the wild?, in Proc. IEEE CVPR, 2009, pp. 1996-2003.
    • (2009) Proc. IEEE CVPR , pp. 1996-2003
    • Liu, J.1    Luo, J.2    Shah, M.3
  • 10
    • 72449171990 scopus 로고    scopus 로고
    • Detecting video events based on action recognition in complex scenes using spatio-temporal descriptor
    • Oct. 2009
    • G. Zhu, M. Yang, K. Yu, W. Xu, and Y. Gong, Detecting video events based on action recognition in complex scenes using spatio-temporal descriptor, in Proc. ACM Int. Conf. Multimedia, Oct. 2009, pp. 165-174.
    • Proc. ACM Int. Conf. Multimedia , pp. 165-174
    • Zhu, G.1    Yang, M.2    Yu, K.3    Xu, W.4    Gong, Y.5
  • 11
    • 77953208298 scopus 로고    scopus 로고
    • Selection and context for action recognition
    • Oct. 2009
    • D. Han, L. Bo, and C. Sminchisescu, Selection and context for action recognition, in Proc. IEEE ICCV, Oct. 2009, pp. 1933-1940.
    • Proc. IEEE ICCV , pp. 1933-1940
    • Han, D.1    Bo, L.2    Sminchisescu, C.3
  • 13
    • 65449154061 scopus 로고    scopus 로고
    • Meaningful auditory information enhances perception of visual biological motion
    • Apr. 2009
    • R. Arrighi, F. Marini, and D. Burr, Meaningful auditory information enhances perception of visual biological motion, J. Vis., vol. 9, no. 4, pp. 25-1-25-7, Apr. 2009.
    • J. Vis. , vol.9 , Issue.4 , pp. 251-257
    • Arrighi, R.1    Marini, F.2    Burr, D.3
  • 14
    • 48749108678 scopus 로고    scopus 로고
    • Real-time highlight detection in baseball video for TVs with time-shift function
    • May 2008
    • H.-G. Kim, J. Jeong, J.-H. Kim, and J. Kim, Real-time highlight detection in baseball video for TVs with time-shift function, IEEE Trans. Consum. Electron., vol. 54, no. 2, pp. 831-838, May 2008.
    • IEEE Trans. Consum. Electron. , vol.54 , Issue.2 , pp. 831-838
    • Kim, H.-G.1    Jeong, J.2    Kim, J.-H.3    Kim, J.4
  • 15
    • 4944245163 scopus 로고    scopus 로고
    • Maximum entropy modelbased baseball highlight detection and classification
    • Nov. 2004
    • Y. Gong, M. Han, W. Hua, and W. Xu, Maximum entropy modelbased baseball highlight detection and classification, Comput. Vis. Image Understand., vol. 96, no. 2, pp. 181-199, Nov. 2004.
    • Comput. Vis. Image Understand. , vol.96 , Issue.2 , pp. 181-199
    • Gong, Y.1    Han, M.2    Hua, W.3    Xu, W.4
  • 16
    • 2542497936 scopus 로고    scopus 로고
    • Semantic indexing of soccer audio-visual sequences: A multimodal approach based on controlled markov chains
    • May 2004
    • R. Leonardi, P. Migliorati, and M. Prandini, Semantic indexing of soccer audio-visual sequences: A multimodal approach based on controlled markov chains, IEEE Trans. Circuits Syst. Video Technol., vol. 14, no. 5, pp. 634-643, May 2004.
    • IEEE Trans. Circuits Syst. Video Technol. , vol.14 , Issue.5 , pp. 634-643
    • Leonardi, R.1    Migliorati, P.2    Prandini, M.3
  • 17
    • 33846623313 scopus 로고    scopus 로고
    • Audio-visual event recognition in surveillance video sequences
    • Feb. 2007
    • M. Cristani, M. Bicego, and V. Murino, Audio-visual event recognition in surveillance video sequences, IEEE Trans. Multimedia, vol. 9, no. 2, pp. 257-267, Feb. 2007.
    • IEEE Trans. Multimedia , vol.9 , Issue.2 , pp. 257-267
    • Cristani, M.1    Bicego, M.2    Murino, V.3
  • 18
    • 59349103114 scopus 로고    scopus 로고
    • A framework for flexible summarization of racquet sports video using multiple modalities
    • Mar. 2009
    • C. Liu, Q. Huang, S. Jiang, L. Xing, Q. Ye, andW. Gao, A framework for flexible summarization of racquet sports video using multiple modalities, Comput. Vis. Image Understand., vol. 113, no. 3, pp. 415-424, Mar. 2009.
    • Comput. Vis. Image Understand. , vol.113 , Issue.3 , pp. 415-424
    • Liu, C.1    Huang, Q.2    Jiang, S.3    Xing, L.4    Ye, Q.5    Gao, W.6
  • 19
    • 19944366235 scopus 로고    scopus 로고
    • Video data mining: Semantic indexing and event detection from the association perspective
    • May 2005
    • X. Zhu, X.Wu, A. Elmagarmid, Z. Feng, and L.Wu, Video data mining: Semantic indexing and event detection from the association perspective, IEEE Trans. Knowl. Data Eng., vol. 17, no. 5, pp. 665-677, May 2005.
    • IEEE Trans. Knowl. Data Eng. , vol.17 , Issue.5 , pp. 665-677
    • Zhu, X.1    Wu, X.2    Elmagarmid, A.3    Feng, Z.4    Wu, L.5
  • 20
    • 4043050072 scopus 로고    scopus 로고
    • Content-based movie analysis and indexing based on audiovisual cues
    • Aug. 2004
    • Y. Li, S. Narayanan, and C.-C. J. Kuo, Content-based movie analysis and indexing based on audiovisual cues, IEEE Trans. Circuits Syst. Video Technol., vol. 14, no. 8, pp. 1073-1085, Aug. 2004.
    • IEEE Trans. Circuits Syst. Video Technol. , vol.14 , Issue.8 , pp. 1073-1085
    • Li, Y.1    Narayanan, S.2    Kuo, C.-C.J.3
  • 21
    • 79951644084 scopus 로고    scopus 로고
    • Realistic human action recognition with audio context
    • Dec. 2010
    • Q. Wu, Z. Wang, F. Deng, and D. D. Feng, Realistic human action recognition with audio context, in Proc. Int. Conf. DICTA, Dec. 2010, pp. 288-293.
    • Proc. Int. Conf. DICTA , pp. 288-293
    • Wu, Q.1    Wang, Z.2    Deng, F.3    Feng, D.D.4
  • 22
    • 17044405923 scopus 로고    scopus 로고
    • Toward integrating feature selection algorithms for classification and clustering
    • Apr. 2005
    • H. Liu and L. Yu, Toward integrating feature selection algorithms for classification and clustering, IEEE Trans. Knowl. Data Eng., vol. 17, no. 4, pp. 491-502, Apr. 2005.
    • IEEE Trans. Knowl. Data Eng. , vol.17 , Issue.4 , pp. 491-502
    • Liu, H.1    Yu, L.2
  • 23
    • 77955856351 scopus 로고    scopus 로고
    • Environment recognition using selected MPEG-7 audio features and mel-frequency cepstral coefficients
    • Jun. 2010
    • G. Muhammad, Y. A. Alotaibi, M. Alsulaiman, and M. N. Huda, Environment recognition using selected MPEG-7 audio features and mel-frequency cepstral coefficients, in Proc. IEEE ICDT, Jun. 2010, pp. 11-16.
    • Proc. IEEE ICDT , pp. 11-16
    • Muhammad, G.1    Alotaibi, Y.A.2    Alsulaiman, M.3    Huda, M.N.4
  • 24
    • 66749087087 scopus 로고    scopus 로고
    • Using GA-based feature selection for emotion recognition from physiological signals
    • Feb. 2009
    • Y. Gu, S. Tan, K. Wong, M. Ho, and L. Qu, Using GA-based feature selection for emotion recognition from physiological signals, in Proc. ISPACS, Feb. 2009, pp. 1-4.
    • Proc. ISPACS , pp. 1-4
    • Gu, Y.1    Tan, S.2    Wong, K.3    Ho, M.4    Qu, L.5
  • 25
    • 71149100224 scopus 로고    scopus 로고
    • More generality in efficient multiple kernel learning
    • M. Varma and B. R. Babu, More generality in efficient multiple kernel learning, in Proc. ICML, 2009, pp. 1065-1072.
    • (2009) Proc. ICML , pp. 1065-1072
    • Varma, M.1    Babu, B.R.2
  • 26
    • 38349121660 scopus 로고    scopus 로고
    • Fuzzy integral based information fusion for classification of highly confusable non-speech sounds
    • May 2008
    • A. Temko, D. Macho, and C. Nadeu, Fuzzy integral based information fusion for classification of highly confusable non-speech sounds, Pattern Recognit., vol. 41, no. 5, pp. 1814-1823, May 2008.
    • Pattern Recognit. , vol.41 , Issue.5 , pp. 1814-1823
    • Temko, A.1    Macho, D.2    Nadeu, C.3
  • 28
    • 27644547620 scopus 로고    scopus 로고
    • A performance evaluation of local descriptors
    • Oct. 2005
    • K. Mikolajczyk and C. Schmid, A performance evaluation of local descriptors, IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 10, pp. 1615-1630, Oct. 2005.
    • IEEE Trans. Pattern Anal. Mach. Intell. , vol.27 , Issue.10 , pp. 1615-1630
    • Mikolajczyk, K.1    Schmid, C.2
  • 29
    • 0344551869 scopus 로고    scopus 로고
    • Space-time interest points
    • I. Laptev and T. Lindeberg, Space-time interest points, in Proc. IEEE ICCV, 2003, vol. 1, pp. 432-439.
    • (2003) Proc. IEEE ICCV , vol.1 , pp. 432-439
    • Laptev, I.1    Lindeberg, T.2
  • 31
    • 70450196535 scopus 로고    scopus 로고
    • Recognising action as clouds of space-time interest points
    • M. Bregonzio, S. Gong, and T. Xiang, Recognising action as clouds of space-time interest points, in Proc. IEEE CVPR, 2009, pp. 1948-1955.
    • (2009) Proc. IEEE CVPR , pp. 1948-1955
    • Bregonzio, M.1    Gong, S.2    Xiang, T.3
  • 32
    • 10044236762 scopus 로고    scopus 로고
    • Multimodal video indexing: A review of the state-of-the-art
    • Jan. 2005
    • C. G. Snoek and M. Worring, Multimodal video indexing: A review of the state-of-the-art, Multimedia Tools Appl., vol. 25, no. 1, pp. 5-35, Jan. 2005.
    • Multimedia Tools Appl. , vol.25 , Issue.1 , pp. 5-35
    • Snoek, C.G.1    Worring, M.2
  • 33
    • 72549099611 scopus 로고    scopus 로고
    • Short-term audiovisual atoms for generic video concept classification
    • Beijing, China, Oct. 2009
    • W. Jiang, C. Cotton, S.-F. Chang, D. Ellis, and A. Loui, Short-term audiovisual atoms for generic video concept classification, in Proc. ACM Int. Conf. Multimedia, Beijing, China, Oct. 2009, pp. 5-14.
    • Proc. ACM Int. Conf. Multimedia , pp. 5-14
    • Jiang, W.1    Cotton, C.2    Chang, S.-F.3    Ellis, D.4    Loui, A.5
  • 34
    • 70450273199 scopus 로고    scopus 로고
    • Information theoretic feature extraction for audio-visual speech recognition
    • Dec. 2009
    • M. Gurban and J.-P. Thiran, Information theoretic feature extraction for audio-visual speech recognition, IEEE Trans. Signal Process., vol. 57, no. 12, pp. 4765-4776, Dec. 2009.
    • IEEE Trans. Signal Process. , vol.57 , Issue.12 , pp. 4765-4776
    • Gurban, M.1    Thiran, J.-P.2
  • 35
    • 68149163531 scopus 로고    scopus 로고
    • Environmental sound recognition with time-frequency audio features
    • Aug. 2009
    • S. Chu, S. Narayanan, and C.-C. Kuo, Environmental sound recognition with time-frequency audio features, IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 6, pp. 1142-1158, Aug. 2009.
    • IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.6 , pp. 1142-1158
    • Chu, S.1    Narayanan, S.2    Kuo, C.-C.3
  • 36
    • 70449395311 scopus 로고    scopus 로고
    • Representation and feature selection using multiple kernel learning
    • Jun. 2009
    • A. D. Dileep and C. C. Sekhar, Representation and feature selection using multiple kernel learning, in Proc. IJCNN, Jun. 2009, pp. 717-722.
    • Proc. IJCNN , pp. 717-722
    • Dileep, A.D.1    Sekhar, C.C.2
  • 37
    • 0010739663 scopus 로고    scopus 로고
    • Filters wrappers and a boosting-based hybrid for feature selection
    • S. Das, Filters wrappers and a boosting-based hybrid for feature selection, in Proc. IEEE ICML, 2001, pp. 98-101.
    • (2001) Proc. IEEE ICML , pp. 98-101
    • Das, S.1
  • 38
    • 0003076895 scopus 로고    scopus 로고
    • Feature selection for highdimensional genomic microarray data
    • E. P. Xing, M. I. Jordan, and R. M. Karp, Feature selection for highdimensional genomic microarray data, in Proc. IEEE ICML, 2001, pp. 601-608.
    • (2001) Proc. IEEE ICML , pp. 601-608
    • Xing, E.P.1    Jordan, M.I.2    Karp, R.M.3
  • 39
    • 78049469733 scopus 로고    scopus 로고
    • Multimodal fusion for multimedia analysis: A survey
    • Nov. 2010
    • P. K. Atrey, M. A. Hossain, A. E. Saddik, and M. S. Kankanhalli, Multimodal fusion for multimedia analysis: A survey, Multimedia Syst., vol. 16, no. 6, pp. 345-379, Nov. 2010.
    • Multimedia Syst. , vol.16 , Issue.6 , pp. 345-379
    • Atrey, P.K.1    Hossain, M.A.2    Saddik, A.E.3    Kankanhalli, M.S.4
  • 40
    • 77956978396 scopus 로고    scopus 로고
    • Audiovisual information fusion in human-computer interfaces and intelligent environments: A survey
    • Oct. 2010
    • S. T. Shivappa, M. M. Trivedi, and B. D. Rao, Audiovisual information fusion in human-computer interfaces and intelligent environments: A survey, Proc. IEEE, vol. 98, no. 10, pp. 1692-1715, Oct. 2010.
    • Proc. IEEE , vol.98 , Issue.10 , pp. 1692-1715
    • Shivappa, S.T.1    Trivedi, M.M.2    Rao, B.D.3
  • 41
    • 38849120236 scopus 로고    scopus 로고
    • Tracking humans using multi-modal fusion
    • Jun. 2005
    • X. Zou and B. Bhanu, Tracking humans using multi-modal fusion, in Proc. IEEE Int. Workshop CVPR, Jun. 2005, p. 4.
    • Proc. IEEE Int. Workshop CVPR , pp. 4
    • Zou, X.1    Bhanu, B.2
  • 42
    • 51949105885 scopus 로고    scopus 로고
    • Recognizing human actions using multiple features
    • Jun. 2008
    • J. Liu, S. Ali, and M. Shah, Recognizing human actions using multiple features, in Proc. IEEE Int. Conf. CVPR, Jun. 2008, pp. 1-8.
    • Proc. IEEE Int. Conf. CVPR , pp. 1-8
    • Liu, J.1    Ali, S.2    Shah, M.3
  • 44
    • 77956006653 scopus 로고    scopus 로고
    • Multimodal semi-supervised learning for image classification
    • Jun. 2010
    • M. Guillaumin, J. Verbeek, and C. Schmid, Multimodal semi-supervised learning for image classification, in Proc. IEEE Int. Conf. CVPR, Jun. 2010, pp. 902-909.
    • Proc. IEEE Int. Conf. CVPR , pp. 902-909
    • Guillaumin, M.1    Verbeek, J.2    Schmid, C.3
  • 46
    • 50549100416 scopus 로고    scopus 로고
    • An empirical study of statistical properties of the Choquet and Sugeno integrals
    • Aug. 2008
    • M. Grabisch and E. Raufaste, An empirical study of statistical properties of the Choquet and Sugeno integrals, IEEE Trans. Fuzzy Syst., vol. 16, no. 4, pp. 839-850, Aug. 2008.
    • IEEE Trans. Fuzzy Syst. , vol.16 , Issue.4 , pp. 839-850
    • Grabisch, M.1    Raufaste, E.2
  • 47
    • 24944451092 scopus 로고    scopus 로고
    • On space-time interest points
    • Sep. 2005
    • I. Laptev, On space-time interest points, Int. J. Comput. Vis., vol. 64, no. 2, pp. 107-123, Sep. 2005.
    • Int. J. Comput. Vis. , vol.64 , Issue.2 , pp. 107-123
    • Laptev, I.1
  • 49
    • 50649115912 scopus 로고    scopus 로고
    • Learning the discriminative power-invariance trade-off
    • Oct. 2007
    • M. Varma and D. Ray, Learning the discriminative power-invariance trade-off, in Proc. IEEE ICCV, Oct. 2007, pp. 1-8.
    • Proc. IEEE ICCV , pp. 1-8
    • Varma, M.1    Ray, D.2
  • 51
    • 0029212628 scopus 로고    scopus 로고
    • A new algorithm for identifying fuzzy measures and its application to pattern recognition
    • Mar. 1995
    • M. Grabisch, A new algorithm for identifying fuzzy measures and its application to pattern recognition, in Proc. IEEE Int. Conf. Fuzzy Syst., Mar. 1995, vol. 1, pp. 145-150.
    • Proc. IEEE Int. Conf. Fuzzy Syst. , vol.1 , pp. 145-150
    • Grabisch, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.