메뉴 건너뛰기




Volumn 25, Issue 3, 2016, Pages 1033-1046

Deep Fusion of Multiple Semantic Cues for Complex Event Recognition

Author keywords

deep learning; fusion; multimedia event recognition

Indexed keywords

FUSION REACTIONS; LEARNING SYSTEMS;

EID: 84962692397     PISSN: 10577149     EISSN: None     Source Type: Journal    
DOI: 10.1109/TIP.2015.2511585     Document Type: Article
Times cited : (57)

References (48)
  • 1
    • 80052915321 scopus 로고    scopus 로고
    • Actom sequence models for efficient action detection
    • A. Gaidon, Z. Harchaoui, and C. Schmid, "Actom sequence models for efficient action detection, " in Proc. CVPR, 2011, pp. 3201-3208.
    • (2011) Proc. CVPR , pp. 3201-3208
    • Gaidon, A.1    Harchaoui, Z.2    Schmid, C.3
  • 2
    • 84875599426 scopus 로고    scopus 로고
    • Video event recognition using concept attributes
    • J. Liu et al., "Video event recognition using concept attributes, " in Proc. WACV, 2013, pp. 339-346.
    • (2013) Proc. WACV , pp. 339-346
    • Liu, J.1
  • 3
    • 84899755756 scopus 로고    scopus 로고
    • Event-driven semantic concept discovery by exploiting weakly tagged Internet images
    • J. Chen, Y. Cui, G. Ye, D. Liu, and S.-F. Chang, "Event-driven semantic concept discovery by exploiting weakly tagged Internet images, " in Proc. ICMR, 2014, p. 1.
    • (2014) Proc. ICMR , pp. 1
    • Chen, J.1    Cui, Y.2    Ye, G.3    Liu, D.4    Chang, S.-F.5
  • 4
    • 84898775956 scopus 로고    scopus 로고
    • ACTIVE: Activity concept transitions in video event classification
    • C. Sun and R. Nevatia, "ACTIVE: Activity concept transitions in video event classification, " in Proc. ICCV, 2013, pp. 913-920.
    • (2013) Proc. ICCV , pp. 913-920
    • Sun, C.1    Nevatia, R.2
  • 5
    • 84899713776 scopus 로고    scopus 로고
    • ISOMER: Informative segment observations for multimedia event recounting
    • C. Sun et al., "ISOMER: Informative segment observations for multimedia event recounting, " in Proc. ICMR, 2014, p. 241.
    • (2014) Proc. ICMR , pp. 241
    • Sun, C.1
  • 8
    • 85162513516 scopus 로고    scopus 로고
    • Object bank: A highlevel image representation for scene classification & semantic feature sparsification
    • L.-J. Li, H. Su, L. Fei-Fei, and E. P. Xing, "Object bank: A highlevel image representation for scene classification & semantic feature sparsification, " in Proc. NIPS, 2010, pp. 1378-1386.
    • (2010) Proc. NIPS , pp. 1378-1386
    • Li, L.-J.1    Su, H.2    Fei-Fei, L.3    Xing, E.P.4
  • 10
    • 0000107975 scopus 로고
    • Relations between two sets of variates
    • H. Hotelling, "Relations between two sets of variates, " Biometrika, vol. 28, nos. 3-4, pp. 321-377, 1936.
    • (1936) Biometrika , vol.28 , Issue.3-4 , pp. 321-377
    • Hotelling, H.1
  • 12
    • 69349090197 scopus 로고    scopus 로고
    • Learning deep architectures for AI
    • Y. Bengio, "Learning deep architectures for AI, " Found. Trends Mach. Learn., vol. 2, no. 1, pp. 1-127, 2009.
    • (2009) Found. Trends Mach. Learn , vol.2 , Issue.1 , pp. 1-127
    • Bengio, Y.1
  • 13
    • 84866707906 scopus 로고    scopus 로고
    • Evaluation of low-level features and their combinations for complex event detection in open source videos
    • A. Tamrakar et al., "Evaluation of low-level features and their combinations for complex event detection in open source videos, " in Proc. CVPR, 2012, pp. 3681-3688.
    • (2012) Proc. CVPR , pp. 3681-3688
    • Tamrakar, A.1
  • 14
    • 78149348487 scopus 로고    scopus 로고
    • Object, scene and actions: Combining multiple features for human action recognition
    • N. Ikizler-Cinbis and S. Sclaroff, "Object, scene and actions: Combining multiple features for human action recognition, " in Proc. ECCV, 2010, pp. 494-507.
    • (2010) Proc. ECCV , pp. 494-507
    • Ikizler-Cinbis, N.1    Sclaroff, S.2
  • 15
    • 84898791167 scopus 로고    scopus 로고
    • Action and event recognition with fisher vectors on a compact feature set
    • D. Oneata, J. Verbeek, and C. Schmid, "Action and event recognition with fisher vectors on a compact feature set, " in Proc. ICCV, 2013, pp. 1817-1824.
    • (2013) Proc. ICCV , pp. 1817-1824
    • Oneata, D.1    Verbeek, J.2    Schmid, C.3
  • 16
    • 84875607338 scopus 로고    scopus 로고
    • Large-scale Web video event classification by use of fisher vectors
    • C. Sun and R. Nevatia, "Large-scale Web video event classification by use of fisher vectors, " in Proc. WACV, 2013, pp. 15-22.
    • (2013) Proc. WACV , pp. 15-22
    • Sun, C.1    Nevatia, R.2
  • 17
    • 84866712341 scopus 로고    scopus 로고
    • Multimodal feature fusion for robust event detection in Web videos
    • P. Natarajan et al., "Multimodal feature fusion for robust event detection in Web videos, " in Proc. CVPR, 2012, pp. 1298-1305.
    • (2012) Proc. CVPR , pp. 1298-1305
    • Natarajan, P.1
  • 18
    • 84864120582 scopus 로고    scopus 로고
    • Multimodal knowledge-based analysis in multimedia event detection
    • E. Younessian, T. Mitamura, and A. Hauptmann, "Multimodal knowledge-based analysis in multimedia event detection, " in Proc. ICMR, 2012, Art. ID 51.
    • (2012) Proc. ICMR
    • Younessian, E.1    Mitamura, T.2    Hauptmann, A.3
  • 19
    • 84867886443 scopus 로고    scopus 로고
    • Complex events detection using data-driven concepts
    • Y. Yang and M. Shah, "Complex events detection using data-driven concepts, " in Proc. ECCV, 2012, pp. 722-735.
    • (2012) Proc. ECCV , pp. 722-735
    • Yang, Y.1    Shah, M.2
  • 20
    • 84911434661 scopus 로고    scopus 로고
    • Zeroshot event detection using multi-modal fusion of weakly supervised concepts
    • S. Wu, S. Bondugula, F. Luisier, X. Zhuang, and P. Natarajan, "Zeroshot event detection using multi-modal fusion of weakly supervised concepts, " in Proc. CVPR, 2014, pp. 2665-2672.
    • (2014) Proc. CVPR , pp. 2665-2672
    • Wu, S.1    Bondugula, S.2    Luisier, F.3    Zhuang, X.4    Natarajan, P.5
  • 21
    • 84871359352 scopus 로고    scopus 로고
    • Leveraging high-level and low-level features for multimedia event detection
    • L. Jiang, A. G. Hauptmann, and G. Xiang, "Leveraging high-level and low-level features for multimedia event detection, " in Proc. MM, 2012, pp. 449-458.
    • (2012) Proc. MM , pp. 449-458
    • Jiang, L.1    Hauptmann, A.G.2    Xiang, G.3
  • 22
    • 84867889550 scopus 로고    scopus 로고
    • Recognizing complex events using large margin joint low-level event model
    • H. Izadinia and M. Shah, "Recognizing complex events using large margin joint low-level event model, " in Proc. ECCV, 2012, pp. 430-444.
    • (2012) Proc. ECCV , pp. 430-444
    • Izadinia, H.1    Shah, M.2
  • 23
    • 78149355981 scopus 로고    scopus 로고
    • Efficient object category recognition using classemes
    • L. Torresani, M. Szummer, and A. Fitzgibbon, "Efficient object category recognition using classemes, " in Proc. ECCV, 2010, pp. 776-789.
    • (2010) Proc. ECCV , pp. 776-789
    • Torresani, L.1    Szummer, M.2    Fitzgibbon, A.3
  • 25
    • 84894905430 scopus 로고    scopus 로고
    • Evaluating multimedia features and fusion for example-based event detection
    • G. K. Myers et al., "Evaluating multimedia features and fusion for example-based event detection, " Mach. Vis. Appl., vol. 25, no. 1, pp. 17-32, 2014.
    • (2014) Mach. Vis. Appl , vol.25 , Issue.1 , pp. 17-32
    • Myers, G.K.1
  • 26
    • 84883126733 scopus 로고    scopus 로고
    • Early versus late fusion in semantic video analysis
    • C. G. Snoek, M. Worring, and A. W. M. Smeulders, "Early versus late fusion in semantic video analysis, " in Proc. MM, 2005, pp. 399-402.
    • (2005) Proc. MM , pp. 399-402
    • Snoek, C.G.1    Worring, M.2    Smeulders, A.W.M.3
  • 28
    • 84866712367 scopus 로고    scopus 로고
    • Robust late fusion with rank minimization
    • G. Ye, D. Liu, I.-H. Jhuo, and S.-F. Chang, "Robust late fusion with rank minimization, " in Proc. CVPR, 2012, pp. 3021-3028.
    • (2012) Proc. CVPR , pp. 3021-3028
    • Ye, G.1    Liu, D.2    Jhuo, I.-H.3    Chang, S.-F.4
  • 29
    • 84897724955 scopus 로고    scopus 로고
    • Multi-feature fusion via hierarchical regression for multimedia analysis
    • Apr.
    • Y. Yang, J. Song, Z. Huang, Z. Ma, N. Sebe, and A. G. Hauptmann, "Multi-feature fusion via hierarchical regression for multimedia analysis, " IEEE Trans. Multimedia, vol. 15, no. 3, pp. 572-581, Apr. 2013.
    • (2013) IEEE Trans. Multimedia , vol.15 , Issue.3 , pp. 572-581
    • Yang, Y.1    Song, J.2    Huang, Z.3    Ma, Z.4    Sebe, N.5    Hauptmann, A.G.6
  • 30
    • 84913586072 scopus 로고    scopus 로고
    • Exploring interfeature and inter-class relationships with deep neural networks for video classification
    • Z. Wu, Y.-G. Jiang, J. Wang, J. Pu, and X. Xue, "Exploring interfeature and inter-class relationships with deep neural networks for video classification, " in Proc. 22nd ACM Int. Conf. Multimedia, 2014, pp. 167-176.
    • (2014) Proc. 22nd ACM Int. Conf. Multimedia , pp. 167-176
    • Wu, Z.1    Jiang, Y.-G.2    Wang, J.3    Pu, J.4    Xue, X.5
  • 32
    • 84877724347 scopus 로고    scopus 로고
    • Multimodal learning with deep Boltzmann machines
    • N. Srivastava and R. Salakhutdinov, "Multimodal learning with deep Boltzmann machines, " in Proc. NIPS, 2012, pp. 2949-2980.
    • (2012) Proc. NIPS , pp. 2949-2980
    • Srivastava, N.1    Salakhutdinov, R.2
  • 34
    • 0036505670 scopus 로고    scopus 로고
    • A comparison of methods for multiclass support vector machines
    • Mar.
    • C.-W. Hsu and C.-J. Lin, "A comparison of methods for multiclass support vector machines, " IEEE Trans. Neural Netw., vol. 13, no. 2, pp. 415-425, Mar. 2002.
    • (2002) IEEE Trans. Neural Netw , vol.13 , Issue.2 , pp. 415-425
    • Hsu, C.-W.1    Lin, C.-J.2
  • 35
    • 84905223557 scopus 로고    scopus 로고
    • TRECVID 2010-An overview of the goals, tasks, data, evaluation mechanisms, and metrics
    • A. Over et al., "TRECVID 2010-An overview of the goals, tasks, data, evaluation mechanisms, and metrics, " in Proc. TRECVid, 2011, pp. 10-12.
    • (2011) Proc. TRECVid , pp. 10-12
    • Over, A.1
  • 36
    • 84864116485 scopus 로고    scopus 로고
    • SUPER: Towards real-time event recognition in Internet videos
    • Y.-G. Jiang, "SUPER: Towards real-time event recognition in Internet videos, " in Proc. ICMR, 2012, Art. ID 7.
    • (2012) Proc. ICMR
    • Jiang, Y.-G.1
  • 37
    • 84898775557 scopus 로고    scopus 로고
    • Video event understanding using natural language descriptions
    • V. Ramanathan, P. Liang, and L. Fei-Fei, "Video event understanding using natural language descriptions, " in Proc. ICCV, 2013, pp. 905-912.
    • (2013) Proc. ICCV , pp. 905-912
    • Ramanathan, V.1    Liang, P.2    Fei-Fei, L.3
  • 39
    • 77955988947 scopus 로고    scopus 로고
    • SUN database: Large-scale scene recognition from abbey to zoo
    • J. Xiao, J. Hays, K. A. Ehinger, A. Oliva, and A. Torralba, "SUN database: Large-scale scene recognition from abbey to zoo, " in Proc. CVPR, 2010, pp. 3485-3492.
    • (2010) Proc. CVPR , pp. 3485-3492
    • Xiao, J.1    Hays, J.2    Ehinger, K.A.3    Oliva, A.4    Torralba, A.5
  • 41
    • 0035328421 scopus 로고    scopus 로고
    • Modeling the shape of the scene: A holistic representation of the spatial envelope
    • A. Oliva and A. Torralba, "Modeling the shape of the scene: A holistic representation of the spatial envelope, " Int. J. Comput. Vis., vol. 42, no. 3, pp. 145-175, 2001.
    • (2001) Int. J. Comput. Vis , vol.42 , Issue.3 , pp. 145-175
    • Oliva, A.1    Torralba, A.2
  • 42
    • 0036647193 scopus 로고    scopus 로고
    • Multiresolution gray-scale and rotation invariant texture classification with local binary patterns
    • Jul.
    • T. Ojala, M. Pietikäinen, and T. Mäenpää, "Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 7, pp. 971-987, Jul. 2002.
    • (2002) IEEE Trans. Pattern Anal. Mach. Intell , vol.24 , Issue.7 , pp. 971-987
    • Ojala, T.1    Pietikäinen, M.2    Mäenpää, T.3
  • 43
    • 33845572523 scopus 로고    scopus 로고
    • Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
    • Jun.
    • S. Lazebnik, C. Schmid, and J. Ponce, "Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories, " in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., vol. 2. Jun. 2006, pp. 2169-2178.
    • (2006) Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit , vol.2 , pp. 2169-2178
    • Lazebnik, S.1    Schmid, C.2    Ponce, J.3
  • 44
    • 84898805910 scopus 로고    scopus 로고
    • Action recognition with improved trajectories
    • H. Wang and C. Schmid, "Action recognition with improved trajectories, " in Proc. ICCV, 2013, pp. 3551-3558.
    • (2013) Proc. ICCV , pp. 3551-3558
    • Wang, H.1    Schmid, C.2
  • 45
    • 24944451092 scopus 로고    scopus 로고
    • On space-time interest points
    • I. Laptev, "On space-time interest points, " Int. J. Comput. Vis., vol. 64, no. 2, pp. 107-123, 2005.
    • (2005) Int. J. Comput. Vis , vol.64 , Issue.2 , pp. 107-123
    • Laptev, I.1
  • 46
    • 56449086223 scopus 로고    scopus 로고
    • Training restricted boltzmann machines using approximations to the likelihood gradient
    • T. Tieleman, "Training restricted boltzmann machines using approximations to the likelihood gradient, " in Proc. ICML, 2008.
    • (2008) Proc. ICML
    • Tieleman, T.1
  • 47
    • 84898834622 scopus 로고    scopus 로고
    • Feature weighting via optimal thresholding for video analysis
    • Z. Xu, Y. Yang, I. Tsang, N. Sebe, and A. G. Hauptmann, "Feature weighting via optimal thresholding for video analysis, " in Proc. ICCV, 2013, pp. 3440-3447.
    • (2013) Proc. ICCV , pp. 3440-3447
    • Xu, Z.1    Yang, Y.2    Tsang, I.3    Sebe, N.4    Hauptmann, A.G.5
  • 48
    • 84894901576 scopus 로고    scopus 로고
    • Discovering joint audio-visual codewords for video event detection
    • I.-H. Jhuo et al., "Discovering joint audio-visual codewords for video event detection, " Mach. Vis. Appl., vol. 25, no. 1, pp. 33-47, 2014.
    • (2014) Mach. Vis. Appl , vol.25 , Issue.1 , pp. 33-47
    • Jhuo, I.-H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.