메뉴 건너뛰기




Volumn 39, Issue 8, 2017, Pages 1617-1632

Semantic Pooling for Complex Event Analysis in Untrimmed Videos

Author keywords

Complex event detection; event recognition; event recounting; nearly isotonic SVM; semantic saliency

Indexed keywords

IMAGE RETRIEVAL; SUPPORT VECTOR MACHINES;

EID: 85015732763     PISSN: 01628828     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPAMI.2016.2608901     Document Type: Article
Times cited : (327)

References (75)
  • 1
    • 84886571480 scopus 로고    scopus 로고
    • Multimedia event detection using a classifier-specific intermedi ate representation
    • Nov
    • Z. Ma, Y. Yang, N. Sebe, K. Zheng, and A. G. Hauptmann, "Multimedia event detection using a classifier-specific intermedi ate representation, " IEEE Trans. Multimedia, vol. 15, no. 7, pp. 1628-1637, Nov. 2013.
    • (2013) IEEE Trans. Multimedia , vol.15 , Issue.7 , pp. 1628-1637
    • Ma, Z.1    Yang, Y.2    Sebe, N.3    Zheng, K.4    Hauptmann, A.G.5
  • 2
    • 84866707906 scopus 로고    scopus 로고
    • Evaluation of low-level features and their combinations for complex event detection in open source vid-eos
    • A. Tamrakar, et al., "Evaluation of low-level features and their combinations for complex event detection in open source vid-eos, " in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2012, pp. 3681-3688.
    • (2012) Proc. IEEE Conf. Comput. Vis. Pattern Recognit. , pp. 3681-3688
    • Tamrakar, A.1
  • 3
    • 84905573114 scopus 로고    scopus 로고
    • Knowledge adaptation with partially shared features for event detection using few exemplars
    • Sep
    • Z. Ma, Y. Yang, N. Sebe, and A. G. Hauptmann, "Knowledge adaptation with partially shared features for event detection using few exemplars, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 36, no. 9, pp. 1789-1802, Sep. 2014.
    • (2014) IEEE Trans. Pattern Anal. Mach. Intell. , vol.36 , Issue.9 , pp. 1789-1802
    • Ma, Z.1    Yang, Y.2    Sebe, N.3    Hauptmann, A.G.4
  • 4
    • 84863116061 scopus 로고    scopus 로고
    • A multimedia retrieval framework based on semi-supervised ranking and relevance feedback
    • Apr
    • Y. Yang, F. Nie, D. Xu, J. Luo, Y. Zhuang, and Y. Pan, "A multimedia retrieval framework based on semi-supervised ranking and relevance feedback, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 34, no. 4, pp. 723-742, Apr. 2012.
    • (2012) IEEE Trans. Pattern Anal. Mach. Intell. , vol.34 , Issue.4 , pp. 723-742
    • Yang, Y.1    Nie, F.2    Xu, D.3    Luo, J.4    Zhuang, Y.5    Pan, Y.6
  • 5
    • 85085787361 scopus 로고    scopus 로고
    • The AXES submissions at TrecVid 2013
    • R. Aly, et al., "The AXES submissions at TrecVid 2013, " in TREC-VID 2013.
    • (2013) TREC-VID
    • Aly, R.1
  • 6
    • 85112190132 scopus 로고    scopus 로고
    • Informedia@TRECVID 2014 MED and MER
    • S.-I. Yu, et al., "Informedia@TRECVID 2014 MED and MER, " in TRECVID 2014.
    • (2014) TRECVID
    • Yu, S.-I.1
  • 9
    • 0022388528 scopus 로고
    • Shifts in selective visual attention: Towards the underlying neural circuitry
    • C. Koch and S. Ullman, "Shifts in selective visual attention: Towards the underlying neural circuitry, " Human Neurobiology, vol. 4, pp. 219-227, 1985.
    • (1985) Human Neurobiology , vol.4 , pp. 219-227
    • Koch, C.1    Ullman, S.2
  • 11
  • 14
    • 84957922397 scopus 로고    scopus 로고
    • YFCC100M: The new data in multimedia research
    • B. Thomee, et al., "YFCC100M: The new data in multimedia research, " Comm. ACM, vol. 59, no. 2, pp. 64-73, 2016.
    • (2016) Comm. ACM , vol.59 , Issue.2 , pp. 64-73
    • Thomee, B.1
  • 16
    • 84905580784 scopus 로고    scopus 로고
    • Proximal alternating linearized minimization for nonconvex and nonsmooth problems
    • J. Bolte, S. Sabach, and M. Teboulle, "Proximal alternating linearized minimization for nonconvex and nonsmooth problems, " Math. Program. Series A, vol. 146, pp. 459-494, 2014.
    • (2014) Math. Program. Series A , vol.146 , pp. 459-494
    • Bolte, J.1    Sabach, S.2    Teboulle, M.3
  • 17
    • 84969822870 scopus 로고    scopus 로고
    • Complex event detection using semantic saliency and nearly-isotonic SVM
    • X. Chang, Y. Yang, E. P. Xing, and Y. Yu, "Complex event detection using semantic saliency and nearly-isotonic SVM, " in Proc. 32nd Int. Conf. Mach. Learn., 2015, pp. 1348-1357.
    • (2015) Proc. 32nd Int. Conf. Mach. Learn. , pp. 1348-1357
    • Chang, X.1    Yang, Y.2    Xing, E.P.3    Yu, Y.4
  • 18
    • 84865579385 scopus 로고    scopus 로고
    • Visual event recognition in videos by learning from Web data
    • Sep
    • L. Duan, D. Xu, I. W. Tsang, and J. Luo, "Visual event recognition in videos by learning from Web data, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 34, no. 9, pp. 1667-1680, Sep. 2012.
    • (2012) IEEE Trans. Pattern Anal. Mach. Intell. , vol.34 , Issue.9 , pp. 1667-1680
    • Duan, L.1    Xu, D.2    Tsang, I.W.3    Luo, J.4
  • 19
    • 68549126868 scopus 로고    scopus 로고
    • Tensor-based transductive learning for multimodality video semantic concept detection
    • Aug
    • F. Wu, Y. Liu, and Y. Zhuang, "Tensor-based transductive learning for multimodality video semantic concept detection, " IEEE Trans. Multimedia, vol. 11, no. 5, pp. 868-878, Aug. 2009.
    • (2009) IEEE Trans. Multimedia , vol.11 , Issue.5 , pp. 868-878
    • Wu, F.1    Liu, Y.2    Zhuang, Y.3
  • 20
    • 3042535216 scopus 로고    scopus 로고
    • Distinctive image features from scale-invariant key-points
    • D. G. Lowe, "Distinctive image features from scale-invariant key-points, " Int. J. Comput. Vis., vol. 60, no. 2, pp. 91-110, 2004.
    • (2004) Int. J. Comput. Vis. , vol.60 , Issue.2 , pp. 91-110
    • Lowe, D.G.1
  • 21
    • 35548930762 scopus 로고    scopus 로고
    • Local veloc-ity-adapted motion events for spatio-temporal recognition
    • I. Laptev, B. Caputo, C. Schuldt, and T. Lindeberg, "Local veloc-ity-adapted motion events for spatio-temporal recognition, " Comput. Vis. Image Understanding, vol. 108, no. 3, pp. 207-229, 2007.
    • (2007) Comput. Vis. Image Understanding , vol.108 , Issue.3 , pp. 207-229
    • Laptev, I.1    Caputo, B.2    Schuldt, C.3    Lindeberg, T.4
  • 22
    • 84898805910 scopus 로고    scopus 로고
    • Action recognition with improved trajectories
    • H. Wang and C. Schmid, "Action recognition with improved trajectories, " in Proc. IEEE Int. Conf. Comput. Vis., 2013, pp. 3551-3558.
    • (2013) Proc. IEEE Int. Conf. Comput. Vis. , pp. 3551-3558
    • Wang, H.1    Schmid, C.2
  • 23
    • 84937862424 scopus 로고    scopus 로고
    • Two-stream convolutional net-works for action recognition in videos
    • K. Simonyan and A. Zisserman, "Two-stream convolutional net-works for action recognition in videos, " in Proc. Advances Neural Inf. Process. Syst., 2014, pp. 568-576.
    • (2014) Proc. Advances Neural Inf. Process. Syst. , pp. 568-576
    • Simonyan, K.1    Zisserman, A.2
  • 27
    • 84962921420 scopus 로고    scopus 로고
    • Modeling spatial-temporal clues in a hybrid deep learning framework for video classification
    • Z. Wu, X. Wang, Y. Jiang, H. Ye, and X. Xue, "Modeling spatial-temporal clues in a hybrid deep learning framework for video classification, " in Proc. 23rd ACM Conf. Multimedia Conf., 2015, pp. 461-470.
    • (2015) Proc. 23rd ACM Conf. Multimedia Conf. , pp. 461-470
    • Wu, Z.1    Wang, X.2    Jiang, Y.3    Ye, H.4    Xue, X.5
  • 28
    • 84978690061 scopus 로고    scopus 로고
    • Event Fisher vectors: Robust encoding visual diversity of visual streams
    • M. Nagel, T. Mensink, and C. G. Snoek, "Event Fisher vectors: Robust encoding visual diversity of visual streams, " in Proc. 26th British Mach. Vis. Conf., 2015, pp. 178.1-178.12.
    • (2015) Proc. 26th British Mach. Vis. Conf. , pp. 1781-17812
    • Nagel, M.1    Mensink, T.2    Snoek, C.G.3
  • 29
    • 84898791167 scopus 로고    scopus 로고
    • Action and event recognition with Fisher vectors on a compact feature set
    • D. Oneata, J. Verbeek, and C. Schmid, "Action and event recognition with Fisher vectors on a compact feature set, " in Proc. IEEE Int. Conf. Comput. Vis., 2013, pp. 1817-1824.
    • (2013) Proc. IEEE Int. Conf. Comput. Vis. , pp. 1817-1824
    • Oneata, D.1    Verbeek, J.2    Schmid, C.3
  • 30
    • 84911429593 scopus 로고    scopus 로고
    • DISCOVER: Discovering important segments for classification of video events and recounting
    • C. Sun and R. Nevatia, "DISCOVER: Discovering important segments for classification of video events and recounting, " in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2014, pp. 2569-2576.
    • (2014) Proc. IEEE Conf. Comput. Vis. Pattern Recognit. , pp. 2569-2576
    • Sun, C.1    Nevatia, R.2
  • 32
    • 84866712341 scopus 로고    scopus 로고
    • Multimodal feature fusion for robust event detection in Web videos
    • P. Natarajan, et al., "Multimodal feature fusion for robust event detection in Web videos, " in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2012, pp. 1298-1305.
    • (2012) Proc. IEEE Conf. Comput. Vis. Pattern Recognit. , pp. 1298-1305
    • Natarajan, P.1
  • 34
    • 84867855153 scopus 로고    scopus 로고
    • Local expert forest of score fusion for video event classification
    • J. Liu, S. McCloskey, and Y. Liu, "Local expert forest of score fusion for video event classification, " in Proc. 12th Eur. Conf. Comput. Vis., 2012, pp. 397-410.
    • (2012) Proc. 12th Eur. Conf. Comput. Vis. , pp. 397-410
    • Liu, J.1    McCloskey, S.2    Liu, Y.3
  • 36
    • 84898817119 scopus 로고    scopus 로고
    • Compositional models for video event detection: A multiple ker-nel learning latent variable approach
    • A. Vahdat, K. Cannons, G. Mori, S. Oh, and I. Kim, "Compositional models for video event detection: A multiple ker-nel learning latent variable approach, " in Proc. IEEE Int. Conf. Comput. Vis., 2013, pp. 1185-1192.
    • (2013) Proc. IEEE Int. Conf. Comput. Vis. , pp. 1185-1192
    • Vahdat, A.1    Cannons, K.2    Mori, G.3    Oh, S.4    Kim, I.5
  • 37
    • 84906508190 scopus 로고    scopus 로고
    • Recognizing com-plex events in videos by learning key static-dynamic evidences
    • K.-T. Lai, D. Liu, M.-S. Chen, and S.-F. Chang, "Recognizing com-plex events in videos by learning key static-dynamic evidences, " in Proc. 13th Eur. Conf. Comput. Vis., 2014, pp. 675-688.
    • (2014) Proc. 13th Eur. Conf. Comput. Vis. , pp. 675-688
    • Lai, K.-T.1    Liu, D.2    Chen, M.-S.3    Chang, S.-F.4
  • 39
    • 84970002232 scopus 로고    scopus 로고
    • Show, attend and tell: Neural image caption genera-tion with visual attention
    • K. Xu, et al., "Show, attend and tell: Neural image caption genera-tion with visual attention, " in Proc. 32nd Int. Conf. Mach. Learn., 2015, pp. 2048-2057.
    • (2015) Proc. 32nd Int. Conf. Mach. Learn. , pp. 2048-2057
    • Xu, K.1
  • 41
    • 50649103674 scopus 로고    scopus 로고
    • What, where and who? Classifying events by scene and object recognition
    • L. Li and F. Li, "What, where and who? Classifying events by scene and object recognition, " in Proc. IEEE 11th Int. Conf. Comput. Vis., 2007, pp. 1-8.
    • (2007) Proc. IEEE 11th Int. Conf. Comput. Vis. , pp. 1-8
    • Li, L.1    Li, F.2
  • 42
    • 84867887689 scopus 로고    scopus 로고
    • Image labeling on a network: Using social-network metadata for image classification
    • J. J. McAuley and J. Leskovec, "Image labeling on a network: Using social-network metadata for image classification, " in Proc. 12th Eur. Conf. Comput. Vis., 2012, pp. 828-841.
    • (2012) Proc. 12th Eur. Conf. Comput. Vis. , pp. 828-841
    • McAuley, J.J.1    Leskovec, J.2
  • 44
    • 84867889550 scopus 로고    scopus 로고
    • Recognizing complex events using large margin joint low-level event model
    • H. Izadinia and M. Shah, "Recognizing complex events using large margin joint low-level event model, " in Proc. 12th Eur. Conf. Comput. Vis., 2012, pp. 430-444.
    • (2012) Proc. 12th Eur. Conf. Comput. Vis. , pp. 430-444
    • Izadinia, H.1    Shah, M.2
  • 45
    • 84939510589 scopus 로고    scopus 로고
    • Enhancing video event recognition using automatically constructed semantic-visual knowledge base
    • Sep
    • X. Zhang, et al., "Enhancing video event recognition using automatically constructed semantic-visual knowledge base, " IEEE Trans. Multimedia, vol. 17, no. 9, pp. 1562-1575, Sep. 2015.
    • (2015) IEEE Trans. Multimedia , vol.17 , Issue.9 , pp. 1562-1575
    • Zhang, X.1
  • 46
    • 84875599426 scopus 로고    scopus 로고
    • Video event recognition using concept attributes
    • J. Liu, et al., "Video event recognition using concept attributes, " in Proc. IEEE Workshop Appl. Comput. Vis., 2013, pp. 339-346.
    • (2013) Proc. IEEE Workshop Appl. Comput. Vis. , pp. 339-346
    • Liu, J.1
  • 47
    • 84899713776 scopus 로고    scopus 로고
    • ISOMER: Informative segment observations for multimedia event recounting
    • Art. no. 241
    • C. Sun, et al., "ISOMER: Informative segment observations for multimedia event recounting, " in Proc. Int. Conf. Multimedia Retrieval, 2014, Art. no. 241.
    • (2014) Proc. Int. Conf. Multimedia Retrieval
    • Sun, C.1
  • 48
    • 84455192418 scopus 로고    scopus 로고
    • Towards textually describing complex video contents with audio-visual concept classifiers
    • C. C. Tan, Y. Jiang, and C. Ngo, "Towards textually describing complex video contents with audio-visual concept classifiers, " in Proc. 19th ACM Int. Conf. Multimedia, 2011, pp. 655-658.
    • (2011) Proc. 19th ACM Int. Conf. Multimedia , pp. 655-658
    • Tan, C.C.1    Jiang, Y.2    Ngo, C.3
  • 49
    • 70450202741 scopus 로고    scopus 로고
    • Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
    • A. Gupta, P. Srinivasan, J. Shi, and L. S. Davis, "Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos, " in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2009, pp. 2012-2019.
    • (2009) Proc. IEEE Conf. Comput. Vis. Pattern Recognit. , pp. 2012-2019
    • Gupta, A.1    Srinivasan, P.2    Shi, J.3    Davis, L.S.4
  • 50
    • 84962907282 scopus 로고    scopus 로고
    • Searching persuasively: Joint event detection and evidence recounting with limited supervision
    • X. Chang, Y.-L. Yu, Y. Yang, and A. G Hauptmann, "Searching persuasively: Joint event detection and evidence recounting with limited supervision, " in Proc. 23rd ACM Int. Conf. Multimedia, 2015, pp. 581-590.
    • (2015) Proc. 23rd ACM Int. Conf. Multimedia , pp. 581-590
    • Chang, X.1    Yu, Y.-L.2    Yang, Y.3    Hauptmann, A.G.4
  • 51
    • 33847765090 scopus 로고    scopus 로고
    • A formal study of shot boundary detection
    • Feb
    • J. Yuan, et al., "A formal study of shot boundary detection, " IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 2, pp. 168-186, Feb. 2007.
    • (2007) IEEE Trans. Circuits Syst. Video Technol. , vol.17 , Issue.2 , pp. 168-186
    • Yuan, J.1
  • 52
    • 85083953063 scopus 로고    scopus 로고
    • Very deep convolutional net-works for large-scale image recognition
    • K. Simonyan and A. Zisserman, "Very deep convolutional net-works for large-scale image recognition, " in Proc. Int. Conf. Learn. Representations, 2015, pp. 1245-1258.
    • (2015) Proc. Int. Conf. Learn. Representations , pp. 1245-1258
    • Simonyan, K.1    Zisserman, A.2
  • 54
    • 84883487458 scopus 로고    scopus 로고
    • Image classification with the Fisher vector: Theory and practice
    • J. Sanchez, F. Perronnin, T. Mensink, and J. J. Verbeek, "Image classification with the Fisher vector: Theory and practice, " Int. J. Comput. Vis., vol. 105, no. 3, pp. 222-245, 2013.
    • (2013) Int. J. Comput. Vis. , vol.105 , Issue.3 , pp. 222-245
    • Sanchez, J.1    Perronnin, F.2    Mensink, T.3    Verbeek, J.J.4
  • 56
    • 78650994992 scopus 로고    scopus 로고
    • VLFeat: An open and portable library of computer vision algorithms
    • A. Vedaldi and B. Fulkerson, "VLFeat: An open and portable library of computer vision algorithms, " in Proc. 18th ACM Int. Conf. Multimedia, 2010, pp. 1469-1472.
    • (2010) Proc. 18th ACM Int. Conf. Multimedia , pp. 1469-1472
    • Vedaldi, A.1    Fulkerson, B.2
  • 62
    • 44049111982 scopus 로고
    • Nonlinear total variation based noise removal algorithms
    • L. I. Rudin, S. Osher, and E. Fatemi, "Nonlinear total variation based noise removal algorithms, " Physica D, vol. 60, pp. 259-268, 1992.
    • (1992) Physica D , vol.60 , pp. 259-268
    • Rudin, L.I.1    Osher, S.2    Fatemi, E.3
  • 63
    • 0033592606 scopus 로고    scopus 로고
    • Learning the parts of objects by non-negative matrix factorization
    • D. D. Lee and H. S. Seung, "Learning the parts of objects by non-negative matrix factorization, " Nature, vol. 401, pp. 788-791, 1999.
    • (1999) Nature , vol.401 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 65
    • 78149297677 scopus 로고    scopus 로고
    • Weighted sums of random kitchen sinks: Replacing minimization with randomization in learning
    • A. Rahimi and B. Recht, "Weighted sums of random kitchen sinks: Replacing minimization with randomization in learning, " in Proc. Advances Neural Inf. Process. Syst., 2006, pp. 1313-1320.
    • (2006) Proc. Advances Neural Inf. Process. Syst. , pp. 1313-1320
    • Rahimi, A.1    Recht, B.2
  • 66
    • 0019602085 scopus 로고
    • A generalized proximal point algorithm for certain non-convex minimization problems
    • M. Fukushima and H. Mine, "A generalized proximal point algorithm for certain non-convex minimization problems, " Int. J. Syst. Sci., vol. 12, no. 8, pp. 989-1000, 1981.
    • (1981) Int. J. Syst. Sci. , vol.12 , Issue.8 , pp. 989-1000
    • Fukushima, M.1    Mine, H.2
  • 68
    • 0035608644 scopus 로고    scopus 로고
    • Local extremes, runs, strings and multiresolution
    • P. L. Davies and A. Kovac, "Local extremes, runs, strings and multiresolution, " Ann. Statist., vol. 29, no. 1, pp. 1-65, 2001.
    • (2001) Ann. Statist. , vol.29 , Issue.1 , pp. 1-65
    • Davies, P.L.1    Kovac, A.2
  • 69
    • 0010442827 scopus 로고    scopus 로고
    • On the algorithmic implementation of multiclass kernel-based vector machines
    • K. Crammer and Y. Singer, "On the algorithmic implementation of multiclass kernel-based vector machines, " J. Mach. Learn. Res., vol. 2, pp. 265-292, 2001.
    • (2001) J. Mach. Learn. Res. , vol.2 , pp. 265-292
    • Crammer, K.1    Singer, Y.2
  • 70
    • 79959766559 scopus 로고    scopus 로고
    • Consumer video understanding: A benchmark database and an evaluation of human and machine performance
    • Art. no. 29
    • Y. Jiang, G. Ye, S. Chang, D. P. W. Ellis, and A. C. Loui, "Consumer video understanding: A benchmark database and an evaluation of human and machine performance, " in Proc. 1st ACM Int. Conf. Multimedia Retrieval, 2011, Art. no. 29.
    • (2011) Proc. 1st ACM Int. Conf. Multimedia Retrieval
    • Jiang, Y.1    Ye, G.2    Chang, S.3    Ellis, D.P.W.4    Loui, A.C.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.