메뉴 건너뛰기




Volumn 155, Issue , 2017, Pages 1-23

The THUMOS challenge on action recognition for videos “in the wild”

Author keywords

Action detection; Action localization; Action recognition; Benchmark; Dataset; THUMOS; UCF101; Untrimmed videos

Indexed keywords

BENCHMARKING; IMAGE UNDERSTANDING;

EID: 85006106345     PISSN: 10773142     EISSN: 1090235X     Source Type: Journal    
DOI: 10.1016/j.cviu.2016.10.018     Document Type: Article
Times cited : (588)

References (64)
  • 6
    • 85072028231 scopus 로고    scopus 로고
    • Return of the devil in the details: Delving deep into convolutional nets
    • Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A., Return of the devil in the details: Delving deep into convolutional nets. BMVC, 2014.
    • (2014) BMVC
    • Chatfield, K.1    Simonyan, K.2    Vedaldi, A.3    Zisserman, A.4
  • 10
    • 85043218536 scopus 로고    scopus 로고
    • Joint segmentation and classification of human actions in video
    • Hoai, M., Lan, Z.-Z., De la Torre, F., Joint segmentation and classification of human actions in video. IEEE CVPR, 2011.
    • (2011) IEEE CVPR
    • Hoai, M.1    Lan, Z.-Z.2    De la Torre, F.3
  • 12
    • 84977913021 scopus 로고    scopus 로고
    • Aggregating local descriptors into a compact image representation
    • Jégou, H., Douze, M., Schmid, C., Pérez, P., Aggregating local descriptors into a compact image representation. IEEE CVPR, 2010.
    • (2010) IEEE CVPR
    • Jégou, H.1    Douze, M.2    Schmid, C.3    Pérez, P.4
  • 13
    • 85009464127 scopus 로고    scopus 로고
    • THUMOS’13: ICCV workshop on action recognition with a large number of classes
    • Jiang, Y.-G., Liu, J., Zamir, A.R., Laptev, I., Piccardi, M., Shah, M., Sukthankar, R., THUMOS’13: ICCV workshop on action recognition with a large number of classes. 2013 http://crcv.ucf.edu/ICCV13-Action-Workshop/.
    • (2013)
    • Jiang, Y.-G.1    Liu, J.2    Zamir, A.R.3    Laptev, I.4    Piccardi, M.5    Shah, M.6    Sukthankar, R.7
  • 14
    • 85009460460 scopus 로고    scopus 로고
    • THUMOS’14: ECCV workshop on action recognition with a large number of classes
    • Jiang, Y.-G., Liu, J., Zamir, A.R., Toderici, G., Laptev, I., Shah, M., Sukthankar, R., THUMOS’14: ECCV workshop on action recognition with a large number of classes. 2014 http://crcv.ucf.edu/THUMOS14/.
    • (2014)
    • Jiang, Y.-G.1    Liu, J.2    Zamir, A.R.3    Toderici, G.4    Laptev, I.5    Shah, M.6    Sukthankar, R.7
  • 16
    • 38049168073 scopus 로고    scopus 로고
    • Efficient visual event detection using volumetric features
    • Ke, Y., Sukthankar, R., Hebert, M., Efficient visual event detection using volumetric features. IEEE ICCV, 2005.
    • (2005) IEEE ICCV
    • Ke, Y.1    Sukthankar, R.2    Hebert, M.3
  • 20
    • 85006752800 scopus 로고    scopus 로고
    • Beyond gaussian pyramid: multi-skip feature stacking for action recognition
    • Lan, Z., Lin, M., Li, X., Hauptmann, A.G., Raj, B., Beyond gaussian pyramid: multi-skip feature stacking for action recognition. IEEE CVPR, 2015.
    • (2015) IEEE CVPR
    • Lan, Z.1    Lin, M.2    Li, X.3    Hauptmann, A.G.4    Raj, B.5
  • 22
    • 85009495464 scopus 로고    scopus 로고
    • Retrieving actions in movies
    • Laptev, I., Pérez, P., Retrieving actions in movies. IEEE ICCV, 2007.
    • (2007) IEEE ICCV
    • Laptev, I.1    Pérez, P.2
  • 23
    • 84856329825 scopus 로고    scopus 로고
    • Recognizing realistic actions from videos “in the wild”
    • Liu, J., Luo, J., Shah, M., Recognizing realistic actions from videos “in the wild”. IEEE CVPR, 2009.
    • (2009) IEEE CVPR
    • Liu, J.1    Luo, J.2    Shah, M.3
  • 26
    • 80052874353 scopus 로고    scopus 로고
    • Modeling temporal structure of decomposable motion segments for activity classification
    • Niebles, J.C., Chen, C.-W., Li, F.-F., Modeling temporal structure of decomposable motion segments for activity classification. ECCV, 2010.
    • (2010) ECCV
    • Niebles, J.C.1    Chen, C.-W.2    Li, F.-F.3
  • 29
    • 84899710214 scopus 로고    scopus 로고
    • Action and event recognition with Fisher vectors on a compact feature set
    • Oneata, D., Verbeek, J., Schmid, C., Action and event recognition with Fisher vectors on a compact feature set. IEEE ICCV, 2013.
    • (2013) IEEE ICCV
    • Oneata, D.1    Verbeek, J.2    Schmid, C.3
  • 32
    • 79959771606 scopus 로고    scopus 로고
    • Improving the fisher kernel for large-scale image classification
    • Perronnin, F., Sánchez, J., Mensink, T., Improving the fisher kernel for large-scale image classification. ECCV, 2010.
    • (2010) ECCV
    • Perronnin, F.1    Sánchez, J.2    Mensink, T.3
  • 33
    • 84887832683 scopus 로고    scopus 로고
    • Detecting activities of daily living in first-person camera views
    • Pirsiavash, H., Ramanan, D., Detecting activities of daily living in first-person camera views. IEEE CVPR, 2012.
    • (2012) IEEE CVPR
    • Pirsiavash, H.1    Ramanan, D.2
  • 34
    • 85009492207 scopus 로고    scopus 로고
    • Parsing videos of actions with segmental grammars
    • Pirsiavash, H., Ramanan, D., Parsing videos of actions with segmental grammars. IEEE CVPR, 2014.
    • (2014) IEEE CVPR
    • Pirsiavash, H.1    Ramanan, D.2
  • 36
    • 85009439718 scopus 로고    scopus 로고
    • Poselet key-framing: a model for human activity recognition
    • Raptis, M., Sigal, L., Poselet key-framing: a model for human activity recognition. IEEE CVPR, 2013.
    • (2013) IEEE CVPR
    • Raptis, M.1    Sigal, L.2
  • 37
    • 84879553900 scopus 로고    scopus 로고
    • Recognizing 50 human action categories of web videos
    • Reddy, K.K., Shah, M., Recognizing 50 human action categories of web videos. Mach. Vis Appl. 24:5 (2013), 971–981.
    • (2013) Mach. Vis Appl. , vol.24 , Issue.5 , pp. 971-981
    • Reddy, K.K.1    Shah, M.2
  • 38
    • 85009460474 scopus 로고    scopus 로고
    • Temporal action detection using a statistical language model
    • Richard, A., Gall, J., Temporal action detection using a statistical language model. 2016.
    • (2016)
    • Richard, A.1    Gall, J.2
  • 39
    • 51949084792 scopus 로고    scopus 로고
    • Action MACH: a spatio-temporal maximum average correlation height filter for action recognition
    • Rodriguez, M., Ahmed, J., Shah, M., Action MACH: a spatio-temporal maximum average correlation height filter for action recognition. IEEE CVPR, 2008.
    • (2008) IEEE CVPR
    • Rodriguez, M.1    Ahmed, J.2    Shah, M.3
  • 40
    • 84970904400 scopus 로고    scopus 로고
    • First-person activity recognition: what are they doing to me?
    • Ryoo, M.S., Matthies, L., First-person activity recognition: what are they doing to me?. IEEE CVPR, 2013.
    • (2013) IEEE CVPR
    • Ryoo, M.S.1    Matthies, L.2
  • 41
    • 84883487458 scopus 로고    scopus 로고
    • Image classification with the fisher vector: theory and practice
    • Sánchez, J., Perronnin, F., Mensink, T., Verbeek, J., Image classification with the fisher vector: theory and practice. IJCV 105:3 (2013), 222–245.
    • (2013) IJCV , vol.105 , Issue.3 , pp. 222-245
    • Sánchez, J.1    Perronnin, F.2    Mensink, T.3    Verbeek, J.4
  • 42
    • 80052901415 scopus 로고    scopus 로고
    • Modeling the temporal extent of actions
    • Satkin, S., Hebert, M., Modeling the temporal extent of actions. ECCV, 2010.
    • (2010) ECCV
    • Satkin, S.1    Hebert, M.2
  • 43
    • 10044233701 scopus 로고    scopus 로고
    • Recognizing human actions: a local SVM approach
    • Schuldt, C., Laptev, I., Caputo, B., Recognizing human actions: a local SVM approach. ICPR, 2004.
    • (2004) ICPR
    • Schuldt, C.1    Laptev, I.2    Caputo, B.3
  • 44
    • 85047469861 scopus 로고    scopus 로고
    • Action temporal localization in untrimmed videos via multi-stage cnns
    • Shou, Z., Wang, D., Chang, S.-F., Action temporal localization in untrimmed videos via multi-stage cnns. IEEE CVPR, 2016.
    • (2016) IEEE CVPR
    • Shou, Z.1    Wang, D.2    Chang, S.-F.3
  • 45
    • 84937862424 scopus 로고    scopus 로고
    • Two-stream convolutional networks for action recognition in videos
    • Simonyan, K., Zisserman, A., Two-stream convolutional networks for action recognition in videos. NIPS, 2014.
    • (2014) NIPS
    • Simonyan, K.1    Zisserman, A.2
  • 46
  • 47
    • 85009454535 scopus 로고    scopus 로고
    • Action localization in videos through context walk
    • Soomro, K., Idrees, H., Shah, M., Action localization in videos through context walk. IEEE ICCV, 2015.
    • (2015) IEEE ICCV
    • Soomro, K.1    Idrees, H.2    Shah, M.3
  • 48
    • 84986246311 scopus 로고    scopus 로고
    • Predicting the where and what of actors and actions through online action localization
    • Soomro, K., Idrees, H., Shah, M., Predicting the where and what of actors and actions through online action localization. CVPR, 2016.
    • (2016) CVPR
    • Soomro, K.1    Idrees, H.2    Shah, M.3
  • 49
    • 84884955228 scopus 로고    scopus 로고
    • UCF101: A Dataset of 101 Human Action Classes from Videos in the Wild
    • UCF
    • Soomro, K., Zamir, A.R., Shah, M., UCF101: A Dataset of 101 Human Action Classes from Videos in the Wild. Technical Report CRCV-TR-12-01, 2012, UCF.
    • (2012) Technical Report CRCV-TR-12-01
    • Soomro, K.1    Zamir, A.R.2    Shah, M.3
  • 52
    • 84887372329 scopus 로고    scopus 로고
    • Learning latent temporal structure for complex event detection
    • Tang, K., Fei-Fei, L., Koller, D., Learning latent temporal structure for complex event detection. IEEE CVPR, 2012.
    • (2012) IEEE CVPR
    • Tang, K.1    Fei-Fei, L.2    Koller, D.3
  • 53
    • 85009499368 scopus 로고    scopus 로고
    • Spatiotemporal deformable part models for action detection
    • Tian, Y., Sukthankar, R., Shah, M., Spatiotemporal deformable part models for action detection. IEEE CVPR, 2013.
    • (2013) IEEE CVPR
    • Tian, Y.1    Sukthankar, R.2    Shah, M.3
  • 54
    • 84973865953 scopus 로고    scopus 로고
    • Learning spatiotemporal features with 3d convolutional networks
    • Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M., Learning spatiotemporal features with 3d convolutional networks. ICCV, 2015.
    • (2015) ICCV
    • Tran, D.1    Bourdev, L.2    Fergus, R.3    Torresani, L.4    Paluri, M.5
  • 55
    • 84931075884 scopus 로고    scopus 로고
    • Action recognition with improved trajectories
    • Wang, H., Schmid, C., Action recognition with improved trajectories. IEEE ICCV, 2013.
    • (2013) IEEE ICCV
    • Wang, H.1    Schmid, C.2
  • 58
    • 84937225744 scopus 로고    scopus 로고
    • A discriminative cnn video representation for event detection
    • Xu, Z., Yang, Y., Hauptmann, A.G., A discriminative cnn video representation for event detection. IEEE CVPR, 2015.
    • (2015) IEEE CVPR
    • Xu, Z.1    Yang, Y.2    Hauptmann, A.G.3
  • 62
    • 80155196334 scopus 로고    scopus 로고
    • Discriminative subvolume search for efficient action detection
    • Yuan, J., Liu, Z., Wu, Y., Discriminative subvolume search for efficient action detection. IEEE CVPR, 2009.
    • (2009) IEEE CVPR
    • Yuan, J.1    Liu, Z.2    Wu, Y.3
  • 64
    • 84921476116 scopus 로고    scopus 로고
    • Visualizing and understanding convolutional networks
    • Zeiler, M.D., Fergus, R., Visualizing and understanding convolutional networks. ECCV, 2014.
    • (2014) ECCV
    • Zeiler, M.D.1    Fergus, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.