SCOPUS 정보 검색 플랫폼

Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017

Volumn 2017-January, Issue , 2017, Pages 1417-1426

CDC: Convolutional-de-convolutional networks for precise temporal action localization in untrimmed videos

(5) Shou, Zheng a Chan, Jonathan a Zareian, Alireza a Miyazawa, Kazuyuki b Chang, Shih Fu a

a Och Spine at New York Presbyterian Hospitals (United States)

b MITSUBISHI ELECTRIC CORPORATION (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; PATTERN RECOGNITION; SEMANTICS; SIGNAL SAMPLING;

ACTION SEMANTICS; COMPLEX BACKGROUND; CONVOLUTIONAL NETWORKS; FINE GRANULARITY; FRAMES PER SECONDS; HIGH-EFFICIENCY; STATE-OF-THE-ART SYSTEM; TEMPORAL DYNAMICS;

CONVOLUTION;

EID: 85044270610 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CVPR.2017.155 Document Type: Conference Paper

Times cited : (621)

References (79)

1
- 85044309526
- Mexaction2. http://mexculture.cnam.fr/xwiki/bin/view/Datasets/Mex+action+dataset, 2015.
- (2015)

2
- 85044280305
- Activitynet challenge 2016. http://activity-net.org/challenges/2016/, 2016.
- (2016) Activitynet Challenge 2016

3
- 79955649703
- Human activity analysis: A review
- J. K. Aggarwal and M. S. Ryoo. Human activity analysis: A review. In ACM Computing Surveys, 2011.
- (2011) ACM Computing Surveys
- Aggarwal, J.K.¹ Ryoo, M.S.²

4
- 85038956512
- Segnet: A deep convolutional encoder-decoder architecture for image segmentation
- V. Badrinarayanan, A. Kendall, and R. Cipolla. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. TPAMI, 2016.
- (2016) TPAMI
- Badrinarayanan, V.¹ Kendall, A.² Cipolla, R.³

5
- 85083954148
- Semantic image segmentation with deep con-volutional nets and fully connected crfs
- L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille. Semantic image segmentation with deep con-volutional nets and fully connected crfs. In ICLR, 2015.
- (2015) ICLR
- Chen, L.-C.¹ Papandreou, G.² Kokkinos, I.³ Murphy, K.⁴ Yuille, A.L.⁵

6
- 84990051868
- L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. 2016.
- (2016) Deeplab: Semantic Image Segmentation With Deep Convolutional Nets, Atrous Convolution, and Fully Connected Crfs
- Chen, L.-C.¹ Papandreou, G.² Kokkinos, I.³ Murphy, K.⁴ Yuille, A.L.⁵

7
- 84959236502
- Long-term recurrent convolutional networks for visual recognition and description
- J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term recurrent convolutional networks for visual recognition and description. In CVPR, 2015.
- (2015) CVPR
- Donahue, J.¹ Hendricks, L.A.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

8
- 80054908266
- Automatic annotation of human actions in video
- O. Duchenne, I. Laptev, J. Sivic, F. Bach, and J. Ponce. Automatic annotation of human actions in video. In ICCV, 2007.
- (2007) ICCV
- Duchenne, O.¹ Laptev, I.² Sivic, J.³ Bach, F.⁴ Ponce, J.⁵

9
- 85026309914
- Daps: Deep action proposals for action understanding
- V. Escorcia, F. C. Heilbron, J. C. Niebles, and B. Ghanem. Daps: Deep action proposals for action understanding. In ECCV, 2016.
- (2016) ECCV
- Escorcia, V.¹ Heilbron, F.C.² Niebles, J.C.³ Ghanem, B.⁴

10
- 84986266741
- Convolutional two-stream network fusion for video action recognition
- C. Feichtenhofer, A. Pinz, and A. Zisserman. Convolutional two-stream network fusion for video action recognition. In CVPR, 2016.
- (2016) CVPR
- Feichtenhofer, C.¹ Pinz, A.² Zisserman, A.³

11
- 80052915321
- Actom sequence models for efficient action detection
- A. Gaidon, Z. Harchaoui, and C. Schmid. Actom sequence models for efficient action detection. In CVPR, 2011.
- (2011) CVPR
- Gaidon, A.¹ Harchaoui, Z.² Schmid, C.³

12
- 84973872525
- Temporal localization of actions with actoms
- A. Gaidon, Z. Harchaoui, and C. Schmid. Temporal localization of actions with actoms. In TPAMI, 2013.
- (2013) TPAMI
- Gaidon, A.¹ Harchaoui, Z.² Schmid, C.³

13
- 84959230113
- Devnet: A deep event network for multimedia event detection and evidence recounting
- C. Gan, N. Wang, Y. Yang, D.-Y. Yeung, and A. G. Hauptmann. Devnet: A deep event network for multimedia event detection and evidence recounting. In CVPR, 2015.
- (2015) CVPR
- Gan, C.¹ Wang, N.² Yang, Y.³ Yeung, D.-Y.⁴ Hauptmann, A.G.⁵

14
- 84959196122
- Finding action tubes
- G. Gkioxari and J. Malik. Finding action tubes. In CVPR, 2015.
- (2015) CVPR
- Gkioxari, G.¹ Malik, J.²

15
- 84961136088
- A. Gorban, H. Idrees, Y.-G. Jiang, A. R. Zamir, I. Laptev, M. Shah, and R. Sukthankar. THUMOS challenge: Action recognition with a large number of classes. http://www.thumos.info/, 2015.
- (2015) THUMOS Challenge: Action Recognition With a Large Number of Classes
- Gorban, A.¹ Idrees, H.² Jiang, Y.-G.³ Zamir, A.R.⁴ Laptev, I.⁵ Shah, M.⁶ Sukthankar, R.⁷

16
- 84986274465
- Deep residual learning for image recognition
- K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In CVPR, 2016.
- (2016) CVPR
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

17
- 84959216468
- Activitynet: A large-scale video benchmark for human activity understanding
- F. C. Heilbron, V. Escorcia, B. Ghanem, and J. C. Niebles. Activitynet: A large-scale video benchmark for human activity understanding. In CVPR, 2015.
- (2015) CVPR
- Heilbron, F.C.¹ Escorcia, V.² Ghanem, B.³ Niebles, J.C.⁴

18
- 84986275821
- Fast temporal activity proposals for efficient detection of human actions in untrimmed videos
- F. C. Heilbron, J. C. Niebles, and B. Ghanem. Fast temporal activity proposals for efficient detection of human actions in untrimmed videos. In CVPR, 2016.
- (2016) CVPR
- Heilbron, F.C.¹ Niebles, J.C.² Ghanem, B.³

19
- 84965099276
- Decoupled deep neural network for semi-supervised semantic segmentation
- S. Hong, H. Noh, and B. Han. Decoupled deep neural network for semi-supervised semantic segmentation. In NIPS, 2015.
- (2015) NIPS
- Hong, S.¹ Noh, H.² Han, B.³

20
- 84911453664
- Action localization with tubelets from motion
- M. Jain, J. van Gemert, H. Jégou, P. Bouthemy, and C. Snoek. Action localization with tubelets from motion. In CVPR, 2014.
- (2014) CVPR
- Jain, M.¹ Van Gemert, J.² Jégou, H.³ Bouthemy, P.⁴ Snoek, C.⁵

21
- 84973868024
- Objects2action: Classifying and localizing actions without any video example
- M. Jain, J. van Gemert, T. Mensink, and C. Snoek. Objects2action: Classifying and localizing actions without any video example. In ICCV, 2015.
- (2015) ICCV
- Jain, M.¹ Van Gemert, J.² Mensink, T.³ Snoek, C.⁴

22
- 84959235126
- What do 15, 000 object categories tell us about classifying and localizing actions?
- M. Jain, J. van Gemert, and C. Snoek. What do 15, 000 object categories tell us about classifying and localizing actions? In CVPR, 2015.
- (2015) CVPR
- Jain, M.¹ Van Gemert, J.² Snoek, C.³

23
- 77956004473
- Aggregating local descriptors into a compact image representation
- H. Jégou, M. Douze, C. Schmid, and P. Pérez. Aggregating local descriptors into a compact image representation. In CVPR, 2010.
- (2010) CVPR
- Jégou, H.¹ Douze, M.² Schmid, C.³ Pérez, P.⁴

24
- 85009867858
- Caffe: Convolutional architecture for fast feature embedding
- Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Gir-shick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. In ACM MM, 2014.
- (2014) ACM MM
- Jia, Y.¹ Shelhamer, E.² Donahue, J.³ Karayev, S.⁴ Long, J.⁵ Gir-Shick, R.⁶ Guadarrama, S.⁷ Darrell, T.⁸

25
- 84905052261
- Y.-G. Jiang, J. Liu, A. R. Zamir, G. Toderici, I. Laptev, M. Shah, and R. Sukthankar. THUMOS challenge: Action recognition with a large number of classes. http://crcv.ucf.edu/THUMOS14/, 2014.
- (2014) THUMOS Challenge: Action Recognition With a Large Number of Classes
- Jiang, Y.-G.¹ Liu, J.² Zamir, A.R.³ Toderici, G.⁴ Laptev, I.⁵ Shah, M.⁶ Sukthankar, R.⁷

26
- 84986316707
- Fast saliency based pooling of fisher encoded dense trajectories
- S. Karaman, L. Seidenari, and A. D. Bimbo. Fast saliency based pooling of fisher encoded dense trajectories. In ECCV THUMOS Workshop, 2014.
- (2014) ECCV THUMOS Workshop
- Karaman, S.¹ Seidenari, L.² Bimbo, A.D.³

27
- 84911364368
- Large-scale video classification with convo-lutional neural networks
- A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and L. Fei-Fei. Large-scale video classification with convo-lutional neural networks. In CVPR, 2014.
- (2014) CVPR
- Karpathy, A.¹ Toderici, G.² Shetty, S.³ Leung, T.⁴ Sukthankar, R.⁵ Fei-Fei, L.⁶

28
- 84871122577
- Human focused action localization in video
- A. Kläser, M. Marszałek, C. Schmid, and A. Zisserman. Human focused action localization in video. In Trends and Topics in Computer Vision, 2012.
- (2012) Trends and Topics in Computer Vision
- Kläser, A.¹ Marszałek, M.² Schmid, C.³ Zisserman, A.⁴

29
- 84876231242
- Imagenet classification with deep convolutional neural networks
- A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
- (2012) NIPS
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

30
- 50649122769
- Retrieving actions in movies
- I. Laptev and P. Pérez. Retrieving actions in movies. In ICCV, 2007.
- (2007) ICCV
- Laptev, I.¹ Pérez, P.²

31
- 85044291629
- Segmental spatiotemporal cnns for fine-grained action segmentation
- C. Lea, A. Reiter, R. Vidal, and G. D. Hager. Segmental spatiotemporal cnns for fine-grained action segmentation. In ECCV, 2016.
- (2016) ECCV
- Lea, C.¹ Reiter, A.² Vidal, R.³ Hager, G.D.⁴

32
- 84986261676
- Efficient piecewise training of deep structured models for semantic segmentation
- G. Lin, C. Shen, A. van den Hengel, and I. Reid. Efficient piecewise training of deep structured models for semantic segmentation. In CVPR, 2016.
- (2016) CVPR
- Lin, G.¹ Shen, C.² Van Den Hengel, A.³ Reid, I.⁴

33
- 84986256919
- Multi-scale patch aggregation (mpa) for simultaneous detection and segmentation
- S. Liu, X. Qi, J. Shi, H. Zhang, and J. Jia. Multi-scale patch aggregation (mpa) for simultaneous detection and segmentation. In CVPR, 2016.
- (2016) CVPR
- Liu, S.¹ Qi, X.² Shi, J.³ Zhang, H.⁴ Jia, J.⁵

34
- 84959205572
- Fully convolutional networks for semantic segmentation
- J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation. In CVPR, 2015.
- (2015) CVPR
- Long, J.¹ Shelhamer, E.² Darrell, T.³

35
- 84866710901
- A database for fine grained activity detection of cooking activities
- M. A. M. Rohrbach, S. Amin and B. Schiele. A database for fine grained activity detection of cooking activities. In CVPR, 2012.
- (2012) CVPR
- Rohrbach, M.A.M.¹ Amin, S.² Schiele, B.³

36
- 84994583262
- Spot on: Action localization from pointly-supervised proposals
- P. Mettes, J. van Gemert, and C. Snoek. Spot on: Action localization from pointly-supervised proposals. In ECCV, 2016.
- (2016) ECCV
- Mettes, P.¹ Van Gemert, J.² Snoek, C.³

37
- 84973879016
- Learning deconvolution network for semantic segmentation
- H. Noh, S. Hong, and B. Han. Learning deconvolution network for semantic segmentation. In ICCV, 2015.
- (2015) ICCV
- Noh, H.¹ Hong, S.² Han, B.³

38
- 84898791167
- Action and event recognition with fisher vectors on a compact feature set
- D. Oneata, J. Verbeek, and C. Schmid. Action and event recognition with fisher vectors on a compact feature set. In ICCV, 2013.
- (2013) ICCV
- Oneata, D.¹ Verbeek, J.² Schmid, C.³

39
- 84973924620
- The lear submission at thumos 2014
- D. Oneata, J. Verbeek, and C. Schmid. The lear submission at thumos 2014. In ECCV THUMOS Workshop, 2014.
- (2014) ECCV THUMOS Workshop
- Oneata, D.¹ Verbeek, J.² Schmid, C.³

40
- 79959771606
- Improving the fisher kernel for large-scale image classification
- F. Perronnin, J. Sánchez, and T. Mensink. Improving the fisher kernel for large-scale image classification. In ECCV, 2010.
- (2010) ECCV
- Perronnin, F.¹ Sánchez, J.² Mensink, T.³

41
- 77949275097
- A survey on vision-based human action recognition
- R. Poppe. A survey on vision-based human action recognition. In Image and vision computing, 2010.
- (2010) Image and Vision Computing
- Poppe, R.¹

42
- 84973879045
- Un-supervised tube extraction using transductive learning and dense trajectories
- M. M. Puscas, E. Sangineto, D. Culibrk, and N. Sebe. Un-supervised tube extraction using transductive learning and dense trajectories. In ICCV, 2015.
- (2015) ICCV
- Puscas, M.M.¹ Sangineto, E.² Culibrk, D.³ Sebe, N.⁴

43
- 84986270053
- Temporal action detection using a statistical language model
- A. Richard and J. Gall. Temporal action detection using a statistical language model. In CVPR, 2016.
- (2016) CVPR
- Richard, A.¹ Gall, J.²

44
- 84947041871
- ImageNet large scale visual recognition challenge
- O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei. ImageNet Large Scale Visual Recognition Challenge. IJCV, 2015.
- (2015) IJCV
- Russakovsky, O.¹ Deng, J.² Su, H.³ Krause, J.⁴ Satheesh, S.⁵ Ma, S.⁶ Huang, Z.⁷ Karpathy, A.⁸ Khosla, A.⁹ Bernstein, M.¹⁰ Berg, A.C.¹¹ Fei-Fei, L.¹²

45
- 85011076500
- Fully convolutional networks for semantic segmentation
- E. Shelhamer, J. Long, and T. Darrell. Fully convolutional networks for semantic segmentation. TPAMI, 2016.
- (2016) TPAMI
- Shelhamer, E.¹ Long, J.² Darrell, T.³

46
- 85044270610
- Z. Shou, J. Chan, A. Zareian, K. Miyazawa, and S.-F. Chang. Cdc: Convolutional-de-convolutional networks for precise temporal action localization in untrimmed videos. arXiv preprint arXiv:1703.01515, 2017.
- (2017) CDC: Convolutional-de-convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos
- Shou, Z.¹ Chan, J.² Zareian, A.³ Miyazawa, K.⁴ Chang, S.-F.⁵

47
- 84986268774
- Temporal action localization in untrimmed videos via multi-stage cnns
- Z. Shou, D. Wang, and S.-F. Chang. Temporal action localization in untrimmed videos via multi-stage cnns. In CVPR, 2016.
- (2016) CVPR
- Shou, Z.¹ Wang, D.² Chang, S.-F.³

48
- 85167598864
- Much ado about time: Exhaustive annotation of temporal data
- G. A. Sigurdsson, O. Russakovsky, A. Farhadi, I. Laptev, and A. Gupta. Much ado about time: Exhaustive annotation of temporal data. In HCOMP, 2016.
- (2016) HCOMP
- Sigurdsson, G.A.¹ Russakovsky, O.² Farhadi, A.³ Laptev, I.⁴ Gupta, A.⁵

49
- 85041903747
- Hollywood in homes: Crowdsourcing data collection for activity understanding
- G. A. Sigurdsson, G. Varol, X. Wang, A. Farhadi, I. Laptev, and A. Gupta. Hollywood in homes: Crowdsourcing data collection for activity understanding. In ECCV, 2016.
- (2016) ECCV
- Sigurdsson, G.A.¹ Varol, G.² Wang, X.³ Farhadi, A.⁴ Laptev, I.⁵ Gupta, A.⁶

50
- 84937862424
- Two-stream convolutional networks for action recognition in videos
- K. Simonyan and A. Zisserman. Two-stream convolutional networks for action recognition in videos. In NIPS, 2014.
- (2014) NIPS
- Simonyan, K.¹ Zisserman, A.²

51
- 85083953063
- Very deep convolutional networks for large-scale image recognition
- K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In International Conference on Learning Representations, 2015.
- (2015) International Conference on Learning Representations
- Simonyan, K.¹ Zisserman, A.²

52
- 84986328004
- A multi-stream bi-directional recurrent neural network for finegrained action detection
- B. Singh, T. K. Marks, M. Jones, O. Tuzel, and M. Shao. A multi-stream bi-directional recurrent neural network for finegrained action detection. In CVPR, 2016.
- (2016) CVPR
- Singh, B.¹ Marks, T.K.² Jones, M.³ Tuzel, O.⁴ Shao, M.⁵

53
- 85044257995
- Untrimmed classification for activity detection: Submission to activitynet challenge
- G. Singh and F. Cuzzolin. Untrimmed classification for activity detection: submission to activitynet challenge. In CVPR ActivityNet Workshop, 2016.
- (2016) CVPR ActivityNet Workshop
- Singh, G.¹ Cuzzolin, F.²

54
- 84973931629
- Action localization in videos through context walk
- K. Soomro, H. Idrees, and M. Shah. Action localization in videos through context walk. In ICCV, 2015.
- (2015) ICCV
- Soomro, K.¹ Idrees, H.² Shah, M.³

55
- 84986246311
- Predicting the where and what of actors and actions through online action localization
- K. Soomro, H. Idrees, and M. Shah. Predicting the where and what of actors and actions through online action localization. In CVPR, 2016.
- (2016) CVPR
- Soomro, K.¹ Idrees, H.² Shah, M.³

56
- 84986255283
- Fast action localization in large scale video archives
- A. Stoian, M. Ferecatu, J. Benois-Pineau, and M. Crucianu. Fast action localization in large scale video archives. In TCSVT, 2015.
- (2015) TCSVT
- Stoian, A.¹ Ferecatu, M.² Benois-Pineau, J.³ Crucianu, M.⁴

57
- 84956616423
- Scalable action localization with kernel-space hashing
- A. Stoian, M. Ferecatu, J. Benois-Pineau, and M. Crucianu. Scalable action localization with kernel-space hashing. In ICIP, 2015.
- (2015) ICIP
- Stoian, A.¹ Ferecatu, M.² Benois-Pineau, J.³ Crucianu, M.⁴

58
- 84986265065
- What if we do not have multiple videos of the same action? - Video action localization using web images
- W. Sultani and M. Shah. What if we do not have multiple videos of the same action? - video action localization using web images. In CVPR, 2016.
- (2016) CVPR
- Sultani, W.¹ Shah, M.²

59
- 84986290264
- Temporal localization of fine-grained actions in videos by domain transfer from web images
- C. Sun, S. Shetty, R. Sukthankar, and R. Nevatia. Temporal localization of fine-grained actions in videos by domain transfer from web images. In ACM MM, 2015.
- (2015) ACM MM
- Sun, C.¹ Shetty, S.² Sukthankar, R.³ Nevatia, R.⁴

60
- 84973865953
- Learning spatiotemporal features with 3d convolutional networks
- D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri. Learning spatiotemporal features with 3d convolutional networks. In ICCV, 2015.
- (2015) ICCV
- Tran, D.¹ Bourdev, L.² Fergus, R.³ Torresani, L.⁴ Paluri, M.⁵

61
- 85010192577
- Deep end2end voxel2voxel prediction
- D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri. Deep end2end voxel2voxel prediction. In CVPR Workshop on Deep Learning in Computer Vision, 2016.
- (2016) CVPR Workshop on Deep Learning in Computer Vision
- Tran, D.¹ Bourdev, L.² Fergus, R.³ Torresani, L.⁴ Paluri, M.⁵

62
- 84973913561
- Apt: Action localization proposals from dense trajectories
- J. van Gemert, M. Jain, E. Gati, and C. Snoek. Apt: Action localization proposals from dense trajectories. In BMVC, 2015.
- (2015) BMVC
- Van Gemert, J.¹ Jain, M.² Gati, E.³ Snoek, C.⁴

63
- 80052877143
- Action recognition by dense trajectories
- H. Wang, A. Kläser, C. Schmid, and C.-L. Liu. Action Recognition by Dense Trajectories. In CVPR, 2011.
- (2011) CVPR
- Wang, H.¹ Kläser, A.² Schmid, C.³ Liu, C.-L.⁴

64
- 84898805910
- Action recognition with improved trajectories
- H. Wang and C. Schmid. Action recognition with improved trajectories. In ICCV, 2013.
- (2013) ICCV
- Wang, H.¹ Schmid, C.²

65
- 84986274451
- Action recognition and detection by combining motion and appearance features
- L. Wang, Y. Qiao, and X. Tang. Action recognition and detection by combining motion and appearance features. In ECCV THUMOS Workshop, 2014.
- (2014) ECCV THUMOS Workshop
- Wang, L.¹ Qiao, Y.² Tang, X.³

66
- 85019099168
- Temporal segment networks: Towards good practices for deep action recognition
- L. Wang, Y. Xiong, Z. Wang, Y. Qiao, D. Lin, X. Tang, and L. V. Gool. Temporal segment networks: Towards good practices for deep action recognition. In ECCV, 2016.
- (2016) ECCV
- Wang, L.¹ Xiong, Y.² Wang, Z.³ Qiao, Y.⁴ Lin, D.⁵ Tang, X.⁶ Gool, L.V.⁷

67
- 85035244549
- Uts at activitynet 2016
- R. Wang and D. Tao. Uts at activitynet 2016. In CVPR ActivityNet Workshop, 2016.
- (2016) CVPR ActivityNet Workshop
- Wang, R.¹ Tao, D.²

68
- 78751648503
- A survey of vision-based methods for action representation, segmentation and recognition
- D. Weinland, R. Ronfard, and E. Boyer. A survey of vision-based methods for action representation, segmentation and recognition. In Computer Vision and Image Understanding, 2011.
- (2011) Computer Vision and Image Understanding
- Weinland, D.¹ Ronfard, R.² Boyer, E.³

69
- 84973931775
- Learning to track for spatio-temporal action localization
- P. Weinzaepfel, Z. Harchaoui, and C. Schmid. Learning to track for spatio-temporal action localization. In ICCV, 2015.
- (2015) ICCV
- Weinzaepfel, P.¹ Harchaoui, Z.² Schmid, C.³

70
- 84986313829
- Actor-action semantic segmentation with grouping process models
- C. Xu and J. J. Corso. Actor-action semantic segmentation with grouping process models. In CVPR, 2016.
- (2016) CVPR
- Xu, C.¹ Corso, J.J.²

71
- 84959226659
- A discriminative cnn video representation for event detection
- Z. Xu, Y. Yang, and A. G. Hauptmann. A discriminative cnn video representation for event detection. In CVPR, 2015.
- (2015) CVPR
- Xu, Z.¹ Yang, Y.² Hauptmann, A.G.³

72
- 84986240394
- S. Yeung, O. Russakovsky, N. Jin, M. Andriluka, G. Mori, and L. Fei-Fei. Every moment counts: Dense detailed labeling of actions in complex videos. arXiv preprint arXiv:1507.05738, 2015.
- (2015) Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos
- Yeung, S.¹ Russakovsky, O.² Jin, N.³ Andriluka, M.⁴ Mori, G.⁵ Fei-Fei, L.⁶

73
- 84986253505
- End-to-end learning of action detection from frame glimpses in videos
- S. Yeung, O. Russakovsky, G. Mori, and L. Fei-Fei. End-to-end learning of action detection from frame glimpses in videos. In CVPR, 2016.
- (2016) CVPR
- Yeung, S.¹ Russakovsky, O.² Mori, G.³ Fei-Fei, L.⁴

74
- 85083952059
- Multi-scale context aggregation by dilated convolutions
- F. Yu and V. Koltun. Multi-scale context aggregation by dilated convolutions. In ICLR, 2016.
- (2016) ICLR
- Yu, F.¹ Koltun, V.²

75
- 84959191147
- Fast action proposals for human action detection and search
- G. Yu and J. Yuan. Fast action proposals for human action detection and search. In CVPR, 2015.
- (2015) CVPR
- Yu, G.¹ Yuan, J.²

76
- 84986267340
- Temporal action localization with pyramid of score distribution features
- J. Yuan, B. Ni, X. Yang, and A. Kassim. Temporal action localization with pyramid of score distribution features. In CVPR, 2016.
- (2016) CVPR
- Yuan, J.¹ Ni, B.² Yang, X.³ Kassim, A.⁴

77
- 84921476116
- Visualizing and understanding con-volutional networks
- M. Zeiler and R. Fergus. Visualizing and understanding con-volutional networks. In ECCV, 2014.
- (2014) ECCV
- Zeiler, M.¹ Fergus, R.²

78
- 77956001004
- Decon-volutional networks
- M. Zeiler, D. Krishnan, G. W. Taylor, and R. Fergus. Decon-volutional networks. In CVPR, 2010.
- (2010) CVPR
- Zeiler, M.¹ Krishnan, D.² Taylor, G.W.³ Fergus, R.⁴

79
- 84973861983
- Conditional random fields as recurrent neural networks
- S. Zheng, S. Jayasumana, B. Romera-Paredes, V. Vineet, Z. Su, D. Du, C. Huang, and P. H. S. Torr. Conditional random fields as recurrent neural networks. In ICCV, 2015.
- (2015) ICCV
- Zheng, S.¹ Jayasumana, S.² Romera-Paredes, B.³ Vineet, V.⁴ Su, Z.⁵ Du, D.⁶ Huang, C.⁷ Torr, P.H.S.⁸

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.