SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn 0, Issue , 2016, Pages 3476-3484

Spatiotemporal residual networks for video action recognition

(3) Feichtenhofer, Christoph a Pinz, Axel a Wildes, Richard P b

a GRAZ UNIVERSITY OF TECHNOLOGY (Austria)

b YORK UNIVERSITY (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

CONVOLUTION;

CONVOLUTIONAL NETWORKS; HIERARCHICAL LEARNING; HUMAN-ACTION RECOGNITION; SPATIO TEMPORAL FEATURES; SPATIO-TEMPORAL DOMAINS; SPATIOTEMPORAL INTERACTIONS; SPATIOTEMPORAL NETWORKS; SPATIOTEMPORAL RECEPTIVE FIELD;

NETWORK ARCHITECTURE;

EID: 85019227137 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (746)

References (32)

1
- 85083954507
- Delving deeper into convolutional networks for learning video representations
- Nicolas Ballas, Li Yao, Chris Pal, and Aaron Courville. Delving deeper into convolutional networks for learning video representations. In Proc. ICLR, 2016.
- (2016) Proc. ICLR
- Ballas, N.¹ Yao, L.² Pal, C.³ Courville, A.⁴

2
- 84986334053
- Dynamic image networks for action recognition
- H. Bilen, B. Fernando, E. Gavves, A. Vedaldi, and S. Gould. Dynamic image networks for action recognition. In Proc. CVPR, 2016.
- (2016) Proc. CVPR
- Bilen, H.¹ Fernando, B.² Gavves, E.³ Vedaldi, A.⁴ Gould, S.⁵

3
- 0026682661
- Segregation of global and local motion processing in primate middle temporal visual area
- Richard T Born and Roger BH Tootell. Segregation of global and local motion processing in primate middle temporal visual area. Nature, 357(6378): 497-499, 1992.
- (1992) Nature , vol.357 , Issue.6378 , pp. 497-499
- Born, R.T.¹ Tootell, R.B.H.²

4
- 84959236502
- Long-term recurrent convolutional networks for visual recognition and description
- Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Kate Saenko, and Trevor Darrell. Long-term recurrent convolutional networks for visual recognition and description. In Proc. CVPR, 2015.
- (2015) Proc. CVPR
- Donahue, J.¹ Hendricks, L.A.² Guadarrama, S.³ Rohrbach, M.⁴ Venugopalan, S.⁵ Saenko, K.⁶ Darrell, T.⁷

5
- 85066903360
- Convolutional two-stream network fusion for video action recognition
- 20116
- Christoph Feichtenhofer, Axel Pinz, and Andrew Zisserman. Convolutional two-stream network fusion for video action recognition. In Proc. CVPR, 20116.
- Proc. CVPR
- Feichtenhofer, C.¹ Pinz, A.² Zisserman, A.³

6
- 0026565214
- Separate visual pathways for perception and action
- M. A. Goodale and A. D. Milner. Separate visual pathways for perception and action. Trends in Neurosciences, 15(1): 20-25, 1992.
- (1992) Trends in Neurosciences , vol.15 , Issue.1 , pp. 20-25
- Goodale, M.A.¹ Milner, A.D.²

7
- 84973901576
- Unsupervised feature learning from temporal data
- Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, and Yann LeCun. Unsupervised feature learning from temporal data. In Proc. ICCV, 2015.
- (2015) Proc. ICCV
- Goroshin, R.¹ Bruna, J.² Tompson, J.³ Eigen, D.⁴ LeCun, Y.⁵

8
- 84958589374
- arXiv preprint arXiv: 1512.03385
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. arXiv preprint arXiv: 1512.03385, 2015.
- (2015) Deep Residual Learning for Image Recognition
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

9
- 84990068011
- arXiv preprint arXiv: 1603.05027
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Identity mappings in deep residual networks. arXiv preprint arXiv: 1603.05027, 2016.
- (2016) Identity Mappings in Deep Residual Networks
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

10
- 85083950527
- Training cnns with low-rank filters for efficient image classification
- Yani Ioannou, Duncan Robertson, Jamie Shotton, Roberto Cipolla, and Antonio Criminisi. Training cnns with low-rank filters for efficient image classification. In Proc. ICLR, 2016.
- (2016) Proc. ICLR
- Ioannou, Y.¹ Robertson, D.² Shotton, J.³ Cipolla, R.⁴ Criminisi, A.⁵

11
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- Sergey Ioffe and Christian Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In Proc. ICML, 2015.
- (2015) Proc. ICML
- Ioffe, S.¹ Szegedy, C.²

12
- 84870183903
- 3D convolutional neural networks for human action recognition
- S. Ji, W. Xu, M. Yang, and K. Yu. 3D convolutional neural networks for human action recognition. IEEE PAMI, 35(1): 221-231, 2013.
- (2013) IEEE PAMI , vol.35 , Issue.1 , pp. 221-231
- Ji, S.¹ Xu, W.² Yang, M.³ Yu, K.⁴

13
- 84911364368
- Large-scale video classification with convolutional neural networks
- A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and L. Fei-Fei. Large-scale video classification with convolutional neural networks. In Proc. CVPR, 2014.
- (2014) Proc. CVPR
- Karpathy, A.¹ Toderici, G.² Shetty, S.³ Leung, T.⁴ Sukthankar, R.⁵ Fei-Fei, L.⁶

14
- 84876231242
- ImageNet classification with deep convolutional neural networks
- A. Krizhevsky, I. Sutskever, and G. E. Hinton. ImageNet classification with deep convolutional neural networks. In NIPS, 2012.
- (2012) NIPS
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

15
- 84856682691
- HMDB: A large video database for human motion recognition
- Hildegard Kuehne, Hueihan Jhuang, Estíbaliz Garrote, Tomaso Poggio, and Thomas Serre. HMDB: a large video database for human motion recognition. In Proc. ICCV, 2011.
- (2011) Proc. ICCV
- Kuehne, H.¹ Jhuang, H.² Garrote, E.³ Poggio, T.⁴ Serre, T.⁵

16
- 80052874098
- Learning hierarchical invariant spatiotemporal features for action recognition with independent subspace analysis
- Quoc V Le, Will Y Zou, Serena Y Yeung, and Andrew Y Ng. Learning hierarchical invariant spatiotemporal features for action recognition with independent subspace analysis. In Proc. CVPR, 2011.
- (2011) Proc. CVPR
- Le, Q.V.¹ Zou, W.Y.² Yeung, S.Y.³ Ng, A.Y.⁴

17
- 85019254484
- Behrooz Mahasseni and Sinisa Todorovic. Regularizing long short term memory with 3D human-skeleton sequences for action recognition.
- Regularizing Long Short Term Memory with 3D Human-skeleton Sequences for Action Recognition
- Mahasseni, B.¹ Todorovic, S.²

18
- 84959228762
- Beyond short snippets: Deep networks for video classification
- Joe Yue-Hei Ng, Matthew Hausknecht, Sudheendra Vijayanarasimhan, Oriol Vinyals, Rajat Monga, and George Toderici. Beyond short snippets: Deep networks for video classification. In Proc. CVPR, 2015.
- (2015) Proc. CVPR
- Ng, J.Y.-H.¹ Hausknecht, M.² Vijayanarasimhan, S.³ Vinyals, O.⁴ Monga, R.⁵ Toderici, G.⁶

19
- 84978908606
- Action recognition using visual attention
- Shikhar Sharma, Ryan Kiros, and Ruslan Salakhutdinov. Action recognition using visual attention. In NIPS workshop on Time Series. 2015.
- (2015) NIPS Workshop on Time Series
- Sharma, S.¹ Kiros, R.² Salakhutdinov, R.³

20
- 84937862424
- Two-stream convolutional networks for action recognition in videos
- K. Simonyan and A. Zisserman. Two-stream convolutional networks for action recognition in videos. In NIPS, 2014.
- (2014) NIPS
- Simonyan, K.¹ Zisserman, A.²

21
- 84925410541
- Very deep convolutional networks for large-scale image recognition
- Karen Simonyan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. In Proc. ICLR, 2014.
- (2014) Proc. ICLR
- Simonyan, K.¹ Zisserman, A.²

22
- 84884955228
- UCF101: A dataset of 101 human actions calsses from videos in the wild
- Khurram Soomro, Amir Roshan Zamir, and Mubarak Shah. UCF101: A dataset of 101 human actions calsses from videos in the wild. Technical Report CRCV-TR-12-01, 2012.
- (2012) Technical Report CRCV-TR-12-01
- Soomro, K.¹ Zamir, A.R.² Shah, M.³

23
- 84990032289
- arXiv preprint arXiv: 1512.00567
- Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, and Zbigniew Wojna. Rethinking the inception architecture for computer vision. arXiv preprint arXiv: 1512.00567, 2015.
- (2015) Rethinking the Inception Architecture for Computer Vision
- Szegedy, C.¹ Vanhoucke, V.² Ioffe, S.³ Shlens, J.⁴ Wojna, Z.⁵

24
- 84983383396
- arXiv preprint arXiv: 1602.07261
- Christian Szegedy, Sergey Ioffe, and Vincent Vanhoucke. Inception-v4, inception-resnet and the impact of residual connections on learning. arXiv preprint arXiv: 1602.07261, 2016.
- (2016) Inception-v4, Inception-resnet and the Impact of Residual Connections on Learning
- Szegedy, C.¹ Ioffe, S.² Vanhoucke, V.³

25
- 84867652321
- Convolutional learning of spatio-temporal features
- G. W. Taylor, R. Fergus, Y. LeCun, and C. Bregler. Convolutional learning of spatio-temporal features. In Proc. ECCV, 2010.
- (2010) Proc. ECCV
- Taylor, G.W.¹ Fergus, R.² LeCun, Y.³ Bregler, C.⁴

26
- 84973865953
- Learning spatiotemporal features with 3D convolutional networks
- D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri. Learning spatiotemporal features with 3D convolutional networks. In Proc. ICCV, 2015.
- (2015) Proc. ICCV
- Tran, D.¹ Bourdev, L.² Fergus, R.³ Torresani, L.⁴ Paluri, M.⁵

27
- 0028088013
- Neural mechanisms of form and motion processing in the primate visual system
- David C Van Essen and Jack L Gallant. Neural mechanisms of form and motion processing in the primate visual system. Neuron, 13(1): 1-10, 1994.
- (1994) Neuron , vol.13 , Issue.1 , pp. 1-10
- Van Essen, D.C.¹ Gallant, J.L.²

28
- 84962815548
- MatConvNet - Convolutional neural networks for MATLAB
- A. Vedaldi and K. Lenc. MatConvNet - convolutional neural networks for MATLAB. In Proceeding of the ACM Int. Conf. on Multimedia, 2015.
- (2015) Proceeding of the ACM Int. Conf. on Multimedia
- Vedaldi, A.¹ Lenc, K.²

29
- 84898805910
- Action recognition with improved trajectories
- Heng Wang and Cordelia Schmid. Action recognition with improved trajectories. In Proc. ICCV, 2013.
- (2013) Proc. ICCV
- Wang, H.¹ Schmid, C.²

30
- 84955282488
- Action recognition with trajectory-pooled deep-convolutional descriptors
- Limin Wang, Yu Qiao, and Xiaoou Tang. Action recognition with trajectory-pooled deep-convolutional descriptors. In Proc. CVPR, 2015.
- (2015) Proc. CVPR
- Wang, L.¹ Qiao, Y.² Tang, X.³

31
- 84986268683
- Actions ~ transformations
- Xiaolong Wang, Ali Farhadi, and Abhinav Gupta. Actions ~ transformations. In Proc. CVPR, 2016.
- (2016) Proc. CVPR
- Wang, X.¹ Farhadi, A.² Gupta, A.³

32
- 38349007037
- A duality based approach for realtime TV-L1 optical flow
- C. Zach, T. Pock, and H. Bischof. A duality based approach for realtime TV-L1 optical flow. In Proc. DAGM, pages 214-223, 2007.
- (2007) Proc. DAGM , pp. 214-223
- Zach, C.¹ Pock, T.² Bischof, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.