-
1
-
-
85015681977
-
Distributed Video Sensor Networks
-
Springer London London Ch. Motion Analysis: Past, Present and Future
-
[1] Aggarwal, J.K., Distributed Video Sensor Networks. 2011, Springer London, London, 27–39 Ch. Motion Analysis: Past, Present and Future.
-
(2011)
, pp. 27-39
-
-
Aggarwal, J.K.1
-
2
-
-
84861642569
-
Motion history image: its variants and applications
-
[2] Ahad, M.A.R., Tan, J.K., Kim, H., Ishikawa, S., Motion history image: its variants and applications. Mach. Vis. Appl., 23, 2012.
-
(2012)
Mach. Vis. Appl.
, vol.23
-
-
Ahad, M.A.R.1
Tan, J.K.2
Kim, H.3
Ishikawa, S.4
-
3
-
-
84887354935
-
All About VLAD
-
June
-
[3] Arandjelovic, R., Zisserman, A., All About VLAD. Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, June 2013, 1578–1585.
-
(2013)
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
, pp. 1578-1585
-
-
Arandjelovic, R.1
Zisserman, A.2
-
4
-
-
81855221241
-
Sequential deep learning for human action recognition
-
[4] Baccouche, M., Mamalet, F., Wolf, C., Garcia, C., Baskurt, A., Sequential deep learning for human action recognition. Proceedings of the Second International Conference on Human Behavior Understanding. HBU’11, 2011, 29–39.
-
(2011)
Proceedings of the Second International Conference on Human Behavior Understanding. HBU’11
, pp. 29-39
-
-
Baccouche, M.1
Mamalet, F.2
Wolf, C.3
Garcia, C.4
Baskurt, A.5
-
5
-
-
0028392483
-
Learning long-term dependencies with gradient descent is difficult
-
[5] Bengio, Y., Simard, P., Frasconi, P., Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 5:2 (1994), 157–166.
-
(1994)
IEEE Trans. Neural Netw.
, vol.5
, Issue.2
, pp. 157-166
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
6
-
-
33745891801
-
Actions as space–time shapes
-
[6] Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R., Actions as space–time shapes. Proc. Int. Conference on Computer Vision (ICCV), vol. 2, 2005, 1395–1402.
-
(2005)
Proc. Int. Conference on Computer Vision (ICCV)
, vol.2
, pp. 1395-1402
-
-
Blank, M.1
Gorelick, L.2
Shechtman, E.3
Irani, M.4
Basri, R.5
-
7
-
-
0035279879
-
The recognition of human movement using temporal templates
-
[7] Bobick, A.F., Davis, J.W., The recognition of human movement using temporal templates. IEEE Trans. Pattern Anal. Mach. Intell. 23:3 (2001), 257–267.
-
(2001)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.23
, Issue.3
, pp. 257-267
-
-
Bobick, A.F.1
Davis, J.W.2
-
8
-
-
84886686429
-
Joint action segmentation and classification by an extended hidden Markov model
-
[8] Borzeshi, E.Z., Concha, O.P., Xu, R.Y.D., Piccardi, M., Joint action segmentation and classification by an extended hidden Markov model. IEEE Signal Process Lett. 20:12 (2013), 1207–1210.
-
(2013)
IEEE Signal Process Lett.
, vol.20
, Issue.12
, pp. 1207-1210
-
-
Borzeshi, E.Z.1
Concha, O.P.2
Xu, R.Y.D.3
Piccardi, M.4
-
9
-
-
31744440684
-
Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information
-
[9] Candès, E.J., Romberg, J., Tao, T., Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inf. Theory 52:2 (2006), 489–509.
-
(2006)
IEEE Trans. Inf. Theory
, vol.52
, Issue.2
, pp. 489-509
-
-
Candès, E.J.1
Romberg, J.2
Tao, T.3
-
10
-
-
84929223025
-
Free-form region description with second-order pooling
-
June
-
[10] Carreira, J., Caseiro, R., Batista, J., Sminchisescu, C., Free-form region description with second-order pooling. IEEE Trans. Pattern Anal. Mach. Intell. 37:6 (June 2015), 1177–1189.
-
(2015)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.37
, Issue.6
, pp. 1177-1189
-
-
Carreira, J.1
Caseiro, R.2
Batista, J.3
Sminchisescu, C.4
-
11
-
-
85015614393
-
Joint Recognition and Segmentation of Actions via Probabilistic Integration of Spatio-Temporal Fisher Vectors
-
CoRR abs/1602.01601
-
[11] Carvajal, J., McCool, C., Lovell, B.C., Sanderson, C., Joint Recognition and Segmentation of Actions via Probabilistic Integration of Spatio-Temporal Fisher Vectors. 2016 CoRR abs/1602.01601.
-
(2016)
-
-
Carvajal, J.1
McCool, C.2
Lovell, B.C.3
Sanderson, C.4
-
12
-
-
84985996075
-
Multi-action recognition via stochastic modelling of optical flow and gradients
-
[12] Carvajal, J., Sanderson, C., McCool, C., Lovell, B.C., Multi-action recognition via stochastic modelling of optical flow and gradients. Proceedings of the MLSDA 2014 2nd Workshop on Machine Learning for Sensory Data Analysis, MLSDA’14, 2014, 19:19–19:24.
-
(2014)
Proceedings of the MLSDA 2014 2nd Workshop on Machine Learning for Sensory Data Analysis, MLSDA’14
, pp. 1919-1924
-
-
Carvajal, J.1
Sanderson, C.2
McCool, C.3
Lovell, B.C.4
-
13
-
-
84861190873
-
A review on vision techniques applied to human behaviour analysis for ambient-assisted living
-
[13] Chaaraoui, A.A., Climent-Pérez, P., Flórez-Revuelta, F., A review on vision techniques applied to human behaviour analysis for ambient-assisted living. Expert Syst. Appl. 39:12 (2012), 10873–10888.
-
(2012)
Expert Syst. Appl.
, vol.39
, Issue.12
, pp. 10873-10888
-
-
Chaaraoui, A.A.1
Climent-Pérez, P.2
Flórez-Revuelta, F.3
-
14
-
-
85072028231
-
Return of the devil in the details: delving deep into convolutional nets
-
[14] Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A., Return of the devil in the details: delving deep into convolutional nets. British Machine Vision Conference, 2014.
-
(2014)
British Machine Vision Conference
-
-
Chatfield, K.1
Simonyan, K.2
Vedaldi, A.3
Zisserman, A.4
-
15
-
-
24644436425
-
Learning a similarity metric discriminatively, with application to face verification
-
[15] Chopra, S., Hadsell, R., LeCun, Y., Learning a similarity metric discriminatively, with application to face verification. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), CVPR'05, 2005, 539–546.
-
(2005)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), CVPR'05
, pp. 539-546
-
-
Chopra, S.1
Hadsell, R.2
LeCun, Y.3
-
16
-
-
24644524200
-
Visual categorization with bags of keypoints
-
[16] Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C., Visual categorization with bags of keypoints. In Workshop on Statistical Learning in Computer Vision, ECCV, 2004, 1–22.
-
(2004)
In Workshop on Statistical Learning in Computer Vision, ECCV
, pp. 1-22
-
-
Csurka, G.1
Dance, C.R.2
Fan, L.3
Willamowski, J.4
Bray, C.5
-
17
-
-
33645146449
-
Histograms of oriented gradients for human detection
-
vol. 1
-
[17] Dalal, N., Triggs, B., Histograms of oriented gradients for human detection. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1, 2005, 886–893 vol. 1.
-
(2005)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, vol.1
, pp. 886-893
-
-
Dalal, N.1
Triggs, B.2
-
18
-
-
33745821718
-
Human detection using oriented histograms of flow and appearance
-
[18] Dalal, N., Triggs, B., Schmid, C., Human detection using oriented histograms of flow and appearance. Proc. European Conference on Computer Vision (ECCV), Lecture Notes in Computer Science (LNCS), vol. 3952, 2006, 428–441.
-
(2006)
Proc. European Conference on Computer Vision (ECCV), Lecture Notes in Computer Science (LNCS)
, vol.3952
, pp. 428-441
-
-
Dalal, N.1
Triggs, B.2
Schmid, C.3
-
19
-
-
84990031871
-
Sympathy for the details: dense trajectories and hybrid classification architectures for action recognition
-
[19] de Souza, C.R., Gaidon, A., Vig, E., López, A.M., Sympathy for the details: dense trajectories and hybrid classification architectures for action recognition. Proc. European Conference on Computer Vision (ECCV), 2016, 697–716.
-
(2016)
Proc. European Conference on Computer Vision (ECCV)
, pp. 697-716
-
-
de Souza, C.R.1
Gaidon, A.2
Vig, E.3
López, A.M.4
-
20
-
-
33846622081
-
Behavior recognition via sparse spatio-temporal features
-
[20] Dollar, P., Rabaud, V., Cottrell, G., Belongie, S., Behavior recognition via sparse spatio-temporal features. Proceedings of the 14th International Conference on Computer Communications and Networks, 2005, 65–72.
-
(2005)
Proceedings of the 14th International Conference on Computer Communications and Networks
, pp. 65-72
-
-
Dollar, P.1
Rabaud, V.2
Cottrell, G.3
Belongie, S.4
-
21
-
-
84959236502
-
Long-term recurrent convolutional networks for visual recognition and description
-
[21] Donahue, J., Hendricks, L.A., Guadarrama, S., Rohrbach, M., Venugopalan, S., Saenko, K., Darrell, T., Long-term recurrent convolutional networks for visual recognition and description. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, 2625–2634.
-
(2015)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 2625-2634
-
-
Donahue, J.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
22
-
-
84919881041
-
DeCAF: a deep convolutional activation feature for generic visual recognition
-
[22] Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T., DeCAF: a deep convolutional activation feature for generic visual recognition. International Conference in Machine Learning (ICML), 2014.
-
(2014)
International Conference in Machine Learning (ICML)
-
-
Donahue, J.1
Jia, Y.2
Vinyals, O.3
Hoffman, J.4
Zhang, N.5
Tzeng, E.6
Darrell, T.7
-
23
-
-
33645712892
-
Compressed sensing
-
[23] Donoho, D.L., Compressed sensing. IEEE Trans. Inf. Theory 52:4 (2006), 1289–1306.
-
(2006)
IEEE Trans. Inf. Theory
, vol.52
, Issue.4
, pp. 1289-1306
-
-
Donoho, D.L.1
-
24
-
-
0037312530
-
Dynamic textures
-
[24] Doretto, G., Chiuso, A., Wu, Y.N., Soatto, S., Dynamic textures. Int. J. Comput. Vis. 51:2 (2003), 91–109.
-
(2003)
Int. J. Comput. Vis.
, vol.51
, Issue.2
, pp. 91-109
-
-
Doretto, G.1
Chiuso, A.2
Wu, Y.N.3
Soatto, S.4
-
25
-
-
84959217041
-
Hierarchical recurrent neural network for skeleton based action recognition
-
June
-
[25] Du, Y., Wang, W., Wang, L., Hierarchical recurrent neural network for skeleton based action recognition. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015, 1110–1118.
-
(2015)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1110-1118
-
-
Du, Y.1
Wang, W.2
Wang, L.3
-
26
-
-
84892329327
-
Sparse and Redundant Representations — From Theory to Applications in Signal and Image Processing
-
Springer
-
[26] Elad, M., Sparse and Redundant Representations — From Theory to Applications in Signal and Image Processing. 2010, Springer.
-
(2010)
-
-
Elad, M.1
-
27
-
-
84884541998
-
Sparse subspace clustering: algorithm, theory, and applications
-
[27] Elhamifar, E., Vidal, R., Sparse subspace clustering: algorithm, theory, and applications. IEEE Trans. Pattern Anal. Mach. Intell. 35:11 (2013), 2765–2781.
-
(2013)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.35
, Issue.11
, pp. 2765-2781
-
-
Elhamifar, E.1
Vidal, R.2
-
28
-
-
33745155436
-
A Bayesian hierarchical model for learning natural scene categories
-
vol. 2
-
[28] Fei-Fei, L., Perona, P., A Bayesian hierarchical model for learning natural scene categories. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2, 2005, 524–531 vol. 2.
-
(2005)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, vol.2
, pp. 524-531
-
-
Fei-Fei, L.1
Perona, P.2
-
29
-
-
84986266741
-
Convolutional two-stream network fusion for video action recognition
-
[29] Feichtenhofer, C., Pinz, A., Zisserman, A., Convolutional two-stream network fusion for video action recognition. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, 1933–1941.
-
(2016)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1933-1941
-
-
Feichtenhofer, C.1
Pinz, A.2
Zisserman, A.3
-
30
-
-
84986290213
-
Discriminative hierarchical rank pooling for activity recognition
-
[30] Fernando, B., Anderson, P., Hutter, M., Gould, S., Discriminative hierarchical rank pooling for activity recognition. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
-
(2016)
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Fernando, B.1
Anderson, P.2
Hutter, M.3
Gould, S.4
-
31
-
-
84959223985
-
Modeling video evolution for action recognition
-
[31] Fernando, B., Gavves, E., Oramas, M.J., Ghodrati, A., Tuytelaars, T., Modeling video evolution for action recognition. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, 5378–5387.
-
(2015)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 5378-5387
-
-
Fernando, B.1
Gavves, E.2
Oramas, M.J.3
Ghodrati, A.4
Tuytelaars, T.5
-
32
-
-
84998887168
-
Learning end-to-end video classification with rank-pooling
-
[32] Fernando, B., Gould, S., Learning end-to-end video classification with rank-pooling. ICML, 2016.
-
(2016)
ICML
-
-
Fernando, B.1
Gould, S.2
-
33
-
-
80052915321
-
Actom sequence models for efficient action detection
-
[33] Gaidon, A., Harchaoui, Z., Schmid, C., Actom sequence models for efficient action detection. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011, 3201–3208.
-
(2011)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 3201-3208
-
-
Gaidon, A.1
Harchaoui, Z.2
Schmid, C.3
-
34
-
-
84897461487
-
Activity representation with motion hierarchies
-
[34] Gaidon, A., Harchaoui, Z., Schmid, C., Activity representation with motion hierarchies. Int. J. Comput. Vis. 107:3 (2014), 219–238.
-
(2014)
Int. J. Comput. Vis.
, vol.107
, Issue.3
, pp. 219-238
-
-
Gaidon, A.1
Harchaoui, Z.2
Schmid, C.3
-
35
-
-
70349227947
-
The application of hidden Markov models in speech recognition
-
[35] Gales, M., Young, S., The application of hidden Markov models in speech recognition. Found. Trends Signal Process. 1:3 (2007), 195–304.
-
(2007)
Found. Trends Signal Process.
, vol.1
, Issue.3
, pp. 195-304
-
-
Gales, M.1
Young, S.2
-
36
-
-
85015679944
-
1 2 Separate Visual Pathways for Perception and Action. Essential Sources in the Scientific Study of Consciousness
-
[36] Goodale, M.A., Milner, A.D., 1 2 Separate Visual Pathways for Perception and Action. Essential Sources in the Scientific Study of Consciousness. 2003, 175.
-
(2003)
, pp. 175
-
-
Goodale, M.A.1
Milner, A.D.2
-
37
-
-
84937849144
-
Generative adversarial nets
-
[37] Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y., Generative adversarial nets. Proc. Advances in Neural Information Processing Systems (NIPS), 2014, 2672–2680.
-
(2014)
Proc. Advances in Neural Information Processing Systems (NIPS)
, pp. 2672-2680
-
-
Goodfellow, I.1
Pouget-Abadie, J.2
Mirza, M.3
Xu, B.4
Warde-Farley, D.5
Ozair, S.6
Courville, A.7
Bengio, Y.8
-
38
-
-
84973902378
-
Unsupervised learning of spatiotemporally coherent metrics
-
[38] Goroshin, R., Bruna, J., Tompson, J., Eigen, D., LeCun, Y., Unsupervised learning of spatiotemporally coherent metrics. Proc. Int. Conference on Computer Vision (ICCV), 2015, 4086–4093.
-
(2015)
Proc. Int. Conference on Computer Vision (ICCV)
, pp. 4086-4093
-
-
Goroshin, R.1
Bruna, J.2
Tompson, J.3
Eigen, D.4
LeCun, Y.5
-
39
-
-
84862649818
-
Learning sparse representations for human action recognition
-
[39] Guha, T., Ward, R.K., Learning sparse representations for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 34:8 (2012), 1576–1588.
-
(2012)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.34
, Issue.8
, pp. 1576-1588
-
-
Guha, T.1
Ward, R.K.2
-
40
-
-
84911424665
-
Bregman divergences for infinite dimensional covariance matrices
-
June
-
[40] Harandi, M., Salzmann, M., Porikli, F., Bregman divergences for infinite dimensional covariance matrices. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2014, 1003–1010.
-
(2014)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1003-1010
-
-
Harandi, M.1
Salzmann, M.2
Porikli, F.3
-
42
-
-
84986274465
-
Deep residual learning for image recognition
-
[42] He, K., Zhang, X., Ren, S., Sun, J., Deep residual learning for image recognition. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
-
(2016)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
He, K.1
Zhang, X.2
Ren, S.3
Sun, J.4
-
44
-
-
0031573117
-
Long short-term memory
-
[44] Hochreiter, S., Schmidhuber, J., Long short-term memory. Neural Comput. 9:8 (1997), 1735–1780.
-
(1997)
Neural Comput.
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
45
-
-
84902292428
-
Asymmetric and category invariant feature transformations for domain adaptation
-
[45] Hoffman, J., Rodner, E., Donahue, J., Kulis, B., Saenko, K., Asymmetric and category invariant feature transformations for domain adaptation. Int. J. Comput. Vis. 109:1–2 (2014), 28–41.
-
(2014)
Int. J. Comput. Vis.
, vol.109
, Issue.1-2
, pp. 28-41
-
-
Hoffman, J.1
Rodner, E.2
Donahue, J.3
Kulis, B.4
Saenko, K.5
-
46
-
-
0020968792
-
Model-based vision: a program to see a walking person
-
[46] Hogg, D., Model-based vision: a program to see a walking person. Image Vision Comput. 1 (1983), 5–20.
-
(1983)
Image Vision Comput.
, vol.1
, pp. 5-20
-
-
Hogg, D.1
-
47
-
-
0345414187
-
Large-scale event detection using semi-hidden Markov models
-
[47] Hongeng, S., Nevatia, R., Large-scale event detection using semi-hidden Markov models. Proc. Int. Conference on Computer Vision (ICCV), vol. 2, 2003, 1455–1462.
-
(2003)
Proc. Int. Conference on Computer Vision (ICCV)
, vol.2
, pp. 1455-1462
-
-
Hongeng, S.1
Nevatia, R.2
-
48
-
-
84986250512
-
Sparse coding and dictionary learning with linear dynamical systems
-
[48] Huang, W., Sun, F., Cao, L., Zhao, D., Liu, H., Harandi, M., Sparse coding and dictionary learning with linear dynamical systems. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
-
(2016)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Huang, W.1
Sun, F.2
Cao, L.3
Zhao, D.4
Liu, H.5
Harandi, M.6
-
49
-
-
84898982939
-
Exploiting generative models in discriminative classifiers
-
MIT Press
-
[49] Jaakkola, T., Haussler, D., Exploiting generative models in discriminative classifiers. Proc. Advances in Neural Information Processing Systems (NIPS), 1998, MIT Press, 487–493.
-
(1998)
Proc. Advances in Neural Information Processing Systems (NIPS)
, pp. 487-493
-
-
Jaakkola, T.1
Haussler, D.2
-
50
-
-
84887398298
-
Better exploiting motion for better action recognition
-
[50] Jain, M., Jgou, H., Bouthemy, P., Better exploiting motion for better action recognition. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013, 2555–2562.
-
(2013)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 2555-2562
-
-
Jain, M.1
Jgou, H.2
Bouthemy, P.3
-
51
-
-
77956004473
-
Aggregating local descriptors into a compact image representation
-
June
-
[51] Jgou, H., Douze, M., Schmid, C., Prez, P., Aggregating local descriptors into a compact image representation. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2010, 3304–3311.
-
(2010)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 3304-3311
-
-
Jgou, H.1
Douze, M.2
Schmid, C.3
Prez, P.4
-
52
-
-
84870183903
-
3D convolutional neural networks for human action recognition
-
[52] Ji, S., Xu, W., Yang, M., Yu, K., 3D convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35:1 (2013), 221–231.
-
(2013)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.35
, Issue.1
, pp. 221-231
-
-
Ji, S.1
Xu, W.2
Yang, M.3
Yu, K.4
-
53
-
-
84938629614
-
Human action recognition in unconstrained videos by explicit motion modeling
-
[53] Jiang, Y.G., Dai, Q., Liu, W., Xue, X., Ngo, C.W., Human action recognition in unconstrained videos by explicit motion modeling. IEEE Trans. Image Process. 24:11 (2015), 3781–3795.
-
(2015)
IEEE Trans. Image Process.
, vol.24
, Issue.11
, pp. 3781-3795
-
-
Jiang, Y.G.1
Dai, Q.2
Liu, W.3
Xue, X.4
Ngo, C.W.5
-
54
-
-
84867849524
-
Trajectory-based modeling of human actions with motion reference points
-
[54] Jiang, Y.-G., Dai, Q., Xue, X., Liu, W., Ngo, C.-W., Trajectory-based modeling of human actions with motion reference points. Proc. European Conference on Computer Vision (ECCV), 2012, 425–438.
-
(2012)
Proc. European Conference on Computer Vision (ECCV)
, pp. 425-438
-
-
Jiang, Y.-G.1
Dai, Q.2
Xue, X.3
Liu, W.4
Ngo, C.-W.5
-
55
-
-
84911441074
-
Efficient feature extraction, encoding, and classification for action recognition
-
[55] Kantorov, V., Laptev, I., Efficient feature extraction, encoding, and classification for action recognition. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014, 2593–2600.
-
(2014)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 2593-2600
-
-
Kantorov, V.1
Laptev, I.2
-
56
-
-
84911364368
-
Large-scale video classification with convolutional neural networks
-
[56] Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., Fei-Fei, L., Large-scale video classification with convolutional neural networks. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014, 1725–1732.
-
(2014)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1725-1732
-
-
Karpathy, A.1
Toderici, G.2
Shetty, S.3
Leung, T.4
Sukthankar, R.5
Fei-Fei, L.6
-
57
-
-
84898491875
-
Human activity recognition using a dynamic texture based method
-
[57] Kellokumpu, V., Zhao, G., Pietikinen, M., Human activity recognition using a dynamic texture based method. British Machine Vision Conference, 2008, 885–894.
-
(2008)
British Machine Vision Conference
, pp. 885-894
-
-
Kellokumpu, V.1
Zhao, G.2
Pietikinen, M.3
-
58
-
-
84898426452
-
A spatio-temporal descriptor based on 3D-gradients
-
[58] Kläser, A., MarszaÅek, M., Schmid, C., A spatio-temporal descriptor based on 3D-gradients. In BMVC08, 2008, 275:1-10.
-
(2008)
In BMVC08
, pp. 2751-10
-
-
Kläser, A.1
MarszaÅek, M.2
Schmid, C.3
-
59
-
-
84867849228
-
Motion interchange patterns for action recognition in unconstrained videos
-
[59] Kliper-Gross, O., Gurovich, Y., Hassner, T., Wolf, L., Motion interchange patterns for action recognition in unconstrained videos. Proc. European Conference on Computer Vision (ECCV), 2012, 256–269.
-
(2012)
Proc. European Conference on Computer Vision (ECCV)
, pp. 256-269
-
-
Kliper-Gross, O.1
Gurovich, Y.2
Hassner, T.3
Wolf, L.4
-
60
-
-
84990026420
-
Tensor representations via kernel linearization for action recognition from 3D skeletons
-
[60] Koniusz, P., Cherian, A., Porikli, F., Tensor representations via kernel linearization for action recognition from 3D skeletons. Proc. European Conference on Computer Vision (ECCV), 2016, 37–53.
-
(2016)
Proc. European Conference on Computer Vision (ECCV)
, pp. 37-53
-
-
Koniusz, P.1
Cherian, A.2
Porikli, F.3
-
62
-
-
84876231242
-
ImageNet classification with deep convolutional neural networks
-
[62] Krizhevsky, A., Sutskever, I., Hinton, G.E., ImageNet classification with deep convolutional neural networks. Proc. Advances in Neural Information Processing Systems (NIPS), 2012, 1097–1105.
-
(2012)
Proc. Advances in Neural Information Processing Systems (NIPS)
, pp. 1097-1105
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
63
-
-
84929272905
-
High Performance Computing in Science and Engineering ‘12: Transactions of the High Performance Computing Center, Stuttgart (HLRS) 2012
-
Springer Berlin Heidelberg Ch. HMDB51: A Large Video Database for Human Motion Recognition
-
[63] Kuehne, H., Jhuang, H., Stiefelhagen, R., Serre, T., High Performance Computing in Science and Engineering ‘12: Transactions of the High Performance Computing Center, Stuttgart (HLRS) 2012. 2013, Springer, Berlin Heidelberg, 571–582 Ch. HMDB51: A Large Video Database for Human Motion Recognition.
-
(2013)
, pp. 571-582
-
-
Kuehne, H.1
Jhuang, H.2
Stiefelhagen, R.3
Serre, T.4
-
64
-
-
0142192295
-
Conditional random fields: probabilistic models for segmenting and labeling sequence data
-
Morgan Kaufmann Publishers Inc. San Francisco, CA, USA
-
[64] Lafferty, J.D., McCallum, A., Pereira, F.C.N., Conditional random fields: probabilistic models for segmenting and labeling sequence data. Proc. Int. Conference on Machine Learning (ICML), ICML'01, 2001, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 282–289.
-
(2001)
Proc. Int. Conference on Machine Learning (ICML), ICML'01
, pp. 282-289
-
-
Lafferty, J.D.1
McCallum, A.2
Pereira, F.C.N.3
-
65
-
-
84959241532
-
Beyond Gaussian pyramid: multi-skip feature stacking for action recognition
-
[65] Lan, Z., Lin, M., Li, X., Hauptmann, A.G., Raj, B., Beyond Gaussian pyramid: multi-skip feature stacking for action recognition. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, 204–212.
-
(2015)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 204-212
-
-
Lan, Z.1
Lin, M.2
Li, X.3
Hauptmann, A.G.4
Raj, B.5
-
66
-
-
24944451092
-
On space–time interest points
-
[66] Laptev, I., On space–time interest points. Int. J. Comput. Vis. 64:2 (2005), 107–123.
-
(2005)
Int. J. Comput. Vis.
, vol.64
, Issue.2
, pp. 107-123
-
-
Laptev, I.1
-
67
-
-
51949083365
-
Learning realistic human actions from movies
-
June
-
[67] Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B., Learning realistic human actions from movies. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2008, 1–8.
-
(2008)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1-8
-
-
Laptev, I.1
Marszalek, M.2
Schmid, C.3
Rozenfeld, B.4
-
68
-
-
84990062965
-
Segmental spatiotemporal CNNs for fine-grained action segmentation
-
[68] Lea, C., Reiter, A., Vidal, R., Hager, G.D., Segmental spatiotemporal CNNs for fine-grained action segmentation. Proc. European Conference on Computer Vision (ECCV), 2016, 36–52.
-
(2016)
Proc. European Conference on Computer Vision (ECCV)
, pp. 36-52
-
-
Lea, C.1
Reiter, A.2
Vidal, R.3
Hager, G.D.4
-
69
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
[69] Lecun, Y., Bottou, L., Bengio, Y., Haffner, P., Gradient-based learning applied to document recognition. Proc. IEEE 86:11 (1998), 2278–2324.
-
(1998)
Proc. IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
70
-
-
85162513516
-
Object bank: a high-level image representation for scene classification & semantic feature sparsification
-
[70] Li, L.-J., Su, H., Fei-fei, L., Xing, E.P., Object bank: a high-level image representation for scene classification & semantic feature sparsification. Proc. Advances in Neural Information Processing Systems (NIPS), 2010, 1378–1386.
-
(2010)
Proc. Advances in Neural Information Processing Systems (NIPS)
, pp. 1378-1386
-
-
Li, L.-J.1
Su, H.2
Fei-fei, L.3
Xing, E.P.4
-
71
-
-
84986305500
-
VLAD3: encoding dynamics of deep features for action recognition
-
[71] Li, Y., Li, W., Mahadevan, V., Vasconcelos, N., VLAD3: encoding dynamics of deep features for action recognition. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
-
(2016)
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Li, Y.1
Li, W.2
Mahadevan, V.3
Vasconcelos, N.4
-
72
-
-
70450203660
-
Recognizing realistic actions from videos “in the wild”
-
[72] Liu, J., Luo, J., Shah, M., Recognizing realistic actions from videos “in the wild”. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009, 1996–2003.
-
(2009)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1996-2003
-
-
Liu, J.1
Luo, J.2
Shah, M.3
-
73
-
-
84990059379
-
Spatio-temporal LSTM with trust gates for 3D human action recognition
-
Springer International Publishing
-
[73] Liu, J., Shahroudy, A., Xu, D., Wang, G., Spatio-temporal LSTM with trust gates for 3D human action recognition. Proc. European Conference on Computer Vision (ECCV), 2016, Springer International Publishing, 816–833.
-
(2016)
Proc. European Conference on Computer Vision (ECCV)
, pp. 816-833
-
-
Liu, J.1
Shahroudy, A.2
Xu, D.3
Wang, G.4
-
74
-
-
84987664752
-
Nonlinear metric learning for visual tracking
-
[74] Lu, J., Hu, J., Tan, Y.P., Nonlinear metric learning for visual tracking. 2016 IEEE International Conference on Multimedia and Expo (ICME), 2016, 1–6.
-
(2016)
2016 IEEE International Conference on Multimedia and Expo (ICME)
, pp. 1-6
-
-
Lu, J.1
Hu, J.2
Tan, Y.P.3
-
75
-
-
39149089704
-
Sparse representation for color image restoration
-
[75] Mairal, J., Elad, M., Sapiro, G., Sparse representation for color image restoration. IEEE Trans. Image Process. (TIP) 17:1 (2008), 53–69.
-
(2008)
IEEE Trans. Image Process. (TIP)
, vol.17
, Issue.1
, pp. 53-69
-
-
Mairal, J.1
Elad, M.2
Sapiro, G.3
-
76
-
-
0020073967
-
Representation and recognition of the movements of shapes
-
[76] Marr, D., Vaina, L., Representation and recognition of the movements of shapes. Proc. R. Soc. Lond. B Biol. Sci. 214:1197 (1982), 501–524.
-
(1982)
Proc. R. Soc. Lond. B Biol. Sci.
, vol.214
, Issue.1197
, pp. 501-524
-
-
Marr, D.1
Vaina, L.2
-
77
-
-
70450177757
-
Actions in context
-
[77] Marszalek, M., Laptev, I., Schmid, C., Actions in context. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009, 2929–2936.
-
(2009)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 2929-2936
-
-
Marszalek, M.1
Laptev, I.2
Schmid, C.3
-
78
-
-
84990053441
-
Deep Multi-Scale Video Prediction Beyond Mean Square Error
-
CoRR
-
[78] Mathieu, M., Couprie, C., LeCun, Y., Deep Multi-Scale Video Prediction Beyond Mean Square Error. 2015, CoRR.
-
(2015)
-
-
Mathieu, M.1
Couprie, C.2
LeCun, Y.3
-
79
-
-
77953178862
-
Trajectons: action recognition through the motion analysis of tracked features
-
[79] Matikainen, P., Hebert, M., Sukthankar, R., Trajectons: action recognition through the motion analysis of tracked features. Computer Vision Workshops (ICCV Workshops), 2009 IEEE 12th International Conference on, Sept. 2009, 514–521.
-
(2009)
Computer Vision Workshops (ICCV Workshops), 2009 IEEE 12th International Conference on
, pp. 514-521
-
-
Matikainen, P.1
Hebert, M.2
Sukthankar, R.3
-
80
-
-
85121365374
-
Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons
-
Association for Computational Linguistics Stroudsburg, PA, USA
-
[80] McCallum, A., Li, W., Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons. Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003 - Volume 4, CONLL'03, 2003, Association for Computational Linguistics, Stroudsburg, PA, USA, 188–191.
-
(2003)
Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003 - Volume 4, CONLL'03
, pp. 188-191
-
-
McCallum, A.1
Li, W.2
-
81
-
-
77953182943
-
Activity recognition using the velocity histories of tracked keypoints
-
[81] Messing, R., Pal, C., Kautz, H., Activity recognition using the velocity histories of tracked keypoints. Proc. Int. Conference on Computer Vision (ICCV), 2009, 104–111.
-
(2009)
Proc. Int. Conference on Computer Vision (ICCV)
, pp. 104-111
-
-
Messing, R.1
Pal, C.2
Kautz, H.3
-
82
-
-
84886716503
-
A review of motion analysis methods for human nonverbal communication computing
-
[82] Metaxas, D., Zhang, S., A review of motion analysis methods for human nonverbal communication computing. Image Vision Comput. 31:6–7 (2013), 421–433.
-
(2013)
Image Vision Comput.
, vol.31
, Issue.6-7
, pp. 421-433
-
-
Metaxas, D.1
Zhang, S.2
-
84
-
-
85015698797
-
Unsupervised Learning Using Sequential Verification for Action Recognition
-
arXiv preprint arXiv:1603.08561
-
[84] Misra, I., Zitnick, C.L., Hebert, M., Unsupervised Learning Using Sequential Verification for Action Recognition. 2016 arXiv preprint arXiv:1603.08561.
-
(2016)
-
-
Misra, I.1
Zitnick, C.L.2
Hebert, M.3
-
85
-
-
33749990780
-
A survey of advances in vision-based human motion capture and analysis
-
[85] Moeslund, T.B., Granum, E., A survey of advances in vision-based human motion capture and analysis. Comput. Vis. Image Underst. 104:3 (2006), 90–127.
-
(2006)
Comput. Vis. Image Underst.
, vol.104
, Issue.3
, pp. 90-127
-
-
Moeslund, T.B.1
Granum, E.2
-
86
-
-
84959228762
-
Beyond short snippets: deep networks for video classification
-
[86] Ng, J.Y.-H., Hausknecht, M., Vijayanarasimhan, S., Vinyals, O., Monga, R., Toderici, G., Beyond short snippets: deep networks for video classification. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, 4694–4702.
-
(2015)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 4694-4702
-
-
Ng, J.Y.-H.1
Hausknecht, M.2
Vijayanarasimhan, S.3
Vinyals, O.4
Monga, R.5
Toderici, G.6
-
87
-
-
84986331364
-
Progressively parsing interactional objects for fine grained action detection
-
[87] Ni, B., Yang, X., Gao, S., Progressively parsing interactional objects for fine grained action detection. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
-
(2016)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Ni, B.1
Yang, X.2
Gao, S.3
-
88
-
-
78149353400
-
Modeling temporal structure of decomposable motion segments for activity classification
-
[88] Niebles, J.C., Chen, C.-W., Fei-Fei, L., Modeling temporal structure of decomposable motion segments for activity classification. Proc. European Conference on Computer Vision (ECCV), 2010, 392–405.
-
(2010)
Proc. European Conference on Computer Vision (ECCV)
, pp. 392-405
-
-
Niebles, J.C.1
Chen, C.-W.2
Fei-Fei, L.3
-
89
-
-
84867875030
-
Directional space–time oriented gradients for 3D visual pattern analysis
-
[89] Norouznezhad, E., Harandi, M.T., Bigdeli, A., Baktash, M., Postula, A., Lovell, B.C., Directional space–time oriented gradients for 3D visual pattern analysis. Proc. European Conference on Computer Vision (ECCV), 2012, 736–749.
-
(2012)
Proc. European Conference on Computer Vision (ECCV)
, pp. 736-749
-
-
Norouznezhad, E.1
Harandi, M.T.2
Bigdeli, A.3
Baktash, M.4
Postula, A.5
Lovell, B.C.6
-
90
-
-
33745835368
-
Sampling strategies for bag-of-features image classification
-
Springer Berlin Heidelberg Berlin, Heidelberg
-
[90] Nowak, E., Jurie, F., Triggs, B., Sampling strategies for bag-of-features image classification. Proc. European Conference on Computer Vision (ECCV), 2006, Springer Berlin Heidelberg, Berlin, Heidelberg, 490–503.
-
(2006)
Proc. European Conference on Computer Vision (ECCV)
, pp. 490-503
-
-
Nowak, E.1
Jurie, F.2
Triggs, B.3
-
91
-
-
0036647193
-
Multiresolution gray-scale and rotation invariant texture classification with local binary patterns
-
[91] Ojala, T., Pietikainen, M., Maenpaa, T., Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24:7 (2002), 971–987.
-
(2002)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.24
, Issue.7
, pp. 971-987
-
-
Ojala, T.1
Pietikainen, M.2
Maenpaa, T.3
-
92
-
-
0030779611
-
Sparse coding with an overcomplete basis set: a strategy employed by V1?
-
[92] Olshausen, B.A., Field, D.J., Sparse coding with an overcomplete basis set: a strategy employed by V1?. Vis. Res. 37:23 (1997), 3311–3325.
-
(1997)
Vis. Res.
, vol.37
, Issue.23
, pp. 3311-3325
-
-
Olshausen, B.A.1
Field, D.J.2
-
93
-
-
84898791167
-
Action and event recognition with Fisher vectors on a compact feature set
-
[93] Oneata, D., Verbeek, J., Schmid, C., Action and event recognition with Fisher vectors on a compact feature set. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013, 1817–1824.
-
(2013)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1817-1824
-
-
Oneata, D.1
Verbeek, J.2
Schmid, C.3
-
95
-
-
84906511738
-
Bag of Visual Words and Fusion Methods for Action Recognition: Comprehensive Study and Good Practice
-
CoRR abs/1405.4506
-
[95] Peng, X., Wang, L., Wang, X., Qiao, Y., Bag of Visual Words and Fusion Methods for Action Recognition: Comprehensive Study and Good Practice. 2014 CoRR abs/1405.4506.
-
(2014)
-
-
Peng, X.1
Wang, L.2
Wang, X.3
Qiao, Y.4
-
96
-
-
84906510060
-
Action recognition with stacked Fisher vectors
-
[96] Peng, X., Zou, C., Qiao, Y., Peng, Q., Action recognition with stacked Fisher vectors. Proc. European Conference on Computer Vision (ECCV), 2014, 581–595.
-
(2014)
Proc. European Conference on Computer Vision (ECCV)
, pp. 581-595
-
-
Peng, X.1
Zou, C.2
Qiao, Y.3
Peng, Q.4
-
98
-
-
77949275097
-
A survey on vision-based human action recognition
-
[98] Poppe, R., A survey on vision-based human action recognition. Image Vision Comput. 28:6 (2010), 976–990.
-
(2010)
Image Vision Comput.
, vol.28
, Issue.6
, pp. 976-990
-
-
Poppe, R.1
-
99
-
-
36049014768
-
Hidden conditional random fields
-
[99] Quattoni, A., Wang, S., Morency, L.-P., Collins, M., Darrell, T., Hidden conditional random fields. IEEE Trans. Pattern Anal. Mach. Intell. 29:10 (2007), 1848–1852.
-
(2007)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.29
, Issue.10
, pp. 1848-1852
-
-
Quattoni, A.1
Wang, S.2
Morency, L.-P.3
Collins, M.4
Darrell, T.5
-
100
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
[100] Rabiner, L.R., A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77:2 (Feb. 1989), 257–286.
-
(1989)
Proc. IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
103
-
-
84965108042
-
Video (language) Modeling: A Baseline for Generative Models of Natural Videos
-
CoRR
-
[103] Ranzato, M., Szlam, A., Bruna, J., Mathieu, M., Collobert, R., Chopra, S., Video (language) Modeling: A Baseline for Generative Models of Natural Videos. 2014, CoRR.
-
(2014)
-
-
Ranzato, M.1
Szlam, A.2
Bruna, J.3
Mathieu, M.4
Collobert, R.5
Chopra, S.6
-
104
-
-
84879553900
-
Recognizing 50 human action categories of web videos
-
[104] Reddy, K.K., Shah, M., Recognizing 50 human action categories of web videos. Mach. Vis. Appl. 24:5 (2013), 971–981.
-
(2013)
Mach. Vis. Appl.
, vol.24
, Issue.5
, pp. 971-981
-
-
Reddy, K.K.1
Shah, M.2
-
105
-
-
0018015137
-
Modeling by shortest data description
-
[105] Rissanen, J., Modeling by shortest data description. Automatica 14:5 (1978), 465–471.
-
(1978)
Automatica
, vol.14
, Issue.5
, pp. 465-471
-
-
Rissanen, J.1
-
107
-
-
51949084792
-
Action MACH a spatio-temporal maximum average correlation height filter for action recognition
-
[107] Rodriguez, M.D., Ahmed, J., Shah, M., Action MACH a spatio-temporal maximum average correlation height filter for action recognition. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008, 1–8.
-
(2008)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1-8
-
-
Rodriguez, M.D.1
Ahmed, J.2
Shah, M.3
-
108
-
-
0027928914
-
Towards model-based recognition of human movements in image sequences
-
[108] Rohr, K., Towards model-based recognition of human movements in image sequences. CVGIP: Image Underst. 59:1 (Jan. 1994), 94–115.
-
(1994)
CVGIP: Image Underst.
, vol.59
, Issue.1
, pp. 94-115
-
-
Rohr, K.1
-
109
-
-
84866710901
-
A database for fine grained activity detection of cooking activities
-
[109] Rohrbach, M., Amin, S., Andriluka, M., Schiele, B., A database for fine grained activity detection of cooking activities. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012, 1194–1201.
-
(2012)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1194-1201
-
-
Rohrbach, M.1
Amin, S.2
Andriluka, M.3
Schiele, B.4
-
110
-
-
84947041871
-
ImageNet large scale visual recognition challenge
-
[110] Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A.C., Fei-Fei, L., ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115:3 (2015), 211–252.
-
(2015)
Int. J. Comput. Vis.
, vol.115
, Issue.3
, pp. 211-252
-
-
Russakovsky, O.1
Deng, J.2
Su, H.3
Krause, J.4
Satheesh, S.5
Ma, S.6
Huang, Z.7
Karpathy, A.8
Khosla, A.9
Bernstein, M.10
Berg, A.C.11
Fei-Fei, L.12
-
112
-
-
84875633658
-
Spatio-temporal covariance descriptors for action and gesture recognition
-
[112] Sanin, A., Sanderson, C., Harandi, M.T., Lovell, B.C., Spatio-temporal covariance descriptors for action and gesture recognition. IEEE Workshop on Applications of Computer Vision, 2013, 103–110.
-
(2013)
IEEE Workshop on Applications of Computer Vision
, pp. 103-110
-
-
Sanin, A.1
Sanderson, C.2
Harandi, M.T.3
Lovell, B.C.4
-
113
-
-
10044233701
-
Recognizing human actions: a local SVM approach
-
[113] Schuldt, C., Laptev, I., Caputo, B., Recognizing human actions: a local SVM approach. Proc. Int. Conference on Pattern Recognition (ICPR), ICPR'04, 2004, 32–36.
-
(2004)
Proc. Int. Conference on Pattern Recognition (ICPR), ICPR'04
, pp. 32-36
-
-
Schuldt, C.1
Laptev, I.2
Caputo, B.3
-
114
-
-
84901263896
-
Spatio-temporal Laplacian pyramid coding for action recognition
-
[114] Shao, L., Zhen, X., Tao, D., Li, X., Spatio-temporal Laplacian pyramid coding for action recognition. IEEE Trans. Cybern. 44:6 (2014), 817–827.
-
(2014)
IEEE Trans. Cybern.
, vol.44
, Issue.6
, pp. 817-827
-
-
Shao, L.1
Zhen, X.2
Tao, D.3
Li, X.4
-
116
-
-
84986328004
-
A multi-stream bi-directional recurrent neural network for fine-grained action detection
-
[116] Singh, B., Marks, T.K., Jones, M., Tuzel, O., Shao, M., A multi-stream bi-directional recurrent neural network for fine-grained action detection. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
-
(2016)
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Singh, B.1
Marks, T.K.2
Jones, M.3
Tuzel, O.4
Shao, M.5
-
117
-
-
84899639091
-
Action recognition using global spatio-temporal features derived from sparse representations
-
[117] Somasundaram, G., Cherian, A., Morellas, V., Papanikolopoulos, N., Action recognition using global spatio-temporal features derived from sparse representations. Comput. Vis. Image Underst. 123 (2014), 1–13.
-
(2014)
Comput. Vis. Image Underst.
, vol.123
, pp. 1-13
-
-
Somasundaram, G.1
Cherian, A.2
Morellas, V.3
Papanikolopoulos, N.4
-
118
-
-
84887335980
-
Action recognition by hierarchical sequence summarization
-
[118] Song, Y., Morency, L.P., Davis, R., Action recognition by hierarchical sequence summarization. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013, 3562–3569.
-
(2013)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 3562-3569
-
-
Song, Y.1
Morency, L.P.2
Davis, R.3
-
119
-
-
84884955228
-
UCF101: A Dataset of 101 Human Actions Classes From Videos in the Wild
-
CoRR abs/1212.0402
-
[119] Soomro, K., Zamir, A.R., Shah, M., UCF101: A Dataset of 101 Human Actions Classes From Videos in the Wild. 2012 CoRR abs/1212.0402.
-
(2012)
-
-
Soomro, K.1
Zamir, A.R.2
Shah, M.3
-
120
-
-
84944082890
-
Unsupervised Learning of Video Representations Using LSTMs
-
CoRR
-
[120] Srivastava, N., Mansimov, E., Salakhutdinov, R., Unsupervised Learning of Video Representations Using LSTMs. 2015, CoRR.
-
(2015)
-
-
Srivastava, N.1
Mansimov, E.2
Salakhutdinov, R.3
-
121
-
-
84965164720
-
Training very deep networks
-
[121] Srivastava, R.K., Greff, K., Schmidhuber, J., Training very deep networks. Proc. Advances in Neural Information Processing Systems (NIPS), 2015, 2377–2385.
-
(2015)
Proc. Advances in Neural Information Processing Systems (NIPS)
, pp. 2377-2385
-
-
Srivastava, R.K.1
Greff, K.2
Schmidhuber, J.3
-
122
-
-
84990062624
-
Hierarchical dynamic parsing and encoding for action recognition
-
[122] Su, B., Zhou, J., Ding, X., Wang, H., Wu, Y., Hierarchical dynamic parsing and encoding for action recognition. Proc. European Conference on Computer Vision (ECCV), 2016, 202–217.
-
(2016)
Proc. European Conference on Computer Vision (ECCV)
, pp. 202-217
-
-
Su, B.1
Zhou, J.2
Ding, X.3
Wang, H.4
Wu, Y.5
-
124
-
-
84875607338
-
Large-scale web video event classification by use of Fisher Vectors
-
[124] Sun, C., Nevatia, R., Large-scale web video event classification by use of Fisher Vectors. Applications of Computer Vision (WACV), 2013 IEEE Workshop on, 2013, 15–22.
-
(2013)
Applications of Computer Vision (WACV), 2013 IEEE Workshop on
, pp. 15-22
-
-
Sun, C.1
Nevatia, R.2
-
125
-
-
84973863239
-
Human action recognition using factorized spatio-temporal convolutional networks
-
[125] Sun, L., Jia, K., Yeung, D.Y., Shi, B.E., Human action recognition using factorized spatio-temporal convolutional networks. Proc. Int. Conference on Computer Vision (ICCV), 2015, 4597–4605.
-
(2015)
Proc. Int. Conference on Computer Vision (ICCV)
, pp. 4597-4605
-
-
Sun, L.1
Jia, K.2
Yeung, D.Y.3
Shi, B.E.4
-
126
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
[126] Sutskever, I., Vinyals, O., Le, Q.V., Sequence to sequence learning with neural networks. Proc. Advances in Neural Information Processing Systems (NIPS), 2014, 3104–3112.
-
(2014)
Proc. Advances in Neural Information Processing Systems (NIPS)
, pp. 3104-3112
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
127
-
-
84937522268
-
Going deeper with convolutions
-
[127] Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A., Going deeper with convolutions. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, 1–9.
-
(2015)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1-9
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
Erhan, D.7
Vanhoucke, V.8
Rabinovich, A.9
-
128
-
-
84866658784
-
Learning latent temporal structure for complex event detection
-
[128] Tang, K., Fei-Fei, L., Koller, D., Learning latent temporal structure for complex event detection. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012, 1250–1257.
-
(2012)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1250-1257
-
-
Tang, K.1
Fei-Fei, L.2
Koller, D.3
-
129
-
-
84860220275
-
Hierarchical filtered motion for action recognition in crowded videos
-
[129] Tian, Y., Cao, L., Liu, Z., Zhang, Z., Hierarchical filtered motion for action recognition in crowded videos. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 42:3 (2012), 313–323.
-
(2012)
IEEE Trans. Syst. Man Cybern. Part C Appl. Rev.
, vol.42
, Issue.3
, pp. 313-323
-
-
Tian, Y.1
Cao, L.2
Liu, Z.3
Zhang, Z.4
-
130
-
-
84973865953
-
Learning spatiotemporal features with 3D convolutional networks
-
[130] Tran, D., Bourdev, L., Fergus, R., Torresani, L., Paluri, M., Learning spatiotemporal features with 3D convolutional networks. Proc. Int. Conference on Computer Vision (ICCV), 2015, 4489–4497.
-
(2015)
Proc. Int. Conference on Computer Vision (ICCV)
, pp. 4489-4497
-
-
Tran, D.1
Bourdev, L.2
Fergus, R.3
Torresani, L.4
Paluri, M.5
-
131
-
-
84986301251
-
Learning cross-domain landmarks for heterogeneous domain adaptation
-
[131] Hubert Tsai, Y.-H., Yeh, Y.-R., Frank Wang, Y.-C., Learning cross-domain landmarks for heterogeneous domain adaptation. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
-
(2016)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
-
-
Hubert Tsai, Y.-H.1
Yeh, Y.-R.2
Frank Wang, Y.-C.3
-
132
-
-
55149089260
-
Machine recognition of human activities: a survey
-
[132] Turaga, P., Chellappa, R., Subrahmanian, V.S., Udrea, O., Machine recognition of human activities: a survey. IEEE Trans. Circuits Syst. Video Technol. 18:11 (2008), 1473–1488.
-
(2008)
IEEE Trans. Circuits Syst. Video Technol.
, vol.18
, Issue.11
, pp. 1473-1488
-
-
Turaga, P.1
Chellappa, R.2
Subrahmanian, V.S.3
Udrea, O.4
-
133
-
-
33745818927
-
Region covariance: a fast descriptor for detection and classification
-
[133] Tuzel, O., Porikli, F., Meer, P., Region covariance: a fast descriptor for detection and classification. Proc. European Conference on Computer Vision (ECCV), 2006, 589–600.
-
(2006)
Proc. European Conference on Computer Vision (ECCV)
, pp. 589-600
-
-
Tuzel, O.1
Porikli, F.2
Meer, P.3
-
134
-
-
50249124717
-
Pedestrian detection via classification on Riemannian manifolds
-
[134] Tuzel, O., Porikli, F., Meer, P., Pedestrian detection via classification on Riemannian manifolds. IEEE Trans. Pattern Anal. Mach. Intell. 30:10 (2008), 1713–1727.
-
(2008)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.30
, Issue.10
, pp. 1713-1727
-
-
Tuzel, O.1
Porikli, F.2
Meer, P.3
-
135
-
-
84990038547
-
A Siamese long short-term memory architecture for human re-identification
-
[135] Varior, R.R., Shuai, B., Lu, J., Xu, D., Wang, G., A Siamese long short-term memory architecture for human re-identification. Proc. European Conference on Computer Vision (ECCV), 2016, 135–153.
-
(2016)
Proc. European Conference on Computer Vision (ECCV)
, pp. 135-153
-
-
Varior, R.R.1
Shuai, B.2
Lu, J.3
Xu, D.4
Wang, G.5
-
136
-
-
84990037381
-
Long-term temporal convolutions for action recognition
-
arXiv:1604.04494
-
[136] Varol, G., Laptev, I., Schmid, C., Long-term temporal convolutions for action recognition. 2016 arXiv:1604.04494.
-
(2016)
-
-
Varol, G.1
Laptev, I.2
Schmid, C.3
-
137
-
-
84911376484
-
Human action recognition by representing 3D skeletons as points in a lie group
-
June
-
[137] Vemulapalli, R., Arrate, F., Chellappa, R., Human action recognition by representing 3D skeletons as points in a lie group. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2014, 588–595.
-
(2014)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 588-595
-
-
Vemulapalli, R.1
Arrate, F.2
Chellappa, R.3
-
138
-
-
56449089103
-
Extracting and composing robust features with denoising autoencoders
-
[138] Vincent, P., Larochelle, H., Bengio, Y., Manzagol, P.-A., Extracting and composing robust features with denoising autoencoders. Proc. Int. Conference on Machine Learning (ICML), 2008, 1096–1103.
-
(2008)
Proc. Int. Conference on Machine Learning (ICML)
, pp. 1096-1103
-
-
Vincent, P.1
Larochelle, H.2
Bengio, Y.3
Manzagol, P.-A.4
-
139
-
-
84890553932
-
A survey on activity recognition and behavior understanding in video surveillance
-
[139] Vishwakarma, S., Agrawal, A., A survey on activity recognition and behavior understanding in video surveillance. Vis. Comput. 29:10 (2013), 983–1009.
-
(2013)
Vis. Comput.
, vol.29
, Issue.10
, pp. 983-1009
-
-
Vishwakarma, S.1
Agrawal, A.2
-
140
-
-
80052877143
-
Action recognition by dense trajectories
-
[140] Wang, H., Klaser, A., Schmid, C., Liu, C.-L., Action recognition by dense trajectories. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011, 3169–3176.
-
(2011)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 3169-3176
-
-
Wang, H.1
Klaser, A.2
Schmid, C.3
Liu, C.-L.4
-
142
-
-
84898890371
-
Evaluation of local spatio-temporal features for action recognition
-
[142] Wang, H., Ullah, M.M., Kläser, A., Laptev, I., Schmid, C., Evaluation of local spatio-temporal features for action recognition. British Machine Vision Conference, Sep. 2009, 127.
-
(2009)
British Machine Vision Conference
, pp. 127
-
-
Wang, H.1
Ullah, M.M.2
Kläser, A.3
Laptev, I.4
Schmid, C.5
-
143
-
-
84955282488
-
Action recognition with trajectory-pooled deep-convolutional descriptors
-
[143] Wang, L., Qiao, Y., Tang, X., Action recognition with trajectory-pooled deep-convolutional descriptors. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, 4305–4314.
-
(2015)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 4305-4314
-
-
Wang, L.1
Qiao, Y.2
Tang, X.3
-
144
-
-
84955300999
-
Towards Good Practices for Very Deep Two-Stream ConvNets
-
CoRR abs/1507.02159
-
[144] Wang, L., Xiong, Y., Wang, Z., Qiao, Y., Towards Good Practices for Very Deep Two-Stream ConvNets. 2015 CoRR abs/1507.02159.
-
(2015)
-
-
Wang, L.1
Xiong, Y.2
Wang, Z.3
Qiao, Y.4
-
147
-
-
79957467077
-
Hidden part models for human action recognition: probabilistic versus max margin
-
[147] Wang, Y., Mori, G., Hidden part models for human action recognition: probabilistic versus max margin. IEEE Trans. Pattern Anal. Mach. Intell. 33:7 (2011), 1310–1323.
-
(2011)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.33
, Issue.7
, pp. 1310-1323
-
-
Wang, Y.1
Mori, G.2
-
148
-
-
33750025833
-
Free viewpoint action recognition using motion history volumes
-
[148] Weinland, D., Ronfard, R., Boyer, E., Free viewpoint action recognition using motion history volumes. Comput. Vis. Image Underst. 104:23 (2006), 249–257.
-
(2006)
Comput. Vis. Image Underst.
, vol.104
, Issue.23
, pp. 249-257
-
-
Weinland, D.1
Ronfard, R.2
Boyer, E.3
-
149
-
-
78751648503
-
A survey of vision-based methods for action representation, segmentation and recognition
-
[149] Weinland, D., Ronfard, R., Boyer, E., A survey of vision-based methods for action representation, segmentation and recognition. Comput. Vis. Image Underst. 115:2 (2011), 224–241.
-
(2011)
Comput. Vis. Image Underst.
, vol.115
, Issue.2
, pp. 224-241
-
-
Weinland, D.1
Ronfard, R.2
Boyer, E.3
-
150
-
-
56749155587
-
An efficient dense and scale-invariant spatio-temporal interest point detector
-
[150] Willems, G., Tuytelaars, T., Gool, L., An efficient dense and scale-invariant spatio-temporal interest point detector. Proc. European Conference on Computer Vision (ECCV), 2008, 650–663.
-
(2008)
Proc. European Conference on Computer Vision (ECCV)
, pp. 650-663
-
-
Willems, G.1
Tuytelaars, T.2
Gool, L.3
-
151
-
-
61549128441
-
Robust face recognition via sparse representation
-
[151] Wright, J., Yang, A.Y., Ganesh, A., Sastry, S.S., Ma, Y., Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31:2 (2009), 210–227.
-
(2009)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.31
, Issue.2
, pp. 210-227
-
-
Wright, J.1
Yang, A.Y.2
Ganesh, A.3
Sastry, S.S.4
Ma, Y.5
-
152
-
-
84911433150
-
Towards good practices for action video encoding
-
[152] Wu, J., Zhang, Y., Lin, W., Towards good practices for action video encoding. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014, 2577–2584.
-
(2014)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 2577-2584
-
-
Wu, J.1
Zhang, Y.2
Lin, W.3
-
153
-
-
84990020516
-
Fusing Multi-Stream Deep Networks for Video Classification
-
CoRR
-
[153] Wu, Z., Jiang, Y., Wang, X., Ye, H., Xue, X., Wang, J., Fusing Multi-Stream Deep Networks for Video Classification. 2015, CoRR.
-
(2015)
-
-
Wu, Z.1
Jiang, Y.2
Wang, X.3
Ye, H.4
Xue, X.5
Wang, J.6
-
154
-
-
84941212449
-
Action recognition using hybrid feature descriptor and VLAD video encoding
-
[154] Xing, D., Wang, X., Lu, H., Action recognition using hybrid feature descriptor and VLAD video encoding. Computer Vision — ACCV 2014 Workshops: Singapore, Singapore, November 1–2, 2014, Revised Selected Papers, Part I, 2015, 99–112.
-
(2015)
Computer Vision — ACCV 2014 Workshops: Singapore, Singapore, November 1–2, 2014, Revised Selected Papers, Part I
, pp. 99-112
-
-
Xing, D.1
Wang, X.2
Lu, H.3
-
155
-
-
84906494714
-
Modeling video dynamics with deep dynencoder
-
[155] Yan, X., Chang, H., Shan, S., Chen, X., Modeling video dynamics with deep dynencoder. Proc. European Conference on Computer Vision (ECCV), 2014, 215–230.
-
(2014)
Proc. European Conference on Computer Vision (ECCV)
, pp. 215-230
-
-
Yan, X.1
Chang, H.2
Shan, S.3
Chen, X.4
-
156
-
-
33846013241
-
Object tracking: a survey
-
[156] Yilmaz, A., Javed, O., Shah, M., Object tracking: a survey. ACM Comput. Surv., 38(4), 2006.
-
(2006)
ACM Comput. Surv.
, vol.38
, Issue.4
-
-
Yilmaz, A.1
Javed, O.2
Shah, M.3
-
157
-
-
33745142597
-
Actions sketch: a novel action representation
-
[157] Yilmaz, A., Shah, M., Actions sketch: a novel action representation. Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1, 2005, 984–989.
-
(2005)
Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
, vol.1
, pp. 984-989
-
-
Yilmaz, A.1
Shah, M.2
-
159
-
-
54849197900
-
Crowd analysis: a survey
-
[159] Zhan, B., Monekosso, D.N., Remagnino, P., Velastin, S.A., Xu, L.-Q., Crowd analysis: a survey. Mach. Vis. Appl. 19:5 (2008), 345–357.
-
(2008)
Mach. Vis. Appl.
, vol.19
, Issue.5
, pp. 345-357
-
-
Zhan, B.1
Monekosso, D.N.2
Remagnino, P.3
Velastin, S.A.4
Xu, L.-Q.5
-
160
-
-
34247557079
-
Dynamic texture recognition using local binary patterns with an application to facial expressions
-
[160] Zhao, G., Pietikainen, M., Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans. Pattern Anal. Mach. Intell. 29:6 (2007), 915–928.
-
(2007)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.29
, Issue.6
, pp. 915-928
-
-
Zhao, G.1
Pietikainen, M.2
-
161
-
-
79952503594
-
Sparse coding on local spatial–temporal volumes for human action recognition
-
Springer
-
[161] Zhu, Y., Zhao, X., Fu, Y., Liu, Y., Sparse coding on local spatial–temporal volumes for human action recognition. Proc. Asian Conference on Computer Vision (ACCV), 2011, Springer, 660–671.
-
(2011)
Proc. Asian Conference on Computer Vision (ACCV)
, pp. 660-671
-
-
Zhu, Y.1
Zhao, X.2
Fu, Y.3
Liu, Y.4
|