-
1
-
-
84874768452
-
RAVEL: An annotated corpus for training robots with audiovisual abilities
-
Alameda-Pineda, X., Sanchez-Riera, J., Wienke, J., Franc, V., Cech, J., Kulkarni, K., et al. (2013). RAVEL: An annotated corpus for training robots with audiovisual abilities. Journal on Multimodal User Interfaces, 7(1–2), 79–91.
-
(2013)
Journal on Multimodal User Interfaces
, vol.7
, Issue.1-2
, pp. 79-91
-
-
Alameda-Pineda, X.1
Sanchez-Riera, J.2
Wienke, J.3
Franc, V.4
Cech, J.5
Kulkarni, K.6
Deleforge, A.7
Horaud, R.P.8
-
2
-
-
67650957596
-
A unified framework for gesture recognition and spatiotemporal gesture segmentation
-
Alon, J., Athitsos, V., Yuan, Q., & Sclaroff, S. (2009). A unified framework for gesture recognition and spatiotemporal gesture segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(9), 1685–1699.
-
(2009)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.31
, Issue.9
, pp. 1685-1699
-
-
Alon, J.1
Athitsos, V.2
Yuan, Q.3
Sclaroff, S.4
-
3
-
-
38349001348
-
Human motion recognition using isomap and dynamic time warping
-
Springer, Berlin
-
Blackburn, J., & Ribeiro, E. (2007). Human motion recognition using isomap and dynamic time warping. Human motion-understanding, modeling, capture and animation (pp. 285–298). Berlin: Springer.
-
(2007)
Human motion-understanding, modeling, capture and animation
, pp. 285-298
-
-
Blackburn, J.1
Ribeiro, E.2
-
5
-
-
78149346980
-
Activities as time series of human postures
-
Paragios N, (ed), Springer, Berlin
-
Brendel, W., & Todorovic, S. (2010). Activities as time series of human postures. In N. Paragios (Ed.), Computer Vision-ECCV 2010 (pp. 721–734). Berlin: Springer.
-
(2010)
Computer Vision-ECCV 2010
, pp. 721-734
-
-
Brendel, W.1
Todorovic, S.2
-
6
-
-
0035273106
-
Atomic decomposition by basis pursuit
-
Chen, S. S., Donoho, D. L., & Saunders, M. A. (2001). Atomic decomposition by basis pursuit. SIAM Rev, 43(1), 129–159.
-
(2001)
SIAM Rev
, vol.43
, Issue.1
, pp. 129-159
-
-
Chen, S.S.1
Donoho, D.L.2
Saunders, M.A.3
-
7
-
-
85067717513
-
-
Csurka, G., Dance, C. R., Fan, L., Willamowski, J., & Bray, C. (2004). Visual categorization with bags of keypoints. In ECCV Workshop on Statistical Learning in Computer Vision
-
Csurka, G., Dance, C. R., Fan, L., Willamowski, J., & Bray, C. (2004). Visual categorization with bags of keypoints. In ECCV Workshop on Statistical Learning in Computer Vision.
-
-
-
-
8
-
-
84892583619
-
-
Escalera, S., Gonzàlez, J., Baró, X., Reyes, M., Lopes, O., Guyon, I., Athitsos, V., & Escalante, H. J. (2013). Multi-modal gesture recognition challenge 2013: Dataset and results. In ChaLearn Multi-modal Gesture Recognition Grand Challenge and Workshop, 15th ACM International Conference on Multimodal Interaction
-
Escalera, S., Gonzàlez, J., Baró, X., Reyes, M., Lopes, O., Guyon, I., Athitsos, V., & Escalante, H. J. (2013). Multi-modal gesture recognition challenge 2013: Dataset and results. In ChaLearn Multi-modal Gesture Recognition Grand Challenge and Workshop, 15th ACM International Conference on Multimodal Interaction.
-
-
-
-
9
-
-
50249122257
-
Parametric image alignment using enhanced correlation coefficient maximization
-
Evangelidis, G. D., & Psarakis, E. Z. (2008). Parametric image alignment using enhanced correlation coefficient maximization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(10), 1858–1865.
-
(2008)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.30
, Issue.10
, pp. 1858-1865
-
-
Evangelidis, G.D.1
Psarakis, E.Z.2
-
10
-
-
70349227947
-
The application of hidden Markov models in speech recognition
-
Gales, M., & Young, S. (2008). The application of hidden Markov models in speech recognition. Foundations and Trends in Signal Processing, 1(3), 195–304.
-
(2008)
Foundations and Trends in Signal Processing
, vol.1
, Issue.3
, pp. 195-304
-
-
Gales, M.1
Young, S.2
-
11
-
-
80052903155
-
The in-crowd algorithm for fast basis pursuit denoising
-
Gill, P. R., Wang, A., & Molnar, A. (2011). The in-crowd algorithm for fast basis pursuit denoising. IEEE Transactions on Signal Processing, 59(10), 4595–4605.
-
(2011)
IEEE Transactions on Signal Processing
, vol.59
, Issue.10
, pp. 4595-4605
-
-
Gill, P.R.1
Wang, A.2
Molnar, A.3
-
12
-
-
84856634625
-
-
Gong, D., & Medioni, G. (2011) Dynamic manifold warping for view invariant action recognition. In IEEE International Conference on Computer Vision, (pp. 571–578). IEEE
-
Gong, D., & Medioni, G. (2011) Dynamic manifold warping for view invariant action recognition. In IEEE International Conference on Computer Vision, (pp. 571–578). IEEE.
-
-
-
-
13
-
-
84958953848
-
HMM-based continuous sign language recognition using stochastic grammars
-
Braffort A, Gherbi R, Gibet S, Teil D, Richardson J, (eds), Lecture Notes in Computer Science, 1739, Springer, Berlin
-
Hienz, H., Bauer, B., & Kraiss, K. F. (1999). HMM-based continuous sign language recognition using stochastic grammars. In A. Braffort, R. Gherbi, S. Gibet, D. Teil, & J. Richardson (Eds.), Gesture-based communication in human-computer interaction (Vol. 1739, pp. 185–196)., Lecture Notes in Computer Science Berlin: Springer.
-
(1999)
Gesture-based communication in human-computer interaction
, pp. 185-196
-
-
Hienz, H.1
Bauer, B.2
Kraiss, K.F.3
-
14
-
-
80052873938
-
-
Hoai, M., Lan, Z. Z., & De la Torre, F. (2011). Joint segmentation and classification of human actions in video. In 2011 IEEE Conference on Computer Vision and Pattern Recognition CVPR. (pp. 3265–3272). IEEE
-
Hoai, M., Lan, Z. Z., & De la Torre, F. (2011). Joint segmentation and classification of human actions in video. In 2011 IEEE Conference on Computer Vision and Pattern Recognition CVPR. (pp. 3265–3272). IEEE.
-
-
-
-
15
-
-
67349195493
-
Histogram of oriented rectangles: A new pose descriptor for human action recognition
-
Ikizler, N., & Duygulu, P. (2009). Histogram of oriented rectangles: A new pose descriptor for human action recognition. Image and Vision Computing, 27(10), 1515–1526.
-
(2009)
Image and Vision Computing
, vol.27
, Issue.10
, pp. 1515-1526
-
-
Ikizler, N.1
Duygulu, P.2
-
16
-
-
84887398298
-
-
Jain, M., Jégou, H., & Bouthémy, P. (2013). Better exploiting motion for better action recognition. In Computer Vision and Pattern Recognition, (pp. 2555–2562). IEEE
-
Jain, M., Jégou, H., & Bouthémy, P. (2013). Better exploiting motion for better action recognition. In Computer Vision and Pattern Recognition, (pp. 2555–2562). IEEE.
-
-
-
-
17
-
-
84867849524
-
-
Jiang, Y. G., Dai, Q., Xue, X., Liu, W., & Ngo, C. W. (2012). Trajectory-based modeling of human actions with motion reference points. In European Conference on Computer Vision, (pp. 425–438). Berlin :Springer
-
Jiang, Y. G., Dai, Q., Xue, X., Liu, W., & Ngo, C. W. (2012). Trajectory-based modeling of human actions with motion reference points. In European Conference on Computer Vision, (pp. 425–438). Berlin:Springer.
-
-
-
-
18
-
-
85067719968
-
-
Kulkarni, K., Cherla, S., Kale, A., & Ramasubramanian, V. (2008). A framework for indexing human actions in video. In The 1st International Workshop on Machine Learning for Vision-based Motion Analysis-MLVMA’08
-
Kulkarni, K., Cherla, S., Kale, A., & Ramasubramanian, V. (2008). A framework for indexing human actions in video. In The 1st International Workshop on Machine Learning for Vision-based Motion Analysis-MLVMA’08.
-
-
-
-
19
-
-
51949083365
-
-
Laptev, I., Marszalek, M., Schmid, C., & Rozenfeld, B. (2008) Learning realistic human actions from movies. In IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, (pp. 1–8). IEEE
-
Laptev, I., Marszalek, M., Schmid, C., & Rozenfeld, B. (2008) Learning realistic human actions from movies. In IEEE Conference on Computer Vision and Pattern Recognition, 2008. CVPR 2008, (pp. 1–8). IEEE.
-
-
-
-
20
-
-
0024769238
-
A frame-synchronous network search algorithm for connected word recognition
-
Lee, C., & Rabiner, L. (1989). A frame-synchronous network search algorithm for connected word recognition. IEEE Transactions on Acoustics, Speech and Signal Processing, 37(11), 1649–1658.
-
(1989)
IEEE Transactions on Acoustics, Speech and Signal Processing
, vol.37
, Issue.11
, pp. 1649-1658
-
-
Lee, C.1
Rabiner, L.2
-
21
-
-
84905385057
-
-
Liang, R., & Ouhyoung, M. (1998). A real-time continuous gesture recognition system for sign language. In Third IEEE International Conference on Automatic Face and Gesture Recognition, 1998, (pp. 558–567). IEEE
-
Liang, R., & Ouhyoung, M. (1998). A real-time continuous gesture recognition system for sign language. In Third IEEE International Conference on Automatic Face and Gesture Recognition, 1998, (pp. 558–567). IEEE.
-
-
-
-
22
-
-
33745834282
-
-
Lv, F., & Nevatia, R. (2006). Recognition and segmentation of 3-d human action using HMM and multi-class AdaBoost. In European Conference on Computer Vision, (pp. 359–372). Berlin: Springer
-
Lv, F., & Nevatia, R. (2006). Recognition and segmentation of 3-d human action using HMM and multi-class AdaBoost. In European Conference on Computer Vision, (pp. 359–372). Berlin: Springer.
-
-
-
-
23
-
-
34948833676
-
-
Lv, F., & Nevatia, R. (2007). Single view human action recognition using key pose matching and Viterbi path searching. In Computer Vision and Pattern Recognition, 2007. CVPR’07, (pp. 1–8). IEEE
-
Lv, F., & Nevatia, R. (2007). Single view human action recognition using key pose matching and Viterbi path searching. In Computer Vision and Pattern Recognition, 2007. CVPR’07, (pp. 1–8). IEEE.
-
-
-
-
25
-
-
85117427589
-
-
Marszalek, M., Laptev, I., & Schmid, C. (2009) Actions in context. In IEEE Conference on Computer Vision and Pattern Recognition, (pp. 2929–2936). IEEE
-
Marszalek, M., Laptev, I., & Schmid, C. (2009) Actions in context. In IEEE Conference on Computer Vision and Pattern Recognition, (pp. 2929–2936). IEEE.
-
-
-
-
26
-
-
34948881770
-
-
Morency, L., Quattoni, A., & Darrell, T. (2007). Latent-dynamic discriminative models for continuous gesture recognition. In Computer Vision and Pattern Recognition, (pp. 1–8). IEEE
-
Morency, L., Quattoni, A., & Darrell, T. (2007). Latent-dynamic discriminative models for continuous gesture recognition. In Computer Vision and Pattern Recognition, (pp. 1–8). IEEE.
-
-
-
-
28
-
-
0021406359
-
The use of a one-stage dynamic programming algorithm for connected word recognition
-
Ney, H. (1984). The use of a one-stage dynamic programming algorithm for connected word recognition. IEEE Transactions on Acoustics, Speech and Signal Processing, 32(2), 263–271.
-
(1984)
IEEE Transactions on Acoustics, Speech and Signal Processing
, vol.32
, Issue.2
, pp. 263-271
-
-
Ney, H.1
-
29
-
-
85032751521
-
Dynamic programming search for continuous speech recognition
-
Ney, H., & Ortmanns, S. (1999). Dynamic programming search for continuous speech recognition. IEEE Signal Processing Magazine, 16(5), 64–83.
-
(1999)
IEEE Signal Processing Magazine
, vol.16
, Issue.5
, pp. 64-83
-
-
Ney, H.1
Ortmanns, S.2
-
30
-
-
56749132994
-
-
Ning, H., Xu, W., Gong, Y., Huang, T. (2008). Latent pose estimator for continuous action recognition. In European Conference on Computer Vision, (pp. 419–433). Springer
-
Ning, H., Xu, W., Gong, Y., Huang, T. (2008). Latent pose estimator for continuous action recognition. In European Conference on Computer Vision, (pp. 419–433). Springer.
-
-
-
-
32
-
-
0018724280
-
Two-level DP-matching - a dynamic programming-based pattern matching algorithm for connected word recognition
-
Sakoe, H. (1979). Two-level DP-matching - a dynamic programming-based pattern matching algorithm for connected word recognition. IEEE Transactions on Acoustic, Speech, and Signal Processing, 27(6), 588–595.
-
(1979)
IEEE Transactions on Acoustic, Speech, and Signal Processing
, vol.27
, Issue.6
, pp. 588-595
-
-
Sakoe, H.1
-
33
-
-
85067714531
-
-
Sanchez-Riera, J., Cech, J., Horaud, R. P. (2012). Action recognition robust to background clutter by using stereo vision. In The Fourth International Workshop on Video Event Categorization, Tagging and Retrieval, LNCS: Springer
-
Sanchez-Riera, J., Cech, J., Horaud, R. P. (2012). Action recognition robust to background clutter by using stereo vision. In The Fourth International Workshop on Video Event Categorization, Tagging and Retrieval, LNCS: Springer.
-
-
-
-
34
-
-
79953179659
-
Discriminative human action segmentation and recognition using SMMs
-
Shi, Q., Wang, L., Cheng, L., & Smola, A. (2011). Discriminative human action segmentation and recognition using SMMs. IJCV, 93(1), 22–32.
-
(2011)
IJCV
, vol.93
, Issue.1
, pp. 22-32
-
-
Shi, Q.1
Wang, L.2
Cheng, L.3
Smola, A.4
-
35
-
-
75149150235
-
Humaneva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion
-
Sigal, L., Balan, A., & Black, M. (2010). Humaneva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion. International Journal of Computer Vision, 87(1), 4–27.
-
(2010)
International Journal of Computer Vision
, vol.87
, Issue.1
, pp. 4-27
-
-
Sigal, L.1
Balan, A.2
Black, M.3
-
36
-
-
62249222499
-
Efficient visual search of videos cast as text retrieval
-
Sivic, J., & Zisserman, A. (2009). Efficient visual search of videos cast as text retrieval. IEEE Transactions on PAMI, 31(4), 591–606.
-
(2009)
IEEE Transactions on PAMI
, vol.31
, Issue.4
, pp. 591-606
-
-
Sivic, J.1
Zisserman, A.2
-
37
-
-
33749993686
-
Conditional models for contextual human motion recognition
-
Sminchisescu, C., Kanaujia, A., & Metaxas, D. N. (2006). Conditional models for contextual human motion recognition. CVIU, 104(2–3), 210–220.
-
(2006)
CVIU
, vol.104
, Issue.2-3
, pp. 210-220
-
-
Sminchisescu, C.1
Kanaujia, A.2
Metaxas, D.N.3
-
38
-
-
84885330892
-
Classifying web videos using a global video descriptor
-
Solmaz, B., Assari, S. M., & Shah, M. (2013). Classifying web videos using a global video descriptor. Machine vision and applications, 24(7), 1473–1485.
-
(2013)
Machine vision and applications
, vol.24
, Issue.7
, pp. 1473-1485
-
-
Solmaz, B.1
Assari, S.M.2
Shah, M.3
-
39
-
-
0032304547
-
Real-time american sign language recognition using desk and wearable computer based video
-
Starner, T., Weaver, J., & Pentland, A. (1998). Real-time american sign language recognition using desk and wearable computer based video. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(12), 1371–1375.
-
(1998)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.20
, Issue.12
, pp. 1371-1375
-
-
Starner, T.1
Weaver, J.2
Pentland, A.3
-
40
-
-
64649083745
-
Signal recovery from random measurements via orthogonal matching pursuit
-
Tropp, J. A., & Gilbert, A. C. (2007). Signal recovery from random measurements via orthogonal matching pursuit. IEEE Transactions on Information Theory, 53(12), 4655–4666.
-
(2007)
IEEE Transactions on Information Theory
, vol.53
, Issue.12
, pp. 4655-4666
-
-
Tropp, J.A.1
Gilbert, A.C.2
-
41
-
-
84898428599
-
N,, Laptev, I
-
Ullah, M. M., Parizi, S. N,, Laptev, I. (2010). Improving bag-of-features action recognition with non-local cues. In British Machine Vision Conference. (Vol. 10, pp. 95–101).
-
(2010)
Improving bag-of-features action recognition with non-local cues. In British Machine Vision Conference
, vol.10
, pp. 95-101
-
-
Ullah, M.M.1
Parizi, S.2
-
42
-
-
57749118369
-
-
Vail, D., Veloso, M., & Lafferty, J. (2007). Conditional random fields for activity recognition. In Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems, (p. 235). ACM
-
Vail, D., Veloso, M., & Lafferty, J. (2007). Conditional random fields for activity recognition. In Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems, (p. 235). ACM.
-
-
-
-
43
-
-
34250411858
-
Element-wise recognition of continuous speech composed of words from a specified dictionary
-
Vintsyuk, T. (1971). Element-wise recognition of continuous speech composed of words from a specified dictionary. Cybernetics and Systems Analysis, 7(2), 361–372.
-
(1971)
Cybernetics and Systems Analysis
, vol.7
, Issue.2
, pp. 361-372
-
-
Vintsyuk, T.1
-
45
-
-
0035270479
-
A framework for recognizing the simultaneous aspects of american sign language
-
Vogler, C., & Metaxas, D. (2001). A framework for recognizing the simultaneous aspects of american sign language. Computer Vision and Image Understanding, 81(3), 358–384.
-
(2001)
Computer Vision and Image Understanding
, vol.81
, Issue.3
, pp. 358-384
-
-
Vogler, C.1
Metaxas, D.2
-
46
-
-
84898805910
-
-
Wang, H., & Schmid, C. (2013). Action recognition with improved trajectories. In International Conference on Computer Vision, (pp. 3551–3558). IEEE
-
Wang, H., & Schmid, C. (2013). Action recognition with improved trajectories. In International Conference on Computer Vision, (pp. 3551–3558). IEEE.
-
-
-
-
47
-
-
0003901486
-
Token passing: a simple conceptual model for connected speech recognition systems. Technical Report 38, University of Cambridge
-
Young, S., Russell, N. H., & Thornton, J. (1989). Token passing: a simple conceptual model for connected speech recognition systems. Technical Report 38, University of Cambridge, Department of Engineering.
-
(1989)
Department of Engineering
-
-
Young, S.1
Russell, N.H.2
Thornton, J.3
-
48
-
-
0003483593
-
HTK: Hidden Markov model toolkit v1. 5. Technical Report, University of Cambridge
-
Young, S., Woodland, P., & Byrne, W. (1993). HTK: Hidden Markov model toolkit v1. 5. Technical Report, University of Cambridge, Department of Engineering.
-
(1993)
Department of Engineering
-
-
Young, S.1
Woodland, P.2
Byrne, W.3
-
49
-
-
85067733602
-
-
The HTK book. Technical Report: University of Cambridge, Department of Engineerin
-
Young, S., Evermann, G., Kershaw, D., Moore, G., Odell, J., Ollason, D., et al. (2009). The HTK book. Technical Report: University of Cambridge, Department of Engineering.
-
(2009)
-
-
Young, S.1
Evermann, G.2
Kershaw, D.3
Moore, G.4
Odell, J.5
Ollason, D.6
|