-
1
-
-
84894907010
-
Semantic video retrieval using audio analysis
-
Lew, M.; Sebe, N.; Eakins, J. (eds.) Springer, International Conference on Image and Video Retrieval, London
-
Bakker, E.; Lew, M.: Semantic video retrieval using audio analysis. In: Lew, M.; Sebe, N.; Eakins, J. (eds.) Image and video retrieval. Lecture Notes in Computer Science, vol. 2383, pp. 201-218. Springer, International Conference on Image and Video Retrieval, London (2002)
-
(2002)
Image and Video Retrieval. Lecture Notes in Computer Science
, vol.2383
, pp. 201-218
-
-
Bakker, E.1
Lew, M.2
-
2
-
-
84905181679
-
Irit at trecvid 2010: Hidden markov models for context-aware late fusion of multiple audio classifiers
-
Bredin, H.; Koenig, L.; Farinas, J.: Irit at trecvid 2010: Hidden markov models for context-aware late fusion of multiple audio classifiers. In: TRECVID 2010 Notebook papers (2010)
-
(2010)
TRECVID 2010 Notebook Papers
-
-
Bredin, H.1
Koenig, L.2
Farinas, J.3
-
3
-
-
33645146449
-
Histograms of oriented gradients for human detection
-
IEEE Computer Society, Washington, DC
-
Dalal, N.; Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886-893. IEEE Computer Society, Washington, DC (2005)
-
(2005)
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
, vol.1
, pp. 886-893
-
-
Dalal, N.1
Triggs, B.2
-
5
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
10.1109/TASSP.1980.1163420
-
Davis, S.; Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28(4), 357-366 (1980)
-
(1980)
IEEE Trans. Acoust. Speech Signal Process.
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.1
Mermelstein, P.2
-
7
-
-
36248957499
-
Actions as space-time shapes
-
10.1109/TPAMI.2007.70711
-
Gorelick, L.; Blank, M.; Shechtman, E.; Irani, M.; Basri, R.: Actions as space-time shapes. Trans. Pattern Anal. Mach. Intell. 29(12), 2247-2253 (2007)
-
(2007)
Trans. Pattern Anal. Mach. Intell.
, vol.29
, Issue.12
, pp. 2247-2253
-
-
Gorelick, L.1
Blank, M.2
Shechtman, E.3
Irani, M.4
Basri, R.5
-
8
-
-
84905233993
-
Tokyotech+canon at trecvid 2011
-
Inoue, N.; Wada, T.; Kamishima, Y.; Shinoda, K.; Sato, S.: Tokyotech+canon at trecvid 2011. In: TRECVID 2011 Notebook papers (2011)
-
(2011)
TRECVID 2011 Notebook Papers
-
-
Inoue, N.1
Wada, T.2
Kamishima, Y.3
Shinoda, K.4
Sato, S.5
-
9
-
-
79959766559
-
Consumer video understanding: A benchmark database and an evaluation of human and machine performance
-
Jiang, Y.G.; Ye, G.; Chang, S.F.; Ellis, D.; Loui, A.C.: Consumer video understanding: a benchmark database and an evaluation of human and machine performance. In: Proceedings of ACM International Conference on Multimedia Retrieval (ICMR), oral session (2011)
-
(2011)
Proceedings of ACM International Conference on Multimedia Retrieval (ICMR), Oral Session
-
-
Jiang, Y.G.1
Ye, G.2
Chang, S.F.3
Ellis, D.4
Loui, A.C.5
-
10
-
-
24944451092
-
On space-time interest points
-
10.1007/s11263-005-1838-7
-
Laptev, I.: On space-time interest points. Int. J. Comput. Vis. 64(2/3), 107-123 (2005)
-
(2005)
Int. J. Comput. Vis.
, vol.64
, Issue.23
, pp. 107-123
-
-
Laptev, I.1
-
13
-
-
84873572465
-
MIR in Matlab (ii): A toolbox for musical feature extraction from audio
-
Lartillot, O.; Toiviainen, P.: MIR in Matlab (ii): a toolbox for musical feature extraction from audio. In: ISMIR, pp. 127-130 (2007)
-
(2007)
ISMIR
, pp. 127-130
-
-
Lartillot, O.1
Toiviainen, P.2
-
14
-
-
84905158219
-
Pku-idm at trecvid 2010: Copy detection with visual-audio feature fusion and sequential pyramid matching
-
Li, Y.; Mou, L.; Jiang, M.; Su, C.; Fang, X.; Qian, M.; Tian, Y.; Wang, Y.; Huang, T.; Gao, W.: Pku-idm at trecvid 2010: copy detection with visual-audio feature fusion and sequential pyramid matching. In: TRECVID 2010 Notebook papers (2010)
-
(2010)
TRECVID 2010 Notebook Papers
-
-
Li, Y.1
Mou, L.2
Jiang, M.3
Su, C.4
Fang, X.5
Qian, M.6
Tian, Y.7
Wang, Y.8
Huang, T.9
Gao, W.10
-
15
-
-
0001457509
-
Some methods for classification and analysis of multivariate observations
-
Cam, L.M.L.; Neyman, J. (eds.) University of California Press
-
MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Cam, L.M.L.; Neyman, J. (eds.) Proceedings of the fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281-297. University of California Press (1967)
-
(1967)
Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability
, vol.1
, pp. 281-297
-
-
Macqueen, J.B.1
-
16
-
-
0002322469
-
On a test of whether one of two random variables is stochastically larger than the other
-
10.1214/aoms/1177730491 0041.26103 22058
-
Mann, H.B.; Whitney, D.R.: On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat. 18(1), 50-60 (1947)
-
(1947)
Ann. Math. Stat.
, vol.18
, Issue.1
, pp. 50-60
-
-
Mann, H.B.1
Whitney, D.R.2
-
19
-
-
15044354466
-
Automatic analysis of multimodal group actions in meetings
-
10.1109/TPAMI.2005.49
-
McCowan, I.; Gatica-Perez, D.; Bengio, S.; Lathoud, G.; Barnard, M.; Zhang, D.: Automatic analysis of multimodal group actions in meetings. IEEE Trans. Pattern Anal. Mach. Intell. 27(3), 305-317 (2005)
-
(2005)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.27
, Issue.3
, pp. 305-317
-
-
McCowan, I.1
Gatica-Perez, D.2
Bengio, S.3
Lathoud, G.4
Barnard, M.5
Zhang, D.6
-
20
-
-
84867753906
-
Structured learning of human interactions in TV shows
-
10.1109/TPAMI.2012.24
-
Patron-Perez, A.; Marszalek, M.; Reid, I.; Zisserman, A.: Structured learning of human interactions in TV shows. IEEE Trans. Pattern Anal. Mach. Intell. 34(12), 2441-2453 (2012)
-
(2012)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.34
, Issue.12
, pp. 2441-2453
-
-
Patron-Perez, A.1
Marszalek, M.2
Reid, I.3
Zisserman, A.4
-
21
-
-
84905269591
-
Genie trecvid2011 multimedia event detection: Late-fusion approaches to combine multiple audio-visual features
-
Perera, A.G.A.; Oh, S.; Leotta, M.; Kim, I.; Byun, B.; Lee, C.H.; McCloskey, S.; Liu, J.; Miller, B.; Huang, Z.F.; Vahdat, A.; Yang, W.; Mori, G.; Tang, K.; Koller, D.; Fei-Fei, L.; Li, K.; Chen, G.; Corso, J.; Fu, Y.; Srihari, R.: Genie trecvid2011 multimedia event detection: late-fusion approaches to combine multiple audio-visual features. In: TRECVID 2011 Notebook papers (2011)
-
(2011)
TRECVID 2011 Notebook Papers
-
-
Perera, A.G.A.1
Oh, S.2
Leotta, M.3
Kim, I.4
Byun, B.5
Lee, C.H.6
McCloskey, S.7
Liu, J.8
Miller, B.9
Huang, Z.F.10
Vahdat, A.11
Yang, W.12
Mori, G.13
Tang, K.14
Koller, D.15
Fei-Fei, L.16
Li, K.17
Chen, G.18
Corso, J.19
Fu, Y.20
Srihari, R.21
more..
-
22
-
-
0003474751
-
-
2 edn. Cambridge University Press, Cambridge
-
Press, W.H.; Teukolsky, S.A.; Vetterling, W.; Flannery, B.P.: Numerical recipes in C++: the art of scientific computing, 2 edn. Cambridge University Press, Cambridge (2002)
-
(2002)
Numerical Recipes in C++: The Art of Scientific Computing
-
-
Press, W.H.1
Teukolsky, S.A.2
Vetterling, W.3
Flannery, B.P.4
-
23
-
-
80053115991
-
Understanding interactions and guiding visual surveillance by tracking attention
-
Koch, R.; Huang, F. (eds.) Springer, Berlin/Heidelberg
-
Reid, I.; Benfold, B.; Patron, A.; Sommerlade, E.: Understanding interactions and guiding visual surveillance by tracking attention. In: Koch, R.; Huang, F. (eds.) Computer Vision - ACCV 2010 Workshops. Lecture Notes in Computer Science, vol. 6468, pp. 380-389. Springer, Berlin/Heidelberg (2011)
-
(2011)
Computer Vision - ACCV 2010 Workshops. Lecture Notes in Computer Science
, vol.6468
, pp. 380-389
-
-
Reid, I.1
Benfold, B.2
Patron, A.3
Sommerlade, E.4
-
24
-
-
77953187842
-
Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities
-
Ryoo, M.; Aggarwal, J.: Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities. In: Proceedings of the International Conference on Computer Vision, pp. 1593-1600 (2009)
-
(2009)
Proceedings of the International Conference on Computer Vision
, pp. 1593-1600
-
-
Ryoo, M.1
Aggarwal, J.2
-
25
-
-
10044233701
-
Recognizing human actions: A local SVM approach
-
Cambridge
-
Schüldt, C.; Laptev, I.; Caputo, B.: Recognizing human actions: a local SVM approach. In: Proceedings of the International Conference on Pattern Recognition, vol. 3, pp. 32-36, Cambridge (2004)
-
(2004)
Proceedings of the International Conference on Pattern Recognition
, vol.3
, pp. 32-36
-
-
Schüldt, C.1
Laptev, I.2
Caputo, B.3
-
26
-
-
78649922664
-
On the use of audio events for improving video scene segmentation
-
Sidiropoulos, P.; Mezaris, V.; Kompatsiaris, I.; Meinedo, H.; Bugalho, M.; Trancoso, I.: On the use of audio events for improving video scene segmentation. In: 2010 11th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), pp. 1-4 (2010)
-
(2010)
2010 11th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS)
, pp. 1-4
-
-
Sidiropoulos, P.1
Mezaris, V.2
Kompatsiaris, I.3
Meinedo, H.4
Bugalho, M.5
Trancoso, I.6
-
28
-
-
34547401486
-
Evaluation campaigns and TRECVid
-
ACM Press, New York
-
Smeaton, A.F.; Over, P.; Kraaij, W.: Evaluation campaigns and TRECVid. In: MIR '06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, pp. 321-330. ACM Press, New York (2006)
-
(2006)
MIR '06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval
, pp. 321-330
-
-
Smeaton, A.F.1
Over, P.2
Kraaij, W.3
-
29
-
-
84955035459
-
A scale for the measurement of the psychological magnitude of pitch
-
10.1121/1.1915893
-
Stevens, S.; Volkmann, J.; Newman, E.: A scale for the measurement of the psychological magnitude of pitch. J. Acoust. Soc. Am. 8, 185-190 (1937)
-
(1937)
J. Acoust. Soc. Am.
, vol.8
, pp. 185-190
-
-
Stevens, S.1
Volkmann, J.2
Newman, E.3
-
30
-
-
55149089260
-
Machine recognition of human activities: A survey
-
Turaga, P.; Chellappa, R.; Subrahmanian, V.S.; Udrea, O.: Machine recognition of human activities: a survey. IEEE Trans. Circ. Syst. Video Technol. 18(11), 1473-1488 (2008)
-
(2008)
IEEE Trans. Circ. Syst. Video Technol.
, vol.18
, Issue.11
, pp. 1473-1488
-
-
Turaga, P.1
Chellappa, R.2
Subrahmanian, V.S.3
Udrea, O.4
-
31
-
-
46149103415
-
Building audio classification for broadcast news retrieval
-
Tzanetakis, G.; Chen, M.: Building audio classification for broadcast news retrieval. In: Proceedings of WIAMIS (2004)
-
(2004)
Proceedings of WIAMIS
-
-
Tzanetakis, G.1
Chen, M.2
-
32
-
-
71149100224
-
More generality in efficient multiple kernel learning
-
Varma, M.; Babu, B.R.: More generality in efficient multiple kernel learning. In: ICML, p. 134 (2009)
-
(2009)
ICML
, pp. 134
-
-
Varma, M.1
Babu, B.R.2
-
34
-
-
84856194352
-
Efficient additive kernels via explicit feature maps
-
10.1109/TPAMI.2011.153
-
Vedaldi, A.; Zisserman, A.: Efficient additive kernels via explicit feature maps. IEEE Trans. Pattern Anal. Mach. Intell. 34(3), 480-492 (2012)
-
(2012)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.34
, Issue.3
, pp. 480-492
-
-
Vedaldi, A.1
Zisserman, A.2
-
36
-
-
84898890371
-
Evaluation of local spatio-temporal features for action recognition
-
Wang, H.; Ullah, M.; Kläser, A.; Laptev, I.; Schmid, C.: Evaluation of local spatio-temporal features for action recognition. In: Proceedings of the British Machine Vision Conference, p. 127 (2009)
-
(2009)
Proceedings of the British Machine Vision Conference
, pp. 127
-
-
Wang, H.1
Ullah, M.2
Kläser, A.3
Laptev, I.4
Schmid, C.5
-
37
-
-
33750025833
-
Free viewpoint action recognition using motion history volumes
-
Weinland, D.; Ronfard, R.; Boyer, E.: Free viewpoint action recognition using motion history volumes. Comput. Vis. Image Underst. 104(2-3), 249-257 (2006)
-
(2006)
Comput. Vis. Image Underst.
, vol.104
, Issue.2-3
, pp. 249-257
-
-
Weinland, D.1
Ronfard, R.2
Boyer, E.3
-
38
-
-
0001884644
-
Individual comparisons by ranking methods
-
10.2307/3001968
-
Wilcoxon, F.: Individual comparisons by ranking methods. Biom. Bull. 1(6), 80-83 (1945)
-
(1945)
Biom. Bull.
, vol.1
, Issue.6
, pp. 80-83
-
-
Wilcoxon, F.1
-
39
-
-
84856672971
-
Action recognition by learning bases of action attributes and parts
-
Barcelona, Spain
-
Yao, B.; Jiang, X.; Khosla, A.; Lin, A.; Guibas, L.; Fei-Fei, L.: Action recognition by learning bases of action attributes and parts. In: Proceedings of the International Conference on Computer Vision, Barcelona, Spain (2011)
-
(2011)
Proceedings of the International Conference on Computer Vision
-
-
Yao, B.1
Jiang, X.2
Khosla, A.3
Lin, A.4
Guibas, L.5
Fei-Fei, L.6
-
40
-
-
84864147835
-
Joint audio-visual bi-modal codewords for video event detection
-
Ye, G.; Jhuo, I.H.; Liu, D.; Jiang, Y.G.; Lee, D.T.; Chang, S.F.: Joint audio-visual bi-modal codewords for video event detection. In: ICMR, p. 39 (2012)
-
(2012)
ICMR
, pp. 39
-
-
Ye, G.1
Jhuo, I.H.2
Liu, D.3
Jiang, Y.G.4
Lee, D.T.5
Chang, S.F.6
-
41
-
-
84866712367
-
Robust late fusion with rank minimization
-
Ye, G.; Liu, D.; Jhuo, I.H.; Chang, S.F.: Robust late fusion with rank minimization. In: Proceedings of the IEEE Conference on Computer Vision and, Pattern Recognition, pp. 3021-3028 (2012)
-
(2012)
Proceedings of the IEEE Conference on Computer Vision And, Pattern Recognition
, pp. 3021-3028
-
-
Ye, G.1
Liu, D.2
Jhuo, I.H.3
Chang, S.F.4
|