SCOPUS 정보 검색 플랫폼

Machine Vision and Applications

Volumn 25, Issue 1, 2014, Pages 71-84

Human interaction categorization by using audio-visual cues

(4) Marin Jimenez M J a Munoz Salinas R a Yeguas Bolivar, E a Perez De La Blanca N b

a UNIVERSITY OF CÓRDOBA (Spain)

b UNIVERSITY OF GRANADA (Spain)

Author keywords

Audio; BOW; Human interactions; Video

Indexed keywords

EID: 84894902950 PISSN: 09328092 EISSN: 14321769 Source Type: Journal
DOI: 10.1007/s00138-013-0521-1 Document Type: Article

Times cited : (21)

References (41)

1
- 84894907010
- Semantic video retrieval using audio analysis
- Lew, M.; Sebe, N.; Eakins, J. (eds.) Springer, International Conference on Image and Video Retrieval, London
- Bakker, E.; Lew, M.: Semantic video retrieval using audio analysis. In: Lew, M.; Sebe, N.; Eakins, J. (eds.) Image and video retrieval. Lecture Notes in Computer Science, vol. 2383, pp. 201-218. Springer, International Conference on Image and Video Retrieval, London (2002)
- (2002) Image and Video Retrieval. Lecture Notes in Computer Science , vol.2383 , pp. 201-218
- Bakker, E.¹ Lew, M.²

2
- 84905181679
- Irit at trecvid 2010: Hidden markov models for context-aware late fusion of multiple audio classifiers
- Bredin, H.; Koenig, L.; Farinas, J.: Irit at trecvid 2010: Hidden markov models for context-aware late fusion of multiple audio classifiers. In: TRECVID 2010 Notebook papers (2010)
- (2010) TRECVID 2010 Notebook Papers
- Bredin, H.¹ Koenig, L.² Farinas, J.³

3
- 33645146449
- Histograms of oriented gradients for human detection
- IEEE Computer Society, Washington, DC
- Dalal, N.; Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886-893. IEEE Computer Society, Washington, DC (2005)
- (2005) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , vol.1 , pp. 886-893
- Dalal, N.¹ Triggs, B.²

4
- 34948855444
- Human detection using oriented histograms of flow and appearance
- Dalal, N.; Triggs, B.; Schmid, C.: Human detection using oriented histograms of flow and appearance. In: Proceedings of the European Conference on Computer Vision (2006)
- (2006) Proceedings of the European Conference on Computer Vision
- Dalal, N.¹ Triggs, B.² Schmid, C.³

5
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- 10.1109/TASSP.1980.1163420
- Davis, S.; Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28(4), 357-366 (1980)
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

6
- 85162332438
- Learning person-object interactions for action recognition in still images
- Delaitre, V.; Sivic, J.; Laptev, I.: Learning person-object interactions for action recognition in still images. In: Advances in Neural Information Processing Systems (2011)
- (2011) Advances in Neural Information Processing Systems
- Delaitre, V.¹ Sivic, J.² Laptev, I.³

7
- 36248957499
- Actions as space-time shapes
- 10.1109/TPAMI.2007.70711
- Gorelick, L.; Blank, M.; Shechtman, E.; Irani, M.; Basri, R.: Actions as space-time shapes. Trans. Pattern Anal. Mach. Intell. 29(12), 2247-2253 (2007)
- (2007) Trans. Pattern Anal. Mach. Intell. , vol.29 , Issue.12 , pp. 2247-2253
- Gorelick, L.¹ Blank, M.² Shechtman, E.³ Irani, M.⁴ Basri, R.⁵

8
- 84905233993
- Tokyotech+canon at trecvid 2011
- Inoue, N.; Wada, T.; Kamishima, Y.; Shinoda, K.; Sato, S.: Tokyotech+canon at trecvid 2011. In: TRECVID 2011 Notebook papers (2011)
- (2011) TRECVID 2011 Notebook Papers
- Inoue, N.¹ Wada, T.² Kamishima, Y.³ Shinoda, K.⁴ Sato, S.⁵

9
- 79959766559
- Consumer video understanding: A benchmark database and an evaluation of human and machine performance
- Jiang, Y.G.; Ye, G.; Chang, S.F.; Ellis, D.; Loui, A.C.: Consumer video understanding: a benchmark database and an evaluation of human and machine performance. In: Proceedings of ACM International Conference on Multimedia Retrieval (ICMR), oral session (2011)
- (2011) Proceedings of ACM International Conference on Multimedia Retrieval (ICMR), Oral Session
- Jiang, Y.G.¹ Ye, G.² Chang, S.F.³ Ellis, D.⁴ Loui, A.C.⁵

10
- 24944451092
- On space-time interest points
- 10.1007/s11263-005-1838-7
- Laptev, I.: On space-time interest points. Int. J. Comput. Vis. 64(2/3), 107-123 (2005)
- (2005) Int. J. Comput. Vis. , vol.64 , Issue.23 , pp. 107-123
- Laptev, I.¹

11
- 51949083365
- Learning realistic human actions from movies
- Laptev, I.; Marszalek, M.; Schmid, C.; Rozenfeld, B.: Learning realistic human actions from movies. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2008)
- (2008) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
- Laptev, I.¹ Marszalek, M.² Schmid, C.³ Rozenfeld, B.⁴

12
- 50649122769
- Retrieving actions in movies
- Laptev, I.; Pérez, P.: Retrieving actions in movies. In: Proceedings of the International Conference on Computer Vision, pp. 1-8 (2007)
- (2007) Proceedings of the International Conference on Computer Vision , pp. 1-8
- Laptev, I.¹ Pérez, P.²

13
- 84873572465
- MIR in Matlab (ii): A toolbox for musical feature extraction from audio
- Lartillot, O.; Toiviainen, P.: MIR in Matlab (ii): a toolbox for musical feature extraction from audio. In: ISMIR, pp. 127-130 (2007)
- (2007) ISMIR , pp. 127-130
- Lartillot, O.¹ Toiviainen, P.²

14
- 84905158219
- Pku-idm at trecvid 2010: Copy detection with visual-audio feature fusion and sequential pyramid matching
- Li, Y.; Mou, L.; Jiang, M.; Su, C.; Fang, X.; Qian, M.; Tian, Y.; Wang, Y.; Huang, T.; Gao, W.: Pku-idm at trecvid 2010: copy detection with visual-audio feature fusion and sequential pyramid matching. In: TRECVID 2010 Notebook papers (2010)
- (2010) TRECVID 2010 Notebook Papers
- Li, Y.¹ Mou, L.² Jiang, M.³ Su, C.⁴ Fang, X.⁵ Qian, M.⁶ Tian, Y.⁷ Wang, Y.⁸ Huang, T.⁹ Gao, W.¹⁰

15
- 0001457509
- Some methods for classification and analysis of multivariate observations
- Cam, L.M.L.; Neyman, J. (eds.) University of California Press
- MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Cam, L.M.L.; Neyman, J. (eds.) Proceedings of the fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281-297. University of California Press (1967)
- (1967) Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability , vol.1 , pp. 281-297
- Macqueen, J.B.¹

16
- 0002322469
- On a test of whether one of two random variables is stochastically larger than the other
- 10.1214/aoms/1177730491 0041.26103 22058
- Mann, H.B.; Whitney, D.R.: On a test of whether one of two random variables is stochastically larger than the other. Ann. Math. Stat. 18(1), 50-60 (1947)
- (1947) Ann. Math. Stat. , vol.18 , Issue.1 , pp. 50-60
- Mann, H.B.¹ Whitney, D.R.²

17
- 84898462191
- "Here's looking at you, kid". Detecting people looking at each other in videos
- Marin-Jimenez, M.; Zisserman, A.; Ferrari, V.: "Here's looking at you, kid". Detecting people looking at each other in videos. In: Proceedings of the British Machine Vision Conference (2011)
- (2011) Proceedings of the British Machine Vision Conference
- Marin-Jimenez, M.¹ Zisserman, A.² Ferrari, V.³

18
- 70450177757
- Actions in context
- Marszałek, M.; Laptev, I.; Schmid, C.: Actions in context. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2009)
- (2009) Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
- Marszałek, M.¹ Laptev, I.² Schmid, C.³

19
- 15044354466
- Automatic analysis of multimodal group actions in meetings
- 10.1109/TPAMI.2005.49
- McCowan, I.; Gatica-Perez, D.; Bengio, S.; Lathoud, G.; Barnard, M.; Zhang, D.: Automatic analysis of multimodal group actions in meetings. IEEE Trans. Pattern Anal. Mach. Intell. 27(3), 305-317 (2005)
- (2005) IEEE Trans. Pattern Anal. Mach. Intell. , vol.27 , Issue.3 , pp. 305-317
- McCowan, I.¹ Gatica-Perez, D.² Bengio, S.³ Lathoud, G.⁴ Barnard, M.⁵ Zhang, D.⁶

20
- 84867753906
- Structured learning of human interactions in TV shows
- 10.1109/TPAMI.2012.24
- Patron-Perez, A.; Marszalek, M.; Reid, I.; Zisserman, A.: Structured learning of human interactions in TV shows. IEEE Trans. Pattern Anal. Mach. Intell. 34(12), 2441-2453 (2012)
- (2012) IEEE Trans. Pattern Anal. Mach. Intell. , vol.34 , Issue.12 , pp. 2441-2453
- Patron-Perez, A.¹ Marszalek, M.² Reid, I.³ Zisserman, A.⁴

21
- 84905269591
- Genie trecvid2011 multimedia event detection: Late-fusion approaches to combine multiple audio-visual features
- Perera, A.G.A.; Oh, S.; Leotta, M.; Kim, I.; Byun, B.; Lee, C.H.; McCloskey, S.; Liu, J.; Miller, B.; Huang, Z.F.; Vahdat, A.; Yang, W.; Mori, G.; Tang, K.; Koller, D.; Fei-Fei, L.; Li, K.; Chen, G.; Corso, J.; Fu, Y.; Srihari, R.: Genie trecvid2011 multimedia event detection: late-fusion approaches to combine multiple audio-visual features. In: TRECVID 2011 Notebook papers (2011)
- (2011) TRECVID 2011 Notebook Papers
- Perera, A.G.A.¹ Oh, S.² Leotta, M.³ Kim, I.⁴ Byun, B.⁵ Lee, C.H.⁶ McCloskey, S.⁷ Liu, J.⁸ Miller, B.⁹ Huang, Z.F.¹⁰ Vahdat, A.¹¹ Yang, W.¹² Mori, G.¹³ Tang, K.¹⁴ Koller, D.¹⁵ Fei-Fei, L.¹⁶ Li, K.¹⁷ Chen, G.¹⁸ Corso, J.¹⁹ Fu, Y.²⁰ Srihari, R.²¹ more..

22
- 0003474751
- 2 edn. Cambridge University Press, Cambridge
- Press, W.H.; Teukolsky, S.A.; Vetterling, W.; Flannery, B.P.: Numerical recipes in C++: the art of scientific computing, 2 edn. Cambridge University Press, Cambridge (2002)
- (2002) Numerical Recipes in C++: The Art of Scientific Computing
- Press, W.H.¹ Teukolsky, S.A.² Vetterling, W.³ Flannery, B.P.⁴

23
- 80053115991
- Understanding interactions and guiding visual surveillance by tracking attention
- Koch, R.; Huang, F. (eds.) Springer, Berlin/Heidelberg
- Reid, I.; Benfold, B.; Patron, A.; Sommerlade, E.: Understanding interactions and guiding visual surveillance by tracking attention. In: Koch, R.; Huang, F. (eds.) Computer Vision - ACCV 2010 Workshops. Lecture Notes in Computer Science, vol. 6468, pp. 380-389. Springer, Berlin/Heidelberg (2011)
- (2011) Computer Vision - ACCV 2010 Workshops. Lecture Notes in Computer Science , vol.6468 , pp. 380-389
- Reid, I.¹ Benfold, B.² Patron, A.³ Sommerlade, E.⁴

24
- 77953187842
- Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities
- Ryoo, M.; Aggarwal, J.: Spatio-temporal relationship match: Video structure comparison for recognition of complex human activities. In: Proceedings of the International Conference on Computer Vision, pp. 1593-1600 (2009)
- (2009) Proceedings of the International Conference on Computer Vision , pp. 1593-1600
- Ryoo, M.¹ Aggarwal, J.²

25
- 10044233701
- Recognizing human actions: A local SVM approach
- Cambridge
- Schüldt, C.; Laptev, I.; Caputo, B.: Recognizing human actions: a local SVM approach. In: Proceedings of the International Conference on Pattern Recognition, vol. 3, pp. 32-36, Cambridge (2004)
- (2004) Proceedings of the International Conference on Pattern Recognition , vol.3 , pp. 32-36
- Schüldt, C.¹ Laptev, I.² Caputo, B.³

26
- 78649922664
- On the use of audio events for improving video scene segmentation
- Sidiropoulos, P.; Mezaris, V.; Kompatsiaris, I.; Meinedo, H.; Bugalho, M.; Trancoso, I.: On the use of audio events for improving video scene segmentation. In: 2010 11th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), pp. 1-4 (2010)
- (2010) 2010 11th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS) , pp. 1-4
- Sidiropoulos, P.¹ Mezaris, V.² Kompatsiaris, I.³ Meinedo, H.⁴ Bugalho, M.⁵ Trancoso, I.⁶

27
- 0345414182
- Video Google: A text retrieval approach to object matching in videos
- Sivic, J.; Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings of the International Conference on Computer Vision, vol. 2, pp. 1470-1477 (2003)
- (2003) Proceedings of the International Conference on Computer Vision , vol.2 , pp. 1470-1477
- Sivic, J.¹ Zisserman, A.²

28
- 34547401486
- Evaluation campaigns and TRECVid
- ACM Press, New York
- Smeaton, A.F.; Over, P.; Kraaij, W.: Evaluation campaigns and TRECVid. In: MIR '06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, pp. 321-330. ACM Press, New York (2006)
- (2006) MIR '06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval , pp. 321-330
- Smeaton, A.F.¹ Over, P.² Kraaij, W.³

29
- 84955035459
- A scale for the measurement of the psychological magnitude of pitch
- 10.1121/1.1915893
- Stevens, S.; Volkmann, J.; Newman, E.: A scale for the measurement of the psychological magnitude of pitch. J. Acoust. Soc. Am. 8, 185-190 (1937)
- (1937) J. Acoust. Soc. Am. , vol.8 , pp. 185-190
- Stevens, S.¹ Volkmann, J.² Newman, E.³

30
- 55149089260
- Machine recognition of human activities: A survey
- Turaga, P.; Chellappa, R.; Subrahmanian, V.S.; Udrea, O.: Machine recognition of human activities: a survey. IEEE Trans. Circ. Syst. Video Technol. 18(11), 1473-1488 (2008)
- (2008) IEEE Trans. Circ. Syst. Video Technol. , vol.18 , Issue.11 , pp. 1473-1488
- Turaga, P.¹ Chellappa, R.² Subrahmanian, V.S.³ Udrea, O.⁴

31
- 46149103415
- Building audio classification for broadcast news retrieval
- Tzanetakis, G.; Chen, M.: Building audio classification for broadcast news retrieval. In: Proceedings of WIAMIS (2004)
- (2004) Proceedings of WIAMIS
- Tzanetakis, G.¹ Chen, M.²

32
- 71149100224
- More generality in efficient multiple kernel learning
- Varma, M.; Babu, B.R.: More generality in efficient multiple kernel learning. In: ICML, p. 134 (2009)
- (2009) ICML , pp. 134
- Varma, M.¹ Babu, B.R.²

33
- 70349362313
- Vedaldi, A.; Fulkerson, B.: VLFeat: an open and portable library of computer vision algorithms. http://www.vlfeat.org/ (2008)
- (2008) VLFeat: An Open and Portable Library of Computer Vision Algorithms
- Vedaldi, A.¹ Fulkerson, B.²

34
- 84856194352
- Efficient additive kernels via explicit feature maps
- 10.1109/TPAMI.2011.153
- Vedaldi, A.; Zisserman, A.: Efficient additive kernels via explicit feature maps. IEEE Trans. Pattern Anal. Mach. Intell. 34(3), 480-492 (2012)
- (2012) IEEE Trans. Pattern Anal. Mach. Intell. , vol.34 , Issue.3 , pp. 480-492
- Vedaldi, A.¹ Zisserman, A.²

35
- 85162016686
- Multiple kernel learning and the SMO algorithm
- Vishwanathan, S.V.N.; Sun, Z.; Theera-Ampornpunt, N.; Varma, M.: Multiple kernel learning and the SMO algorithm. In: Advances in Neural Information Processing Systems (2010)
- (2010) Advances in Neural Information Processing Systems
- Vishwanathan, S.V.N.¹ Sun, Z.² Theera-Ampornpunt, N.³ Varma, M.⁴

36
- 84898890371
- Evaluation of local spatio-temporal features for action recognition
- Wang, H.; Ullah, M.; Kläser, A.; Laptev, I.; Schmid, C.: Evaluation of local spatio-temporal features for action recognition. In: Proceedings of the British Machine Vision Conference, p. 127 (2009)
- (2009) Proceedings of the British Machine Vision Conference , pp. 127
- Wang, H.¹ Ullah, M.² Kläser, A.³ Laptev, I.⁴ Schmid, C.⁵

37
- 33750025833
- Free viewpoint action recognition using motion history volumes
- Weinland, D.; Ronfard, R.; Boyer, E.: Free viewpoint action recognition using motion history volumes. Comput. Vis. Image Underst. 104(2-3), 249-257 (2006)
- (2006) Comput. Vis. Image Underst. , vol.104 , Issue.2-3 , pp. 249-257
- Weinland, D.¹ Ronfard, R.² Boyer, E.³

38
- 0001884644
- Individual comparisons by ranking methods
- 10.2307/3001968
- Wilcoxon, F.: Individual comparisons by ranking methods. Biom. Bull. 1(6), 80-83 (1945)
- (1945) Biom. Bull. , vol.1 , Issue.6 , pp. 80-83
- Wilcoxon, F.¹

39
- 84856672971
- Action recognition by learning bases of action attributes and parts
- Barcelona, Spain
- Yao, B.; Jiang, X.; Khosla, A.; Lin, A.; Guibas, L.; Fei-Fei, L.: Action recognition by learning bases of action attributes and parts. In: Proceedings of the International Conference on Computer Vision, Barcelona, Spain (2011)
- (2011) Proceedings of the International Conference on Computer Vision
- Yao, B.¹ Jiang, X.² Khosla, A.³ Lin, A.⁴ Guibas, L.⁵ Fei-Fei, L.⁶

40
- 84864147835
- Joint audio-visual bi-modal codewords for video event detection
- Ye, G.; Jhuo, I.H.; Liu, D.; Jiang, Y.G.; Lee, D.T.; Chang, S.F.: Joint audio-visual bi-modal codewords for video event detection. In: ICMR, p. 39 (2012)
- (2012) ICMR , pp. 39
- Ye, G.¹ Jhuo, I.H.² Liu, D.³ Jiang, Y.G.⁴ Lee, D.T.⁵ Chang, S.F.⁶

41
- 84866712367
- Robust late fusion with rank minimization
- Ye, G.; Liu, D.; Jhuo, I.H.; Chang, S.F.: Robust late fusion with rank minimization. In: Proceedings of the IEEE Conference on Computer Vision and, Pattern Recognition, pp. 3021-3028 (2012)
- (2012) Proceedings of the IEEE Conference on Computer Vision And, Pattern Recognition , pp. 3021-3028
- Ye, G.¹ Liu, D.² Jhuo, I.H.³ Chang, S.F.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.