-
1
-
-
34547401486
-
Evaluation campaigns and TRECVID
-
ACM Press, New York
-
Smeaton, A.F.; Over, P.; Kraaij, W.: Evaluation campaigns and TRECVID. In: Proceedings of the 8th ACM international workshop on multimedia information retrieval, Santa Barbara, 26-27 October 2006 (MIR '06). ACM Press, New York, pp. 321-330 (2006)
-
(2006)
Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, Santa Barbara, 26-27 October 2006 (MIR '06)
, pp. 321-330
-
-
Smeaton, A.F.1
Over, P.2
Kraaij, W.3
-
3
-
-
84866712341
-
Multimodal feature fusion for robust event detection in web videos
-
Natarajan, P.; Wu, S.; Vitaladevuni, S.; Zhuang, X.; Tsakalidis, S.; Paurk, U.; Prasad.; R.: Multimodal feature fusion for robust event detection in web videos. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition (CVPR), pp. 1298-1305 (2012)
-
(2012)
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR)
, pp. 1298-1305
-
-
Natarajan, P.1
Wu, S.2
Vitaladevuni, S.3
Zhuang, X.4
Tsakalidis, S.5
Paurk, U.6
Prasad, R.7
-
4
-
-
84894902302
-
Evaluation of low-level features and their combinations for complex event detection in open source videos
-
Sawhney, H.; Cheng, H.; Divakaran, A.; Javed, O.; Liu, J.; Yu, Q.; Ali, S.; Tamrakar, A.: Evaluation of low-level features and their combinations for complex event detection in open source videos. CVPR, 2496-2499 (2012)
-
(2012)
CVPR
, pp. 2496-2499
-
-
Sawhney, H.1
Cheng, H.2
Divakaran, A.3
Javed, O.4
Liu, J.5
Yu, Q.6
Ali, S.7
Tamrakar, A.8
-
5
-
-
84864116485
-
Super: Towards real-time event recognition in internet videos
-
(article no. 33)
-
Jiang, Y.: Super: towards real-time event recognition in internet videos. ACM Int. Conf. Multimed. Retr. (ICMR) (2012) (article no. 33)
-
(2012)
ACM Int. Conf. Multimed. Retr. (ICMR)
-
-
Jiang, Y.1
-
6
-
-
78651388935
-
Event detection and recognition for semantic annotation of video
-
10.1007/s11042-010-0643-7
-
Ballan, L.; Bertini, M.; Del Bimbo, A.; Seidenari, L.; Serra, G.: Event detection and recognition for semantic annotation of video. Multimed. Tools Appl. 51(1), 279-302 (2011)
-
(2011)
Multimed. Tools Appl.
, vol.51
, Issue.1
, pp. 279-302
-
-
Ballan, L.1
Bertini, M.2
Del Bimbo, A.3
Seidenari, L.4
Serra, G.5
-
7
-
-
54749131961
-
Video event recognition using kernel methods with multilevel temporal alignment
-
10.1109/TPAMI.2008.129
-
Xu, D.; Chang, S.-F.: Video event recognition using kernel methods with multilevel temporal alignment. IEEE Trans. Pattern Anal. Mach. Intell. (IEEE TPAMI) 30(11), 1985-1997 (2008)
-
(2008)
IEEE Trans. Pattern Anal. Mach. Intell. (IEEE TPAMI)
, vol.30
, Issue.11
, pp. 1985-1997
-
-
Xu, D.1
Chang, S.-F.2
-
9
-
-
77955422240
-
Object detection with discriminatively trained part-based models
-
10.1109/TPAMI.2009.167
-
Felzenszwalb, P.; Girshick, R.; McAllester, D.; Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE TPAMI 32(9), 1627-1645 (2010)
-
(2010)
IEEE TPAMI
, vol.32
, Issue.9
, pp. 1627-1645
-
-
Felzenszwalb, P.1
Girshick, R.2
McAllester, D.3
Ramanan, D.4
-
10
-
-
85162513516
-
Object bank: A high-level image representation for scene classification and semantic feature sparsification
-
Li, L.; SU, H.; Xing, E.; Fei-Fei, L.: Object bank: a high-level image representation for scene classification and semantic feature sparsification. Adv. Neural Inf. Process. Syst.; 24 (2010)
-
(2010)
Adv. Neural Inf. Process. Syst.
, vol.24
-
-
Li, L.1
Su, H.2
Xing, E.3
Fei-Fei, L.4
-
11
-
-
84866718894
-
Action bank: A high-level representation of activity in video
-
Sadanand, S.; Corso, J.J.: Action bank: a high-level representation of activity in video. CVPR (2012)
-
(2012)
CVPR
-
-
Sadanand, S.1
Corso, J.J.2
-
12
-
-
77953353879
-
Visual-concept search solved?
-
10.1109/MC.2010.183
-
Snoek, C.G.M.; Smeulders, A.W.M.: Visual-concept search solved? IEEE Comput. 43(6), 76-78 (2010)
-
(2010)
IEEE Comput.
, vol.43
, Issue.6
, pp. 76-78
-
-
Snoek, C.G.M.1
Smeulders, A.W.M.2
-
13
-
-
84856141623
-
Semantic model vectors for complex video event recognition
-
10.1109/TMM.2011.2168948
-
Merler, M.; Huang, B.; Xie, L.; Hua, G.; Natsev, A.: Semantic model vectors for complex video event recognition. IEEE Trans. Multimed. (TMM) 14(1), 88-101 (2012)
-
(2012)
IEEE Trans. Multimed. (TMM)
, vol.14
, Issue.1
, pp. 88-101
-
-
Merler, M.1
Huang, B.2
Xie, L.3
Hua, G.4
Natsev, A.5
-
14
-
-
84871390957
-
Detection bank: An object detection based video representation for multimedia event recognition
-
Althoff, T.; Song, H.; Darrell, T.: Detection bank: an object detection based video representation for multimedia event recognition. ACM Multimed. (MM) (2012)
-
(2012)
ACM Multimed. (MM)
-
-
Althoff, T.1
Song, H.2
Darrell, T.3
-
15
-
-
79959761356
-
High-level event detection in video exploiting discriminant concepts
-
Tsampoulatidis, I.; Gkalelis, N.; Dimou, A.; Mezaris, V.; Kompatsiaris, I.: High-level event detection in video exploiting discriminant concepts. In: Proceedings of the 1st ACM international conference on multimedia retrieval, pp. 85-90 (2011)
-
(2011)
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
, pp. 85-90
-
-
Tsampoulatidis, I.1
Gkalelis, N.2
Dimou, A.3
Mezaris, V.4
Kompatsiaris, I.5
-
17
-
-
84905269591
-
GENIE TRECVID 2011 multimedia event detection: Late-fusion approaches to combine multiple audio-visual features
-
Perera, A.G.A.; Oh, S.; Leotta, M.; Kim, I.; Byun, B.; Lee, C.-H.; McCloskey, S.; Liu, J.; Miller, B.; Huang, Z.F.; Vahdat, A.; Yang, W.; Mori, G.; Tang, K.; Koller, D.; Fei-Fei, L.; Li, K.; Chen, G.; Corso, J.; Fu, Y.; Srihari, R.: GENIE TRECVID 2011 multimedia event detection: late-fusion approaches to combine multiple audio-visual features. In: NIST TRECVID, workshop (2011)
-
(2011)
NIST TRECVID, Workshop
-
-
Perera, A.G.A.1
Oh, S.2
Leotta, M.3
Kim, I.4
Byun, B.5
Lee, C.-H.6
McCloskey, S.7
Liu, J.8
Miller, B.9
Huang, Z.F.10
Vahdat, A.11
Yang, W.12
Mori, G.13
Tang, K.14
Koller, D.15
Fei-Fei, L.16
Li, K.17
Chen, G.18
Corso, J.19
Fu, Y.20
Srihari, R.21
more..
-
18
-
-
84937454179
-
Creating HAVIC: Heterogeneous audio visual internet collection
-
Calzolari N.; Choukri K.; Declerck T.; Uǧur Doǧan M.; Maegaard B.; Mariani J.; Odijk J.; Piperidis S. (eds.) Istanbul
-
Strassel, S.; Morris, A.; Fiscus, J.; Caruso, C.; Lee, H.; Over, P.; Fiumara, J.; Shaw, B.; Antonishek, B.; Michel, M.: Creating HAVIC: heterogeneous audio visual internet collection. In: Calzolari N.; Choukri K.; Declerck T.; Uǧur Doǧan M.; Maegaard B.; Mariani J.; Odijk J.; Piperidis S. (eds.) Proceedings of the eighth international conference on language resources and evaluation, Istanbul (2012)
-
(2012)
Proceedings of the Eighth International Conference on Language Resources and Evaluation
-
-
Strassel, S.1
Morris, A.2
Fiscus, J.3
Caruso, C.4
Lee, H.5
Over, P.6
Fiumara, J.7
Shaw, B.8
Antonishek, B.9
Michel, M.10
-
20
-
-
84865584175
-
Aggregating local image descriptors into compact codes
-
10.1109/TPAMI.2011.235
-
Jégou, H.; Perronnin, F.; Douze, M.; Sanchez, J.; Pérez, P.; Schmid, C.: Aggregating local image descriptors into compact codes. IEEE TPAMI 34(9), 1704-1716 (2012)
-
(2012)
IEEE TPAMI
, vol.34
, Issue.9
, pp. 1704-1716
-
-
Jégou, H.1
Perronnin, F.2
Douze, M.3
Sanchez, J.4
Pérez, P.5
Schmid, C.6
-
21
-
-
34948815101
-
Fisher kernels on visual vocabularies for image categorization
-
Perronnin, F.; Dance, C.: Fisher kernels on visual vocabularies for image categorization. CVPR, (2007)
-
(2007)
CVPR
-
-
Perronnin, F.1
Dance, C.2
-
22
-
-
84905234239
-
The MediaMill TRECVID 2012 semantic video search engine
-
Gaithersburg
-
Snoek, C.G.M.; van de Sande, K.E.A.; Habibian, A.; Kordumova, S.; Li, Z.; Mazloom, M.; Pintea, S.L.; Tao, R.; Koelma, D.C.; Smeulders, A.W.M.: The MediaMill TRECVID 2012 semantic video search engine. In: Proceeding of the TRECVID workshop, Gaithersburg (2012)
-
(2012)
Proceeding of the TRECVID Workshop
-
-
Snoek, C.G.M.1
Van De Sande, K.E.A.2
Habibian, A.3
Kordumova, S.4
Li, Z.5
Mazloom, M.6
Pintea, S.L.7
Tao, R.8
Koelma, D.C.9
Smeulders, A.W.M.10
-
23
-
-
47549083305
-
Local invariant feature detectors: A survey. Found
-
Tuytelaars, T.; Mikolajczyk, K.: Local invariant feature detectors: a survey. Found. Trends. Comput. Graph. Vis. 3(3), 177-280 (2008)
-
(2008)
Trends. Comput. Graph. Vis.
, vol.3
, Issue.3
, pp. 177-280
-
-
Tuytelaars, T.1
Mikolajczyk, K.2
-
24
-
-
33750571107
-
On the surplus value of semantic video analysis beyond the key frame
-
Snoek, C.G.M.; Worring, M.; Geusebroek, J.-M.; Koelma, D.C.; Seinstra, F.J.: On the surplus value of semantic video analysis beyond the key frame.In: Proceedings of the IEEE international conference on multimedia and expo (2005)
-
(2005)
Proceedings of the IEEE International Conference on Multimedia and Expo
-
-
Snoek, C.G.M.1
Worring, M.2
Geusebroek, J.-M.3
Koelma, D.C.4
Seinstra, F.J.5
-
25
-
-
33845572523
-
Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories
-
(New York)
-
Lazebnik, S.; Schmid, C.; Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. CVPR 2, 2169-2178 (2006) (New York)
-
(2006)
CVPR
, vol.2
, pp. 2169-2178
-
-
Lazebnik, S.1
Schmid, C.2
Ponce, J.3
-
26
-
-
77955426203
-
Evaluating color descriptors for object and scene recognition
-
10.1109/TPAMI.2009.154
-
van de Sande, K.E.A.; Gevers, T.; Snoek, C.G.M.: Evaluating color descriptors for object and scene recognition. IEEE TPAMI 32(9), 1582-1596 (2010)
-
(2010)
IEEE TPAMI
, vol.32
, Issue.9
, pp. 1582-1596
-
-
Van De Sande, K.E.A.1
Gevers, T.2
Snoek, C.G.M.3
-
27
-
-
3042535216
-
Distinctive image features from scale-invariant keypoints
-
10.1023/B:VISI.0000029664.99615.94
-
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91-110 (2004)
-
(2004)
Int. J. Comput. Vis.
, vol.60
, pp. 91-110
-
-
Lowe, D.G.1
-
28
-
-
0035670615
-
Color invariance
-
Geusebroek, J.-M.; Boomgaard, R.; Smeulders, A.W.M.; Geerts, H.: Color invariance. IEEE TPAMI 23(12), 1338-1350 (2001)
-
(2001)
IEEE TPAMI
, vol.23
, Issue.12
, pp. 1338-1350
-
-
Geusebroek, J.-M.1
Boomgaard, R.2
Smeulders, A.W.M.3
Geerts, H.4
-
29
-
-
77249101259
-
Comparing compact codebooks for visual categorization
-
van Gemert, J.C.; Snoek, C.G.M.; Veenman, C.J.; Smeulders, A.W.M.; Geusebroek, J.-M.: Comparing compact codebooks for visual categorization. Comput. Vis. Image Underst. 114(4), 450-462 (2010)
-
(2010)
Comput. Vis. Image Underst.
, vol.114
, Issue.4
, pp. 450-462
-
-
Van Gemert, J.C.1
Snoek, C.G.M.2
Veenman, C.J.3
Smeulders, A.W.M.4
Geusebroek, J.-M.5
-
31
-
-
80052877143
-
Action recognition by dense trajectories
-
Wang, H.; Kläser, A.; Schmid, C.; Cheng-Lin, L.: Action recognition by dense trajectories. CVPR, 3169-3176 (2011)
-
(2011)
CVPR
, pp. 3169-3176
-
-
Wang, H.1
Kläser, A.2
Schmid, C.3
Cheng-Lin, L.4
-
33
-
-
24944451092
-
On space-time interest points
-
10.1007/s11263-005-1838-7
-
Laptev, I.: On space-time interest points. Int. J. Comput. Vis. 64(2/3), 107-123 (2005)
-
(2005)
Int. J. Comput. Vis.
, vol.64
, Issue.23
, pp. 107-123
-
-
Laptev, I.1
-
35
-
-
70049084217
-
Large-scale content-based audio retrieval from text queries
-
New York
-
Chechik, G.; Ie, E.; Rehn, M.; Bengio, S.; Lyon, D.: Large-scale content-based audio retrieval from text queries. In: Proceedings of 1st ACM international conference on multimedia information retrieval (MIR '08), pp. 105-112, New York (2008)
-
(2008)
Proceedings of 1st ACM International Conference on Multimedia Information Retrieval (MIR '08)
, pp. 105-112
-
-
Chechik, G.1
Ie, E.2
Rehn, M.3
Bengio, S.4
Lyon, D.5
-
36
-
-
84905190512
-
KDDI labs and SRI international at TRECVID 2010: Content-based copy detection
-
Uchida, Y.; Sakazawa, S.; Argawal, M.; Akbacak, M.: KDDI labs and SRI international at TRECVID 2010: content-based copy detection. In: NIST TRECVID 2010 evaluation, workshop (2010)
-
(2010)
NIST TRECVID 2010 Evaluation, Workshop
-
-
Uchida, Y.1
Sakazawa, S.2
Argawal, M.3
Akbacak, M.4
-
37
-
-
84905161670
-
Columbia-UCF TRECVID 2010 multimedia event detection: Combining multiple modalities, contextual concepts, and temporal matching
-
Jiang, Y.; Zeng, X.; Ye, G.; Ellis, D.; Shah, M.; Chang, S.: Columbia-UCF TRECVID 2010 multimedia event detection: combining multiple modalities, contextual concepts, and temporal matching. In: NIST TRECVID, workshop (2010)
-
(2010)
NIST TRECVID, Workshop
-
-
Jiang, Y.1
Zeng, X.2
Ye, G.3
Ellis, D.4
Shah, M.5
Chang, S.6
-
38
-
-
84878606595
-
Bag-of-audio-words approach for multimedia event detection
-
Pancoast, S.; Akbacak, M.: Bag-of-audio-words approach for multimedia event detection. In: Proceedings of interspeech (2012)
-
(2012)
Proceedings of Interspeech
-
-
Pancoast, S.1
Akbacak, M.2
-
39
-
-
84856141623
-
Semantic model vectors for complex video event recognition
-
10.1109/TMM.2011.2168948
-
Merler, M.; Huang, B.; Xie, L.; Hua, G.; Natsev, A.: Semantic model vectors for complex video event recognition. IEEE Trans. Multimed. 14(1), 88-101 (2012)
-
(2012)
IEEE Trans. Multimed.
, vol.14
, Issue.1
, pp. 88-101
-
-
Merler, M.1
Huang, B.2
Xie, L.3
Hua, G.4
Natsev, A.5
-
40
-
-
84905274625
-
TRECVID 2012 - An overview of the goals, tasks, data, evaluation mechanisms, and metrics
-
Over, P.; Awad, G.; Michel, M.; Fiscus, J.; Sanders, G.; Shaw, B.; Kraaij, W.; Smeaton, A.F.; Quéenot, G.: TRECVID 2012 - an overview of the goals, tasks, data, evaluation mechanisms, and metrics. In: Proceedings of TRECVID (2012) http://www-nlpir.nist.gov/projects/tvpubs/tv12.papers/ tv12overview.pdf
-
(2012)
Proceedings of TRECVID
-
-
Over, P.1
Awad, G.2
Michel, M.3
Fiscus, J.4
Sanders, G.5
Shaw, B.6
Kraaij, W.7
Smeaton, A.F.8
Quéenot, G.9
-
41
-
-
80052915062
-
-
Berg, A.; Deng, J.; Satheesh, S.; Su, H.; Li, F.-F.: Imagenet large scale visual recognition challenge (2011) http://www.image-net.org/challenges/LSVRC/ 2011/
-
(2011)
Imagenet Large Scale Visual Recognition Challenge
-
-
Berg, A.1
Deng, J.2
Satheesh, S.3
Su, H.4
Li, F.-F.5
-
42
-
-
51449099706
-
The ICSI-SRI spring 2006 meeting recognition system, MLMI'06
-
Janin, A.; Stolcke, A.; Anguera, X.; Boakye, K.; Çetin, Ö.; Frankel, J.; Zheng, J.: The ICSI-SRI spring 2006 meeting recognition system, MLMI'06. In: Proceedings of the third international conference on machine learning for multimodal, interaction, pp. 444-456 (2006)
-
(2006)
Proceedings of the Third International Conference on Machine Learning for Multimodal, Interaction
, pp. 444-456
-
-
Janin, A.1
Stolcke, A.2
Anguera, X.3
Boakye, K.4
Çetin, O.5
Frankel, J.6
Zheng, J.7
-
43
-
-
84890454299
-
Extracting audio and spoken concepts for multimedia event detection
-
van Hout, J.; Akbacak, M.; Castaneda, D.; Yeh, E.; Sanchez, M.: Extracting audio and spoken concepts for multimedia event detection. In: International conference on acoustics, speech, and signal processing (ICASSP) (2013)
-
(2013)
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Van Hout, J.1
Akbacak, M.2
Castaneda, D.3
Yeh, E.4
Sanchez, M.5
-
46
-
-
34547469390
-
Can high-level concepts fill the semantic gap in video retrieval? A case study with broadcast retrieval
-
10.1109/TMM.2007.900150
-
Hauptmann, A.; Yan, R.; Lin, W.-H.; Christel, M.; Wactlar, H.: Can high-level concepts fill the semantic gap in video retrieval? A case study with broadcast retrieval. IEEE Trans Multimed. 9(5), 958-966 (2007)
-
(2007)
IEEE Trans Multimed.
, vol.9
, Issue.5
, pp. 958-966
-
-
Hauptmann, A.1
Yan, R.2
Lin, W.-H.3
Christel, M.4
Wactlar, H.5
|