-
1
-
-
72549102414
-
Biologically motivated audio-visual cue integration for object categorization
-
J. Anemueller and et al. Biologically motivated audio-visual cue integration for object categorization. In CogSys, 2008.
-
(2008)
CogSys
-
-
Anemueller, J.1
and et, al.2
-
3
-
-
0042349407
-
A graphical model for audiovisual object tracking
-
M.J. Beal and et al. A graphical model for audiovisual object tracking. IEEE Trans. PAMI, 25(7):828-836, 2003.
-
(2003)
IEEE Trans. PAMI
, vol.25
, Issue.7
, pp. 828-836
-
-
Beal, M.J.1
and et, al.2
-
5
-
-
84905193419
-
Columbia university TRECVID-2005 video search and high-level feature extraction
-
Gaithersburg, MD
-
S.F. Chang and et al. Columbia university TRECVID-2005 video search and high-level feature extraction. In NIST TRECVID workshop, Gaithersburg, MD, 2005.
-
(2005)
NIST TRECVID workshop
-
-
Chang, S.F.1
and et, al.2
-
6
-
-
72549095204
-
Large-scale multimodal semantic concept detection for consumer video
-
S.F. Chang and et al. Large-scale multimodal semantic concept detection for consumer video. In ACM MIR, 2007.
-
(2007)
ACM MIR
-
-
Chang, S.F.1
and et, al.2
-
7
-
-
84863161940
-
Image categorization by learning and reasoning with regions
-
Y.X. Chen and et al. Image categorization by learning and reasoning with regions. In JMLR, 5:913-939, 2004.
-
(2004)
JMLR
, vol.5
, pp. 913-939
-
-
Chen, Y.X.1
and et, al.2
-
8
-
-
33846623313
-
Audio-visual event recognition in surveillance video sequences
-
M. Cristani and et al. Audio-visual event recognition in surveillance video sequences. In IEEE Trans. Multimedia, 9(2):257-267, 2007.
-
(2007)
IEEE Trans. Multimedia
, vol.9
, Issue.2
, pp. 257-267
-
-
Cristani, M.1
and et, al.2
-
9
-
-
51449105193
-
Environmental sound recognition using MP-based features
-
S. Chu and et al. Environmental sound recognition using MP-based features. in Proc. ICASSP, pages 1-4, 2008.
-
(2008)
Proc. ICASSP
, pp. 1-4
-
-
Chu, S.1
and et, al.2
-
10
-
-
33645146449
-
Histograms of oriented gradients for human detection
-
N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In Proc. CVPR, pages 886-893, 2005.
-
(2005)
Proc. CVPR
, pp. 886-893
-
-
Dalal, N.1
Triggs, B.2
-
11
-
-
2342460956
-
Video retrieval using spatial-temporal descriptors
-
D. Dementhon and D. Doermann. Video retrieval using spatial-temporal descriptors. In ACM Multimedia, 2003.
-
(2003)
ACM Multimedia
-
-
Dementhon, D.1
Doermann, D.2
-
12
-
-
0035423154
-
Unsupervised segmentation of color-texture regions in images and video
-
Y. Deng and B.S. Manjunath. Unsupervised segmentation of color-texture regions in images and video. In IEEE Trans. PAMI, 23(8):800-810, 2001.
-
(2001)
IEEE Trans. PAMI
, vol.23
, Issue.8
, pp. 800-810
-
-
Deng, Y.1
Manjunath, B.S.2
-
13
-
-
0034164230
-
Additive logistic regression: A statistical view of boosting
-
J. Friedman and et al. Additive logistic regression: a statistical view of boosting. Ann. of Sta., 28(22):337-407, 2000.
-
(2000)
Ann. of Sta
, vol.28
, Issue.22
, pp. 337-407
-
-
Friedman, J.1
and et, al.2
-
14
-
-
33745855044
-
The pyramid match kernel: Discriminative classification with sets of image features
-
K. Grauman and T. Darrel. The pyramid match kernel: Discriminative classification with sets of image features. In Proc. ICCV, 2:1458-1465, 2005.
-
(2005)
Proc. ICCV
, vol.2
, pp. 1458-1465
-
-
Grauman, K.1
Darrel, T.2
-
15
-
-
5044226887
-
Incremental density approximation and kernel-based bayesian filtering for object tracking
-
B. Han and et al. Incremental density approximation and kernel-based bayesian filtering for object tracking. In Proc. CVPR, pages 638-644, 2004.
-
(2004)
Proc. CVPR
, pp. 638-644
-
-
Han, B.1
and et, al.2
-
16
-
-
0009622482
-
Audio-vision: Using audio-visual synchrony to locate sounds
-
J. Hershey and J. Movellan. Audio-vision: Using audio-visual synchrony to locate sounds. In NIPS, 1999.
-
(1999)
NIPS
-
-
Hershey, J.1
Movellan, J.2
-
17
-
-
34247257857
-
Audio-visual speech recognition using lip information extracted from side-face images
-
K. Iwano and et al. Audio-visual speech recognition using lip information extracted from side-face images. In EURASIP JASMP, 2007(1):4-4, 2007.
-
(2007)
EURASIP JASMP
, vol.2007
, Issue.1
, pp. 4-4
-
-
Iwano, K.1
and et, al.2
-
18
-
-
0142134976
-
Robust online appearence models for visual tracking
-
A. Jepson and et al. Robust online appearence models for visual tracking. IEEE Trans.PAMI, 25(10):1296-1311, 2003.
-
(2003)
IEEE Trans.PAMI
, vol.25
, Issue.10
, pp. 1296-1311
-
-
Jepson, A.1
and et, al.2
-
19
-
-
0001008498
-
Real-time lip tracking for audio-visual speech recognition applications
-
R. Kaucic, B. Dalton, and A. Blake. Real-time lip tracking for audio-visual speech recognition applications. In Proc. ECCV, vol.2, pages 376-387, 1996.
-
(1996)
Proc. ECCV
, vol.2
, pp. 376-387
-
-
Kaucic, R.1
Dalton, B.2
Blake, A.3
-
21
-
-
37849015208
-
Kodak's consumer video benchmark data set: Concept definition and annotation
-
A. Loui and et al. Kodak's consumer video benchmark data set: concept definition and annotation. In ACM SIGMM Int'l Workshop on MIR, pages 245-254, 2007.
-
(2007)
ACM SIGMM Int'l Workshop on MIR
, pp. 245-254
-
-
Loui, A.1
and et, al.2
-
22
-
-
3042535216
-
Distinctive image features from scale-invariant keypoints
-
D. Lowe. Distinctive image features from scale-invariant keypoints. In IJCV, 60(2):91-110, 2004.
-
(2004)
IJCV
, vol.60
, Issue.2
, pp. 91-110
-
-
Lowe, D.1
-
23
-
-
0002836012
-
An iterative image registration technique with an application to stereo vision
-
B.D. Lucas and T. Kanade. An iterative image registration technique with an application to stereo vision. In Proc. Imaging understanding workshop, pages 121-130, 1981.
-
(1981)
Proc. Imaging understanding workshop
, pp. 121-130
-
-
Lucas, B.D.1
Kanade, T.2
-
24
-
-
0027842081
-
Matching pursuits with time-frequency dictionaries
-
S. Mallat and Z. Zhang. Matching pursuits with time-frequency dictionaries. In IEEE Trans. Signal Processing, 41(12):3397-3415, 1993.
-
(1993)
IEEE Trans. Signal Processing
, vol.41
, Issue.12
, pp. 3397-3415
-
-
Mallat, S.1
Zhang, Z.2
-
25
-
-
84898935332
-
A framework for multiple-instance learning
-
O. Maron and et al. A framework for multiple-instance learning. In NIPS, 1998.
-
(1998)
NIPS
-
-
Maron, O.1
and et, al.2
-
26
-
-
57149147931
-
Extracting moving people from internet videos
-
J.C. Niebles and et al. Extracting moving people from internet videos. in Proc. ECCV, pages 527-540, 2008.
-
(2008)
Proc. ECCV
, pp. 527-540
-
-
Niebles, J.C.1
and et, al.2
-
28
-
-
34547532522
-
Fingerprinting to identify repeated sound events in long-duration personal audio recordings
-
J. Ogle and D. Ellis. Fingerprinting to identify repeated sound events in long-duration personal audio recordings. In Proc. ICASSP, pages I-233-236, 2007.
-
(2007)
Proc. ICASSP
-
-
Ogle, J.1
Ellis, D.2
-
30
-
-
0028112849
-
Good features to track
-
J. Shi and C. Tomasi. Good features to track. In Proc. CVPR, pages 593-600, 1994.
-
(1994)
Proc. CVPR
, pp. 593-600
-
-
Shi, J.1
Tomasi, C.2
-
31
-
-
0034244889
-
Learning patterns of activity using real-time tracking
-
C. Stauffer and W.E.L. Grimson. Learning patterns of activity using real-time tracking. In IEEE Trans. PAMI, 22(8):747-757, 2002.
-
(2002)
IEEE Trans. PAMI
, vol.22
, Issue.8
, pp. 747-757
-
-
Stauffer, C.1
Grimson, W.E.L.2
-
32
-
-
72549084034
-
Boosting image retrieval
-
K. Tieu and P. Viola. Boosting image retrieval. In IJCV, 56(1-2):228-235, 2000.
-
(2000)
IJCV
, vol.56
, Issue.1-2
, pp. 228-235
-
-
Tieu, K.1
Viola, P.2
-
34
-
-
33745844069
-
Learning Semantic Scene Models by Trajectory Analysis
-
X.G. Wang and et al. Learning Semantic Scene Models by Trajectory Analysis. In Proc. ECCV, pages 110-123, 2006.
-
(2006)
Proc. ECCV
, pp. 110-123
-
-
Wang, X.G.1
and et, al.2
-
35
-
-
20444437959
-
Multimodal information fusion for video concept detection
-
Y. Wu and et al. Multimodal information fusion for video concept detection. in Proc. ICIP, pages 2391-2394, 2004.
-
(2004)
Proc. ICIP
, pp. 2391-2394
-
-
Wu, Y.1
and et, al.2
-
36
-
-
33845562302
-
Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning
-
C. Yang and et al. Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning. In Proc. CVPR, pages 2057-2063, 2006.
-
(2006)
Proc. CVPR
, pp. 2057-2063
-
-
Yang, C.1
and et, al.2
-
37
-
-
37849051635
-
Large head movement tracking using SIFT-based registration
-
G.Q. Zhao and et al. Large head movement tracking using SIFT-based registration. In ACM Multimedia, 2007.
-
(2007)
ACM Multimedia
-
-
Zhao, G.Q.1
and et, al.2
-
38
-
-
59349094120
-
Object tracking using sift features and mean shift
-
H. Zhou and et al. Object tracking using sift features and mean shift. Com. Vis. & Ima. Und., 113(3):345-352, 2009.
-
(2009)
Com. Vis. & Ima. Und
, vol.113
, Issue.3
, pp. 345-352
-
-
Zhou, H.1
and et, al.2
|