메뉴 건너뛰기




Volumn , Issue , 2009, Pages 5-14

Short-term audio-visual atoms for generic video concept classification

Author keywords

Audio visual codebook; Joint audio visual analysis; Semantic concept detection; Short term Audio Visual Atom

Indexed keywords

AUDIO-VISUAL; AUDIOVISUAL ANALYSIS; CODEBOOKS; SEMANTIC CONCEPT DETECTION;

EID: 72549099611     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1631272.1631277     Document Type: Conference Paper
Times cited : (49)

References (38)
  • 1
    • 72549102414 scopus 로고    scopus 로고
    • Biologically motivated audio-visual cue integration for object categorization
    • J. Anemueller and et al. Biologically motivated audio-visual cue integration for object categorization. In CogSys, 2008.
    • (2008) CogSys
    • Anemueller, J.1    and et, al.2
  • 3
    • 0042349407 scopus 로고    scopus 로고
    • A graphical model for audiovisual object tracking
    • M.J. Beal and et al. A graphical model for audiovisual object tracking. IEEE Trans. PAMI, 25(7):828-836, 2003.
    • (2003) IEEE Trans. PAMI , vol.25 , Issue.7 , pp. 828-836
    • Beal, M.J.1    and et, al.2
  • 5
    • 84905193419 scopus 로고    scopus 로고
    • Columbia university TRECVID-2005 video search and high-level feature extraction
    • Gaithersburg, MD
    • S.F. Chang and et al. Columbia university TRECVID-2005 video search and high-level feature extraction. In NIST TRECVID workshop, Gaithersburg, MD, 2005.
    • (2005) NIST TRECVID workshop
    • Chang, S.F.1    and et, al.2
  • 6
    • 72549095204 scopus 로고    scopus 로고
    • Large-scale multimodal semantic concept detection for consumer video
    • S.F. Chang and et al. Large-scale multimodal semantic concept detection for consumer video. In ACM MIR, 2007.
    • (2007) ACM MIR
    • Chang, S.F.1    and et, al.2
  • 7
    • 84863161940 scopus 로고    scopus 로고
    • Image categorization by learning and reasoning with regions
    • Y.X. Chen and et al. Image categorization by learning and reasoning with regions. In JMLR, 5:913-939, 2004.
    • (2004) JMLR , vol.5 , pp. 913-939
    • Chen, Y.X.1    and et, al.2
  • 8
    • 33846623313 scopus 로고    scopus 로고
    • Audio-visual event recognition in surveillance video sequences
    • M. Cristani and et al. Audio-visual event recognition in surveillance video sequences. In IEEE Trans. Multimedia, 9(2):257-267, 2007.
    • (2007) IEEE Trans. Multimedia , vol.9 , Issue.2 , pp. 257-267
    • Cristani, M.1    and et, al.2
  • 9
    • 51449105193 scopus 로고    scopus 로고
    • Environmental sound recognition using MP-based features
    • S. Chu and et al. Environmental sound recognition using MP-based features. in Proc. ICASSP, pages 1-4, 2008.
    • (2008) Proc. ICASSP , pp. 1-4
    • Chu, S.1    and et, al.2
  • 10
    • 33645146449 scopus 로고    scopus 로고
    • Histograms of oriented gradients for human detection
    • N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In Proc. CVPR, pages 886-893, 2005.
    • (2005) Proc. CVPR , pp. 886-893
    • Dalal, N.1    Triggs, B.2
  • 11
    • 2342460956 scopus 로고    scopus 로고
    • Video retrieval using spatial-temporal descriptors
    • D. Dementhon and D. Doermann. Video retrieval using spatial-temporal descriptors. In ACM Multimedia, 2003.
    • (2003) ACM Multimedia
    • Dementhon, D.1    Doermann, D.2
  • 12
    • 0035423154 scopus 로고    scopus 로고
    • Unsupervised segmentation of color-texture regions in images and video
    • Y. Deng and B.S. Manjunath. Unsupervised segmentation of color-texture regions in images and video. In IEEE Trans. PAMI, 23(8):800-810, 2001.
    • (2001) IEEE Trans. PAMI , vol.23 , Issue.8 , pp. 800-810
    • Deng, Y.1    Manjunath, B.S.2
  • 13
    • 0034164230 scopus 로고    scopus 로고
    • Additive logistic regression: A statistical view of boosting
    • J. Friedman and et al. Additive logistic regression: a statistical view of boosting. Ann. of Sta., 28(22):337-407, 2000.
    • (2000) Ann. of Sta , vol.28 , Issue.22 , pp. 337-407
    • Friedman, J.1    and et, al.2
  • 14
    • 33745855044 scopus 로고    scopus 로고
    • The pyramid match kernel: Discriminative classification with sets of image features
    • K. Grauman and T. Darrel. The pyramid match kernel: Discriminative classification with sets of image features. In Proc. ICCV, 2:1458-1465, 2005.
    • (2005) Proc. ICCV , vol.2 , pp. 1458-1465
    • Grauman, K.1    Darrel, T.2
  • 15
    • 5044226887 scopus 로고    scopus 로고
    • Incremental density approximation and kernel-based bayesian filtering for object tracking
    • B. Han and et al. Incremental density approximation and kernel-based bayesian filtering for object tracking. In Proc. CVPR, pages 638-644, 2004.
    • (2004) Proc. CVPR , pp. 638-644
    • Han, B.1    and et, al.2
  • 16
    • 0009622482 scopus 로고    scopus 로고
    • Audio-vision: Using audio-visual synchrony to locate sounds
    • J. Hershey and J. Movellan. Audio-vision: Using audio-visual synchrony to locate sounds. In NIPS, 1999.
    • (1999) NIPS
    • Hershey, J.1    Movellan, J.2
  • 17
    • 34247257857 scopus 로고    scopus 로고
    • Audio-visual speech recognition using lip information extracted from side-face images
    • K. Iwano and et al. Audio-visual speech recognition using lip information extracted from side-face images. In EURASIP JASMP, 2007(1):4-4, 2007.
    • (2007) EURASIP JASMP , vol.2007 , Issue.1 , pp. 4-4
    • Iwano, K.1    and et, al.2
  • 18
    • 0142134976 scopus 로고    scopus 로고
    • Robust online appearence models for visual tracking
    • A. Jepson and et al. Robust online appearence models for visual tracking. IEEE Trans.PAMI, 25(10):1296-1311, 2003.
    • (2003) IEEE Trans.PAMI , vol.25 , Issue.10 , pp. 1296-1311
    • Jepson, A.1    and et, al.2
  • 19
    • 0001008498 scopus 로고    scopus 로고
    • Real-time lip tracking for audio-visual speech recognition applications
    • R. Kaucic, B. Dalton, and A. Blake. Real-time lip tracking for audio-visual speech recognition applications. In Proc. ECCV, vol.2, pages 376-387, 1996.
    • (1996) Proc. ECCV , vol.2 , pp. 376-387
    • Kaucic, R.1    Dalton, B.2    Blake, A.3
  • 21
    • 37849015208 scopus 로고    scopus 로고
    • Kodak's consumer video benchmark data set: Concept definition and annotation
    • A. Loui and et al. Kodak's consumer video benchmark data set: concept definition and annotation. In ACM SIGMM Int'l Workshop on MIR, pages 245-254, 2007.
    • (2007) ACM SIGMM Int'l Workshop on MIR , pp. 245-254
    • Loui, A.1    and et, al.2
  • 22
    • 3042535216 scopus 로고    scopus 로고
    • Distinctive image features from scale-invariant keypoints
    • D. Lowe. Distinctive image features from scale-invariant keypoints. In IJCV, 60(2):91-110, 2004.
    • (2004) IJCV , vol.60 , Issue.2 , pp. 91-110
    • Lowe, D.1
  • 23
    • 0002836012 scopus 로고
    • An iterative image registration technique with an application to stereo vision
    • B.D. Lucas and T. Kanade. An iterative image registration technique with an application to stereo vision. In Proc. Imaging understanding workshop, pages 121-130, 1981.
    • (1981) Proc. Imaging understanding workshop , pp. 121-130
    • Lucas, B.D.1    Kanade, T.2
  • 24
    • 0027842081 scopus 로고
    • Matching pursuits with time-frequency dictionaries
    • S. Mallat and Z. Zhang. Matching pursuits with time-frequency dictionaries. In IEEE Trans. Signal Processing, 41(12):3397-3415, 1993.
    • (1993) IEEE Trans. Signal Processing , vol.41 , Issue.12 , pp. 3397-3415
    • Mallat, S.1    Zhang, Z.2
  • 25
    • 84898935332 scopus 로고    scopus 로고
    • A framework for multiple-instance learning
    • O. Maron and et al. A framework for multiple-instance learning. In NIPS, 1998.
    • (1998) NIPS
    • Maron, O.1    and et, al.2
  • 26
    • 57149147931 scopus 로고    scopus 로고
    • Extracting moving people from internet videos
    • J.C. Niebles and et al. Extracting moving people from internet videos. in Proc. ECCV, pages 527-540, 2008.
    • (2008) Proc. ECCV , pp. 527-540
    • Niebles, J.C.1    and et, al.2
  • 28
    • 34547532522 scopus 로고    scopus 로고
    • Fingerprinting to identify repeated sound events in long-duration personal audio recordings
    • J. Ogle and D. Ellis. Fingerprinting to identify repeated sound events in long-duration personal audio recordings. In Proc. ICASSP, pages I-233-236, 2007.
    • (2007) Proc. ICASSP
    • Ogle, J.1    Ellis, D.2
  • 30
    • 0028112849 scopus 로고
    • Good features to track
    • J. Shi and C. Tomasi. Good features to track. In Proc. CVPR, pages 593-600, 1994.
    • (1994) Proc. CVPR , pp. 593-600
    • Shi, J.1    Tomasi, C.2
  • 31
    • 0034244889 scopus 로고    scopus 로고
    • Learning patterns of activity using real-time tracking
    • C. Stauffer and W.E.L. Grimson. Learning patterns of activity using real-time tracking. In IEEE Trans. PAMI, 22(8):747-757, 2002.
    • (2002) IEEE Trans. PAMI , vol.22 , Issue.8 , pp. 747-757
    • Stauffer, C.1    Grimson, W.E.L.2
  • 32
    • 72549084034 scopus 로고    scopus 로고
    • Boosting image retrieval
    • K. Tieu and P. Viola. Boosting image retrieval. In IJCV, 56(1-2):228-235, 2000.
    • (2000) IJCV , vol.56 , Issue.1-2 , pp. 228-235
    • Tieu, K.1    Viola, P.2
  • 34
    • 33745844069 scopus 로고    scopus 로고
    • Learning Semantic Scene Models by Trajectory Analysis
    • X.G. Wang and et al. Learning Semantic Scene Models by Trajectory Analysis. In Proc. ECCV, pages 110-123, 2006.
    • (2006) Proc. ECCV , pp. 110-123
    • Wang, X.G.1    and et, al.2
  • 35
    • 20444437959 scopus 로고    scopus 로고
    • Multimodal information fusion for video concept detection
    • Y. Wu and et al. Multimodal information fusion for video concept detection. in Proc. ICIP, pages 2391-2394, 2004.
    • (2004) Proc. ICIP , pp. 2391-2394
    • Wu, Y.1    and et, al.2
  • 36
    • 33845562302 scopus 로고    scopus 로고
    • Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning
    • C. Yang and et al. Region-based image annotation using asymmetrical support vector machine-based multiple-instance learning. In Proc. CVPR, pages 2057-2063, 2006.
    • (2006) Proc. CVPR , pp. 2057-2063
    • Yang, C.1    and et, al.2
  • 37
    • 37849051635 scopus 로고    scopus 로고
    • Large head movement tracking using SIFT-based registration
    • G.Q. Zhao and et al. Large head movement tracking using SIFT-based registration. In ACM Multimedia, 2007.
    • (2007) ACM Multimedia
    • Zhao, G.Q.1    and et, al.2
  • 38
    • 59349094120 scopus 로고    scopus 로고
    • Object tracking using sift features and mean shift
    • H. Zhou and et al. Object tracking using sift features and mean shift. Com. Vis. & Ima. Und., 113(3):345-352, 2009.
    • (2009) Com. Vis. & Ima. Und , vol.113 , Issue.3 , pp. 345-352
    • Zhou, H.1    and et, al.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.