메뉴 건너뛰기




Volumn 25, Issue 1, 2014, Pages 49-69

Multimedia event detection with multimodal feature fusion and temporal concept localization

Author keywords

Classification; Fusion; Machine learning; Multimedia

Indexed keywords


EID: 84894902895     PISSN: 09328092     EISSN: 14321769     Source Type: Journal    
DOI: 10.1007/s00138-013-0525-x     Document Type: Article
Times cited : (41)

References (60)
  • 1
    • 84894900796 scopus 로고    scopus 로고
    • http://www.lscom.org/
  • 3
    • 14344252374 scopus 로고    scopus 로고
    • Multiple kernel learning, conic duality, and the smo algorithm
    • Bach, F.R.; Lanckriet, G.R.G.; Jordan, M.I.: Multiple kernel learning, conic duality, and the smo algorithm. In: ICML (2004)
    • (2004) ICML
    • Bach, F.R.1    Lanckriet, G.R.G.2    Jordan, M.I.3
  • 4
    • 78650994996 scopus 로고    scopus 로고
    • Explicit and implicit concept-based video retrieval with bipartite graph propagation model
    • Bao, L.; Cao, J.; Zhang, Y.; Li, J.; yu Chen, M.; Hauptmann, A.G.: Explicit and implicit concept-based video retrieval with bipartite graph propagation model. In: ACM Multimedia (2010)
    • (2010) ACM Multimedia
    • Bao, L.1    Cao, J.2    Zhang, Y.3    Li, J.4    Yu Chen, M.5    Hauptmann, A.G.6
  • 5
    • 1542287501 scopus 로고    scopus 로고
    • Modeling annotated data
    • Blei, D.M.; Jordan, M.I.: Modeling annotated data. In: ACM SIGIR, pp. 127-134 (2003)
    • (2003) ACM SIGIR , pp. 127-134
    • Blei, D.M.1    Jordan, M.I.2
  • 6
    • 84878582006 scopus 로고    scopus 로고
    • Consumer-level multimedia event detection through unsupervised audio signal modeling
    • Byun, B.; Kim, I.; Siniscalchi, S.M.; Lee, C.H.: Consumer-level multimedia event detection through unsupervised audio signal modeling. In: InterSpeech (2012)
    • (2012) InterSpeech
    • Byun, B.1    Kim, I.2    Siniscalchi, S.M.3    Lee, C.H.4
  • 8
    • 50649087214 scopus 로고    scopus 로고
    • Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes
    • Cao, L.; Fei-Fei, L.: Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes. In: ICCV (2007)
    • (2007) ICCV
    • Cao, L.1    Fei-Fei, L.2
  • 9
    • 79955702502 scopus 로고    scopus 로고
    • Libsvm: A library for support vector machines
    • 10.1145/1961189.1961199
    • Chang, C.C.; Lin, C.J.: Libsvm: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 27:1-27:27 (2011)
    • (2011) ACM Trans. Intell. Syst. Technol. , vol.2 , Issue.3 , pp. 271-2727
    • Chang, C.C.1    Lin, C.J.2
  • 10
    • 33645146449 scopus 로고    scopus 로고
    • Histograms of oriented gradients for human detection
    • Dalal, N.; Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
    • (2005) CVPR
    • Dalal, N.1    Triggs, B.2
  • 11
    • 80052876786 scopus 로고    scopus 로고
    • What does classifying more than 10,000 image categories tell us?
    • Deng, J.; Berg, A.C.; Li, K.; Fei-Fei, L.: What does classifying more than 10,000 image categories tell us? In: ECCV (2010)
    • (2010) ECCV
    • Deng, J.1    Berg, A.C.2    Li, K.3    Fei-Fei, L.4
  • 14
    • 78650976705 scopus 로고    scopus 로고
    • Towards a universal detector by mining concepts with small semantic gaps
    • Feng, J.; Zheng, Y.; Yan, S.: Towards a universal detector by mining concepts with small semantic gaps. In: ACM Multimedia (2010)
    • (2010) ACM Multimedia
    • Feng, J.1    Zheng, Y.2    Yan, S.3
  • 15
    • 80053231413 scopus 로고    scopus 로고
    • Topic models for image annotation and text illustration
    • Feng, Y.; Lapata, M.: Topic models for image annotation and text illustration. In: NAACL HLT (2010)
    • (2010) NAACL HLT
    • Feng, Y.1    Lapata, M.2
  • 16
    • 14344255188 scopus 로고    scopus 로고
    • A mfom learning approach to robust multiclass multi-label text categorization
    • Gao, S.; Wu, W.; Lee, C.H.; Chua, T.S.: A mfom learning approach to robust multiclass multi-label text categorization. In: ICML (2004)
    • (2004) ICML
    • Gao, S.1    Wu, W.2    Lee, C.H.3    Chua, T.S.4
  • 17
    • 77953202699 scopus 로고    scopus 로고
    • TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation
    • Guillaumin, M.; Mensink, T.; Verbeek, J.; Schmid, C.: TagProp: discriminative metric learning in nearest neighbor models for image auto-annotation. In: ICCV (2009)
    • (2009) ICCV
    • Guillaumin, M.1    Mensink, T.2    Verbeek, J.3    Schmid, C.4
  • 18
    • 64549150985 scopus 로고    scopus 로고
    • Video retrieval based on semantic concepts
    • 10.1109/JPROC.2008.916355
    • Hauptmann, A.G.; Christel, M.G.; Yan, R.: Video retrieval based on semantic concepts. Proc. IEEE 96(4), 602-622 (2008)
    • (2008) Proc. IEEE , vol.96 , Issue.4 , pp. 602-622
    • Hauptmann, A.G.1    Christel, M.G.2    Yan, R.3
  • 19
    • 80054815184 scopus 로고    scopus 로고
    • A survey on visual content-based video indexing and retrieval
    • Hu, W.; Xie, N.; Li, L.; Zeng, X.; Maybank, S.J.: A survey on visual content-based video indexing and retrieval. IEEE Trans. Syst. Man Cybern. Part C 41(6), 797-819 (2011). URL: http://dx.doi.org/10.1109/TSMCC.2011.2109710
    • (2011) IEEE Trans. Syst. Man Cybern. Part C , vol.41 , Issue.6 , pp. 797-819
    • Hu, W.1    Xie, N.2    Li, L.3    Zeng, X.4    Maybank, S.J.5
  • 20
    • 25144471298 scopus 로고    scopus 로고
    • Score normalization in multimodal biometric systems
    • 10.1016/j.patcog.2005.01.012
    • Jain, A.; Nandakumar, K.; Ross, A.: Score normalization in multimodal biometric systems. Pattern Recogn. 38(12), 2270-2285 (2005)
    • (2005) Pattern Recogn. , vol.38 , Issue.12 , pp. 2270-2285
    • Jain, A.1    Nandakumar, K.2    Ross, A.3
  • 21
    • 84871359352 scopus 로고    scopus 로고
    • Leveraging high-level and low-level features for multimedia event detection
    • Jiang, L.; Hauptmann, A.G.; Xiang, G.: Leveraging high-level and low-level features for multimedia event detection. In: ACM-MM (2012)
    • (2012) ACM-MM
    • Jiang, L.1    Hauptmann, A.G.2    Xiang, G.3
  • 22
    • 84455170074 scopus 로고    scopus 로고
    • Audio-visual grouplet: Temporal audio-visual interactions for general video concept classification
    • Jiang, W.; Loui, A.C.: Audio-visual grouplet: temporal audio-visual interactions for general video concept classification. In: ACM Multimedia (2011)
    • (2011) ACM Multimedia
    • Jiang, W.1    Loui, A.C.2
  • 24
    • 0032203256 scopus 로고    scopus 로고
    • Pattern recognition using a family of design algorithm based upon the generalized probabilistic descent method
    • 10.1109/5.726793
    • Katagiri, S.; Juang, B.H.; Lee, C.H.: Pattern recognition using a family of design algorithm based upon the generalized probabilistic descent method. Proc. IEEE 86, 2345-2373 (1998)
    • (1998) Proc. IEEE , vol.86 , pp. 2345-2373
    • Katagiri, S.1    Juang, B.H.2    Lee, C.H.3
  • 25
    • 82455163885 scopus 로고    scopus 로고
    • Optimization of average precision with maximal figure-of-merit learning
    • Kim, I.; Lee, C.H.: Optimization of average precision with maximal figure-of-merit learning. In: MLSP (2011)
    • (2011) MLSP
    • Kim, I.1    Lee, C.H.2
  • 26
    • 84894904810 scopus 로고    scopus 로고
    • Explicit performance metric optimization for fusion-based video retrieval
    • Kim, I.; Oh, S.; Byun, B.; Perera, A.G.A.; Lee, C.H.: Explicit performance metric optimization for fusion-based video retrieval. In: ECCV Workshops, no. 3 (2012)
    • (2012) ECCV Workshops , Issue.3
    • Kim, I.1    Oh, S.2    Byun, B.3    Perera, A.G.A.4    Lee, C.H.5
  • 27
    • 84894904810 scopus 로고    scopus 로고
    • Explicit performance metric optimization for fusion-based video retrieval
    • Kim, I.; Oh, S.; Byun, B.; Perera, A.G.A.; Lee, C.H.: Explicit performance metric optimization for fusion-based video retrieval. In: ECCV Workshop (2012)
    • (2012) ECCV Workshop
    • Kim, I.1    Oh, S.2    Byun, B.3    Perera, A.G.A.4    Lee, C.H.5
  • 28
    • 0032021555 scopus 로고    scopus 로고
    • On combining classifiers
    • 10.1109/34.667881
    • Kittler, J.; Hatef, M.; Duin, R.P.W.; Matas, J.: On combining classifiers. PAMI 20, 226-239 (1998)
    • (1998) PAMI , vol.20 , pp. 226-239
    • Kittler, J.1    Hatef, M.2    Duin, R.P.W.3    Matas, J.4
  • 29
    • 84898426452 scopus 로고    scopus 로고
    • A spatio-temporal descriptor based on 3d-gradients
    • Klaser, A.; Marszalek, M.; Schmid, C.: A spatio-temporal descriptor based on 3d-gradients. In: BMVC (2008)
    • (2008) BMVC
    • Klaser, A.1    Marszalek, M.2    Schmid, C.3
  • 31
    • 80052874098 scopus 로고    scopus 로고
    • Learning hierarchical spatio-temporal features for action recognition with independent subspace analysis
    • Le, Q.; Zou, W.; Yeung, S.; Ng, A.: Learning hierarchical spatio-temporal features for action recognition with independent subspace analysis. In: CVPR (2011)
    • (2011) CVPR
    • Le, Q.1    Zou, W.2    Yeung, S.3    Ng, A.4
  • 32
    • 0023800699 scopus 로고
    • A segment model based approach to speech recognition
    • Lee, C.H.; Soong, F.K.; Juang, B.H.: A segment model based approach to speech recognition. In: ICASSP (1988)
    • (1988) ICASSP
    • Lee, C.H.1    Soong, F.K.2    Juang, B.H.3
  • 33
    • 77955746721 scopus 로고    scopus 로고
    • Audio-based semantic concept classification for consumer video
    • Lee, K.; Ellis, D.P.W.: Audio-based semantic concept classification for consumer video. IEEE Trans. Audio Speech Lang. Process. 18(6), 1406-1416 (2010)
    • (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.6 , pp. 1406-1416
    • Lee, K.1    Ellis, D.P.W.2
  • 34
    • 85162513516 scopus 로고    scopus 로고
    • Object bank: A high-level image representation for scene classification & semantic feature sparsification
    • Li, L.J.; Su, H.; Xing, E.P.; Li, F.F.: Object bank: A high-level image representation for scene classification & semantic feature sparsification. In: NIPS (2010)
    • (2010) NIPS
    • Li, L.J.1    Su, H.2    Xing, E.P.3    Li, F.F.4
  • 35
    • 84887358015 scopus 로고    scopus 로고
    • Local expert forest of score fusion for video event classification
    • Liu, J.; McCloskey, S.; Liu, Y.: Local expert forest of score fusion for video event classification. In: ECCV (2012)
    • (2012) ECCV
    • Liu, J.1    McCloskey, S.2    Liu, Y.3
  • 36
    • 84856667921 scopus 로고    scopus 로고
    • Linear dependency modeling for feature fusion
    • Ma, A.J.; Yuen, P.C.: Linear dependency modeling for feature fusion. In: ICCV, pp. 2041-2048 (2011)
    • (2011) ICCV , pp. 2041-2048
    • Ma, A.J.1    Yuen, P.C.2
  • 37
    • 51949098112 scopus 로고    scopus 로고
    • Classification using intersection kernel support vector machines is efficient
    • Maji, S.; Berg, A.C.; Malik, J.: Classification using intersection kernel support vector machines is efficient. In: CVPR (2008)
    • (2008) CVPR
    • Maji, S.1    Berg, A.C.2    Malik, J.3
  • 38
    • 70449580491 scopus 로고    scopus 로고
    • A new baseline for image annotation
    • Makadia, A.; Pavlovic, V.; Kumar, S.: A new baseline for image annotation. In: ECCV (2008)
    • (2008) ECCV
    • Makadia, A.1    Pavlovic, V.2    Kumar, S.3
  • 40
    • 31844433358 scopus 로고    scopus 로고
    • Predicting good probabilities with supervised learning
    • Niculescu-Mizil, A.; Caruana, R.: Predicting good probabilities with supervised learning. In: ICML (2005)
    • (2005) ICML
    • Niculescu-Mizil, A.1    Caruana, R.2
  • 41
    • 0035328421 scopus 로고    scopus 로고
    • Modeling the shape of the scene: A holistic representation of the spatial envelope
    • 10.1023/A:1011139631724 0990.68601
    • Oliva, A.; Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145-175 (2001)
    • (2001) Int. J. Comput. Vis. , vol.42 , Issue.3 , pp. 145-175
    • Oliva, A.1    Torralba, A.2
  • 44
    • 77955999239 scopus 로고    scopus 로고
    • Topic regression multi-model latent dirichlet allocation for image annotation
    • Putthividhya, D.; Attias, H.T.; Nagarajan, S.S.: Topic regression multi-model latent dirichlet allocation for image annotation. In: CVPR (2010)
    • (2010) CVPR
    • Putthividhya, D.1    Attias, H.T.2    Nagarajan, S.S.3
  • 45
    • 70349195978 scopus 로고    scopus 로고
    • On the importance of modeling temporal information in music tag annotation
    • Reed, J.; Lee, C.H.: On the importance of modeling temporal information in music tag annotation. In: ICASSP (2009)
    • (2009) ICASSP
    • Reed, J.1    Lee, C.H.2
  • 46
    • 77955426203 scopus 로고    scopus 로고
    • Evaluating color descriptors for object and scene recognition
    • 10.1109/TPAMI.2009.154
    • van de Sande, K.E.A.; Gevers, T.; Snoek, C.G.M.: Evaluating color descriptors for object and scene recognition. PAMI 32(9), 1582-1596 (2010)
    • (2010) PAMI , vol.32 , Issue.9 , pp. 1582-1596
    • Van De Sande, K.E.A.1    Gevers, T.2    Snoek, C.G.M.3
  • 47
    • 78149315356 scopus 로고    scopus 로고
    • Robust fusion: Extreme value theory for recognition score normalization
    • Scheirer, W.; Rocha, A.; Micheals, R.; Boult, T.: Robust fusion: extreme value theory for recognition score normalization. In: ECCV, pp. 481-495 (2010)
    • (2010) ECCV , pp. 481-495
    • Scheirer, W.1    Rocha, A.2    Micheals, R.3    Boult, T.4
  • 48
    • 84908571977 scopus 로고    scopus 로고
    • Multimedia semantic indexing using model vectors
    • Smith, J.; Naphade, M.; Natsev, A.: Multimedia semantic indexing using model vectors. In: ICME (2003)
    • (2003) ICME
    • Smith, J.1    Naphade, M.2    Natsev, A.3
  • 50
    • 84866707906 scopus 로고    scopus 로고
    • Evaluation of low-level features and their combinations for complex event detection in open source videos
    • Tamrakar, A.; Ali, S.; Yu, Q.; Liu, J.; Javed, O.; Divakaran, A.; Cheng, H.; Sawhney, H.S.: Evaluation of low-level features and their combinations for complex event detection in open source videos. In: CVPR (2012)
    • (2012) CVPR
    • Tamrakar, A.1    Ali, S.2    Yu, Q.3    Liu, J.4    Javed, O.5    Divakaran, A.6    Cheng, H.7    Sawhney, H.S.8
  • 51
    • 67650999671 scopus 로고    scopus 로고
    • Optimal classifier fusion in a non-bayesian probabilistic framework
    • 10.1109/TPAMI.2008.224
    • Terrades, O.R.; Valveny, E.; Tabbone, S.: Optimal classifier fusion in a non-bayesian probabilistic framework. PAMI 31(9), 1630-1644 (2009)
    • (2009) PAMI , vol.31 , Issue.9 , pp. 1630-1644
    • Terrades, O.R.1    Valveny, E.2    Tabbone, S.3
  • 52
    • 78049411640 scopus 로고    scopus 로고
    • An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition
    • Tsao, Y.; Sun, H.; Li, H.; Lee, C.H.: An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition. In: ICASSP (2010)
    • (2010) ICASSP
    • Tsao, Y.1    Sun, H.2    Li, H.3    Lee, C.H.4
  • 55
    • 70450178502 scopus 로고    scopus 로고
    • Simultaneous image classification and annotation
    • Wang, C.; Blei, D.M.; Fei-Fei, L.: Simultaneous image classification and annotation. In: CVPR (2009)
    • (2009) CVPR
    • Wang, C.1    Blei, D.M.2    Fei-Fei, L.3
  • 56
    • 70450216856 scopus 로고    scopus 로고
    • Max-margin hidden conditional random fields for human action recognition
    • Wang, Y.; Mori, G.: Max-margin hidden conditional random fields for human action recognition. In: CVPR (2009)
    • (2009) CVPR
    • Wang, Y.1    Mori, G.2
  • 57
    • 77955988947 scopus 로고    scopus 로고
    • SUN database: Large-scale scene recognition from abbey to zoo
    • Xiao, J.; Hays, J.; Ehinger, K.; Oliva, A.; Torralba, A.: SUN database: large-scale scene recognition from abbey to zoo. In: CVPR (2010)
    • (2010) CVPR
    • Xiao, J.1    Hays, J.2    Ehinger, K.3    Oliva, A.4    Torralba, A.5
  • 59
    • 84866712367 scopus 로고    scopus 로고
    • Robust late fusion with rank minimization
    • Ye, G.; Liu, D.; Jhuo, I.H.; Chang, S.F.: Robust late fusion with rank minimization. In: CVPR (2012)
    • (2012) CVPR
    • Ye, G.1    Liu, D.2    Jhuo, I.H.3    Chang, S.F.4
  • 60
    • 84885614497 scopus 로고    scopus 로고
    • Text classification with kernels on the multinomial manifold
    • Zhang, D.; Chen, X.; Lee, W.S.: Text classification with kernels on the multinomial manifold. In: SIGIR (2005)
    • (2005) SIGIR
    • Zhang, D.1    Chen, X.2    Lee, W.S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.