메뉴 건너뛰기




Volumn 16, Issue 6, 2010, Pages 345-379

Multimodal fusion for multimedia analysis: A survey

Author keywords

Multimedia analysis; Multimodal information fusion

Indexed keywords

BASIC CONCEPTS; CONFIDENCE LEVELS; CONTEXTUAL INFORMATION; FUSION METHODOLOGY; FUSION METHODS; FUSION STRATEGIES; MULTI-MEDIA ANALYSIS; MULTI-MODAL FUSION; MULTIMODAL INFORMATION FUSION; MULTIPLE MODALITIES;

EID: 78049469733     PISSN: 09424962     EISSN: None     Source Type: Journal    
DOI: 10.1007/s00530-010-0182-0     Document Type: Article
Times cited : (996)

References (158)
  • 2
    • 78049454011 scopus 로고    scopus 로고
    • Last access date 02 September 2009
    • TRECVID data availability (Last access date 02 September 2009). http://www-nlpir.nist.gov/projects/trecvid/trecvid.data.html
    • TRECVID Data Availability
  • 3
  • 5
    • 33947384963 scopus 로고    scopus 로고
    • Audio-visual biometrics
    • DOI 10.1109/JPROC.2006.886017
    • P.S. Aleksic A.K. Katsaggelos 2006 Audio-visual biometrics Proc. IEEE 94 11 2025 2044 10.1109/JPROC.2006.886017 (Pubitemid 46445568)
    • (2006) Proceedings of the IEEE , vol.94 , Issue.11 , pp. 2025-2044
    • Aleksic, P.S.1    Katsaggelos, A.K.2
  • 6
    • 10944251332 scopus 로고    scopus 로고
    • Particle methods for change detection, system identification, and control
    • Andrieu, C., Doucet, A., Singh, S., Tadic, V.: Particle methods for change detection, system identification, and control. Proc. IEEE 92(3), 423-438 (2004)
    • (2004) Proc. IEEE , vol.92 , Issue.3 , pp. 423-438
    • Andrieu, C.1    Doucet, A.2    Singh, S.3    Tadic, V.4
  • 8
    • 33845300572 scopus 로고    scopus 로고
    • Information assimilation framework for event detection in multimedia surveillance systems
    • DOI 10.1007/s00530-006-0063-8
    • P.K. Atrey M.S. Kankanhalli R. Jain 2006 Information assimilation framework for event detection in multimedia surveillance systems Springer/ACM Multimedia Syst. J. 12 3 239 253 10.1007/s00530-006-0063-8 (Pubitemid 44876288)
    • (2006) Multimedia Systems , vol.12 , Issue.3 , pp. 239-253
    • Atrey, P.K.1    Kankanhalli, M.S.2    Jain, R.3
  • 11
  • 12
    • 0036502392 scopus 로고    scopus 로고
    • Event based indexing of broadcasted sports video by intermodal collaboration
    • DOI 10.1109/6046.985555, PII S1520921002013974
    • N. Babaguchi Y. Kawai T. Kitahashi 2002 Event based indexing of broadcasted sports video by intermodal collaboration IEEE Trans. Multimed. 4 68 75 10.1109/6046.985555 (Pubitemid 34291529)
    • (2002) IEEE Transactions on Multimedia , vol.4 , Issue.1 , pp. 68-75
    • Babaguchi, N.1    Kawai, Y.2    Kitahashi, T.3
  • 13
    • 3242780133 scopus 로고    scopus 로고
    • Personalized abstraction of broadcasted american football video by highlight selection
    • 10.1109/TMM.2004.830811
    • N. Babaguchi Y. Kawai T. Ogura T. Kitahashi 2004 Personalized abstraction of broadcasted american football video by highlight selection IEEE Trans. Multimed. 6 4 575 586 10.1109/TMM.2004.830811
    • (2004) IEEE Trans. Multimed. , vol.6 , Issue.4 , pp. 575-586
    • Babaguchi, N.1    Kawai, Y.2    Ogura, T.3    Kitahashi, T.4
  • 15
    • 0042349407 scopus 로고    scopus 로고
    • A graphical model for audio-visual object tracking
    • 10.1109/TPAMI.2003.1206512
    • M.J. Beal N. Jojic H. Attias 2003 A graphical model for audio-visual object tracking IEEE Trans. Pattern Anal. Mach. Intell. 25 828 836 10.1109/TPAMI.2003.1206512
    • (2003) IEEE Trans. Pattern Anal. Mach. Intell. , vol.25 , pp. 828-836
    • Beal, M.J.1    Jojic, N.2    Attias, H.3
  • 16
    • 0035442720 scopus 로고    scopus 로고
    • Multisensor image segmentation using Dempster-Shafer fusion in Markov fields context
    • DOI 10.1109/36.942557, PII S0196289201054742, Large Scale Passive Microwave Remote Sensing of Soil Moisture
    • A. Bendjebbour Y. Delignon L. Fouque V. Samson W. Pieczynski 2001 Multisensor image segmentation using Dempster-Shafer fusion in markov fields context IEEE Trans. Geosci. Remote Sens. 39 8 1789 1798 10.1109/36.942557 (Pubitemid 32935693)
    • (2001) IEEE Transactions on Geoscience and Remote Sensing , vol.39 , Issue.8 , pp. 1789-1798
    • Bendjebbour, A.1    Delignon, Y.2    Fouque, L.3    Samson, V.4    Pieczynski, W.5
  • 18
    • 0036893996 scopus 로고    scopus 로고
    • Confidence measures for multimodal identity verification
    • 10.1016/S1566-2535(02)00089-1
    • S. Bengio C. Marcel S. Marcel J. Mariethoz 2002 Confidence measures for multimodal identity verification Inf. Fusion 3 4 267 276 10.1016/S1566-2535(02) 00089-1
    • (2002) Inf. Fusion , vol.3 , Issue.4 , pp. 267-276
    • Bengio, S.1    Marcel, C.2    Marcel, S.3    Mariethoz, J.4
  • 20
    • 34347337657 scopus 로고    scopus 로고
    • Audiovisual speech synchrony measure: Application to biometrics
    • Article ID 70186
    • Bredin, H., Chollet, G.: Audiovisual speech synchrony measure: application to biometrics. EURASIP J. Adv. Signal Process. 11 p. (2007). Article ID 70186
    • (2007) EURASIP J. Adv. Signal Process , vol.11
    • Bredin, H.1    Chollet, G.2
  • 23
    • 27144489164 scopus 로고    scopus 로고
    • A Tutorial on Support Vector Machines for Pattern Recognition
    • DOI 10.1023/A:1009715923555
    • C.J.C. Burges 1998 A tutorial on support vector machines for pattern recognition Data Min. Knowl. Discov. 2 2 121 167 10.1023/A:1009715923555 (Pubitemid 128126769)
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.2 , pp. 121-168
    • Burges, C.J.C.1
  • 25
    • 3543148439 scopus 로고    scopus 로고
    • A multi-modal approach to story segmentation for news video
    • DOI 10.1023/A:1023622605600
    • L. Chaisorn T.S. Chua C.H. Lee Y. Zhao H. Xu H. Feng Q. Tian 2003 A multi-modal approach to story segmentation for news video World Wide Web 6 187 208 10.1023/A:1023622605600 (Pubitemid 39020666)
    • (2003) World Wide Web , vol.6 , Issue.2 , pp. 187-208
    • Chaisorn, L.1    Chua, T.-S.2    Lee, C.-H.3
  • 27
    • 78049486291 scopus 로고    scopus 로고
    • Anomaly detection using the dempster-shafer method
    • Las Vegas
    • Chen, Q., Aickelin, U.: Anomaly detection using the dempster-shafer method. In: International Conference on Data Mining, pp. 232-240. Las Vegas (2006)
    • (2006) International Conference on Data Mining , pp. 232-240
    • Chen, Q.1    Aickelin, U.2
  • 28
    • 36248980117 scopus 로고    scopus 로고
    • Audio-visual multimodal fusion for biometric person authentication and liveness verification
    • Sydney
    • Chetty, G., Wagner, M.: Audio-visual multimodal fusion for biometric person authentication and liveness verification. In: NICTA-HCSNet Multimodal User Interaction Workshop, pp. 17-24. Sydney (2006)
    • (2006) NICTA-HCSNet Multimodal User Interaction Workshop , pp. 17-24
    • Chetty, G.1    Wagner, M.2
  • 31
    • 13444310530 scopus 로고    scopus 로고
    • Story boundary detection in large broadcast news video archives: Techniques, experience and trends
    • New York, USA
    • Chua, T.S., Chang, S.F., Chaisorn, L., Hsu, W.: Story boundary detection in large broadcast news video archives: techniques, experience and trends. In: ACM International Conference on Multimedia, pp. 656-659. New York, USA (2004)
    • (2004) ACM International Conference on Multimedia , pp. 656-659
    • Chua, T.S.1    Chang, S.F.2    Chaisorn, L.3    Hsu, W.4
  • 33
    • 0036504051 scopus 로고    scopus 로고
    • A survey of convergence results on particle filtering methods for practitioners
    • DOI 10.1109/78.984773, PII S1053587X02013284
    • D. Crisan A. Doucet 2002 A survey of convergence results on particle filtering methods for practitioners IEEE Trans. Signal Process. 50 3 736 746 10.1109/78.984773 1895071 (Pubitemid 34295113)
    • (2002) IEEE Transactions on Signal Processing , vol.50 , Issue.3 , pp. 736-746
    • Crisan, D.1    Doucet, A.2
  • 34
    • 0034507915 scopus 로고    scopus 로고
    • Look who's talking: Speaker detection using video and audio correlation
    • New York City
    • Cutler, R., Davis, L.: Look who's talking: Speaker detection using video and audio correlation. In: IEEE International Conference on Multimedia and Expo, pp. 1589-1592. New York City (2000)
    • (2000) IEEE International Conference on Multimedia and Expo , pp. 1589-1592
    • Cutler, R.1    Davis, L.2
  • 36
    • 33750573692 scopus 로고    scopus 로고
    • Facial expression recognition with relevance vector machines
    • Amsterdam, The Netherlands
    • Datcu, D., Rothkrantz, L.J.M.: Facial expression recognition with relevance vector machines. In: IEEE International Conference on Multimedia and Expo, pp. 193-196. Amsterdam, The Netherlands (2005)
    • (2005) IEEE International Conference on Multimedia and Expo , pp. 193-196
    • Datcu, D.1    Rothkrantz, L.J.M.2
  • 48
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • DOI 10.1162/neco.2006.18.7.1527
    • G.E. Hinton S. Osindero Y. Teh 2006 A fast learning algorithm for deep belief nets Neural Comput. 18 7 1527 1554 1106.68094 10.1162/neco.2006.18.7.1527 2224485 (Pubitemid 44024729)
    • (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.-W.3
  • 49
    • 14944340532 scopus 로고    scopus 로고
    • Implementation and evaluation of a constraint-based multimodal fusion system for speech and 3d pointing gestures
    • State College, PA
    • Holzapfel, H., Nickel, K., Stiefelhagen, R.: Implementation and evaluation of a constraint-based multimodal fusion system for speech and 3d pointing gestures. In: ACM International Conference on Multimodal Interfaces, pp. 175-182. State College, PA (2004)
    • (2004) ACM International Conference on Multimodal Interfaces , pp. 175-182
    • Holzapfel, H.1    Nickel, K.2    Stiefelhagen, R.3
  • 51
    • 78049462391 scopus 로고    scopus 로고
    • Modeling and assessing quality of information in multi-sensor multimedia monitoring systems
    • Hossain, M.A., Atrey, P.K., El Saddik, A.: Modeling and assessing quality of information in multi-sensor multimedia monitoring systems. ACM Trans. Multimed. Comput. Commun. Appl. 7(1) (2011)
    • (2011) ACM Trans. Multimed. Comput. Commun. Appl. , vol.7 , Issue.1
    • Hossain, M.A.1    Atrey, P.K.2    El Saddik, A.3
  • 53
    • 11244288275 scopus 로고    scopus 로고
    • Generative, discriminative, and ensemble learning on multi-modal perceputal fusion toward news stroy segmentation
    • Taipei
    • Hsu, W.H.M., Chang, S.F.: Generative, discriminative, and ensemble learning on multi-modal perceputal fusion toward news stroy segmentation. In: IEEE International Conference on Multimedia and Expos, pp. 1091-1094. Taipei (2004)
    • (2004) IEEE International Conference on Multimedia and Expos , pp. 1091-1094
    • Hsu, W.H.M.1    Chang, S.F.2
  • 55
    • 54049105977 scopus 로고    scopus 로고
    • An attention-based decision fusion scheme for multimedia information retrieval
    • Tokyo, Japan
    • Hua, X.S., Zhang, H.J.: An attention-based decision fusion scheme for multimedia information retrieval. In: The 5th Pacific-Rim Conference on Multimedia. Tokyo, Japan (2004)
    • (2004) The 5th Pacific-Rim Conference on Multimedia
    • Hua, X.S.1    Zhang, H.J.2
  • 58
    • 2342527770 scopus 로고    scopus 로고
    • Discriminative model fusion for semantic concept detection and annotation in video
    • Berkeley
    • Iyengar, G., Nock, H.J., Neti, C.: Discriminative model fusion for semantic concept detection and annotation in video. In: ACM International Conference on Multimedia, pp. 255-258. Berkeley (2003)
    • (2003) ACM International Conference on Multimedia , pp. 255-258
    • Iyengar, G.1    Nock, H.J.2    Neti, C.3
  • 61
    • 25144471298 scopus 로고    scopus 로고
    • Score normalization in multimodal biometric systems
    • DOI 10.1016/j.patcog.2005.01.012, PII S0031320305000592
    • A. Jain K. Nandakumar A. Ross 2005 Score normalization in multimodal biometric systems Pattern Recognit. 38 12 2270 2285 10.1016/j.patcog.2005.01.012 (Pubitemid 41336698)
    • (2005) Pattern Recognition , vol.38 , Issue.12 , pp. 2270-2285
    • Jain, A.1    Nandakumar, K.2    Ross, A.3
  • 64
    • 0037350942 scopus 로고    scopus 로고
    • Optimal sensor selection for discrete event systems with partial observation
    • 10.1109/TAC.2003.809144 1962246
    • S. Jiang R. Kumar H.E. Garcia 2003 Optimal sensor selection for discrete event systems with partial observation IEEE Trans. Automat. Contr. 48 369 381 10.1109/TAC.2003.809144 1962246
    • (2003) IEEE Trans. Automat. Contr. , vol.48 , pp. 369-381
    • Jiang, S.1    Kumar, R.2    Garcia, H.E.3
  • 66
    • 85024429815 scopus 로고
    • A new approach to linear filtering and prediction problems
    • R.E. Kalman 1960 A new approach to linear filtering and prediction problems Trans. ASME J. Basic Eng. 82 Series D 35 45
    • (1960) Trans. ASME J. Basic Eng. , vol.82 , Issue.SERIES D , pp. 35-45
    • Kalman, R.E.1
  • 67
    • 33749527339 scopus 로고    scopus 로고
    • Experiential sampling in multimedia systems
    • DOI 10.1109/TMM.2006.879876, 1703508
    • M.S. Kankanhalli J. Wang R. Jain 2006 Experiential sampling in multimedia systems IEEE Trans. Multimed. 8 5 937 946 10.1109/TMM.2006.879876 (Pubitemid 44523108)
    • (2006) IEEE Transactions on Multimedia , vol.8 , Issue.5 , pp. 937-946
    • Kankanhalli, M.S.1    Wang, J.2    Jain, R.3
  • 68
    • 33749523537 scopus 로고    scopus 로고
    • Experiential sampling on multiple data streams
    • DOI 10.1109/TMM.2006.879875, 1703509
    • M.S. Kankanhalli J. Wang R. Jain 2006 Experiential sampling on multiple data streams IEEE Trans. Multimed. 8 5 947 955 10.1109/TMM.2006.879875 (Pubitemid 44523109)
    • (2006) IEEE Transactions on Multimedia , vol.8 , Issue.5 , pp. 947-955
    • Kankanhalli, M.S.1    Wang, J.2    Jain, R.3
  • 71
    • 34249025710 scopus 로고    scopus 로고
    • Applying logistic regression to relevance feedback in image retrieval systems
    • DOI 10.1016/j.patcog.2007.02.002, PII S0031320307000854
    • T. León P. Zuccarello G. Ayala E. de Ves J. Domingo 2007 Applying logistic regression to relevance feedback in image retrieval systems Pattern Recognit. 40 10 2621 2632 1132.68642 10.1016/j.patcog.2007.02.002 (Pubitemid 46782861)
    • (2007) Pattern Recognition , vol.40 , Issue.10 , pp. 2621-2632
    • Leon, T.1    Zuccarello, P.2    Ayala, G.3    De Ves, E.4    Domingo, J.5
  • 75
    • 14644440462 scopus 로고    scopus 로고
    • Boosting image classification with LDA-based feature combination for digital photograph management
    • DOI 10.1016/j.patcog.2004.11.008, PII S0031320304004066, Image Understanding for Photographs
    • X. Liu L. Zhang M. Li H. Zhang D. Wang 2005 Boosting image classification with lda-based feature combination for digital photograph management Pattern Recognit. 38 6 887 901 10.1016/j.patcog.2004.11.008 (Pubitemid 40308637)
    • (2005) Pattern Recognition , vol.38 , Issue.6 , pp. 887-901
    • Liu, X.1    Zhang, L.2    Li, M.3    Zhang, H.4    Wang, D.5
  • 79
    • 0142163128 scopus 로고    scopus 로고
    • Multisensor fusion and integration: Approaches, applications, and future research directions
    • DOI 10.1109/JSEN.2002.1000251, PII S1530437X02039416
    • R.C. Luo C.C. Yih K.L. Su 2002 Multisensor fusion and integration: Approaches, applications, and future research directions IEEE Sens. J. 2 2 107 119 10.1109/JSEN.2002.1000251 (Pubitemid 44357916)
    • (2002) IEEE Sensors Journal , vol.2 , Issue.2 , pp. 107-119
    • Luo, R.C.1    Yih, C.-C.2    Su, K.L.3
  • 80
    • 36849070411 scopus 로고    scopus 로고
    • Information-theoretic semantic multimedia indexing
    • Amsterdam, The Netherlands
    • Magalhães, J., Rüger, S.: Information-theoretic semantic multimedia indexing. In: International Conference on Image and Video Retrieval, pp. 619-626. Amsterdam, The Netherlands (2007)
    • (2007) International Conference on Image and Video Retrieval , pp. 619-626
    • Magalhães, J.1
  • 83
    • 26444504617 scopus 로고    scopus 로고
    • A comparison of score, rank and probability-based fusion methods for video shot retrieval
    • Singapore
    • McDonald, K., Smeaton, A.F.: A comparison of score, rank and probability-based fusion methods for video shot retrieval. In: International Conference on Image and Video Retrieval, pp. 61-70. Singapore (2005)
    • (2005) International Conference on Image and Video Retrieval , pp. 61-70
    • McDonald, K.1    Smeaton, A.F.2
  • 84
    • 77949507915 scopus 로고    scopus 로고
    • Color image segmentation using the dempster-shafer theory of evidence for the fusion of texture
    • Munich, Germany
    • Mena, J.B., Malpica, J.: Color image segmentation using the dempster-shafer theory of evidence for the fusion of texture. In: International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XXXIV, Part 3/W8, pp. 139-144. Munich, Germany (2003)
    • (2003) International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences , vol.34 , Issue.PART 3-W8 , pp. 139-144
    • Mena, J.B.1    Malpica, J.2
  • 85
    • 1842854571 scopus 로고    scopus 로고
    • Continuous audio-visual digit recognition using N -best decision fusion
    • DOI 10.1016/j.inffus.2003.07.001, PII S1566253503000915
    • G.F. Meyer J.B. Mulligan S.M. Wuerger 2004 Continuous audio-visual digit recognition using N-best decision fusion J. Inf. Fusion 5 91 101 10.1016/j.inffus.2003.07.001 (Pubitemid 38488057)
    • (2004) Information Fusion , vol.5 , Issue.2 , pp. 91-101
    • Meyer, G.F.1    Mulligan, J.B.2    Wuerger, S.M.3
  • 87
    • 84978830303 scopus 로고    scopus 로고
    • Joint processing of audio and visual information for multimedia indexing and human-computer interaction
    • Paris, France
    • Neti, C., Maison, B., Senior, A., Iyengar, G., Cuetos, P., Basu, S., Verma, A.: Joint processing of audio and visual information for multimedia indexing and human-computer interaction. In: International Conference RIAO. Paris, France (2000)
    • (2000) International Conference RIAO
    • Neti, C.1    Maison, B.2    Senior, A.3    Iyengar, G.4    Cuetos, P.5    Basu, S.6    Verma, A.7
  • 90
    • 0037700834 scopus 로고    scopus 로고
    • Assessing face and speech consistency for monologue detection in video
    • French Riviera, France
    • Nock, H.J., Iyengar, G., Neti, C.: Assessing face and speech consistency for monologue detection in video. In: ACM International Conference on Multimedia. French Riviera, France (2002)
    • (2002) ACM International Conference on Multimedia
    • Nock, H.J.1    Iyengar, G.2    Neti, C.3
  • 94
    • 0028407685 scopus 로고
    • Optimal sensor selection strategy for discrete-time state estimators
    • 10.1109/7.272256
    • Y. Oshman 1994 Optimal sensor selection strategy for discrete-time state estimators IEEE Trans. Aerosp. Electron. Syst. 30 307 314 10.1109/7.272256
    • (1994) IEEE Trans. Aerosp. Electron. Syst. , vol.30 , pp. 307-314
    • Oshman, Y.1
  • 95
    • 0002126112 scopus 로고    scopus 로고
    • Ten myths of multimodal interaction
    • 10.1145/319382.319398
    • S. Oviatt 1999 Ten myths of multimodal interaction Commun. ACM 42 11 74 81 10.1145/319382.319398
    • (1999) Commun. ACM , vol.42 , Issue.11 , pp. 74-81
    • Oviatt, S.1
  • 96
    • 0002798273 scopus 로고    scopus 로고
    • Taming speech recognition errors within a multimodal interface
    • 10.1145/348941.348979
    • S. Oviatt 2000 Taming speech recognition errors within a multimodal interface Commun. ACM 43 9 45 51 10.1145/348941.348979
    • (2000) Commun. ACM , vol.43 , Issue.9 , pp. 45-51
    • Oviatt, S.1
  • 101
    • 33744914910 scopus 로고    scopus 로고
    • Fade - An integrated approach to multimodal fusion and discourse processing
    • Trento, Italy
    • Pfleger, N.: Fade - an integrated approach to multimodal fusion and discourse processing. In: Dotoral Spotlight at ICMI 2005. Trento, Italy (2005)
    • (2005) : Dotoral Spotlight at ICMI 2005
    • Pfleger, N.1
  • 103
    • 27744533915 scopus 로고    scopus 로고
    • How do correlation and variance of base-experts affect fusion in biometric authentication tasks?
    • Poh, N., Bengio, S.: How do correlation and variance of base-experts affect fusion in biometric authentication tasks? IEEE Trans. Signal Process. 53, 4384-4396 (2005)
    • (2005) IEEE Trans. Signal Process , vol.53 , pp. 4384-4396
    • Poh, N.1    Bengio, S.2
  • 104
    • 27744526744 scopus 로고    scopus 로고
    • Database, protocols and tools for evaluating score-level fusion algorithms in biometric authentication
    • DOI 10.1016/j.patcog.2005.06.011, PII S0031320305002347, Complexity Reduction
    • N. Poh S. Bengio 2006 Database, protocols and tools for evaluating score-level fusion algorithms in biometric authentication Pattern Recognit. 39 2 223 233 10.1016/j.patcog.2005.06.011 (Part Special Issue: Complexity Reduction) (Pubitemid 41586091)
    • (2006) Pattern Recognition , vol.39 , Issue.2 , pp. 223-233
    • Poh, N.1    Bengio, S.2
  • 106
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audiovisual speech
    • DOI 10.1109/JPROC.2003.817150, Human-Computer Multimodal Interface
    • G. Potamianos C. Neti G. Gravier A. Garg A. Senior 2003 Recent advances in the automatic recognition of audiovisual speech Proc. IEEE 91 9 1306 1326 10.1109/JPROC.2003.817150 (Pubitemid 40890816)
    • (2003) Proceedings of the IEEE , vol.91 , Issue.9 , pp. 1306-1325
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.W.5
  • 107
    • 4344680537 scopus 로고    scopus 로고
    • Tracking of multiple moving speakers with multiple microphone arrays
    • 10.1109/TSA.2004.833004
    • I. Potamitis H. Chen G. Tremoulis 2004 Tracking of multiple moving speakers with multiple microphone arrays IEEE Trans. Speech Audio Process. 12 5 520 529 10.1109/TSA.2004.833004
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.5 , pp. 520-529
    • Potamitis, I.1    Chen, H.2    Tremoulis, G.3
  • 112
    • 0033556862 scopus 로고    scopus 로고
    • A unifying review of linear gaussian models
    • 10.1162/089976699300016674
    • S. Roweis Z. Ghahramani 1999 A unifying review of linear gaussian models Neural Comput. 11 2 305 345 10.1162/089976699300016674
    • (1999) Neural Comput. , vol.11 , Issue.2 , pp. 305-345
    • Roweis, S.1    Ghahramani, Z.2
  • 113
    • 4544228318 scopus 로고    scopus 로고
    • Identity verification using speech and face information
    • 10.1016/j.dsp.2004.05.001
    • C. Sanderson K.K. Paliwal 2004 Identity verification using speech and face information Digit. Signal Process. 14 5 449 480 10.1016/j.dsp.2004.05.001
    • (2004) Digit. Signal Process. , vol.14 , Issue.5 , pp. 449-480
    • Sanderson, C.1    Paliwal, K.K.2
  • 114
    • 0032660827 scopus 로고    scopus 로고
    • Name-It: Naming and detecting faces in news video
    • 10.1109/93.752960
    • S. Satoh Y. Nakamura T. Kanade 1999 Name-It: naming and detecting faces in news video IEEE Multimed. 6 1 22 35 10.1109/93.752960
    • (1999) IEEE Multimed. , vol.6 , Issue.1 , pp. 22-35
    • Satoh, S.1    Nakamura, Y.2    Kanade, T.3
  • 116
    • 33947354959 scopus 로고    scopus 로고
    • Dempster-shafer theory based finger print classifier fusion with update rule to minimize training time
    • 10.1587/elex.3.429
    • R. Singh M. Vatsa A. Noore S.K. Singh 2006 Dempster-shafer theory based finger print classifier fusion with update rule to minimize training time IEICE Electron. Express 3 20 429 435 10.1587/elex.3.429
    • (2006) IEICE Electron. Express , vol.3 , Issue.20 , pp. 429-435
    • Singh, R.1    Vatsa, M.2    Noore, A.3    Singh, S.K.4
  • 117
    • 2642557514 scopus 로고    scopus 로고
    • Facesync: A linear operator for measuring synchronization of video facial images and audio tracks
    • Slaney, M., Covell, M.: Facesync: A linear operator for measuring synchronization of video facial images and audio tracks. In: Neural Information Processing Society, vol. 13 (2000)
    • (2000) Neural Information Processing Society , vol.13
    • Slaney, M.1    Covell, M.2
  • 118
    • 70349341623 scopus 로고    scopus 로고
    • High-level feature detection from video in TRECVid: A 5-year retrospective of achievements
    • A. Divakaran (eds). Springer Berlin
    • Smeaton, A.F., Over, P., Kraaij, W.: High-level feature detection from video in TRECVid: a 5-year retrospective of achievements. In: Divakaran, A. (ed.) Multimedia Content Analysis, Theory and Applications, pp. 151-174. Springer, Berlin (2009)
    • (2009) Multimedia Content Analysis, Theory and Applications. , pp. 151-174
    • Smeaton, A.F.1    Over, P.2    Kraaij, W.3
  • 120
    • 10044236762 scopus 로고    scopus 로고
    • Multimodal video indexing: A review of the state-of-the-art
    • 10.1023/B:MTAP.0000046380.27575.a5
    • C.G.M. Snoek M. Worring 2005 Multimodal video indexing: a review of the state-of-the-art Multimed. Tools Appl. 25 1 5 35 10.1023/B:MTAP.0000046380. 27575.a5
    • (2005) Multimed. Tools Appl. , vol.25 , Issue.1 , pp. 5-35
    • Snoek, C.G.M.1    Worring, M.2
  • 123
    • 54049157068 scopus 로고    scopus 로고
    • Tech. rep., MIT-CSAIL-TR-2005-057, Massachusetts Institute of Technology, Cambridge, MA
    • Stauffer, C.: Automated audio-visual activity analysis. Tech. rep., MIT-CSAIL-TR-2005-057, Massachusetts Institute of Technology, Cambridge, MA (2005)
    • (2005) Automated Audio-visual Activity Analysis
    • Stauffer, C.1
  • 124
    • 85032766888 scopus 로고    scopus 로고
    • Joint audio-video object localization and tracking: A presentation general methodology
    • DOI 10.1109/79.911196
    • N. Strobel S. Spors R. Rabenstein 2001 Joint audio-video object localization and tracking IEEE Signal Process. Mag. 18 1 22 31 10.1109/79.911196 (Pubitemid 32287668)
    • (2001) IEEE Signal Processing Magazine , vol.18 , Issue.1 , pp. 22-31
    • Strobel, N.1    Spors, S.2    Rabenstein, R.3
  • 128
    • 0032179738 scopus 로고    scopus 로고
    • Models for audiovisual fusion in a noisy-vowel recognition task
    • 10.1023/A:1008014206206
    • P. Teissier A. Guerin-Dugue J.L. Schwartz 1998 Models for audiovisual fusion in a noisy-vowel recognition task J. VLSI Signal Process. 20 25 44 10.1023/A:1008014206206
    • (1998) J. VLSI Signal Process. , vol.20 , pp. 25-44
    • Teissier, P.1    Guerin-Dugue, A.2    Schwartz, J.L.3
  • 131
    • 33748494648 scopus 로고    scopus 로고
    • Multi-sensory and multi-modal fusion for sentient computing
    • DOI 10.1007/s11263-006-7834-8
    • C. Town 2007 Multi-sensory and multi-modal fusion for sentient computing Int. J. Comput. Vis. 71 235 253 10.1007/s11263-006-7834-8 (Pubitemid 44359123)
    • (2007) International Journal of Computer Vision , vol.71 , Issue.2 , pp. 235-253
    • Town, C.1
  • 135
    • 2342648773 scopus 로고    scopus 로고
    • Experience-based sampling technique for multimedia analysis
    • Berkeley, CA
    • Wang, J., Kankanhalli, M.S.: Experience-based sampling technique for multimedia analysis. In: ACM International Conference on Multimedia, pp. 319-322. Berkeley, CA (2003)
    • (2003) ACM International Conference on Multimedia , pp. 319-322
    • Wang, J.1    Kankanhalli, M.S.2
  • 138
    • 85032751556 scopus 로고    scopus 로고
    • Multimedia content analysis: Using both audio and visual clues
    • Wang, Y., Liu, Z., Huang, J.C.: Multimedia content analysis: using both audio and visual clues. In: IEEE Signal Processing Magazine, pp. 12-36 (2000)
    • (2000) IEEE Signal Processing Magazine , pp. 12-36
    • Wang, Y.1    Liu, Z.2    Huang, J.C.3
  • 146
    • 0036609291 scopus 로고    scopus 로고
    • Multi-sensor management for information fusion: Issues and approaches
    • 10.1016/S1566-2535(02)00055-6
    • N. Xiong P. Svensson 2002 Multi-sensor management for information fusion: issues and approaches Inf. Fusion 3 163 186(24) 10.1016/S1566-2535(02)00055-6
    • (2002) Inf. Fusion , vol.3 , pp. 163-18624
    • Xiong, N.1    Svensson, P.2
  • 147
    • 41549084805 scopus 로고    scopus 로고
    • A novel framework for semantic annotation and personalized retrieval of sports video
    • DOI 10.1109/TMM.2008.917346, 4469885
    • C. Xu J. Wang H. Lu Y. Zhang 2008 A novel framework for semantic annotation and personalized retrieval of sports video IEEE Trans. Multimed. 10 3 421 436 10.1109/TMM.2008.917346 (Pubitemid 351459505)
    • (2008) IEEE Transactions on Multimedia , vol.10 , Issue.3 , pp. 421-436
    • Xu, C.1    Wang, J.2    Lu, H.3    Zhang, Y.4
  • 148
    • 56549121057 scopus 로고    scopus 로고
    • Using webcast text for semantic event detection in broadcast sports video
    • 10.1109/TMM.2008.2004912
    • C. Xu Y.F. Zhang G. Zhu Y. Rui H. Lu Q. Huang 2008 Using webcast text for semantic event detection in broadcast sports video IEEE Trans. Multimed. 10 7 1342 1355 10.1109/TMM.2008.2004912
    • (2008) IEEE Trans. Multimed. , vol.10 , Issue.7 , pp. 1342-1355
    • Xu, C.1    Zhang, Y.F.2    Zhu, G.3    Rui, Y.4    Lu, H.5    Huang, Q.6
  • 149
    • 33745151973 scopus 로고    scopus 로고
    • Fusion of AV features and external information sources for event detection in team sports video
    • 10.1145/1126004.1126007
    • H. Xu T.S. Chua 2006 Fusion of AV features and external information sources for event detection in team sports video ACM Trans. Multimed. Comput. Commun. Appl. 2 1 44 67 10.1145/1126004.1126007
    • (2006) ACM Trans. Multimed. Comput. Commun. Appl. , vol.2 , Issue.1 , pp. 44-67
    • Xu, H.1    Chua, T.S.2
  • 151
    • 13444278599 scopus 로고    scopus 로고
    • Learning query-class dependent weights in automatic video retrieval
    • New York, USA
    • Yan, R., Yang, J., Hauptmann, A.: Learning query-class dependent weights in automatic video retrieval. In: ACM International Conference on Multimedia, pp. 548-555. New York, USA (2004)
    • (2004) ACM International Conference on Multimedia , pp. 548-555
    • Yan, R.1    Yang, J.2    Hauptmann, A.3
  • 153
  • 154
    • 33749262426 scopus 로고    scopus 로고
    • Object tracking in an outdoor environment using fusion of features and cameras
    • DOI 10.1016/j.imavis.2005.06.008, PII S0262885605000843
    • Q. Zhou J. Aggarwal 2006 Object tracking in an outdoor environment using fusion of features and cameras Image Vis. Comput. 24 11 1244 1255 10.1016/j.imavis.2005.06.008 (Pubitemid 44485264)
    • (2006) Image and Vision Computing , vol.24 , Issue.11 , pp. 1244-1255
    • Zhou, Q.1    Aggarwal, J.K.2
  • 156
    • 34547210642 scopus 로고    scopus 로고
    • Multimodal fusion using learned text concepts for image categorization
    • Santa Barbara
    • Zhu, Q., Yeh, M.C., Cheng, K.T.: Multimodal fusion using learned text concepts for image categorization. In: ACM International Conference on Multimedia, pp. 211-220. Santa Barbara (2006)
    • (2006) ACM International Conference on Multimedia , pp. 211-220
    • Zhu, Q.1    Yeh, M.C.2    Cheng, K.T.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.