SCOPUS 정보 검색 플랫폼

Multimedia Systems

Volumn 16, Issue 6, 2010, Pages 345-379

Multimodal fusion for multimedia analysis: A survey

(4) Atrey, Pradeep K a Hossain, M Anwar b El Saddik, Abdulmotaleb b Kankanhalli, Mohan S c

a UNIVERSITY OF WINNIPEG (Canada)

b UNIVERSITY OF OTTAWA (Canada)

c NATIONAL UNIVERSITY OF SINGAPORE (Singapore)

Author keywords

Multimedia analysis; Multimodal information fusion

Indexed keywords

BASIC CONCEPTS; CONFIDENCE LEVELS; CONTEXTUAL INFORMATION; FUSION METHODOLOGY; FUSION METHODS; FUSION STRATEGIES; MULTI-MEDIA ANALYSIS; MULTI-MODAL FUSION; MULTIMODAL INFORMATION FUSION; MULTIPLE MODALITIES;

INFORMATION FUSION; SURVEYS;

RESEARCH;

EID: 78049469733 PISSN: 09424962 EISSN: None Source Type: Journal
DOI: 10.1007/s00530-010-0182-0 Document Type: Article

Times cited : (996)

References (158)

1
- 84858664433
- (Last access date 31 August 2009)
- PETS: Performance evaluation of tracking and surveillance (Last access date 31 August 2009). http://www.cvg.rdg.ac.uk/slides/pets.html
- PETS: Performance Evaluation of Tracking and Surveillance

2
- 78049454011
- Last access date 02 September 2009
- TRECVID data availability (Last access date 02 September 2009). http://www-nlpir.nist.gov/projects/trecvid/trecvid.data.html
- TRECVID Data Availability

3
- 0037299467
- Semantic indexing of multimedia content using visual, audio, and text cues
- 10.1155/S1110865703211173
- W. Adams G. Iyengar C. Lin M. Naphade C. Neti H. Nock J. Smith 2003 Semantic indexing of multimedia content using visual, audio, and text cues EURASIP J. Appl. Signal Process. 2003 2 170 185 10.1155/S1110865703211173
- (2003) EURASIP J. Appl. Signal Process. , vol.2003 , Issue.2 , pp. 170-185
- Adams, W.¹ Iyengar, G.² Lin, C.³ Naphade, M.⁴ Neti, C.⁵ Nock, H.⁶ Smith, J.⁷

4
- 78049484206
- A comparative evaluation of fusion strategies for multimodal biometric verification
- Guildford
- Aguilar, J.F., Garcia, J.O., Romero, D.G., Rodriguez, J.G.: A comparative evaluation of fusion strategies for multimodal biometric verification. In: International Conference on Video-Based Biometrie Person Authentication, pp. 830-837. Guildford (2003)
- (2003) International Conference on Video-Based Biometrie Person Authentication , pp. 830-837
- Aguilar, J.F.¹ Garcia, J.O.² Romero, D.G.³ Rodriguez, J.G.⁴

5
- 33947384963
- Audio-visual biometrics
- DOI 10.1109/JPROC.2006.886017
- P.S. Aleksic A.K. Katsaggelos 2006 Audio-visual biometrics Proc. IEEE 94 11 2025 2044 10.1109/JPROC.2006.886017 (Pubitemid 46445568)
- (2006) Proceedings of the IEEE , vol.94 , Issue.11 , pp. 2025-2044
- Aleksic, P.S.¹ Katsaggelos, A.K.²

6
- 10944251332
- Particle methods for change detection, system identification, and control
- Andrieu, C., Doucet, A., Singh, S., Tadic, V.: Particle methods for change detection, system identification, and control. Proc. IEEE 92(3), 423-438 (2004)
- (2004) Proc. IEEE , vol.92 , Issue.3 , pp. 423-438
- Andrieu, C.¹ Doucet, A.² Singh, S.³ Tadic, V.⁴

7
- 33646764222
- Semantic annotation of multimedia using maximum entropy models
- Philadelphia
- Argillander, J., Iyengar, G., Nock, H.: Semantic annotation of multimedia using maximum entropy models. In: International Conference on Accoustic, Speech and Signal Processing, pp. II-153-156. Philadelphia (2005)
- (2005) International Conference on Accoustic, Speech and Signal Processing
- Argillander, J.¹ Iyengar, G.² Nock, H.³

8
- 33845300572
- Information assimilation framework for event detection in multimedia surveillance systems
- DOI 10.1007/s00530-006-0063-8
- P.K. Atrey M.S. Kankanhalli R. Jain 2006 Information assimilation framework for event detection in multimedia surveillance systems Springer/ACM Multimedia Syst. J. 12 3 239 253 10.1007/s00530-006-0063-8 (Pubitemid 44876288)
- (2006) Multimedia Systems , vol.12 , Issue.3 , pp. 239-253
- Atrey, P.K.¹ Kankanhalli, M.S.² Jain, R.³

9
- 33847757153
- Goal-oriented optimal subset selection of correlated multimedia streams
- Atrey, P.K., Kankanhalli, M.S., Oommen, J.B.: Goal-oriented optimal subset selection of correlated multimedia streams. ACM Trans. Multimedia Comput. Commun. Appl. 3(1), 2 (2007)
- (2007) ACM Trans. Multimedia Comput. Commun. Appl. , vol.3 , Issue.1-2
- Atrey, P.K.¹ Kankanhalli, M.S.² Oommen, J.B.³

10
- 84886439155
- Confidence building among correlated streams in multimedia surveillance systems
- Singapore
- Atrey, P.K., Kankanhalli, M.S., El Saddik, A.: Confidence building among correlated streams in multimedia surveillance systems. In: International Conference on Multimedia Modeling, pp. 155-164. Singapore (2007)
- (2007) International Conference on Multimedia Modeling , pp. 155-164
- Atrey, P.K.¹ Kankanhalli, M.S.² El Saddik, A.³

11
- 37149009641
- Classifier fusion for svm-based multimedia semantic indexing
- Rome
- Ayache, S., Quénot, G., Gensel, J.: Classifier fusion for svm-based multimedia semantic indexing. In: The 29th European Conference on Information Retrieval Research, pp. 494-504. Rome (2007)
- (2007) The 29th European Conference on Information Retrieval Research , pp. 494-504
- Ayache, S.¹

12
- 0036502392
- Event based indexing of broadcasted sports video by intermodal collaboration
- DOI 10.1109/6046.985555, PII S1520921002013974
- N. Babaguchi Y. Kawai T. Kitahashi 2002 Event based indexing of broadcasted sports video by intermodal collaboration IEEE Trans. Multimed. 4 68 75 10.1109/6046.985555 (Pubitemid 34291529)
- (2002) IEEE Transactions on Multimedia , vol.4 , Issue.1 , pp. 68-75
- Babaguchi, N.¹ Kawai, Y.² Kitahashi, T.³

13
- 3242780133
- Personalized abstraction of broadcasted american football video by highlight selection
- 10.1109/TMM.2004.830811
- N. Babaguchi Y. Kawai T. Ogura T. Kitahashi 2004 Personalized abstraction of broadcasted american football video by highlight selection IEEE Trans. Multimed. 6 4 575 586 10.1109/TMM.2004.830811
- (2004) IEEE Trans. Multimed. , vol.6 , Issue.4 , pp. 575-586
- Babaguchi, N.¹ Kawai, Y.² Ogura, T.³ Kitahashi, T.⁴

14
- 35248819751
- The BANCA database and evaluation protocol
- Guildford
- Bailly-Bailliére, E., Bengio, S., Bimbot, F., Hamouz, M., Kittler, J., Mariéthoz, J., Matas, J., Messer, K., Popovici, V., Porée, F., Ruíz, B., Thiran, J.P.: The BANCA database and evaluation protocol. In: International Conference on Audio-and Video-Based Biometrie Person Authentication, pp. 625-638. Guildford (2003)
- (2003) International Conference on Audio-and Video-Based Biometrie Person Authentication , pp. 625-638
- Bailly-Bailliére, E.¹

15
- 0042349407
- A graphical model for audio-visual object tracking
- 10.1109/TPAMI.2003.1206512
- M.J. Beal N. Jojic H. Attias 2003 A graphical model for audio-visual object tracking IEEE Trans. Pattern Anal. Mach. Intell. 25 828 836 10.1109/TPAMI.2003.1206512
- (2003) IEEE Trans. Pattern Anal. Mach. Intell. , vol.25 , pp. 828-836
- Beal, M.J.¹ Jojic, N.² Attias, H.³

16
- 0035442720
- Multisensor image segmentation using Dempster-Shafer fusion in Markov fields context
- DOI 10.1109/36.942557, PII S0196289201054742, Large Scale Passive Microwave Remote Sensing of Soil Moisture
- A. Bendjebbour Y. Delignon L. Fouque V. Samson W. Pieczynski 2001 Multisensor image segmentation using Dempster-Shafer fusion in markov fields context IEEE Trans. Geosci. Remote Sens. 39 8 1789 1798 10.1109/36.942557 (Pubitemid 32935693)
- (2001) IEEE Transactions on Geoscience and Remote Sensing , vol.39 , Issue.8 , pp. 1789-1798
- Bendjebbour, A.¹ Delignon, Y.² Fouque, L.³ Samson, V.⁴ Pieczynski, W.⁵

17
- 33745546675
- Multimodal authentication using asynchronous hmms
- Guildford
- Bengio, S.: Multimodal authentication using asynchronous hmms. In: The 4th International Conference Audio and Video Based Biometric Person Authentication, pp. 770-777. Guildford (2003)
- (2003) The 4th International Conference Audio and Video Based Biometric Person Authentication , pp. 770-777
- Bengio, S.¹

18
- 0036893996
- Confidence measures for multimodal identity verification
- 10.1016/S1566-2535(02)00089-1
- S. Bengio C. Marcel S. Marcel J. Mariethoz 2002 Confidence measures for multimodal identity verification Inf. Fusion 3 4 267 276 10.1016/S1566-2535(02) 00089-1
- (2002) Inf. Fusion , vol.3 , Issue.4 , pp. 267-276
- Bengio, S.¹ Marcel, C.² Marcel, S.³ Mariethoz, J.⁴

19
- 34547523367
- Audio-visual speech synchrony measure for talking-face identity verification
- Paris
- Bredin, H., Chollet, G.: Audio-visual speech synchrony measure for talking-face identity verification. In: IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 233-236. Paris (2007)
- (2007) IEEE International Conference on Acoustics, Speech and Signal Processing , vol.2 , pp. 233-236
- Bredin, H.¹ Chollet, G.²

20
- 34347337657
- Audiovisual speech synchrony measure: Application to biometrics
- Article ID 70186
- Bredin, H., Chollet, G.: Audiovisual speech synchrony measure: application to biometrics. EURASIP J. Adv. Signal Process. 11 p. (2007). Article ID 70186
- (2007) EURASIP J. Adv. Signal Process , vol.11
- Bredin, H.¹ Chollet, G.²

21
- 78049474085
- A context representation of surveillance systems
- Brémond, F., Thonnat, M.: A context representation of surveillance systems. In: European Conference on Computer Vision. Orlando (1996)
- (1996) European Conference on Computer Vision. Orlando
- Brémond, F.¹

22
- 0004157072
- Prentice Hall PTR Upper Saddle River, NJ
- Brooks, R.R., Iyengar, S.S.: Multi-sensor Fusion: Fundamentals and Applications with Software. Prentice Hall PTR, Upper Saddle River, NJ (1998)
- (1998) Multi-sensor Fusion: Fundamentals and Applications with Software
- Brooks, R.R.¹ Iyengar, S.S.²

23
- 27144489164
- A Tutorial on Support Vector Machines for Pattern Recognition
- DOI 10.1023/A:1009715923555
- C.J.C. Burges 1998 A tutorial on support vector machines for pattern recognition Data Min. Knowl. Discov. 2 2 121 167 10.1023/A:1009715923555 (Pubitemid 128126769)
- (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.2 , pp. 121-168
- Burges, C.J.C.¹

24
- 48749096852
- Getting the most out of ensemble selection
- Maryland
- Caruana, R., Munson, A., Niculescu-Mizil, A.: Getting the most out of ensemble selection. In: ACM International Conference on on Data Mining, pp. 828-833. Maryland (2006)
- (2006) ACM International Conference on on Data Mining , pp. 828-833
- Caruana, R.¹ Munson, A.² Niculescu-Mizil, A.³

25
- 3543148439
- A multi-modal approach to story segmentation for news video
- DOI 10.1023/A:1023622605600
- L. Chaisorn T.S. Chua C.H. Lee Y. Zhao H. Xu H. Feng Q. Tian 2003 A multi-modal approach to story segmentation for news video World Wide Web 6 187 208 10.1023/A:1023622605600 (Pubitemid 39020666)
- (2003) World Wide Web , vol.6 , Issue.2 , pp. 187-208
- Chaisorn, L.¹ Chua, T.-S.² Lee, C.-H.³

26
- 33646801960
- Combining text and audio-visual features in video indexing
- IEEE Computer Society, Philadelphia
- Chang, S.F., Manmatha, R., Chua, T.S.: Combining text and audio-visual features in video indexing. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 5, pp. 1005-1008. IEEE Computer Society, Philadelphia (2005)
- (2005) IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.5 , pp. 1005-1008
- Chang, S.F.¹ Manmatha, R.² Chua, T.S.³

27
- 78049486291
- Anomaly detection using the dempster-shafer method
- Las Vegas
- Chen, Q., Aickelin, U.: Anomaly detection using the dempster-shafer method. In: International Conference on Data Mining, pp. 232-240. Las Vegas (2006)
- (2006) International Conference on Data Mining , pp. 232-240
- Chen, Q.¹ Aickelin, U.²

28
- 36248980117
- Audio-visual multimodal fusion for biometric person authentication and liveness verification
- Sydney
- Chetty, G., Wagner, M.: Audio-visual multimodal fusion for biometric person authentication and liveness verification. In: NICTA-HCSNet Multimodal User Interaction Workshop, pp. 17-24. Sydney (2006)
- (2006) NICTA-HCSNet Multimodal User Interaction Workshop , pp. 17-24
- Chetty, G.¹ Wagner, M.²

29
- 8644219914
- Query based event extraction along a timeline
- Sheffield
- Chieu, H.L., Lee, Y.K.: Query based event extraction along a timeline. In: International ACM Conference on Research and Development in Information Retrieval, pp. 425-432. Sheffield (2004)
- (2004) International ACM Conference on Research and Development in Information Retrieval , pp. 425-432
- Chieu, H.L.¹ Lee, Y.K.²

30
- 4544386970
- Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection
- Quebec
- Choudhury, T., Rehg, J.M., Pavlovic, V., Pentland, A.: Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection. In: The 16th International Conference on Pattern Recognition, vol. 3, pp. 789-794. Quebec (2002)
- (2002) The 16th International Conference on Pattern Recognition , vol.3 , pp. 789-794
- Choudhury, T.¹ Rehg, J.M.² Pavlovic, V.³ Pentland, A.⁴

31
- 13444310530
- Story boundary detection in large broadcast news video archives: Techniques, experience and trends
- New York, USA
- Chua, T.S., Chang, S.F., Chaisorn, L., Hsu, W.: Story boundary detection in large broadcast news video archives: techniques, experience and trends. In: ACM International Conference on Multimedia, pp. 656-659. New York, USA (2004)
- (2004) ACM International Conference on Multimedia , pp. 656-659
- Chua, T.S.¹ Chang, S.F.² Chaisorn, L.³ Hsu, W.⁴

32
- 34548239285
- Multimodal input fusion in human-computer interaction
- Karlsruhe University, Germany
- Corradini, A., Mehta, M., Bernsen, N., Martin, J., Abrilian, S.: Multimodal input fusion in human-computer interaction. In: NATO-ASI Conference on Data Fusion for Situation Monitoring, Incident Detection, Alert and Response Management. Karlsruhe University, Germany (2003)
- (2003) NATO-ASI Conference on Data Fusion for Situation Monitoring, Incident Detection, Alert and Response Management
- Corradini, A.¹ Mehta, M.² Bernsen, N.³ Martin, J.⁴ Abrilian, S.⁵

33
- 0036504051
- A survey of convergence results on particle filtering methods for practitioners
- DOI 10.1109/78.984773, PII S1053587X02013284
- D. Crisan A. Doucet 2002 A survey of convergence results on particle filtering methods for practitioners IEEE Trans. Signal Process. 50 3 736 746 10.1109/78.984773 1895071 (Pubitemid 34295113)
- (2002) IEEE Transactions on Signal Processing , vol.50 , Issue.3 , pp. 736-746
- Crisan, D.¹ Doucet, A.²

34
- 0034507915
- Look who's talking: Speaker detection using video and audio correlation
- New York City
- Cutler, R., Davis, L.: Look who's talking: Speaker detection using video and audio correlation. In: IEEE International Conference on Multimedia and Expo, pp. 1589-1592. New York City (2000)
- (2000) IEEE International Conference on Multimedia and Expo , pp. 1589-1592
- Cutler, R.¹ Davis, L.²

35
- 1842830672
- Audio-visual segmentation and "the cocktail party effect"
- Bejing
- Darrell, T., Fisher III, J.W., Viola, P., Freeman, W.: Audio-visual segmentation and "the cocktail party effect". In: International Conference on Multimodal Interfaces. Bejing (2000)
- (2000) International Conference on Multimodal Interfaces
- Darrell, T.¹ Fisher Iii, J.W.² Viola, P.³ Freeman, W.⁴

36
- 33750573692
- Facial expression recognition with relevance vector machines
- Amsterdam, The Netherlands
- Datcu, D., Rothkrantz, L.J.M.: Facial expression recognition with relevance vector machines. In: IEEE International Conference on Multimedia and Expo, pp. 193-196. Amsterdam, The Netherlands (2005)
- (2005) IEEE International Conference on Multimedia and Expo , pp. 193-196
- Datcu, D.¹ Rothkrantz, L.J.M.²

37
- 0036786158
- On an optimal problem in sensor selection
- 1087.93059 10.1023/A:1019770124060 1926147
- R. Debouk S. Lafortune D. Teneketzis 2002 On an optimal problem in sensor selection J. Discret. Event Dyn. Syst. Theory Appl. 12 417 445 1087.93059 10.1023/A:1019770124060 1926147
- (2002) J. Discret. Event Dyn. Syst. Theory Appl. , vol.12 , pp. 417-445
- Debouk, R.¹ Lafortune, S.² Teneketzis, D.³

38
- 78049476639
- Segmental hidden markov models for view-based sport video analysis
- Minneapolis
- Ding, Y., Fan, G.: Segmental hidden markov models for view-based sport video analysis. In: International Workshop on Semantic Learning Applications in Multimedia. Minneapolis (2007)
- (2007) International Workshop on Semantic Learning Applications in Multimedia
- Ding, Y.¹ Fan, G.²

39
- 0009622481
- Learning joint statistical models for audio-visual fusion and segregation
- Denver
- Fisher-III, J., Darrell, T., Freeman, W., Viola, P.: Learning joint statistical models for audio-visual fusion and segregation. In: Advances in Neural Information Processing Systems, pp. 772-778. Denver (2000)
- (2000) Advances in Neural Information Processing Systems , pp. 772-778
- Fisher-Iii, J.¹ Darrell, T.² Freeman, W.³ Viola, P.⁴

40
- 18844472771
- A distributed sensor network for video surveillance of outdoor environments
- Rochester
- Foresti, G.L., Snidaro, L.: A distributed sensor network for video surveillance of outdoor environments. In: IEEE International Conference on Image Processing. Rochester (2002)
- (2002) IEEE International Conference on Image Processing
- Foresti, G.L.¹ Snidaro, L.²

41
- 78049454364
- From multi-sensor surveillance towards smart interactive spaces
- Baltimore
- Gandetto, M., Marchesotti, L., Sciutto, S., Negroni, D., Regazzoni, C.S.: From multi-sensor surveillance towards smart interactive spaces. In: IEEE International Conference on Multimedia and Expo, pp. I:641-644. Baltimore (2003)
- (2003) IEEE International Conference on Multimedia and Expo , vol.1 , pp. 641-644
- Gandetto, M.¹ Marchesotti, L.² Sciutto, S.³ Negroni, D.⁴ Regazzoni, C.S.⁵

42
- 33749528634
- BIOMET: A multimodal person authentication database including face, voice, fingerprint, hand and signature modalities
- Guildford, UK
- Garcia Salicetti, S., Beumier, C., Chollet, G., Dorizzi, B., les Jardins, J., Lunter, J., Ni, Y., Petrovska Delacretaz, D.: BIOMET: A multimodal person authentication database including face, voice, fingerprint, hand and signature modalities. In: International Conference on Audio-and Video-Based Biometrie Person Authentication, pp. 845-853. Guildford, UK (2003)
- (2003) International Conference on Audio-and Video-Based Biometrie Person Authentication , pp. 845-853
- Garcia Salicetti, S.¹ Beumier, C.² Chollet, G.³ Dorizzi, B.⁴ Les Jardins, J.⁵ Lunter, J.⁶ Ni, Y.⁷ Petrovska Delacretaz, D.⁸

43
- 33645672078
- Kalman filters for audio-video source localization
- Karlsruhe University, Germany
- Gehrig, T., Nickel, K., Ekenel, H., Klee, U., McDonough, J.: Kalman filters for audio-video source localization. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 118- 121. Karlsruhe University, Germany (2005)
- (2005) IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , pp. 118-121
- Gehrig, T.¹ Nickel, K.² Ekenel, H.³ Klee, U.⁴ McDonough, J.⁵

44
- 84863649145
- Video classification based on low-level feature fusion model
- Antalya, Turkey
- Guironnet, M., Pellerin, D., Rombaut, M.: Video classification based on low-level feature fusion model. In: The 13th European Signal Processing Conference. Antalya, Turkey (2005)
- (2005) The 13th European Signal Processing Conference
- Guironnet, M.¹ Pellerin, D.² Rombaut, M.³

45
- 0030735959
- An introduction to multisensor fusion
- Hall, D.L., Llinas, J.: An introduction to multisensor fusion. In: Proceedings of the IEEE: Special Issues on Data Fusion, vol. 85, no. 1, pp. 6-23 (1997)
- (1997) Proceedings of the IEEE: Special Issues on Data Fusion , vol.85 , Issue.1 , pp. 6-23
- Hall, D.L.¹ Llinas, J.²

46
- 4544300302
- Audio visual graphical models for speech processing
- Montreal
- Hershey, J., Attias, H., Jojic, N., Krisjianson, T.: Audio visual graphical models for speech processing. In: IEEE International Conference on Speech, Acoustics, and Signal Processing, pp. 649-652. Montreal (2004)
- (2004) IEEE International Conference on Speech, Acoustics, and Signal Processing , pp. 649-652
- Hershey, J.¹ Attias, H.² Jojic, N.³ Krisjianson, T.⁴

47
- 84899028297
- Audio-vision: Using audio-visual synchrony to locate sounds
- MIT Press, USA
- Hershey, J., Movellan, J.: Audio-vision: using audio-visual synchrony to locate sounds. In: Advances in Neural Information Processing Systems, pp. 813-819. MIT Press, USA (2000)
- (2000) Advances in Neural Information Processing Systems , pp. 813-819
- Hershey, J.¹ Movellan, J.²

48
- 33745805403
- A fast learning algorithm for deep belief nets
- DOI 10.1162/neco.2006.18.7.1527
- G.E. Hinton S. Osindero Y. Teh 2006 A fast learning algorithm for deep belief nets Neural Comput. 18 7 1527 1554 1106.68094 10.1162/neco.2006.18.7.1527 2224485 (Pubitemid 44024729)
- (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.-W.³

49
- 14944340532
- Implementation and evaluation of a constraint-based multimodal fusion system for speech and 3d pointing gestures
- State College, PA
- Holzapfel, H., Nickel, K., Stiefelhagen, R.: Implementation and evaluation of a constraint-based multimodal fusion system for speech and 3d pointing gestures. In: ACM International Conference on Multimodal Interfaces, pp. 175-182. State College, PA (2004)
- (2004) ACM International Conference on Multimodal Interfaces , pp. 175-182
- Holzapfel, H.¹ Nickel, K.² Stiefelhagen, R.³

50
- 67649827397
- Smart mirror for ambient home environment
- Ulm
- Hossain, M.A., Atrey, P.K., El Saddik, A.: Smart mirror for ambient home environment. In: The 3rd IET International Conference on Intelligent Environments, pp. 589-596. Ulm (2007)
- (2007) The 3rd IET International Conference on Intelligent Environments , pp. 589-596
- Hossain, M.A.¹ Atrey, P.K.² El Saddik, A.³

51
- 78049462391
- Modeling and assessing quality of information in multi-sensor multimedia monitoring systems
- Hossain, M.A., Atrey, P.K., El Saddik, A.: Modeling and assessing quality of information in multi-sensor multimedia monitoring systems. ACM Trans. Multimed. Comput. Commun. Appl. 7(1) (2011)
- (2011) ACM Trans. Multimed. Comput. Commun. Appl. , vol.7 , Issue.1
- Hossain, M.A.¹ Atrey, P.K.² El Saddik, A.³

52
- 4544242800
- News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003
- Montreal, QC
- Hsu, W., Kennedy, L., Huang, C.W., Chang, S.F., Lin, C.Y.: News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003. In: International Conference on Acoustics Speech and Signal Processing. Montreal, QC (2004)
- (2004) International Conference on Acoustics Speech and Signal Processing
- Hsu, W.¹ Kennedy, L.² Huang, C.W.³ Chang, S.F.⁴ Lin, C.Y.⁵

53
- 11244288275
- Generative, discriminative, and ensemble learning on multi-modal perceputal fusion toward news stroy segmentation
- Taipei
- Hsu, W.H.M., Chang, S.F.: Generative, discriminative, and ensemble learning on multi-modal perceputal fusion toward news stroy segmentation. In: IEEE International Conference on Multimedia and Expos, pp. 1091-1094. Taipei (2004)
- (2004) IEEE International Conference on Multimedia and Expos , pp. 1091-1094
- Hsu, W.H.M.¹ Chang, S.F.²

54
- 50249137600
- Department of Computer Science, University of Essex, UK
- Hu, H., Gan, J.Q.: Sensors and data fusion algorithms in mobile robotics. Technical report, CSM-422, Department of Computer Science, University of Essex, UK (2005)
- (2005) Sensors and Data Fusion Algorithms in Mobile Robotics. Technical Report, CSM-422
- Hu, H.¹ Gan, J.Q.²

55
- 54049105977
- An attention-based decision fusion scheme for multimedia information retrieval
- Tokyo, Japan
- Hua, X.S., Zhang, H.J.: An attention-based decision fusion scheme for multimedia information retrieval. In: The 5th Pacific-Rim Conference on Multimedia. Tokyo, Japan (2004)
- (2004) The 5th Pacific-Rim Conference on Multimedia
- Hua, X.S.¹ Zhang, H.J.²

56
- 33744904962
- The sensor selection problem for bounded uncertainty sensing models
- Los Angeles
- Isler, V., Bajcsy, R.: The sensor selection problem for bounded uncertainty sensing models. In: International Symposium on Information Processing in Sensor Networks, pp. 151-158. Los Angeles (2005)
- (2005) International Symposium on Information Processing in Sensor Networks , pp. 151-158
- Isler, V.¹ Bajcsy, R.²

57
- 0141631499
- Audio-visual synchrony for detection of monologue in video archives
- Hong Kong
- Iyengar, G., Nock, H.J., Neti, C.: Audio-visual synchrony for detection of monologue in video archives. In: IEEE International Conference on Acoustics, Speech, and Signal Processing. Hong Kong (2003)
- (2003) IEEE International Conference on Acoustics, Speech, and Signal Processing
- Iyengar, G.¹ Nock, H.J.² Neti, C.³

58
- 2342527770
- Discriminative model fusion for semantic concept detection and annotation in video
- Berkeley
- Iyengar, G., Nock, H.J., Neti, C.: Discriminative model fusion for semantic concept detection and annotation in video. In: ACM International Conference on Multimedia, pp. 255-258. Berkeley (2003)
- (2003) ACM International Conference on Multimedia , pp. 255-258
- Iyengar, G.¹ Nock, H.J.² Neti, C.³

59
- 84940408597
- Audio/video fusion: A preprocessing step for multimodal person identification
- France
- Jaffre, G., Pinquier, J.: Audio/video fusion: a preprocessing step for multimodal person identification. In: International Workshop on MultiModal User Authentification. Toulouse, France (2006)
- (2006) International Workshop on MultiModal User Authentification. Toulouse
- Jaffre, G.¹ Pinquier, J.²

60
- 37649019645
- Multimodal human computer interaction: A survey
- Beijing
- Jaimes, A., Sebe, N.: Multimodal human computer interaction: a survey. In: IEEE International Workshop on Human Computer Interaction. Beijing (2005)
- (2005) IEEE International Workshop on Human Computer Interaction
- Jaimes, A.¹ Sebe, N.²

61
- 25144471298
- Score normalization in multimodal biometric systems
- DOI 10.1016/j.patcog.2005.01.012, PII S0031320305000592
- A. Jain K. Nandakumar A. Ross 2005 Score normalization in multimodal biometric systems Pattern Recognit. 38 12 2270 2285 10.1016/j.patcog.2005.01.012 (Pubitemid 41336698)
- (2005) Pattern Recognition , vol.38 , Issue.12 , pp. 2270-2285
- Jain, A.¹ Nandakumar, K.² Ross, A.³

62
- 2542465652
- A probabilistic layered framework for integrating multimedia content and context information
- Orlando
- Jasinschi, R.S., Dimitrova, N., McGee, T., Agnihotri, L., Zimmerman, J., Li, D., Louie, J.: A probabilistic layered framework for integrating multimedia content and context information. In: International Conference on Acoustics, Speech and Signal Processing, vol. II, pp. 2057-2060. Orlando (2002)
- (2002) International Conference on Acoustics, Speech and Signal Processing , vol.2 , pp. 2057-2060
- Jasinschi, R.S.¹ Dimitrova, N.² McGee, T.³ Agnihotri, L.⁴ Zimmerman, J.⁵ Li, D.⁶ Louie, J.⁷

63
- 35048876369
- Using maximum entropy for automatic image annotation
- Dublin
- Jeon, J., Manmatha, R.: Using maximum entropy for automatic image annotation. In: International Conference on Image and Video Retrieval, vol. 3115, pp. 24-32. Dublin (2004)
- (2004) International Conference on Image and Video Retrieval , vol.3115 , pp. 24-32
- Jeon, J.¹ Manmatha, R.²

64
- 0037350942
- Optimal sensor selection for discrete event systems with partial observation
- 10.1109/TAC.2003.809144 1962246
- S. Jiang R. Kumar H.E. Garcia 2003 Optimal sensor selection for discrete event systems with partial observation IEEE Trans. Automat. Contr. 48 369 381 10.1109/TAC.2003.809144 1962246
- (2003) IEEE Trans. Automat. Contr. , vol.48 , pp. 369-381
- Jiang, S.¹ Kumar, R.² Garcia, H.E.³

65
- 0031347068
- New extension of the Kalman filter to nonlinear systems
- San Diego
- Julier, S.J., Uhlmann, J.K.: New extension of the Kalman filter to nonlinear systems. In: Signal Processing, Sensor Fusion, and Target Recognition VI, vol. 3068 SPIE, pp. 182-193. San Diego (1997)
- (1997) Signal Processing, Sensor Fusion, and Target Recognition VI, Vol. 3068 SPIE , pp. 182-193
- Julier, S.J.¹ Uhlmann, J.K.²

66
- 85024429815
- A new approach to linear filtering and prediction problems
- R.E. Kalman 1960 A new approach to linear filtering and prediction problems Trans. ASME J. Basic Eng. 82 Series D 35 45
- (1960) Trans. ASME J. Basic Eng. , vol.82 , Issue.SERIES D , pp. 35-45
- Kalman, R.E.¹

67
- 33749527339
- Experiential sampling in multimedia systems
- DOI 10.1109/TMM.2006.879876, 1703508
- M.S. Kankanhalli J. Wang R. Jain 2006 Experiential sampling in multimedia systems IEEE Trans. Multimed. 8 5 937 946 10.1109/TMM.2006.879876 (Pubitemid 44523108)
- (2006) IEEE Transactions on Multimedia , vol.8 , Issue.5 , pp. 937-946
- Kankanhalli, M.S.¹ Wang, J.² Jain, R.³

68
- 33749523537
- Experiential sampling on multiple data streams
- DOI 10.1109/TMM.2006.879875, 1703509
- M.S. Kankanhalli J. Wang R. Jain 2006 Experiential sampling on multiple data streams IEEE Trans. Multimed. 8 5 947 955 10.1109/TMM.2006.879875 (Pubitemid 44523109)
- (2006) IEEE Transactions on Multimedia , vol.8 , Issue.5 , pp. 947-955
- Kankanhalli, M.S.¹ Wang, J.² Jain, R.³

69
- 0032021555
- On combining classifiers
- 10.1109/34.667881
- J. Kittler M. Hatef R.P. Duin J. Matas 1998 On combining classifiers IEEE Trans. Pattern Anal. Mach. Intell. 20 3 226 239 10.1109/34.667881
- (1998) IEEE Trans. Pattern Anal. Mach. Intell. , vol.20 , Issue.3 , pp. 226-239
- Kittler, J.¹ Hatef, M.² Duin, R.P.³ Matas, J.⁴

70
- 14944376431
- Sensor node selection for execution of continuous probabilistic queries in wireless sensor networks
- NY, USA
- Lam, K.Y., Cheng, R., Liang, B.Y., Chau, J.: Sensor node selection for execution of continuous probabilistic queries in wireless sensor networks. In: ACM International Workshop on Video Surveillance and Sensor Networks, pp. 63-71. NY, USA (2004)
- (2004) ACM International Workshop on Video Surveillance and Sensor Networks , pp. 63-71
- Lam, K.Y.¹ Cheng, R.² Liang, B.Y.³ Chau, J.⁴

71
- 34249025710
- Applying logistic regression to relevance feedback in image retrieval systems
- DOI 10.1016/j.patcog.2007.02.002, PII S0031320307000854
- T. León P. Zuccarello G. Ayala E. de Ves J. Domingo 2007 Applying logistic regression to relevance feedback in image retrieval systems Pattern Recognit. 40 10 2621 2632 1132.68642 10.1016/j.patcog.2007.02.002 (Pubitemid 46782861)
- (2007) Pattern Recognition , vol.40 , Issue.10 , pp. 2621-2632
- Leon, T.¹ Zuccarello, P.² Ayala, G.³ De Ves, E.⁴ Domingo, J.⁵

72
- 2342451199
- Multimedia content processing through cross-modal association
- Li, D., Dimitrova, N., Li, M., Sethi, I.K.: Multimedia content processing through cross-modal association. In: ACM International Conference on Multimedia (2003)
- (2003) ACM International Conference on Multimedia
- Li, D.¹ Dimitrova, N.² Li, M.³ Sethi, I.K.⁴

73
- 33745155436
- A bayesian hierarchical model for learning natural scene categories
- Washington
- Li, F.F., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 524-531. Washington (2005)
- (2005) IEEE Computer Society Conference on Computer Vision and Pattern Recognition , vol.2 , pp. 524-531
- Li, F.F.¹ Perona, P.²

74
- 2342473246
- Audio-visual talking face detection
- Baltimore, MD
- Li, M., Li, D., Dimitrove, N., Sethi, I.K.: Audio-visual talking face detection. In: International Conference on Multimedia and Expo, pp. 473-476. Baltimore, MD (2003)
- (2003) International Conference on Multimedia and Expo , pp. 473-476
- Li, M.¹ Li, D.² Dimitrove, N.³ Sethi, I.K.⁴

75
- 14644440462
- Boosting image classification with LDA-based feature combination for digital photograph management
- DOI 10.1016/j.patcog.2004.11.008, PII S0031320304004066, Image Understanding for Photographs
- X. Liu L. Zhang M. Li H. Zhang D. Wang 2005 Boosting image classification with lda-based feature combination for digital photograph management Pattern Recognit. 38 6 887 901 10.1016/j.patcog.2004.11.008 (Pubitemid 40308637)
- (2005) Pattern Recognition , vol.38 , Issue.6 , pp. 887-901
- Liu, X.¹ Zhang, L.² Li, M.³ Zhang, H.⁴ Wang, D.⁵

76
- 84886438774
- Integrating semantic templates with decision tree for image semantic learning
- Singapore
- Liu, Y., Zhang, D., Lu, G., Tan, A.H.: Integrating semantic templates with decision tree for image semantic learning. In: The 13th International Multimedia Modeling Conference, pp. 185-195. Singapore (2007)
- (2007) The 13th International Multimedia Modeling Conference , pp. 185-195
- Liu, Y.¹ Zhang, D.² Lu, G.³ Tan, A.H.⁴

77
- 21244487870
- Motion estimation using audio and video fusion
- Loh, A., Guan, F., Ge, S.S.: Motion estimation using audio and video fusion. In: International Conference on Control, Automation, Robotics and Vision, vol. 3, pp. 1569-1574 (2004)
- (2004) International Conference on Control, Automation, Robotics and Vision , vol.3 , pp. 1569-1574
- Loh, A.¹ Guan, F.² Ge, S.S.³

78
- 1842830692
- Improved speech recognition using adaptive audio-visual fusion via a stochastic secondary classifier
- Hong Kong
- Lucey, S., Sridharan, S., Chandran, V.: Improved speech recognition using adaptive audio-visual fusion via a stochastic secondary classifier. In: International Symposium on Intelligent Multimedia, Video and Speech Processing, pp. 551-554. Hong Kong (2001)
- (2001) International Symposium on Intelligent Multimedia, Video and Speech Processing , pp. 551-554
- Lucey, S.¹ Sridharan, S.² Chandran, V.³

79
- 0142163128
- Multisensor fusion and integration: Approaches, applications, and future research directions
- DOI 10.1109/JSEN.2002.1000251, PII S1530437X02039416
- R.C. Luo C.C. Yih K.L. Su 2002 Multisensor fusion and integration: Approaches, applications, and future research directions IEEE Sens. J. 2 2 107 119 10.1109/JSEN.2002.1000251 (Pubitemid 44357916)
- (2002) IEEE Sensors Journal , vol.2 , Issue.2 , pp. 107-119
- Luo, R.C.¹ Yih, C.-C.² Su, K.L.³

80
- 36849070411
- Information-theoretic semantic multimedia indexing
- Amsterdam, The Netherlands
- Magalhães, J., Rüger, S.: Information-theoretic semantic multimedia indexing. In: International Conference on Image and Video Retrieval, pp. 619-626. Amsterdam, The Netherlands (2007)
- (2007) International Conference on Image and Video Retrieval , pp. 619-626
- Magalhães, J.¹

81
- 78049456017
- MS Thesis, University of Waterloo, Canada
- Makkook, M.A.: A multimodal sensor fusion architecture for audio-visual speech recognition. MS Thesis, University of Waterloo, Canada (2007)
- (2007) A Multimodal Sensor Fusion Architecture for Audio-visual Speech Recognition
- Makkook, M.A.¹

82
- 0001690761
- Los Alamitos, CA, USA
- Matas, J., Hamouz, M., Jonsson, K., Kittler, J., Li, Y., Kotropoulos, C., Tefas, A., Pitas, I., Tan, T., Yan, H., Smeraldi, F., Capdevielle, N., Gerstner, W., Abdeljaoued, Y., Bigun, J., Ben-Yacoub, S., Mayoraz, E.: Comparison of face verification results on the XM2VTS database. p. 4858. Los Alamitos, CA, USA (2000)
- (2000) Comparison of Face Verification Results on the XM2VTS Database , pp. 4858
- Matas, J.¹ Hamouz, M.² Jonsson, K.³ Kittler, J.⁴ Li, Y.⁵ Kotropoulos, C.⁶ Tefas, A.⁷ Pitas, I.⁸ Tan, T.⁹ Yan, H.¹⁰ Smeraldi, F.¹¹ Capdevielle, N.¹² Gerstner, W.¹³ Abdeljaoued, Y.¹⁴ Bigun, J.¹⁵ Ben-Yacoub, S.¹⁶ Mayoraz, E.¹⁷

83
- 26444504617
- A comparison of score, rank and probability-based fusion methods for video shot retrieval
- Singapore
- McDonald, K., Smeaton, A.F.: A comparison of score, rank and probability-based fusion methods for video shot retrieval. In: International Conference on Image and Video Retrieval, pp. 61-70. Singapore (2005)
- (2005) International Conference on Image and Video Retrieval , pp. 61-70
- McDonald, K.¹ Smeaton, A.F.²

84
- 77949507915
- Color image segmentation using the dempster-shafer theory of evidence for the fusion of texture
- Munich, Germany
- Mena, J.B., Malpica, J.: Color image segmentation using the dempster-shafer theory of evidence for the fusion of texture. In: International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. XXXIV, Part 3/W8, pp. 139-144. Munich, Germany (2003)
- (2003) International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences , vol.34 , Issue.PART 3-W8 , pp. 139-144
- Mena, J.B.¹ Malpica, J.²

85
- 1842854571
- Continuous audio-visual digit recognition using N -best decision fusion
- DOI 10.1016/j.inffus.2003.07.001, PII S1566253503000915
- G.F. Meyer J.B. Mulligan S.M. Wuerger 2004 Continuous audio-visual digit recognition using N-best decision fusion J. Inf. Fusion 5 91 101 10.1016/j.inffus.2003.07.001 (Pubitemid 38488057)
- (2004) Information Fusion , vol.5 , Issue.2 , pp. 91-101
- Meyer, G.F.¹ Mulligan, J.B.² Wuerger, S.M.³

86
- 0036874999
- Dynamic bayesian networks for audio-visual speech recognition
- A.V. Nefian L. Liang X. Pi X. Liu K. Murphye 2002 Dynamic bayesian networks for audio-visual speech recognition EURASIP J. Appl. Signal Process. 11 1 15
- (2002) EURASIP J. Appl. Signal Process. , vol.11 , pp. 1-15
- Nefian, A.V.¹ Liang, L.² Pi, X.³ Liu, X.⁴ Murphye, K.⁵

87
- 84978830303
- Joint processing of audio and visual information for multimedia indexing and human-computer interaction
- Paris, France
- Neti, C., Maison, B., Senior, A., Iyengar, G., Cuetos, P., Basu, S., Verma, A.: Joint processing of audio and visual information for multimedia indexing and human-computer interaction. In: International Conference RIAO. Paris, France (2000)
- (2000) International Conference RIAO
- Neti, C.¹ Maison, B.² Senior, A.³ Iyengar, G.⁴ Cuetos, P.⁵ Basu, S.⁶ Verma, A.⁷

88
- 21444436897
- An image recognition method based on multiple bp neural networks fusion
- Ni, J.,, Ma, X., Xu, L., Wang, J.: An image recognition method based on multiple bp neural networks fusion. In: IEEE International Conference on Information Acquisition (2004)
- (2004) IEEE International Conference on Information Acquisition
- Ni, J.¹ Ma, X.² Xu, L.³ Wang, J.⁴

89
- 32344434992
- A joint particle filter for audio-visual speaker tracking
- Torento, Italy
- Nickel, K., Gehrig, T., Stiefelhagen, R., McDonough, J.: A joint particle filter for audio-visual speaker tracking. In: The 7th International Conference on Multimodal Interfaces, pp. 61-68. Torento, Italy (2005)
- (2005) The 7th International Conference on Multimodal Interfaces , pp. 61-68
- Nickel, K.¹ Gehrig, T.² Stiefelhagen, R.³ McDonough, J.⁴

90
- 0037700834
- Assessing face and speech consistency for monologue detection in video
- French Riviera, France
- Nock, H.J., Iyengar, G., Neti, C.: Assessing face and speech consistency for monologue detection in video. In: ACM International Conference on Multimedia. French Riviera, France (2002)
- (2002) ACM International Conference on Multimedia
- Nock, H.J.¹ Iyengar, G.² Neti, C.³

91
- 20444478554
- Speaker localisation using audio-visual synchrony: An empirical study
- Urbana, USA
- Nock, H.J., Iyengar, G., Neti, C.: Speaker localisation using audio-visual synchrony: an empirical study. In: International Conference on Image and Video Retrieval. Urbana, USA (2003)
- (2003) International Conference on Image and Video Retrieval
- Nock, H.J.¹ Iyengar, G.² Neti, C.³

92
- 34547231084
- Em detection of common origin of multi-modal cues
- Banff
- Noulas, A.K., Krose, B.J.A.: Em detection of common origin of multi-modal cues. In: International Conference on Multimodal Interfaces, pp. 201-208. Banff (2006)
- (2006) International Conference on Multimodal Interfaces , pp. 201-208
- Noulas, A.K.¹ Krose, B.J.A.²

93
- 10744222590
- Biometric on the internet MCYT baseline corpus: A bimodal biometric database
- 10.1049/ip-vis:20031078
- J. Ortega-Garcia J. Fierrez-Aguilar D. Simon J. Gonzalez M. Faundez-Zanuy V. Espinosa A. Satue I. Hernaez J.J. Igarza C. Vivaracho D. Escudero Q.I. Moro 2003 Biometric on the internet MCYT baseline corpus: a bimodal biometric database IEE Proc. Vis. Image Signal Process. 150 6 395 401 10.1049/ip-vis: 20031078
- (2003) IEE Proc. Vis. Image Signal Process. , vol.150 , Issue.6 , pp. 395-401
- Ortega-Garcia, J.¹ Fierrez-Aguilar, J.² Simon, D.³ Gonzalez, J.⁴ Faundez-Zanuy, M.⁵ Espinosa, V.⁶ Satue, A.⁷ Hernaez, I.⁸ Igarza, J.J.⁹ Vivaracho, C.¹⁰ Escudero, D.¹¹ Moro, Q.I.¹²

94
- 0028407685
- Optimal sensor selection strategy for discrete-time state estimators
- 10.1109/7.272256
- Y. Oshman 1994 Optimal sensor selection strategy for discrete-time state estimators IEEE Trans. Aerosp. Electron. Syst. 30 307 314 10.1109/7.272256
- (1994) IEEE Trans. Aerosp. Electron. Syst. , vol.30 , pp. 307-314
- Oshman, Y.¹

95
- 0002126112
- Ten myths of multimodal interaction
- 10.1145/319382.319398
- S. Oviatt 1999 Ten myths of multimodal interaction Commun. ACM 42 11 74 81 10.1145/319382.319398
- (1999) Commun. ACM , vol.42 , Issue.11 , pp. 74-81
- Oviatt, S.¹

96
- 0002798273
- Taming speech recognition errors within a multimodal interface
- 10.1145/348941.348979
- S. Oviatt 2000 Taming speech recognition errors within a multimodal interface Commun. ACM 43 9 45 51 10.1145/348941.348979
- (2000) Commun. ACM , vol.43 , Issue.9 , pp. 45-51
- Oviatt, S.¹

97
- 0042161151
- Multimodal interfaces
- Jacko, J., Sears, A. (eds.) Lawrence Erlbaum Assoc., NJ
- Oviatt, S.L.: Multimodal interfaces. In: Jacko, J., Sears, A. (eds.) The Human-Computer Interaction Handbook: Fundamentals, Evolving Technologies and Emerging Applications. Lawrence Erlbaum Assoc., NJ (2003)
- (2003) The Human-Computer Interaction Handbook: Fundamentals, Evolving Technologies and Emerging Applications
- Oviatt, S.L.¹

98
- 20444500490
- Optimal sensor selection for video-based target tracking in a wireless sensor network
- Singapore
- Pahalawatta, P., Pappas, T.N., Katsaggelos, A.K.: Optimal sensor selection for video-based target tracking in a wireless sensor network. In: IEEE International Conference on Image Processing, pp. V:3073-3076. Singapore (2004)
- (2004) IEEE International Conference on Image Processing , vol.5 , pp. 3073-3076
- Pahalawatta, P.¹ Pappas, T.N.² Katsaggelos, A.K.³

99
- 0345565782
- Audio-visual speaker tracking with importance particle filter
- Perez, D.G., Lathoud, G., McCowan, I., Odobez, J.M., Moore, D.: Audio-visual speaker tracking with importance particle filter. In: IEEE International Conference on Image Processing (2003)
- (2003) IEEE International Conference on Image Processing
- Perez, D.G.¹ Lathoud, G.² McCowan, I.³ Odobez, J.M.⁴ Moore, D.⁵

100
- 14944375000
- Context based multimodal fusion
- State College
- Pfleger, N.: Context based multimodal fusion. In: ACM International Conference on Multimodal Interfaces, pp. 265-272. State College (2004)
- (2004) ACM International Conference on Multimodal Interfaces , pp. 265-272
- Pfleger, N.¹

101
- 33744914910
- Fade - An integrated approach to multimodal fusion and discourse processing
- Trento, Italy
- Pfleger, N.: Fade - an integrated approach to multimodal fusion and discourse processing. In: Dotoral Spotlight at ICMI 2005. Trento, Italy (2005)
- (2005) : Dotoral Spotlight at ICMI 2005
- Pfleger, N.¹

102
- 44949227080
- Adaptive multimodal fusion by uncertainty compensation
- Pittsburgh
- Pitsikalis, V., Katsamanis, A., Papandreou, G., Maragos, P.: Adaptive multimodal fusion by uncertainty compensation. In: Ninth International Conference on Spoken Language Processing. Pittsburgh (2006)
- (2006) Ninth International Conference on Spoken Language Processing
- Pitsikalis, V.¹ Katsamanis, A.² Papandreou, G.³ Maragos, P.⁴

103
- 27744533915
- How do correlation and variance of base-experts affect fusion in biometric authentication tasks?
- Poh, N., Bengio, S.: How do correlation and variance of base-experts affect fusion in biometric authentication tasks? IEEE Trans. Signal Process. 53, 4384-4396 (2005)
- (2005) IEEE Trans. Signal Process , vol.53 , pp. 4384-4396
- Poh, N.¹ Bengio, S.²

104
- 27744526744
- Database, protocols and tools for evaluating score-level fusion algorithms in biometric authentication
- DOI 10.1016/j.patcog.2005.06.011, PII S0031320305002347, Complexity Reduction
- N. Poh S. Bengio 2006 Database, protocols and tools for evaluating score-level fusion algorithms in biometric authentication Pattern Recognit. 39 2 223 233 10.1016/j.patcog.2005.06.011 (Part Special Issue: Complexity Reduction) (Pubitemid 41586091)
- (2006) Pattern Recognition , vol.39 , Issue.2 , pp. 223-233
- Poh, N.¹ Bengio, S.²

105
- 0034853041
- Hierarchical discriminant features for audio-visual LVSCR
- Salt Lake City
- Potamianos, G., Luettin, J., Neti, C.: Hierarchical discriminant features for audio-visual LVSCR. In: IEEE International Conference on Acoustic Speech and Signal Processing, pp. 165-168. Salt Lake City (2001)
- (2001) IEEE International Conference on Acoustic Speech and Signal Processing , pp. 165-168
- Potamianos, G.¹ Luettin, J.² Neti, C.³

106
- 4544290191
- Recent advances in the automatic recognition of audiovisual speech
- DOI 10.1109/JPROC.2003.817150, Human-Computer Multimodal Interface
- G. Potamianos C. Neti G. Gravier A. Garg A. Senior 2003 Recent advances in the automatic recognition of audiovisual speech Proc. IEEE 91 9 1306 1326 10.1109/JPROC.2003.817150 (Pubitemid 40890816)
- (2003) Proceedings of the IEEE , vol.91 , Issue.9 , pp. 1306-1325
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.W.⁵

107
- 4344680537
- Tracking of multiple moving speakers with multiple microphone arrays
- 10.1109/TSA.2004.833004
- I. Potamitis H. Chen G. Tremoulis 2004 Tracking of multiple moving speakers with multiple microphone arrays IEEE Trans. Speech Audio Process. 12 5 520 529 10.1109/TSA.2004.833004
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.5 , pp. 520-529
- Potamitis, I.¹ Chen, H.² Tremoulis, G.³

108
- 0030647922
- An approach to speaker identification using multiple classifiers
- Munich, Germany
- Radova, V., Psutka, J.: An approach to speaker identification using multiple classifiers. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2, 1135-1138. Munich, Germany (1997)
- (1997) IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.2 , pp. 1135-1138
- Radova, V.¹ Psutka, J.²

109
- 84986207144
- Extended dempster-shafer theory for multi-system/sensor decision fusion
- Germany
- Rashidi, A., Ghassemian, H.: Extended dempster-shafer theory for multi-system/sensor decision fusion. In: Commission IV Joint Workshop on Challenges in Geospatial Analysis, Integration and Visualization II, pp. 31-37. Germany (2003)
- (2003) Commission IV Joint Workshop on Challenges in Geospatial Analysis, Integration and Visualization , vol.2 , pp. 31-37
- Rashidi, A.¹ Ghassemian, H.²

110
- 78049473905
- MS Thesis, University of Waterloo, Canada
- Reddy, B.S.: Evidential reasoning for multimodal fusion in human computer interaction (2007). MS Thesis, University of Waterloo, Canada
- (2007) Evidential Reasoning for Multimodal Fusion in Human Computer Interaction
- Reddy, B.S.¹

111
- 27644496488
- Technical report., Institute for Systems and Robotics, Lisboa
- Ribeiro, M.I.: Kalman and extended Kalman filters: concept, derivation and properties. Technical report., Institute for Systems and Robotics, Lisboa (2004)
- (2004) Kalman and Extended Kalman Filters: Concept, Derivation and Properties
- Ribeiro, M.I.¹

112
- 0033556862
- A unifying review of linear gaussian models
- 10.1162/089976699300016674
- S. Roweis Z. Ghahramani 1999 A unifying review of linear gaussian models Neural Comput. 11 2 305 345 10.1162/089976699300016674
- (1999) Neural Comput. , vol.11 , Issue.2 , pp. 305-345
- Roweis, S.¹ Ghahramani, Z.²

113
- 4544228318
- Identity verification using speech and face information
- 10.1016/j.dsp.2004.05.001
- C. Sanderson K.K. Paliwal 2004 Identity verification using speech and face information Digit. Signal Process. 14 5 449 480 10.1016/j.dsp.2004.05.001
- (2004) Digit. Signal Process. , vol.14 , Issue.5 , pp. 449-480
- Sanderson, C.¹ Paliwal, K.K.²

114
- 0032660827
- Name-It: Naming and detecting faces in news video
- 10.1109/93.752960
- S. Satoh Y. Nakamura T. Kanade 1999 Name-It: naming and detecting faces in news video IEEE Multimed. 6 1 22 35 10.1109/93.752960
- (1999) IEEE Multimed. , vol.6 , Issue.1 , pp. 22-35
- Satoh, S.¹ Nakamura, Y.² Kanade, T.³

115
- 6344253001
- Confidence fusion
- Siegel, M., Wu, H.: Confidence fusion. In: IEEE International Workshop on Robot Sensing, pp. 96-99 (2004)
- (2004) IEEE International Workshop on Robot Sensing , pp. 96-99
- Siegel, M.¹ Wu, H.²

116
- 33947354959
- Dempster-shafer theory based finger print classifier fusion with update rule to minimize training time
- 10.1587/elex.3.429
- R. Singh M. Vatsa A. Noore S.K. Singh 2006 Dempster-shafer theory based finger print classifier fusion with update rule to minimize training time IEICE Electron. Express 3 20 429 435 10.1587/elex.3.429
- (2006) IEICE Electron. Express , vol.3 , Issue.20 , pp. 429-435
- Singh, R.¹ Vatsa, M.² Noore, A.³ Singh, S.K.⁴

117
- 2642557514
- Facesync: A linear operator for measuring synchronization of video facial images and audio tracks
- Slaney, M., Covell, M.: Facesync: A linear operator for measuring synchronization of video facial images and audio tracks. In: Neural Information Processing Society, vol. 13 (2000)
- (2000) Neural Information Processing Society , vol.13
- Slaney, M.¹ Covell, M.²

118
- 70349341623
- High-level feature detection from video in TRECVid: A 5-year retrospective of achievements
- A. Divakaran (eds). Springer Berlin
- Smeaton, A.F., Over, P., Kraaij, W.: High-level feature detection from video in TRECVid: a 5-year retrospective of achievements. In: Divakaran, A. (ed.) Multimedia Content Analysis, Theory and Applications, pp. 151-174. Springer, Berlin (2009)
- (2009) Multimedia Content Analysis, Theory and Applications. , pp. 151-174
- Smeaton, A.F.¹ Over, P.² Kraaij, W.³

119
- 38049144522
- A review on multimodal video indexing
- Lusanne, Switzerland
- Snoek, C.G.M., Worring, M.: A review on multimodal video indexing. In: IEEE International Conference on Multimedia and Expo, pp. 21-24. Lusanne, Switzerland (2002)
- (2002) IEEE International Conference on Multimedia and Expo , pp. 21-24
- Snoek, C.G.M.¹ Worring, M.²

120
- 10044236762
- Multimodal video indexing: A review of the state-of-the-art
- 10.1023/B:MTAP.0000046380.27575.a5
- C.G.M. Snoek M. Worring 2005 Multimodal video indexing: a review of the state-of-the-art Multimed. Tools Appl. 25 1 5 35 10.1023/B:MTAP.0000046380. 27575.a5
- (2005) Multimed. Tools Appl. , vol.25 , Issue.1 , pp. 5-35
- Snoek, C.G.M.¹ Worring, M.²

121
- 84883126733
- Early versus late fusion in semantic video analysis
- Singapore
- Snoek, C.G.M., Worring, M., Smeulders, A.W.M.: Early versus late fusion in semantic video analysis. In: ACM International Conference on Multimedia, pp. 399-402. Singapore (2005)
- (2005) ACM International Conference on Multimedia , pp. 399-402
- Snoek, C.G.M.¹ Worring, M.² Smeulders, A.W.M.³

122
- 4344686854
- Computational models for experiences in the arts and multimedia
- Berkeley, CA
- Sridharan, H., Sundaram, H., Rikakis, T.: Computational models for experiences in the arts and multimedia. In: The ACM Workshop on Experiential Telepresence. Berkeley, CA (2003)
- (2003) The ACM Workshop on Experiential Telepresence
- Sridharan, H.¹ Sundaram, H.² Rikakis, T.³

123
- 54049157068
- Tech. rep., MIT-CSAIL-TR-2005-057, Massachusetts Institute of Technology, Cambridge, MA
- Stauffer, C.: Automated audio-visual activity analysis. Tech. rep., MIT-CSAIL-TR-2005-057, Massachusetts Institute of Technology, Cambridge, MA (2005)
- (2005) Automated Audio-visual Activity Analysis
- Stauffer, C.¹

124
- 85032766888
- Joint audio-video object localization and tracking: A presentation general methodology
- DOI 10.1109/79.911196
- N. Strobel S. Spors R. Rabenstein 2001 Joint audio-video object localization and tracking IEEE Signal Process. Mag. 18 1 22 31 10.1109/79.911196 (Pubitemid 32287668)
- (2001) IEEE Signal Processing Magazine , vol.18 , Issue.1 , pp. 22-31
- Strobel, N.¹ Spors, S.² Rabenstein, R.³

125
- 34250755977
- Real time audio-visual person tracking
- IEEE Computer Society, Victoria, BC
- Talantzis, F., Pnevmatikakis, A., Polymenakos, L.C.: Real time audio-visual person tracking. In: IEEE 8th Workshop on Multimedia Signal Processing, pp. 243-247. IEEE Computer Society, Victoria, BC (2006)
- (2006) IEEE 8th Workshop on Multimedia Signal Processing , pp. 243-247
- Talantzis, F.¹ Pnevmatikakis, A.² Polymenakos, L.C.³

126
- 33947180907
- Confidence-based data management for personal area sensor networks
- Tatbul, N., Buller, M., Hoyt, R., Mullen, S., Zdonik, S.: Confidence-based data management for personal area sensor networks. In: The Workshop on Data Management for Sensor Networks (2004)
- (2004) The Workshop on Data Management for Sensor Networks
- Tatbul, N.¹ Buller, M.² Hoyt, R.³ Mullen, S.⁴ Zdonik, S.⁵

127
- 47149113260
- Group-based event detection in undersea sensor networks
- San Diego, CA
- Tavakoli, A., Zhang, J., Son, S.H.: Group-based event detection in undersea sensor networks. In: Second International Workshop on Networked Sensing Systems. San Diego, CA (2005)
- (2005) Second International Workshop on Networked Sensing Systems
- Tavakoli, A.¹ Zhang, J.² Son, S.H.³

128
- 0032179738
- Models for audiovisual fusion in a noisy-vowel recognition task
- 10.1023/A:1008014206206
- P. Teissier A. Guerin-Dugue J.L. Schwartz 1998 Models for audiovisual fusion in a noisy-vowel recognition task J. VLSI Signal Process. 20 25 44 10.1023/A:1008014206206
- (1998) J. VLSI Signal Process. , vol.20 , pp. 25-44
- Teissier, P.¹ Guerin-Dugue, A.² Schwartz, J.L.³

129
- 78049458056
- Multilevel context representation using semantic metanetwork
- Rio de Janeiro, Brazil
- Teriyan, V.Y., Puuronen, S.: Multilevel context representation using semantic metanetwork. In: International and Interdisciplinary Conference on Modeling and Using Context, pp. 21-32. Rio de Janeiro, Brazil (1997)
- (1997) International and Interdisciplinary Conference on Modeling and Using Context , pp. 21-32
- Teriyan, V.Y.¹ Puuronen, S.²

130
- 46449095507
- Data modeling strategies for imbalanced learning in visual search
- Beijing
- Tesic, J., Natsev, A., Lexing, X., Smith, J.R.: Data modeling strategies for imbalanced learning in visual search. In: IEEE International Conference on Multimedia and Expo, pp. 1990-1993. Beijing (2007)
- (2007) IEEE International Conference on Multimedia and Expo , pp. 1990-1993
- Tesic, J.¹ Natsev, A.² Lexing, X.³ Smith, J.R.⁴

131
- 33748494648
- Multi-sensory and multi-modal fusion for sentient computing
- DOI 10.1007/s11263-006-7834-8
- C. Town 2007 Multi-sensory and multi-modal fusion for sentient computing Int. J. Comput. Vis. 71 235 253 10.1007/s11263-006-7834-8 (Pubitemid 44359123)
- (2007) International Journal of Computer Vision , vol.71 , Issue.2 , pp. 235-253
- Town, C.¹

132
- 0034844366
- Sequential monte carlo fusion of sound and vision for speaker tracking
- Paris, France
- Vermaak, J., Gangnet, M., Blake, A., Perez, P.: Sequential monte carlo fusion of sound and vision for speaker tracking. In: The 8th IEEE International Conference on Computer Vision, vol. 1, pp. 741-746. Paris, France (2001)
- (2001) The 8th IEEE International Conference on Computer Vision , vol.1 , pp. 741-746
- Vermaak, J.¹ Gangnet, M.² Blake, A.³ Perez, P.⁴

133
- 0029180720
- Learning collection fusion strategies
- Seattle, WA
- Voorhees, E.M., Gupta, N.K., Johnson-Laird, B.: Learning collection fusion strategies. In: ACM International Conference on Research and Development in Information Retrieval, pp. 172-179. Seattle, WA (1995)
- (1995) ACM International Conference on Research and Development in Information Retrieval , pp. 172-179
- Voorhees, E.M.¹ Gupta, N.K.² Johnson-Laird, B.³

134
- 2542430932
- Kluwel Norwell
- Wall, M.E., Rechtsteiner, A., Rocha, L.M.: Singular Value Decomposition and Principal Component Analysis, Chap. 5, pp. 91-109. Kluwel, Norwell, MA (2003)
- (2003) Singular Value Decomposition and Principal Component Analysis, Chap. 5 , pp. 91-109
- Wall, M.E.¹ Rechtsteiner, A.² Rocha, L.M.³

135
- 2342648773
- Experience-based sampling technique for multimedia analysis
- Berkeley, CA
- Wang, J., Kankanhalli, M.S.: Experience-based sampling technique for multimedia analysis. In: ACM International Conference on Multimedia, pp. 319-322. Berkeley, CA (2003)
- (2003) ACM International Conference on Multimedia , pp. 319-322
- Wang, J.¹ Kankanhalli, M.S.²

136
- 84945921158
- Experiential sampling for video surveillance
- Wang, J., Kankanhalli, M.S., Yan, W.Q., Jain, R.: Experiential sampling for video surveillance. In: ACM Workshop on Video Surveillance. Berkeley (2003)
- (2003) ACM Workshop on Video Surveillance. Berkeley
- Wang, J.¹ Kankanhalli, M.S.² Yan, W.Q.³ Jain, R.⁴

137
- 34548749801
- Efficient sampling of training set in large and noisy multimedia data
- DOI 10.1145/1236471.1236473
- S. Wang M. Dash L.T. Chia M. Xu 2007 Efficient sampling of training set in large and noisy multimedia data ACM Trans. Multimed. Comput. Commun. Appl. 3 3 14 10.1145/1236471.1236473 (Pubitemid 47425936)
- (2007) ACM Transactions on Multimedia Computing, Communications and Applications , vol.3 , Issue.3 , pp. 1236473
- Wang, S.¹ Dash, M.² Chia, L.-T.³ Xu, M.⁴

138
- 85032751556
- Multimedia content analysis: Using both audio and visual clues
- Wang, Y., Liu, Z., Huang, J.C.: Multimedia content analysis: using both audio and visual clues. In: IEEE Signal Processing Magazine, pp. 12-36 (2000)
- (2000) IEEE Signal Processing Magazine , pp. 12-36
- Wang, Y.¹ Liu, Z.² Huang, J.C.³

139
- 0013184624
- Image retrieval: Content versus context
- Paris, France
- Westerveld, T.: Image retrieval: content versus context. In: RIAO Content-Based Multimedia Information Access. Paris, France (2000)
- (2000) RIAO Content-Based Multimedia Information Access
- Westerveld, T.¹

140
- 3843072321
- Ph.D. thesis, The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA
- Wu, H.: Sensor data fusion for context-aware computing using dempster-shafer theory. Ph.D. thesis, The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA (2003)
- (2003) Sensor Data Fusion for Context-aware Computing Using Dempster-shafer Theory
- Wu, H.¹

141
- 20444437959
- Multimodal information fusion for video concept detection
- Singapore
- Wu, K., Lin, C.K., Chang, E., Smith, J.R.: Multimodal information fusion for video concept detection. In: IEEE International Conference on Image Processing, pp. 2391-2394. Singapore (2004)
- (2004) IEEE International Conference on Image Processing , pp. 2391-2394
- Wu, K.¹ Lin, C.K.² Chang, E.³ Smith, J.R.⁴

142
- 78651400215
- Multimodal metadata fusion using causal strength
- Singapore
- Wu, Y., Chang, E., Tsengh, B.L.: Multimodal metadata fusion using causal strength. In: ACM International Conference on Multimedia, pp. 872-881. Singapore (2005)
- (2005) ACM International Conference on Multimedia , pp. 872-881
- Wu, Y.¹ Chang, E.² Tsengh, B.L.³

143
- 13444263342
- Optimal multimodal fusion for multimedia data analysis
- ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia
- Wu, Y., Chang, E.Y., Chang, K.C.C., Smith, J.R.: Optimal multimodal fusion for multimedia data analysis. In: ACM International Conference on Multimedia, pp. 572-579. New York City, NY (2004) (Pubitemid 40211831)
- (2004) ACM Multimedia 2004 - proceedings of the 12th ACM International Conference on Multimedia , pp. 572-579
- Wu, Y.¹ Chang, K.C.-C.² Chang, E.Y.³ Smith, J.R.⁴

144
- 33744958808
- Multi-level fusion of audio and visual features for speaker identification
- Advances in Biometrics - International Conference, ICB 2006, Proceedings
- Wu, Z., Cai, L., Meng, H.: Multi-level fusion of audio and visual features for speaker identification. In: International Conference on Advances in Biometrics, pp. 493-499 (2006) (Pubitemid 43856410)
- (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.3832 LNCS , pp. 493-499
- Wu, Z.¹ Cai, L.² Meng, H.³

145
- 33646820043
- Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams
- Philadelphia, USA
- Xie, L., Kennedy, L., Chang, S.F., Divakaran, A., Sun, H., Lin, C.Y.: Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 1053-1056. Philadelphia, USA (2005)
- (2005) IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.2 , pp. 1053-1056
- Xie, L.¹ Kennedy, L.² Chang, S.F.³ Divakaran, A.⁴ Sun, H.⁵ Lin, C.Y.⁶

146
- 0036609291
- Multi-sensor management for information fusion: Issues and approaches
- 10.1016/S1566-2535(02)00055-6
- N. Xiong P. Svensson 2002 Multi-sensor management for information fusion: issues and approaches Inf. Fusion 3 163 186(24) 10.1016/S1566-2535(02)00055-6
- (2002) Inf. Fusion , vol.3 , pp. 163-18624
- Xiong, N.¹ Svensson, P.²

147
- 41549084805
- A novel framework for semantic annotation and personalized retrieval of sports video
- DOI 10.1109/TMM.2008.917346, 4469885
- C. Xu J. Wang H. Lu Y. Zhang 2008 A novel framework for semantic annotation and personalized retrieval of sports video IEEE Trans. Multimed. 10 3 421 436 10.1109/TMM.2008.917346 (Pubitemid 351459505)
- (2008) IEEE Transactions on Multimedia , vol.10 , Issue.3 , pp. 421-436
- Xu, C.¹ Wang, J.² Lu, H.³ Zhang, Y.⁴

148
- 56549121057
- Using webcast text for semantic event detection in broadcast sports video
- 10.1109/TMM.2008.2004912
- C. Xu Y.F. Zhang G. Zhu Y. Rui H. Lu Q. Huang 2008 Using webcast text for semantic event detection in broadcast sports video IEEE Trans. Multimed. 10 7 1342 1355 10.1109/TMM.2008.2004912
- (2008) IEEE Trans. Multimed. , vol.10 , Issue.7 , pp. 1342-1355
- Xu, C.¹ Zhang, Y.F.² Zhu, G.³ Rui, Y.⁴ Lu, H.⁵ Huang, Q.⁶

149
- 33745151973
- Fusion of AV features and external information sources for event detection in team sports video
- 10.1145/1126004.1126007
- H. Xu T.S. Chua 2006 Fusion of AV features and external information sources for event detection in team sports video ACM Trans. Multimed. Comput. Commun. Appl. 2 1 44 67 10.1145/1126004.1126007
- (2006) ACM Trans. Multimed. Comput. Commun. Appl. , vol.2 , Issue.1 , pp. 44-67
- Xu, H.¹ Chua, T.S.²

150
- 34547487143
- Ph.D. thesis. Carnegie Mellon University
- Yan, R.: Probabilistic models for combining diverse knowledge sources in multimedia retrieval. Ph.D. thesis. Carnegie Mellon University (2006)
- (2006) Probabilistic Models for Combining Diverse Knowledge Sources in Multimedia Retrieval
- Yan, R.¹

151
- 13444278599
- Learning query-class dependent weights in automatic video retrieval
- New York, USA
- Yan, R., Yang, J., Hauptmann, A.: Learning query-class dependent weights in automatic video retrieval. In: ACM International Conference on Multimedia, pp. 548-555. New York, USA (2004)
- (2004) ACM International Conference on Multimedia , pp. 548-555
- Yan, R.¹ Yang, J.² Hauptmann, A.³

152
- 27744496398
- A multimodal fusion system for people detection and tracking
- DOI 10.1002/ima.20046
- M.T. Yang S.C. Wang Y.Y. Lin 2005 A multimodal fusion system for people detection and tracking International Journal of Imaging Systems and Technology 15 131 142 10.1002/ima.20046 (Pubitemid 41633027)
- (2005) International Journal of Imaging Systems and Technology , vol.15 , Issue.2 , pp. 131-142
- Yang, M.-T.¹ Wang, S.-C.² Lin, Y.-Y.³

153
- 1842499650
- Face recognition: A literature survey
- 10.1145/954339.954342
- W. Zhao R. Chellappa P.J. Phillips A. Rosenfeld 2003 Face recognition: a literature survey ACM Comput. Surv. 35 4 399 458 10.1145/954339.954342
- (2003) ACM Comput. Surv. , vol.35 , Issue.4 , pp. 399-458
- Zhao, W.¹ Chellappa, R.² Phillips, P.J.³ Rosenfeld, A.⁴

154
- 33749262426
- Object tracking in an outdoor environment using fusion of features and cameras
- DOI 10.1016/j.imavis.2005.06.008, PII S0262885605000843
- Q. Zhou J. Aggarwal 2006 Object tracking in an outdoor environment using fusion of features and cameras Image Vis. Comput. 24 11 1244 1255 10.1016/j.imavis.2005.06.008 (Pubitemid 44485264)
- (2006) Image and Vision Computing , vol.24 , Issue.11 , pp. 1244-1255
- Zhou, Q.¹ Aggarwal, J.K.²

155
- 33749557228
- Learning with unlabeled data and its application to image retrieval
- Guilin
- Zhou, Z.H.: Learning with unlabeled data and its application to image retrieval. In: The 9th Pacific Rim International Conference on Artificial Intelligence, pp. 5-10. Guilin (2006)
- (2006) The 9th Pacific Rim International Conference on Artificial Intelligence , pp. 5-10
- Zhou, Z.H.¹

156
- 34547210642
- Multimodal fusion using learned text concepts for image categorization
- Santa Barbara
- Zhu, Q., Yeh, M.C., Cheng, K.T.: Multimodal fusion using learned text concepts for image categorization. In: ACM International Conference on Multimedia, pp. 211-220. Santa Barbara (2006)
- (2006) ACM International Conference on Multimedia , pp. 211-220
- Zhu, Q.¹ Yeh, M.C.² Cheng, K.T.³

157
- 0036874485
- Joint audio-visual tracking using particle filters
- Zotkin, D.N., Duraiswami, R., Davis, L.S.: Joint audio-visual tracking using particle filters. EURASIP J. Appl. Signal Process. (11), 1154-1164 (2002)
- (2002) EURASIP J. Appl. Signal Process. , Issue.11 , pp. 1154-1164
- Zotkin, D.N.¹ Duraiswami, R.² Davis, L.S.³

158
- 38849120236
- Tracking humans using multimodal fusion
- Washington
- Zou, X., Bhanu, B.: Tracking humans using multimodal fusion. In: IEEE Conference on Computer Vision and Pattern Recognition, p. 4. Washington (2005)
- (2005) IEEE Conference on Computer Vision and Pattern Recognition , pp. 4
- Zou, X.¹ Bhanu, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.