메뉴 건너뛰기




Volumn 2003, Issue 2, 2003, Pages 170-185

Semantic indexing of multimedia content using visual, audio, and text cues

Author keywords

GMM; HMM; Multimodal information fusion; Query by keywords; Spoken document retrieval; Statistical modeling of multimedia; SVM; Video event detection; Video indexing and retrieval; Video TREC

Indexed keywords

INFORMATION ANALYSIS; LEARNING SYSTEMS; MARKOV PROCESSES; SEMANTICS; STATISTICAL METHODS;

EID: 0037299467     PISSN: 11108657     EISSN: None     Source Type: Journal    
DOI: 10.1155/S1110865703211173     Document Type: Review
Times cited : (117)

References (33)
  • 1
    • 0030389403 scopus 로고    scopus 로고
    • VisualSEEk: A fully automated content-based image query system
    • ACM, Boston, Mass, USA, November
    • J. R. Smith and S.-F. Chang, "VisualSEEk: a fully automated content-based image query system," in Proc. 4th ACM International Conference on Multimedia, pp. 87-98, ACM, Boston, Mass, USA, November 1996.
    • (1996) Proc. 4th ACM International Conference on Multimedia , pp. 87-98
    • Smith, J.R.1    Chang, S.-F.2
  • 2
    • 0032318109 scopus 로고    scopus 로고
    • Probabilistic multimedia objects (multijects): A novel approach to video indexing and retrieval in multimedia systems
    • IEEE, Chicago, 111, USA, October
    • M. Naphade, T. Kristjansson, B. Frey, and T. S. Huang, "Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems," in Proc. IEEE International Conference on Image Processing, vol. 3, pp. 536-540, IEEE, Chicago, 111, USA, October 1998.
    • (1998) Proc. IEEE International Conference on Image Processing , vol.3 , pp. 536-540
    • Naphade, M.1    Kristjansson, T.2    Frey, B.3    Huang, T.S.4
  • 3
    • 0032314489 scopus 로고    scopus 로고
    • Semantic visual templates - Linking features to semantics
    • IEEE, Chicago, 111, USA, October
    • S. F. Chang, W. Chen, and H. Sundaram, "Semantic visual templates - linking features to semantics," in Proc, IEEE International Conference on Image Processing, vol. 3, pp. 531-535, IEEE, Chicago, 111, USA, October 1998.
    • (1998) Proc, IEEE International Conference on Image Processing , vol.3 , pp. 531-535
    • Chang, S.F.1    Chen, W.2    Sundaram, H.3
  • 4
  • 5
    • 0033897763 scopus 로고    scopus 로고
    • An integrated approach to multimodal media content analysis
    • Storage and Retrieval for Media Databases 2000, SPIE, San Jose, Calif, USA, January
    • T. Zhang and C. Kuo, "An integrated approach to multimodal media content analysis," in Storage and Retrieval for Media Databases 2000, vol. 3972 of SPIE Proceedings, pp. 506-517, SPIE, San Jose, Calif, USA, January 2000.
    • (2000) SPIE Proceedings , vol.3972 , pp. 506-517
    • Zhang, T.1    Kuo, C.2
  • 6
    • 0003794341 scopus 로고    scopus 로고
    • Ph.D. thesis, MIT Department of Electrical Engineering and Computer Science, Cambridge, Mass, USA
    • D. Ellis, Prediction-driven computational auditory scene analysis, Ph.D. thesis, MIT Department of Electrical Engineering and Computer Science, Cambridge, Mass, USA, 1996.
    • (1996) Prediction-Driven Computational Auditory Scene Analysis
    • Ellis, D.1
  • 7
    • 0034857154 scopus 로고    scopus 로고
    • Learning the semantics of words and pictures
    • IEEE, Vancouver, Canada, July
    • K. Barnard and D. Forsyth, "Learning the semantics of words and pictures," in Proc. International Conf. on Computer Vision, vol. 2, pp. 408-415, IEEE, Vancouver, Canada, July 2001.
    • (2001) Proc. International Conf. on Computer Vision , vol.2 , pp. 408-415
    • Barnard, K.1    Forsyth, D.2
  • 8
    • 0012532765 scopus 로고    scopus 로고
    • Reduced-rank spectra and minimum-entropy priors as consistent and reliable cues for generalized sound recognition
    • Aalborg, Denmark, September
    • M. A. Casey, "Reduced-rank spectra and minimum-entropy priors as consistent and reliable cues for generalized sound recognition," in Proc. Eurospeech, Aalborg, Denmark, September 2001.
    • (2001) Proc. Eurospeech
    • Casey, M.A.1
  • 9
    • 0030705367 scopus 로고    scopus 로고
    • Hidden Markov model parsing of video programs
    • IEEE, Munich, Germany, April
    • W. Wolf, "Hidden Markov model parsing of video programs," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 4, pp. 2609-2611, IEEE, Munich, Germany, April 1997.
    • (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.4 , pp. 2609-2611
    • Wolf, W.1
  • 10
    • 0032223813 scopus 로고    scopus 로고
    • Models for automatic classification of video sequences
    • Storage and Retrieval for Image and Video Databases VI, SPIE, San Jose, Calif, USA, January
    • G. Iyengar and A. B. Lippman, "Models for automatic classification of video sequences," in Storage and Retrieval for Image and Video Databases VI, vol. 3312 of SPIE Proceedings, pp. 216-227, SPIE, San Jose, Calif, USA, January 1998.
    • (1998) SPIE Proceedings , vol.3312 , pp. 216-227
    • Iyengar, G.1    Lippman, A.B.2
  • 11
    • 0033316687 scopus 로고    scopus 로고
    • Probabilistic analysis and extraction of video content
    • IEEE, Kobe, Japan, October
    • A. M. Ferman and A. M. Tekalp, "Probabilistic analysis and extraction of video content," in Proc. IEEE International Conference on Image Processing, vol. 2, pp. 91-95, IEEE, Kobe, Japan, October 1999.
    • (1999) Proc. IEEE International Conference on Image Processing , vol.2 , pp. 91-95
    • Ferman, A.M.1    Tekalp, A.M.2
  • 12
    • 0032311673 scopus 로고    scopus 로고
    • Bayesian modeling of video editing and structure: Semantic features for video summarization and browsing
    • IEEE, Chicago, Ill, USA
    • N. Vasconcelos and A. Lippman, "Bayesian modeling of video editing and structure: semantic features for video summarization and browsing," in Proc. IEEE International Conference on Image Processing, vol. 3, pp. 153-157, IEEE, Chicago, Ill, USA, 1998.
    • (1998) Proc. IEEE International Conference on Image Processing , vol.3 , pp. 153-157
    • Vasconcelos, N.1    Lippman, A.2
  • 13
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature speech/music discriminator
    • IEEE, Munich, Germany, April
    • E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, pp. 1331-1334, IEEE, Munich, Germany, April 1997.
    • (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 14
    • 0034509230 scopus 로고    scopus 로고
    • Towards automatic extraction of expressive elements from motion pictures: Tempo
    • IEEE, New York, NY, USA, July
    • B. Adams, C. Dorai, and S. Venkatesh, "Towards automatic extraction of expressive elements from motion pictures: Tempo," in Proc. IEEE International Conference on Multimedia and Expo, vol. II, pp. 641-645, IEEE, New York, NY, USA, July 2000.
    • (2000) Proc. IEEE International Conference on Multimedia and Expo , vol.2 , pp. 641-645
    • Adams, B.1    Dorai, C.2    Venkatesh, S.3
  • 15
    • 0012583743 scopus 로고    scopus 로고
    • The 10th text retrieval conference (TREC 2001)
    • NIST, Gaithersburg, Md, USA
    • E. M. Voorhees and D. K. Harman, Eds., The 10th Text REtrieval Conference (TREC 2001), vol. 500-250 of NIST Special Publication, NIST, Gaithersburg, Md, USA, 2001.
    • (2001) NIST Special Publication , vol.500 , Issue.250
    • Voorhees, E.M.1    Harman, D.K.2
  • 16
    • 0001839555 scopus 로고
    • Media Streams: An iconic visual language for video annotation
    • M. Davis, "Media Streams: an iconic visual language for video annotation," Telektronikk, vol. 89, no. 4, pp. 59-71, 1993.
    • (1993) Telektronikk , vol.89 , Issue.4 , pp. 59-71
    • Davis, M.1
  • 18
    • 84931854786 scopus 로고    scopus 로고
    • Speaker change detection using joint audio-visual statistics
    • Paris, France, April
    • G. Iyengar and C. Neti, "Speaker change detection using joint audio-visual statistics," in Proc. RIAO, Paris, France, April 2000.
    • (2000) Proc. RIAO
    • Iyengar, G.1    Neti, C.2
  • 24
    • 0012611082 scopus 로고    scopus 로고
    • TREC-6 Ad-hoc retrieval
    • Proc. 6th Text REtrieval Conference (TREC-6), NIST, Gaithersburg, Md, USA
    • M. Franz and S. Roukos, "TREC-6 Ad-hoc retrieval," in Proc. 6th Text REtrieval Conference (TREC-6), vol. 500-240 of NIST Special Publication, pp. 511-516, NIST, Gaithersburg, Md, USA, 1998.
    • (1998) NIST Special Publication , vol.500 , Issue.240 , pp. 511-516
    • Franz, M.1    Roukos, S.2
  • 26
    • 0001319911 scopus 로고
    • Okapi at TREC-3
    • The 3rd Text REtrieval Conference (TREC-3), NIST, Gaithersburg, Md, USA
    • S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford, "Okapi at TREC-3," in The 3rd Text REtrieval Conference (TREC-3), vol. 500-225 of NIST Special Publication, pp. 109-126, NIST, Gaithersburg, Md, USA, 1995.
    • (1995) NIST Special Publication , vol.500 , Issue.225 , pp. 109-126
    • Robertson, S.E.1    Walker, S.2    Jones, S.3    Hancock-Beaulieu, M.M.4    Gatford, M.5
  • 27
    • 0012577594 scopus 로고    scopus 로고
    • IBM Almaden Research Center, "The IBM cuevideo project," 1997, www.almaden.ibm.com/projects/cuevideo.shtml.
    • (1997) The IBM Cuevideo Project
  • 28
    • 0012529167 scopus 로고    scopus 로고
    • Integrating features, models, and semantics for TREC video retrieval
    • Proc. 10th Text REtrieval Conference (TREC 2001), NIST, Gaithersburg, Md, USA
    • J. R. Smith, S. Srinivasan, A. Amir, et al., "Integrating features, models, and semantics for TREC video retrieval," in Proc. 10th Text REtrieval Conference (TREC 2001), vol. 500-250 of NIST Special Publication, pp. 240-249, NIST, Gaithersburg, Md, USA, 2001.
    • (2001) NIST Special Publication , vol.500 , Issue.250 , pp. 240-249
    • Smith, J.R.1    Srinivasan, S.2    Amir, A.3
  • 29
    • 0032164964 scopus 로고    scopus 로고
    • Shape-based retrieval: A case study with trademark image databases
    • A. K. Jain and A. Vailaya, "Shape-based retrieval: A case study with trademark image databases," Pattern Recognition, vol. 31, no. 9, pp. 1369-1390, 1998.
    • (1998) Pattern Recognition , vol.31 , Issue.9 , pp. 1369-1390
    • Jain, A.K.1    Vailaya, A.2
  • 31
    • 0017442627 scopus 로고
    • Aircraft identification by moment invariants
    • S. Dudani, K. Breeding, and R. McGhee, "Aircraft identification by moment invariants," IEEE Trans. on Computers, vol. 26, no. 1, pp. 39-45, 1977.
    • (1977) IEEE Trans. on Computers , vol.26 , Issue.1 , pp. 39-45
    • Dudani, S.1    Breeding, K.2    McGhee, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.