SCOPUS 정보 검색 플랫폼

Eurasip Journal on Applied Signal Processing

Volumn 2003, Issue 2, 2003, Pages 170-185

Semantic indexing of multimedia content using visual, audio, and text cues

(7) Adams, W H a Iyengar, Giridharan a Lin, Ching Yung a Naphade, Milind Ramesh a Neti, Chalapathy a Nock, Harriet J a Smith, John R a

a IBM T J WATSON RESEARCH CENTER (United States)

Author keywords

GMM; HMM; Multimodal information fusion; Query by keywords; Spoken document retrieval; Statistical modeling of multimedia; SVM; Video event detection; Video indexing and retrieval; Video TREC

Indexed keywords

INFORMATION ANALYSIS; LEARNING SYSTEMS; MARKOV PROCESSES; SEMANTICS; STATISTICAL METHODS;

MULTIMODAL INFORMATION FUSION;

MULTIMEDIA SYSTEMS;

EID: 0037299467 PISSN: 11108657 EISSN: None Source Type: Journal
DOI: 10.1155/S1110865703211173 Document Type: Review

Times cited : (117)

References (33)

1
- 0030389403
- VisualSEEk: A fully automated content-based image query system
- ACM, Boston, Mass, USA, November
- J. R. Smith and S.-F. Chang, "VisualSEEk: a fully automated content-based image query system," in Proc. 4th ACM International Conference on Multimedia, pp. 87-98, ACM, Boston, Mass, USA, November 1996.
- (1996) Proc. 4th ACM International Conference on Multimedia , pp. 87-98
- Smith, J.R.¹ Chang, S.-F.²

2
- 0032318109
- Probabilistic multimedia objects (multijects): A novel approach to video indexing and retrieval in multimedia systems
- IEEE, Chicago, 111, USA, October
- M. Naphade, T. Kristjansson, B. Frey, and T. S. Huang, "Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems," in Proc. IEEE International Conference on Image Processing, vol. 3, pp. 536-540, IEEE, Chicago, 111, USA, October 1998.
- (1998) Proc. IEEE International Conference on Image Processing , vol.3 , pp. 536-540
- Naphade, M.¹ Kristjansson, T.² Frey, B.³ Huang, T.S.⁴

3
- 0032314489
- Semantic visual templates - Linking features to semantics
- IEEE, Chicago, 111, USA, October
- S. F. Chang, W. Chen, and H. Sundaram, "Semantic visual templates - linking features to semantics," in Proc, IEEE International Conference on Image Processing, vol. 3, pp. 531-535, IEEE, Chicago, 111, USA, October 1998.
- (1998) Proc, IEEE International Conference on Image Processing , vol.3 , pp. 531-535
- Chang, S.F.¹ Chen, W.² Sundaram, H.³

4
- 0032666227
- A computational approach to semantic event detection
- IEEE, Fort Collins, Colo, USA, June
- R. Qian, N. Hearing, and I. Sezan, "A computational approach to semantic event detection," in Proc. Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 200-206, IEEE, Fort Collins, Colo, USA, June 1999.
- (1999) Proc. Conference on Computer Vision and Pattern Recognition , vol.1 , pp. 200-206
- Qian, R.¹ Hearing, N.² Sezan, I.³

5
- 0033897763
- An integrated approach to multimodal media content analysis
- Storage and Retrieval for Media Databases 2000, SPIE, San Jose, Calif, USA, January
- T. Zhang and C. Kuo, "An integrated approach to multimodal media content analysis," in Storage and Retrieval for Media Databases 2000, vol. 3972 of SPIE Proceedings, pp. 506-517, SPIE, San Jose, Calif, USA, January 2000.
- (2000) SPIE Proceedings , vol.3972 , pp. 506-517
- Zhang, T.¹ Kuo, C.²

6
- 0003794341
- Ph.D. thesis, MIT Department of Electrical Engineering and Computer Science, Cambridge, Mass, USA
- D. Ellis, Prediction-driven computational auditory scene analysis, Ph.D. thesis, MIT Department of Electrical Engineering and Computer Science, Cambridge, Mass, USA, 1996.
- (1996) Prediction-Driven Computational Auditory Scene Analysis
- Ellis, D.¹

7
- 0034857154
- Learning the semantics of words and pictures
- IEEE, Vancouver, Canada, July
- K. Barnard and D. Forsyth, "Learning the semantics of words and pictures," in Proc. International Conf. on Computer Vision, vol. 2, pp. 408-415, IEEE, Vancouver, Canada, July 2001.
- (2001) Proc. International Conf. on Computer Vision , vol.2 , pp. 408-415
- Barnard, K.¹ Forsyth, D.²

8
- 0012532765
- Reduced-rank spectra and minimum-entropy priors as consistent and reliable cues for generalized sound recognition
- Aalborg, Denmark, September
- M. A. Casey, "Reduced-rank spectra and minimum-entropy priors as consistent and reliable cues for generalized sound recognition," in Proc. Eurospeech, Aalborg, Denmark, September 2001.
- (2001) Proc. Eurospeech
- Casey, M.A.¹

9
- 0030705367
- Hidden Markov model parsing of video programs
- IEEE, Munich, Germany, April
- W. Wolf, "Hidden Markov model parsing of video programs," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 4, pp. 2609-2611, IEEE, Munich, Germany, April 1997.
- (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.4 , pp. 2609-2611
- Wolf, W.¹

10
- 0032223813
- Models for automatic classification of video sequences
- Storage and Retrieval for Image and Video Databases VI, SPIE, San Jose, Calif, USA, January
- G. Iyengar and A. B. Lippman, "Models for automatic classification of video sequences," in Storage and Retrieval for Image and Video Databases VI, vol. 3312 of SPIE Proceedings, pp. 216-227, SPIE, San Jose, Calif, USA, January 1998.
- (1998) SPIE Proceedings , vol.3312 , pp. 216-227
- Iyengar, G.¹ Lippman, A.B.²

11
- 0033316687
- Probabilistic analysis and extraction of video content
- IEEE, Kobe, Japan, October
- A. M. Ferman and A. M. Tekalp, "Probabilistic analysis and extraction of video content," in Proc. IEEE International Conference on Image Processing, vol. 2, pp. 91-95, IEEE, Kobe, Japan, October 1999.
- (1999) Proc. IEEE International Conference on Image Processing , vol.2 , pp. 91-95
- Ferman, A.M.¹ Tekalp, A.M.²

12
- 0032311673
- Bayesian modeling of video editing and structure: Semantic features for video summarization and browsing
- IEEE, Chicago, Ill, USA
- N. Vasconcelos and A. Lippman, "Bayesian modeling of video editing and structure: semantic features for video summarization and browsing," in Proc. IEEE International Conference on Image Processing, vol. 3, pp. 153-157, IEEE, Chicago, Ill, USA, 1998.
- (1998) Proc. IEEE International Conference on Image Processing , vol.3 , pp. 153-157
- Vasconcelos, N.¹ Lippman, A.²

13
- 0030648077
- Construction and evaluation of a robust multifeature speech/music discriminator
- IEEE, Munich, Germany, April
- E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, pp. 1331-1334, IEEE, Munich, Germany, April 1997.
- (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 1331-1334
- Scheirer, E.¹ Slaney, M.²

14
- 0034509230
- Towards automatic extraction of expressive elements from motion pictures: Tempo
- IEEE, New York, NY, USA, July
- B. Adams, C. Dorai, and S. Venkatesh, "Towards automatic extraction of expressive elements from motion pictures: Tempo," in Proc. IEEE International Conference on Multimedia and Expo, vol. II, pp. 641-645, IEEE, New York, NY, USA, July 2000.
- (2000) Proc. IEEE International Conference on Multimedia and Expo , vol.2 , pp. 641-645
- Adams, B.¹ Dorai, C.² Venkatesh, S.³

15
- 0012583743
- The 10th text retrieval conference (TREC 2001)
- NIST, Gaithersburg, Md, USA
- E. M. Voorhees and D. K. Harman, Eds., The 10th Text REtrieval Conference (TREC 2001), vol. 500-250 of NIST Special Publication, NIST, Gaithersburg, Md, USA, 2001.
- (2001) NIST Special Publication , vol.500 , Issue.250
- Voorhees, E.M.¹ Harman, D.K.²

16
- 0001839555
- Media Streams: An iconic visual language for video annotation
- M. Davis, "Media Streams: an iconic visual language for video annotation," Telektronikk, vol. 89, no. 4, pp. 59-71, 1993.
- (1993) Telektronikk , vol.89 , Issue.4 , pp. 59-71
- Davis, M.¹

17
- 0003322357
- Audio-visual speech recognition
- The Johns Hopkins University, Baltimore, Md, USA, October
- C. Neti, G. Potamianos, J. Luettin, et al., "Audio-visual speech recognition," Final workshop 2000 report, Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, Md, USA, October 2000.
- (2000) Final Workshop 2000 Report, Center for Language and Speech Processing
- Neti, C.¹ Potamianos, G.² Luettin, J.³

18
- 84931854786
- Speaker change detection using joint audio-visual statistics
- Paris, France, April
- G. Iyengar and C. Neti, "Speaker change detection using joint audio-visual statistics," in Proc. RIAO, Paris, France, April 2000.
- (2000) Proc. RIAO
- Iyengar, G.¹ Neti, C.²

19
- 0003487601
- Oxford University Press, Oxford, UK
- C. M. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, Oxford, UK, 1995.
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.M.¹

20
- 0004244302
- Prentice-Hall, Englewood Cliffs, NJ, USA, 1st edition
- L. R. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition, Prentice-Hall, Englewood Cliffs, NJ, USA, 1st edition, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.-H.²

21
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," Journal of Royal Statistical Society, Series B, vol. 39, no. 1, pp. 1-38, 1977.
- (1977) Journal of Royal Statistical Society, Series B , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

22
- 0003450542
- Springer-Verlag, New York, NY, USA
- V. Vapnik, The Nature of Statistical Learning Theory, Springer-Verlag, New York, NY, USA, 1995.
- (1995) The Nature of Statistical Learning Theory
- Vapnik, V.¹

23
- 17344389852
- Robust speech recognition in noisy environments: The IBM SPINE-2 evaluation system
- Orlando, Fla, USA, May
- B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust speech recognition in noisy environments: The IBM SPINE-2 evaluation system," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Orlando, Fla, USA, May 2002.
- (2002) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
- Kingsbury, B.¹ Saon, G.² Mangu, L.³ Padmanabhan, M.⁴ Sarikaya, R.⁵

24
- 0012611082
- TREC-6 Ad-hoc retrieval
- Proc. 6th Text REtrieval Conference (TREC-6), NIST, Gaithersburg, Md, USA
- M. Franz and S. Roukos, "TREC-6 Ad-hoc retrieval," in Proc. 6th Text REtrieval Conference (TREC-6), vol. 500-240 of NIST Special Publication, pp. 511-516, NIST, Gaithersburg, Md, USA, 1998.
- (1998) NIST Special Publication , vol.500 , Issue.240 , pp. 511-516
- Franz, M.¹ Roukos, S.²

25
- 0004289791
- MIT Press, Cambridge, Mass, USA
- C. Fellbaum, Ed., WordNet: An Electronic Lexical Database, MIT Press, Cambridge, Mass, USA, 1998.
- (1998) WordNet: An Electronic Lexical Database
- Fellbaum, C.¹

26
- 0001319911
- Okapi at TREC-3
- The 3rd Text REtrieval Conference (TREC-3), NIST, Gaithersburg, Md, USA
- S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford, "Okapi at TREC-3," in The 3rd Text REtrieval Conference (TREC-3), vol. 500-225 of NIST Special Publication, pp. 109-126, NIST, Gaithersburg, Md, USA, 1995.
- (1995) NIST Special Publication , vol.500 , Issue.225 , pp. 109-126
- Robertson, S.E.¹ Walker, S.² Jones, S.³ Hancock-Beaulieu, M.M.⁴ Gatford, M.⁵

27
- 0012577594
- IBM Almaden Research Center, "The IBM cuevideo project," 1997, www.almaden.ibm.com/projects/cuevideo.shtml.
- (1997) The IBM Cuevideo Project

28
- 0012529167
- Integrating features, models, and semantics for TREC video retrieval
- Proc. 10th Text REtrieval Conference (TREC 2001), NIST, Gaithersburg, Md, USA
- J. R. Smith, S. Srinivasan, A. Amir, et al., "Integrating features, models, and semantics for TREC video retrieval," in Proc. 10th Text REtrieval Conference (TREC 2001), vol. 500-250 of NIST Special Publication, pp. 240-249, NIST, Gaithersburg, Md, USA, 2001.
- (2001) NIST Special Publication , vol.500 , Issue.250 , pp. 240-249
- Smith, J.R.¹ Srinivasan, S.² Amir, A.³

29
- 0032164964
- Shape-based retrieval: A case study with trademark image databases
- A. K. Jain and A. Vailaya, "Shape-based retrieval: A case study with trademark image databases," Pattern Recognition, vol. 31, no. 9, pp. 1369-1390, 1998.
- (1998) Pattern Recognition , vol.31 , Issue.9 , pp. 1369-1390
- Jain, A.K.¹ Vailaya, A.²

30
- 0032598856
- Query by video clip
- A. K. Jain, A. Vailaya, and W. Xiong, "Query by video clip," Multimedia Systems: Special Issue on Video Libraries, vol. 7, no, 5, pp. 369-384, 1999.
- (1999) Multimedia Systems: Special Issue on Video Libraries , vol.7 , Issue.5 , pp. 369-384
- Jain, A.K.¹ Vailaya, A.² Xiong, W.³

31
- 0017442627
- Aircraft identification by moment invariants
- S. Dudani, K. Breeding, and R. McGhee, "Aircraft identification by moment invariants," IEEE Trans. on Computers, vol. 26, no. 1, pp. 39-45, 1977.
- (1977) IEEE Trans. on Computers , vol.26 , Issue.1 , pp. 39-45
- Dudani, S.¹ Breeding, K.² McGhee, R.³

32
- 0030706661
- Transcription of broadcast news - System robustness issues and adaptation techniques
- IEEE, Munich, Germany, April
- R. Bakis, S. Sehen, P. Gopalakrishnan, R. Gopinath, S. Maes, and L. Polymenakos, "Transcription of broadcast news - system robustness issues and adaptation techniques," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, pp. 711-714, IEEE, Munich, Germany, April 1997.
- (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 711-714
- Bakis, R.¹ Sehen, S.² Gopalakrishnan, P.³ Gopinath, R.⁴ Maes, S.⁵ Polymenakos, L.⁶

33
- 0003694781
- K. Murphy, "Bayes net toolbox for matlab," 2001, http://www.ai.mit.edu/̃murphyk/Software/BNT/bnt.html.
- (2001) Bayes Net Toolbox for Matlab
- Murphy, K.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.