SCOPUS 정보 검색 플랫폼

Multimedia Tools and Applications

Volumn 25, Issue 1, 2005, Pages 5-35

Multimodal video indexing: A review of the state-of-the-art

(2) Snoek, Cees G M a Worring, Marcel a

a UNIVERSITY OF AMSTERDAM (Netherlands)

Author keywords

Analysis framework; Multimodal integration; Multimodal video indexing; Review; Video segmentation

Indexed keywords

ALGORITHMS; INTEGRATION; MULTIMEDIA SYSTEMS; PROJECT MANAGEMENT; STATISTICAL METHODS; TEXT PROCESSING;

ANALYSIS FRAMEWORK; MULTIMODAL INTEGRATION; MULTIMODAL VIDEO INDEXING; REVIEW; VIDEO SEGMENTATION;

INDEXING (OF INFORMATION);

EID: 10044236762 PISSN: 13807501 EISSN: None Source Type: Journal
DOI: 10.1023/B:MTAP.0000046380.27575.a5 Document Type: Review

Times cited : (351)

References (103)

1
- 0002121497
- Part-of-speech tagging and partial parsing
- S. Young and G. Bloothooft (Eds.), Kluwer Academic Publishers, Dordrecht
- S. Abney, "Part-of-speech tagging and partial parsing," in Corpus-Based Methods in Language and Speech Processing, S. Young and G. Bloothooft (Eds.), Kluwer Academic Publishers, Dordrecht, 1997, pp. 118-136.
- (1997) Corpus-based Methods in Language and Speech Processing , pp. 118-136
- Abney, S.¹

2
- 0000718946
- The advanced video information system: Data structures and query processing
- S. Adali, K.S. Candan, S.S. Chen, K. Erol, and V.S. Subrahmanian, "The advanced video information system: Data structures and query processing," Multimedia Systems, Vol. 4, No. 4, pp. 172-186, 1996.
- (1996) Multimedia Systems , vol.4 , Issue.4 , pp. 172-186
- Adali, S.¹ Candan, K.S.² Chen, S.S.³ Erol, K.⁴ Subrahmanian, V.S.⁵

3
- 0035368101
- Multi-modal dialogue scene detection using hidden markov models for content-based multimedia indexing
- A.A. Alatan, A.N. Akansu, and W. Wolf, "Multi-modal dialogue scene detection using hidden markov models for content-based multimedia indexing," Multimedia Tools and Applications, Vol. 14, No. 2, pp. 137-151, 2001.
- (2001) Multimedia Tools and Applications , vol.14 , Issue.2 , pp. 137-151
- Alatan, A.A.¹ Akansu, A.N.² Wolf, W.³

4
- 0031611061
- Region-based parametric motion segmentation using color information
- Y. Altunbasak, P.E. Eren, and A.M. Tekalp, "Region-based parametric motion segmentation using color information," Graphical Models and Image Processing, Vol. 60, No. 1, pp. 13-23, 1998.
- (1998) Graphical Models and Image Processing , vol.60 , Issue.1 , pp. 13-23
- Altunbasak, Y.¹ Eren, P.E.² Tekalp, A.M.³

5
- 0036502392
- Event based indexing of broadcasted sports video by intermodal collaboration
- N. Babaguchi, Y. Kawai, and T. Kitahashi, "Event based indexing of broadcasted sports video by intermodal collaboration," IEEE Transactions on Multimedia, Vol. 4, No. 1, pp. 68-75, 2002.
- (2002) IEEE Transactions on Multimedia , vol.4 , Issue.1 , pp. 68-75
- Babaguchi, N.¹ Kawai, Y.² Kitahashi, T.³

6
- 0031185845
- Eigenfaces vs. fisherfaces: Recognition using class specific linear projection
- P.N. Belhumeur, J.P. Hespanha, and D.J. Kriegman, "Eigenfaces vs. fisherfaces: Recognition using class specific linear projection," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, No. 7, pp. 711-720, 1997.
- (1997) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.19 , Issue.7 , pp. 711-720
- Belhumeur, P.N.¹ Hespanha, J.P.² Kriegman, D.J.³

7
- 0035309512
- Content-based indexing and retrieval of TV news
- M. Bertini, A. Del Bimbo, and P. Pala, "Content-based indexing and retrieval of TV news," Pattern Recognition Letters, Vol. 22, No. 5, pp. 503-516, 2001.
- (2001) Pattern Recognition Letters , vol.22 , Issue.5 , pp. 503-516
- Bertini, M.¹ Del Bimbo, A.² Pala, P.³

8
- 0032632354
- An algorithm that learns what's in a name
- D. Bikel, R. Schwartz, and R.M. Weischedel, "An algorithm that learns what's in a name," Machine Learning, Vol. 34, Nos. 1-3, pp. 211-231, 1999.
- (1999) Machine Learning , vol.34 , Issue.1-3 , pp. 211-231
- Bikel, D.¹ Schwartz, R.² Weischedel, R.M.³

9
- 0013356412
- Mayfield Publishing Company: Mountain View, USA
- J.M. Boggs and D.W. Petrie, The Art of Watching Films, 5th edition, Mayfield Publishing Company: Mountain View, USA, 2000.
- (2000) The Art of Watching Films, 5th Edition
- Boggs, J.M.¹ Petrie, D.W.²

10
- 0032028721
- Video query: Research directions
- R.M. Bolle, B.-L. Yeo, and M.M. Yeung, "Video query: Research directions," IBM Journal of Research and Development, Vol. 42, No. 2, pp. 233-252, 1998.
- (1998) IBM Journal of Research and Development , vol.42 , Issue.2 , pp. 233-252
- Bolle, R.M.¹ Yeo, B.-L.² Yeung, M.M.³

11
- 84908265994
- Event recognition in sport programs using low-level motion indices
- Tokyo, Japan
- A. Bonzanini, R. Leonardi, and P. Migliorati, "Event recognition in sport programs using low-level motion indices," in IEEE International Conference on Multimedia & Expo, Tokyo, Japan, 2001, pp. 1208-1211.
- (2001) IEEE International Conference on Multimedia & Expo , pp. 1208-1211
- Bonzanini, A.¹ Leonardi, R.² Migliorati, P.³

12
- 0029451866
- Automatic content-based retrieval of broadcast news
- San Francisco, USA
- M. Brown, J. Foote, G. Jones, K. Sparck-Jones, and S. Young, "Automatic content-based retrieval of broadcast news," in ACM Multimedia 1995, San Francisco, USA, 1995.
- (1995) ACM Multimedia 1995
- Brown, M.¹ Foote, J.² Jones, G.³ Sparck-Jones, K.⁴ Young, S.⁵

13
- 0032657349
- A survey on the automatic indexing of video data
- R. Brunelli, O. Mich, and C.M. Modena, "A survey on the automatic indexing of video data," Journal of Visual Communication and Image Representation, Vol. 10, No. 2, pp. 78-112, 1999.
- (1999) Journal of Visual Communication and Image Representation , vol.10 , Issue.2 , pp. 78-112
- Brunelli, R.¹ Mich, O.² Modena, C.M.³

14
- 85019650666
- Combining textual and visual cues for content-based image retrieval on the world wide web
- M. La Cascia, S. Sethi, and S. Sclaroff, "Combining textual and visual cues for content-based image retrieval on the world wide web," in IEEE Workshop on Content-Based Access of Image and Video Libraries, 1998.
- (1998) IEEE Workshop on Content-Based Access of Image and Video Libraries
- Cascia, M.L.¹ Sethi, S.² Sclaroff, S.³

15
- 0033892811
- Interactive maps for a digital video library
- M. Christel, A. Olligschlaeger, and C. Huang, "Interactive maps for a digital video library," IEEE Multimedia, Vol. 7, No. 1, pp. 60-67, 2000.
- (2000) IEEE Multimedia , vol.7 , Issue.1 , pp. 60-67
- Christel, M.¹ Olligschlaeger, A.² Huang, C.³

16
- 0032595005
- Semantics in visual information retrieval
- C. Colombo, A. Del Bimbo, and P. Pala, "Semantics in visual information retrieval," IEEE Multimedia, Vol. 6, No. 3, pp. 38-53, 1999.
- (1999) IEEE Multimedia , vol.6 , Issue.3 , pp. 38-53
- Colombo, C.¹ Del Bimbo, A.² Pala, P.³

17
- 10044262950
- Convera. http://www.convera.com.

18
- 0026187121
- Cinematic principles for multimedia
- G. Davenport, T. Aguierre Smith, and N. Pincever, "Cinematic principles for multimedia," in IEEE Computer Graphics & Applications, Vol. 11, No. 4, pp. 67-74, 1991.
- (1991) IEEE Computer Graphics & Applications , vol.11 , Issue.4 , pp. 67-74
- Davenport, G.¹ Smith, T.A.² Pincever, N.³

19
- 84989525001
- Indexing by latent semantic analysis
- S. Deerwester, S.T. Dumais, G.W. Furnas, T.K. Landauer, and R. Harshman, "Indexing by latent semantic analysis," Journal of the American Society for Information Science, Vol. 41, No. 6, pp. 391-407, 1990.
- (1990) Journal of the American Society for Information Science , vol.41 , Issue.6 , pp. 391-407
- Deerwester, S.¹ Dumais, S.T.² Furnas, G.W.³ Landauer, T.K.⁴ Harshman, R.⁵

20
- 84937046785
- Video classification based on HMM using text and faces
- Tampere, Finland
- N. Dimitrova, L. Agnihotri, and G. Wei, "Video classification based on HMM using text and faces," in European Signal Processing Conference, Tampere, Finland, 2000.
- (2000) European Signal Processing Conference
- Dimitrova, N.¹ Agnihotri, L.² Wei, G.³

21
- 0032629746
- Content-based video indexing of TV broadcast news using hidden markov models
- Phoenix, USA
- S. Eickeler and S. Müller, "Content-based video indexing of TV broadcast news using hidden markov models," in IEEE International Conference on Acoustics, Speech, and Signal Processing, Phoenix, USA, 1999, pp. 2997-3000.
- (1999) IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 2997-3000
- Eickeler, S.¹ Müller, S.²

22
- 0033705976
- Speech/music discrimination for multimedia applications
- Istanbul, Turkey
- K. El-Maleh, M. Klein, G. Petrucci, and P. Kabal, "Speech/music discrimination for multimedia applications," in IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, Turkey, 2000, pp. 2445-2448.
- (2000) IEEE International Conference on Acoustics, Speech, and Signal Processing , pp. 2445-2448
- El-Maleh, K.¹ Klein, M.² Petrucci, G.³ Kabal, P.⁴

23
- 0029458263
- Automatic recognition of film genres
- San Francisco, USA
- S. Fischer, R. Lienhart, and W. Effelsberg, "Automatic recognition of film genres," in ACM Multimedia 1995, San Francisco, USA, 1995, pp. 295-304.
- (1995) ACM Multimedia 1995 , pp. 295-304
- Fischer, S.¹ Lienhart, R.² Effelsberg, W.³

24
- 0001176213
- Finding naked people
- Cambridge, UK
- M.M. Fleck, D.A. Forsyth, and C. Bregler, "Finding naked people," in European Conference on Computer Vision, Cambridge, UK, 1996, Vol. 2, pp. 593-602.
- (1996) European Conference on Computer Vision , vol.2 , pp. 593-602
- Fleck, M.M.¹ Forsyth, D.A.² Bregler, C.³

25
- 0003799897
- Kluwer Academic Publishers: Norwell, USA
- B. Furht, S.W. Smoliar, and H.J. Zhang, Video and Image Processing in Multimedia Systems, 2nd edition, Kluwer Academic Publishers: Norwell, USA, 1996.
- (1996) Video and Image Processing in Multimedia Systems, 2nd Edition
- Furht, B.¹ Smoliar, S.W.² Zhang, H.J.³

26
- 0029456574
- Query by humming - Musical information retrieval in an audio database
- San Francisco, USA
- A. Ghias, J. Logan, D. Chamberlin, and B.C. Smith, "Query by humming - musical information retrieval in an audio database," in ACM Multimedia 1995, San Francisco, USA, 1995.
- (1995) ACM Multimedia 1995
- Ghias, A.¹ Logan, J.² Chamberlin, D.³ Smith, B.C.⁴

27
- 0029226035
- Automatic parsing of TV soccer programs
- Y. Gong, L.T. Sin, and C.H. Chuan, "Automatic parsing of TV soccer programs," in IEEE International Conference on Multimedia Computing and Systems, 1995, pp. 167-174.
- (1995) IEEE International Conference on Multimedia Computing and Systems , pp. 167-174
- Gong, Y.¹ Sin, L.T.² Chuan, C.H.³

28
- 0030387565
- Video indexing through integration of syntactic and semantic features
- Sarasota, USA
- B. Günsel, A.M. Ferman, and A.M. Tekalp, "Video indexing through integration of syntactic and semantic features," in Third IEEE Workshop on Applications of Computer Vision, Sarasota, USA, 1996.
- (1996) Third IEEE Workshop on Applications of Computer Vision
- Günsel, B.¹ Ferman, A.M.² Tekalp, A.M.³

29
- 0034269926
- A semantic event-detection approach and its application to detecting hunts in wildlife video
- N. Haering, R. Qian, and I. Sezan, "A semantic event-detection approach and its application to detecting hunts in wildlife video," IEEE Transactions on Circuits and Systems for Video Technology, Vol. 10, No. 6, pp. 857-868, 2000.
- (2000) IEEE Transactions on Circuits and Systems for Video Technology , vol.10 , Issue.6 , pp. 857-868
- Haering, N.¹ Qian, R.² Sezan, I.³

30
- 85031624134
- Feature based digital video indexing
- Lausanne, Switzerland
- A. Hampapur, R. Jain, and T. Weymouth, "Feature based digital video indexing," in IFIP 2.6 Third Working Conference on Visual Database Systems, Lausanne, Switzerland, 1995.
- (1995) IFIP 2.6 Third Working Conference on Visual Database Systems
- Hampapur, A.¹ Jain, R.² Weymouth, T.³

31
- 0035054149
- Dancers: Delft advanced news retrieval system
- San Jose, USA
- A. Hanjalic, G. Kakes, R.L. Lagendijk, and J. Biemond, "Dancers: Delft advanced news retrieval system," in IS&T/SPIE Electronic Imaging 2001: Storage and Retrieval for Media Databases 2001, San Jose, USA, 2001.
- (2001) IS&T/SPIE Electronic Imaging 2001: Storage and Retrieval for Media Databases 2001
- Hanjalic, A.¹ Kakes, G.² Lagendijk, R.L.³ Biemond, J.⁴

32
- 0004073653
- Amsterdam, The Netherlands
- A. Hanjalic, G.C. Langelaar, P.M.B. van Roosmalen, J. Biemond, and R.L. Lagendijk, Image and Video Databases: Restoration, Watermarking and Retrieval, Elsevier Science: Amsterdam, The Netherlands, 2000.
- (2000) Image and Video Databases: Restoration, Watermarking and Retrieval, Elsevier Science
- Hanjalic, A.¹ Langelaar, G.C.² Van Roosmalen, P.M.B.³ Biemond, J.⁴ Lagendijk, R.L.⁵

33
- 10044239072
- Topic labeling of multilingual broadcast news in the informedia digital video library
- Berkely, USA
- A.G. Hauptmann, D. Lee, and P.E. Kennedy, 'Topic labeling of multilingual broadcast news in the informedia digital video library," in ACM DL/SIGIR MIDAS Workshop, Berkely, USA, 1999.
- (1999) ACM DL/SIGIR MIDAS Workshop
- Hauptmann, A.G.¹ Lee, D.² Kennedy, P.E.³

34
- 0031673913
- Story segmentation and detection of commercials in broadcast news video
- Santa Barbara, USA
- A.G. Hauptmann and M.J. Witbrock, "Story segmentation and detection of commercials in broadcast news video," in ADL-98 Advances in Digital Libraries, Santa Barbara, USA, 1998, pp. 168-179.
- (1998) ADL-98 Advances in Digital Libraries , pp. 168-179
- Hauptmann, A.G.¹ Witbrock, M.J.²

35
- 0003342953
- Integration of multimodal features for video scene classification based on HMM
- Copenhagen, Denmark
- J. Huang, Z. Liu, Y. Wang, Y. Chen, and E.K. Wong, "Integration of multimodal features for video scene classification based on HMM," in IEEE Workshop on Multimedia Signal Processing, Copenhagen, Denmark, 1999.
- (1999) IEEE Workshop on Multimedia Signal Processing
- Huang, J.¹ Liu, Z.² Wang, Y.³ Chen, Y.⁴ Wong, E.K.⁵

36
- 34247627935
- Automatic video indexing based on shot classification
- of Lecture Notes in Computer Science, Springer-Verlag: Osaka, Japan
- I. Ide, K. Yamamoto, and H. Tanaka, "Automatic video indexing based on shot classification," in First International Conference on Advanced Multimedia Content Processing, Vol. 1554 of Lecture Notes in Computer Science, Springer-Verlag: Osaka, Japan, 1999.
- (1999) First International Conference on Advanced Multimedia Content Processing , vol.1554
- Ide, I.¹ Yamamoto, K.² Tanaka, H.³

37
- 0033640646
- Statistical pattern recognition: A review
- A.K. Jain, R.P.W. Duin, and J. Mao, "Statistical pattern recognition: A review," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 1, pp. 4-37, 2000.
- (2000) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.22 , Issue.1 , pp. 4-37
- Jain, A.K.¹ Duin, R.P.W.² Mao, J.³

38
- 84976799884
- Metadata in video databases
- R. Jain and A. Hampapur, "Metadata in video databases," ACM SIGMOD, Vol. 23, No. 4, pp. 27-33, 1994.
- (1994) ACM SIGMOD , vol.23 , Issue.4 , pp. 27-33
- Jain, R.¹ Hampapur, A.²

39
- 0033327198
- Learning to recognize speech by watching television
- P.J. Jang and A.G. Hauptmann, "Learning to recognize speech by watching television," IEEE Intelligent Systems, Vol. 14, No. 5, pp. 51-58, 1999.
- (1999) IEEE Intelligent Systems , vol.14 , Issue.5 , pp. 51-58
- Jang, P.J.¹ Hauptmann, A.G.²

40
- 0035164232
- Integrated multimedia processing for topic segmentation and classification
- Thessaloniki, Greece
- R.S. Jasinschi, N. Dimitrova, T. McGee, L. Agnihotri, J. Zimmerman, and D. Li, "Integrated multimedia processing for topic segmentation and classification," in IEEE International Conference on Image Processing, Thessaloniki, Greece, 2001, pp. 366-369.
- (2001) IEEE International Conference on Image Processing , pp. 366-369
- Jasinschi, R.S.¹ Dimitrova, N.² McGee, T.³ Agnihotri, L.⁴ Zimmerman, J.⁵ Li, D.⁶

41
- 0034850017
- A framework for segmentation of talk & game shows
- Vancouver, Canada
- O. Javed, Z. Rasheed, and M. Shah, "A framework for segmentation of talk & game shows," in IEEE International Conference on Computer Vision, Vancouver, Canada, 2001.
- (2001) IEEE International Conference on Computer Vision
- Javed, O.¹ Rasheed, Z.² Shah, M.³

42
- 0033898334
- Identification of sports videos using replay, text, and camera motion features
- V. Kobla, D. DeMenthon, and D. Doermann, "Identification of sports videos using replay, text, and camera motion features," in SPIE Conference on Storage and Retrieval for Media Databases, Vol. 3972, pp. 332-343, 2000.
- (2000) SPIE Conference on Storage and Retrieval for Media Databases , vol.3972 , pp. 332-343
- Kobla, V.¹ Dementhon, D.² Doermann, D.³

43
- 0035308233
- Classification of general audio data for content-based retrieval
- D. Li, I.K. Sethi, N. Dimitrova, and T. McGee, "Classification of general audio data for content-based retrieval," Pattern Recognition Letters, Vol. 22, No. 5, pp. 533-544, 2001.
- (2001) Pattern Recognition Letters , vol.22 , Issue.5 , pp. 533-544
- Li, D.¹ Sethi, I.K.² Dimitrova, N.³ McGee, T.⁴

44
- 0033909041
- Automatic text detection and tracking in digital video
- H. Li, D. Doermann, and O. Kia, "Automatic text detection and tracking in digital video," IEEE Transactions on Image Processing, Vol. 9, No. 1, pp. 147-156, 2000.
- (2000) IEEE Transactions on Image Processing , vol.9 , Issue.1 , pp. 147-156
- Li, H.¹ Doermann, D.² Kia, O.³

45
- 0030711994
- On the detection and recognition of television commercials
- Ottawa, Canada
- R. Lienhart, C. Kuhmünch, and W. Effelsberg, "On the detection and recognition of television commercials," in IEEE Conference on Multimedia Computing and Systems, Ottawa, Canada, 1997, pp. 509-516.
- (1997) IEEE Conference on Multimedia Computing and Systems , pp. 509-516
- Lienhart, R.¹ Kuhmünch, C.² Effelsberg, W.³

46
- 0003612818
- The MIT Press, Cambridge, USA
- C.D. Manning and H. Schütze, Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, USA, 1999.
- (1999) Foundations of Statistical Natural Language Processing.
- Manning, C.D.¹ Schütze, H.²

47
- 0032115209
- Video handling with music and speech detection
- K. Minami, A. Akutsu, H. Hamada, and Y. Tomomura, "Video handling with music and speech detection," IEEE Multimedia, Vol. 5, No. 3, pp. 17-25, 1998.
- (1998) IEEE Multimedia , vol.5 , Issue.3 , pp. 17-25
- Minami, K.¹ Akutsu, A.² Hamada, H.³ Tomomura, Y.⁴

48
- 84905368120
- Video annotation for content-based retrieval using human behavior analysis and domain knowledge
- Grenoble, France
- H. Miyamori and S. Iisaku, "Video annotation for content-based retrieval using human behavior analysis and domain knowledge," in IEEE International Conference on Automatic Face and Gesture Recognition, Grenoble, France, 2000, pp. 26-30.
- (2000) IEEE International Conference on Automatic Face and Gesture Recognition , pp. 26-30
- Miyamori, H.¹ Iisaku, S.²

49
- 0035305653
- Example-based object detection in images by components
- A. Mohan, C. Papageorgiou, and T. Poggio, "Example-based object detection in images by components," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 23, No. 4, pp. 349-361, 2001.
- (2001) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.23 , Issue.4 , pp. 349-361
- Mohan, A.¹ Papageorgiou, C.² Poggio, T.³

50
- 84908302157
- Detecting indexical signs in film audio for scene interpretation
- Tokyo, Japan
- S. Moncrieff, C. Dorai, and S. Venkatesh, "Detecting indexical signs in film audio for scene interpretation," in IEEE International Conference on Multimedia & Expo, Tokyo, Japan, 2001, pp. 1192-1195.
- (2001) IEEE International Conference on Multimedia & Expo , pp. 1192-1195
- Moncrieff, S.¹ Dorai, C.² Venkatesh, S.³

51
- 0032595006
- Everything you always wanted to know about MPEG-7: Part 1
- F. Nack and A.T. Lindsay, "Everything you always wanted to know about MPEG-7: Part 1," IEEE Multimedia, Vol. 6, No. 3, pp. 65-77, 1999.
- (1999) IEEE Multimedia , vol.6 , Issue.3 , pp. 65-77
- Nack, F.¹ Lindsay, A.T.²

52
- 0033203799
- Everything you always wanted to know about MPEG-7: Part 2
- F. Nack and A.T. Lindsay, "Everything you always wanted to know about MPEG-7: Part 2," IEEE Multimedia, Vol. 6, No. 4, pp. 64-73, 1999.
- (1999) IEEE Multimedia , vol.6 , Issue.4 , pp. 64-73
- Nack, F.¹ Lindsay, A.T.²

53
- 0032312615
- Audio-visual content-based violent scene characterization
- Chicago, USA
- J. Nam, M. Alghoniemy, and A.H. Tewfik, "Audio-visual content-based violent scene characterization," in IEEE International Conference on Image Processing, Chicago, USA, 1998, Vol. 1, pp. 353-357.
- (1998) IEEE International Conference on Image Processing , vol.1 , pp. 353-357
- Nam, J.¹ Alghoniemy, M.² Tewfik, A.H.³

54
- 0031374433
- Speaker identification and video analysis for hierarchical video shot classification
- Washington DC, USA
- J. Nam, A. Enis Cetin, and A.H. Tewfik, "Speaker identification and video analysis for hierarchical video shot classification," in IEEE International Conference on Image Processing, Washington DC, USA, 1997, Vol. 2.
- (1997) IEEE International Conference on Image Processing , vol.2
- Nam, J.¹ Cetin, A.E.² Tewfik, A.H.³

55
- 0035281949
- A probabilistic framework for semantic video indexing, filtering, and retrieval
- M.R. Naphade and T.S. Huang, "A probabilistic framework for semantic video indexing, filtering, and retrieval," IEEE Transactions on Multimedia, Vol. 3, No. 1, pp. 141-151, 2001.
- (2001) IEEE Transactions on Multimedia , vol.3 , Issue.1 , pp. 141-151
- Naphade, M.R.¹ Huang, T.S.²

56
- 0033896326
- Detection of moving objects in video using a robust motion similarity measure
- H.T. Nguyen, M. Worring, and A. Dev, "Detection of moving objects in video using a robust motion similarity measure," IEEE Transactions on Image Processing, Vol. 9, No. 1, pp. 137-141, 2000.
- (2000) IEEE Transactions on Image Processing , vol.9 , Issue.1 , pp. 137-141
- Nguyen, H.T.¹ Worring, M.² Dev, A.³

57
- 0027887659
- A design space for multimodal systems: Concurrent processing and data fusion
- Amsterdam, the Netherlands
- L. Nigay and J. Coutaz, "A design space for multimodal systems: concurrent processing and data fusion." in INTERCHI'93 Proceedings, Amsterdam, the Netherlands, 1993, pp. 172-178.
- (1993) INTERCHI'93 Proceedings , pp. 172-178
- Nigay, L.¹ Coutaz, J.²

58
- 0031344674
- The state of the art in text filtering
- D.W. Oard, "The state of the art in text filtering," User Modeling and User-Adapted Interaction, Vol. 7, No. 3, pp. 141-178, 1997.
- (1997) User Modeling and User-Adapted Interaction , vol.7 , Issue.3 , pp. 141-178
- Oard, D.W.¹

59
- 0034842539
- Detection of slow-motion replay segments in sports video for highlights generation
- H. Pan, P. Van Beek, and M.I. Sezan, "Detection of slow-motion replay segments in sports video for highlights generation," in IEEE International Conference on Acoustic, Speech and Signal Processing, 2001.
- (2001) IEEE International Conference on Acoustic, Speech and Signal Processing
- Pan, H.¹ Van Beek, P.² Sezan, M.I.³

60
- 0029778685
- Audio characterization for video indexing
- San Jose, USA
- N.V. Patel and I.K. Sethi, "Audio characterization for video indexing," in Proceedings SPIE on Storage and Retrieval for Still Image and Video Databases, San Jose, USA, 1996, Vol. 2670, pp. 373-384.
- (1996) Proceedings SPIE on Storage and Retrieval for Still Image and Video Databases , vol.2670 , pp. 373-384
- Patel, N.V.¹ Sethi, I.K.²

61
- 77954021581
- Video classification using speaker identification
- San Jose, USA
- N.V. Patel and I.K. Sethi, "Video classification using speaker identification," in IS&T SPIE, Proceedings: Storage and Retrieval for Image and Video Databases IV, San Jose, USA, 1997.
- (1997) IS&T SPIE, Proceedings: Storage and Retrieval for Image and Video Databases IV
- Patel, N.V.¹ Sethi, I.K.²

62
- 0003391330
- Morgan Kaufmann: San Mateo, USA
- J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann: San Mateo, USA, 1988.
- (1988) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
- Pearl, J.¹

63
- 0034517157
- Low-level motion activity features for semantic characterization of video
- New York City, USA
- A.K. Peker, A.A. Alatan, and A.N. Akansu, "Low-level motion activity features for semantic characterization of video," in IEEE International Conference on Multimedia & Expo, New York City, USA, 2000.
- (2000) IEEE International Conference on Multimedia & Expo
- Peker, A.K.¹ Alatan, A.A.² Akansu, A.N.³

64
- 0027928121
- View-based and modular eigenspaces for face recognition
- Seattle, USA
- A. Pentland, B. Moghaddam, and T. Starner, "View-based and modular eigenspaces for face recognition," in IEEE International Conference on Computer Vision and Pattern Recognition, Seattle, USA, 1994.
- (1994) IEEE International Conference on Computer Vision and Pattern Recognition
- Pentland, A.¹ Moghaddam, B.² Starner, T.³

65
- 0030396150
- Automatic audio content analysis
- Boston, USA
- S. Pfeiffer, S. Fischer, and W. Effelsberg, "Automatic audio content analysis," in ACM Multimedia 1996, Boston, USA, 1996, pp. 21-30.
- (1996) ACM Multimedia 1996 , pp. 21-30
- Pfeiffer, S.¹ Fischer, S.² Effelsberg, W.³

66
- 0035442477
- Scene determination based on video and audio features
- S. Pfeiffer, R. Lienhart, and W. Effelsberg, "Scene determination based on video and audio features," Multimedia Tools and Applications, Vol. 15, No. 1, pp. 59-81, 2001.
- (2001) Multimedia Tools and Applications , vol.15 , Issue.1 , pp. 59-81
- Pfeiffer, S.¹ Lienhart, R.² Effelsberg, W.³

67
- 10044260677
- Face detection methods: A critical evaluation
- Intelligent Sensory Information Systems, University of Amsterdam, 2000
- T.V. Pham and M. Worring, "Face detection methods: A critical evaluation," Technical Report 2000-11, Intelligent Sensory Information Systems, University of Amsterdam, 2000.
- Technical Report 2000-11
- Pham, T.V.¹ Worring, M.²

68
- 10044230195
- Praja. http://www.praja.com.

69
- 0024610919
- A tutorial on hidden markov models and selected applications in speech recognition
- L.R. Rabiner, "A tutorial on hidden markov models and selected applications in speech recognition," Proceedings of the IEEE, Vol. 77, No. 2, pp. 257-286, 1989.
- (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

70
- 0031672526
- Neural network-based face detection
- H.A. Rowley, S. Baluja, and T. Kanade, "Neural network-based face detection," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 20, No. 1, pp. 23-38, 1998.
- (1998) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.20 , Issue.1 , pp. 23-38
- Rowley, H.A.¹ Baluja, S.² Kanade, T.³

71
- 0034440695
- Automatically extracting highlights for TV baseball programs
- Los Angeles, USA
- Y. Rui, A. Gupta, and A. Acero, "Automatically extracting highlights for TV baseball programs," in ACM Multimedia 2000, Los Angeles, USA, 2000, pp. 105-115.
- (2000) ACM Multimedia 2000 , pp. 105-115
- Rui, Y.¹ Gupta, A.² Acero, A.³

72
- 0033279866
- Content analysis of video using principal components
- E. Sahouria and A. Zakhor, "Content analysis of video using principal components," IEEE Transactions on Circuits and Systems for Video Technology, Vol. 9, No. 8, pp. 1290-1298, 1999.
- (1999) IEEE Transactions on Circuits and Systems for Video Technology , vol.9 , Issue.8 , pp. 1290-1298
- Sahouria, E.¹ Zakhor, A.²

73
- 0032306091
- Identification of story units in audio-visual sequences by joint audio and video processing
- Chicago, USA
- C. Saraceno and R. Leonardi, "Identification of story units in audio-visual sequences by joint audio and video processing," in IEEE International Conference on Image Processing, Chicago, USA, 1998.
- (1998) IEEE International Conference on Image Processing
- Saraceno, C.¹ Leonardi, R.²

74
- 0032660827
- Name-It: Naming and detecting faces in news videos
- S. Satoh, Y. Nakamura, and T. Kanade, "Name-It: Naming and detecting faces in news videos," IEEE Multimedia, Vol. 6, No. 1, pp. 22-35, 1999.
- (1999) IEEE Multimedia , vol.6 , Issue.1 , pp. 22-35
- Satoh, S.¹ Nakamura, Y.² Kanade, T.³

75
- 85076098160
- Automated analysis and annotation of basketball video
- San Jose, USA
- D.D. Saur, Y.-P. Tan, S.R. Kulkarni, and P.J. Ramadge, "Automated analysis and annotation of basketball video," in SPIE's Electronic Imaging conference on Storage and Retrieval for Image and Video Databases V, San Jose, USA, 1997, Vol. 3022, pp. 176-187.
- (1997) SPIE's Electronic Imaging Conference on Storage and Retrieval for Image and Video Databases V , vol.3022 , pp. 176-187
- Saur, D.D.¹ Tan, Y.-P.² Kulkarni, S.R.³ Ramadge, P.J.⁴

76
- 0033682228
- A statistical method for 3D object detection applied to faces and cars
- Hilton Head, USA
- H. Schneiderman and T. Kanade, "A statistical method for 3D object detection applied to faces and cars," in IEEE Computer Vision and Pattern Recognition, Hilton Head, USA, 2000.
- (2000) IEEE Computer Vision and Pattern Recognition
- Schneiderman, H.¹ Kanade, T.²

77
- 10044239073
- Incorporating domain knowledge with video and voice data analysis in news broadcasts
- Boston, USA
- K. Shearer, C. Dorai, and S. Venkatesh, "Incorporating domain knowledge with video and voice data analysis in news broadcasts," in ACM International Conference on Knowledge Discovery and Data Mining, Boston, USA, 2000, pp. 46-53.
- (2000) ACM International Conference on Knowledge Discovery and Data Mining , pp. 46-53
- Shearer, K.¹ Dorai, C.² Venkatesh, S.³

78
- 0000088247
- Automatic text extraction from video for content-based annotation and retrieval
- J. Shim, C. Dorai, and R. Bolle, "Automatic text extraction from video for content-based annotation and retrieval," in IEEE International Conference on Pattern Recognition, 1998, pp. 618-620.
- (1998) IEEE International Conference on Pattern Recognition , pp. 618-620
- Shim, J.¹ Dorai, C.² Bolle, R.³

79
- 0034498523
- Content based image retrieval at the end of the early years
- A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, "Content based image retrieval at the end of the early years," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 12, pp. 1349-1380, 2000.
- (2000) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.22 , Issue.12 , pp. 1349-1380
- Smeulders, A.W.M.¹ Worring, M.² Santini, S.³ Gupta, A.⁴ Jain, R.⁵

80
- 0029373840
- Automatic indexing and content-based retrieval of captioned images
- R.K. Srihari, "Automatic indexing and content-based retrieval of captioned images," IEEE Computer, Vol. 28, No. 9, pp. 49-56, 1995.
- (1995) IEEE Computer , vol.28 , Issue.9 , pp. 49-56
- Srihari, R.K.¹

81
- 85029855944
- Automatic classification of tennis video for high-level content-based retrieval
- Bombay, India
- G. Sudhir, J.C.M. Lee, and A.K. Jain, "Automatic classification of tennis video for high-level content-based retrieval," in IEEE International Workshop on Content-Based Access of Image and Video Databases, in conjunction with ICCV'98, Bombay, India, 1998.
- (1998) IEEE International Workshop on Content-Based Access of Image and Video Databases, in Conjunction with ICCV'98
- Sudhir, G.¹ Lee, J.C.M.² Jain, A.K.³

82
- 85047467917
- Indoor-outdoor image classification
- Bombay, India
- M. Szummer and R.W. Picard, "Indoor-outdoor image classification," in IEEE International Workshop on Content-based Access of Image and Video Databases, in conjunction with ICCV'98, Bombay, India, 1998.
- (1998) IEEE International Workshop on Content-based Access of Image and Video Databases, in Conjunction with ICCV'98
- Szummer, M.¹ Picard, R.W.²

83
- 84908306296
- Determining dramatic intensification via flashing lights in movies
- Tokyo, Japan
- B.T. Truong and S. Venkatesh, "Determining dramatic intensification via flashing lights in movies," in IEEE International Conference on Multimedia & Expo, Tokyo, Japan, 2001, pp. 61-64.
- (2001) IEEE International Conference on Multimedia & Expo , pp. 61-64
- Truong, B.T.¹ Venkatesh, S.²

84
- 0345164806
- Automatic genre identification for content-based video categorization
- Barcelona, Spain
- B.T. Truong, S. Venkatesh, and C. Dorai, "Automatic genre identification for content-based video categorization," in IEEE International Conference on Pattern Recognition, Barcelona, Spain, 2000.
- (2000) IEEE International Conference on Pattern Recognition
- Truong, B.T.¹ Venkatesh, S.² Dorai, C.³

85
- 0035308821
- Content-based video parsing and indexing based on audio-visual interaction
- S. Tsekeridou and I. Pitas, "Content-based video parsing and indexing based on audio-visual interaction," IEEE Transactions on Circuits and Systems for Video Technology, Vol. 11, No. 4, pp. 522-535, 2001.
- (2001) IEEE Transactions on Circuits and Systems for Video Technology , vol.11 , Issue.4 , pp. 522-535
- Tsekeridou, S.¹ Pitas, I.²

86
- 0033883143
- Detecting sky and vegetation in outdoor images
- San Jose, USA
- A. Vailaya and A.K. Jain, "Detecting sky and vegetation in outdoor images," in Proceedings of SPIE: Storage and Retrieval for Image and Video Databases VIII, San Jose, USA, 2000, Vol. 3972.
- (2000) Proceedings of SPIE: Storage and Retrieval for Image and Video Databases VIII , vol.3972
- Vailaya, A.¹ Jain, A.K.²

87
- 0032455864
- On image classification: City images vs. landscapes
- A. Vailaya, A.K. Jain, and H.-J. Zhang, "On image classification: City images vs. landscapes," Pattern Recognition, Vol. 31, pp. 1921-1936, 1998.
- (1998) Pattern Recognition , vol.31 , pp. 1921-1936
- Vailaya, A.¹ Jain, A.K.² Zhang, H.-J.³

88
- 0036999134
- Systematic evaluation of logical story unit segmentation
- J. Vendrig and M. Worring, "Systematic evaluation of logical story unit segmentation," IEEE Transactions on Multimedia, Vol. 4, No. 4, pp. 492-499, 2002.
- (2002) IEEE Transactions on Multimedia , vol.4 , Issue.4 , pp. 492-499
- Vendrig, J.¹ Worring, M.²

89
- 10044257375
- Virage, http ://www.virage.com.

90
- 85032751556
- Multimedia content analysis using both audio and visual clues
- Y. Wang, Z. Liu, and J. Huang, "Multimedia content analysis using both audio and visual clues," IEEE Signal Processing Magazine, Vol. 17, No. 6, pp. 12-36, 2000.
- (2000) IEEE Signal Processing Magazine , vol.17 , Issue.6 , pp. 12-36
- Wang, Y.¹ Liu, Z.² Huang, J.³

91
- 0013184624
- Image retrieval: Content versus context
- Paris, France
- T. Westerveld, "Image retrieval: Content versus context," in Content-Based Multimedia Information Access, RIAO 2000 Conference, Paris, France, 2000, pp. 276-284.
- (2000) Content-Based Multimedia Information Access, RIAO 2000 Conference , pp. 276-284
- Westerveld, T.¹

92
- 0030242072
- Content-based classification, search, and retrieval of audio
- E. Wold, T. Blum, D. Keislar, and J. Wheaton, "Content-based classification, search, and retrieval of audio," IEEE Multimedia, Vol. 3, No. 3, pp. 27-36, 1996.
- (1996) IEEE Multimedia , vol.3 , Issue.3 , pp. 27-36
- Wold, E.¹ Blum, T.² Keislar, D.³ Wheaton, J.⁴

93
- 0030241856
- Spatio-temporal segmentation of image sequences for object-oriented low bit-rate image coding
- L. Wu, J. Benois-Pineau, and D. Barba, "Spatio-temporal segmentation of image sequences for object-oriented low bit-rate image coding," Image Communication, Vol. 8, No. 6, pp. 513-544, 1996.
- (1996) Image Communication , vol.8 , Issue.6 , pp. 513-544
- Wu, L.¹ Benois-Pineau, J.² Barba, D.³

94
- 84908269466
- Algorithms and systems for segmentation and structure analysis in soccer video
- Tokyo, Japan
- P. Xu, L. Xie, S.-F. Chang, A. Divakaran, A. Vetro, and H. Sun, "Algorithms and systems for segmentation and structure analysis in soccer video," in IEEE International Conference on Multimedia & Expo, Tokyo, Japan, 2001, pp. 928-931.
- (2001) IEEE International Conference on Multimedia & Expo , pp. 928-931
- Xu, P.¹ Xie, L.² Chang, S.-F.³ Divakaran, A.⁴ Vetro, A.⁵ Sun, H.⁶

95
- 0036223025
- Detecting faces in images: A survey
- M.-H. Yang, D. Kriegman, and N. Ahuja, "Detecting faces in images: A survey," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 24, No. 1, pp. 34-58, 2002.
- (2002) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.24 , Issue.1 , pp. 34-58
- Yang, M.-H.¹ Kriegman, D.² Ahuja, N.³

96
- 85076107083
- Video content characterization and compaction for digital library applications
- M.M. Yeung and B.-L. Yeo, "Video content characterization and compaction for digital library applications," in IS&T/SPIE Storage and Retrieval of Image and Video Databases V, 1997, Vol. 3022, pp. 45-58.
- (1997) IS&T/SPIE Storage and Retrieval of Image and Video Databases V , vol.3022 , pp. 45-58
- Yeung, M.M.¹ Yeo, B.-L.²

97
- 34250082473
- Automatic partitioning of full-motion video
- H.-J. Zhang, A. Kankanhalli, and S.W. Smoliar, "Automatic partitioning of full-motion video," Multimedia Systems, Vol. 1, No. 1, pp. 10-28, 1993.
- (1993) Multimedia Systems , vol.1 , Issue.1 , pp. 10-28
- Zhang, H.-J.¹ Kankanhalli, A.² Smoliar, S.W.³

98
- 0000464746
- Automatic parsing and indexing of news video
- H.-J. Zhang, S.Y. Tan, S.W. Smoliar, and G. Yihong, "Automatic parsing and indexing of news video," Multimedia Systems, Vol. 2, No. 6, pp. 256-266, 1995.
- (1995) Multimedia Systems , vol.2 , Issue.6 , pp. 256-266
- Zhang, H.-J.¹ Tan, S.Y.² Smoliar, S.W.³ Yihong, G.⁴

99
- 0032629748
- Hierarchical classification of audio data for archiving and retrieving
- Phoenix, USA
- T. Zhang and C.-C.J. Kuo, "Hierarchical classification of audio data for archiving and retrieving," in IEEE International Conference on Acoustics, Speech, and Signal Processing, Phoenix, USA, 1999, Vol. 6, pp. 3001-3004.
- (1999) IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.6 , pp. 3001-3004
- Zhang, T.¹ Kuo, C.-C.J.²

100
- 84908281148
- Structure analysis of sports video using domain models
- Tokyo, Japan, 2001
- D. Zhong and S.-F. Chang, "Structure analysis of sports video using domain models," in IEEE International Conference on Multimedia & Expo, Tokyo, Japan, 2001, pp. 920-923.
- IEEE International Conference on Multimedia & Expo , pp. 920-923
- Zhong, D.¹ Chang, S.-F.²

101
- 0033689853
- Automatic caption localization in compressed video
- Y. Zhong, H.-J. Zhang, and A.K. Jain, "Automatic caption localization in compressed video," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 4, pp. 385-392, 2000.
- (2000) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.22 , Issue.4 , pp. 385-392
- Zhong, Y.¹ Zhang, H.-J.² Jain, A.K.³

102
- 84925836212
- Rule-based video classification system for basketball video indexing
- Los Angeles, USA
- W. Zhou, A. Vellaikal, and C.-C.J. Kuo, "Rule-based video classification system for basketball video indexing," in ACM Multimedia 2000, Los Angeles, USA, 2000.
- (2000) ACM Multimedia 2000
- Zhou, W.¹ Vellaikal, A.² Kuo, C.-C.J.³

103
- 79952051658
- Automatic news video segmentation and categorization based on closed-captioned text
- Tokyo, Japan
- W. Zhu, C. Toklu, and S.-P. Liou, "Automatic news video segmentation and categorization based on closed-captioned text," in IEEE International Conference on Multimedia & Expo, Tokyo, Japan, 2001, pp. 1036-1039.
- (2001) IEEE International Conference on Multimedia & Expo , pp. 1036-1039
- Zhu, W.¹ Toklu, C.² Liou, S.-P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.