SCOPUS 정보 검색 플랫폼

IEEE Transactions on Multimedia

Volumn 9, Issue 1, 2007, Pages 89-101

Major cast detection in video using both speaker and face information

(2) Liu, Zhu a,b Wang, Yao a,c

a IEEE (United States)

b AT AND T LABS RESEARCH (United States)

c POLYTECHNIC UNIVERSITY (United States)

Author keywords

Content based multimedia indexing; Face detection; Major cast detection; Media integration; Speaker segmentation; Video browsing; Video summary

Indexed keywords

CONTENT-BASED MULTIMEDIA INDEXING; FACE DETECTION; MEDIA INTEGRATION; SPEAKER SEGMENTATION; VIDEO BROWSING; VIDEO SUMMARY;

ALGORITHMS; IMAGE ANALYSIS; IMAGE SEGMENTATION; INFORMATION ANALYSIS; MOTION PICTURES;

FACE RECOGNITION;

EID: 33846216333 PISSN: 15209210 EISSN: None Source Type: Journal
DOI: 10.1109/TMM.2006.886360 Document Type: Article

Times cited : (20)

References (29)

1
- 0031619139
- A hidden Markov model framework for video segmentation using audio and image features
- Seattle, WA, May 12-15
- J. S. Boreczky and L. D. Wilcox, "A hidden Markov model framework for video segmentation using audio and image features," in Proc. ICASSP'1998, Seattle, WA, May 12-15, 1998, vol. 6, pp. 3741-3744.
- (1998) Proc. ICASSP'1998 , vol.6 , pp. 3741-3744
- Boreczky, J.S.¹ Wilcox, L.D.²

2
- 0029304865
- Human and machine recognition of faces: A survey
- May
- R. Chellappa, C. L. Wilson, and S. Sirohey, "Human and machine recognition of faces: A survey," Proc. IEEE, vol. 83, no. 5, pp. 705-741, May 1995.
- (1995) Proc. IEEE , vol.83 , Issue.5 , pp. 705-741
- Chellappa, R.¹ Wilson, C.L.² Sirohey, S.³

3
- 84889281816
- New York: Wiley
- T. M. Cover and J. A. Thomas, Elements of Information Theory. New York: Wiley, 1991.
- (1991) Elements of Information Theory
- Cover, T.M.¹ Thomas, J.A.²

4
- 0037860595
- Look who's talking: Speaker detection using video and audio corrlation
- New York, Nov
- R. Cutler and L. Davis, "Look who's talking: Speaker detection using video and audio corrlation," in Proc. ICME, New York, Nov. 2000.
- (2000) Proc. ICME
- Cutler, R.¹ Davis, L.²

5
- 0029375609
- Query by image and video content: The QBIC system
- Sep
- M. Flickner et al., "Query by image and video content: The QBIC system," IEEE Comput., vol. 28, no. 9, pp. 23-32, Sep. 1995.
- (1995) IEEE Comput , vol.28 , Issue.9 , pp. 23-32
- Flickner, M.¹

6
- 0032296995
- Efficient filtering and clustering methods for temporal video segmentation and visual summarization
- Dec
- A. Ferman and A. Tekalp, "Efficient filtering and clustering methods for temporal video segmentation and visual summarization," J. Vis. Comun. Image Repres., vol. 9, no. 4, pp. 336-351, Dec. 1998.
- (1998) J. Vis. Comun. Image Repres , vol.9 , Issue.4 , pp. 336-351
- Ferman, A.¹ Tekalp, A.²

7
- 67649123507
- Semantic indexing of multimedia using audio, text and visual cues
- Lausanne, Switzerland
- C. N. G. Iyengar, H. Nock, and M. Franz, "Semantic indexing of multimedia using audio, text and visual cues," in Proc. ICME, Lausanne, Switzerland, 2002, pp. 369-372.
- (2002) Proc. ICME , pp. 369-372
- Iyengar, C.N.G.¹ Nock, H.² Franz, M.³

8
- 0032320287
- Integration of audio and visual information for content-based video segmentation
- Chicago, IL, Oct. 4-7
- J. Huang, Z. Liu, and Y. Wang, "Integration of audio and visual information for content-based video segmentation," in Proc. ICIP'1998, Chicago, IL, Oct. 4-7, 1998, vol. 3, pp. 526-530.
- (1998) Proc. ICIP'1998 , vol.3 , pp. 526-530
- Huang, J.¹ Liu, Z.² Wang, Y.³

9
- 0009622481
- Learning joint statistical models for audio-visual fusion and segregation
- Denver, CO
- W. F. J. Fisher, T. Darrell, and P. Viola, "Learning joint statistical models for audio-visual fusion and segregation," in Advances in Neural Information Processing Systems., Denver, CO, 2000.
- (2000) Advances in Neural Information Processing Systems
- Fisher, W.F.J.¹ Darrell, T.² Viola, P.³

10
- 0004161991
- Upper Saddle River, NJ: Prentice-Hall
- A. K. Jain and R. C. Dubes, Algorithms for Clustering Data. Upper Saddle River, NJ: Prentice-Hall, 1998.
- (1998) Algorithms for Clustering Data
- Jain, A.K.¹ Dubes, R.C.²

11
- 0032692096
- Scene determination based on video and audio features
- Jun. 7-11
- R. Lienhart, S. Pfeiffer, and W. Effelsberg, "Scene determination based on video and audio features," in Proc. IEEE Int. Conf. Multimedia Computing and Systems, Jun. 7-11, 1999, vol. 1, pp. 685-690.
- (1999) Proc. IEEE Int. Conf. Multimedia Computing and Systems , vol.1 , pp. 685-690
- Lienhart, R.¹ Pfeiffer, S.² Effelsberg, W.³

12
- 0033690894
- A new distance measure for probability distribution function of mixture type
- Istanbul, Turkey, Jun. 5-9
- Z. Liu and Q. Huang, "A new distance measure for probability distribution function of mixture type," in Proc. ICASSP '2000, Istanbul, Turkey, Jun. 5-9, 2000.
- (2000) Proc. ICASSP '2000
- Liu, Z.¹ Huang, Q.²

13
- 0034445531
- Face detection and tracking in video using dynamic programming
- Vancouver, BC, Canada, Sep. 10-13
- Z. Liu and Y. Wang, "Face detection and tracking in video using dynamic programming," in Proc. ICIP'2000, Vancouver, BC, Canada, Sep. 10-13, 2000.
- (2000) Proc. ICIP'2000
- Liu, Z.¹ Wang, Y.²

14
- 0034841928
- Major cast detection in video using both audio and visual information
- Salt Lake City, UT, May 7-11
- _, "Major cast detection in video using both audio and visual information," in Proc. ICASSP'2001, Salt Lake City, UT, May 7-11, 2001.
- (2001) Proc. ICASSP'2001
- Liu, Z.¹ Wang, Y.²

15
- 0032181880
- Audio feature extraction and analysis for scene segmentation and classification
- Oct
- Z. Liu, Y. Wang, and T. Chen, "Audio feature extraction and analysis for scene segmentation and classification," J. VLSI Signal Process. Syst. for Signal, Image, and Video Technol., vol. 20, no. 1/2, pp. 61-79, Oct. 1998.
- (1998) J. VLSI Signal Process. Syst. for Signal, Image, and Video Technol , vol.20 , Issue.1-2 , pp. 61-79
- Liu, Z.¹ Wang, Y.² Chen, T.³

16
- 0031650491
- Scene break detection: A comparison
- Feb
- G. Lupatini, C. Saraceno, and R. Leonardi, "Scene break detection: A comparison," in Proc. 8th Int. Workshop on Research Issues In Data Engineering, Feb. 1998, pp. 34-41.
- (1998) Proc. 8th Int. Workshop on Research Issues In Data Engineering , pp. 34-41
- Lupatini, G.¹ Saraceno, C.² Leonardi, R.³

17
- 0004171986
- Available
- Online
- A. Martinez and R. Benavente, The AR Face Database 1998 [Online]. Available: http://rvl1.ecn.purdue.edu/~aleix/aleix_face_DB.html
- (1998)
- Martinez, A.¹ Benavente, R.²

18
- 0036223025
- Detecting faces in images: A survey
- Jan
- M. Y. N. Ahuja and D. Kriegman, "Detecting faces in images: A survey," IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 1, pp. 34-58, Jan. 2002.
- (2002) IEEE Trans. Pattern Anal. Mach. Intell , vol.24 , Issue.1 , pp. 34-58
- Ahuja, M.Y.N.¹ Kriegman, D.²

19
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- Jan
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
- (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

20
- 0031672526
- Neural network-baesd face detection
- Jan
- H. A. Rowley, S. Baluja, and T. Kanade, "Neural network-baesd face detection," IEEE Trans. Pattern Anal. Mach. Intell., vol. 20, no. 1, pp. 22-38, Jan. 1998.
- (1998) IEEE Trans. Pattern Anal. Mach. Intell , vol.20 , Issue.1 , pp. 22-38
- Rowley, H.A.¹ Baluja, S.² Kanade, T.³

21
- 0032663356
- Image retrieval: Current technologies, promising directions, and open issues
- Mar
- Y. Rui, T. S. Huang, and S.-F. Chang, "Image retrieval: Current technologies, promising directions, and open issues," J. Vis. Commun. Image Represen., vol. 10, no. 1, pp. 39-62, Mar. 1999.
- (1999) J. Vis. Commun. Image Represen , vol.10 , Issue.1 , pp. 39-62
- Rui, Y.¹ Huang, T.S.² Chang, S.-F.³

22
- 0026755044
- Automatic recognition and analysis of human faces and facial expressions: A survey
- A. Samal and P. A. Iyengar, "Automatic recognition and analysis of human faces and facial expressions: A survey," Pattern Recognit., vol. 25, no. 1, pp. 65-77, 1992.
- (1992) Pattern Recognit , vol.25 , Issue.1 , pp. 65-77
- Samal, A.¹ Iyengar, P.A.²

23
- 0032306091
- Identification of story units in audio-visual sequencies by joint audio and video processing
- Chicago, IL, Oct. 4-7
- C. Saraceno and R. Leonardi, "Identification of story units in audio-visual sequencies by joint audio and video processing," in Proc. ICIP'1998, Chicago, IL, Oct. 4-7, 1998, vol. 1, pp. 363-367.
- (1998) Proc. ICIP'1998 , vol.1 , pp. 363-367
- Saraceno, C.¹ Leonardi, R.²

24
- 0032660827
- Name-it: Naming and detecting faces in news videos
- Jan.-Mar
- S. Satoh, Y. Nakamura, and T. Kanade, "Name-it: Naming and detecting faces in news videos," IEEE Multimedia Mag., vol. 6, no. 1, pp. 22-35, Jan.-Mar. 1999.
- (1999) IEEE Multimedia Mag , vol.6 , Issue.1 , pp. 22-35
- Satoh, S.¹ Nakamura, Y.² Kanade, T.³

25
- 0029765670
- Real-time discrimination of broadcast speech/music
- Atlanta, GA, May 7-10
- J. Saunders, "Real-time discrimination of broadcast speech/music," in Proc. ICASSP'1996, Atlanta, GA, May 7-10, 1996, vol. 2, pp. 993-996.
- (1996) Proc. ICASSP'1996 , vol.2 , pp. 993-996
- Saunders, J.¹

26
- 0003450542
- New York: Springer
- V. N. Vapnik, The Nature of Statistical Learning Theory. New York: Springer, 1998.
- (1998) The Nature of Statistical Learning Theory
- Vapnik, V.N.¹

27
- 0030242072
- Content-based classification, search, and retrieval of audio
- Sep
- E. Wold, T. Blum, D. Keislar, and J. Wheaton, "Content-based classification, search, and retrieval of audio," IEEE Multimedia Mag., vol. 3, no. 3, pp. 27-36, Sep. 1996.
- (1996) IEEE Multimedia Mag , vol.3 , Issue.3 , pp. 27-36
- Wold, E.¹ Blum, T.² Keislar, D.³ Wheaton, J.⁴

28
- 34250082473
- Automatic partioning of video
- H. J. Zhang, A. Kankanhalli, and S. W. Smoliar, "Automatic partioning of video," Multimedia Syst., vol. 1, no. 1, pp. 10-28, 1993.
- (1993) Multimedia Syst , vol.1 , Issue.1 , pp. 10-28
- Zhang, H.J.¹ Kankanhalli, A.² Smoliar, S.W.³

29
- 0032629748
- Hierarchical classification of audio data for archiving and retrieving
- Phoenix, AZ, Mar. 15-19
- T. Zhang and C.-C. J. Kuo, "Hierarchical classification of audio data for archiving and retrieving," in Proc. ICASSP'1999, Phoenix, AZ, Mar. 15-19, 1999, vol. 6, pp. 3001-3004.
- (1999) Proc. ICASSP'1999 , vol.6 , pp. 3001-3004
- Zhang, T.¹ Kuo, C.-C.J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.