메뉴 건너뛰기




Volumn 9, Issue 1, 2007, Pages 89-101

Major cast detection in video using both speaker and face information

Author keywords

Content based multimedia indexing; Face detection; Major cast detection; Media integration; Speaker segmentation; Video browsing; Video summary

Indexed keywords

CONTENT-BASED MULTIMEDIA INDEXING; FACE DETECTION; MEDIA INTEGRATION; SPEAKER SEGMENTATION; VIDEO BROWSING; VIDEO SUMMARY;

EID: 33846216333     PISSN: 15209210     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMM.2006.886360     Document Type: Article
Times cited : (20)

References (29)
  • 1
    • 0031619139 scopus 로고    scopus 로고
    • A hidden Markov model framework for video segmentation using audio and image features
    • Seattle, WA, May 12-15
    • J. S. Boreczky and L. D. Wilcox, "A hidden Markov model framework for video segmentation using audio and image features," in Proc. ICASSP'1998, Seattle, WA, May 12-15, 1998, vol. 6, pp. 3741-3744.
    • (1998) Proc. ICASSP'1998 , vol.6 , pp. 3741-3744
    • Boreczky, J.S.1    Wilcox, L.D.2
  • 2
    • 0029304865 scopus 로고
    • Human and machine recognition of faces: A survey
    • May
    • R. Chellappa, C. L. Wilson, and S. Sirohey, "Human and machine recognition of faces: A survey," Proc. IEEE, vol. 83, no. 5, pp. 705-741, May 1995.
    • (1995) Proc. IEEE , vol.83 , Issue.5 , pp. 705-741
    • Chellappa, R.1    Wilson, C.L.2    Sirohey, S.3
  • 4
    • 0037860595 scopus 로고    scopus 로고
    • Look who's talking: Speaker detection using video and audio corrlation
    • New York, Nov
    • R. Cutler and L. Davis, "Look who's talking: Speaker detection using video and audio corrlation," in Proc. ICME, New York, Nov. 2000.
    • (2000) Proc. ICME
    • Cutler, R.1    Davis, L.2
  • 5
    • 0029375609 scopus 로고
    • Query by image and video content: The QBIC system
    • Sep
    • M. Flickner et al., "Query by image and video content: The QBIC system," IEEE Comput., vol. 28, no. 9, pp. 23-32, Sep. 1995.
    • (1995) IEEE Comput , vol.28 , Issue.9 , pp. 23-32
    • Flickner, M.1
  • 6
    • 0032296995 scopus 로고    scopus 로고
    • Efficient filtering and clustering methods for temporal video segmentation and visual summarization
    • Dec
    • A. Ferman and A. Tekalp, "Efficient filtering and clustering methods for temporal video segmentation and visual summarization," J. Vis. Comun. Image Repres., vol. 9, no. 4, pp. 336-351, Dec. 1998.
    • (1998) J. Vis. Comun. Image Repres , vol.9 , Issue.4 , pp. 336-351
    • Ferman, A.1    Tekalp, A.2
  • 7
    • 67649123507 scopus 로고    scopus 로고
    • Semantic indexing of multimedia using audio, text and visual cues
    • Lausanne, Switzerland
    • C. N. G. Iyengar, H. Nock, and M. Franz, "Semantic indexing of multimedia using audio, text and visual cues," in Proc. ICME, Lausanne, Switzerland, 2002, pp. 369-372.
    • (2002) Proc. ICME , pp. 369-372
    • Iyengar, C.N.G.1    Nock, H.2    Franz, M.3
  • 8
    • 0032320287 scopus 로고    scopus 로고
    • Integration of audio and visual information for content-based video segmentation
    • Chicago, IL, Oct. 4-7
    • J. Huang, Z. Liu, and Y. Wang, "Integration of audio and visual information for content-based video segmentation," in Proc. ICIP'1998, Chicago, IL, Oct. 4-7, 1998, vol. 3, pp. 526-530.
    • (1998) Proc. ICIP'1998 , vol.3 , pp. 526-530
    • Huang, J.1    Liu, Z.2    Wang, Y.3
  • 12
    • 0033690894 scopus 로고    scopus 로고
    • A new distance measure for probability distribution function of mixture type
    • Istanbul, Turkey, Jun. 5-9
    • Z. Liu and Q. Huang, "A new distance measure for probability distribution function of mixture type," in Proc. ICASSP '2000, Istanbul, Turkey, Jun. 5-9, 2000.
    • (2000) Proc. ICASSP '2000
    • Liu, Z.1    Huang, Q.2
  • 13
    • 0034445531 scopus 로고    scopus 로고
    • Face detection and tracking in video using dynamic programming
    • Vancouver, BC, Canada, Sep. 10-13
    • Z. Liu and Y. Wang, "Face detection and tracking in video using dynamic programming," in Proc. ICIP'2000, Vancouver, BC, Canada, Sep. 10-13, 2000.
    • (2000) Proc. ICIP'2000
    • Liu, Z.1    Wang, Y.2
  • 14
    • 0034841928 scopus 로고    scopus 로고
    • Major cast detection in video using both audio and visual information
    • Salt Lake City, UT, May 7-11
    • _, "Major cast detection in video using both audio and visual information," in Proc. ICASSP'2001, Salt Lake City, UT, May 7-11, 2001.
    • (2001) Proc. ICASSP'2001
    • Liu, Z.1    Wang, Y.2
  • 15
    • 0032181880 scopus 로고    scopus 로고
    • Audio feature extraction and analysis for scene segmentation and classification
    • Oct
    • Z. Liu, Y. Wang, and T. Chen, "Audio feature extraction and analysis for scene segmentation and classification," J. VLSI Signal Process. Syst. for Signal, Image, and Video Technol., vol. 20, no. 1/2, pp. 61-79, Oct. 1998.
    • (1998) J. VLSI Signal Process. Syst. for Signal, Image, and Video Technol , vol.20 , Issue.1-2 , pp. 61-79
    • Liu, Z.1    Wang, Y.2    Chen, T.3
  • 17
    • 0004171986 scopus 로고    scopus 로고
    • Available
    • Online
    • A. Martinez and R. Benavente, The AR Face Database 1998 [Online]. Available: http://rvl1.ecn.purdue.edu/~aleix/aleix_face_DB.html
    • (1998)
    • Martinez, A.1    Benavente, R.2
  • 19
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 21
    • 0032663356 scopus 로고    scopus 로고
    • Image retrieval: Current technologies, promising directions, and open issues
    • Mar
    • Y. Rui, T. S. Huang, and S.-F. Chang, "Image retrieval: Current technologies, promising directions, and open issues," J. Vis. Commun. Image Represen., vol. 10, no. 1, pp. 39-62, Mar. 1999.
    • (1999) J. Vis. Commun. Image Represen , vol.10 , Issue.1 , pp. 39-62
    • Rui, Y.1    Huang, T.S.2    Chang, S.-F.3
  • 22
    • 0026755044 scopus 로고
    • Automatic recognition and analysis of human faces and facial expressions: A survey
    • A. Samal and P. A. Iyengar, "Automatic recognition and analysis of human faces and facial expressions: A survey," Pattern Recognit., vol. 25, no. 1, pp. 65-77, 1992.
    • (1992) Pattern Recognit , vol.25 , Issue.1 , pp. 65-77
    • Samal, A.1    Iyengar, P.A.2
  • 23
    • 0032306091 scopus 로고    scopus 로고
    • Identification of story units in audio-visual sequencies by joint audio and video processing
    • Chicago, IL, Oct. 4-7
    • C. Saraceno and R. Leonardi, "Identification of story units in audio-visual sequencies by joint audio and video processing," in Proc. ICIP'1998, Chicago, IL, Oct. 4-7, 1998, vol. 1, pp. 363-367.
    • (1998) Proc. ICIP'1998 , vol.1 , pp. 363-367
    • Saraceno, C.1    Leonardi, R.2
  • 24
    • 0032660827 scopus 로고    scopus 로고
    • Name-it: Naming and detecting faces in news videos
    • Jan.-Mar
    • S. Satoh, Y. Nakamura, and T. Kanade, "Name-it: Naming and detecting faces in news videos," IEEE Multimedia Mag., vol. 6, no. 1, pp. 22-35, Jan.-Mar. 1999.
    • (1999) IEEE Multimedia Mag , vol.6 , Issue.1 , pp. 22-35
    • Satoh, S.1    Nakamura, Y.2    Kanade, T.3
  • 25
    • 0029765670 scopus 로고    scopus 로고
    • Real-time discrimination of broadcast speech/music
    • Atlanta, GA, May 7-10
    • J. Saunders, "Real-time discrimination of broadcast speech/music," in Proc. ICASSP'1996, Atlanta, GA, May 7-10, 1996, vol. 2, pp. 993-996.
    • (1996) Proc. ICASSP'1996 , vol.2 , pp. 993-996
    • Saunders, J.1
  • 27
    • 0030242072 scopus 로고    scopus 로고
    • Content-based classification, search, and retrieval of audio
    • Sep
    • E. Wold, T. Blum, D. Keislar, and J. Wheaton, "Content-based classification, search, and retrieval of audio," IEEE Multimedia Mag., vol. 3, no. 3, pp. 27-36, Sep. 1996.
    • (1996) IEEE Multimedia Mag , vol.3 , Issue.3 , pp. 27-36
    • Wold, E.1    Blum, T.2    Keislar, D.3    Wheaton, J.4
  • 29
    • 0032629748 scopus 로고    scopus 로고
    • Hierarchical classification of audio data for archiving and retrieving
    • Phoenix, AZ, Mar. 15-19
    • T. Zhang and C.-C. J. Kuo, "Hierarchical classification of audio data for archiving and retrieving," in Proc. ICASSP'1999, Phoenix, AZ, Mar. 15-19, 1999, vol. 6, pp. 3001-3004.
    • (1999) Proc. ICASSP'1999 , vol.6 , pp. 3001-3004
    • Zhang, T.1    Kuo, C.-C.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.