메뉴 건너뛰기




Volumn , Issue , 2012, Pages 2377-2380

Segmentation of TV shows into scenes using speaker diarization and speech recognition

Author keywords

multimodal fusion; scene boundary detection; scene transition graph; speaker diarization; speech recognition

Indexed keywords

AUDIO-VISUAL DOCUMENT; AUTOMATIC SPEECH RECOGNITION; BOUNDARY DETECTION; COLOR HISTOGRAM; MULTI-MODAL APPROACH; MULTI-MODAL FUSION; SEMANTIC INFORMATION; SPEAKER DIARIZATION; STATE-OF-THE-ART ALGORITHMS; TRANSITION GRAPHS; VISUAL INFORMATION;

EID: 84867605264     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2012.6288393     Document Type: Conference Paper
Times cited : (18)

References (9)
  • 1
    • 3242764570 scopus 로고    scopus 로고
    • Shot Clustering Techniques for Story Browsing
    • August
    • Wallapak Tavanapong and Junyu Zhou, "Shot Clustering Techniques for Story Browsing," IEEE Transactions on Multimedia, vol. 6, no. 4, pp. 517-527, August 2004.
    • (2004) IEEE Transactions on Multimedia , vol.6 , Issue.4 , pp. 517-527
    • Tavanapong, W.1    Zhou, J.2
  • 2
    • 2142771243 scopus 로고    scopus 로고
    • Structure Analysis of Soccer Video with Domain Knowledge and Hidden Markov Models
    • May
    • Lexing Xie, Peng Xu, Shih-Fu Chang, Ajay Divakaran, and Huifang Sun, "Structure Analysis of Soccer Video with Domain Knowledge and Hidden Markov Models," Pattern Recognition Letters - Video computing, vol. 25, pp. 767-775, May 2004.
    • (2004) Pattern Recognition Letters - Video Computing , vol.25 , pp. 767-775
    • Xie, L.1    Xu, P.2    Chang, S.-F.3    Divakaran, A.4    Sun, H.5
  • 4
    • 0037001728 scopus 로고    scopus 로고
    • Computable Scenes and Structures in Films
    • December
    • Hari Sundaram and Shih-Fu Chang, "Computable Scenes and Structures in Films," IEEE Transactions on Multimedia, vol. 4, no. 4, pp. 482-491, December 2002.
    • (2002) IEEE Transactions on Multimedia , vol.4 , Issue.4 , pp. 482-491
    • Sundaram, H.1    Chang, S.-F.2
  • 5
    • 62249087788 scopus 로고    scopus 로고
    • Video Scene Segmentation and Semantic Representation Using a Novel Scheme
    • April
    • Songhao Zhu and Yuncai Liu, "Video Scene Segmentation and Semantic Representation Using a Novel Scheme," Multimedia Tools and Applications, vol. 42, no. 2, pp. 183-205, April 2009.
    • (2009) Multimedia Tools and Applications , vol.42 , Issue.2 , pp. 183-205
    • Zhu, S.1    Liu, Y.2
  • 7
    • 0032121957 scopus 로고    scopus 로고
    • Segmentation of Video by Clustering and Graph Analysis
    • July
    • Minerva Yeung, Boon-Lock Yeo, and Bede Liu, "Segmentation of Video by Clustering and Graph Analysis," Computer Vision and Image Understanding, vol. 71, no. 1, pp. 94-109, July 1998.
    • (1998) Computer Vision and Image Understanding , vol.71 , Issue.1 , pp. 94-109
    • Yeung, M.1    Yeo, B.-L.2    Liu, B.3
  • 9
    • 0036567851 scopus 로고    scopus 로고
    • The LIMSI Broadcast News Transcription System
    • Jean-Luc Gauvain, Lori Lamel, and Gilles Adda, "The LIMSI Broadcast News Transcription System," Speech Communication, vol. 37, no. 1-2, pp. 89-109, 2002.
    • (2002) Speech Communication , vol.37 , Issue.1-2 , pp. 89-109
    • Gauvain, J.-L.1    Lamel, L.2    Adda, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.