메뉴 건너뛰기




Volumn 20, Issue 2-3 SPEC. ISS., 2006, Pages 303-330

Step-by-step and integrated approaches in broadcast news speaker diarization

Author keywords

E HMM; Integrated approach; Speaker diarization; Speaker indexing; Speaker segmentation and clustering; Step by step approach

Indexed keywords

IMAGE SEGMENTATION; PATTERN RECOGNITION SYSTEMS; TELEVISION BROADCASTING;

EID: 29044442235     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2005.08.002     Document Type: Conference Paper
Times cited : (108)

References (37)
  • 2
    • 84946742526 scopus 로고    scopus 로고
    • A robust speaker clustering algorithm
    • IEEE, ASRU 2003, St. Thomas, US Virgin Islands
    • Ajmera, J., Wooters, C., 2003. A robust speaker clustering algorithm. In: Automatic Speech Recognition and Understanding, IEEE, ASRU 2003, St. Thomas, US Virgin Islands, pp. 411-416.
    • (2003) Automatic Speech Recognition and Understanding , pp. 411-416
    • Ajmera, J.1    Wooters, C.2
  • 3
    • 29044446864 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the bayesian information criterion
    • Landsdowne, VA
    • Chen, S., Gopalakrishnan, P., 1998. Speaker, environment and channel change detection and clustering via the bayesian information criterion. In: DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, VA.
    • (1998) DARPA Broadcast News Transcription and Understanding Workshop
    • Chen, S.1    Gopalakrishnan, P.2
  • 5
    • 0034273195 scopus 로고    scopus 로고
    • DISTBIC: A speaker based segmentation for audio data indexing
    • P. Delacourt, and C.J. Welkens DISTBIC: a speaker based segmentation for audio data indexing Speech Communication 32 2000 111 126
    • (2000) Speech Communication , vol.32 , pp. 111-126
    • Delacourt, P.1    Welkens, C.J.2
  • 7
    • 29044436483 scopus 로고    scopus 로고
    • The NIST 2004 spring rich transcription evaluation: Two-axis merging strategy in the context of multiple distance microphone based meeting speaker segmentation
    • Fredouille, C., Moraru, D., Meignier, S., Besacier, L., Bonastre, J.-F., 2004. The NIST 2004 spring rich transcription evaluation: two-axis merging strategy in the context of multiple distance microphone based meeting speaker segmentation, In: RT2004 Spring Meeting Recognition Workshop, p. 5.
    • (2004) RT2004 Spring Meeting Recognition Workshop , pp. 5
    • Fredouille, C.1    Moraru, D.2    Meignier, S.3    Besacier, L.4    Bonastre, J.-F.5
  • 8
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • J.-L. Gauvain, and C.H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Transactions on Speech and Audio Processing 22 1994 291 298
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.22 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.H.2
  • 11
    • 0036567851 scopus 로고    scopus 로고
    • The LIMSI broadcast news transcription system
    • J.-L. Gauvain, L. Lamel, and G. Adda The LIMSI broadcast news transcription system Speech Communication 37 1-2 2002 89 108
    • (2002) Speech Communication , vol.37 , Issue.1-2 , pp. 89-108
    • Gauvain, J.-L.1    Lamel, L.2    Adda, G.3
  • 23
  • 25
    • 29044441444 scopus 로고    scopus 로고
    • (Version 4, Updated 02/25/2003) (February)
    • NIST, The rich transcription spring 2003 (RT-03S) evaluation plan. Available from: 〈http://www.nist.gov/speech/tests/rt/rt2003/spring/docs/ rt03-spring-eval-plan-v4.pdf〉, (Version 4, Updated 02/25/2003) (February 2003).
    • (2003) The Rich Transcription Spring 2003 (RT-03S) Evaluation Plan
  • 27
    • 33750543737 scopus 로고    scopus 로고
    • Clips-imag at trec-11: Experiments in video retrieval
    • Gaithersburg, MD, USA
    • Quénot, G., Moraru, D., Besacier, L., Mulhem, P., 2002. Clips-imag at trec-11: Experiments in video retrieval. In: TREC 2002, Gaithersburg, MD, USA.
    • (2002) TREC 2002
    • Quénot, G.1    Moraru, D.2    Besacier, L.3    Mulhem, P.4
  • 28
    • 29044439703 scopus 로고    scopus 로고
    • Clips at trecvid: Shot boundary detection and feature detection
    • Gaithersburg, MD, USA
    • Quénot, G., Moraru, D., Besacier, L., 2003. Clips at trecvid: Shot boundary detection and feature detection. In: TREC 2003, Gaithersburg, MD, USA.
    • (2003) TREC 2003
    • Quénot, G.1    Moraru, D.2    Besacier, L.3
  • 31
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • G. Schwarz Estimating the dimension of a model The Annals of Statistics 6 2 1978 461 464
    • (1978) The Annals of Statistics , vol.6 , Issue.2 , pp. 461-464
    • Schwarz, G.1
  • 32
    • 0002782496 scopus 로고    scopus 로고
    • Automatic segmentation and clustering of broadcast news audio
    • Westfields, Chantilly, Virginia
    • Siegler, M., Jain, U., Raj, B., Stern, R., 1997. Automatic segmentation and clustering of broadcast news audio. In: the DARPA Speech Recognition Workshop, Westfields, Chantilly, Virginia.
    • (1997) The DARPA Speech Recognition Workshop
    • Siegler, M.1    Jain, U.2    Raj, B.3    Stern, R.4
  • 37
    • 0036567794 scopus 로고    scopus 로고
    • The development of the HTK broadcast news transcription system: An overview
    • P. Woodland The development of the HTK broadcast news transcription system: an overview Speech Communication 37 1-2 2002 291 299
    • (2002) Speech Communication , vol.37 , Issue.1-2 , pp. 291-299
    • Woodland, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.