메뉴 건너뛰기




Volumn 16, Issue 8, 2008, Pages 1590-1601

Strategies to improve the robustness of agglomerative hierarchical clustering under data source variation for speaker diarization

Author keywords

Agglomerative hierarchical clustering (ahc); Bayesian information criterion (bic); Generalized likelihood ratio (glr); Information change rate (icr); Selective agglomerative hierarchical clustering (sahc); Speaker diarization

Indexed keywords

AGGLOMERATIVE HIERARCHICAL CLUSTERING (AHC); BAYESIAN INFORMATION CRITERION (BIC); GENERALIZED LIKELIHOOD RATIO (GLR); INFORMATION CHANGE RATE (ICR); SELECTIVE AGGLOMERATIVE HIERARCHICAL CLUSTERING (SAHC); SPEAKER DIARIZATION;

EID: 70350572462     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2002085     Document Type: Article
Times cited : (65)

References (24)
  • 1
    • 34047261805 scopus 로고    scopus 로고
    • "An overview of automatic speaker diarization systems, ", vol, no. Sep
    • S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems, " IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1557-1565, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1557-1565
    • Tranter, S.E.1    Reynolds, D.A.2
  • 2
    • 33748523736 scopus 로고    scopus 로고
    • National Institute of Standards and Technology (NIST). [Online]. Available
    • Benchmark Tests: Rich Transcription, National Institute of Standards and Technology (NIST). [Online]. Available: http://www.nist.gov/speech/tests/rt/.
    • Benchmark Tests: Rich Transcription
  • 5
    • 34047264090 scopus 로고    scopus 로고
    • The MIT Lincoln Laboratory RT-04F diarization systems: Applications to broadcast news and telephone conversations
    • Nov., CD-ROM
    • D. A. Reynolds and P. A. Torres-Carrasquillo, "The MIT Lincoln Laboratory RT-04F diarization systems: Applications to broadcast news and telephone conversations, " in Proc. Fall 2004 Rich Transcription Workshop (RT-04), Nov. 2004, CD-ROM.
    • (2004) Proc. Fall 2004 Rich Transcription Workshop (RT-04)
    • Reynolds, D.A.1    Torres-Carrasquillo, P.A.2
  • 9
    • 29044442235 scopus 로고    scopus 로고
    • Step-by-step and integrated approaches in broadcast news speaker diarization
    • Jul
    • S. Meignier, D. Moraru, C. Fredouille, J.-F. Bonastre, and L. Besacier, "Step-by-step and integrated approaches in broadcast news speaker diarization, " Comput. Speech Lang., vol. 20, no. 2-3, pp. 303-330, Jul. 2006.
    • (2006) Comput. Speech Lang. , vol.20 , Issue.2-3 , pp. 303-330
    • Meignier, S.1    Moraru, D.2    Fredouille, C.3    Bonastre, J.-F.4    Besacier, L.5
  • 11
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • Mar.
    • G. Schwarz, "Estimating the dimension of a model, " Ann. Statist., vol. 6, no. 2, pp. 461-464, Mar. 1978.
    • (1978) Ann. Statist. , vol.6 , Issue.2 , pp. 461-464
    • Schwarz, G.1
  • 15
    • 44849109123 scopus 로고    scopus 로고
    • Arobust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system
    • Aug
    • K. J. Han and S. S. Narayanan, "Arobust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system, " Proc. Interspeech 2007-Eurospeech, pp. 1853-1856, Aug. 2007.
    • (2007) Proc. Interspeech 2007-Eurospeech , pp. 1853-1856
    • Han, K.J.1    Narayanan, S.S.2
  • 16
    • 85119434191 scopus 로고    scopus 로고
    • Fast speaker change detection for broadcast news transcription and indexing
    • Sep
    • D. Liu and F. Kubala, "Fast speaker change detection for broadcast news transcription and indexing, " in Proc. 6th Eur. Conf. Speech Commun. Technol., Sep. 1999, pp. 1031-1034.
    • (1999) Proc. 6th Eur. Conf. Speech Commun. Technol. , pp. 1031-1034
    • Liu, D.1    Kubala, F.2
  • 17
    • 85009109772 scopus 로고    scopus 로고
    • A fast, accurate and stream-based speaker segmentation and clustering algorithm
    • Sep.
    • A. Vandecatseye and J.-P. Martens, "A fast, accurate and stream-based speaker segmentation and clustering algorithm, " Proc. Interspeech 2003-Eurospeech, pp. 941-944, Sep. 2003.
    • (2003) Proc. Interspeech 2003-Eurospeech , pp. 941-944
    • Vandecatseye, A.1    Martens, J.-P.2
  • 21
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture models
    • Jan.
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture models, " IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 22
    • 0029355999 scopus 로고
    • Speaker identification and verification using Gaussian mixture speaker models
    • Aug
    • D. A. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models, " Speech Commun., vol. 17, no. 1-2, pp. 91-108, Aug. 1995.
    • (1995) Speech Commun. , vol.17 , Issue.1-2 , pp. 91-108
    • Reynolds, D.A.1
  • 23
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • July
    • D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models, " Digital Signal Process., vol. 10, no. 1-3, pp. 19-41, July 2000.
    • (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 24
    • 34547535369 scopus 로고    scopus 로고
    • Real-time monitoring of participants interaction in a meeting using audio-visual sensors
    • Apr. vol
    • C. Busso, P. G. Georgiou, and S. S. Narayanan, "Real-time monitoring of participants interaction in a meeting using audio-visual sensors, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 2007, vol. 2, pp. 685-688.
    • (2007) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.2 , pp. 685-688
    • Busso, C.1    Georgiou, P.G.2    Narayanan, S.S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.