메뉴 건너뛰기




Volumn , Issue , 2007, Pages 262-267

Robust speaker clustering strategies to data source variation for improved speaker diarization

Author keywords

Agglomerative hierarchical clustering (AHC); Clustering error rate (CER); Data source variation; Speaker diarization

Indexed keywords

HIERARCHICAL SYSTEMS; SPEECH ANALYSIS;

EID: 44849129300     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/asru.2007.4430121     Document Type: Conference Paper
Times cited : (11)

References (12)
  • 1
    • 33646380923 scopus 로고    scopus 로고
    • Approaches and applications of audio diarization
    • March
    • D. A. Reynolds and P. A. Torres-Carrasquillo, "Approaches and applications of audio diarization," Proc. ICASSP 2005, vol. 5, pp. 953-956, March 2005.
    • (2005) Proc. ICASSP 2005 , vol.5 , pp. 953-956
    • Reynolds, D.A.1    Torres-Carrasquillo, P.A.2
  • 2
    • 34047261805 scopus 로고    scopus 로고
    • S. E. Tranter and D. A. Reynolds, An overview of automatic speaker diarization systems, IEEE Trans. Audio, Speech, and Language Processing, 14(5), pp. 1557-1565, Sept. 2006.
    • S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Trans. Audio, Speech, and Language Processing, vol. 14(5), pp. 1557-1565, Sept. 2006.
  • 5
  • 6
    • 33745560829 scopus 로고    scopus 로고
    • Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system
    • July
    • X. Anguera, C. Wooters, B. Peskin, and M. Aguilo, "Robust speaker segmentation for meetings: The ICSI-SRI spring 2005 diarization system," Proc. MLMI2005, pp. 402-414, July 2005.
    • (2005) Proc. MLMI2005 , pp. 402-414
    • Anguera, X.1    Wooters, C.2    Peskin, B.3    Aguilo, M.4
  • 8
    • 34047264090 scopus 로고    scopus 로고
    • The MIT Lincoln laboratory RT-04F diarization systems: Applications to broadcast news and telephone conversations
    • Nov
    • D. A. Reynolds and P. A. Torres-Carrasquillo, "The MIT Lincoln laboratory RT-04F diarization systems: Applications to broadcast news and telephone conversations," Proc. Fall 2004 Rich Transcription Workshop, Nov. 2004.
    • (2004) Proc. Fall 2004 Rich Transcription Workshop
    • Reynolds, D.A.1    Torres-Carrasquillo, P.A.2
  • 9
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • March
    • G. Schwarz, "Estimating the dimension of a model," The Annals of Statistics, vol. 6(2), pp. 461-464, March 1978.
    • (1978) The Annals of Statistics , vol.6 , Issue.2 , pp. 461-464
    • Schwarz, G.1
  • 10
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the Bayesian information criterion
    • Feb
    • S. S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the Bayesian information criterion," Proc. DARPA BNTU Workshop, pp. 127-132, Feb. 1998.
    • (1998) Proc. DARPA BNTU Workshop , pp. 127-132
    • Chen, S.S.1    Gopalakrishnan, P.S.2
  • 11
    • 0026400244 scopus 로고
    • Segregation of speakers for speech recognition and speaker identification
    • May
    • H. Gish, M. Siu, and R. Rohlicek, "Segregation of speakers for speech recognition and speaker identification," Proc. ICASSP 1991, pp. 873-876, May 1991.
    • (1991) Proc. ICASSP 1991 , pp. 873-876
    • Gish, H.1    Siu, M.2    Rohlicek, R.3
  • 12
    • 44849109123 scopus 로고    scopus 로고
    • A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system
    • Aug
    • K. J. Han and S. S. Narayanan, "A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system," Proc. INTERSPEECH 2007, pp. 1853-1856, Aug. 2007.
    • (2007) Proc. INTERSPEECH 2007 , pp. 1853-1856
    • Han, K.J.1    Narayanan, S.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.