SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 16, Issue 8, 2008, Pages 1590-1601

Strategies to improve the robustness of agglomerative hierarchical clustering under data source variation for speaker diarization

(3) Han, Kyu J a Kim, Samuel a Narayanan, Shrikanth S a

a University of Southern California ^* (United States)

Author keywords

Agglomerative hierarchical clustering (ahc); Bayesian information criterion (bic); Generalized likelihood ratio (glr); Information change rate (icr); Selective agglomerative hierarchical clustering (sahc); Speaker diarization

Indexed keywords

AGGLOMERATIVE HIERARCHICAL CLUSTERING (AHC); BAYESIAN INFORMATION CRITERION (BIC); GENERALIZED LIKELIHOOD RATIO (GLR); INFORMATION CHANGE RATE (ICR); SELECTIVE AGGLOMERATIVE HIERARCHICAL CLUSTERING (SAHC); SPEAKER DIARIZATION;

BAYESIAN NETWORKS; HIERARCHICAL SYSTEMS; MAXIMUM LIKELIHOOD ESTIMATION; MERGING; SPEECH ANALYSIS; VIBRATION CONTROL;

CLUSTER ANALYSIS;

EID: 70350572462 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2008.2002085 Document Type: Article

Times cited : (65)

References (24)

1
- 34047261805
- "An overview of automatic speaker diarization systems, ", vol, no. Sep
- S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems, " IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1557-1565, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1557-1565
- Tranter, S.E.¹ Reynolds, D.A.²

2
- 33748523736
- National Institute of Standards and Technology (NIST). [Online]. Available
- Benchmark Tests: Rich Transcription, National Institute of Standards and Technology (NIST). [Online]. Available: http://www.nist.gov/speech/tests/rt/.
- Benchmark Tests: Rich Transcription

3
- 33646380923
- Approaches and applications of audio diarization
- Mar., vol
- D. A. Reynolds and P. A. Torres-Carrasquillo, "Approaches and applications of audio diarization, " in Proc. 2005 IEEE Int. Conf. Acoust., Speech, Signal Process., Mar. 2005, vol. 5, pp. 953-956.
- (2005) Proc. 2005 IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.5 , pp. 953-956
- Reynolds, D.A.¹ Torres-Carrasquillo, P.A.²

4
- 0003922190
- 2nd ed. New York: Wiley
- R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, 2nd ed. New York: Wiley, 2001.
- (2001) Pattern Classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

5
- 34047264090
- The MIT Lincoln Laboratory RT-04F diarization systems: Applications to broadcast news and telephone conversations
- Nov., CD-ROM
- D. A. Reynolds and P. A. Torres-Carrasquillo, "The MIT Lincoln Laboratory RT-04F diarization systems: Applications to broadcast news and telephone conversations, " in Proc. Fall 2004 Rich Transcription Workshop (RT-04), Nov. 2004, CD-ROM.
- (2004) Proc. Fall 2004 Rich Transcription Workshop (RT-04)
- Reynolds, D.A.¹ Torres-Carrasquillo, P.A.²

6
- 33745200276
- The Cambridge University March 2005 speaker diarisation system
- Mar.
- R. Sinha, S. E. Tranter, M. J. F. Gales, and P. C.Woodland, "The Cambridge University March 2005 speaker diarisation system, " in Proc. 9th Eur. Conf. Speech Commun. Technol., Mar. 2005, pp. 2437-2440.
- (2005) Proc. 9th Eur. Conf. Speech Commun. Technol. , pp. 2437-2440
- Sinha, R.¹ Tranter, S.E.² Gales, M.J.F.³ Woodland, P.C.⁴

7
- 34548370664
- Robust speaker diarization for meetings: ICSI RT06S meetings evaluation system
- May
- X. Anguera, C. Wooters, B. Peskin, and M. Aguilo, "Robust speaker diarization for meetings: ICSI RT06S meetings evaluation system, " in Proc. 3rd Joint Workshop Multimodal Interaction and Rel. Mach. Learn. Algorithms, May 2006, pp. 346-358.
- (2006) Proc. 3rd Joint Workshop Multimodal Interaction and Rel. Mach. Learn. Algorithms , pp. 346-358
- Anguera, X.¹ Wooters, C.² Peskin, B.³ Aguilo, M.⁴

8
- 77249176190
- The AMI speaker diarization system for NIST RT06S meeting data
- May
- D. A. van Leeuwen and M. Huijbregts, "The AMI speaker diarization system for NIST RT06S meeting data, " in Proc. 3rd Joint Workshop Multimodal Interaction and Rel. Mach. Learn. Algorithms, May 2006, pp. 371-384.
- (2006) Proc. 3rd Joint Workshop Multimodal Interaction and Rel. Mach. Learn. Algorithms , pp. 371-384
- Leeuwen, D.A.V.¹ Huijbregts, M.²

9
- 29044442235
- Step-by-step and integrated approaches in broadcast news speaker diarization
- Jul
- S. Meignier, D. Moraru, C. Fredouille, J.-F. Bonastre, and L. Besacier, "Step-by-step and integrated approaches in broadcast news speaker diarization, " Comput. Speech Lang., vol. 20, no. 2-3, pp. 303-330, Jul. 2006.
- (2006) Comput. Speech Lang. , vol.20 , Issue.2-3 , pp. 303-330
- Meignier, S.¹ Moraru, D.² Fredouille, C.³ Bonastre, J.-F.⁴ Besacier, L.⁵

10
- 34047266609
- Multistage speaker diarization of broadcast news
- Sep
- C. Barras, X. Zhu, S. Meignier, and J.-L. Gauvain, "Multistage speaker diarization of broadcast news, " IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1505-1512, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1505-1512
- Barras, C.¹ Zhu, X.² Meignier, S.³ Gauvain, J.-L.⁴

11
- 0000120766
- Estimating the dimension of a model
- Mar.
- G. Schwarz, "Estimating the dimension of a model, " Ann. Statist., vol. 6, no. 2, pp. 461-464, Mar. 1978.
- (1978) Ann. Statist. , vol.6 , Issue.2 , pp. 461-464
- Schwarz, G.¹

12
- 0002595416
- Speaker, environment and channel change detection and clustering via the Bayesian information criterion
- Feb
- S. S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the Bayesian information criterion, " in Proc. DARPA Broadcast News Transcription and Understanding Workshop, Feb. 1998, pp. 127-132.
- (1998) Proc. DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
- Chen, S.S.¹ Gopalakrishnan, P.S.²

13
- 0026400244
- Segregation of speakers for speech recognition and speaker identification
- May, vol
- H. Gish, M.-H. Siu, and R. Rohlicek, "Segregation of speakers for speech recognition and speaker identification, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 1991, vol. 2, pp. 873-876.
- (1991) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.2 , pp. 873-876
- Gish, H.¹ Siu, M.-H.² Rohlicek, R.³

14
- 44849129300
- Robust speaker clustering strategies to data source variation for improved speaker diarization
- Dec
- K. J. Han, S. Kim, and S. S. Narayanan, "Robust speaker clustering strategies to data source variation for improved speaker diarization, " in Proc. IEEE Autom. Speech Recognition and Understanding Workshop, Dec. 2007, pp. 262-267.
- (2007) Proc. IEEE Autom. Speech Recognition and Understanding Workshop , pp. 262-267
- Han, K.J.¹ Kim, S.² Narayanan, S.S.³

15
- 44849109123
- Arobust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system
- Aug
- K. J. Han and S. S. Narayanan, "Arobust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system, " Proc. Interspeech 2007-Eurospeech, pp. 1853-1856, Aug. 2007.
- (2007) Proc. Interspeech 2007-Eurospeech , pp. 1853-1856
- Han, K.J.¹ Narayanan, S.S.²

16
- 85119434191
- Fast speaker change detection for broadcast news transcription and indexing
- Sep
- D. Liu and F. Kubala, "Fast speaker change detection for broadcast news transcription and indexing, " in Proc. 6th Eur. Conf. Speech Commun. Technol., Sep. 1999, pp. 1031-1034.
- (1999) Proc. 6th Eur. Conf. Speech Commun. Technol. , pp. 1031-1034
- Liu, D.¹ Kubala, F.²

17
- 85009109772
- A fast, accurate and stream-based speaker segmentation and clustering algorithm
- Sep.
- A. Vandecatseye and J.-P. Martens, "A fast, accurate and stream-based speaker segmentation and clustering algorithm, " Proc. Interspeech 2003-Eurospeech, pp. 941-944, Sep. 2003.
- (2003) Proc. Interspeech 2003-Eurospeech , pp. 941-944
- Vandecatseye, A.¹ Martens, J.-P.²

18
- 84889281816
- New York: Wiley
- T. M. Cover and J. A. Thomas, Elements of Information Theory. New York: Wiley, 1991.
- (1991) Elements of Information Theory
- Cover, T.M.¹ Thomas, J.A.²

19
- 84946742526
- A robust speaker clustering algorithm
- Nov
- J. Ajmera and C. Wooters, "A robust speaker clustering algorithm, " in Proc. IEEE Automatic Speech Recognition and Understanding Workshop, Nov. 2003, pp. 411-416.
- (2003) Proc. IEEE Automatic Speech Recognition and Understanding Workshop , pp. 411-416
- Ajmera, J.¹ Wooters, C.²

20
- 3543144948
- Robust speaker change detection
- Aug
- J. Ajmera, I. McCowan, and H. Bourlard, "Robust speaker change detection, " IEEE Signal Process. Lett., vol. 11, no. 8, pp. 649-651, Aug. 2004.
- (2004) IEEE Signal Process. Lett. , vol.11 , Issue.8 , pp. 649-651
- Ajmera, J.¹ Mccowan, I.² Bourlard, H.³

21
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture models
- Jan.
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture models, " IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

22
- 0029355999
- Speaker identification and verification using Gaussian mixture speaker models
- Aug
- D. A. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models, " Speech Commun., vol. 17, no. 1-2, pp. 91-108, Aug. 1995.
- (1995) Speech Commun. , vol.17 , Issue.1-2 , pp. 91-108
- Reynolds, D.A.¹

23
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- July
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models, " Digital Signal Process., vol. 10, no. 1-3, pp. 19-41, July 2000.
- (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

24
- 34547535369
- Real-time monitoring of participants interaction in a meeting using audio-visual sensors
- Apr. vol
- C. Busso, P. G. Georgiou, and S. S. Narayanan, "Real-time monitoring of participants interaction in a meeting using audio-visual sensors, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 2007, vol. 2, pp. 685-688.
- (2007) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.2 , pp. 685-688
- Busso, C.¹ Georgiou, P.G.² Narayanan, S.S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.