SCOPUS 정보 검색 플랫폼

Odyssey 2014: Speaker and Language Recognition Workshop

Volumn , Issue , 2014, Pages 254-259

Hierarchical speaker clustering methods for the NIST i-vector challenge

(4) Khoury, Elie a Shafey, Laurent El a Ferras, Marc a Marcel, Sebastien a

a IDIAP RESEARCH INSTITUTE (Switzerland)

Author keywords

[No Author keywords available]

Indexed keywords

CLUSTERING ALGORITHMS; COST FUNCTIONS; DISCRIMINANT ANALYSIS; VECTORS;

CASCADE CLUSTERING; COMPENSATION TECHNIQUES; LOGISTIC REGRESSIONS; OBJECTIVE FUNCTIONS; PRIVACY AND SECURITY; PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS; SPEAKER CLUSTERING; SPEAKER RECOGNITION;

SPEECH RECOGNITION;

EID: 84921375866 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (31)

References (32)

1
- 79951609039
- Front-end factor analysis for speaker verification
- N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. on Audio, Speech, and Language Processing, vol. 19, no. 4, pp. 788-798, 2011.
- (2011) IEEE Trans. On Audio, Speech, and Language Processing , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

2
- 84898068800
- I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification
- R. Saedi, K. A. Lee, T. Kinnunen, et al., "I4U submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification," in INTERSPEECH, 2013.
- (2013) Interspeech
- Saedi, R.¹ Lee, K.A.² Kinnunen, T.³

3
- 50649094277
- Probabilistic linear discriminant analysis for inferences about identity
- S. J. D. Prince and J. H. Elder, "Probabilistic linear discriminant analysis for inferences about identity," in IEEE Intl. Conf. on Computer Vision (ICCV), 2007, pp. 1-8.
- (2007) IEEE Intl. Conf. On Computer Vision (ICCV) , pp. 1-8
- Prince, S.J.D.¹ Elder, J.H.²

4
- 82955196715
- Speaker diarization using PLDA-based speaker clustering
- J. Prazak and J. Silovsky, "Speaker diarization using PLDA-based speaker clustering," in IEEE 6th Intl. Conf. on Intelligent Data Acquisition and Advanced Computing Systems (IDAACS), 2011, vol. 1, pp. 347-350.
- (2011) IEEE 6th Intl. Conf. On Intelligent Data Acquisition and Advanced Computing Systems (IDAACS) , vol.1 , pp. 347-350
- Prazak, J.¹ Silovsky, J.²

5
- 85073194318
- A global optimization framework for speaker diarization
- M. Rouvier and S. Meignier, "A global optimization framework for speaker diarization," in Odyssey: The Speaker and Language Recognition Workshop, 2012.
- (2012) Odyssey: The Speaker and Language Recognition Workshop
- Rouvier, M.¹ Meignier, S.²

6
- 84944178665
- Hierarchical grouping to optimize an objective function
- J. H. Ward, "Hierarchical grouping to optimize an objective function," Journal of the American Statistical Association, vol. 58, no. 301, pp. 236-244, 1963.
- (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 236-244
- Ward, J.H.¹

7
- 84874227906
- Speaker diarization and linking of large corpora
- Dec
- M. Ferras and H. Bourlard, "Speaker diarization and linking of large corpora," in IEEE Spoken Language Technology Workshop (SLT), Dec 2012, pp. 280-285.
- (2012) IEEE Spoken Language Technology Workshop (SLT) , pp. 280-285
- Ferras, M.¹ Bourlard, H.²

8
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, vol. 10, pp. 19-41, 2000.
- (2000) Digital Signal Processing , vol.10 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

9
- 84858973723
- Bayesian speaker verification with heavytailed priors
- P. Kenny, "Bayesian speaker verification with heavytailed priors," in Odyssey: The Speaker and Language Recognition Workshop, 2010.
- (2010) Odyssey: The Speaker and Language Recognition Workshop
- Kenny, P.¹

10
- 84865749202
- Towards fully Bayesian speaker recognition: Integrating out the between-speaker covariance
- J. A. Villalba and N. Brümmer, "Towards fully Bayesian speaker recognition: Integrating out the between-speaker covariance," in INTERSPEECH, 2011, pp. 505-508.
- (2011) Interspeech , pp. 505-508
- Villalba, J.A.¹ Brümmer, N.²

11
- 80051621424
- Fullcovariance UBM and heavy-tailed PLDA in i-vector speaker verification
- P. Matejka, O. Glembek, F. Castaldo, M.J. Alam, O. Plchot, P. Kenny, L. Burget, and J. Cernocky, "Fullcovariance UBM and heavy-tailed PLDA in i-vector speaker verification," in IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2011.
- (2011) IEEE Intl. Conf. On Acoustics, Speech and Signal Processing (ICASSP)
- Matejka, P.¹ Glembek, O.² Castaldo, F.³ Alam, M.J.⁴ Plchot, O.⁵ Kenny, P.⁶ Burget, L.⁷ Cernocky, J.⁸

12
- 84865733857
- Analysis of i-vector length normalization in speaker recognition systems
- D. Garcia-Romero and C.Y. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems," in INTERSPEECH, 2011, pp. 249-252.
- (2011) Interspeech , pp. 249-252
- Garcia-Romero, D.¹ Espy-Wilson, C.Y.²

13
- 84889921364
- Tech. Rep., Agnitio Labs
- N. Brümmer, "Bayesian PLDA," Tech. Rep., Agnitio Labs, 2010.
- (2010) Bayesian PLDA
- Brümmer, N.¹

14
- 84878119221
- A scalable formulation of probabilistic linear discriminant analysis
- L. El Shafey, C. McCool, R. Wallace, and S. Marcel, "A scalable formulation of probabilistic linear discriminant analysis," IEEE Trans. in Pattern Analysis and Machine Intelligence (TPAMI), vol. 35, no. 7, pp. 1788-1794, 2013.
- (2013) IEEE Trans. In Pattern Analysis and Machine Intelligence (TPAMI) , vol.35 , Issue.7 , pp. 1788-1794
- El Shafey, L.¹ McCool, C.² Wallace, R.³ Marcel, S.⁴

15
- 84878413073
- PLDA modeling in i-vector and supervector space for speaker verification
- Y. Jiang, K.-A. Lee, Z. Tang, B. Ma, A. Larcher, and H. Li, "PLDA modeling in i-vector and supervector space for speaker verification.," in INTERSPEECH, 2012.
- (2012) Interspeech
- Jiang, Y.¹ Lee, K.-A.² Tang, Z.³ Ma, B.⁴ Larcher, A.⁵ Li, H.⁶

16
- 0033902487
- Applying logistic regression to the fusion of the NIST'99 1-speaker submissions
- S. Pigeon, P. Druyts, and P. Verlinde, "Applying logistic regression to the fusion of the NIST'99 1-speaker submissions," Digital Signal Processing, vol. 10, no. 1-3, pp. 237-248, 2000.
- (2000) Digital Signal Processing , vol.10 , Issue.1-3 , pp. 237-248
- Pigeon, S.¹ Druyts, P.² Verlinde, P.³

17
- 33646796027
- Clustering speech utterances by speaker using eigenvoice-motivated vector space models
- W.-H. Tsai, S.-S. Cheng, Y.-H. Chao, and Wang H.- M., "Clustering speech utterances by speaker using eigenvoice-motivated vector space models," in IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005, vol. 1, pp. 725-728.
- (2005) IEEE Intl. Conf. On Acoustics, Speech, and Signal Processing (ICASSP) , vol.1 , pp. 725-728
- Tsai, W.-H.¹ Cheng, S.-S.² Chao, Y.-H.³ Wang, H.-M.⁴

18
- 34547516256
- Speaker diarization: Towards a more robust and portable system
- E. Khoury, C. Sénac, and R. Andre-Obrecht, "Speaker diarization: Towards a more robust and portable system," in IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2007, vol. 4, pp. IV-489-IV-492.
- (2007) IEEE Intl. Conf. On Acoustics, Speech and Signal Processing (ICASSP) , vol.4 , pp. IV489-IV492
- Khoury, E.¹ Sénac, C.² Andre-Obrecht, R.³

19
- 84865752702
- Exploiting intra-conversation variability for speaker diarization
- S. Shum, N. Dehak, E. Chuangsuwanich, D. A. Reynolds, and J. R. Glass, "Exploiting intra-conversation variability for speaker diarization," in INTERSPEECH, 2011.
- (2011) Interspeech
- Shum, S.¹ Dehak, N.² Chuangsuwanich, E.³ Reynolds, D.A.⁴ Glass, J.R.⁵

20
- 84878381961
- On the use of spectral and iterative methods for speaker diarization
- S. Shum, N. Dehak, and J. Glass, "On the use of spectral and iterative methods for speaker diarization," in INTERSPEECH, 2012.
- (2012) Interspeech
- Shum, S.¹ Dehak, N.² Glass, J.³

21
- 84890467239
- Efficient iterative mean shift based cosine dissimilarity for multi-recording speaker clustering
- M. Senoussaoui, P. Kenny, P. Dumouchel, and T. Stafylakis, "Efficient iterative mean shift based cosine dissimilarity for multi-recording speaker clustering," in IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2013, pp. 7712-7715.
- (2013) IEEE Intl. Conf. On Acoustics, Speech and Signal Processing (ICASSP) , pp. 7712-7715
- Senoussaoui, M.¹ Kenny, P.² Dumouchel, P.³ Stafylakis, T.⁴

22
- 84897931702
- A study of the cosine distance-based mean shift for telephone speech diarization
- M. Senoussaoui, P. Kenny, T. Stafylakis, and P. Dumouchel, "A study of the cosine distance-based mean shift for telephone speech diarization," IEEE Trans. on Audio, Speech, and Language Processing (TASLP), vol. 22, no. 1, pp. 217-227, 2014.
- (2014) IEEE Trans. On Audio, Speech, and Language Processing (TASLP) , vol.22 , Issue.1 , pp. 217-227
- Senoussaoui, M.¹ Kenny, P.² Stafylakis, T.³ Dumouchel, P.⁴

23
- 0036565814
- Mean shift: A robust approach toward feature space analysis
- D. Comaniciu and P. Meer, "Mean shift: a robust approach toward feature space analysis," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 24, no. 5, pp. 603-619, 2002.
- (2002) Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.24 , Issue.5 , pp. 603-619
- Comaniciu, D.¹ Meer, P.²

24
- 84881068970
- Unsupervised methods for speaker diarization: An integrated and iterative approach
- S.H. Shum, N. Dehak, R. Dehak, and J.R. Glass, "Unsupervised methods for speaker diarization: An integrated and iterative approach," IEEE Trans. on Audio, Speech, and Language Processing (TASLP), vol. 21, no. 10, pp. 2015-2028, 2013.
- (2013) IEEE Trans. On Audio, Speech, and Language Processing (TASLP) , vol.21 , Issue.10 , pp. 2015-2028
- Shum, S.H.¹ Dehak, N.² Dehak, R.³ Glass, J.R.⁴

25
- 0002595416
- Speaker, environment and channel change detection and clustering via the Bayesian Information Criterion
- S. Chen and P. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the Bayesian Information Criterion," in DARPA Broadcast News Transcription and Understanding Workshop, 1998.
- (1998) DARPA Broadcast News Transcription and Understanding Workshop
- Chen, S.¹ Gopalakrishnan, P.²

26
- 85073158100
- Effect of multicondition training on i-vector PLDA configurations for speaker recognition
- P. Rajan, T. Kinnunen, and V. Hautamaki, "Effect of multicondition training on i-vector PLDA configurations for speaker recognition," in INTERSPEECH, 2012.
- (2012) Interspeech
- Rajan, P.¹ Kinnunen, T.² Hautamaki, V.³

27
- 0003126317
- A general theory of classificatory sorting strategies. 1. hierarchical systems
- G. N. Lance and W. T. Williams, "A general theory of classificatory sorting strategies. 1. hierarchical systems," Computer Journal, vol. 9, pp. 373-380, 1967.
- (1967) Computer Journal , vol.9 , pp. 373-380
- Lance, G.N.¹ Williams, W.T.²

28
- 77958525204
- A similarity measure for clustering and its applications
- G. J. Torres, R. B. Basnet, A. H. Sung, S. Mukkamala, and B. Ribeiro, "A similarity measure for clustering and its applications," Intl. Journ. of Electrical, Computer, and Systems Engineering, vol. 3, no. 3, 2009.
- (2009) Intl. Journ. Of Electrical, Computer, and Systems Engineering , vol.3 , Issue.3
- Torres, G.J.¹ Basnet, R.B.² Sung, A.H.³ Mukkamala, S.⁴ Ribeiro, B.⁵

29
- 0002719797
- The Hungarian method for the assignment problem
- H. W. Kuhn, "The Hungarian method for the assignment problem," Naval Research Logistics Quarterly, 1955.
- (1955) Naval Research Logistics Quarterly
- Kuhn, H.W.¹

30
- 84895063162
- Audiovisual diarization of people in video content
- E. Khoury, C. Sénac, and P. Joly, "Audiovisual diarization of people in video content," Multimedia Tools and Applications, vol. 68, no. 3, pp. 747-775, 2012.
- (2012) Multimedia Tools and Applications , vol.68 , Issue.3 , pp. 747-775
- Khoury, E.¹ Sénac, C.² Joly, P.³

31
- 84905215309
- Spear: An open source toolbox for speaker recognition based on Bob
- E. Khoury, L. El Shafey, and S. Marcel, "Spear: An open source toolbox for speaker recognition based on Bob," in IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2014.
- (2014) IEEE Intl. Conf. On Acoustics, Speech and Signal Processing (ICASSP)
- Khoury, E.¹ El Shafey, L.² Marcel, S.³

32
- 84871393354
- Bob: A free signal processing and machine learning toolbox for researchers
- A. Anjos, L. El Shafey, R. Wallace, M. Günther, C. Mc- Cool, and S. Marcel, "Bob: a free signal processing and machine learning toolbox for researchers," in 20th ACM Intl. Conf. on Multimedia (ACMMM), 2012.
- (2012) 20th ACM Intl. Conf. On Multimedia (ACMMM)
- Anjos, A.¹ El Shafey, L.² Wallace, R.³ Günther, M.⁴ Mc-Cool, C.⁵ Marcel, S.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.