-
1
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. on Audio, Speech, and Language Processing, vol. 19, no. 4, pp. 788-798, 2011.
-
(2011)
IEEE Trans. On Audio, Speech, and Language Processing
, vol.19
, Issue.4
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
2
-
-
84898068800
-
I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification
-
R. Saedi, K. A. Lee, T. Kinnunen, et al., "I4U submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification," in INTERSPEECH, 2013.
-
(2013)
Interspeech
-
-
Saedi, R.1
Lee, K.A.2
Kinnunen, T.3
-
6
-
-
84944178665
-
Hierarchical grouping to optimize an objective function
-
J. H. Ward, "Hierarchical grouping to optimize an objective function," Journal of the American Statistical Association, vol. 58, no. 301, pp. 236-244, 1963.
-
(1963)
Journal of the American Statistical Association
, vol.58
, Issue.301
, pp. 236-244
-
-
Ward, J.H.1
-
8
-
-
0033884858
-
Speaker verification using adapted Gaussian mixture models
-
D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, vol. 10, pp. 19-41, 2000.
-
(2000)
Digital Signal Processing
, vol.10
, pp. 19-41
-
-
Reynolds, D.A.1
Quatieri, T.F.2
Dunn, R.B.3
-
10
-
-
84865749202
-
Towards fully Bayesian speaker recognition: Integrating out the between-speaker covariance
-
J. A. Villalba and N. Brümmer, "Towards fully Bayesian speaker recognition: Integrating out the between-speaker covariance," in INTERSPEECH, 2011, pp. 505-508.
-
(2011)
Interspeech
, pp. 505-508
-
-
Villalba, J.A.1
Brümmer, N.2
-
11
-
-
80051621424
-
Fullcovariance UBM and heavy-tailed PLDA in i-vector speaker verification
-
P. Matejka, O. Glembek, F. Castaldo, M.J. Alam, O. Plchot, P. Kenny, L. Burget, and J. Cernocky, "Fullcovariance UBM and heavy-tailed PLDA in i-vector speaker verification," in IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2011.
-
(2011)
IEEE Intl. Conf. On Acoustics, Speech and Signal Processing (ICASSP)
-
-
Matejka, P.1
Glembek, O.2
Castaldo, F.3
Alam, M.J.4
Plchot, O.5
Kenny, P.6
Burget, L.7
Cernocky, J.8
-
12
-
-
84865733857
-
Analysis of i-vector length normalization in speaker recognition systems
-
D. Garcia-Romero and C.Y. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems," in INTERSPEECH, 2011, pp. 249-252.
-
(2011)
Interspeech
, pp. 249-252
-
-
Garcia-Romero, D.1
Espy-Wilson, C.Y.2
-
13
-
-
84889921364
-
-
Tech. Rep., Agnitio Labs
-
N. Brümmer, "Bayesian PLDA," Tech. Rep., Agnitio Labs, 2010.
-
(2010)
Bayesian PLDA
-
-
Brümmer, N.1
-
14
-
-
84878119221
-
A scalable formulation of probabilistic linear discriminant analysis
-
L. El Shafey, C. McCool, R. Wallace, and S. Marcel, "A scalable formulation of probabilistic linear discriminant analysis," IEEE Trans. in Pattern Analysis and Machine Intelligence (TPAMI), vol. 35, no. 7, pp. 1788-1794, 2013.
-
(2013)
IEEE Trans. In Pattern Analysis and Machine Intelligence (TPAMI)
, vol.35
, Issue.7
, pp. 1788-1794
-
-
El Shafey, L.1
McCool, C.2
Wallace, R.3
Marcel, S.4
-
15
-
-
84878413073
-
PLDA modeling in i-vector and supervector space for speaker verification
-
Y. Jiang, K.-A. Lee, Z. Tang, B. Ma, A. Larcher, and H. Li, "PLDA modeling in i-vector and supervector space for speaker verification.," in INTERSPEECH, 2012.
-
(2012)
Interspeech
-
-
Jiang, Y.1
Lee, K.-A.2
Tang, Z.3
Ma, B.4
Larcher, A.5
Li, H.6
-
16
-
-
0033902487
-
Applying logistic regression to the fusion of the NIST'99 1-speaker submissions
-
S. Pigeon, P. Druyts, and P. Verlinde, "Applying logistic regression to the fusion of the NIST'99 1-speaker submissions," Digital Signal Processing, vol. 10, no. 1-3, pp. 237-248, 2000.
-
(2000)
Digital Signal Processing
, vol.10
, Issue.1-3
, pp. 237-248
-
-
Pigeon, S.1
Druyts, P.2
Verlinde, P.3
-
17
-
-
33646796027
-
Clustering speech utterances by speaker using eigenvoice-motivated vector space models
-
W.-H. Tsai, S.-S. Cheng, Y.-H. Chao, and Wang H.- M., "Clustering speech utterances by speaker using eigenvoice-motivated vector space models," in IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005, vol. 1, pp. 725-728.
-
(2005)
IEEE Intl. Conf. On Acoustics, Speech, and Signal Processing (ICASSP)
, vol.1
, pp. 725-728
-
-
Tsai, W.-H.1
Cheng, S.-S.2
Chao, Y.-H.3
Wang, H.-M.4
-
18
-
-
34547516256
-
Speaker diarization: Towards a more robust and portable system
-
E. Khoury, C. Sénac, and R. Andre-Obrecht, "Speaker diarization: Towards a more robust and portable system," in IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2007, vol. 4, pp. IV-489-IV-492.
-
(2007)
IEEE Intl. Conf. On Acoustics, Speech and Signal Processing (ICASSP)
, vol.4
, pp. IV489-IV492
-
-
Khoury, E.1
Sénac, C.2
Andre-Obrecht, R.3
-
19
-
-
84865752702
-
Exploiting intra-conversation variability for speaker diarization
-
S. Shum, N. Dehak, E. Chuangsuwanich, D. A. Reynolds, and J. R. Glass, "Exploiting intra-conversation variability for speaker diarization," in INTERSPEECH, 2011.
-
(2011)
Interspeech
-
-
Shum, S.1
Dehak, N.2
Chuangsuwanich, E.3
Reynolds, D.A.4
Glass, J.R.5
-
20
-
-
84878381961
-
On the use of spectral and iterative methods for speaker diarization
-
S. Shum, N. Dehak, and J. Glass, "On the use of spectral and iterative methods for speaker diarization," in INTERSPEECH, 2012.
-
(2012)
Interspeech
-
-
Shum, S.1
Dehak, N.2
Glass, J.3
-
21
-
-
84890467239
-
Efficient iterative mean shift based cosine dissimilarity for multi-recording speaker clustering
-
M. Senoussaoui, P. Kenny, P. Dumouchel, and T. Stafylakis, "Efficient iterative mean shift based cosine dissimilarity for multi-recording speaker clustering," in IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2013, pp. 7712-7715.
-
(2013)
IEEE Intl. Conf. On Acoustics, Speech and Signal Processing (ICASSP)
, pp. 7712-7715
-
-
Senoussaoui, M.1
Kenny, P.2
Dumouchel, P.3
Stafylakis, T.4
-
22
-
-
84897931702
-
A study of the cosine distance-based mean shift for telephone speech diarization
-
M. Senoussaoui, P. Kenny, T. Stafylakis, and P. Dumouchel, "A study of the cosine distance-based mean shift for telephone speech diarization," IEEE Trans. on Audio, Speech, and Language Processing (TASLP), vol. 22, no. 1, pp. 217-227, 2014.
-
(2014)
IEEE Trans. On Audio, Speech, and Language Processing (TASLP)
, vol.22
, Issue.1
, pp. 217-227
-
-
Senoussaoui, M.1
Kenny, P.2
Stafylakis, T.3
Dumouchel, P.4
-
23
-
-
0036565814
-
Mean shift: A robust approach toward feature space analysis
-
D. Comaniciu and P. Meer, "Mean shift: a robust approach toward feature space analysis," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 24, no. 5, pp. 603-619, 2002.
-
(2002)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.24
, Issue.5
, pp. 603-619
-
-
Comaniciu, D.1
Meer, P.2
-
24
-
-
84881068970
-
Unsupervised methods for speaker diarization: An integrated and iterative approach
-
S.H. Shum, N. Dehak, R. Dehak, and J.R. Glass, "Unsupervised methods for speaker diarization: An integrated and iterative approach," IEEE Trans. on Audio, Speech, and Language Processing (TASLP), vol. 21, no. 10, pp. 2015-2028, 2013.
-
(2013)
IEEE Trans. On Audio, Speech, and Language Processing (TASLP)
, vol.21
, Issue.10
, pp. 2015-2028
-
-
Shum, S.H.1
Dehak, N.2
Dehak, R.3
Glass, J.R.4
-
26
-
-
85073158100
-
Effect of multicondition training on i-vector PLDA configurations for speaker recognition
-
P. Rajan, T. Kinnunen, and V. Hautamaki, "Effect of multicondition training on i-vector PLDA configurations for speaker recognition," in INTERSPEECH, 2012.
-
(2012)
Interspeech
-
-
Rajan, P.1
Kinnunen, T.2
Hautamaki, V.3
-
27
-
-
0003126317
-
A general theory of classificatory sorting strategies. 1. hierarchical systems
-
G. N. Lance and W. T. Williams, "A general theory of classificatory sorting strategies. 1. hierarchical systems," Computer Journal, vol. 9, pp. 373-380, 1967.
-
(1967)
Computer Journal
, vol.9
, pp. 373-380
-
-
Lance, G.N.1
Williams, W.T.2
-
28
-
-
77958525204
-
A similarity measure for clustering and its applications
-
G. J. Torres, R. B. Basnet, A. H. Sung, S. Mukkamala, and B. Ribeiro, "A similarity measure for clustering and its applications," Intl. Journ. of Electrical, Computer, and Systems Engineering, vol. 3, no. 3, 2009.
-
(2009)
Intl. Journ. Of Electrical, Computer, and Systems Engineering
, vol.3
, Issue.3
-
-
Torres, G.J.1
Basnet, R.B.2
Sung, A.H.3
Mukkamala, S.4
Ribeiro, B.5
-
30
-
-
84895063162
-
Audiovisual diarization of people in video content
-
E. Khoury, C. Sénac, and P. Joly, "Audiovisual diarization of people in video content," Multimedia Tools and Applications, vol. 68, no. 3, pp. 747-775, 2012.
-
(2012)
Multimedia Tools and Applications
, vol.68
, Issue.3
, pp. 747-775
-
-
Khoury, E.1
Sénac, C.2
Joly, P.3
-
31
-
-
84905215309
-
Spear: An open source toolbox for speaker recognition based on Bob
-
E. Khoury, L. El Shafey, and S. Marcel, "Spear: An open source toolbox for speaker recognition based on Bob," in IEEE Intl. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2014.
-
(2014)
IEEE Intl. Conf. On Acoustics, Speech and Signal Processing (ICASSP)
-
-
Khoury, E.1
El Shafey, L.2
Marcel, S.3
-
32
-
-
84871393354
-
Bob: A free signal processing and machine learning toolbox for researchers
-
A. Anjos, L. El Shafey, R. Wallace, M. Günther, C. Mc- Cool, and S. Marcel, "Bob: a free signal processing and machine learning toolbox for researchers," in 20th ACM Intl. Conf. on Multimedia (ACMMM), 2012.
-
(2012)
20th ACM Intl. Conf. On Multimedia (ACMMM)
-
-
Anjos, A.1
El Shafey, L.2
Wallace, R.3
Günther, M.4
Mc-Cool, C.5
Marcel, S.6
|