-
2
-
-
44949251671
-
Data-driven design of front-end filter bank for Lombard speech recognition
-
H. Boril, P. Fousek, and P. Pollak. Data-driven design of front-end filter bank for Lombard speech recognition. In Proc. Interspeech 2006 (ICSLP), pages 381-384, 2006.
-
(2006)
Proc. Interspeech 2006 (ICSLP)
, pp. 381-384
-
-
Boril, H.1
Fousek, P.2
Pollak, P.3
-
4
-
-
84886695822
-
A simple and effective speech activity detection algorithm for telephone and microphone speech
-
Atlanta, US
-
M. McLaren and D. A. van Leeuwen. A simple and effective speech activity detection algorithm for telephone and microphone speech. In Proc. NIST SRE 2011 workshop, Atlanta, US, 2011.
-
(2011)
Proc. NIST SRE 2011 Workshop
-
-
McLaren, M.1
Van Leeuwen, D.A.2
-
5
-
-
0041360463
-
Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
-
I. Cohen. Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Trans. on Speech and Audio Processing, 11(5):466 - 475, 2003.
-
(2003)
IEEE Trans. on Speech and Audio Processing
, vol.11
, Issue.5
, pp. 466-475
-
-
Cohen, I.1
-
6
-
-
85073258179
-
Feature warping for robust speaker verification
-
J. Pelecanos and S. Sridharan. Feature warping for robust speaker verification. In Odyssey, 2001.
-
(2001)
Odyssey
-
-
Pelecanos, J.1
Sridharan, S.2
-
7
-
-
84873315510
-
Unsupervised speech activity detection using voicing measures and perceptual spectral flux
-
Mar
-
S. O. Sadjadi and J. H. L. Hansen. Unsupervised speech activity detection using voicing measures and perceptual spectral flux. IEEE Signal Processing Letters, pages 197-200, Mar. 2013.
-
(2013)
IEEE Signal Processing Letters
, pp. 197-200
-
-
Sadjadi, S.O.1
Hansen, J.H.L.2
-
8
-
-
84865772156
-
Front-end compensation methods for LVCSR under Lombard effect
-
Florence, Italy
-
H. Bořil, F. Grézl, and J. H. L. Hansen. Front-end compensation methods for LVCSR under Lombard effect. In INTERSPEECH 2011, pages 1257-1260, Florence, Italy, 2011.
-
(2011)
Interspeech 2011
, pp. 1257-1260
-
-
Bořil, H.1
Grézl, F.2
Hansen, J.H.L.3
-
9
-
-
0031636164
-
A voice activity detector employing soft decision based noise spectrum adaptation
-
vol.1
-
J. Sohn and W. Sung. A voice activity detector employing soft decision based noise spectrum adaptation. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 1998), volume 1, pages 365 -368 vol.1, 1998.
-
(1998)
Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 1998)
, vol.1
, pp. 365-368
-
-
Sohn, J.1
Sung, W.2
-
11
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet. Front-end factor analysis for speaker verification. IEEE Trans. Audio, Speech and Language Processing, 19(4):788 -798, 2011.
-
(2011)
IEEE Trans. Audio, Speech and Language Processing
, vol.19
, Issue.4
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
13
-
-
58349106697
-
A study of inter-speaker variability in speaker verification
-
July
-
P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel. A study of inter-speaker variability in speaker verification. IEEE Trans. Audio, Speech and Language Processing, 16(5):980-988, July 2008.
-
(2008)
IEEE Trans. Audio, Speech and Language Processing
, vol.16
, Issue.5
, pp. 980-988
-
-
Kenny, P.1
Ouellet, P.2
Dehak, N.3
Gupta, V.4
Dumouchel, P.5
-
14
-
-
43249091937
-
Speaker and session variability in GMM-based speaker verification
-
May
-
P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel. Speaker and session variability in GMM-based speaker verification. IEEE Trans. Audio, Speech and Language Processing, 15(4):1448- 1460, May 2007.
-
(2007)
IEEE Trans. Audio, Speech and Language Processing
, vol.15
, Issue.4
, pp. 1448-1460
-
-
Kenny, P.1
Boulianne, G.2
Ouellet, P.3
Dumouchel, P.4
-
15
-
-
85073229756
-
Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis
-
P. M. Bousquet, A. Larcher, D. Matrouf, J. F. Bonastre, and O. Plchot. Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis. In Odyssey, 2012, 2012.
-
(2012)
Odyssey, 2012
-
-
Bousquet, P.M.1
Larcher, A.2
Matrouf, D.3
Bonastre, J.F.4
Plchot, O.5
-
17
-
-
84865733857
-
Analysis of i-vector length normalization in speaker recognition systems
-
D. Garcia-Romero and C. Y. Espy-Wilson. Analysis of i-vector length normalization in speaker recognition systems. In Proc. Interspeech 2011, pages 249-252, 2011.
-
(2011)
Proc. Interspeech 2011
, pp. 249-252
-
-
Garcia-Romero, D.1
Espy-Wilson, C.Y.2
-
18
-
-
50949133669
-
LIBLINEAR: A library for large linear classification
-
R. E. Fan, K. W. Chang, C. J. Hsieh, X. R. Wang, and C. J. Lin. LIBLINEAR: A library for large linear classification. J. Mach. Learn. Res., 9:1871-1874, 2008.
-
(2008)
J. Mach. Learn. Res
, vol.9
, pp. 1871-1874
-
-
Fan, R.E.1
Chang, K.W.2
Hsieh, C.J.3
Wang, X.R.4
Lin, C.J.5
-
19
-
-
0035396555
-
Noise power spectral density estimation based on optimal smoothing and minimum statistics
-
R. Martin. Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Trans. on Speech and Audio Processing, 9(5):504 -512, 2001.
-
(2001)
IEEE Trans. on Speech and Audio Processing
, vol.9
, Issue.5
, pp. 504-512
-
-
Martin, R.1
-
20
-
-
84890535969
-
An investigation on back-end for speaker recognition in multi-session enrollment
-
G. Liu, T. Hasan, H. Bořil, and J.H.L. Hansen. An investigation on back-end for speaker recognition in multi-session enrollment. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013), 2013.
-
(2013)
Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013)
-
-
Liu, G.1
Hasan, T.2
Bořil, H.3
Hansen, J.H.L.4
-
21
-
-
84890540706
-
CRSS systems for 2012 NIST speaker recognition evaluation
-
T. Hasan, S. O. Sadjadi, G. Liu, N. Shokouhi, H. Bořil, and J.H.L. Hansen. CRSS Systems for 2012 NIST Speaker Recognition Evaluation. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013), 2013.
-
(2013)
Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2013)
-
-
Hasan, T.1
Sadjadi, S.O.2
Liu, G.3
Shokouhi, N.4
Bořil, H.5
Hansen, J.H.L.6
-
22
-
-
85073247582
-
Variational Bayes logistic regression as regularized fusion for nist sre 2010
-
V. Hautamäki, K. A. Lee, A. Larcher, T. Kinnunen, B. Ma, and H. Li. Variational Bayes logistic regression as regularized fusion for nist sre 2010. In Odyssey, 2012, 2012.
-
(2012)
Odyssey, 2012
-
-
Hautamäki, V.1
Lee, K.A.2
Larcher, A.3
Kinnunen, T.4
Ma, B.5
Li, H.6
-
23
-
-
84877743396
-
Sparse classifier fusion for speaker verification
-
Accepted for publication
-
V. Hautamaki, T. Kinnunen, F. Sedlak, K.-A. Lee, B. Ma, and H. Li. Sparse classifier fusion for speaker verification. IEEE Trans. Audio, Speech and Language Processing, 2013, Accepted for publication.
-
(2013)
IEEE Trans. Audio, Speech and Language Processing
-
-
Hautamaki, V.1
Kinnunen, T.2
Sedlak, F.3
Lee, K.-A.4
Ma, B.5
Li, H.6
-
24
-
-
85073232294
-
A small foot-print i-vector extractor
-
P. Kenny. A small foot-print i-vector extractor. In Odyssey, 2012, pages 1-6, 2012.
-
(2012)
Odyssey, 2012
, pp. 1-6
-
-
Kenny, P.1
-
25
-
-
85073109470
-
An i-vector extractor suitable for speaker recognition with both microphone and telephone speech
-
M. Senoussaoui, P. Kenny, N. Dehak, and P. Dumouchel. An i-vector extractor suitable for speaker recognition with both microphone and telephone speech. In Odyssey, 2010, pages 28-33, 2010.
-
(2010)
Odyssey, 2010
, pp. 28-33
-
-
Senoussaoui, M.1
Kenny, P.2
Dehak, N.3
Dumouchel, P.4
-
26
-
-
84878413073
-
PLDA modeling in i-vector and supervector space for speaker verification
-
Y. Jiang, K. A. Lee, Z. Tang, B. Ma, A. Larcher, and H. Li. PLDA modeling in i-vector and supervector space for speaker verification. In Proc. Interspeech 2012, 2012.
-
(2012)
Proc. Interspeech 2012
-
-
Jiang, Y.1
Lee, K.A.2
Tang, Z.3
Ma, B.4
Larcher, A.5
Li, H.6
-
27
-
-
33645887246
-
Support vector machines using GMM supervectors for speaker verification
-
W. M. Campbell, D. E. Sturim, and D. A. Reynolds. Support vector machines using GMM supervectors for speaker verification. IEEE Signal Processing Letters, 13(5):308 - 311, 2006.
-
(2006)
IEEE Signal Processing Letters
, vol.13
, Issue.5
, pp. 308-311
-
-
Campbell, W.M.1
Sturim, D.E.2
Reynolds, D.A.3
-
28
-
-
33947696754
-
SVM based speaker verification using a GMM supervector kernel and NAP variability compensation
-
Toulouse, France
-
W. M. Campbell, D. E. Sturim, D. A. Reynolds, and A. Solomonoff. SVM based speaker verification using a GMM supervector kernel and NAP variability compensation. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006), volume 1, pages 97-100, Toulouse, France, 2006.
-
(2006)
Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006)
, vol.1
, pp. 97-100
-
-
Campbell, W.M.1
Sturim, D.E.2
Reynolds, D.A.3
Solomonoff, A.4
-
29
-
-
33947630848
-
Use of antimodels to further improve state-of-the-art PRLM language recognition system
-
P. Matejka, P. Schwarz, L. Burget, and J. Cernocky. Use of antimodels to further improve state-of-the-art PRLM language recognition system. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006), volume 1, pages 197-200, 2006.
-
(2006)
Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006)
, vol.1
, pp. 197-200
-
-
Matejka, P.1
Schwarz, P.2
Burget, L.3
Cernocky, J.4
-
30
-
-
77955790894
-
GMM-SVM kernel with a bhattacharyya-based distance for speaker recognition
-
C. H. You, K. A. Lee, and H. Li. GMM-SVM kernel with a bhattacharyya-based distance for speaker recognition. IEEE Trans. Audio, Speech and Language Processing, 18(6):1300 - 1312, 2010.
-
(2010)
IEEE Trans. Audio, Speech and Language Processing
, vol.18
, Issue.6
, pp. 1300-1312
-
-
You, C.H.1
Lee, K.A.2
Li, H.3
-
32
-
-
84906283218
-
IIR system description for the 2010 nist speaker recognition evaluation submission
-
B. Ma, H. Sun, K. A. Lee, C. H. You, D. Zhu, E. Wang, R. Tong, C. L. Huang, C. C. Leung, V. Hautamäki, and H. Li. IIR system description for the 2010 nist speaker recognition evaluation submission. In Proc. NIST SRE 2010 workshop, 2011.
-
(2011)
Proc. NIST SRE 2010 Workshop
-
-
Ma, B.1
Sun, H.2
Lee, K.A.3
You, C.H.4
Zhu, D.5
Wang, E.6
Tong, R.7
Huang, C.L.8
Leung, C.C.9
Hautamäki, V.10
Li, H.11
-
33
-
-
70349199116
-
Comparison of scoring methods used in speaker recognition with joint factor analysis
-
O. Glembek, L. Burget, N. Dehak, N. Brummer, and P. Kenny. Comparison of scoring methods used in speaker recognition with joint factor analysis. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009), 2009.
-
(2009)
Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009)
-
-
Glembek, O.1
Burget, L.2
Dehak, N.3
Brummer, N.4
Kenny, P.5
-
35
-
-
71249110320
-
A majorization minimization algorithm for (multiple) hyperparameter learning
-
C. S. Foo, C. B. Do, and A. Y. Ng. A majorization minimization algorithm for (multiple) hyperparameter learning. In Int. Conf. Mach. Learning, pages 321-328, 2009.
-
(2009)
Int. Conf. Mach. Learning
, pp. 321-328
-
-
Foo, C.S.1
Do, C.B.2
Ng, A.Y.3
|