-
1
-
-
50249182472
-
Discriminative in-set/out-of-set speaker recognition
-
10.1109/TASL.2006.881689
-
Angkititrakul, P., and Hansen, J. H. L. (2007). Discriminative in-set/out-of-set speaker recognition., IEEE Trans. Audio, Speech, Lang. Process. 15, 498-508. 10.1109/TASL.2006.881689
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process
, vol.15
, pp. 498-508
-
-
Angkititrakul, P.1
Hansen, J.H.L.2
-
2
-
-
27144489164
-
A tutorial on support vector machines for pattern recognition
-
10.1023/A:1009715923555
-
Burges, C. (1998). A tutorial on support vector machines for pattern recognition., Data Min. Knowl. Discov. 2, 121-167. 10.1023/A:1009715923555
-
(1998)
Data Min. Knowl. Discov
, vol.2
, pp. 121-167
-
-
Burges, C.1
-
3
-
-
84959118000
-
The fisher corpus: A resource for the next generations of speech-to-text
-
Lisbon, Portugal
-
Cieri, C., Miller, D., and Walker, K. (2004). The fisher corpus: A resource for the next generations of speech-to-text., in Fourth International Conference on Language Resources and Evaluation May 2004, Lisbon, Portugal, Vol. 1, pp. 1-3.
-
(2004)
Fourth International Conference on Language Resources and Evaluation May 2004
, vol.1
, pp. 1-3
-
-
Cieri, C.1
Miller, D.2
Walker, K.3
-
5
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains
-
10.1109/89.279278
-
Gauvain, J., and Lee, C. (1994). Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains., IEEE Trans. Speech Audio Process. 2, 291-298. 10.1109/89.279278
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, pp. 291-298
-
-
Gauvain, J.1
Lee, C.2
-
6
-
-
0028516097
-
Text-independent speaker identification
-
10.1109/79.317924
-
Gish, H., and Schmidt, M. (1994). Text-independent speaker identification., IEEE Signal Process. Mag. 11, 18-32. 10.1109/79.317924
-
(1994)
IEEE Signal Process. Mag
, vol.11
, pp. 18-32
-
-
Gish, H.1
Schmidt, M.2
-
7
-
-
85073095401
-
Human assisted speaker recognition in NIST SRE10
-
Brno, Czech Republic
-
Greenberg, G., Martin, A., Brandschain, L., Campbell, J., Cieri, C., Doddington, G., and Godfrey, J. (2010). Human assisted speaker recognition in NIST SRE10., in Odyssey 2010, Brno, Czech Republic, pp. 180-185.
-
(2010)
Odyssey 2010
, pp. 180-185
-
-
Greenberg, G.1
Martin, A.2
Brandschain, L.3
Campbell, J.4
Cieri, C.5
Doddington, G.6
Godfrey, J.7
-
8
-
-
85008020310
-
Speechfind: Advances in spoken document retrieval for a national gallery of the spoken word
-
10.1109/TSA.2005.852088
-
Hansen, J. H. L., Huang, R., Zhou, B., Seadle, M., Deller, J., Gurijala, A., Kurimo, M., and Angkititrakul, P. (2005). Speechfind: advances in spoken document retrieval for a national gallery of the spoken word., IEEE Trans. Speech Audio Process. 13, 712-730. 10.1109/TSA.2005.852088
-
(2005)
IEEE Trans. Speech Audio Process
, vol.13
, pp. 712-730
-
-
Hansen, J.H.L.1
Huang, R.2
Zhou, B.3
Seadle, M.4
Deller, J.5
Gurijala, A.6
Kurimo, M.7
Angkititrakul, P.8
-
9
-
-
85009152939
-
CU-move: Robust speech processing for in-vehicle speech systems
-
Beijing, China
-
Hansen, J. H. L., Plucienkowski, J., Gallant, S., Pellom, B., and Ward, W. (2000). CU-move: Robust speech processing for in-vehicle speech systems., in ICSLP 2000, Beijing, China, pp. 524-527.
-
(2000)
ICSLP 2000
, pp. 524-527
-
-
Hansen, J.H.L.1
Plucienkowski, J.2
Gallant, S.3
Pellom, B.4
Ward, W.5
-
10
-
-
0000375621
-
A robust version of the probability ratio test
-
10.1214/aoms/1177699803
-
Huber, P. (1965). A robust version of the probability ratio test., Ann Math. Stat. 36, 1753-1758. 10.1214/aoms/1177699803
-
(1965)
Ann Math. Stat
, vol.36
, pp. 1753-1758
-
-
Huber, P.1
-
11
-
-
18744386134
-
Eigenvoice modeling with sparse training data
-
10.1109/TSA.2004.840940
-
Kenny, P., Boulianne, G., and Dumouchel, P. (2005). Eigenvoice modeling with sparse training data., IEEE Trans. Audio, Speech, Lang. Proc. 13, 345-354. 10.1109/TSA.2004.840940
-
(2005)
IEEE Trans. Audio, Speech, Lang. Proc
, vol.13
, pp. 345-354
-
-
Kenny, P.1
Boulianne, G.2
Dumouchel, P.3
-
12
-
-
43249091937
-
Speaker and session variability in GMM-based speaker verification
-
10.1109/TASL.2007.894527
-
Kenny, P., Boulianne, G., Ouellet, P., and Dumouchel, P. (2007). Speaker and session variability in GMM-based speaker verification., IEEE Trans. Audio, Speech, Lang. Process. 15, 1448-1460. 10.1109/TASL.2007.894527
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process
, vol.15
, pp. 1448-1460
-
-
Kenny, P.1
Boulianne, G.2
Ouellet, P.3
Dumouchel, P.4
-
13
-
-
0002044263
-
-
(World Scientific Publishing, MA)
-
Kressel, U., and Schurmann, J. (1997). Pattern Classification Techniques Based on Function Approximation, (World Scientific Publishing, MA), pp. 49-78.
-
(1997)
Pattern Classification Techniques Based on Function Approximation
, pp. 49-78
-
-
Kressel, U.1
Schurmann, J.2
-
14
-
-
0034320005
-
Rapid speaker adaptation in Eigenvoice space
-
10.1109/89.876308
-
Kuhn, R., Junqua, J., Nguyen, P., and Niedzielski, N. (2000). Rapid speaker adaptation in Eigenvoice space., IEEE Trans. Speech Audio Process. 8, 695-707. 10.1109/89.876308
-
(2000)
IEEE Trans. Speech Audio Process
, vol.8
, pp. 695-707
-
-
Kuhn, R.1
Junqua, J.2
Nguyen, P.3
Niedzielski, N.4
-
15
-
-
0001927585
-
On information and sufficiency
-
10.1214/aoms/1177729694
-
Kullback, S., and Leibler, R. (1951). On information and sufficiency., Ann. Math. Stat. 22, 79-86. 10.1214/aoms/1177729694
-
(1951)
Ann. Math. Stat
, vol.22
, pp. 79-86
-
-
Kullback, S.1
Leibler, R.2
-
16
-
-
85135371588
-
High performance speaker-independent phone recognition using CDHMM
-
Berlin, Germany
-
Lamel, L., and Gauvain, J. (1993). High performance speaker-independent phone recognition using CDHMM., in EUROSPEECH 1993, Berlin, Germany, pp. 121-124.
-
(1993)
EUROSPEECH 1993
, pp. 121-124
-
-
Lamel, L.1
Gauvain, J.2
-
17
-
-
0024768209
-
Speaker-independent phone recognition using hidden Markov models
-
10.1109/29.46546
-
Lee, K., and Hon, H. (1989). Speaker-independent phone recognition using hidden Markov models., IEEE Trans. Acoust., Speech, Signal Process. 37, 1641-1648. 10.1109/29.46546
-
(1989)
IEEE Trans. Acoust., Speech, Signal Process
, vol.37
, pp. 1641-1648
-
-
Lee, K.1
Hon, H.2
-
18
-
-
84857387950
-
-
(CRC Press, New York)
-
Li, Q., Juang, B., Lee, C., Zhou, Q., and Soong, F. (2003). Speaker Authentication (CRC Press, New York), pp. 229-259.
-
(2003)
Speaker Authentication
, pp. 229-259
-
-
Li, Q.1
Juang, B.2
Lee, C.3
Zhou, Q.4
Soong, F.5
-
19
-
-
44949143337
-
Speaker cluster based GMM tokenization for speaker recognition
-
Pittsburgh, PA
-
Ma, B., Zhu, D., Tong, R., and Li, H. (2006). Speaker cluster based GMM tokenization for speaker recognition., in INTERSPEECH 2006, Pittsburgh, PA, Vol. 1, pp. 505-508.
-
(2006)
INTERSPEECH 2006
, vol.1
, pp. 505-508
-
-
Ma, B.1
Zhu, D.2
Tong, R.3
Li, H.4
-
20
-
-
33947670488
-
A comparison of various adaptation methods for speaker verification with limited enrollment data
-
Toulouse, France
-
Mak, M., Hsiao, R., and Mak, B. (2006). A comparison of various adaptation methods for speaker verification with limited enrollment data., in ICASSP 2006, Toulouse, France, Vol. 1, pp. 929-932.
-
(2006)
ICASSP 2006
, vol.1
, pp. 929-932
-
-
Mak, M.1
Hsiao, R.2
Mak, B.3
-
21
-
-
84857407484
-
Structural linear model-space transformations for speaker adaptation
-
Geneva, Switzerland
-
Matrouf, D., Bellot, O., Nocera, P., Linares, G., and Bonastre, J. (2003). Structural linear model-space transformations for speaker adaptation., in EUROSPEECH 2003, Geneva, Switzerland, pp. 1625-1628.
-
(2003)
EUROSPEECH 2003
, pp. 1625-1628
-
-
Matrouf, D.1
Bellot, O.2
Nocera, P.3
Linares, G.4
Bonastre, J.5
-
22
-
-
0017992187
-
Frequency of occurrence of phonemes in conversational English
-
Mines, M., Hanson, B., and Shoup, J. (1978). Frequency of occurrence of phonemes in conversational English. Lang. Speech 21, 221-241.
-
(1978)
Lang. Speech
, vol.21
, pp. 221-241
-
-
Mines, M.1
Hanson, B.2
Shoup, J.3
-
23
-
-
84867213267
-
Language and genre detection in audio content analysis
-
Brisbane, Australia
-
Mitra, V., Garcia-Romero, D., and Espy-Wilson, C. (2008). Language and genre detection in audio content analysis., INTERSPEECH-2008, Brisbane, Australia, pp. 2506-2509.
-
(2008)
INTERSPEECH-2008
, pp. 2506-2509
-
-
Mitra, V.1
Garcia-Romero, D.2
Espy-Wilson, C.3
-
24
-
-
32644450332
-
Feature warping for robust speaker verification
-
Pelecanos, J., and Sridharan, S. (2001). Feature warping for robust speaker verification., Proc. Speaker Odyssey 13, 1-5.
-
(2001)
Proc. Speaker Odyssey
, vol.13
, pp. 1-5
-
-
Pelecanos, J.1
Sridharan, S.2
-
25
-
-
63049100830
-
In-set/out-of-set speaker recognition under sparse enrollment
-
10.1109/TASL.2007.902058
-
Prakash, V., and Hansen, J. H. L. (2007). In-set/out-of-set speaker recognition under sparse enrollment., IEEE Trans. Audio, Speech, Lang. Process. 15, 2044-2052. 10.1109/TASL.2007.902058
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process
, vol.15
, pp. 2044-2052
-
-
Prakash, V.1
Hansen, J.H.L.2
-
26
-
-
0033884858
-
Speaker verification using adapted gaussian mixture models
-
10.1006/dspr.1999.0361
-
Reynolds, D., Quatieri, T., and Dunn, R. (2000). Speaker verification using adapted gaussian mixture models., Digit. Signal Process. 10, 19-41. 10.1006/dspr.1999.0361
-
(2000)
Digit. Signal Process
, vol.10
, pp. 19-41
-
-
Reynolds, D.1
Quatieri, T.2
Dunn, R.3
-
27
-
-
44949231596
-
A multiclass framework for speaker verification within an acoustic event sequence system
-
Pittsburgh, PA
-
Scheffer, N., and Bonastre, J. (2006). A multiclass framework for speaker verification within an acoustic event sequence system., in INTERSPEECH 2006, Pittsburgh, PA, pp. 501-504.
-
(2006)
INTERSPEECH 2006
, pp. 501-504
-
-
Scheffer, N.1
Bonastre, J.2
-
28
-
-
0033889739
-
Speaker verification by human listeners: Experiments comparing human and machine performance using the NIST 1998 Speaker Evaluation Data 1
-
10.1006/dspr.1999.0356
-
Schmidt-Nielsen, A., and Crystal, T. (2000). Speaker verification by human listeners: Experiments comparing human and machine performance using the NIST 1998 Speaker Evaluation Data 1., Digit. Signal Process. 10, 249-266. 10.1006/dspr.1999.0356
-
(2000)
Digit. Signal Process
, vol.10
, pp. 249-266
-
-
Schmidt-Nielsen, A.1
Crystal, T.2
-
29
-
-
84857407659
-
USSS-MITLL 2010 Human Assisted Speaker Recognition Evaluation System
-
Brno, Czech Republic
-
Schwartz, R., Campbell, J., Shen, W., D., S., Campbell, W., Richardson, F., Dunn, R., and Granville, R. (2010). USSS-MITLL 2010 Human Assisted Speaker Recognition Evaluation System., in NIST SRE Workshop 2010, Brno, Czech Republic, pp. 1-7.
-
(2010)
NIST SRE Workshop 2010
, pp. 1-7
-
-
Schwartz, R.1
Campbell, J.2
Shen, W.D.S.3
Campbell, W.4
Richardson, F.5
Dunn, R.6
Granville, R.7
-
30
-
-
85009207995
-
Score normalisation applied to open-set, text-independent speaker identification
-
Geneva, Switzerland
-
Sivakumaran, P., Fortuna, J., and Ariyaeeinia, A. (2003). Score normalisation applied to open-set, text-independent speaker identification., in EUROSPEECH, Geneva, Switzerland, pp. 2669-2672.
-
(2003)
EUROSPEECH
, pp. 2669-2672
-
-
Sivakumaran, P.1
Fortuna, J.2
Ariyaeeinia, A.3
-
31
-
-
0022229052
-
A vector quantization approach to speaker recognition
-
FL
-
Soong, F., Rosenberg, A., Rabiner, L., and Juang, L. (1985). A vector quantization approach to speaker recognition., in ICASSP 1985, FL, pp. 387-390.
-
(1985)
ICASSP 1985
, pp. 387-390
-
-
Soong, F.1
Rosenberg, A.2
Rabiner, L.3
Juang, L.4
-
32
-
-
85009275225
-
Approaches to language identification using Gaussian mixture models and shifted delta cepstral features
-
Tampa, FL
-
Torres-Carrasquillo, P., Singer, E., Kohler, M., Greene, R., Reynolds, D., and Deller, Jr., J. (2002). Approaches to language identification using Gaussian mixture models and shifted delta cepstral features., in ICSLP 2002, Tampa, FL, pp. 89-92.
-
(2002)
ICSLP 2002
, pp. 89-92
-
-
Torres-Carrasquillo, P.1
Singer, E.2
Kohler, M.3
Greene, R.4
Reynolds, D.5
Deller Jr., J.6
-
33
-
-
64549092742
-
A cohort-based speaker model synthesis for mismatched channels in speaker verification
-
10.1109/TASL.2007.899297
-
Wu, W., Zheng, T., Xu, M., and Soong, F. (2007). A cohort-based speaker model synthesis for mismatched channels in speaker verification., IEEE Trans. Audio, Speech, Lang. Process. 15, 1893-1903. 10.1109/TASL.2007.899297
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process
, vol.15
, pp. 1893-1903
-
-
Wu, W.1
Zheng, T.2
Xu, M.3
Soong, F.4
-
34
-
-
0041360472
-
Efficient text-independent speaker verification with structural gaussian mixture models and neural network
-
10.1109/TSA.2003.815822
-
Xiang, B., and Berger, T. (2003). Efficient text-independent speaker verification with structural gaussian mixture models and neural network., IEEE Trans. Audio, Speech, Lang. Process. 11, 447-456. 10.1109/TSA.2003.815822
-
(2003)
IEEE Trans. Audio, Speech, Lang. Process
, vol.11
, pp. 447-456
-
-
Xiang, B.1
Berger, T.2
|