-
1
-
-
70350125882
-
An overview of text-independent speaker recognition: From features to supervectors
-
January
-
T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: from features to supervectors," Speech Communication, vol. 52, no. 1, pp. 12-40, January 2010.
-
(2010)
Speech Communication
, vol.52
, Issue.1
, pp. 12-40
-
-
Kinnunen, T.1
Li, H.2
-
2
-
-
0033884858
-
Speaker verification using adapted gaussian mixture models
-
January
-
D. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted gaussian mixture models," Digital Signal Processing, vol. 10, no. 1, pp. 19-41, January 2000.
-
(2000)
Digital Signal Processing
, vol.10
, Issue.1
, pp. 19-41
-
-
Reynolds, D.1
Quatieri, T.2
Dunn, R.3
-
3
-
-
58349106697
-
A study of inter-speaker variability in speaker verification
-
July
-
P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel, "A study of inter-speaker variability in speaker verification," IEEE Trans. Audio, Speech and Language Processing, vol. 16, no. 5, pp. 980-988, July 2008.
-
(2008)
IEEE Trans. Audio, Speech and Language Processing
, vol.16
, Issue.5
, pp. 980-988
-
-
Kenny, P.1
Ouellet, P.2
Dehak, N.3
Gupta, V.4
Dumouchel, P.5
-
4
-
-
77952192470
-
Temporally weighted linear prediction features for tackling additive noise in speaker verification
-
R. Saeidi, J. Pohjalainen, T. Kinnunen, and P. Alku, "Temporally weighted linear prediction features for tackling additive noise in speaker verification," IEEE Signal Processing Letters, vol. 17, no. 6, pp. 599-602, 2010.
-
(2010)
IEEE Signal Processing Letters
, vol.17
, Issue.6
, pp. 599-602
-
-
Saeidi, R.1
Pohjalainen, J.2
Kinnunen, T.3
Alku, P.4
-
5
-
-
79960368127
-
The Prosody of Speech: Timing and Rhythm
-
Birkhauser, ch.
-
J. Fletcher, The Handbook of Phonetic Sciences. Birkhauser, 2010, ch. The Prosody of Speech: Timing and Rhythm.
-
(2010)
The Handbook of Phonetic Sciences
-
-
Fletcher, J.1
-
6
-
-
36249002034
-
Long-term F0 modeling for text-independent speaker recognition
-
T. Kinnunen and R. González-Hautamäki, "Long-term F0 modeling for text-independent speaker recognition," in Proc. 10th International Conf. Speech and Computer (SPECOM'2005), Patras, Greece, October 2005, pp. 567-570.
-
Proc. 10th International Conf. Speech and Computer (SPECOM'2005), Patras, Greece, October 2005
, pp. 567-570
-
-
Kinnunen, T.1
González-Hautamäki, R.2
-
7
-
-
34047164452
-
Modeling prosodic differences for speaker recognition
-
April
-
A. Adami, "Modeling prosodic differences for speaker recognition," Speech Communication, vol. 49, no. 4, pp. 277-291, April 2007.
-
(2007)
Speech Communication
, vol.49
, Issue.4
, pp. 277-291
-
-
Adami, A.1
-
8
-
-
21844454996
-
Modeling prosodic feature sequences for speaker recognition
-
July
-
E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke, "Modeling prosodic feature sequences for speaker recognition," Speech Communication, vol. 46, no. 3-4, pp. 455-472, July 2005.
-
(2005)
Speech Communication
, vol.46
, Issue.3-4
, pp. 455-472
-
-
Shriberg, E.1
Ferrer, L.2
Kajarekar, S.3
Venkataraman, A.4
Stolcke, A.5
-
9
-
-
64249101047
-
Modeling prosodic features with joint factor analysis for speaker verification
-
September
-
N. Dehak, P. Kenny, and P. Dumouchel, "Modeling prosodic features with joint factor analysis for speaker verification," IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 7, pp. 2095-2103, September 2007.
-
(2007)
IEEE Trans. Audio, Speech and Language Processing
, vol.15
, Issue.7
, pp. 2095-2103
-
-
Dehak, N.1
Kenny, P.2
Dumouchel, P.3
-
10
-
-
78049354761
-
Investigations into prosodic syllable contour features for speaker recognition
-
M. Kockmann, L. Burget, and J. Černocký, "Investigations into prosodic syllable contour features for speaker recognition," in Proc. ICASSP 2010, 2010, pp. 4418-4421.
-
(2010)
Proc. ICASSP 2010
, pp. 4418-4421
-
-
Kockmann, M.1
Burget, L.2
Černocký, J.3
-
11
-
-
0015476226
-
Automatic speaker recognition based on pitch contours
-
B. Atal, "Automatic speaker recognition based on pitch contours," Journal of the Acoustic Society of America, vol. 52, no. 6, pp. 1687-1697, 1972.
-
(1972)
Journal of the Acoustic Society of America
, vol.52
, Issue.6
, pp. 1687-1697
-
-
Atal, B.1
-
12
-
-
85009080322
-
Noise-robust speaker verification using f0 features
-
K. Iwano, T. Asami, and S. Sadaoki, "Noise-robust speaker verification using f0 features," in Proc. Interspeech 2004, Jeju Island, Korea, October 2004, pp. 1417-1420.
-
Proc. Interspeech 2004, Jeju Island, Korea, October 2004
, pp. 1417-1420
-
-
Iwano, K.1
Asami, T.2
Sadaoki, S.3
-
13
-
-
33947676771
-
-
April [Online]. Available: http://www.speech.kth.se/snack
-
"The snack sound toolkit," April 2010, http://www.speech.kth. se/snack/. [Online]. Available: http://www.speech.kth.se/snack/
-
(2010)
The Snack Sound Toolkit
-
-
-
15
-
-
34547520011
-
A novel method for prosody prediction in voice conversion
-
Honolulu, Hawaii, USA, April
-
E. Helander and J. Nurminen, "A novel method for prosody prediction in voice conversion," in Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2007), vol. 4, Honolulu, Hawaii, USA, April 2007, pp. 509-512.
-
(2007)
Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2007)
, vol.4
, pp. 509-512
-
-
Helander, E.1
Nurminen, J.2
-
16
-
-
0016495091
-
Linear prediction: A tutorial review
-
April
-
J. Makhoul, "Linear prediction: a tutorial review," Proceedings of the IEEE, vol. 64, no. 4, pp. 561-580, April 1975.
-
(1975)
Proceedings of the IEEE
, vol.64
, Issue.4
, pp. 561-580
-
-
Makhoul, J.1
-
17
-
-
0004056285
-
-
New Jersey: Prentice-Hall
-
X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing: a Guide to Theory, Algorithm, and System Development. New Jersey: Prentice-Hall, 2001.
-
(2001)
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
-
-
Huang, X.1
Acero, A.2
Hon, H.-W.3
-
18
-
-
85135139722
-
A lognormal tied mixture model of pitch for prosody-based speaker recognition
-
M. Sönmez, L. Heck, M. Weintraub, and E. Shriberg, "A lognormal tied mixture model of pitch for prosody-based speaker recognition," in Proc. 5th European Conf. on Speech Communication and Technology (Eurospeech 1997), Rhodos, Greece, September 1997, pp. 1391-1394.
-
Proc. 5th European Conf. on Speech Communication and Technology (Eurospeech 1997), Rhodos, Greece, September 1997
, pp. 1391-1394
-
-
Sönmez, M.1
Heck, L.2
Weintraub, M.3
Shriberg, E.4
|