SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2012, Pages 4777-4780

Intonational speaker verification: A study on parameters and performance under noisy conditions

(4) Siddiq, Sadjad a Kinnunen, Tomi a Vainio, Martti b Werner, Stefan a

a UNIVERSITY OF EASTERN FINLAND (Finland)

b UNIVERSITY OF HELSINKI (Finland)

Author keywords

fundamental frequency; prosodic features; speaker recognition

Indexed keywords

BASELINE SYSTEMS; FUNDAMENTAL FREQUENCIES; HERMITE; LEGENDRE POLYNOMIALS; LINEAR PREDICTION; NOISE CONTAMINATION; NOISE DEGRADATIONS; OPTIMIZATION OF PARAMETERS; OPTIMIZED PARAMETER; PROSODIC FEATURES; PROSODY FEATURES; SPEAKER RECOGNITION; SPEAKER VERIFICATION; TRANSFORMATION COEFFICIENTS;

ACOUSTIC NOISE; DISCRETE COSINE TRANSFORMS; DISCRETE FOURIER TRANSFORMS; NATURAL FREQUENCIES; OPTIMIZATION; SIGNAL PROCESSING; SIGNAL TO NOISE RATIO; WHITE NOISE;

SPEECH RECOGNITION;

EID: 84867600823 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2012.6288987 Document Type: Conference Paper

Times cited : (5)

References (19)

1
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- January
- T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: from features to supervectors," Speech Communication, vol. 52, no. 1, pp. 12-40, January 2010.
- (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

2
- 0033884858
- Speaker verification using adapted gaussian mixture models
- January
- D. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted gaussian mixture models," Digital Signal Processing, vol. 10, no. 1, pp. 19-41, January 2000.
- (2000) Digital Signal Processing , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.¹ Quatieri, T.² Dunn, R.³

3
- 58349106697
- A study of inter-speaker variability in speaker verification
- July
- P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel, "A study of inter-speaker variability in speaker verification," IEEE Trans. Audio, Speech and Language Processing, vol. 16, no. 5, pp. 980-988, July 2008.
- (2008) IEEE Trans. Audio, Speech and Language Processing , vol.16 , Issue.5 , pp. 980-988
- Kenny, P.¹ Ouellet, P.² Dehak, N.³ Gupta, V.⁴ Dumouchel, P.⁵

4
- 77952192470
- Temporally weighted linear prediction features for tackling additive noise in speaker verification
- R. Saeidi, J. Pohjalainen, T. Kinnunen, and P. Alku, "Temporally weighted linear prediction features for tackling additive noise in speaker verification," IEEE Signal Processing Letters, vol. 17, no. 6, pp. 599-602, 2010.
- (2010) IEEE Signal Processing Letters , vol.17 , Issue.6 , pp. 599-602
- Saeidi, R.¹ Pohjalainen, J.² Kinnunen, T.³ Alku, P.⁴

5
- 79960368127
- The Prosody of Speech: Timing and Rhythm
- Birkhauser, ch.
- J. Fletcher, The Handbook of Phonetic Sciences. Birkhauser, 2010, ch. The Prosody of Speech: Timing and Rhythm.
- (2010) The Handbook of Phonetic Sciences
- Fletcher, J.¹

6
- 36249002034
- Long-term F0 modeling for text-independent speaker recognition
- T. Kinnunen and R. González-Hautamäki, "Long-term F0 modeling for text-independent speaker recognition," in Proc. 10th International Conf. Speech and Computer (SPECOM'2005), Patras, Greece, October 2005, pp. 567-570.
- Proc. 10th International Conf. Speech and Computer (SPECOM'2005), Patras, Greece, October 2005 , pp. 567-570
- Kinnunen, T.¹ González-Hautamäki, R.²

7
- 34047164452
- Modeling prosodic differences for speaker recognition
- April
- A. Adami, "Modeling prosodic differences for speaker recognition," Speech Communication, vol. 49, no. 4, pp. 277-291, April 2007.
- (2007) Speech Communication , vol.49 , Issue.4 , pp. 277-291
- Adami, A.¹

8
- 21844454996
- Modeling prosodic feature sequences for speaker recognition
- July
- E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke, "Modeling prosodic feature sequences for speaker recognition," Speech Communication, vol. 46, no. 3-4, pp. 455-472, July 2005.
- (2005) Speech Communication , vol.46 , Issue.3-4 , pp. 455-472
- Shriberg, E.¹ Ferrer, L.² Kajarekar, S.³ Venkataraman, A.⁴ Stolcke, A.⁵

9
- 64249101047
- Modeling prosodic features with joint factor analysis for speaker verification
- September
- N. Dehak, P. Kenny, and P. Dumouchel, "Modeling prosodic features with joint factor analysis for speaker verification," IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 7, pp. 2095-2103, September 2007.
- (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.7 , pp. 2095-2103
- Dehak, N.¹ Kenny, P.² Dumouchel, P.³

10
- 78049354761
- Investigations into prosodic syllable contour features for speaker recognition
- M. Kockmann, L. Burget, and J. Černocký, "Investigations into prosodic syllable contour features for speaker recognition," in Proc. ICASSP 2010, 2010, pp. 4418-4421.
- (2010) Proc. ICASSP 2010 , pp. 4418-4421
- Kockmann, M.¹ Burget, L.² Černocký, J.³

11
- 0015476226
- Automatic speaker recognition based on pitch contours
- B. Atal, "Automatic speaker recognition based on pitch contours," Journal of the Acoustic Society of America, vol. 52, no. 6, pp. 1687-1697, 1972.
- (1972) Journal of the Acoustic Society of America , vol.52 , Issue.6 , pp. 1687-1697
- Atal, B.¹

12
- 85009080322
- Noise-robust speaker verification using f0 features
- K. Iwano, T. Asami, and S. Sadaoki, "Noise-robust speaker verification using f0 features," in Proc. Interspeech 2004, Jeju Island, Korea, October 2004, pp. 1417-1420.
- Proc. Interspeech 2004, Jeju Island, Korea, October 2004 , pp. 1417-1420
- Iwano, K.¹ Asami, T.² Sadaoki, S.³

13
- 33947676771
- April [Online]. Available: http://www.speech.kth.se/snack
- "The snack sound toolkit," April 2010, http://www.speech.kth. se/snack/. [Online]. Available: http://www.speech.kth.se/snack/
- (2010) The Snack Sound Toolkit

14
- 77953352057
- WWW page, February
- P. Boersma and D. Weenink, "Praat: doing phonetics by computer [computer program]," WWW page, February 2011, http://www.praat.org/.
- (2011) Praat: Doing Phonetics by Computer [Computer Program]
- Boersma, P.¹ Weenink, D.²

15
- 34547520011
- A novel method for prosody prediction in voice conversion
- Honolulu, Hawaii, USA, April
- E. Helander and J. Nurminen, "A novel method for prosody prediction in voice conversion," in Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2007), vol. 4, Honolulu, Hawaii, USA, April 2007, pp. 509-512.
- (2007) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2007) , vol.4 , pp. 509-512
- Helander, E.¹ Nurminen, J.²

16
- 0016495091
- Linear prediction: A tutorial review
- April
- J. Makhoul, "Linear prediction: a tutorial review," Proceedings of the IEEE, vol. 64, no. 4, pp. 561-580, April 1975.
- (1975) Proceedings of the IEEE , vol.64 , Issue.4 , pp. 561-580
- Makhoul, J.¹

17
- 0004056285
- New Jersey: Prentice-Hall
- X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing: a Guide to Theory, Algorithm, and System Development. New Jersey: Prentice-Hall, 2001.
- (2001) Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
- Huang, X.¹ Acero, A.² Hon, H.-W.³

18
- 85135139722
- A lognormal tied mixture model of pitch for prosody-based speaker recognition
- M. Sönmez, L. Heck, M. Weintraub, and E. Shriberg, "A lognormal tied mixture model of pitch for prosody-based speaker recognition," in Proc. 5th European Conf. on Speech Communication and Technology (Eurospeech 1997), Rhodos, Greece, September 1997, pp. 1391-1394.
- Proc. 5th European Conf. on Speech Communication and Technology (Eurospeech 1997), Rhodos, Greece, September 1997 , pp. 1391-1394
- Sönmez, M.¹ Heck, L.² Weintraub, M.³ Shriberg, E.⁴

19
- 34447100796
- CRC Press
- P. Loizou, Speech Enhancement: Theory and Practice. CRC Press, 2007.
- (2007) Speech Enhancement: Theory and Practice
- Loizou, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.