SCOPUS 정보 검색 플랫폼

Odyssey 2010: Speaker and Language Recognition Workshop

Volumn , Issue , 2010, Pages 12-19

Modeling prosody for speaker recognition: Why estimating pitch may be a red herring

(2) Laskowski, Kornel a Jin, Qin a

a Carnegie Mellon University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

HARMONIC ANALYSIS; SPEECH PROCESSING;

ACOUSTIC CORRELATES; DISCRETE TRANSFORMS; HARMONIC ENERGY; HARMONIC STRUCTURES; PITCH ESTIMATION; PROSODIC FEATURES; SPEAKER RECOGNITION; SPECTRAL ENVELOPES;

SPEECH RECOGNITION;

EID: 85073112188 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (2)

References (27)

1
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- Reynolds, D. and Rose, R., “Robust text-independent speaker identification using Gaussian mixture speaker models”, IEEE Trans. Speech and Audio Processing, 3:72-83, 1995.
- (1995) IEEE Trans. Speech and Audio Processing , vol.3 , pp. 72-83
- Reynolds, D.¹ Rose, R.²

2
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- Reynolds, D., Quatieri, T., Dunn, R., “Speaker verification using adapted Gaussian mixture models”, Digital Signal Processing, 10(1-3):19-41, 2000.
- (2000) Digital Signal Processing , vol.10 , Issue.1-3 , pp. 19-41
- Reynolds, D.¹ Quatieri, T.² Dunn, R.³

3
- 85009124414
- Speaker recognition based on idiolectal differences between speakers
- Doddington, G., “Speaker recognition based on idiolectal differences between speakers”, Proc. EUROSPEECH, 2001.
- (2001) Proc. EUROSPEECH
- Doddington, G.¹

4
- 0141521592
- Modeling prosodic dynamics for speaker recognition
- Adami, A., Mihaescu, R., Reynolds, D., and Godfrey, J., “Modeling prosodic dynamics for speaker recognition”, Proc. ICASSP, 788-791, 2003.
- (2003) Proc. ICASSP , pp. 788-791
- Adami, A.¹ Mihaescu, R.² Reynolds, D.³ Godfrey, J.⁴

5
- 0036299157
- Using prosodic and lexical information for speaker identification
- Weber, F., Manganaro, L., Peskin, B., Shriberg, E., “Using prosodic and lexical information for speaker identification,” Proc. ICASSP, 1:141-144, 2002.
- (2002) Proc. ICASSP , vol.1 , pp. 141-144
- Weber, F.¹ Manganaro, L.² Peskin, B.³ Shriberg, E.⁴

6
- 21844454996
- Modeling prosodic feature sequences for speaker recognition
- Shriberg, E., Ferrer, L., Kajarekar, S., Venkataraman, A., Stolcke, A., “Modeling prosodic feature sequences for speaker recognition”, Speech Communication, 46(3-4):455-472, 2005.
- (2005) Speech Communication , vol.46 , Issue.3-4 , pp. 455-472
- Shriberg, E.¹ Ferrer, L.² Kajarekar, S.³ Venkataraman, A.⁴ Stolcke, A.⁵

7
- 84939358489
- Measurements of the fundamental period of speech using a delay line
- Miller, R. L. and Weibel, E. S., “Measurements of the fundamental period of speech using a delay line”, J. Acoustical Society of America, 28:761, 1956.
- (1956) J. Acoustical Society of America , vol.28 , pp. 761
- Miller, R.L.¹ Weibel, E.S.²

8
- 0016113915
- The optimum comb method for pitch period analysis of continuous digitized speech
- Moorer, J. A., “The optimum comb method for pitch period analysis of continuous digitized speech”, IEEE Trans. Acoustics, Speech, and Signal Proc., 22(5):330-338, 1974.
- (1974) IEEE Trans. Acoustics, Speech, and Signal Proc. , vol.22 , Issue.5 , pp. 330-338
- Moorer, J.A.¹

9
- 85073106212
- Speech fundamental frequency estimation using the alternate comb
- Antwerpen, Belgium
- Liénard, J.-S., Signol, F., and Barras, C., “Speech fundamental frequency estimation using the alternate comb”, Proc. INTERSPEECH, Antwerpen, Belgium.
- Proc. INTERSPEECH
- Liénard, J.-S.¹ Signol, F.² Barras, C.³

10
- 0004088662
- Prentice Hall, Englewood Cliffs NJ, USA
- Titze, I. R., Principles of Voice Production, Prentice Hall, Englewood Cliffs NJ, USA, 1994.
- (1994) Principles of Voice Production
- Titze, I.R.¹

11
- 0014270962
- Period histogram and product spectrum: New methods for fundamental-frequency measurement
- Shroeder, M. R., “Period histogram and product spectrum: New methods for fundamental-frequency measurement”, J. Acoustical Society of America, 43(4):829-834, 1968.
- (1968) J. Acoustical Society of America , vol.43 , Issue.4 , pp. 829-834
- Shroeder, M.R.¹

12
- 0001835850
- Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
- Boersma, P., “Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound”, Proc. Institute of Phonetic Sciences, 17:97-110, 1993.
- (1993) Proc. Institute of Phonetic Sciences , vol.17 , pp. 97-110
- Boersma, P.¹

13
- 85009075244
- A pitch determination algorithm based on subharmonic-to-harmonic ratio
- Beijing, China
- Sun, X., “A pitch determination algorithm based on subharmonic-to-harmonic ratio”, Proc. ICSLP, Beijing, China, 2000.
- (2000) Proc. ICSLP
- Sun, X.¹

14
- 84865960393
- Using sets of combs to control pitch estimation errors
- Paris, France
- Liénard, J.-S., Barras, C., and Signol, F., “Using sets of combs to control pitch estimation errors”, Proc. 155th Meeting Acoustical Society of America, Paris, France, 2008.
- (2008) Proc. 155th Meeting Acoustical Society of America
- Liénard, J.-S.¹ Barras, C.² Signol, F.³

15
- 0020319209
- Harmonics-to-noise ratio as an index of the degree of hoarseness
- Yumoto, E. and Gould, W. J., “Harmonics-to-noise ratio as an index of the degree of hoarseness”, J. Acoustical Society of America, 71(6):1544-1550,1982.
- (1982) J. Acoustical Society of America , vol.71 , Issue.6 , pp. 1544-1550
- Yumoto, E.¹ Gould, W.J.²

16
- 0027285715
- A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals
- de Krom, G., “A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals”, J. Speech and Hearing Research, 36(2):254-264, 1993.
- (1993) J. Speech and Hearing Research , vol.36 , Issue.2 , pp. 254-264
- De Krom, G.¹

17
- 0030793150
- Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals
- Qi, Y. and Hillman, R. E., “Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals”, J. Acoustical Society of America, 102(1):537-543,1997.
- (1997) J. Acoustical Society of America , vol.102 , Issue.1 , pp. 537-543
- Qi, Y.¹ Hillman, R.E.²

18
- 51449093800
- An instantaneous vector representation of delta pitch for speaker-change prediction in conversational dialogue systems
- Las Vegas NV, USA
- Laskowski, K., Edlund, J., and Heldner, M., “An instantaneous vector representation of delta pitch for speaker-change prediction in conversational dialogue systems”, Proc. ICASSP, Las Vegas NV, USA, pp. 5041-5044, 2008.
- (2008) Proc. ICASSP , pp. 5041-5044
- Laskowski, K.¹ Edlund, J.² Heldner, M.³

19
- 70349209406
- Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum
- Taipei, Taiwan
- Laskowski, K. and Jin, Q., “Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum”, Proc. ICASSP, Taipei, Taiwan, pp. 4541-4544, 2009.
- (2009) Proc. ICASSP , pp. 4541-4544
- Laskowski, K.¹ Jin, Q.²

20
- 78049356203
- Comparing the contributions of context and prosody in text-independent dialog act recognition
- Dallas TX, USA
- Laskowski, K., and Shriberg, E., “Comparing the contributions of context and prosody in text-independent dialog act recognition”, Proc. ICASSP, Dallas TX, USA, pp. 5374-5377, 2010.
- (2010) Proc. ICASSP , pp. 5374-5377
- Laskowski, K.¹ Shriberg, E.²

21
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Davis, S. B. and Mermelstein, P., “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences”, IEEE Trans. on Acoustics, Speech, and Signal Processing, 24(8):357-366, 1980.
- (1980) IEEE Trans. On Acoustics, Speech, and Signal Processing , vol.24 , Issue.8 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

22
- 84955035459
- A scale for the measurement of the psychological magnitude of pitch
- Stevens, S. S., Volkmann, J., and Newman, E. B., “A scale for the measurement of the psychological magnitude of pitch”, J. Acoustical Society of America, 8(3):185-190, 1937.
- (1937) J. Acoustical Society of America , vol.8 , Issue.3 , pp. 185-190
- Stevens, S.S.¹ Volkmann, J.² Newman, E.B.³

23
- 0031036983
- The mel scale's disqualifying bias and a consistency of pitch-difference equisections in 1956 with equal cochlear distances and equal frequency ratios
- Greenwood, D. D., “The Mel Scale's disqualifying bias and a consistency of pitch-difference equisections in 1956 with equal cochlear distances and equal frequency ratios”, J. Hearing Research, 103(1-2):199-224, 1997.
- (1997) J. Hearing Research , vol.103 , Issue.1-2 , pp. 199-224
- Greenwood, D.D.¹

24
- 33644642389
- Effects of coiling on the micromechanics of the mammalian cochlea
- Cai, H., Manoussaki, D., and Chadwick, R., “Effects of coiling on the micromechanics of the mammalian cochlea”, J. Royal Society Interface, 2:341-348, 2005.
- (2005) J. Royal Society Interface , vol.2 , pp. 341-348
- Cai, H.¹ Manoussaki, D.² Chadwick, R.³

25
- 0027925808
- Some general comments on the evolution and design of animal communication systems
- Endler, J. A., “Some general comments on the evolution and design of animal communication systems”, Philosophical Transactions, 340(1292):215-225, 1993.
- (1993) Philosophical Transactions , vol.340 , Issue.1292 , pp. 215-225
- Endler, J.A.¹

26
- 84893564226
- CSR-I (WSJ0) complete
- LDC93S6A
- Garofolo, J., Graff, D., Paul, D., and Pallett, D. “CSR-I (WSJ0) Complete”, Linguistic Data Consortium, vol. LDC93S6A, 2007.
- (2007) Linguistic Data Consortium
- Garofolo, J.¹ Graff, D.² Paul, D.³ Pallett, D.⁴

27
- 80051632172
- CSR-II (WSJ1) complete
- LDC94S13A
- “CSR-II (WSJ1) Complete”, Linguistic Data Consortium, vol. LDC94S13A, 1994.
- (1994) Linguistic Data Consortium

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.