메뉴 건너뛰기




Volumn , Issue , 2010, Pages 12-19

Modeling prosody for speaker recognition: Why estimating pitch may be a red herring

Author keywords

[No Author keywords available]

Indexed keywords

HARMONIC ANALYSIS; SPEECH PROCESSING;

EID: 85073112188     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (2)

References (27)
  • 1
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Reynolds, D. and Rose, R., “Robust text-independent speaker identification using Gaussian mixture speaker models”, IEEE Trans. Speech and Audio Processing, 3:72-83, 1995.
    • (1995) IEEE Trans. Speech and Audio Processing , vol.3 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 2
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • Reynolds, D., Quatieri, T., Dunn, R., “Speaker verification using adapted Gaussian mixture models”, Digital Signal Processing, 10(1-3):19-41, 2000.
    • (2000) Digital Signal Processing , vol.10 , Issue.1-3 , pp. 19-41
    • Reynolds, D.1    Quatieri, T.2    Dunn, R.3
  • 3
    • 85009124414 scopus 로고    scopus 로고
    • Speaker recognition based on idiolectal differences between speakers
    • Doddington, G., “Speaker recognition based on idiolectal differences between speakers”, Proc. EUROSPEECH, 2001.
    • (2001) Proc. EUROSPEECH
    • Doddington, G.1
  • 4
    • 0141521592 scopus 로고    scopus 로고
    • Modeling prosodic dynamics for speaker recognition
    • Adami, A., Mihaescu, R., Reynolds, D., and Godfrey, J., “Modeling prosodic dynamics for speaker recognition”, Proc. ICASSP, 788-791, 2003.
    • (2003) Proc. ICASSP , pp. 788-791
    • Adami, A.1    Mihaescu, R.2    Reynolds, D.3    Godfrey, J.4
  • 5
    • 0036299157 scopus 로고    scopus 로고
    • Using prosodic and lexical information for speaker identification
    • Weber, F., Manganaro, L., Peskin, B., Shriberg, E., “Using prosodic and lexical information for speaker identification,” Proc. ICASSP, 1:141-144, 2002.
    • (2002) Proc. ICASSP , vol.1 , pp. 141-144
    • Weber, F.1    Manganaro, L.2    Peskin, B.3    Shriberg, E.4
  • 7
    • 84939358489 scopus 로고
    • Measurements of the fundamental period of speech using a delay line
    • Miller, R. L. and Weibel, E. S., “Measurements of the fundamental period of speech using a delay line”, J. Acoustical Society of America, 28:761, 1956.
    • (1956) J. Acoustical Society of America , vol.28 , pp. 761
    • Miller, R.L.1    Weibel, E.S.2
  • 8
    • 0016113915 scopus 로고
    • The optimum comb method for pitch period analysis of continuous digitized speech
    • Moorer, J. A., “The optimum comb method for pitch period analysis of continuous digitized speech”, IEEE Trans. Acoustics, Speech, and Signal Proc., 22(5):330-338, 1974.
    • (1974) IEEE Trans. Acoustics, Speech, and Signal Proc. , vol.22 , Issue.5 , pp. 330-338
    • Moorer, J.A.1
  • 9
    • 85073106212 scopus 로고    scopus 로고
    • Speech fundamental frequency estimation using the alternate comb
    • Antwerpen, Belgium
    • Liénard, J.-S., Signol, F., and Barras, C., “Speech fundamental frequency estimation using the alternate comb”, Proc. INTERSPEECH, Antwerpen, Belgium.
    • Proc. INTERSPEECH
    • Liénard, J.-S.1    Signol, F.2    Barras, C.3
  • 11
    • 0014270962 scopus 로고
    • Period histogram and product spectrum: New methods for fundamental-frequency measurement
    • Shroeder, M. R., “Period histogram and product spectrum: New methods for fundamental-frequency measurement”, J. Acoustical Society of America, 43(4):829-834, 1968.
    • (1968) J. Acoustical Society of America , vol.43 , Issue.4 , pp. 829-834
    • Shroeder, M.R.1
  • 12
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
    • Boersma, P., “Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound”, Proc. Institute of Phonetic Sciences, 17:97-110, 1993.
    • (1993) Proc. Institute of Phonetic Sciences , vol.17 , pp. 97-110
    • Boersma, P.1
  • 13
    • 85009075244 scopus 로고    scopus 로고
    • A pitch determination algorithm based on subharmonic-to-harmonic ratio
    • Beijing, China
    • Sun, X., “A pitch determination algorithm based on subharmonic-to-harmonic ratio”, Proc. ICSLP, Beijing, China, 2000.
    • (2000) Proc. ICSLP
    • Sun, X.1
  • 15
    • 0020319209 scopus 로고
    • Harmonics-to-noise ratio as an index of the degree of hoarseness
    • Yumoto, E. and Gould, W. J., “Harmonics-to-noise ratio as an index of the degree of hoarseness”, J. Acoustical Society of America, 71(6):1544-1550,1982.
    • (1982) J. Acoustical Society of America , vol.71 , Issue.6 , pp. 1544-1550
    • Yumoto, E.1    Gould, W.J.2
  • 16
    • 0027285715 scopus 로고
    • A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals
    • de Krom, G., “A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals”, J. Speech and Hearing Research, 36(2):254-264, 1993.
    • (1993) J. Speech and Hearing Research , vol.36 , Issue.2 , pp. 254-264
    • De Krom, G.1
  • 17
    • 0030793150 scopus 로고    scopus 로고
    • Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals
    • Qi, Y. and Hillman, R. E., “Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals”, J. Acoustical Society of America, 102(1):537-543,1997.
    • (1997) J. Acoustical Society of America , vol.102 , Issue.1 , pp. 537-543
    • Qi, Y.1    Hillman, R.E.2
  • 18
    • 51449093800 scopus 로고    scopus 로고
    • An instantaneous vector representation of delta pitch for speaker-change prediction in conversational dialogue systems
    • Las Vegas NV, USA
    • Laskowski, K., Edlund, J., and Heldner, M., “An instantaneous vector representation of delta pitch for speaker-change prediction in conversational dialogue systems”, Proc. ICASSP, Las Vegas NV, USA, pp. 5041-5044, 2008.
    • (2008) Proc. ICASSP , pp. 5041-5044
    • Laskowski, K.1    Edlund, J.2    Heldner, M.3
  • 19
    • 70349209406 scopus 로고    scopus 로고
    • Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum
    • Taipei, Taiwan
    • Laskowski, K. and Jin, Q., “Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum”, Proc. ICASSP, Taipei, Taiwan, pp. 4541-4544, 2009.
    • (2009) Proc. ICASSP , pp. 4541-4544
    • Laskowski, K.1    Jin, Q.2
  • 20
    • 78049356203 scopus 로고    scopus 로고
    • Comparing the contributions of context and prosody in text-independent dialog act recognition
    • Dallas TX, USA
    • Laskowski, K., and Shriberg, E., “Comparing the contributions of context and prosody in text-independent dialog act recognition”, Proc. ICASSP, Dallas TX, USA, pp. 5374-5377, 2010.
    • (2010) Proc. ICASSP , pp. 5374-5377
    • Laskowski, K.1    Shriberg, E.2
  • 21
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis, S. B. and Mermelstein, P., “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences”, IEEE Trans. on Acoustics, Speech, and Signal Processing, 24(8):357-366, 1980.
    • (1980) IEEE Trans. On Acoustics, Speech, and Signal Processing , vol.24 , Issue.8 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 22
    • 84955035459 scopus 로고
    • A scale for the measurement of the psychological magnitude of pitch
    • Stevens, S. S., Volkmann, J., and Newman, E. B., “A scale for the measurement of the psychological magnitude of pitch”, J. Acoustical Society of America, 8(3):185-190, 1937.
    • (1937) J. Acoustical Society of America , vol.8 , Issue.3 , pp. 185-190
    • Stevens, S.S.1    Volkmann, J.2    Newman, E.B.3
  • 23
    • 0031036983 scopus 로고    scopus 로고
    • The mel scale's disqualifying bias and a consistency of pitch-difference equisections in 1956 with equal cochlear distances and equal frequency ratios
    • Greenwood, D. D., “The Mel Scale's disqualifying bias and a consistency of pitch-difference equisections in 1956 with equal cochlear distances and equal frequency ratios”, J. Hearing Research, 103(1-2):199-224, 1997.
    • (1997) J. Hearing Research , vol.103 , Issue.1-2 , pp. 199-224
    • Greenwood, D.D.1
  • 24
    • 33644642389 scopus 로고    scopus 로고
    • Effects of coiling on the micromechanics of the mammalian cochlea
    • Cai, H., Manoussaki, D., and Chadwick, R., “Effects of coiling on the micromechanics of the mammalian cochlea”, J. Royal Society Interface, 2:341-348, 2005.
    • (2005) J. Royal Society Interface , vol.2 , pp. 341-348
    • Cai, H.1    Manoussaki, D.2    Chadwick, R.3
  • 25
    • 0027925808 scopus 로고
    • Some general comments on the evolution and design of animal communication systems
    • Endler, J. A., “Some general comments on the evolution and design of animal communication systems”, Philosophical Transactions, 340(1292):215-225, 1993.
    • (1993) Philosophical Transactions , vol.340 , Issue.1292 , pp. 215-225
    • Endler, J.A.1
  • 27
    • 80051632172 scopus 로고
    • CSR-II (WSJ1) complete
    • LDC94S13A
    • “CSR-II (WSJ1) Complete”, Linguistic Data Consortium, vol. LDC94S13A, 1994.
    • (1994) Linguistic Data Consortium


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.