메뉴 건너뛰기




Volumn 120, Issue 1, 2006, Pages 443-452

Speech feature extraction method using subband-based periodicity and nonperiodicity decomposition

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; DECOMPOSITION; SENSORY PERCEPTION; SPEECH PROCESSING; SPEECH RECOGNITION; SPEECH SYNTHESIS;

EID: 33745738849     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.2205131     Document Type: Article
Times cited : (9)

References (48)
  • 1
    • 0030037151 scopus 로고    scopus 로고
    • Cepstral representation of speech motivated by time-frequency masking: An application to speech recognition
    • Aikawa, K., Singer, H., Kawahara, H., and Tohkura, Y. (1996). "Cepstral representation of speech motivated by time-frequency masking: An application to speech recognition," J. Acoust. Soc. Am. 100, 603-614.
    • (1996) J. Acoust. Soc. Am. , vol.100 , pp. 603-614
    • Aikawa, K.1    Singer, H.2    Kawahara, H.3    Tohkura, Y.4
  • 2
    • 0036649309 scopus 로고    scopus 로고
    • Robust auditory-based speech processing using the average localized synchrony detection
    • Ali, A. M., Spiegel, J. V., and Mueller, P. (2002). "Robust auditory-based speech processing using the average localized synchrony detection," IEEE Trans. Speech Audio Process. 10, 279-292.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , pp. 279-292
    • Ali, A.M.1    Spiegel, J.V.2    Mueller, P.3
  • 3
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • Atal, B. S. (1974). "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Am. 55, 1304-1312.
    • (1974) J. Acoust. Soc. Am. , vol.55 , pp. 1304-1312
    • Atal, B.S.1
  • 5
    • 0002161311 scopus 로고
    • The quefrency analysis of time series for echoes
    • edited by M. Rosenblatt (Wiley, New York)
    • Bogert, B., Healy, M., and Tukey, J. (1963). "The quefrency analysis of time series for echoes," in Proc. Symp. on Time Series Analysis, edited by M. Rosenblatt (Wiley, New York), pp. 209-243.
    • (1963) Proc. Symp. on Time Series Analysis , pp. 209-243
    • Bogert, B.1    Healy, M.2    Tukey, J.3
  • 6
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Boll, S. (1979). "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process. ASSP-27, 113-210.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , pp. 113-210
    • Boll, S.1
  • 7
    • 0001698589 scopus 로고
    • Auditory grouping
    • edited by B. C. J. Moore (Academic, San Diego)
    • Darwin, C. J., and Carlyon, R.P. (1995). "Auditory grouping," in Hearing, edited by B. C. J. Moore (Academic, San Diego), pp. 387-424.
    • (1995) Hearing , pp. 387-424
    • Darwin, C.J.1    Carlyon, R.P.2
  • 8
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis, S. B., and Mermelstein, P. (1980). "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process. ASSP-28, 357-366.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 9
    • 0027298253 scopus 로고
    • Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing
    • de Cheveigné, A. (1993). "Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing," J. Acoust. Soc. Am. 93, 3271-3290.
    • (1993) J. Acoust. Soc. Am. , vol.93 , pp. 3271-3290
    • De Cheveigné, A.1
  • 10
    • 0031009840 scopus 로고    scopus 로고
    • Concurrent vowel identification. III. A neural model of harmonic interference cancellation
    • de Cheveigné, A. (1997). "Concurrent vowel identification. III. A neural model of harmonic interference cancellation," J. Acoust. Soc. Am. 101, 2857-2865.
    • (1997) J. Acoust. Soc. Am. , vol.101 , pp. 2857-2865
    • De Cheveigné, A.1
  • 11
    • 0011902657 scopus 로고    scopus 로고
    • Acoustic features and distance measure to reduce vulnerability of ASR performance due to the presence of a communication channel and/or background noise
    • edited by J.-C. Junqua and G. van Noord (Kluwer Academic, Dordrecht, Netherlands)
    • de Veth, J., Cranen, B., and Boves, L. (2001). "Acoustic features and distance measure to reduce vulnerability of ASR performance due to the presence of a communication channel and/or background noise," in Robustness in Language and Speech Technology, edited by J.-C. Junqua and G. van Noord (Kluwer Academic, Dordrecht, Netherlands), pp. 9-45.
    • (2001) Robustness in Language and Speech Technology , pp. 9-45
    • De Veth, J.1    Cranen, B.2    Boves, L.3
  • 14
    • 84928838192 scopus 로고
    • Temporal non-place information in the auditory nerve firing patterns as a front-end for speech recognition in a noisy environment
    • Ghitza, O. (1988). "Temporal non-place information in the auditory nerve firing patterns as a front-end for speech recognition in a noisy environment," J. Phonetics 16, 109-124.
    • (1988) J. Phonetics , vol.16 , pp. 109-124
    • Ghitza, O.1
  • 15
    • 0028312802 scopus 로고
    • Auditory models and human performance in tasks related to speech coding and speech recognition
    • Ghitza, O. (1994). "Auditory models and human performance in tasks related to speech coding and speech recognition," IEEE Trans. Speech Audio Process. 2, 115-132.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 115-132
    • Ghitza, O.1
  • 16
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Gong, Y. (1995). "Speech recognition in noisy environments: A survey," Speech Commun. 16, 261-291.
    • (1995) Speech Commun. , vol.16 , pp. 261-291
    • Gong, Y.1
  • 17
    • 33745742672 scopus 로고    scopus 로고
    • Speech processing in the auditory system: An overview
    • edited by S. Greenberg, W. A. Ainsworth, A. N. Popper, and R. R. Fay (Springer-Verlag, New York)
    • Greenberg, S. (2004). "Speech processing in the auditory system: An overview," in Speech Processing in the Auditory System, edited by S. Greenberg, W. A. Ainsworth, A. N. Popper, and R. R. Fay (Springer-Verlag, New York).
    • (2004) Speech Processing in the Auditory System
    • Greenberg, S.1
  • 18
    • 0025041264 scopus 로고
    • Perceptual Linear Predictive (PLP) analysis of speech
    • Hermansky, H. (1990). "Perceptual Linear Predictive (PLP) analysis of speech," J. Acoust. Soc. Am. 87, 1738-1752.
    • (1990) J. Acoust. Soc. Am. , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 19
    • 0022112505 scopus 로고
    • Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain
    • Hermansky, H., Hanson, B., and Wakita, H. (1985). "Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain," Speech Commun. 4, 181-187.
    • (1985) Speech Commun. , vol.4 , pp. 181-187
    • Hermansky, H.1    Hanson, B.2    Wakita, H.3
  • 20
    • 33745741500 scopus 로고
    • Short-term analysis pitch determination
    • Springer-Verlag, New York
    • Hess, W. (1983). "Short-term analysis pitch determination," in Pitch Determination of Speech Signals (Springer-Verlag, New York).
    • (1983) Pitch Determination of Speech Signals
    • Hess, W.1
  • 22
    • 84870238333 scopus 로고    scopus 로고
    • Speech signal representation
    • Prentice-Hall, Englewood Cliffs, NJ
    • Huang, X., Acero, A., and Hon, H. (2001). "Speech signal representation," in Spoken Language Processing (Prentice-Hall, Englewood Cliffs, NJ).
    • (2001) Spoken Language Processing
    • Huang, X.1    Acero, A.2    Hon, H.3
  • 24
    • 0016467604 scopus 로고
    • Minimum prediction residual principle applied to speech recognition
    • Itakura, F. (1975). "Minimum prediction residual principle applied to speech recognition," IEEE Trans. Acoust., Speech, Signal Process. ASSP-23, 67-72.
    • (1975) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-23 , pp. 67-72
    • Itakura, F.1
  • 25
    • 0035472456 scopus 로고    scopus 로고
    • Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
    • Jackson, P. J. B., and Shadle, C. H. (2001). "Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech," IEEE Trans. Speech Audio Process. 9, 713-726.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 713-726
    • Jackson, P.J.B.1    Shadle, C.H.2
  • 27
    • 0019142263 scopus 로고
    • The relationship between spike rate and synchrony in responses of auditory-nerve fibers to single tones
    • Johnson, D. H. (1980). "The relationship between spike rate and synchrony in responses of auditory-nerve fibers to single tones," J. Acoust. Soc. Am. 68, 1115-1122.
    • (1980) J. Acoust. Soc. Am. , vol.68 , pp. 1115-1122
    • Johnson, D.H.1
  • 29
    • 0032785783 scopus 로고    scopus 로고
    • Auditory processing of speech signals for robust speech recognition in real-world noisy environments
    • Kim, D. S., Lee, S. Y., and Kil, R. M. (1999). "Auditory processing of speech signals for robust speech recognition in real-world noisy environments," IEEE Trans. Speech Audio Process. 7, 55-69.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , pp. 55-69
    • Kim, D.S.1    Lee, S.Y.2    Kil, R.M.3
  • 31
    • 0026142334 scopus 로고
    • A study on speaker adaptation of the parameters of continuous density hidden Markov models
    • Lee, C.-H., Lin, C.-H., and Juang, B.-H. (1991). "A study on speaker adaptation of the parameters of continuous density hidden Markov models," IEEE Trans. Signal Process. 39, 806-814.
    • (1991) IEEE Trans. Signal Process. , vol.39 , pp. 806-814
    • Lee, C.-H.1    Lin, C.-H.2    Juang, B.-H.3
  • 34
    • 0026882842 scopus 로고
    • Experiments with a Nonlinear Spectral Subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars
    • Lockwood, P., and Boudy, J. (1992). "Experiments with a Nonlinear Spectral Subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars," Speech Commun. 11, 215-228.
    • (1992) Speech Commun. , vol.11 , pp. 215-228
    • Lockwood, P.1    Boudy, J.2
  • 36
    • 0020816083 scopus 로고
    • Suggested formula for calculating auditory-filter bandwidths and excitation patterns
    • Moore, B. C. J., and Glasberg, B. R. (1983). "Suggested formula for calculating auditory-filter bandwidths and excitation patterns," J. Acoust. Soc. Am. 74, 750-753.
    • (1983) J. Acoust. Soc. Am. , vol.74 , pp. 750-753
    • Moore, B.C.J.1    Glasberg, B.R.2
  • 39
    • 0016938506 scopus 로고
    • Auditory filter shapes derived with noise stimuli
    • Patterson, R. D. (1976). "Auditory filter shapes derived with noise stimuli," J. Acoust. Soc. Am. 59, 640-654.
    • (1976) J. Acoust. Soc. Am. , vol.59 , pp. 640-654
    • Patterson, R.D.1
  • 40
    • 0001050571 scopus 로고
    • Auditory filters and excitation patterns as representations of frequency resolution
    • edited by B. C. J. Moore (Academic, London)
    • Patterson, R. D., and Moore, B. C. J. (1986). "Auditory filters and excitation patterns as representations of frequency resolution," in Frequency Selectivity in Hearing, edited by B. C. J. Moore (Academic, London), pp. 123-177.
    • (1986) Frequency Selectivity in Hearing , pp. 123-177
    • Patterson, R.D.1    Moore, B.C.J.2
  • 41
    • 84987702417 scopus 로고    scopus 로고
    • The AURORA experimental framework for the performance evaluation of speech recognition systems under noise conditions
    • Pearce, D., and Hirsh, H. G. (2000). "The AURORA experimental framework for the performance evaluation of speech recognition systems under noise conditions," Proc. of the 6th International Conference on Spoken Language Processing (ICSLP), Vol. 4, pp. 29-32.
    • (2000) Proc. of the 6th International Conference on Spoken Language Processing (ICSLP) , vol.4 , pp. 29-32
    • Pearce, D.1    Hirsh, H.G.2
  • 42
    • 0017367712 scopus 로고
    • On the use of autocorrelation analysis for pitch detection
    • Rabiner, L. R. (1977). "On the use of autocorrelation analysis for pitch detection," IEEE Trans. Acoust., Speech, Signal Process. 25, 24-33.
    • (1977) IEEE Trans. Acoust., Speech, Signal Process. , vol.25 , pp. 24-33
    • Rabiner, L.R.1
  • 43
    • 0015084215 scopus 로고
    • Some effects of the stimulus intensity on response of auditory nerve fibers in the squirrel monkey
    • Rose, J. E., Hind, J. E., Anderson, D. J., and Brugge, J. F. (1971). "Some effects of the stimulus intensity on response of auditory nerve fibers in the squirrel monkey," J. Neurophysiol. 34, 685-699.
    • (1971) J. Neurophysiol. , vol.34 , pp. 685-699
    • Rose, J.E.1    Hind, J.E.2    Anderson, D.J.3    Brugge, J.F.4
  • 44
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory speech processing
    • Seneff, S. (1988). "A joint synchrony/mean-rate model of auditory speech processing," J. Phonetics 16, 55-76.
    • (1988) J. Phonetics , vol.16 , pp. 55-76
    • Seneff, S.1
  • 45
    • 0032123832 scopus 로고    scopus 로고
    • A parametric formulation of the generalized spectral subtraction method
    • Sim, B. L., Tong, Y. C., Chang, J. S., and Tan, C. T. (1998). "A parametric formulation of the generalized spectral subtraction method," IEEE Trans. Speech Audio Process. 6, 328-337.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , pp. 328-337
    • Sim, B.L.1    Tong, Y.C.2    Chang, J.S.3    Tan, C.T.4
  • 46
    • 85009217371 scopus 로고    scopus 로고
    • Signal and feature compensation methods for robust speech recognition
    • edited by G. M. Davis (CRC, Boca Raton, FL)
    • Singh, R., Stern, R. M., and Raj, B. (2002). "Signal and feature compensation methods for robust speech recognition," in Noise Reduction in Speech Applications, edited by G. M. Davis (CRC, Boca Raton, FL), pp. 219-244.
    • (2002) Noise Reduction in Speech Applications , pp. 219-244
    • Singh, R.1    Stern, R.M.2    Raj, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.