메뉴 건너뛰기




Volumn 41, Issue 2-3, 2003, Pages 469-484

Cepstrum derived from differentiated power spectrum for robust speech recognition

Author keywords

Cepstral mean normalization; Differential power spectrum; Hidden Markov model; Linear liftering; Robust speech recognition; Spectral subtraction

Indexed keywords

MATHEMATICAL TRANSFORMATIONS; NONLINEAR EQUATIONS; ROBUSTNESS (CONTROL SYSTEMS); SPEECH RECOGNITION;

EID: 0038373389     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(03)00016-5     Document Type: Article
Times cited : (44)

References (36)
  • 1
    • 85009152845 scopus 로고    scopus 로고
    • Recognition performance of the Siemens front-end with and without frame dropping on the AURORA 2 database
    • Scandinavia
    • Andrassy, B., Vlaj, D., Beaugeant, C., 2001. Recognition performance of the Siemens front-end with and without frame dropping on the AURORA 2 database. Proc. EUROSPEECH, Scandinavia, pp. 193-196.
    • (2001) Proc. EUROSPEECH , pp. 193-196
    • Andrassy, B.1    Vlaj, D.2    Beaugeant, C.3
  • 2
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Boll S.F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoustics, Speech Signal Process. 27(2):1979;113-120.
    • (1979) IEEE Trans. Acoustics, Speech Signal Process. , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 3
    • 0030355935 scopus 로고    scopus 로고
    • A new ASR approach based on independent processing and recombination of partial frequency bands
    • Philadelphia
    • Bourlard, H., Dupont, S., 1996. A new ASR approach based on independent processing and recombination of partial frequency bands. Proc. ICSLP, Philadelphia, pp. 426-429.
    • (1996) Proc. ICSLP , pp. 426-429
    • Bourlard, H.1    Dupont, S.2
  • 4
    • 85009106589 scopus 로고    scopus 로고
    • Sub-band based additive noise removal for robust speech recognition
    • Scandinavia
    • Chen, J., Paliwal, K.K., Nakamura, S., 2001. Sub-band based additive noise removal for robust speech recognition. Proc. EUROSPEECH, Scandinavia, pp. 571-574.
    • (2001) Proc. EUROSPEECH , pp. 571-574
    • Chen, J.1    Paliwal, K.K.2    Nakamura, S.3
  • 5
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis S.B., Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoustics, Speech Signal Process. 28:1980;357-366.
    • (1980) IEEE Trans. Acoustics, Speech Signal Process. , vol.28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 6
    • 85006734596 scopus 로고    scopus 로고
    • Evaluation of the SPLICE algorithm on the Aurora2 database
    • Scandinavia
    • Droppo, J., Deng, L., Acero, A., 2001. Evaluation of the SPLICE algorithm on the Aurora2 database. Proc. EUROSPEECH, Scandinavia, pp. 217-220.
    • (2001) Proc. EUROSPEECH , pp. 217-220
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 7
    • 0022667694 scopus 로고
    • Speaker-independent isolated work recognition using dynamic features of speech spectrum
    • Furui S. Speaker-independent isolated work recognition using dynamic features of speech spectrum. IEEE Trans. Acoustics, Speech Signal Process. 34(1):1986;52-89.
    • (1986) IEEE Trans. Acoustics, Speech Signal Process. , vol.34 , Issue.1 , pp. 52-89
    • Furui, S.1
  • 8
    • 0030245128 scopus 로고    scopus 로고
    • Robust speech recognition using parallel model combination
    • Gales M.J.F., Young S.J. Robust speech recognition using parallel model combination. IEEE Trans. Speech Audio Process. 4(5):1996;352-359.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 10
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Hermansky H. Perceptual linear predictive (PLP) analysis of speech. J. Acoustic. Soc. Am. 87(4):1990;1738-1752.
    • (1990) J. Acoustic. Soc. Am. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 12
    • 85135377175 scopus 로고
    • Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)
    • Genova
    • Hermansky, H., Morgan, N., Bayya, A., Kohn, P. 1991. Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP). Proc. EUROSPEECH, Genova, pp. 1367-1370.
    • (1991) Proc. EUROSPEECH , pp. 1367-1370
    • Hermansky, H.1    Morgan, N.2    Bayya, A.3    Kohn, P.4
  • 13
    • 0011823639 scopus 로고
    • Improved speech recognition using high-pass filtering of subband envelopes
    • Genova
    • Hirsch, H., Meyer, P., Ruehl, H., 1991. Improved speech recognition using high-pass filtering of subband envelopes. Proc. EUROSPEECH, Genova, pp. 413-416.
    • (1991) Proc. EUROSPEECH , pp. 413-416
    • Hirsch, H.1    Meyer, P.2    Ruehl, H.3
  • 14
    • 0038669544 scopus 로고    scopus 로고
    • The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Paris, France
    • Hirsch, H.G., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. Proc. ISCA ASR2000, Paris, France.
    • (2000) Proc. ISCA ASR2000
    • Hirsch, H.G.1
  • 17
    • 0032785783 scopus 로고    scopus 로고
    • Auditory processing of speech signals for robustness speech recognition in real-world noisy environments
    • Kim D.-S., Lee S.-Y., Kil R.M. Auditory processing of speech signals for robustness speech recognition in real-world noisy environments. IEEE Trans. Speech Audio Process. 7(1):1999;55-69.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.1 , pp. 55-69
    • Kim, D.-S.1    Lee, S.-Y.2    Kil, R.M.3
  • 18
    • 85009085054 scopus 로고    scopus 로고
    • A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm
    • Scandinavia
    • Kotnik, B., Kacic, Z., Horvat, B., 2001. A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm. Proc. EUROSPEECH, Scandinavia, pp. 197-200.
    • (2001) Proc. EUROSPEECH , pp. 197-200
    • Kotnik, B.1    Kacic, Z.2    Horvat, B.3
  • 19
    • 0002583871 scopus 로고
    • Speech database development: Design and analysis of the acoustic-phonetic corpus
    • Palo Alto
    • Lamel, L.F., Kassel, H.K., Seneft, S., 1986. Speech database development: Design and analysis of the acoustic-phonetic corpus. Proc. DARPA Speech Recognition Workshop, Palo Alto, pp. 100-109.
    • (1986) Proc. DARPA Speech Recognition Workshop , pp. 100-109
    • Lamel, L.F.1    Kassel, H.K.2    Seneft, S.3
  • 20
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden Markov models
    • Lee K.-F., Hon H.-W. Speaker-independent phone recognition using hidden Markov models. IEEE Trans. Acoustics, Speech Signal Process. 37(11):1989;1641-1648.
    • (1989) IEEE Trans. Acoustics, Speech Signal Process. , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.-F.1    Hon, H.-W.2
  • 21
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment independent speech recognition
    • Philadelphia, PA
    • Moreno, P.J., Raj, B., Stern, R.M., 1996. A vector Taylor series approach for environment independent speech recognition. Proc. ICSLP, Philadelphia, PA, pp. 733-736.
    • (1996) Proc. ICSLP , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 22
    • 84893207073 scopus 로고
    • Continuous speech recognition in noise using spectral subtraction and HMM adaptation
    • Adelaide, Australia
    • Nolazco Flores, J.A., Young, S.J., 1994. Continuous speech recognition in noise using spectral subtraction and HMM adaptation. Proc. ICASSP, Adelaide, Australia, pp. 409-412.
    • (1994) Proc. ICASSP , pp. 409-412
    • Nolazco Flores, J.A.1    Young, S.J.2
  • 23
    • 85135109228 scopus 로고
    • Speaker adaptation based on transfer vector field smoothing technique
    • Banff, Canada
    • Ohkura, K., Sugiyama, M., Sagayama, S., 1992. Speaker adaptation based on transfer vector field smoothing technique. Proc. ICSLP, Banff, Canada, pp. 369-372.
    • (1992) Proc. ICSLP , pp. 369-372
    • Ohkura, K.1    Sugiyama, M.2    Sagayama, S.3
  • 24
    • 0020165569 scopus 로고
    • On the performance of the frequency-weighted cepstral coefficients in vowel recognition
    • Paliwal K.K. On the performance of the frequency-weighted cepstral coefficients in vowel recognition. Speech Commun. 18:1992;151-154.
    • (1992) Speech Commun. , vol.18 , pp. 151-154
    • Paliwal, K.K.1
  • 25
    • 0038338247 scopus 로고    scopus 로고
    • Decorrelated and liftered filter-bank energies for robust speech recognition
    • Budapest
    • Paliwal, K.K., 1999. Decorrelated and liftered filter-bank energies for robust speech recognition. Proc. EUROPSEECH, Budapest, pp. 85-88.
    • (1999) Proc. EUROPSEECH , pp. 85-88
    • Paliwal, K.K.1
  • 26
    • 0027659197 scopus 로고
    • Signal modeling techniques in speech recognition
    • Picone J.W. Signal modeling techniques in speech recognition. Proc. IEEE. 81(9):1993;1215-1247.
    • (1993) Proc. IEEE , vol.81 , Issue.9 , pp. 1215-1247
    • Picone, J.W.1
  • 27
    • 0001656188 scopus 로고    scopus 로고
    • Kalman filtering of colored noise for speech enhancement
    • Seattle
    • Popescu, D.C., Zeljkovic, I., 1998. Kalman filtering of colored noise for speech enhancement. Proc. ICASSP, Seattle, pp. 997-1000.
    • (1998) Proc. ICASSP , pp. 997-1000
    • Popescu, D.C.1    Zeljkovic, I.2
  • 28
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • Rahim M., Juang B.-H. Signal bias removal by maximum likelihood estimation for robust telephone speech recognition. IEEE Trans. Speech Audio Process. 4(1):1996;19-30.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 19-30
    • Rahim, M.1    Juang, B.-H.2
  • 29
    • 0030649027 scopus 로고    scopus 로고
    • Jacobian approach to fast acoustic model adaptation
    • Munich, Germany
    • Sagayama, S., Yamaguchi, Y., Tahahashi, S., Takahashi, J.-I., 1997. Jacobian approach to fast acoustic model adaptation, Proc. ICASSP, Munich, Germany, pp. 835-838.
    • (1997) Proc. ICASSP , pp. 835-838
    • Sagayama, S.1    Yamaguchi, Y.2    Tahahashi, S.3    Takahashi, J.-I.4
  • 30
    • 0022859652 scopus 로고
    • On the use of instantaneous and transitional spectral information in speaker recognition
    • Tokyo, Japan
    • Soong, F.K., Rosenberg, A.E., 1986. On the use of instantaneous and transitional spectral information in speaker recognition. Proc. ICASSP, Tokyo, Japan, pp. 877-880.
    • (1986) Proc. ICASSP , pp. 877-880
    • Soong, F.K.1    Rosenberg, A.E.2
  • 31
    • 0000090514 scopus 로고
    • A weighted cepstral distance measure for speech recognition
    • Tohkura Y. A weighted cepstral distance measure for speech recognition. IEEE Trans. Acoust., Speech Signal Process. 35(10):1987;1414-1422.
    • (1987) IEEE Trans. Acoust., Speech Signal Process. , vol.35 , Issue.10 , pp. 1414-1422
    • Tohkura, Y.1
  • 33
    • 0030779363 scopus 로고    scopus 로고
    • Noise compensation methods for hidden Markov model speech recognition in adverse environments
    • Vaseghi S.V., Milner B.P. Noise compensation methods for hidden Markov model speech recognition in adverse environments. IEEE Trans. Speech Audio Process. 5(1):1997;11-21.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.1 , pp. 11-21
    • Vaseghi, S.V.1    Milner, B.P.2
  • 34
    • 0029726509 scopus 로고    scopus 로고
    • Improving environmental robustness in large vocabulary speech recognition
    • Atlanta, GA
    • Woodland, P.C., Gales, M.J.E., Pye, D., 1996. Improving environmental robustness in large vocabulary speech recognition. Proc. ICASSP, Atlanta, GA, pp. 65-68.
    • (1996) Proc. ICASSP , pp. 65-68
    • Woodland, P.C.1    Gales, M.J.E.2    Pye, D.3
  • 35
    • 85009101128 scopus 로고    scopus 로고
    • Noise robust feature extraction for ASR using Aurora 2 database
    • Scandinavia
    • Zhu, Q., Iseli, M., Cui, X., Alwan, A., 2001. Noise robust feature extraction for ASR using Aurora 2 database. Proc. EUROSPEECH, Scandinavia, pp. 185-188.
    • (2001) Proc. EUROSPEECH , pp. 185-188
    • Zhu, Q.1    Iseli, M.2    Cui, X.3    Alwan, A.4
  • 36
    • 0025477640 scopus 로고
    • Speech database development at MIT: TIMIT and beyond
    • Zue V., Seneff S., Glass J. Speech database development at MIT: TIMIT and beyond. Speech Commun. 9:1990;351-356.
    • (1990) Speech Commun. , vol.9 , pp. 351-356
    • Zue, V.1    Seneff, S.2    Glass, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.