SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 41, Issue 2-3, 2003, Pages 469-484

Cepstrum derived from differentiated power spectrum for robust speech recognition

(3) Chen, Jingdong a Paliwal, Kuldip K b Nakamura, Satoshi c

a LUCENT TECHNOLOGIES (United States)

b GRIFFITH UNIVERSITY (Australia)

c ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

Author keywords

Cepstral mean normalization; Differential power spectrum; Hidden Markov model; Linear liftering; Robust speech recognition; Spectral subtraction

Indexed keywords

MATHEMATICAL TRANSFORMATIONS; NONLINEAR EQUATIONS; ROBUSTNESS (CONTROL SYSTEMS); SPEECH RECOGNITION;

ROBUST SPEECH RECOGNITION;

SPEECH COMMUNICATION;

EID: 0038373389 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/S0167-6393(03)00016-5 Document Type: Article

Times cited : (44)

References (36)

1
- 85009152845
- Recognition performance of the Siemens front-end with and without frame dropping on the AURORA 2 database
- Scandinavia
- Andrassy, B., Vlaj, D., Beaugeant, C., 2001. Recognition performance of the Siemens front-end with and without frame dropping on the AURORA 2 database. Proc. EUROSPEECH, Scandinavia, pp. 193-196.
- (2001) Proc. EUROSPEECH , pp. 193-196
- Andrassy, B.¹ Vlaj, D.² Beaugeant, C.³

2
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Boll S.F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoustics, Speech Signal Process. 27(2):1979;113-120.
- (1979) IEEE Trans. Acoustics, Speech Signal Process. , vol.27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

3
- 0030355935
- A new ASR approach based on independent processing and recombination of partial frequency bands
- Philadelphia
- Bourlard, H., Dupont, S., 1996. A new ASR approach based on independent processing and recombination of partial frequency bands. Proc. ICSLP, Philadelphia, pp. 426-429.
- (1996) Proc. ICSLP , pp. 426-429
- Bourlard, H.¹ Dupont, S.²

4
- 85009106589
- Sub-band based additive noise removal for robust speech recognition
- Scandinavia
- Chen, J., Paliwal, K.K., Nakamura, S., 2001. Sub-band based additive noise removal for robust speech recognition. Proc. EUROSPEECH, Scandinavia, pp. 571-574.
- (2001) Proc. EUROSPEECH , pp. 571-574
- Chen, J.¹ Paliwal, K.K.² Nakamura, S.³

5
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Davis S.B., Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoustics, Speech Signal Process. 28:1980;357-366.
- (1980) IEEE Trans. Acoustics, Speech Signal Process. , vol.28 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

6
- 85006734596
- Evaluation of the SPLICE algorithm on the Aurora2 database
- Scandinavia
- Droppo, J., Deng, L., Acero, A., 2001. Evaluation of the SPLICE algorithm on the Aurora2 database. Proc. EUROSPEECH, Scandinavia, pp. 217-220.
- (2001) Proc. EUROSPEECH , pp. 217-220
- Droppo, J.¹ Deng, L.² Acero, A.³

7
- 0022667694
- Speaker-independent isolated work recognition using dynamic features of speech spectrum
- Furui S. Speaker-independent isolated work recognition using dynamic features of speech spectrum. IEEE Trans. Acoustics, Speech Signal Process. 34(1):1986;52-89.
- (1986) IEEE Trans. Acoustics, Speech Signal Process. , vol.34 , Issue.1 , pp. 52-89
- Furui, S.¹

8
- 0030245128
- Robust speech recognition using parallel model combination
- Gales M.J.F., Young S.J. Robust speech recognition using parallel model combination. IEEE Trans. Speech Audio Process. 4(5):1996;352-359.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 352-359
- Gales, M.J.F.¹ Young, S.J.²

9
- 0037855812
- Improvements in speech recognition for voice dialing in the car environment
- Mandelieu
- Geller, D., Haeb-Umbach, R., Ney, H., 1992. Improvements in speech recognition for voice dialing in the car environment. Proc. ESCA Workshop on Speech Processing in Adverse Conditions, Mandelieu, pp. 203-206.
- (1992) Proc. ESCA Workshop on Speech Processing in Adverse Conditions , pp. 203-206
- Geller, D.¹ Haeb-Umbach, R.² Ney, H.³

10
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Hermansky H. Perceptual linear predictive (PLP) analysis of speech. J. Acoustic. Soc. Am. 87(4):1990;1738-1752.
- (1990) J. Acoustic. Soc. Am. , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

11
- 0028517164
- RASTRA of processing of speech
- Hermansky H., Morgan N. RASTRA of processing of speech. IEEE Trans. Speech Audio Process. 2(4):1994;578-589.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

12
- 85135377175
- Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)
- Genova
- Hermansky, H., Morgan, N., Bayya, A., Kohn, P. 1991. Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP). Proc. EUROSPEECH, Genova, pp. 1367-1370.
- (1991) Proc. EUROSPEECH , pp. 1367-1370
- Hermansky, H.¹ Morgan, N.² Bayya, A.³ Kohn, P.⁴

13
- 0011823639
- Improved speech recognition using high-pass filtering of subband envelopes
- Genova
- Hirsch, H., Meyer, P., Ruehl, H., 1991. Improved speech recognition using high-pass filtering of subband envelopes. Proc. EUROSPEECH, Genova, pp. 413-416.
- (1991) Proc. EUROSPEECH , pp. 413-416
- Hirsch, H.¹ Meyer, P.² Ruehl, H.³

14
- 0038669544
- The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- Paris, France
- Hirsch, H.G., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. Proc. ISCA ASR2000, Paris, France.
- (2000) Proc. ISCA ASR2000
- Hirsch, H.G.¹

15
- 0023165215
- On the use of bandpass liftering in speech recognition
- Juang B.H., Rabiner L.R., Wilpon J.G. On the use of bandpass liftering in speech recognition. IEEE Trans. Acoust., Speech Signal Process. 35(7):1987;947-954.
- (1987) IEEE Trans. Acoust., Speech Signal Process. , vol.35 , Issue.7 , pp. 947-954
- Juang, B.H.¹ Rabiner, L.R.² Wilpon, J.G.³

16
- 0037518178
- Environment-adaptive algorithms for robust speech recognition
- Kyoto, Japan
- Junqua, J.-C., Cerisara, C., Rigazio, L., Kryze, D., 2001. Environment-adaptive algorithms for robust speech recognition. Proc. Internat. Workshop Handsfree Speech Communication, Kyoto, Japan, pp. 31-34.
- (2001) Proc. Internat. Workshop Handsfree Speech Communication , pp. 31-34
- Junqua, J.-C.¹ Cerisara, C.² Rigazio, L.³ Kryze, D.⁴

17
- 0032785783
- Auditory processing of speech signals for robustness speech recognition in real-world noisy environments
- Kim D.-S., Lee S.-Y., Kil R.M. Auditory processing of speech signals for robustness speech recognition in real-world noisy environments. IEEE Trans. Speech Audio Process. 7(1):1999;55-69.
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.1 , pp. 55-69
- Kim, D.-S.¹ Lee, S.-Y.² Kil, R.M.³

18
- 85009085054
- A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm
- Scandinavia
- Kotnik, B., Kacic, Z., Horvat, B., 2001. A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm. Proc. EUROSPEECH, Scandinavia, pp. 197-200.
- (2001) Proc. EUROSPEECH , pp. 197-200
- Kotnik, B.¹ Kacic, Z.² Horvat, B.³

19
- 0002583871
- Speech database development: Design and analysis of the acoustic-phonetic corpus
- Palo Alto
- Lamel, L.F., Kassel, H.K., Seneft, S., 1986. Speech database development: Design and analysis of the acoustic-phonetic corpus. Proc. DARPA Speech Recognition Workshop, Palo Alto, pp. 100-109.
- (1986) Proc. DARPA Speech Recognition Workshop , pp. 100-109
- Lamel, L.F.¹ Kassel, H.K.² Seneft, S.³

20
- 0024768209
- Speaker-independent phone recognition using hidden Markov models
- Lee K.-F., Hon H.-W. Speaker-independent phone recognition using hidden Markov models. IEEE Trans. Acoustics, Speech Signal Process. 37(11):1989;1641-1648.
- (1989) IEEE Trans. Acoustics, Speech Signal Process. , vol.37 , Issue.11 , pp. 1641-1648
- Lee, K.-F.¹ Hon, H.-W.²

21
- 0029725301
- A vector Taylor series approach for environment independent speech recognition
- Philadelphia, PA
- Moreno, P.J., Raj, B., Stern, R.M., 1996. A vector Taylor series approach for environment independent speech recognition. Proc. ICSLP, Philadelphia, PA, pp. 733-736.
- (1996) Proc. ICSLP , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

22
- 84893207073
- Continuous speech recognition in noise using spectral subtraction and HMM adaptation
- Adelaide, Australia
- Nolazco Flores, J.A., Young, S.J., 1994. Continuous speech recognition in noise using spectral subtraction and HMM adaptation. Proc. ICASSP, Adelaide, Australia, pp. 409-412.
- (1994) Proc. ICASSP , pp. 409-412
- Nolazco Flores, J.A.¹ Young, S.J.²

23
- 85135109228
- Speaker adaptation based on transfer vector field smoothing technique
- Banff, Canada
- Ohkura, K., Sugiyama, M., Sagayama, S., 1992. Speaker adaptation based on transfer vector field smoothing technique. Proc. ICSLP, Banff, Canada, pp. 369-372.
- (1992) Proc. ICSLP , pp. 369-372
- Ohkura, K.¹ Sugiyama, M.² Sagayama, S.³

24
- 0020165569
- On the performance of the frequency-weighted cepstral coefficients in vowel recognition
- Paliwal K.K. On the performance of the frequency-weighted cepstral coefficients in vowel recognition. Speech Commun. 18:1992;151-154.
- (1992) Speech Commun. , vol.18 , pp. 151-154
- Paliwal, K.K.¹

25
- 0038338247
- Decorrelated and liftered filter-bank energies for robust speech recognition
- Budapest
- Paliwal, K.K., 1999. Decorrelated and liftered filter-bank energies for robust speech recognition. Proc. EUROPSEECH, Budapest, pp. 85-88.
- (1999) Proc. EUROPSEECH , pp. 85-88
- Paliwal, K.K.¹

26
- 0027659197
- Signal modeling techniques in speech recognition
- Picone J.W. Signal modeling techniques in speech recognition. Proc. IEEE. 81(9):1993;1215-1247.
- (1993) Proc. IEEE , vol.81 , Issue.9 , pp. 1215-1247
- Picone, J.W.¹

27
- 0001656188
- Kalman filtering of colored noise for speech enhancement
- Seattle
- Popescu, D.C., Zeljkovic, I., 1998. Kalman filtering of colored noise for speech enhancement. Proc. ICASSP, Seattle, pp. 997-1000.
- (1998) Proc. ICASSP , pp. 997-1000
- Popescu, D.C.¹ Zeljkovic, I.²

28
- 0029769867
- Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
- Rahim M., Juang B.-H. Signal bias removal by maximum likelihood estimation for robust telephone speech recognition. IEEE Trans. Speech Audio Process. 4(1):1996;19-30.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 19-30
- Rahim, M.¹ Juang, B.-H.²

29
- 0030649027
- Jacobian approach to fast acoustic model adaptation
- Munich, Germany
- Sagayama, S., Yamaguchi, Y., Tahahashi, S., Takahashi, J.-I., 1997. Jacobian approach to fast acoustic model adaptation, Proc. ICASSP, Munich, Germany, pp. 835-838.
- (1997) Proc. ICASSP , pp. 835-838
- Sagayama, S.¹ Yamaguchi, Y.² Tahahashi, S.³ Takahashi, J.-I.⁴

30
- 0022859652
- On the use of instantaneous and transitional spectral information in speaker recognition
- Tokyo, Japan
- Soong, F.K., Rosenberg, A.E., 1986. On the use of instantaneous and transitional spectral information in speaker recognition. Proc. ICASSP, Tokyo, Japan, pp. 877-880.
- (1986) Proc. ICASSP , pp. 877-880
- Soong, F.K.¹ Rosenberg, A.E.²

31
- 0000090514
- A weighted cepstral distance measure for speech recognition
- Tohkura Y. A weighted cepstral distance measure for speech recognition. IEEE Trans. Acoust., Speech Signal Process. 35(10):1987;1414-1422.
- (1987) IEEE Trans. Acoust., Speech Signal Process. , vol.35 , Issue.10 , pp. 1414-1422
- Tohkura, Y.¹

32
- 0004319968
- DRA Speech Research Unit, St. Andrew's Rd., Malvern, Worcestershire, WR14 3PS UK
- Varga, A., Steeneken, H.J.M., Tomlinson, M., Jones D., 1992. The NOISEX-92 study on the effect of additive noise on automatic speech recognition. DRA Speech Research Unit, St. Andrew's Rd., Malvern, Worcestershire, WR14 3PS UK.
- (1992) The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition
- Varga, A.¹ Steeneken, H.J.M.² Tomlinson, M.³ Jones, D.⁴

33
- 0030779363
- Noise compensation methods for hidden Markov model speech recognition in adverse environments
- Vaseghi S.V., Milner B.P. Noise compensation methods for hidden Markov model speech recognition in adverse environments. IEEE Trans. Speech Audio Process. 5(1):1997;11-21.
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.1 , pp. 11-21
- Vaseghi, S.V.¹ Milner, B.P.²

34
- 0029726509
- Improving environmental robustness in large vocabulary speech recognition
- Atlanta, GA
- Woodland, P.C., Gales, M.J.E., Pye, D., 1996. Improving environmental robustness in large vocabulary speech recognition. Proc. ICASSP, Atlanta, GA, pp. 65-68.
- (1996) Proc. ICASSP , pp. 65-68
- Woodland, P.C.¹ Gales, M.J.E.² Pye, D.³

35
- 85009101128
- Noise robust feature extraction for ASR using Aurora 2 database
- Scandinavia
- Zhu, Q., Iseli, M., Cui, X., Alwan, A., 2001. Noise robust feature extraction for ASR using Aurora 2 database. Proc. EUROSPEECH, Scandinavia, pp. 185-188.
- (2001) Proc. EUROSPEECH , pp. 185-188
- Zhu, Q.¹ Iseli, M.² Cui, X.³ Alwan, A.⁴

36
- 0025477640
- Speech database development at MIT: TIMIT and beyond
- Zue V., Seneff S., Glass J. Speech database development at MIT: TIMIT and beyond. Speech Commun. 9:1990;351-356.
- (1990) Speech Commun. , vol.9 , pp. 351-356
- Zue, V.¹ Seneff, S.² Glass, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.