SCOPUS 정보 검색 플랫폼

Volumn 87, Issue 6, 2007, Pages 1202-1223

A noise robust feature extraction algorithm using joint wavelet packet subband decomposition and AR modeling of speech signals

(2) Kotnik, Bojan a Kačič, Zdravko a

a UNIVERSITY OF MARIBOR (Slovenia)

Author keywords

Automatic speech recognition; Autoregressive modeling; Modified soft thresholding; Noise robust speech parameterization; Wavelet packet decomposition

Indexed keywords

ALGORITHMS; COMPUTATION THEORY; COMPUTER SIMULATION; FEATURE EXTRACTION; FOURIER TRANSFORMS; PACKET NETWORKS; SIGNAL PROCESSING; WAVELET TRANSFORMS;

AUTOMATIC SPEECH RECOGNITION; AUTOREGRESSIVE MODELING; MODIFIED SOFT THRESHOLDING; NOISE ROBUST SPEECH PARAMETERIZATION; WAVELET PACKET DECOMPOSITION;

SPEECH RECOGNITION;

EID: 33847103626 PISSN: 01651684 EISSN: None Source Type: Journal
DOI: 10.1016/j.sigpro.2006.10.009 Document Type: Article

Times cited : (30)

References (29)

1
- 0003770709
- Kluwer, Norwell, MA, USA
- Junqua J.-C., and Haton J.-P. Robustness in Automatic Speech Recognition (1996), Kluwer, Norwell, MA, USA
- (1996) Robustness in Automatic Speech Recognition
- Junqua, J.-C.¹ Haton, J.-P.²

2
- 0037939793
- Efficient Noise robust feature extraction algorithms for distributed speech recognition systems
- Kotnik B., Vlaj D., and Horvat B. Efficient Noise robust feature extraction algorithms for distributed speech recognition systems. Int. J. Speech Technol. 6 3 (2003) 205-219
- (2003) Int. J. Speech Technol. , vol.6 , Issue.3 , pp. 205-219
- Kotnik, B.¹ Vlaj, D.² Horvat, B.³

3
- 33847133678
- ETSI standard document-ETSI ES 201 108 v1.1.1, Speech Processing, Transmission and Quality aspects (STQ), Distributed speech recognition, Front-end feature extraction algorithm, Compression Algorithm, 2000.

4
- 33847092730
- ETSI standard document-ETSI ES 202 050 v1.1.1, Speech Processing, Transmission and Quality aspects (STQ), Distributed speech recognition, Advanced front-end feature extraction algorithm, Compression Algorithm, 2002.

5
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Davis S.B., and Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28 (1980) 357-366
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

6
- 33847150755
- D. Pearce, An overview of the ETSI standards activities for distributed speech recognition front-ends, in: Proceedings of the AVIOS 2000, San Jose, CA, USA, 2000.

7
- 33847113720
- AU/225/00, AU/271/00, AU/273/00, AU/378/00. Finnish, Spanish, German, Danish databases for ETSI STQ Aurora WI008 advanced DSR front-end evaluation: description and baseline results, 2000.

8
- 85009124169
- R. Sarikaya, J.H.L. Hansen, Analysis of the Root-Cepstrum for acoustic modeling and fast decoding in speech recognition, in: Proceedings of the Eurospeech 2001, Aalborg, Denmark, 2001, pp. 687-690.

9
- 0003424145
- Macmillan Publishing Company, New York, USA
- Deller J.R., Proakis J.G., and Hansen J.H.L. Discrete-Time Processing of Speech Signals (1993), Macmillan Publishing Company, New York, USA
- (1993) Discrete-Time Processing of Speech Signals
- Deller, J.R.¹ Proakis, J.G.² Hansen, J.H.L.³

10
- 85009230362
- R. Gemello, F. Mana, D. Albesano, R. De Mori, Robust multiple resolution analysis for automatic speech recognition, in: Proceedings of the Eurospeech 2003, Geneva, Switzerland, 2003.

11
- 33847146090
- R. Sarikaya, B.L. Pellom, J.H.L. Hansen, Wavelet packet transform feature with application to speaker identification, in: Proceedings of the IEEE Nordic Signal Processing Symposium, Vigsø, Denmark, June, 1998, pp. 81-84.

12
- 0033705977
- J.N. Gowdy, Z. Tufekci, Mel-scaled discrete wavelet coefficients for speech recognition, in: Proceedings of the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, Turkey, 2000.

13
- 0035125193
- Wavelet speech enhancement based on the teager energy operator
- Bahoura M., and Rouat J. Wavelet speech enhancement based on the teager energy operator. Signal Process. Lett. IEEE 8 1 (2001)
- (2001) Signal Process. Lett. IEEE , vol.8 , Issue.1
- Bahoura, M.¹ Rouat, J.²

14
- 85009074815
- H. Sheikhzadeh, H.R. Abutalebi, An improved wavelet-based speech enhancement system, in: Proceedings of the Eurospeech 2001, Aalborg, Denmark, 2001, pp. 1855-1858.

15
- 0004252361
- Prentice Hall-PTR, Upper Saddle River, NJ, USA
- Vetterli M., and Kovačević J. Wavelets and Subband Coding (1995), Prentice Hall-PTR, Upper Saddle River, NJ, USA
- (1995) Wavelets and Subband Coding
- Vetterli, M.¹ Kovačević, J.²

16
- 0004266795
- Academic Press, San Diego, CA, USA
- Chui C.K. An Introduction to Wavelets (1992), Academic Press, San Diego, CA, USA
- (1992) An Introduction to Wavelets
- Chui, C.K.¹

17
- 79959907654
- B. Kotnik, Z. Kacic, B. Horvat, Noise robust speech parameterization based on joint wavelet packet decomposition and autoregressive modeling, in: Proceedings of the Eurospeech 2003, Geneva, Switzerland, 2003.

18
- 0003833285
- SIAM, Philadelphia, USA
- Daubechies I. Ten Lectures on Wavelets (1997), SIAM, Philadelphia, USA
- (1997) Ten Lectures on Wavelets
- Daubechies, I.¹

19
- 0004206760
- Wellesley-Cambridge Press, MA, USA
- Strang G., and Nguyen T. Wavelets and Filter Banks. Wellesley, (1997), Wellesley-Cambridge Press, MA, USA
- (1997) Wavelets and Filter Banks. Wellesley
- Strang, G.¹ Nguyen, T.²

20
- 33847143008
- I.W. Selesnick, Explicit Formulas for Orthogonal IIR Wavelets, Preprint, November 1997. Available: 〈http://citeseer.nj.nec.com/selesnick97explicit.html〉.

21
- 33847115977
- R. Martin, Spectral subtraction based on minimum statistics, in: Proceedings of the European Signal Processing Conference (EUSIPCO), Edinburgh, UK, September 1994, pp. 1182-1185.

22
- 0029307534
- Denoising by soft thresholding
- Donoho D.L. Denoising by soft thresholding. IEEE Trans. Inform. Theory 41 (1995) 613-627
- (1995) IEEE Trans. Inform. Theory , vol.41 , pp. 613-627
- Donoho, D.L.¹

23
- 0035274536
- Robust voice activity detection using higher-order statistics in the LPC residual domain
- Nemer E., Goubran E.R., and Mahmoud S. Robust voice activity detection using higher-order statistics in the LPC residual domain. IEEE Trans. Speech Audio Process. 9 (2001) 217-231
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 217-231
- Nemer, E.¹ Goubran, E.R.² Mahmoud, S.³

24
- 0015488387
- The SIFT algorithm for fundamental frequency estimation
- Markel J.D. The SIFT algorithm for fundamental frequency estimation. IEEE Trans. Audio Electroacoust. 20 (1972) 367-377
- (1972) IEEE Trans. Audio Electroacoust. , vol.20 , pp. 367-377
- Markel, J.D.¹

25
- 33847173139
- L. Welling, Merkmalsextraction in Spracherkennungssystemen für großen Wortschatz. Ph.D. Thesis, RWTH, Aachen, Germany, 1999.

26
- 84857498381
- B. Kotnik, Z. Kacic, B. Horvat, Development & integration of the LDA-toolkit into the COST249 SpeechDat (II) SIG reference recognizer, in: Proceedings of the LREC 2004, Lisbon, Portugal, 2004, pp. 2083-2086.

27
- 33847128612
- H.-G. Hirsch, D. Pearce, The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions, in: Proceedings of the ISCA ITRW ASR 2000, Paris, France, Sept. 2000, pp. 181-188.

28
- 85009278999
- B. Kotnik, D. Vlaj, Z. Kačič, B. Horvat, Robust MFCC feature extraction algorithm using efficient additive and convolutional noise reduction procedures, in: Proceedings of the ICSLP 2002, Denver, CO, USA, 2002, pp. 445-448.

29
- 0003571977
- Microsoft Corporation, USA
- Young S., Kershaw D., Odell J., Ollason D., Valtchev V., and Woodland P. The HTK Book-Version 3.0 (2000), Microsoft Corporation, USA
- (2000) The HTK Book-Version 3.0
- Young, S.¹ Kershaw, D.² Odell, J.³ Ollason, D.⁴ Valtchev, V.⁵ Woodland, P.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.