메뉴 건너뛰기




Volumn 87, Issue 6, 2007, Pages 1202-1223

A noise robust feature extraction algorithm using joint wavelet packet subband decomposition and AR modeling of speech signals

Author keywords

Automatic speech recognition; Autoregressive modeling; Modified soft thresholding; Noise robust speech parameterization; Wavelet packet decomposition

Indexed keywords

ALGORITHMS; COMPUTATION THEORY; COMPUTER SIMULATION; FEATURE EXTRACTION; FOURIER TRANSFORMS; PACKET NETWORKS; SIGNAL PROCESSING; WAVELET TRANSFORMS;

EID: 33847103626     PISSN: 01651684     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.sigpro.2006.10.009     Document Type: Article
Times cited : (30)

References (29)
  • 2
    • 0037939793 scopus 로고    scopus 로고
    • Efficient Noise robust feature extraction algorithms for distributed speech recognition systems
    • Kotnik B., Vlaj D., and Horvat B. Efficient Noise robust feature extraction algorithms for distributed speech recognition systems. Int. J. Speech Technol. 6 3 (2003) 205-219
    • (2003) Int. J. Speech Technol. , vol.6 , Issue.3 , pp. 205-219
    • Kotnik, B.1    Vlaj, D.2    Horvat, B.3
  • 3
    • 33847133678 scopus 로고    scopus 로고
    • ETSI standard document-ETSI ES 201 108 v1.1.1, Speech Processing, Transmission and Quality aspects (STQ), Distributed speech recognition, Front-end feature extraction algorithm, Compression Algorithm, 2000.
  • 4
    • 33847092730 scopus 로고    scopus 로고
    • ETSI standard document-ETSI ES 202 050 v1.1.1, Speech Processing, Transmission and Quality aspects (STQ), Distributed speech recognition, Advanced front-end feature extraction algorithm, Compression Algorithm, 2002.
  • 5
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis S.B., and Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28 (1980) 357-366
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 6
    • 33847150755 scopus 로고    scopus 로고
    • D. Pearce, An overview of the ETSI standards activities for distributed speech recognition front-ends, in: Proceedings of the AVIOS 2000, San Jose, CA, USA, 2000.
  • 7
    • 33847113720 scopus 로고    scopus 로고
    • AU/225/00, AU/271/00, AU/273/00, AU/378/00. Finnish, Spanish, German, Danish databases for ETSI STQ Aurora WI008 advanced DSR front-end evaluation: description and baseline results, 2000.
  • 8
    • 85009124169 scopus 로고    scopus 로고
    • R. Sarikaya, J.H.L. Hansen, Analysis of the Root-Cepstrum for acoustic modeling and fast decoding in speech recognition, in: Proceedings of the Eurospeech 2001, Aalborg, Denmark, 2001, pp. 687-690.
  • 10
    • 85009230362 scopus 로고    scopus 로고
    • R. Gemello, F. Mana, D. Albesano, R. De Mori, Robust multiple resolution analysis for automatic speech recognition, in: Proceedings of the Eurospeech 2003, Geneva, Switzerland, 2003.
  • 11
    • 33847146090 scopus 로고    scopus 로고
    • R. Sarikaya, B.L. Pellom, J.H.L. Hansen, Wavelet packet transform feature with application to speaker identification, in: Proceedings of the IEEE Nordic Signal Processing Symposium, Vigsø, Denmark, June, 1998, pp. 81-84.
  • 12
    • 0033705977 scopus 로고    scopus 로고
    • J.N. Gowdy, Z. Tufekci, Mel-scaled discrete wavelet coefficients for speech recognition, in: Proceedings of the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, Turkey, 2000.
  • 13
    • 0035125193 scopus 로고    scopus 로고
    • Wavelet speech enhancement based on the teager energy operator
    • Bahoura M., and Rouat J. Wavelet speech enhancement based on the teager energy operator. Signal Process. Lett. IEEE 8 1 (2001)
    • (2001) Signal Process. Lett. IEEE , vol.8 , Issue.1
    • Bahoura, M.1    Rouat, J.2
  • 14
    • 85009074815 scopus 로고    scopus 로고
    • H. Sheikhzadeh, H.R. Abutalebi, An improved wavelet-based speech enhancement system, in: Proceedings of the Eurospeech 2001, Aalborg, Denmark, 2001, pp. 1855-1858.
  • 17
    • 79959907654 scopus 로고    scopus 로고
    • B. Kotnik, Z. Kacic, B. Horvat, Noise robust speech parameterization based on joint wavelet packet decomposition and autoregressive modeling, in: Proceedings of the Eurospeech 2003, Geneva, Switzerland, 2003.
  • 20
    • 33847143008 scopus 로고    scopus 로고
    • I.W. Selesnick, Explicit Formulas for Orthogonal IIR Wavelets, Preprint, November 1997. Available: 〈http://citeseer.nj.nec.com/selesnick97explicit.html〉.
  • 21
    • 33847115977 scopus 로고    scopus 로고
    • R. Martin, Spectral subtraction based on minimum statistics, in: Proceedings of the European Signal Processing Conference (EUSIPCO), Edinburgh, UK, September 1994, pp. 1182-1185.
  • 22
    • 0029307534 scopus 로고
    • Denoising by soft thresholding
    • Donoho D.L. Denoising by soft thresholding. IEEE Trans. Inform. Theory 41 (1995) 613-627
    • (1995) IEEE Trans. Inform. Theory , vol.41 , pp. 613-627
    • Donoho, D.L.1
  • 23
    • 0035274536 scopus 로고    scopus 로고
    • Robust voice activity detection using higher-order statistics in the LPC residual domain
    • Nemer E., Goubran E.R., and Mahmoud S. Robust voice activity detection using higher-order statistics in the LPC residual domain. IEEE Trans. Speech Audio Process. 9 (2001) 217-231
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 217-231
    • Nemer, E.1    Goubran, E.R.2    Mahmoud, S.3
  • 24
    • 0015488387 scopus 로고
    • The SIFT algorithm for fundamental frequency estimation
    • Markel J.D. The SIFT algorithm for fundamental frequency estimation. IEEE Trans. Audio Electroacoust. 20 (1972) 367-377
    • (1972) IEEE Trans. Audio Electroacoust. , vol.20 , pp. 367-377
    • Markel, J.D.1
  • 25
    • 33847173139 scopus 로고    scopus 로고
    • L. Welling, Merkmalsextraction in Spracherkennungssystemen für großen Wortschatz. Ph.D. Thesis, RWTH, Aachen, Germany, 1999.
  • 26
    • 84857498381 scopus 로고    scopus 로고
    • B. Kotnik, Z. Kacic, B. Horvat, Development & integration of the LDA-toolkit into the COST249 SpeechDat (II) SIG reference recognizer, in: Proceedings of the LREC 2004, Lisbon, Portugal, 2004, pp. 2083-2086.
  • 27
    • 33847128612 scopus 로고    scopus 로고
    • H.-G. Hirsch, D. Pearce, The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions, in: Proceedings of the ISCA ITRW ASR 2000, Paris, France, Sept. 2000, pp. 181-188.
  • 28
    • 85009278999 scopus 로고    scopus 로고
    • B. Kotnik, D. Vlaj, Z. Kačič, B. Horvat, Robust MFCC feature extraction algorithm using efficient additive and convolutional noise reduction procedures, in: Proceedings of the ICSLP 2002, Denver, CO, USA, 2002, pp. 445-448.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.