메뉴 건너뛰기




Volumn , Issue , 2009, Pages

Speech analysis for automatic speech recongnition: A review

Author keywords

ASR parameters; Auditory models; Speech parameterization; Speech recognizer

Indexed keywords

ASR PARAMETERS; AUDITORY MODELS; PARAMETERIZATION METHOD; SPEECH RECOGNIZER; SPEECH SIGNALS;

EID: 70350370968     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SPED.2009.5156170     Document Type: Conference Paper
Times cited : (2)

References (22)
  • 2
    • 0029306339 scopus 로고
    • Improving the readability of Time-Frequency and Time-Scale Representations by the Reassignment Method
    • F. Auger and P. Flandrin, "Improving the readability of Time-Frequency and Time-Scale Representations by the Reassignment Method," IEEE Trans. on Signal Processing, 43, pp. 1068-1089, 1995.
    • (1995) IEEE Trans. on Signal Processing , vol.43 , pp. 1068-1089
    • Auger, F.1    Flandrin, P.2
  • 3
    • 0000293183 scopus 로고
    • Theory of communication
    • D. Gabor, "Theory of communication." J. IEEE, 93, 429-457, 1946.
    • (1946) J. IEEE , vol.93 , pp. 429-457
    • Gabor, D.1
  • 4
    • 0024705330 scopus 로고
    • Time - Frequency Distributions - A Review
    • L. Cohen, "Time - Frequency Distributions - A Review," Proc. IEEE, 77, pp. 941-981, 1989.
    • (1989) Proc. IEEE , vol.77 , pp. 941-981
    • Cohen, L.1
  • 5
    • 0024929841 scopus 로고
    • Transient Analysis of Speech Signals Using the Wigner Time-Frequency Representation
    • Glasgow, pp
    • E.F. Velez and R.G. Absher, "Transient Analysis of Speech Signals Using the Wigner Time-Frequency Representation," Proc. ICASSP, Glasgow, pp.2242-2245, 1989.
    • (1989) Proc. ICASSP , pp. 2242-2245
    • Velez, E.F.1    Absher, R.G.2
  • 6
    • 4944257240 scopus 로고    scopus 로고
    • Auditory Based Feature Vectors for Speech Recognition Systems
    • N.E.Mastorakis et V.V. Kluev, Editors, WSEAS Press, pp
    • W.H. Abdulla, "Auditory Based Feature Vectors for Speech Recognition Systems," in Advances in Communications and Software Technologies, N.E.Mastorakis et V.V. Kluev, Editors, WSEAS Press, pp 231-236, 2002.
    • (2002) Advances in Communications and Software Technologies , pp. 231-236
    • Abdulla, W.H.1
  • 7
    • 84869656847 scopus 로고    scopus 로고
    • Sur ĺvaluation du second formant F'2 par une technique d'estimation spectrale basée sur une modélisation du filtrage auditif
    • Nancy
    • K. Ouni and N. Ellouze, "Sur ĺvaluation du second formant F'2 par une technique d'estimation spectrale basée sur une modélisation du filtrage auditif," Proc. XXIV JEP, Nancy, 2002.
    • (2002) Proc. XXIV JEP
    • Ouni, K.1    Ellouze, N.2
  • 9
    • 2442551863 scopus 로고    scopus 로고
    • Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features
    • L. Deng, et al., "Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features," IEEE Trans. SAP, 12, pp. 218-233, 2004.
    • (2004) IEEE Trans. SAP , vol.12 , pp. 218-233
    • Deng, L.1
  • 10
    • 70350410327 scopus 로고    scopus 로고
    • An improved version of the SPACE algorithm for noise robust speech recognition
    • Marrakech
    • K. Daoudi and C. Cerisara, "An improved version of the SPACE algorithm for noise robust speech recognition," Proc. IEEE-EURASIP ISCCSP, Marrakech, 2006.
    • (2006) Proc. IEEE-EURASIP ISCCSP
    • Daoudi, K.1    Cerisara, C.2
  • 11
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. ASSP, 28, pp. 357-366, 1980.
    • (1980) IEEE Trans. ASSP , vol.28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 12
    • 0022667694 scopus 로고
    • Speaker independent isolated word recognition using dynamic features of speech spectrum
    • S. Furui, "Speaker independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. ASSP, 34, pp. 52-59, 1986.
    • (1986) IEEE Trans. ASSP , vol.34 , pp. 52-59
    • Furui, S.1
  • 13
    • 0034817674 scopus 로고    scopus 로고
    • Time and frequency filtering of filterband energies for robust HMM speech recognition
    • C. Nadeu, D. Macho, and J. Hernando, "Time and frequency filtering of filterband energies for robust HMM speech recognition," Speech Communication, 34, pp. 93-114, 2001.
    • (2001) Speech Communication , vol.34 , pp. 93-114
    • Nadeu, C.1    Macho, D.2    Hernando, J.3
  • 15
    • 33745387383 scopus 로고    scopus 로고
    • A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets
    • Y. Ghanbaria and M.R. Karimi-Mollaei, "A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets," Speech Communication, 48, pp. 927-940, 2006.
    • (2006) Speech Communication , vol.48 , pp. 927-940
    • Ghanbaria, Y.1    Karimi-Mollaei, M.R.2
  • 16
    • 34347335498 scopus 로고    scopus 로고
    • A comprehensive noise robust speech parameterization algorithm using wavelet packet decomposition-based denoising and speech feature representation techniques
    • B. Kotnik and Z. Kacik, "A comprehensive noise robust speech parameterization algorithm using wavelet packet decomposition-based denoising and speech feature representation techniques," EURASIP Journal on Advances in Signal Processing, 2007.
    • (2007) EURASIP Journal on Advances in Signal Processing
    • Kotnik, B.1    Kacik, Z.2
  • 17
    • 0026400245 scopus 로고
    • An investigation of PLP and IMELDA acoustic representations and of their potential for combination
    • Toronto, pp
    • M. J. Hunt et al., "An investigation of PLP and IMELDA acoustic representations and of their potential for combination," Proc. ICASSP, Toronto, pp. 881-884, 1991.
    • (1991) Proc. ICASSP , pp. 881-884
    • Hunt, M.J.1
  • 18
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., 87, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Amer , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 19
    • 85135377175 scopus 로고
    • Compensation for the effect of communication channel in auditory-like analysis of speech (RASTA-PLP)
    • H. Hermansky et al., "Compensation for the effect of communication channel in auditory-like analysis of speech (RASTA-PLP)," Proc. EUROSPEECH, Gênes, pp. 1367-1370, 1991.
    • (1991) Proc. EUROSPEECH, Gênes , pp. 1367-1370
    • Hermansky, H.1
  • 20
    • 84944816135 scopus 로고
    • A Digital Filter Bank for Spectral Matching
    • Philadelphia, pp
    • D.H. Klatt, "A Digital Filter Bank for Spectral Matching," Proc. ICASSP, Philadelphia, pp. 573-576, 1976.
    • (1976) Proc. ICASSP , pp. 573-576
    • Klatt, D.H.1
  • 21
    • 53149126814 scopus 로고    scopus 로고
    • Robust Feature Extraction for Continuous Speech recognition Using the mvdr Spectrum Estimation Method
    • S. Dharanipragada et al., "Robust Feature Extraction for Continuous Speech recognition Using the mvdr Spectrum Estimation Method," IEEE Trans. ASSP, 15, pp. 224-234, 2007.
    • (2007) IEEE Trans. ASSP , vol.15 , pp. 224-234
    • Dharanipragada, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.