메뉴 건너뛰기




Volumn , Issue , 2009, Pages 32-35

Towards fusion of feature extraction and acoustic model training: A top down process for robust speech recognition

Author keywords

Auditory model; Automatic speech recognition; Data analysis; Discriminative training

Indexed keywords

A-POSTERIORI PROBABILITIES; ACOUSTIC MODEL; AUDITORY MODELS; AUTOMATIC SPEECH RECOGNITION; BACKGROUND NOISE; DATA ANALYSIS; DISCRIMINATIVE TRAINING; ENVIRONMENTAL NOISE; FEATURE COMPUTATION; FUSION OF FEATURES; HEARING SYSTEM; HUMAN AUDITORY SYSTEM; LOGISTIC FUNCTIONS; MODEL TRAINING; NON-LINEARITY; RESOURCE MANAGEMENT; ROBUST SPEECH RECOGNITION; TOP-DOWN PROCESS; TRAINING DATA;

EID: 70450191552     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (3)

References (11)
  • 1
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S.B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences", IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 357-366, 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 2
    • 0025041264 scopus 로고
    • Perceptual linear predictive (plp) analysis of speech
    • H. Hermansky,"Perceptual linear predictive (plp) analysis of speech", J. Acoust. Soc. Am., vol. 87, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Am , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 3
    • 0000929862 scopus 로고    scopus 로고
    • Physiology of olivocochlear efferents
    • Springer, NewYork
    • J.J. Guinan Jr, "Physiology of olivocochlear efferents",The Cochlea, vol. 8, pp. 435-502, Springer, NewYork, 1996.
    • (1996) The Cochlea , vol.8 , pp. 435-502
    • Guinan Jr, J.J.1
  • 4
    • 0035250280 scopus 로고    scopus 로고
    • An Application of Discriminative Feature Extraction to Filter-Bank-Based Speech Recognition
    • A. Biem, S. Katagiri, E. McDermott and B.-H. Juang, "An Application of Discriminative Feature Extraction to Filter-Bank-Based Speech Recognition", IEEE Trans. Acoust., Speech, Signal Processing, vol. 9, no. 2, pp. 96-110, 2001.
    • (2001) IEEE Trans. Acoust., Speech, Signal Processing , vol.9 , Issue.2 , pp. 96-110
    • Biem, A.1    Katagiri, S.2    McDermott, E.3    Juang, B.-H.4
  • 5
    • 85009240235 scopus 로고    scopus 로고
    • Design a Speaker-Discriminative Adaptive Filter Bank for Speaker Recognition
    • Denver, Colorado, USA, September
    • T. Kinnunen, "Design a Speaker-Discriminative Adaptive Filter Bank for Speaker Recognition",Proc. ICSLP, Denver, Colorado, USA, September, 2002.
    • (2002) Proc. ICSLP
    • Kinnunen, T.1
  • 6
    • 85009064363 scopus 로고    scopus 로고
    • Pallel Feature Generation Based on Maximizing Normalized Acoustic Likelihood
    • Jeju Island, Korea, October
    • X. Li and R. Stern, "Pallel Feature Generation Based on Maximizing Normalized Acoustic Likelihood", Proc. ICSLP, Jeju Island, Korea, October 2004.
    • (2004) Proc. ICSLP
    • Li, X.1    Stern, R.2
  • 7
    • 33646788786 scopus 로고    scopus 로고
    • D. Povey, B. Kingsbury, L. Mangu, G. Saon, H. Soltau and G. Zweig, fMPE: Discriminatively Trained Features for Speech Recognition, Proc. ICASSP, Philadelphia, USA, March 2005.
    • D. Povey, B. Kingsbury, L. Mangu, G. Saon, H. Soltau and G. Zweig, "fMPE: Discriminatively Trained Features for Speech Recognition", Proc. ICASSP, Philadelphia, USA, March 2005.
  • 8
    • 70349209497 scopus 로고    scopus 로고
    • Analysis of Physiologically-Motivated Signal Processing for Robust Speech Recognition
    • Brisbane, Australia, September
    • Y.-H. Chiu and R. Stern, "Analysis of Physiologically-Motivated Signal Processing for Robust Speech Recognition", Proc. ICSLP, Brisbane, Australia, September 2008.
    • (2008) Proc. ICSLP
    • Chiu, Y.-H.1    Stern, R.2
  • 9
    • 0017930132 scopus 로고
    • Auditory nerve response from cats raised in a low noise chamber
    • M.C. Liberman, "Auditory nerve response from cats raised in a low noise chamber", J. Acoust. Soc. Am., vol. 63, pp. 442-455, 1978.
    • (1978) J. Acoust. Soc. Am , vol.63 , pp. 442-455
    • Liberman, M.C.1
  • 10
    • 70349208652 scopus 로고    scopus 로고
    • Minimum variance modulation filter for robust speech recognition
    • Taipei, Taiwan, April
    • Y.-H. Chiu and R. Stern, "Minimum variance modulation filter for robust speech recognition", Proc. ICASSP, Taipei, Taiwan, April 2009.
    • (2009) Proc. ICASSP
    • Chiu, Y.-H.1    Stern, R.2
  • 11
    • 0018367986 scopus 로고
    • Calculating virtual pitch
    • E. Terhardt, "Calculating virtual pitch", Hearing Research, vol. 1, pp. 155-182, 1979.
    • (1979) Hearing Research , vol.1 , pp. 155-182
    • Terhardt, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.