메뉴 건너뛰기




Volumn 48, Issue 11, 2006, Pages 1556-1572

MAP prediction of formant frequencies and voicing class from MFCC vectors in noise

Author keywords

DSR; Formant estimation; Formant prediction; GMM; HMM; MAP prediction

Indexed keywords

FREQUENCIES; MARKOV PROCESSES; MATHEMATICAL MODELS; SIGNAL TO NOISE RATIO; SPEECH COMMUNICATION; SPEECH RECOGNITION;

EID: 33750293417     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2006.06.001     Document Type: Article
Times cited : (22)

References (24)
  • 1
    • 0015112070 scopus 로고
    • Speech analysis and synthesis by linear prediction of the speech wave
    • Atal B.S., and Hanauer S.L. Speech analysis and synthesis by linear prediction of the speech wave. J. Acoust. Soc. Amer. 50 2 (1971) 637-655
    • (1971) J. Acoust. Soc. Amer. , vol.50 , Issue.2 , pp. 637-655
    • Atal, B.S.1    Hanauer, S.L.2
  • 2
    • 17344378368 scopus 로고    scopus 로고
    • Bruce, I.C., Karkhanis, N.V., Young, E.D., Sachs, M.B., 2002. Robust formant tracking in noise. In: ICASSP, Orlando, FL, May, Vol. 1, pp. 281-284.
  • 3
    • 4544315994 scopus 로고    scopus 로고
    • Chen, B., Loizou, P.C., 2004. Formant frequency estimation in noise. In: ICASSP, Montreal, Canada, May, Vol. 1, pp. 581-584.
  • 4
    • 33646763209 scopus 로고    scopus 로고
    • Darch, J., Milner, B., Shao, X., Vaseghi, S., Yan, Q., 2005a. Predicting formant frequencies from MFCC vectors. In: ICASSP, Philadelphia, PA, March, Vol. 1, pp. 941-944.
  • 5
    • 33745198685 scopus 로고    scopus 로고
    • Darch, J., Milner, B., Vaseghi, S., 2005b. Formant frequency prediction from MFCC vectors in noisy environments. In: Eurospeech, Lisbon, Portugal, September, pp. 1129-1132.
  • 6
    • 33750335159 scopus 로고    scopus 로고
    • Fransen, J., Pye, D., Robinson, T., Woodland, P., Young, S., 1994. WSJCAM0 corpus and recording description. Tech. Rep. CUED/F-INFENG/TR.192, Cambridge University Engineering Department, September.
  • 8
    • 0018986665 scopus 로고
    • Software for a cascade/parallel formant synthesizer
    • Klatt D.H. Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Amer. 67 3 (1980) 971-995
    • (1980) J. Acoust. Soc. Amer. , vol.67 , Issue.3 , pp. 971-995
    • Klatt, D.H.1
  • 9
    • 0016049328 scopus 로고
    • An algorithm for automatic formant extraction using linear prediction spectra
    • McCandless S. An algorithm for automatic formant extraction using linear prediction spectra. IEEE Trans. Acoust. Speech Signal Process. 22 2 (1974) 135-141
    • (1974) IEEE Trans. Acoust. Speech Signal Process. , vol.22 , Issue.2 , pp. 135-141
    • McCandless, S.1
  • 10
    • 33745203982 scopus 로고    scopus 로고
    • Milner, B., Shao, X., Darch, J., 2005. Fundamental frequency and voicing prediction from MFCCs for speech reconstruction from unconstrained speech. In: Eurospeech, Lisbon, Portugal, September, pp. 321-324.
  • 11
    • 33750327908 scopus 로고    scopus 로고
    • Niederjohn, R.J., Svoren, T.J., Heinen, J.A., 1992. Intelligibility enhancement of noise-corrupted speech based on formant tracking involving prefiltering. In: IEEE Industrial Electronics, Control, Instrumentation, and Automation, San Diego, CA, November, Vol. 3, pp. 1336-1341.
  • 12
    • 84987702417 scopus 로고    scopus 로고
    • Pearce, D., Hirsch, H.-G., 2000. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: ICSLP, Beijing, China, October, Vol. 4, pp. 29-32.
  • 14
    • 4644336054 scopus 로고    scopus 로고
    • Reconstruction of missing features for robust speech recognition
    • Raj B., Seltzer M.L., and Stern R.M. Reconstruction of missing features for robust speech recognition. Speech Comm. 43 4 (2004) 275-296
    • (2004) Speech Comm. , vol.43 , Issue.4 , pp. 275-296
    • Raj, B.1    Seltzer, M.L.2    Stern, R.M.3
  • 15
    • 0014730929 scopus 로고
    • System for automatic formant analysis of voiced speech
    • Schafer R.W., and Rabiner L.R. System for automatic formant analysis of voiced speech. J. Acoust. Soc. Amer. 47 2 (1970) 634-648
    • (1970) J. Acoust. Soc. Amer. , vol.47 , Issue.2 , pp. 634-648
    • Schafer, R.W.1    Rabiner, L.R.2
  • 16
    • 23744446244 scopus 로고    scopus 로고
    • Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction
    • Shao X., and Milner B. Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction. J. Acoust. Soc. Amer. 118 2 (2005) 1134-1143
    • (2005) J. Acoust. Soc. Amer. , vol.118 , Issue.2 , pp. 1134-1143
    • Shao, X.1    Milner, B.2
  • 18
    • 33750371975 scopus 로고    scopus 로고
    • Sorin, A., Ramabadran, T., 2003. Extended advanced front end algorithm description, Version 1.1. Tech. Rep. ES 202 212, ETSI STQ-Aurora DSR Working Group, April.
  • 20
    • 0029746535 scopus 로고    scopus 로고
    • Welling, L., Ney, H., 1996. A model for efficient formant estimation. In: ICASSP, Atlanta, GA, May, Vol. 2, pp. 797-800.
  • 21
    • 0031647965 scopus 로고    scopus 로고
    • Formant estimation for speech recognition
    • Welling L., and Ney H. Formant estimation for speech recognition. IEEE Trans. Speech Audio Process. 6 1 (1998) 36-48
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.1 , pp. 36-48
    • Welling, L.1    Ney, H.2
  • 22
    • 84864724283 scopus 로고    scopus 로고
    • Wilkinson, N., Russell, M.J., 2002. Improved phone recognition on TIMIT using formant frequency data and confidence measures. In: ICSLP, Denver, CO, September, pp. 2121-2124.
  • 23
    • 33745193696 scopus 로고    scopus 로고
    • Yan, Q., Vaseghi, S., Zavarehei, E., Milner, B., 2005. Formant-tracking linear prediction models for speech processing in noisy environments. In: Eurospeech, Lisbon, Portugal, September, pp. 2081-2084.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.