메뉴 건너뛰기




Volumn 17, Issue 6, 2010, Pages 551-554

Spectral moment features augmented by low order cepstral coefficients for robust ASR

Author keywords

First spectral moment; Low order cepstral coefficients; Robust speech recognition; SMAC

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; CENTRAL FREQUENCY; CEPSTRAL COEFFICIENTS; FREQUENCY DOMAINS; LOW ORDER; ROBUST ASR; ROBUST SPEECH RECOGNITION; SPECTRAL MOMENTS; SPECTRAL TILT; SPEECH SPECTRA; TIME-FREQUENCY DISTRIBUTIONS;

EID: 77951729327     PISSN: 10709908     EISSN: None     Source Type: Journal    
DOI: 10.1109/LSP.2010.2046349     Document Type: Article
Times cited : (26)

References (12)
  • 1
    • 0035278964 scopus 로고    scopus 로고
    • Time-frequency distributions for automatic speech recognition
    • DOI 10.1109/89.905994, PII S1063667601016674
    • A. Potamianos and P. Maragos, "Time-frequency distributions for automatic speech recognition, " IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 196-200, Mar. 2001. (Pubitemid 32286593)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.3 , pp. 196-200
    • Potamianos, A.1    Maragos, P.2
  • 2
    • 84937035392 scopus 로고
    • Estimating and interpreting the instantaneous frequency of a signal-part 1: Fundamentals
    • B. Boashash, "Estimating and interpreting the instantaneous frequency of a signal-Part 1: Fundamentals, " Proc. IEEE, vol. 80, pp. 520-538, 1992.
    • (1992) Proc. IEEE , vol.80 , pp. 520-538
    • Boashash, B.1
  • 3
    • 0030008906 scopus 로고    scopus 로고
    • Speech formant frequency and bandwidth tracking using multiband energy demodulation
    • DOI 10.1121/1.414997
    • A. Potamianos and P. Maragos, "Speech formant frequency and bandwidth tracking using multiband energy demodulation, " J. Acoust. Soc. Amer., vol. 99, pp. 3795-3806, Jun. 1996. (Pubitemid 26190269)
    • (1996) Journal of the Acoustical Society of America , vol.99 , Issue.6 , pp. 3795-3806
    • Potamianos, A.1    Maragos, P.2
  • 4
    • 66149120614 scopus 로고    scopus 로고
    • Speaker identification using instantaneous frequencies
    • Aug
    • M. Grimaldi and F. Cummins, "Speaker identification using instantaneous frequencies, " IEEE Trans. Audio, Speech Lang. Process., vol. 16, no. 6, pp. 1097-1111, Aug. 2008.
    • (2008) IEEE Trans. Audio, Speech Lang. Process. , vol.16 , Issue.6 , pp. 1097-1111
    • Grimaldi, M.1    Cummins, F.2
  • 5
    • 0442326756 scopus 로고    scopus 로고
    • Recognition in noisy speech using dynamic spectral subband centroids
    • Feb
    • J. Chen, Y. A. Huang, Q. Li, and K. K. Paliwal, "Recognition in noisy speech using dynamic spectral subband centroids, " IEEE Signal Process. Lett., vol. 11, no. 2, pp. 258-261, Feb. 2004.
    • (2004) IEEE Signal Process. Lett. , vol.11 , Issue.2 , pp. 258-261
    • Chen, J.1    Huang, Y.A.2    Li, Q.3    Paliwal, K.K.4
  • 6
    • 27644455860 scopus 로고    scopus 로고
    • Robust AM-FM features for speech recognition
    • DOI 10.1109/LSP.2005.853050
    • D. Dimitriadis, P. Maragos, and A. Potamianos, "Robust AM-FM features for speech recognition, " IEEE Signal Process. Lett., vol. 12, no. 9, pp. 621-624, Sep. 2005. (Pubitemid 41554399)
    • (2005) IEEE Signal Processing Letters , vol.12 , Issue.9 , pp. 621-624
    • Dimitriadis, D.1    Maragos, P.2    Potamianos, A.3
  • 7
    • 85009074671 scopus 로고    scopus 로고
    • A mixture of gaussians front end for speech recognition
    • M. N. Stuttle and M. J. F. Gales, "A mixture of Gaussians front end for speech recognition, " in EUROSPEECH, 2001.
    • (2001) EUROSPEECH
    • Stuttle, M.N.1    Gales, M.J.F.2
  • 8
    • 85009192384 scopus 로고    scopus 로고
    • Frequency-related representation of speech
    • K. K. Paliwal and B. S. Atal, "Frequency-related representation of speech, " in EUROSPEECH, 2003.
    • (2003) EUROSPEECH
    • Paliwal, K.K.1    Atal, B.S.2
  • 9
    • 77949410020 scopus 로고    scopus 로고
    • Short-time instantaneous frequency and bandwidth features for speech recognition
    • P. Tsiakoulis, A. Potamianos, and D. Dimitriadis, "Short-time instantaneous frequency and bandwidth features for speech recognition, " in ASRU, 2009.
    • (2009) ASRU
    • Tsiakoulis, P.1    Potamianos, A.2    Dimitriadis, D.3
  • 10
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • DOI 10.1121/1.399423
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990. (Pubitemid 20256470)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.