메뉴 건너뛰기




Volumn , Issue , 2003, Pages 2573-2576

Localized spectro-temporal features for automatic speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTICS; FEATURE EXTRACTION; MODULATION; SPEECH COMMUNICATION;

EID: 85009227802     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (57)

References (27)
  • 1
    • 0032139768 scopus 로고    scopus 로고
    • Should recognizers have ears?
    • H. Hermansky, "Should recognizers have ears?," Speech Communication, vol. 25, pp. 3-24, 1998.
    • (1998) Speech Communication , vol.25 , pp. 3-24
    • Hermansky, H.1
  • 2
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • R.P. Lippmann, "Speech recognition by machines and humans," Speech Communication, vol. 22, pp. 1-15, 1997.
    • (1997) Speech Communication , vol.22 , pp. 1-15
    • Lippmann, R.P.1
  • 3
    • 0028517164 scopus 로고
    • RASTA processing of speech
    • H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. SAP, vol. 2, no. 4, pp. 578-589, 1994.
    • (1994) IEEE Trans. SAP , vol.2 , Issue.4 , pp. 578-589
    • Hermansky, H.1    Morgan, N.2
  • 4
    • 85009254284 scopus 로고    scopus 로고
    • TRAPS-Classifiers of temporal patterns
    • H. Hermansky and S. Sharma, "TRAPS-Classifiers of temporal patterns," in ICSLP, 1998, vol. 3, pp. 1003-1006.
    • (1998) ICSLP , vol.3 , pp. 1003-1006
    • Hermansky, H.1    Sharma, S.2
  • 5
    • 0003045511 scopus 로고
    • Spectral envelope coding in cat primary auditory cortex: Properties of ripple transfer functions
    • C.E. Schreiner and B.M. Calhoun, "Spectral envelope coding in cat primary auditory cortex: properties of ripple transfer functions.," Auditory Neuroscience, vol. 1, pp. 39-61, 1994.
    • (1994) Auditory Neuroscience , vol.1 , pp. 39-61
    • Schreiner, C.E.1    Calhoun, B.M.2
  • 6
    • 0035097825 scopus 로고    scopus 로고
    • Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex
    • D.A. Depireux, J.Z. Simon, D.J. Klein, and S.A. Shamma, "Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex," J. Neurophysiol., vol. 85, pp. 1220-1234, 2001.
    • (2001) J. Neurophysiol. , vol.85 , pp. 1220-1234
    • Depireux, D.A.1    Simon, J.Z.2    Klein, D.J.3    Shamma, S.A.4
  • 7
    • 0036082510 scopus 로고    scopus 로고
    • Spectrotemporal receptive fields in the lemniscal auditory cortex
    • L.M. Miller, M.A. Escabi, H.L. Read, and C.E. Schreiner, "Spectrotemporal receptive fields in the lemniscal auditory cortex," J. Neurophysiol., vol. 87, pp. 516-527, 2002.
    • (2002) J. Neurophysiol. , vol.87 , pp. 516-527
    • Miller, L.M.1    Escabi, M.A.2    Read, H.L.3    Schreiner, C.E.4
  • 10
    • 0030699329 scopus 로고    scopus 로고
    • Modeling auditory processing of amplitude modulation: II. Spectral and temporal integration
    • T. Dau, B. Kollmeier, and A. Kohlrausch, "Modeling auditory processing of amplitude modulation: II. Spectral and temporal integration," JASA, vol. 102, pp. 2906-2919, 1997.
    • (1997) JASA , vol.102 , pp. 2906-2919
    • Dau, T.1    Kollmeier, B.2    Kohlrausch, A.3
  • 11
    • 0040290402 scopus 로고    scopus 로고
    • Spectrotemporal modulation transfer functions and speech intelligibility
    • T. Chi, Y. Gao, M. C. Guyton, P. Ru, and S. Shamma, "Spectrotemporal modulation transfer functions and speech intelligibility," JASA, vol. 106, no. 5, pp. 2719-2732, 1999.
    • (1999) JASA , vol.106 , Issue.5 , pp. 2719-2732
    • Chi, T.1    Gao, Y.2    Guyton, M.C.3    Ru, P.4    Shamma, S.5
  • 13
    • 85128367018 scopus 로고    scopus 로고
    • Speech intelligibility derived from exceedingly sparse spectral information
    • S. Greenberg, T. Arai, and R. Silipo, "Speech intelligibility derived from exceedingly sparse spectral information," in ICSLP, 1998.
    • (1998) ICSLP
    • Greenberg, S.1    Arai, T.2    Silipo, R.3
  • 14
    • 85009113629 scopus 로고    scopus 로고
    • The relation between speech intelligibility and the complex modulation spectrum
    • S. Greenberg and T. Arai, "The relation between speech intelligibility and the complex modulation spectrum," in Eurospeech, 2001.
    • (2001) Eurospeech
    • Greenberg, S.1    Arai, T.2
  • 15
    • 0032136330 scopus 로고    scopus 로고
    • Robust speech recognition using the modulation spectrogram
    • B. Kingsbury, N. Morgan, and S. Greenberg, "Robust speech recognition using the modulation spectrogram," Speech Communication, vol. 25, no. 1, pp. 117-132, 1998.
    • (1998) Speech Communication , vol.25 , Issue.1 , pp. 117-132
    • Kingsbury, B.1    Morgan, N.2    Greenberg, S.3
  • 16
    • 0032828464 scopus 로고    scopus 로고
    • A model of auditory perception as front end for automatic speech recognition
    • J. Tchorz and B. Kollmeier, "A model of auditory perception as front end for automatic speech recognition," J. Acoust. Soc. Am., vol. 106, no. 4, pp. 2040-2050, 1999.
    • (1999) J. Acoust. Soc. Am. , vol.106 , Issue.4 , pp. 2040-2050
    • Tchorz, J.1    Kollmeier, B.2
  • 17
    • 0032676337 scopus 로고    scopus 로고
    • On the relative importance of various components of the modulation spectrum for automatic speech recognition
    • N. Kanedera, T. Arai, H. Hermansky, and M. Pavel, "On the relative importance of various components of the modulation spectrum for automatic speech recognition," Speech Communication, vol. 28, pp. 43-55, 1999.
    • (1999) Speech Communication , vol.28 , pp. 43-55
    • Kanedera, N.1    Arai, T.2    Hermansky, H.3    Pavel, M.4
  • 18
    • 0034817674 scopus 로고    scopus 로고
    • Time and frequency filtering of filter-bank energies for robust HMM speech recognition
    • C. Nadeu, D. Macho, and J. Hernando, "Time and frequency filtering of filter-bank energies for robust HMM speech recognition," Speech Communication, vol. 1-2, pp. 93-114, 2001.
    • (2001) Speech Communication , vol.1-2 , pp. 93-114
    • Nadeu, C.1    Macho, D.2    Hernando, J.3
  • 19
    • 0141703362 scopus 로고    scopus 로고
    • Experiments with linear and nonlinear feature transformations in HMM-based phone recognition
    • P. Somervuo, "Experiments with linear and nonlinear feature transformations in HMM-based phone recognition," in ICASSP, 2003.
    • (2003) ICASSP
    • Somervuo, P.1
  • 20
    • 85009181008 scopus 로고    scopus 로고
    • Beyond a single critical band in TRAP-based ASR
    • submitted
    • P. Jain and H. Hermansky, "Beyond a single critical band in TRAP-based ASR," in Eurospeech, 2003, submitted.
    • (2003) Eurospeech
    • Jain, P.1    Hermansky, H.2
  • 21
    • 3142695111 scopus 로고    scopus 로고
    • Hybrid HMM/ANN systems for speech recognition: Overview and new research directions
    • 1387 of Lect. Notes in AI,. Giles, C.L. and Gori, M.
    • H. Bourlard and N. Morgan, "Hybrid HMM/ANN systems for speech recognition: Overview and new research directions," in Adaptive Processing of Sequences and Data Structures, vol. 1387 of Lect. Notes in AI, pp. 389-417. Giles, C.L. and Gori, M., 1998.
    • (1998) Adaptive Processing of Sequences and Data Structures , pp. 389-417
    • Bourlard, H.1    Morgan, N.2
  • 22
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventionalHMMsystems
    • H. Hermansky, D.P.W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventionalHMMsystems," in ICASSP, 2000.
    • (2000) ICASSP
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 23
    • 0025383284 scopus 로고
    • Recognition of isolated words based on psychoacoustics and neurobiology
    • T. Gramß and H.W. Strube, "Recognition of isolated words based on psychoacoustics and neurobiology," Speech Communication, vol. 9, pp. 35-40, 1990.
    • (1990) Speech Communication , vol.9 , pp. 35-40
    • Gramß, T.1    Strube, H.W.2
  • 24
    • 14244272507 scopus 로고    scopus 로고
    • Methods for capturing spectro-temporal modulations in automatic speech recognition
    • M. Kleinschmidt, "Methods for capturing spectro-temporal modulations in automatic speech recognition," Acustica united with acta acustica, vol. 88, pp. 416-422, 2002.
    • (2002) Acustica United with Acta Acustica , vol.88 , pp. 416-422
    • Kleinschmidt, M.1
  • 26
    • 85009233038 scopus 로고    scopus 로고
    • Improving word accuracy with Gabor feature extraction
    • M. Kleinschmidt and D. Gelbart, "Improving word accuracy with Gabor feature extraction," in ICSLP, 2002.
    • (2002) ICSLP
    • Kleinschmidt, M.1    Gelbart, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.