메뉴 건너뛰기




Volumn 4, Issue , 2007, Pages

Gammatone features and feature combination for large vocabulary speech recognition

Author keywords

Acoustic feature combination; Auditory systems; Feature extraction; Gammatone filterbank; Speech recognition

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; BIT ERROR RATE; FEATURE EXTRACTION; LINEAR SYSTEMS; WORD PROCESSING;

EID: 34547539413     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2007.366996     Document Type: Conference Paper
Times cited : (185)

References (13)
  • 1
    • 0019200387 scopus 로고
    • Spectro-Temporal Receptive Fields of Auditory Neurons in the Grassfrog,
    • Nov
    • A. M. H. J. Aertsen, P. I. M. Johannesma, D. J. Hermes: "Spectro-Temporal Receptive Fields of Auditory Neurons in the Grassfrog,'" Biological Cybernetics, Vol. 38, No. 4, pp. 235-248, Nov. 1980.
    • (1980) Biological Cybernetics , vol.38 , Issue.4 , pp. 235-248
    • Aertsen, A.M.H.J.1    Johannesma, P.I.M.2    Hermes, D.J.3
  • 2
    • 0017804799 scopus 로고
    • On Cochlear Encoding: Potentialities and Limitations of the Reverse-Correlation Technique
    • Jan
    • E. de Boer, H. R. de Jongh: "On Cochlear Encoding: Potentialities and Limitations of the Reverse-Correlation Technique," The Journal of the Acoustical Society of America, Vol. 63, No. 1, pp. 115-135, Jan. 1978.
    • (1978) The Journal of the Acoustical Society of America , vol.63 , Issue.1 , pp. 115-135
    • de Boer, E.1    de Jongh, H.R.2
  • 3
    • 0030638031 scopus 로고    scopus 로고
    • A Post-Processing System to Yield Reduced Word Error Rates: Recognizer Output Voting Error Reduction
    • ASRU, pp, Santa Barbara, CA, USA, Dec
    • J. G. Fiscus: "A Post-Processing System to Yield Reduced Word Error Rates: Recognizer Output Voting Error Reduction," in Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 347-352, Santa Barbara, CA, USA, Dec. 1997.
    • (1997) Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop , pp. 347-352
    • Fiscus, J.G.1
  • 5
    • 0025126556 scopus 로고
    • A Cochlear Frequency-Position Function for Several Species - 29 Years Later
    • D. D. Greenwood: "A Cochlear Frequency-Position Function for Several Species - 29 Years Later," The Journal of the Acoustical Society of America, Vol. 87, No. 6, pp. 2592-2605, 1990.
    • (1990) The Journal of the Acoustical Society of America , vol.87 , Issue.6 , pp. 2592-2605
    • Greenwood, D.D.1
  • 8
    • 44949265179 scopus 로고    scopus 로고
    • J. Lööf, M. Bisani, C. Gollan, G. Heigold, B. Hoffmeister, C. Plahl, R. Schlüter, H. Ney: The 2006 RWTH Parliamentary Speeches Transcription System, in Proceedings of the International Conference on Spoken Language Processing (IC-SLP/Interspeech), pp. 105-108, Pittsburgh, PA, September 2006.
    • J. Lööf, M. Bisani, C. Gollan, G. Heigold, B. Hoffmeister, C. Plahl, R. Schlüter, H. Ney: "The 2006 RWTH Parliamentary Speeches Transcription System," in Proceedings of the International Conference on Spoken Language Processing (IC-SLP/Interspeech), pp. 105-108, Pittsburgh, PA, September 2006.
  • 11
    • 0003913694 scopus 로고
    • An Efficient Implementation of the Patterson-Holdsworth Auditory Filterbank
    • Technical Report 35, Apple Computer Co
    • M. Slaney: "An Efficient Implementation of the Patterson-Holdsworth Auditory Filterbank," Technical Report 35, Apple Computer Co., 1993.
    • (1993)
    • Slaney, M.1
  • 12
    • 44849084626 scopus 로고    scopus 로고
    • Integrated Project funded by the European Commission, Project No
    • Technology and Corpora for Speech to Speech Translation (TC-STAR), Integrated Project funded by the European Commission, Project No." FP6-506738, "2004-2007. http://www.tc-star.org.
    • (2004) Technology and Corpora for Speech to Speech Translation (TC-STAR)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.