메뉴 건너뛰기




Volumn 15, Issue 6, 2007, Pages 1802-1817

Speech analysis in a model of the central auditory system

Author keywords

Auditory model; Central auditory system; Cortex; Dimension expansion; Noise robust; Speech

Indexed keywords

AUDITORY MODEL; CENTRAL AUDITORY SYSTEM; CORTEX; DIMENSION EXPANSION; NOISE ROBUST;

EID: 45549100188     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.900102     Document Type: Article
Times cited : (30)

References (51)
  • 3
    • 33744994972 scopus 로고    scopus 로고
    • Automatic speech recognition with an adaptation model motivated by auditory processing
    • Jan
    • M. Holmberg, D. Gelbart, and W. Hemmert, "Automatic speech recognition with an adaptation model motivated by auditory processing," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 43-49, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.1 , pp. 43-49
    • Holmberg, M.1    Gelbart, D.2    Hemmert, W.3
  • 4
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Amer , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 5
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 6
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • Mar
    • R. Lippmann, "Speech recognition by machines and humans," Speech Commun., vol. 22, no. 1, pp. 1-15, Mar. 1997.
    • (1997) Speech Commun , vol.22 , Issue.1 , pp. 1-15
    • Lippmann, R.1
  • 7
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, 1974.
    • (1974) J. Acoust. Soc. Amer , vol.55 , pp. 1304-1312
    • Atal, B.S.1
  • 8
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. 27, no. 2, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process , vol.27 , Issue.2 , pp. 113-120
    • Boll, S.1
  • 9
    • 0033099548 scopus 로고    scopus 로고
    • On second-order statistics and linear estimation of cepstral coefficients
    • Mar
    • Y. Ephraim and M. Rahim, "On second-order statistics and linear estimation of cepstral coefficients," IEEE Trans. Speech Audio Process., vol. 7, no. 2, pp. 162-176, Mar. 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.2 , pp. 162-176
    • Ephraim, Y.1    Rahim, M.2
  • 10
    • 0001459635 scopus 로고    scopus 로고
    • Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises
    • May
    • Y. Zhao, "Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 255-266, May 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.3 , pp. 255-266
    • Zhao, Y.1
  • 11
    • 0026881830 scopus 로고
    • Gain-adapted hidden markov models for recognition of clean and noisy speech
    • Jun
    • Y. Ephraim, "Gain-adapted hidden markov models for recognition of clean and noisy speech," IEEE Trans. Signal Process., vol. 40, no. 6, pp. 1303-1316, Jun. 1992.
    • (1992) IEEE Trans. Signal Process , vol.40 , Issue.6 , pp. 1303-1316
    • Ephraim, Y.1
  • 12
    • 0002671953 scopus 로고
    • A minimax classification approach with application to robust speech recognition
    • Jan
    • N. Merhav and C.-H. Lee, "A minimax classification approach with application to robust speech recognition," IEEE Trans. Speech Audio Process., vol. 1, no. 1, pp. 90-100, Jan. 1993.
    • (1993) IEEE Trans. Speech Audio Process , vol.1 , Issue.1 , pp. 90-100
    • Merhav, N.1    Lee, C.-H.2
  • 13
    • 0018437122 scopus 로고
    • Automatic speech recognition using psychoacoustic models
    • E. Zwicker, E. Terhardt, and E. Paulus, "Automatic speech recognition using psychoacoustic models," J. Acoust. Soc. Amer., vol. 65, pp. 487-498, 1979.
    • (1979) J. Acoust. Soc. Amer , vol.65 , pp. 487-498
    • Zwicker, E.1    Terhardt, E.2    Paulus, E.3
  • 15
    • 0024392496 scopus 로고
    • Application of an auditory model to speech recognition
    • J. R. Cohen, "Application of an auditory model to speech recognition," J. Acoust. Soc. Amer., vol. 85, pp. 2623-2629, 1989.
    • (1989) J. Acoust. Soc. Amer , vol.85 , pp. 2623-2629
    • Cohen, J.R.1
  • 16
    • 0032828464 scopus 로고    scopus 로고
    • A model of auditory perception as front end for automatic speech recognition
    • Oct
    • J. Tchorz and B. Kollmeier, "A model of auditory perception as front end for automatic speech recognition," J. Acoust. Soc. Amer., vol. 106, no. 4, pp. 2040-2050, Oct. 1999.
    • (1999) J. Acoust. Soc. Amer , vol.106 , Issue.4 , pp. 2040-2050
    • Tchorz, J.1    Kollmeier, B.2
  • 17
    • 0029345416 scopus 로고
    • A comparison of signal processing front ends for automatic word recognition
    • Jul
    • C. R. Jankowski, H.-D. H. Vo, and R. P. Lippmann, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech Audio Process., vol. 3, no. 4, pp. 286-293, Jul. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.4 , pp. 286-293
    • Jankowski, C.R.1    Vo, H.-D.H.2    Lippmann, R.P.3
  • 18
    • 0031647650 scopus 로고    scopus 로고
    • Speech analysis and recognition using interval statistics generated from a composite auditory model
    • Jan
    • H. Sheikhzadeh and L. Deng, "Speech analysis and recognition using interval statistics generated from a composite auditory model," IEEE Trans. Speech Audio Process., vol. 6, no. 1, pp. 90-94, Jan. 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.1 , pp. 90-94
    • Sheikhzadeh, H.1    Deng, L.2
  • 19
    • 0031238095 scopus 로고    scopus 로고
    • A model of dynamic auditory perception and its application to robust word recognition
    • Sep
    • B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 451-464, Sep. 1997.
    • (1997) IEEE Trans. Speech Audio Process , vol.5 , Issue.5 , pp. 451-464
    • Strope, B.1    Alwan, A.2
  • 21
    • 85009227802 scopus 로고    scopus 로고
    • Localized spectro-temporal features for automatic speech recognition
    • M. Kleinschmidt, "Localized spectro-temporal features for automatic speech recognition," in Proc. Interspeech'02, 2002, pp. 2573-2576.
    • (2002) Proc. Interspeech'02 , pp. 2573-2576
    • Kleinschmidt, M.1
  • 23
    • 0026626445 scopus 로고
    • Auditory representations of acoustic signals
    • Mar
    • X. Yang, K. Wang, and S. A. Shamma, "Auditory representations of acoustic signals," IEEE Trans. Inf. Theory, vol. 38, no. 2, pp. 824-839, Mar. 1992.
    • (1992) IEEE Trans. Inf. Theory , vol.38 , Issue.2 , pp. 824-839
    • Yang, X.1    Wang, K.2    Shamma, S.A.3
  • 24
    • 0029378080 scopus 로고
    • Spectral shape analysis in the central auditory system
    • Sep
    • K. Wang and S. A. Shamma, "Spectral shape analysis in the central auditory system," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 382-395, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.5 , pp. 382-395
    • Wang, K.1    Shamma, S.A.2
  • 26
    • 0034710863 scopus 로고    scopus 로고
    • Auditory neuroscience: Development, transduction, and integration
    • A. J. Hudspeth and M. Konishi, "Auditory neuroscience: Development, transduction, and integration," Proc. National Academy Sci., pp. 11690-11691, 2000.
    • (2000) Proc. National Academy Sci , pp. 11690-11691
    • Hudspeth, A.J.1    Konishi, M.2
  • 27
    • 23744508888 scopus 로고    scopus 로고
    • Multiresolution spectrotemporal analysis of complex sounds
    • Aug
    • T. Chi, P. Ru, and S. A. Shamma, "Multiresolution spectrotemporal analysis of complex sounds," J. Acoust. Soc. Amer., vol. 118, no. 2, pp. 887-906, Aug. 2005.
    • (2005) J. Acoust. Soc. Amer , vol.118 , Issue.2 , pp. 887-906
    • Chi, T.1    Ru, P.2    Shamma, S.A.3
  • 28
    • 0028462212 scopus 로고
    • Self-normalization and noise-robustness in early auditory representations
    • Jul
    • K. Wang and S. Shamma, "Self-normalization and noise-robustness in early auditory representations," IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 421-435, Jul. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.3 , pp. 421-435
    • Wang, K.1    Shamma, S.2
  • 32
    • 79251542316 scopus 로고
    • A computational model of filtering, detection, and compression in the cochlea
    • May
    • R. Lyon, "A computational model of filtering, detection, and compression in the cochlea," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., May 1982, vol. 7, pp. 1282-1285.
    • (1982) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , vol.7 , pp. 1282-1285
    • Lyon, R.1
  • 33
    • 0021794508 scopus 로고
    • Cochlear modeling, IEEE Acoust., Speech
    • Jan
    • J. Allen, "Cochlear modeling," IEEE Acoust., Speech, Signal Process. Mag., vol. 2, no. 1, pp. 3-29, Jan. 1985.
    • (1985) Signal Process. Mag , vol.2 , Issue.1 , pp. 3-29
    • Allen, J.1
  • 34
    • 33750418033 scopus 로고    scopus 로고
    • Properties of auditory model representations
    • F. S. Perdigao and L. V. Sa, "Properties of auditory model representations," in Proc. Eurospeech'97, 1997, pp. 2499-2502.
    • (1997) Proc. Eurospeech'97 , pp. 2499-2502
    • Perdigao, F.S.1    Sa, L.V.2
  • 35
    • 0022873930 scopus 로고
    • A computational model for the peripheral auditory system: Application of speech recognition research
    • Apr
    • S. Seneff, "A computational model for the peripheral auditory system: Application of speech recognition research," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., Apr. 1986, pp. 1983-1986.
    • (1986) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , pp. 1983-1986
    • Seneff, S.1
  • 37
    • 64549088551 scopus 로고    scopus 로고
    • The Institute for Systems Research, Online, Available
    • The Institute for Systems Research. [Online]. Available: http://www.isr.umd.edu/CAAR/
  • 38
    • 0030740959 scopus 로고    scopus 로고
    • Laminar fine structure of frequency organization in auditory midbrain
    • Jul
    • C. E. Schreiner and G. Langner, "Laminar fine structure of frequency organization in auditory midbrain," Nature, vol. 388, pp. 383-386, Jul. 1997.
    • (1997) Nature , vol.388 , pp. 383-386
    • Schreiner, C.E.1    Langner, G.2
  • 39
    • 0034037502 scopus 로고    scopus 로고
    • Modular organization of frequency integration in primary auditory cortex
    • Mar
    • C. E. Schreiner, H. L. Read, and M. L. Sutter, "Modular organization of frequency integration in primary auditory cortex," Annu. Rev. Neurosci., vol. 23, pp. 501-529, Mar. 2000.
    • (2000) Annu. Rev. Neurosci , vol.23 , pp. 501-529
    • Schreiner, C.E.1    Read, H.L.2    Sutter, M.L.3
  • 44
    • 84962871227 scopus 로고    scopus 로고
    • Robust speech recognition using wavelet coefficient features
    • Dec
    • M. Gupta and A. Gilbert, "Robust speech recognition using wavelet coefficient features," in Proc. IEEE Workshop ASRU 2001, Dec. 2001, pp. 445-448.
    • (2001) Proc. IEEE Workshop ASRU 2001 , pp. 445-448
    • Gupta, M.1    Gilbert, A.2
  • 45
    • 0037340693 scopus 로고    scopus 로고
    • Distinct brain regions associated with syllable and phoneme
    • W. T. Siok, Z. Jin, P. Fletcher, and L. H. Tan, "Distinct brain regions associated with syllable and phoneme," Human Brain Mapping, vol. 18, pp. 201-207, 2003.
    • (2003) Human Brain Mapping , vol.18 , pp. 201-207
    • Siok, W.T.1    Jin, Z.2    Fletcher, P.3    Tan, L.H.4
  • 46
    • 0030960693 scopus 로고    scopus 로고
    • Lefthemisphere specialization for the processing of acoustic transients
    • I. S. Johnsrude, R. J. Zatorre, B. A. Milner, and A. C. Evans, "Lefthemisphere specialization for the processing of acoustic transients," NeuroReport, vol. 8, pp. 1761-1765, 1997.
    • (1997) NeuroReport , vol.8 , pp. 1761-1765
    • Johnsrude, I.S.1    Zatorre, R.J.2    Milner, B.A.3    Evans, A.C.4
  • 47
    • 64549112338 scopus 로고    scopus 로고
    • R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York: Wiley, 2001, pp. 117-170.
    • R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York: Wiley, 2001, pp. 117-170.
  • 48
    • 33745190989 scopus 로고    scopus 로고
    • A category-dependent feature selection method for speech signals
    • Lisbon, Portugal, Sep
    • W. Jeon and B. -H. Juang, "A category-dependent feature selection method for speech signals," in Proc. Interspeech'05, Lisbon, Portugal, Sep. 2005, pp. 365-368.
    • (2005) Proc. Interspeech'05 , pp. 365-368
    • Jeon, W.1    Juang, B.-H.2
  • 49
    • 0035145191 scopus 로고    scopus 로고
    • Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging
    • C. M. Wessinger, J. VanMeter, B. Tian, J. V. Lare, J. Pekar, and J. P. Rauschecker, "Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging," J. Cognitive Neurosci., vol. 13, no. 1, pp. 1-7, 2001.
    • (2001) J. Cognitive Neurosci , vol.13 , Issue.1 , pp. 1-7
    • Wessinger, C.M.1    VanMeter, J.2    Tian, B.3    Lare, J.V.4    Pekar, J.5    Rauschecker, J.P.6
  • 50
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden markov models
    • Nov
    • K.-F. Lee and H.-W. Hon, "Speaker-independent phone recognition using hidden markov models," IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 11, pp. 1641-1648, Nov. 1989.
    • (1989) IEEE Trans. Acoust., Speech, Signal Process , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.-F.1    Hon, H.-W.2
  • 51
    • 33745185781 scopus 로고    scopus 로고
    • Hidden conditional random fields for phone classification
    • Lisbon, Portugal, Sep
    • A. Gunawardana, M. Mahajan, A. Acero, and J. C. Platt, "Hidden conditional random fields for phone classification," in Interspeech'05, Lisbon, Portugal, Sep. 2005, pp. 1117-1120.
    • (2005) Interspeech'05 , pp. 1117-1120
    • Gunawardana, A.1    Mahajan, M.2    Acero, A.3    Platt, J.C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.