SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 6, 2007, Pages 1802-1817

Speech analysis in a model of the central auditory system

(2) Jeon, Woojay a Juang, B H b

a MOTOROLA INC (United States)

b GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Auditory model; Central auditory system; Cortex; Dimension expansion; Noise robust; Speech

Indexed keywords

AUDITORY MODEL; CENTRAL AUDITORY SYSTEM; CORTEX; DIMENSION EXPANSION; NOISE ROBUST;

MAMMALS; SPEECH COMMUNICATION; SPEECH RECOGNITION;

CONTROL THEORY;

EID: 45549100188 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.900102 Document Type: Article

Times cited : (30)

References (51)

1
- 4544303183
- Speech discrimination based on multiscale spectro-temporal modulations
- May
- N. Mesgarani, S. Shamma, and M. Slaney, "Speech discrimination based on multiscale spectro-temporal modulations," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process.,May 2004, vol. 1, pp. 601-604.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , vol.1 , pp. 601-604
- Mesgarani, N.¹ Shamma, S.² Slaney, M.³

2
- 33947657978
- A biologically inspired approach to the cocktail party problem
- May
- M. Elhilali and S. Shamma, "A biologically inspired approach to the cocktail party problem," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., May 2006, pp. 637-640.
- (2006) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , pp. 637-640
- Elhilali, M.¹ Shamma, S.²

3
- 33744994972
- Automatic speech recognition with an adaptation model motivated by auditory processing
- Jan
- M. Holmberg, D. Gelbart, and W. Hemmert, "Automatic speech recognition with an adaptation model motivated by auditory processing," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 43-49, Jan. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.1 , pp. 43-49
- Holmberg, M.¹ Gelbart, D.² Hemmert, W.³

4
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, 1990.
- (1990) J. Acoust. Soc. Amer , vol.87 , pp. 1738-1752
- Hermansky, H.¹

5
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug
- S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

6
- 0031187171
- Speech recognition by machines and humans
- Mar
- R. Lippmann, "Speech recognition by machines and humans," Speech Commun., vol. 22, no. 1, pp. 1-15, Mar. 1997.
- (1997) Speech Commun , vol.22 , Issue.1 , pp. 1-15
- Lippmann, R.¹

7
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, 1974.
- (1974) J. Acoust. Soc. Amer , vol.55 , pp. 1304-1312
- Atal, B.S.¹

8
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr
- S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. 27, no. 2, pp. 113-120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process , vol.27 , Issue.2 , pp. 113-120
- Boll, S.¹

9
- 0033099548
- On second-order statistics and linear estimation of cepstral coefficients
- Mar
- Y. Ephraim and M. Rahim, "On second-order statistics and linear estimation of cepstral coefficients," IEEE Trans. Speech Audio Process., vol. 7, no. 2, pp. 162-176, Mar. 1999.
- (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.2 , pp. 162-176
- Ephraim, Y.¹ Rahim, M.²

10
- 0001459635
- Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises
- May
- Y. Zhao, "Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 255-266, May 2000.
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.3 , pp. 255-266
- Zhao, Y.¹

11
- 0026881830
- Gain-adapted hidden markov models for recognition of clean and noisy speech
- Jun
- Y. Ephraim, "Gain-adapted hidden markov models for recognition of clean and noisy speech," IEEE Trans. Signal Process., vol. 40, no. 6, pp. 1303-1316, Jun. 1992.
- (1992) IEEE Trans. Signal Process , vol.40 , Issue.6 , pp. 1303-1316
- Ephraim, Y.¹

12
- 0002671953
- A minimax classification approach with application to robust speech recognition
- Jan
- N. Merhav and C.-H. Lee, "A minimax classification approach with application to robust speech recognition," IEEE Trans. Speech Audio Process., vol. 1, no. 1, pp. 90-100, Jan. 1993.
- (1993) IEEE Trans. Speech Audio Process , vol.1 , Issue.1 , pp. 90-100
- Merhav, N.¹ Lee, C.-H.²

13
- 0018437122
- Automatic speech recognition using psychoacoustic models
- E. Zwicker, E. Terhardt, and E. Paulus, "Automatic speech recognition using psychoacoustic models," J. Acoust. Soc. Amer., vol. 65, pp. 487-498, 1979.
- (1979) J. Acoust. Soc. Amer , vol.65 , pp. 487-498
- Zwicker, E.¹ Terhardt, E.² Paulus, E.³

14
- 0023859986
- Auditory neural feedback as a basis for speech processing
- O. Ghitza, "Auditory neural feedback as a basis for speech processing," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., 1988, vol. 1, pp. 91-94.
- (1988) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , vol.1 , pp. 91-94
- Ghitza, O.¹

15
- 0024392496
- Application of an auditory model to speech recognition
- J. R. Cohen, "Application of an auditory model to speech recognition," J. Acoust. Soc. Amer., vol. 85, pp. 2623-2629, 1989.
- (1989) J. Acoust. Soc. Amer , vol.85 , pp. 2623-2629
- Cohen, J.R.¹

16
- 0032828464
- A model of auditory perception as front end for automatic speech recognition
- Oct
- J. Tchorz and B. Kollmeier, "A model of auditory perception as front end for automatic speech recognition," J. Acoust. Soc. Amer., vol. 106, no. 4, pp. 2040-2050, Oct. 1999.
- (1999) J. Acoust. Soc. Amer , vol.106 , Issue.4 , pp. 2040-2050
- Tchorz, J.¹ Kollmeier, B.²

17
- 0029345416
- A comparison of signal processing front ends for automatic word recognition
- Jul
- C. R. Jankowski, H.-D. H. Vo, and R. P. Lippmann, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech Audio Process., vol. 3, no. 4, pp. 286-293, Jul. 1995.
- (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.4 , pp. 286-293
- Jankowski, C.R.¹ Vo, H.-D.H.² Lippmann, R.P.³

18
- 0031647650
- Speech analysis and recognition using interval statistics generated from a composite auditory model
- Jan
- H. Sheikhzadeh and L. Deng, "Speech analysis and recognition using interval statistics generated from a composite auditory model," IEEE Trans. Speech Audio Process., vol. 6, no. 1, pp. 90-94, Jan. 1998.
- (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.1 , pp. 90-94
- Sheikhzadeh, H.¹ Deng, L.²

19
- 0031238095
- A model of dynamic auditory perception and its application to robust word recognition
- Sep
- B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 451-464, Sep. 1997.
- (1997) IEEE Trans. Speech Audio Process , vol.5 , Issue.5 , pp. 451-464
- Strope, B.¹ Alwan, A.²

20
- 0003760813
- Central auditory model for spectral processing
- Apr
- Y. Gao, T. Huang, and J.-P. Haton, "Central auditory model for spectral processing," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., Apr. 1993, pp. 704-707.
- (1993) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , pp. 704-707
- Gao, Y.¹ Huang, T.² Haton, J.-P.³

21
- 85009227802
- Localized spectro-temporal features for automatic speech recognition
- M. Kleinschmidt, "Localized spectro-temporal features for automatic speech recognition," in Proc. Interspeech'02, 2002, pp. 2573-2576.
- (2002) Proc. Interspeech'02 , pp. 2573-2576
- Kleinschmidt, M.¹

22
- 17044376280
- Cascade classifiers for audio classification
- Aug
- S. Ravindran and D. V. Anderson, "Cascade classifiers for audio classification," in Proc. IEEE Digital Signal Process. Workshop, Aug. 2004, pp. 366-370.
- (2004) Proc. IEEE Digital Signal Process. Workshop , pp. 366-370
- Ravindran, S.¹ Anderson, D.V.²

23
- 0026626445
- Auditory representations of acoustic signals
- Mar
- X. Yang, K. Wang, and S. A. Shamma, "Auditory representations of acoustic signals," IEEE Trans. Inf. Theory, vol. 38, no. 2, pp. 824-839, Mar. 1992.
- (1992) IEEE Trans. Inf. Theory , vol.38 , Issue.2 , pp. 824-839
- Yang, X.¹ Wang, K.² Shamma, S.A.³

24
- 0029378080
- Spectral shape analysis in the central auditory system
- Sep
- K. Wang and S. A. Shamma, "Spectral shape analysis in the central auditory system," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 382-395, Sep. 1995.
- (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.5 , pp. 382-395
- Wang, K.¹ Shamma, S.A.²

25
- 64549116937
- J. Syka and M. M. Merzenich, Eds. New York: Springer
- Plasticity and Signal Representation in the Auditory SystemJ. Syka and M. M. Merzenich, Eds. New York: Springer, 2005.
- (2005) Plasticity and Signal Representation in the Auditory System

26
- 0034710863
- Auditory neuroscience: Development, transduction, and integration
- A. J. Hudspeth and M. Konishi, "Auditory neuroscience: Development, transduction, and integration," Proc. National Academy Sci., pp. 11690-11691, 2000.
- (2000) Proc. National Academy Sci , pp. 11690-11691
- Hudspeth, A.J.¹ Konishi, M.²

27
- 23744508888
- Multiresolution spectrotemporal analysis of complex sounds
- Aug
- T. Chi, P. Ru, and S. A. Shamma, "Multiresolution spectrotemporal analysis of complex sounds," J. Acoust. Soc. Amer., vol. 118, no. 2, pp. 887-906, Aug. 2005.
- (2005) J. Acoust. Soc. Amer , vol.118 , Issue.2 , pp. 887-906
- Chi, T.¹ Ru, P.² Shamma, S.A.³

28
- 0028462212
- Self-normalization and noise-robustness in early auditory representations
- Jul
- K. Wang and S. Shamma, "Self-normalization and noise-robustness in early auditory representations," IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 421-435, Jul. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.3 , pp. 421-435
- Wang, K.¹ Shamma, S.²

29
- 0003513556
- 2nd ed. Englewood Cliffs, NJ: Prentice-Hall
- A. V. Oppenheim and R. W. Schafer, Discrete-Time Signal Processing, 2nd ed. Englewood Cliffs, NJ: Prentice-Hall, 1999.
- (1999) Discrete-Time Signal Processing
- Oppenheim, A.V.¹ Schafer, R.W.²

30
- 0003424145
- Piscataway, NJ: IEEE Press
- J. R. Deller Jr., J. H. L. Hansen, and J. G. Proakis, Discrete-Time Processing of Speech Signals. Piscataway, NJ: IEEE Press, 2000.
- (2000) Discrete-Time Processing of Speech Signals
- Deller Jr., J.R.¹ Hansen, J.H.L.² Proakis, J.G.³

31
- 0003789815
- New York: Academic
- B. C. J. Moore, An Introduction to the Psychology of Hearing. New York: Academic, 2003.
- (2003) An Introduction to the Psychology of Hearing
- Moore, B.C.J.¹

32
- 79251542316
- A computational model of filtering, detection, and compression in the cochlea
- May
- R. Lyon, "A computational model of filtering, detection, and compression in the cochlea," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., May 1982, vol. 7, pp. 1282-1285.
- (1982) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , vol.7 , pp. 1282-1285
- Lyon, R.¹

33
- 0021794508
- Cochlear modeling, IEEE Acoust., Speech
- Jan
- J. Allen, "Cochlear modeling," IEEE Acoust., Speech, Signal Process. Mag., vol. 2, no. 1, pp. 3-29, Jan. 1985.
- (1985) Signal Process. Mag , vol.2 , Issue.1 , pp. 3-29
- Allen, J.¹

34
- 33750418033
- Properties of auditory model representations
- F. S. Perdigao and L. V. Sa, "Properties of auditory model representations," in Proc. Eurospeech'97, 1997, pp. 2499-2502.
- (1997) Proc. Eurospeech'97 , pp. 2499-2502
- Perdigao, F.S.¹ Sa, L.V.²

35
- 0022873930
- A computational model for the peripheral auditory system: Application of speech recognition research
- Apr
- S. Seneff, "A computational model for the peripheral auditory system: Application of speech recognition research," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., Apr. 1986, pp. 1983-1986.
- (1986) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , pp. 1983-1986
- Seneff, S.¹

36
- 84873975005
- Online, Available
- HTK Speech Recognition Toolkit, [Online]. Available: http://htk.eng.cam.ac.uk/
- HTK Speech Recognition Toolkit

37
- 64549088551
- The Institute for Systems Research, Online, Available
- The Institute for Systems Research. [Online]. Available: http://www.isr.umd.edu/CAAR/

38
- 0030740959
- Laminar fine structure of frequency organization in auditory midbrain
- Jul
- C. E. Schreiner and G. Langner, "Laminar fine structure of frequency organization in auditory midbrain," Nature, vol. 388, pp. 383-386, Jul. 1997.
- (1997) Nature , vol.388 , pp. 383-386
- Schreiner, C.E.¹ Langner, G.²

39
- 0034037502
- Modular organization of frequency integration in primary auditory cortex
- Mar
- C. E. Schreiner, H. L. Read, and M. L. Sutter, "Modular organization of frequency integration in primary auditory cortex," Annu. Rev. Neurosci., vol. 23, pp. 501-529, Mar. 2000.
- (2000) Annu. Rev. Neurosci , vol.23 , pp. 501-529
- Schreiner, C.E.¹ Read, H.L.² Sutter, M.L.³

40
- 0003684441
- Cambridge, MA: MIT Press
- A. S. Bregman, Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, MA: MIT Press, 1994.
- (1994) Auditory Scene Analysis: The Perceptual Organization of Sound
- Bregman, A.S.¹

41
- 0003837293
- Englewood Cliffs, NJ: Prentice-Hall
- S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation Theory. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Statistical Signal Processing: Estimation Theory
- Kay, S.M.¹

42
- 0004199580
- 2nd ed. Englewood Cliffs, NJ: Prentice-Hall
- A. V. Oppenheim and A. S. Willsky, Signals and Systems, 2nd ed. Englewood Cliffs, NJ: Prentice-Hall, 1997.
- (1997) Signals and Systems
- Oppenheim, A.V.¹ Willsky, A.S.²

43
- 0029238302
- Subband analysis for robust speech recognition in the presence of car noise
- May
- E. Erzin, A. E. Cetin, and Y. Yardimci, "Subband analysis for robust speech recognition in the presence of car noise," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., May 1995, pp. 417-420.
- (1995) Proc. IEEE Int. Conf. Acoust., Speech. Signal Process , pp. 417-420
- Erzin, E.¹ Cetin, A.E.² Yardimci, Y.³

44
- 84962871227
- Robust speech recognition using wavelet coefficient features
- Dec
- M. Gupta and A. Gilbert, "Robust speech recognition using wavelet coefficient features," in Proc. IEEE Workshop ASRU 2001, Dec. 2001, pp. 445-448.
- (2001) Proc. IEEE Workshop ASRU 2001 , pp. 445-448
- Gupta, M.¹ Gilbert, A.²

45
- 0037340693
- Distinct brain regions associated with syllable and phoneme
- W. T. Siok, Z. Jin, P. Fletcher, and L. H. Tan, "Distinct brain regions associated with syllable and phoneme," Human Brain Mapping, vol. 18, pp. 201-207, 2003.
- (2003) Human Brain Mapping , vol.18 , pp. 201-207
- Siok, W.T.¹ Jin, Z.² Fletcher, P.³ Tan, L.H.⁴

46
- 0030960693
- Lefthemisphere specialization for the processing of acoustic transients
- I. S. Johnsrude, R. J. Zatorre, B. A. Milner, and A. C. Evans, "Lefthemisphere specialization for the processing of acoustic transients," NeuroReport, vol. 8, pp. 1761-1765, 1997.
- (1997) NeuroReport , vol.8 , pp. 1761-1765
- Johnsrude, I.S.¹ Zatorre, R.J.² Milner, B.A.³ Evans, A.C.⁴

47
- 64549112338
- R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York: Wiley, 2001, pp. 117-170.
- R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York: Wiley, 2001, pp. 117-170.

48
- 33745190989
- A category-dependent feature selection method for speech signals
- Lisbon, Portugal, Sep
- W. Jeon and B. -H. Juang, "A category-dependent feature selection method for speech signals," in Proc. Interspeech'05, Lisbon, Portugal, Sep. 2005, pp. 365-368.
- (2005) Proc. Interspeech'05 , pp. 365-368
- Jeon, W.¹ Juang, B.-H.²

49
- 0035145191
- Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging
- C. M. Wessinger, J. VanMeter, B. Tian, J. V. Lare, J. Pekar, and J. P. Rauschecker, "Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging," J. Cognitive Neurosci., vol. 13, no. 1, pp. 1-7, 2001.
- (2001) J. Cognitive Neurosci , vol.13 , Issue.1 , pp. 1-7
- Wessinger, C.M.¹ VanMeter, J.² Tian, B.³ Lare, J.V.⁴ Pekar, J.⁵ Rauschecker, J.P.⁶

50
- 0024768209
- Speaker-independent phone recognition using hidden markov models
- Nov
- K.-F. Lee and H.-W. Hon, "Speaker-independent phone recognition using hidden markov models," IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 11, pp. 1641-1648, Nov. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Process , vol.37 , Issue.11 , pp. 1641-1648
- Lee, K.-F.¹ Hon, H.-W.²

51
- 33745185781
- Hidden conditional random fields for phone classification
- Lisbon, Portugal, Sep
- A. Gunawardana, M. Mahajan, A. Acero, and J. C. Platt, "Hidden conditional random fields for phone classification," in Interspeech'05, Lisbon, Portugal, Sep. 2005, pp. 1117-1120.
- (2005) Interspeech'05 , pp. 1117-1120
- Gunawardana, A.¹ Mahajan, M.² Acero, A.³ Platt, J.C.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.