SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE

Volumn 84, Issue 9, 1996, Pages 1199-1215

Time-Frequency Analysis and Auditory Modeling for Automatic Recognition of Speech

(3) Pitton, James W a,c Wang, Kuansan b Juang, Biing Hwang a

a Parametric Technology Corporation (United States)

b VERIZON (United States)

c LUCENT TECHNOLOGIES (United States)

Author keywords

[No Author keywords available]

Indexed keywords

FREQUENCY DOMAIN ANALYSIS; PHYSIOLOGICAL MODELS; SPEECH CODING; SPEECH INTELLIGIBILITY; SPEECH PROCESSING; SPEECH TRANSMISSION; SPURIOUS SIGNAL NOISE; STATISTICAL METHODS; TIME DOMAIN ANALYSIS;

AUDITORY MODELS; SPEECH WAVEFORM; TIME FREQUENCY ANALYSIS;

SPEECH RECOGNITION;

EID: 0030244273 PISSN: 00189219 EISSN: None Source Type: Journal
DOI: 10.1109/5.535241 Document Type: Article

Times cited : (52)

References (115)

1
- 0021794508
- Cochlear modeling
- J. Allen, Cochlear modeling, IEEE ASSP Mag., Jan. 1985.
- (1985) IEEE ASSP Mag., Jan.
- Allen, J.¹

2
- 33646912649
- Perceptually-based dynamic spectrograms
- T. Applebaum and B. Hanson, "Perceptually-based dynamic spectrograms," in Visual Representations of Speech Signals, M. Cooke and S. Beet, Eds. London: Wiley, 1993, pp. 153-160.
- Visual Representations of Speech Signals, M. Cooke and S. Beet, Eds. London: Wiley, 1993 , pp. 153-160
- Applebaum, T.¹ Hanson, B.²

3
- 84942219496
- Autocorrelogram models of the segregation of competing voices
- P. Assmann and D. Paschall, "Autocorrelogram models of the segregation of competing voices," in Proc. 15th Winter ARO Meet., St. Petersburg, FL, Feb. 1992.
- (1992) Proc. 15th Winter ARO Meet., St. Petersburg, FL, Feb.
- Assmann, P.¹ Paschall, D.²

4
- 0024592122
- Modeling the perception of concurrent vowels: Vowels with the same fundamental frequency
- P. Assmann and Q. Summerfield, "Modeling the perception of concurrent vowels: vowels with the same fundamental frequency," J. Acoust. Soc. Amer., vol. 85, no. 1, pp. 327-338, 1989.
- (1989) J. Acoust. Soc. Amer. , vol.85 , Issue.1 , pp. 327-338
- Assmann, P.¹ Summerfield, Q.²

5
- 0025003184
- Modeling the perception of concurrent vowels: Vowels with different fundamental frequencies
- "Modeling the perception of concurrent vowels: vowels with different fundamental frequencies," J. Acoust. Soc. Amer., vol. 88, no. 2, pp. 680-697, 1990.
- (1990) J. Acoust. Soc. Amer. , vol.88 , Issue.2 , pp. 680-697

6
- 0025682331
- New nonstationary techniques for the analysis and display of speech transients
- L. Atlas, W. Kooiman, P. Loughlin, and R. Cole, "New nonstationary techniques for the analysis and display of speech transients," in Proc. ICASSP'90, pp. 385-388.
- Proc. ICASSP'90 , pp. 385-388
- Atlas, L.¹ Kooiman, W.² Loughlin, P.³ Cole, R.⁴

7
- 0026388701
- Truly nonstationary techniques for the analysis and display of voiced speech
- L. Atlas, P. Loughlin, and J. Pitton, 'Truly nonstationary techniques for the analysis and display of voiced speech," in Proc. ICASSP'91, pp. 433-436.
- Proc. ICASSP'91 , pp. 433-436
- Atlas, L.¹ Loughlin, P.² Pitton, J.³

8
- 0027373113
- Objective analysis versus subjective assessment of vowels pronounced by native, nonnative and deaf male speakers of Dutch
- B. Bakkum, R. Plomp, and L. Pols, "Objective analysis versus subjective assessment of vowels pronounced by native, nonnative and deaf male speakers of Dutch," J. Acoust. Soc. Amer., vol. 94, no. 10, pp. 1989-2004, 1993.
- (1993) J. Acoust. Soc. Amer. , vol.94 , Issue.10 , pp. 1989-2004
- Bakkum, B.¹ Plomp, R.² Pols, L.³

9
- 0027577041
- A signal-dependent time-frequency representation: Optimal kernel design
- R. Baraniuk and D. Jones, "A signal-dependent time-frequency representation: optimal kernel design," IEEE Trans. Signal Process., vol. 41, no. 4, pp. 1589-1602, 1993.
- (1993) IEEE Trans. Signal Process. , vol.41 , Issue.4 , pp. 1589-1602
- Baraniuk, R.¹ Jones, D.²

10
- 0026185556
- Zero-crossing rates of functions of gaussian processes
- J. Barnett and B. Kedem, "Zero-crossing rates of functions of gaussian processes," IEEE Trans. Inform. Theory, vol. 37, pp. 1188-1194, Apr. 1991.
- (1991) IEEE Trans. Inform. Theory , vol.37 , pp. 1188-1194
- Barnett, J.¹ Kedem, B.²

11
- 33646940920
- Optimal real-time signal processing in the nervous system
- W. Bialek, "Optimal real-time signal processing in the nervous system," in Neural Systems: Analysis and Modeling, F. H. Eeckman, Ed. Amsterdam: Kluwer, 1993, pp. 5-28.
- Neural Systems: Analysis and Modeling, F. H. Eeckman, Ed. Amsterdam: Kluwer, 1993 , pp. 5-28
- Bialek, W.¹

12
- 0018664543
- Acoustic invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants
- S. Blumstein and K. Stevens, "Acoustic invariance in speech production: evidence from measurements of the spectral characteristics of stop consonants," J. Acoust. Soc. Amer., vol. 66, no. 4, pp. 1001-1017, 1979.
- (1979) J. Acoust. Soc. Amer. , vol.66 , Issue.4 , pp. 1001-1017
- Blumstein, S.¹ Stevens, K.²

13
- 0003684441
- A. Bregman, Auditory Scene Analysis: The Perceptual Organization of Sound. Cambridge, MA: MIT Press, 1990.
- (1990) Auditory Scene Analysis: the Perceptual Organization of Sound. Cambridge, MA: MIT Press
- Bregman, A.¹

14
- 84941442230
- The auditory processing and recognition of speech
- W. Byrne, J. Robinson, and S. Shamma, "The auditory processing and recognition of speech," in Proc. Speech and Natural Lang. Workshop, 1989, pp. 325-331.
- Proc. Speech and Natural Lang. Workshop, 1989 , pp. 325-331
- Byrne, W.¹ Robinson, J.² Shamma, S.³

15
- 0027528776
- A model for the responses of low-frequency auditory nerve fibers in cats
- L. Camey, "A model for the responses of low-frequency auditory nerve fibers in cats," J. Acoust. Soc. Amer., vol. 93, no. 1, pp. 401-4117, 1993.
- (1993) J. Acoust. Soc. Amer. , vol.93 , Issue.1 , pp. 401-4117
- Camey, L.¹

16
- 0023294678
- A nonstationary model for the analysis of transient speech signals
- F. Casacuberta and E. Vidai, "A nonstationary model for the analysis of transient speech signals," IEEE Trans. Acoust. Speech Signal Process., vol. 35, pp. 226-228, Feb. 1987.
- (1987) IEEE Trans. Acoust. Speech Signal Process. , vol.35 , pp. 226-228
- Casacuberta, F.¹ Vidai, E.²

17
- 0026372224
- Combined multi-resolution (wideband/narrowband) spectrogram
- S. Cheung and J. Lim, "Combined multi-resolution (wideband/narrowband) spectrogram," in IEEE Proc. ICASSP'91, pp. 457-460.
- IEEE Proc. ICASSP'91 , pp. 457-460
- Cheung, S.¹ Lim, J.²

18
- 0024681555
- Improved time-frequency representation of multi-component signals using exponential kernels
- H. Choi and W. Williams, "Improved time-frequency representation of multi-component signals using exponential kernels," IEEE Trans. Acoust. Speech Signal Process., vol. 37, pp. 862-871, June 1989.
- (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , pp. 862-871
- Choi, H.¹ Williams, W.²

19
- 0141582037
- Generalized phase-space distribution functions
- L. Cohen, "Generalized phase-space distribution functions," J. Math. Phys., vol. 7, no. 5, pp. 781-786, 1966.
- (1966) J. Math. Phys. , vol.7 , Issue.5 , pp. 781-786
- Cohen, L.¹

20
- 0003733873
- New York: Prentice-Hall
- Time-Frequency Analysis. New York: Prentice-Hall, 1995.
- (1995) Time-Frequency Analysis

21
- 84957503277
- Instantaneous frequency, its standard deviation and multicomponent signals
- L. Cohen and C. Lee, "Instantaneous frequency, its standard deviation and multicomponent signals," SPIE Advanced Algs. Archs. Sig. Proc. Ill, vol. 975, pp. 186-208, 1988.
- (1988) SPIE Advanced Algs. Archs. Sig. Proc. Ill , vol.975 , pp. 186-208
- Cohen, L.¹ Lee, C.²

22
- 0009634522
- M. Cooke, S. Beet, and M. Crawford, Visual Representations of Speech Signais. New York: Wiley, 1993.
- (1993) Visual Representations of Speech Signais. New York: Wiley
- Cooke, M.¹ Beet, S.² Crawford, M.³

23
- 0019053271
- Comparison of parametric representation for monosyllable word recognition in continuously spoken sentences
- S. Davis and P. Mermelstein, "Comparison of parametric representation for monosyllable word recognition in continuously spoken sentences," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-28, pp. 357-366, Apr. 1980.
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

24
- 84953656135
- Acoustic loci and transitional cues for consonants
- P. Delattre, A. Liberman, and F. Cooper, "Acoustic loci and transitional cues for consonants," J. Acoust. Soc. Amer., vol. 27, no. 4, pp. 769-773, 1955.
- (1955) J. Acoust. Soc. Amer. , vol.27 , Issue.4 , pp. 769-773
- Delattre, P.¹ Liberman, A.² Cooper, F.³

25
- 0026854213
- A generalized hidden Markov model with stateconditioned trend functions of time for the speech signal
- L. Deng, "A generalized hidden Markov model with stateconditioned trend functions of time for the speech signal," Signal Process., vol. 27, pp. 65-78, 1992.
- (1992) Signal Process. , vol.27 , pp. 65-78
- Deng, L.¹

26
- 0028516022
- Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
- L. Deng, M. Aksmanovic, X. Sun, and C. Wu, "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 507-520, 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 507-520
- Deng, L.¹ Aksmanovic, M.² Sun, X.³ Wu, C.⁴

27
- 84928839596
- A composite model of the auditory periphery for the processing of speech
- L. Deng, C. Geisler, and S. Greenberg, "A composite model of the auditory periphery for the processing of speech," J. Phonetics, vol. 16, no. 1, p. 93, 1988.
- (1988) J. Phonetics , vol.16 , Issue.1 , pp. 93
- Deng, L.¹ Geisler, C.² Greenberg, S.³

28
- 0027681974
- ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
- V. Digilakis, J. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE Trans. Speech. Audio Process., vol. 1, no. 4, pp. 431-442, 1993.
- (1993) IEEE Trans. Speech. Audio Process. , vol.1 , Issue.4 , pp. 431-442
- Digilakis, V.¹ Rohlicek, J.² Ostendorf, M.³

29
- 33646922512
- Wigner distribution analysis of stop consonant release transients: Labials, velars, and labio-velars
- G. Dogil and W. Wokurek, "Wigner distribution analysis of stop consonant release transients: labials, velars, and labio-velars," in Proc. Int. Conf. Speech Res. '89, Budapest, pp. 1-4.
- Proc. Int. Conf. Speech Res. '89, Budapest , pp. 1-4
- Dogil, G.¹ Wokurek, W.²

30
- 33646936237
- Wigner time-frequency analysis for major places of articulation in stop consonants
- "Wigner time-frequency analysis for major places of articulation in stop consonants," in Proc. 12th Int. Cong. Phon. Sci., 1991, vol. 3, pp. 390-393.
- Proc. 12th Int. Cong. Phon. Sci., 1991 , vol.3 , pp. 390-393

31
- 0023904763
- Frequency importance function for a feature recognition test material
- V. Duggirala, G. Studebaker, C. Pavlovic, and R. Sherbecoe, "Frequency importance function for a feature recognition test material," J. Acoust. Soc. Amer., vol. 83, no. 6, pp. 2372-2382, 1988.
- (1988) J. Acoust. Soc. Amer. , vol.83 , Issue.6 , pp. 2372-2382
- Duggirala, V.¹ Studebaker, G.² Pavlovic, C.³ Sherbecoe, R.⁴

32
- 0038676741
- Methods of measuring vowel formant bandwidths
- H. Dünn, "Methods of measuring vowel formant bandwidths," J. Acoust. Soc. Amer., vol. 33, no. 12, pp. 1737-1746, 1961.
- (1961) J. Acoust. Soc. Amer. , vol.33 , Issue.12 , pp. 1737-1746
- Dünn, H.¹

33
- 0021881648
- Peripheral auditory adaptation and fatigue
- J. Eggermont, "Peripheral auditory adaptation and fatigue," Hearing Res., vol. 18, pp. 57-71, 1985.
- (1985) Hearing Res. , vol.18 , pp. 57-71
- Eggermont, J.¹

34
- 0003418124
- New York: Mouton
- G. Fant, Acoustic Theory of Speech Production, 2nd ed. New York: Mouton, 1970.
- (1970) Acoustic Theory of Speech Production, 2nd Ed.
- Fant, G.¹

35
- 0003757962
- J. Flanagan, Speech Analysis, Synthesis, and Perception. New York: Springer-Verlag, 1965.
- (1965) Speech Analysis, Synthesis, and Perception. New York: Springer-Verlag
- Flanagan, J.¹

36
- 0022667694
- Speaker-independent isolated word recognition using dynamic features of speech spectrum
- S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-34, pp. 52-59, Jan. 1986.
- (1986) IEEE Trans. Acoust. Speech Signal Process. , vol.34 , pp. 52-59
- Furui, S.¹

37
- 0022548705
- On the role of spectral transition for speech perception
- "On the role of spectral transition for speech perception," J. Acoust. Soc. Amer., vol. 80, no. 4, pp. 1016-1025, 1986.
- (1986) J. Acoust. Soc. Amer. , vol.80 , Issue.4 , pp. 1016-1025

38
- 33646934291
- Invariant acoustic cues in stop consonants: A cross-language study using the Wigner distribution
- H. Garudadri, J, Gilbert, A. Benguerel, and M. Beddoes, "Invariant acoustic cues in stop consonants: a cross-language study using the Wigner distribution," J. Acoust. Soc. Amer., vol. 82, no. S55, 1987.
- (1987) J. Acoust. Soc. Amer. , vol.82 , Issue.S55
- Garudadri, H.¹ Gilbert, J.² Benguerel, A.³ Beddoes, M.⁴

39
- 84991416125
- Auditory nerve representation as a front end for speech recognition in a noisy environment
- O. Ghitza, "Auditory nerve representation as a front end for speech recognition in a noisy environment," Computer Speech and Lang., vol. 1, pp. 109-130, 1986.
- (1986) Computer Speech and Lang. , vol.1 , pp. 109-130
- Ghitza, O.¹

40
- 0027578207
- Hidden Markov models with templates as nonstationary states: An application to speech recognition
- O. Ghitza and M. M. Sondhi, "Hidden Markov models with templates as nonstationary states: An application to speech recognition," Computer Speech and Lang., vol. 7, no. 2, pp. 101-119, 1993.
- (1993) Computer Speech and Lang. , vol.7 , Issue.2 , pp. 101-119
- Ghitza, O.¹ Sondhi, M.M.²

41
- 0004047397
- D. Green, Profile Analysis: Auditory Intensity Discrimination. New York: Oxford, 1988.
- (1988) Profile Analysis: Auditory Intensity Discrimination. New York: Oxford
- Green, D.¹

42
- 0020798029
- Time-dependent ARMA modeling of nonstationary signals
- Y. Grenier, "Time-dependent ARMA modeling of nonstationary signals," IEEE Trans. Acoust. Speech Signal Process., vol. 31, pp. 899-911, Apr. 1983.
- (1983) IEEE Trans. Acoust. Speech Signal Process. , vol.31 , pp. 899-911
- Grenier, Y.¹

43
- 0028206226
- The contribution of the murmur and vowel to the place of articulation distinction in nasal consonants
- J. Harrington, "The contribution of the murmur and vowel to the place of articulation distinction in nasal consonants," J. Acoust. Soc. Amer., vol. 96, no. 1, pp. 19-32, 1994.
- (1994) J. Acoust. Soc. Amer. , vol.96 , Issue.1 , pp. 19-32
- Harrington, J.¹

44
- 0027491495
- Effect of relative amplitude of frication on perception of place of articulations
- M. Hedrick and R. Ohde, "Effect of relative amplitude of frication on perception of place of articulations," J. Acoust. Soc. Amer., vol. 94, no. 4, pp. 2005-2026, 1993.
- (1993) J. Acoust. Soc. Amer. , vol.94 , Issue.4 , pp. 2005-2026
- Hedrick, M.¹ Ohde, R.²

45
- 0025041264
- Perceptual linear predictive (PLP) analysis for speech
- H. Hermansky, "Perceptual linear predictive (PLP) analysis for speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990.
- (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

46
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

47
- 0010423564
- Diphthong formants and their movements
- A. Holbrook and G. Fairbanks, "Diphthong formants and their movements," J. Speech Hearing Res., vol. 5, no. 1, pp. 38-58, 1962.
- (1962) J. Speech Hearing Res. , vol.5 , Issue.1 , pp. 38-58
- Holbrook, A.¹ Fairbanks, G.²

48
- 0028996918
- Measuring fine structure in speech: Application to speaker identification
- C. Jankowski, T. Quatieri, and D. Reynolds, "Measuring fine structure in speech: application to speaker identification," in Proc. ICASSP'95, pp. 325-328.
- Proc. ICASSP'95 , pp. 325-328
- Jankowski, C.¹ Quatieri, T.² Reynolds, D.³

49
- 0027684995
- Signal compression based on models of human perception
- N. Jayant, J. Johnston, and R. Safranek, "Signal compression based on models of human perception," Proc. IEEE, vol. 81, pp. 1385-1422, Oct. 1993.
- (1993) Proc. IEEE , vol.81 , pp. 1385-1422
- Jayant, N.¹ Johnston, J.² Safranek, R.³

50
- 0028040175
- Vowel identification in mixed-speaker silent-center syllables
- J. Jenkins, W. Strange, and S. Miranda, "Vowel identification in mixed-speaker silent-center syllables," J. Aconst. Soc. Amer., vol. 9, no. 2, pp. 1030-1041, 1994.
- (1994) J. Aconst. Soc. Amer. , vol.9 , Issue.2 , pp. 1030-1041
- Jenkins, J.¹ Strange, W.² Miranda, S.³

51
- 0022097649
- Maximum likelihood estimation for mixture multivariate stochastic observations of Markov chains
- B. H. Juang, "Maximum likelihood estimation for mixture multivariate stochastic observations of Markov chains," AT&T Tech. J., vol. 64, no. 6, pp. 1235-1249, 1985.
- (1985) AT&T Tech. J. , vol.64 , Issue.6 , pp. 1235-1249
- Juang, B.H.¹

52
- 33646944964
- Hierarchical AR model for time varying speech signals
- O. Kakusho and M. Yanagida, "Hierarchical AR model for time varying speech signals," in Proc. ICASSP'82, pp. 1295-1298.
- Proc. ICASSP'82 , pp. 1295-1298
- Kakusho, O.¹ Yanagida, M.²

53
- 0022806994
- Spectral analysis and discrimination by zerocrossings
- B. Kedem, "Spectral analysis and discrimination by zerocrossings," Proc. IEEE, vol. PROC-74, pp. 1477-1493, Nov. 1986.
- (1986) Proc. IEEE , vol.74 , pp. 1477-1493
- Kedem, B.¹

54
- 0020642039
- Time-varying features as correlates of place of articulation in stop consonants
- D. Kewley-Port, 'Time-varying features as correlates of place of articulation in stop consonants," J. Aconst. Soc. Amer., vol. 73, no. 1, pp. 322-335, 1983.
- (1983) J. Aconst. Soc. Amer. , vol.73 , Issue.1 , pp. 322-335
- Kewley-Port, D.¹

55
- 70349995741
- The sound spectrograph
- W. Koenig, H. Dunn, and L. Lacy, "The sound spectrograph," J. Aconst. Soc. Amer., vol. 18, no. 1, pp. 19-49, 1946.
- (1946) J. Aconst. Soc. Amer. , vol.18 , Issue.1 , pp. 19-49
- Koenig, W.¹ Dunn, H.² Lacy, L.³

56
- 0001463644
- A duplex theory of pitch perception
- J. Licklider, "A duplex theory of pitch perception," Experienlia, vol. 7, pp. 128-133, 1951.
- (1951) Experienlia , vol.7 , pp. 128-133
- Licklider, J.¹

57
- 84953653173
- Perturbations in vocal pitch
- P. Lieberman, "Perturbations in vocal pitch," J. Aconst. Soc. Amer., vol. 33, no. 5, pp. 597-603, 1961.
- (1961) J. Aconst. Soc. Amer. , vol.33 , Issue.5 , pp. 597-603
- Lieberman, P.¹

58
- 0016735638
- Linear estimation of nonstationary signals
- L. Liporace, "Linear estimation of nonstationary signals," J. Acoust. Soc. Amer., vol. 58, no. 6, pp. 1288-1295, 1975.
- (1975) J. Acoust. Soc. Amer. , vol.58 , Issue.6 , pp. 1288-1295
- Liporace, L.¹

59
- 84885564021
- Advanced time-frequency representations for speech processing
- P. Loughlin, L. Atlas, and J. Pitton, "Advanced time-frequency representations for speech processing," Visual Representations of Speech Signals, M. Cooke and S. Beet, Eds. London, U.K.: Wiley, 1993, pp. 27-53.
- Visual Representations of Speech Signals, M. Cooke and S. Beet, Eds. London, U.K.: Wiley, 1993 , pp. 27-53
- Loughlin, P.¹ Atlas, L.² Pitton, J.³

60
- 0027542642
- Bilinear time-frequency representations: New insights and properties
- "Bilinear time-frequency representations: new insights and properties," IEEE Trans. Signal Process., vol. 41, pp. 750-767, Feb. 1993.
- (1993) IEEE Trans. Signal Process. , vol.41 , pp. 750-767

61
- 0028517015
- Construction of positive time-frequency distributions
- "Construction of positive time-frequency distributions," IEEE Trans. Signal Process., vol. 42, pp. 2697-2705, Oct. 1994.
- (1994) IEEE Trans. Signal Process. , vol.42 , pp. 2697-2705

62
- 0028739812
- Approximating time-frequency density functions via optimal combinations of spectrograms
- P. Loughlin, J. Pitton, and B. Hannaford, "Approximating time-frequency density functions via optimal combinations of spectrograms," IEEE Signal Process. Lett., vol. 1, pp. 199-202, Dec. 1994.
- (1994) IEEE Signal Process. Lett. , vol.1 , pp. 199-202
- Loughlin, P.¹ Pitton, J.² Hannaford, B.³

63
- 0021204483
- Computational models of neural auditory processing
- R. Lyon, "Computational models of neural auditory processing," Proc. ICASSP'84, 1984.
- (1984) Proc. ICASSP'84
- Lyon, R.¹

64
- 0027676955
- Energy separation in signal modulations with application to speech analysis
- P. Maragos, J. Kaiser, and T. Quatieri, "Energy separation in signal modulations with application to speech analysis," IEEE Trans. Signal Process., vol. 41, pp. 3024-3051, Oct. 1993.
- (1993) IEEE Trans. Signal Process. , vol.41 , pp. 3024-3051
- Maragos, P.¹ Kaiser, J.² Quatieri, T.³

65
- 33646923066
- J. Markel and A. Gray, Linear Prediction of Speech. New York: Kluwer, 1989.
- (1989) Linear Prediction of Speech. New York: Kluwer
- Markel, J.¹ Gray, A.²

66
- 0026654967
- Modeling the identification of concurrent vowels with different fundamental frequencies
- R. Meddis and M. Hewitt, "Modeling the identification of concurrent vowels with different fundamental frequencies," J. Aconst. Soc. Amer., vol. 91, no. 1, pp. 233-245, 1992.
- (1992) J. Aconst. Soc. Amer. , vol.91 , Issue.1 , pp. 233-245
- Meddis, R.¹ Hewitt, M.²

67
- 0027409390
- Voice source model for continuous control of pitch period
- P. Milenkovic, "Voice source model for continuous control of pitch period," J. Aconst. Soc. Amer., vol. 93, no. 2, pp. 1087-1096, 1993.
- (1993) J. Aconst. Soc. Amer. , vol.93 , Issue.2 , pp. 1087-1096
- Milenkovic, P.¹

68
- 0022737369
- Adaptive identification of a time-varying ARMA speech model
- Y. Miyanaga, N. Miki, and N. Nagai, "Adaptive identification of a time-varying ARMA speech model," IEEE Trans. Aconst. Speech, Signal Process., vol. ASSP-34, pp. 423-433, Mar. 1986.
- (1986) IEEE Trans. Aconst. Speech, Signal Process. , vol.34 , pp. 423-433
- Miyanaga, Y.¹ Miki, N.² Nagai, N.³

69
- 0003789815
- B. Moore, An Introduction to the Psychology of Hearing, 3rd ed. New York: Academic, 1989.
- (1989) An Introduction to the Psychology of Hearing, 3rd Ed. New York: Academic
- Moore, B.¹

70
- 0028997032
- Co-channel speaker separation
- D. Morgan, E. George, L. Lee, and S. Kay, "Co-channel speaker separation," in Proc. ICASSP'95, vol. 1, pp. 828-831.
- Proc. ICASSP'95 , vol.1 , pp. 828-831
- Morgan, D.¹ George, E.² Lee, L.³ Kay, S.⁴

71
- 0028996926
- Stochastic perceptual models of speech, Proc
- N. Morgan et al., "Stochastic perceptual models of speech," Proc. ICASSP'95, vol. 1, pp. 397-400.
- ICASSP'95 , vol.1 , pp. 397-400
- Morgan, N.¹

72
- 33646946122
- Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition
- C. Nadeau, P. Paches-Leal, and B. H. Juang, "Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition," Eurospeech95, Sept. 1995.
- (1995) Eurospeech95, Sept.
- Nadeau, C.¹ Paches-Leal, P.² Juang, B.H.³

73
- 33749761107
- Speech enhancement based on a new set of auditory constrained parameters
- S. Nandkumar and J. Hansen, "Speech enhancement based on a new set of auditory constrained parameters," in Proc. ICASSP'94, pp. 1-4.
- Proc. ICASSP'94 , pp. 1-4
- Nandkumar, S.¹ Hansen, J.²

74
- 0026142442
- A time-varying analysis method for rapid transitions in speech
- K. Nathan, Y. Lee, and H. Silverman, "A time-varying analysis method for rapid transitions in speech," IEEE Trans. Signal Process., vol. 39, pp. 815-824, Apr. 1991.
- (1991) IEEE Trans. Signal Process. , vol.39 , pp. 815-824
- Nathan, K.¹ Lee, Y.² Silverman, H.³

75
- 0028460992
- Time-varying feature selection and classification of unvoiced stop consonants
- K. Nathan and H. Silverman, "Time-varying feature selection and classification of unvoiced stop consonants," IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 395-405, 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.3 , pp. 395-405
- Nathan, K.¹ Silverman, H.²

76
- 0028710576
- Neuromorphic speech processing for noisy environments
- Orlando, FL, June
- C. Neti, "Neuromorphic speech processing for noisy environments," in Proc. ICNN-94, Orlando, FL, June 1994, pp. 4425-4430.
- (1994) Proc. ICNN-94 , pp. 4425-4430
- Neti, C.¹

77
- 0000460671
- R. D. Patterson et al., "Complex Sounds and auditory images," in Auditory Physiology and Perception, Y. Gazais, L. Demany, and K. Honer, Eds. London: Pergamon, 1992, pp. 429-446.
- Complex Sounds and auditory images, Auditory Physiology and Perception, Y. Gazais, L. Demany, and K. Honer, Eds. London: Pergamon, 1992 , pp. 429-446
- Patterson, R.D.¹

78
- 0003663467
- A. Papoulis, Probability, Random Variables, and Stochastic Processes. New York: McGraw-Hill, 1984.
- (1984) Probability, Random Variables, and Stochastic Processes. New York: McGraw-Hill
- Papoulis, A.¹

79
- 84941328385
- Control methods used in a study of vowels
- G. Peterson and H. Barney, "Control methods used in a study of vowels," J. Acoust. Soc. Amer., vol. 24, no. 2, pp. 175-184, 1952.
- (1952) J. Acoust. Soc. Amer. , vol.24 , Issue.2 , pp. 175-184
- Peterson, G.¹ Barney, H.²

80
- 0004106903
- J. Pickles, An Introduction to the Physiology of Hearing. New York: Academic, 1988.
- (1988) An Introduction to the Physiology of Hearing. New York: Academic
- Pickles, J.¹

81
- 0028516834
- Applications of positive time-frequency distributions to speech processing
- J. Pitton, L. Atlas, and P. Loughlin, "Applications of positive time-frequency distributions to speech processing," IEEE Trans. Speech Audio. Process, vol. 2, no. 4, pp. 554-566, 1994.
- (1994) IEEE Trans. Speech Audio. Process , vol.2 , Issue.4 , pp. 554-566
- Pitton, J.¹ Atlas, L.² Loughlin, P.³

82
- 0026078506
- A computational model of afferent neural activity from the cochlea to the dorsal acoustic stria
- M. Pont and R. Damper, "A computational model of afferent neural activity from the cochlea to the dorsal acoustic stria," J. Acoust. Soc. Amer., vol. 89, no. 3, pp. 1213-1228, 1991.
- (1991) J. Acoust. Soc. Amer. , vol.89 , Issue.3 , pp. 1213-1228
- Pont, M.¹ Damper, R.²

83
- 33646913483
- R. Potter, G. Kopp, and H. Green, Visible Speech. New York: Van Nostrand, 1947.
- (1947) Visible Speech. New York: Van Nostrand
- Potter, R.¹ Kopp, G.² Green, H.³

84
- 33646937252
- L. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall
- Rabiner, L.¹ Juang, B.H.²

85
- 0028996925
- Robust utterance verification for connected digits recognition
- M. Rahim, C. H. Lee, and B. H. Juang, "Robust utterance verification for connected digits recognition," in Proc. ICASSP'95, vol. 1, pp. 285-288.
- Proc. ICASSP'95 , vol.1 , pp. 285-288
- Rahim, M.¹ Lee, C.H.² Juang, B.H.³

86
- 0024589234
- Acoustic properties and perception of consonant release transients
- B. Repp and H. Lin, "Acoustic properties and perception of consonant release transients," J. Acoust. Soc. Amer., vol. 85, no. 1, pp. 379-396, 1989.
- (1989) J. Acoust. Soc. Amer. , vol.85 , Issue.1 , pp. 379-396
- Repp, B.¹ Lin, H.²

87
- 0028912840
- Auditory-nerve encoding of pinna-based spectral cues: Rate representation of high-frequency stimuli
- J. Rice, E. Young, and G. Spirou, "Auditory-nerve encoding of pinna-based spectral cues: rate representation of high-frequency stimuli," J. Acoust. Soc. Amer., vol. 97, no. 3, pp. 1764-1776, 1995.
- (1995) J. Acoust. Soc. Amer. , vol.97 , Issue.3 , pp. 1764-1776
- Rice, J.¹ Young, E.² Spirou, G.³

88
- 0011795236
- M. Riley, Speech Tune-Frequency Representations. New York: Kluwer, 1989.
- (1989) Speech Tune-Frequency Representations. New York: Kluwer
- Riley, M.¹

89
- 0037795511
- Frequency selectivity and the perception of speech
- S. Rosen and A. Fourcin, "Frequency selectivity and the perception of speech," in Frequency Selectivity in Hearing, B. Moore, Ed. New York: Academic, 1986, pp. 373-487.
- Frequency Selectivity in Hearing, B. Moore, Ed. New York: Academic, 1986 , pp. 373-487
- Rosen, S.¹ Fourcin, A.²

90
- 0020579183
- Auditory nerve representation of vowels in background noise
- M. Sachs, H. Voigt, and E. Young, "Auditory nerve representation of vowels in background noise," J. Neurophys., vol. 50, pp. 27-45, 1983.
- (1983) J. Neurophys. , vol.50 , pp. 27-45
- Sachs, M.¹ Voigt, H.² Young, E.³

91
- 0018617277
- Encoding of steady-state vowels in the auditory nerve: Representation in terms of discharge rate
- M. Sachs and E. Young, "Encoding of steady-state vowels in the auditory nerve: representation in terms of discharge rate," J. Acoust. Soc. Amer., vol. 66, no. 1, pp. 470-479, 1979.
- (1979) J. Acoust. Soc. Amer. , vol.66 , Issue.1 , pp. 470-479
- Sachs, M.¹ Young, E.²

92
- 0029239090
- A comparative study of mel cepstra and EIH for phone classification under adverse conditions
- S. Sandhu and O. Ghitza, "A comparative study of mel cepstra and EIH for phone classification under adverse conditions," in Proc. ICASSP'95, vol. 1, pp. 409-412.
- Proc. ICASSP'95 , vol.1 , pp. 409-412
- Sandhu, S.¹ Ghitza, O.²

93
- 84928837806
- A joint synchrony/mean-rate model of auditory processing
- S. Seneff, "A joint synchrony/mean-rate model of auditory processing," J. Phonetics, vol. 85, no. 1, pp. 55-76, 1988.
- (1988) J. Phonetics , vol.85 , Issue.1 , pp. 55-76
- Seneff, S.¹

94
- 0022348981
- Speech processing in the auditory system II: Lateral inhibition and the central processing of speech evoked activity in the auditory nerve
- S. Shamma, "Speech processing in the auditory system II: lateral inhibition and the central processing of speech evoked activity in the auditory nerve," J. Acoust. Soc. Amer., vol. 78, no. 5, pp. 1622-1632, 1985.
- (1985) J. Acoust. Soc. Amer. , vol.78 , Issue.5 , pp. 1622-1632
- Shamma, S.¹

95
- 84928841878
- The acoustic features of speech sounds in a model of auditory processing: Vowels and voiceless fricatives
- "The acoustic features of speech sounds in a model of auditory processing: vowels and voiceless fricatives," J. Phonetics, vol. 16, pp. 77-91, 1988.
- (1988) J. Phonetics , vol.16 , pp. 77-91

96
- 0020707077
- Responses of auditory-nerve fibers to consonant-vowel syllables
- D. Sinex and C. Geisler, "Responses of auditory-nerve fibers to consonant-vowel syllables," J. Acoust. Soc. Amer., vol. 73, no. 2, pp. 602-615, 1983.
- (1983) J. Acoust. Soc. Amer. , vol.73 , Issue.2 , pp. 602-615
- Sinex, D.¹ Geisler, C.²

97
- 0021461483
- Comparison of the responses of auditory-nerve fibers to consonant-vowel syllables with predictions from linear models
- "Comparison of the responses of auditory-nerve fibers to consonant-vowel syllables with predictions from linear models," J. Acoust. Soc. Amer., vol. 76, no. 1, pp. 116-121, 1984.
- (1984) J. Acoust. Soc. Amer. , vol.76 , Issue.1 , pp. 116-121

98
- 0002296637
- On the importance of time-a temporal representation of sound
- M. Slaney and R. Lyon, "On the importance of time-a temporal representation of sound," in Visual Representations of Speech Signals, M. Cooke, S. Beet, and M. Crawford, Eds. New York: Wiley, 1993, pp. 95-116.
- Visual Representations of Speech Signals, M. Cooke, S. Beet, and M. Crawford, Eds. New York: Wiley, 1993 , pp. 95-116
- Slaney, M.¹ Lyon, R.²

99
- 0028657430
- Accuracy of quasistationary analysis of highly dynamic speech signals
- R. Smits, "Accuracy of quasistationary analysis of highly dynamic speech signals," J. Acoitst. Soc. Amer., vol. 96, no. 6, pp. 3401-3415, 1994.
- (1994) J. Acoitst. Soc. Amer. , vol.96 , Issue.6 , pp. 3401-3415
- Smits, R.¹

100
- 33646919328
- personal communication
- M. M. Sondhi, personal communication.
- Sondhi, M.M.¹

101
- 0018038036
- Invariant cues for place of articulation in stop consonants
- K. Stevens and S. Blumstein, "Invariant cues for place of articulation in stop consonants," J. Acoust. Soc. Amer., vol. 64, no. 5, pp. 1358-1368, 1978.
- (1978) J. Acoust. Soc. Amer. , vol.64 , Issue.5 , pp. 1358-1368
- Stevens, K.¹ Blumstein, S.²

102
- 0020816189
- Dynamic specification of coarticulated vowels
- W. Strange, J. Jenkins, and T. Johnson, "Dynamic specification of coarticulated vowels," J. Acoust. Soc. Amer., vol. 74, no. 3, pp. 695-705, 1983.
- (1983) J. Acoust. Soc. Amer. , vol.74 , Issue.3 , pp. 695-705
- Strange, W.¹ Jenkins, J.² Johnson, T.³

103
- 0026030074
- Perception of concurrent vowels: Effects of harmonic misalignment and pitch-period asynchrony
- Q. Summerfield and P. Assmann, "Perception of concurrent vowels: effects of harmonic misalignment and pitch-period asynchrony," J. Acoust. Soc. Amer., vol. 89, no. 3, pp. 1364-1377, 1991.
- (1991) J. Acoust. Soc. Amer. , vol.89 , Issue.3 , pp. 1364-1377
- Summerfield, Q.¹ Assmann, P.²

104
- 0024929841
- Transient analysis of speech signals using the Wigner time-frequency representation
- E. Velez and R. Abshcr, 'Transient analysis of speech signals using the Wigner time-frequency representation," in Proc. ICASSP'89, pp. 2242-2245.
- Proc. ICASSP'89 , pp. 2242-2245
- Velez, E.¹ Abshcr, R.²

105
- 0001843298
- Theorie et applications de la notion de signal analytique
- J. Ville, "Theorie et applications de la notion de signal analytique," Cables et Transmissions, vol. 2A, no. 1, pp. 61-74, 1948;
- (1948) Cables Et Transmissions , vol.2 A , Issue.1 , pp. 61-74
- Ville, J.¹

106
- 0346642106
- Theory and applications of the notion of complex signal
- RAND Corp., Santa Monica, CA
- I. Selin, transi., "Theory and applications of the notion of complex signal," Tech. Rep. T-92, RAND Corp., Santa Monica, CA, 1958.
- (1958) Tech. Rep.
- Selin Transi, I.¹

107
- 0028997028
- Speech enhancement based on masking properties of the auditory system
- N. Virag, "Speech enhancement based on masking properties of the auditory system," in Proc. ICASSP'95, vol. 1, pp. 796-799.
- Proc. ICASSP'95 , vol.1 , pp. 796-799
- Virag, N.¹

108
- 0028462212
- Self-normalization and noiserobustness in early auditory representations
- K. Wang and S. Shamma, "Self-normalization and noiserobustness in early auditory representations," IEEE Trans. Speech Audio Process., vol. 2, pp. 421-435, 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 421-435
- Wang, K.¹ Shamma, S.²

109
- 33646919723
- A diffusion model of the transient response of the cochlear inner hair cell synapse
- L. Westerman and R. Smith, "A diffusion model of the transient response of the cochlear inner hair cell synapse," J. Acoust. Soc. Amer., vol. 93, no. 1, pp. 401-417, 1993.
- (1993) J. Acoust. Soc. Amer. , vol.93 , Issue.1 , pp. 401-417
- Westerman, L.¹ Smith, R.²

110
- 0021207990
- Rapid and short term adaptation in auditory nerve responses
- "Rapid and short term adaptation in auditory nerve responses," Hearing Res., vol. 15, pp. 249-260, 1985.
- (1985) Hearing Res. , vol.15 , pp. 249-260

111
- 33745014742
- On the quantum correction for thermodynamic equilibrium
- E. Wigner, "On the quantum correction for thermodynamic equilibrium," Phys. Rev., vol. 40, pp. 749-759, 1932.
- (1932) Phys. Rev. , vol.40 , pp. 749-759
- Wigner, E.¹

112
- 0018653975
- Least squares glottal inverse filtering from the acoustic speech waveform
- D. Wong, J. Markel, and A. Gray, "Least squares glottal inverse filtering from the acoustic speech waveform," IEEE Trans. Acoust. Speech Signal Process., vol. 27, no. 4, pp. 350-355, 1979.
- (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.4 , pp. 350-355
- Wong, D.¹ Markel, J.² Gray, A.³

113
- 0018606571
- Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers, J
- E. Young and M. Sachs, "Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers," J. Acoust. Soc. Amer., vol. 66, no. 3, pp. 1381-1403, 1979.
- (1979) Acoust. Soc. Amer. , vol.66 , Issue.3 , pp. 1381-1403
- Young, E.¹ Sachs, M.²

114
- 0027368837
- Spectral-shape features versus formants as acoustic correlates for vowels
- S. Zahorian and A. Jagharghi, "Spectral-shape features versus formants as acoustic correlates for vowels," J. Acoust. Soc. Amer., vol. 94, no. 4, pp. 1966-1982, 1993.
- (1993) J. Acoust. Soc. Amer. , vol.94 , Issue.4 , pp. 1966-1982
- Zahorian, S.¹ Jagharghi, A.²

115
- 0025463449
- The use of cone-shaped kernels for generalized time-frequency representations of nonstationary signals
- Y. Zhao, L. Atlas, and R. Marks, "The use of cone-shaped kernels for generalized time-frequency representations of nonstationary signals," IEEE Trans. Acoust. Speech Signal Process., vol. 38, no. 7, pp. 1084-1091, 1990.
- (1990) IEEE Trans. Acoust. Speech Signal Process. , vol.38 , Issue.7 , pp. 1084-1091
- Zhao, Y.¹ Atlas, L.² Marks, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.