메뉴 건너뛰기




Volumn 84, Issue 9, 1996, Pages 1199-1215

Time-Frequency Analysis and Auditory Modeling for Automatic Recognition of Speech

Author keywords

[No Author keywords available]

Indexed keywords

FREQUENCY DOMAIN ANALYSIS; PHYSIOLOGICAL MODELS; SPEECH CODING; SPEECH INTELLIGIBILITY; SPEECH PROCESSING; SPEECH TRANSMISSION; SPURIOUS SIGNAL NOISE; STATISTICAL METHODS; TIME DOMAIN ANALYSIS;

EID: 0030244273     PISSN: 00189219     EISSN: None     Source Type: Journal    
DOI: 10.1109/5.535241     Document Type: Article
Times cited : (52)

References (115)
  • 4
    • 0024592122 scopus 로고
    • Modeling the perception of concurrent vowels: Vowels with the same fundamental frequency
    • P. Assmann and Q. Summerfield, "Modeling the perception of concurrent vowels: vowels with the same fundamental frequency," J. Acoust. Soc. Amer., vol. 85, no. 1, pp. 327-338, 1989.
    • (1989) J. Acoust. Soc. Amer. , vol.85 , Issue.1 , pp. 327-338
    • Assmann, P.1    Summerfield, Q.2
  • 5
    • 0025003184 scopus 로고
    • Modeling the perception of concurrent vowels: Vowels with different fundamental frequencies
    • "Modeling the perception of concurrent vowels: vowels with different fundamental frequencies," J. Acoust. Soc. Amer., vol. 88, no. 2, pp. 680-697, 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.88 , Issue.2 , pp. 680-697
  • 6
    • 0025682331 scopus 로고    scopus 로고
    • New nonstationary techniques for the analysis and display of speech transients
    • L. Atlas, W. Kooiman, P. Loughlin, and R. Cole, "New nonstationary techniques for the analysis and display of speech transients," in Proc. ICASSP'90, pp. 385-388.
    • Proc. ICASSP'90 , pp. 385-388
    • Atlas, L.1    Kooiman, W.2    Loughlin, P.3    Cole, R.4
  • 7
    • 0026388701 scopus 로고    scopus 로고
    • Truly nonstationary techniques for the analysis and display of voiced speech
    • L. Atlas, P. Loughlin, and J. Pitton, 'Truly nonstationary techniques for the analysis and display of voiced speech," in Proc. ICASSP'91, pp. 433-436.
    • Proc. ICASSP'91 , pp. 433-436
    • Atlas, L.1    Loughlin, P.2    Pitton, J.3
  • 8
    • 0027373113 scopus 로고
    • Objective analysis versus subjective assessment of vowels pronounced by native, nonnative and deaf male speakers of Dutch
    • B. Bakkum, R. Plomp, and L. Pols, "Objective analysis versus subjective assessment of vowels pronounced by native, nonnative and deaf male speakers of Dutch," J. Acoust. Soc. Amer., vol. 94, no. 10, pp. 1989-2004, 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.94 , Issue.10 , pp. 1989-2004
    • Bakkum, B.1    Plomp, R.2    Pols, L.3
  • 9
    • 0027577041 scopus 로고
    • A signal-dependent time-frequency representation: Optimal kernel design
    • R. Baraniuk and D. Jones, "A signal-dependent time-frequency representation: optimal kernel design," IEEE Trans. Signal Process., vol. 41, no. 4, pp. 1589-1602, 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.4 , pp. 1589-1602
    • Baraniuk, R.1    Jones, D.2
  • 10
    • 0026185556 scopus 로고
    • Zero-crossing rates of functions of gaussian processes
    • J. Barnett and B. Kedem, "Zero-crossing rates of functions of gaussian processes," IEEE Trans. Inform. Theory, vol. 37, pp. 1188-1194, Apr. 1991.
    • (1991) IEEE Trans. Inform. Theory , vol.37 , pp. 1188-1194
    • Barnett, J.1    Kedem, B.2
  • 12
    • 0018664543 scopus 로고
    • Acoustic invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants
    • S. Blumstein and K. Stevens, "Acoustic invariance in speech production: evidence from measurements of the spectral characteristics of stop consonants," J. Acoust. Soc. Amer., vol. 66, no. 4, pp. 1001-1017, 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.66 , Issue.4 , pp. 1001-1017
    • Blumstein, S.1    Stevens, K.2
  • 15
    • 0027528776 scopus 로고
    • A model for the responses of low-frequency auditory nerve fibers in cats
    • L. Camey, "A model for the responses of low-frequency auditory nerve fibers in cats," J. Acoust. Soc. Amer., vol. 93, no. 1, pp. 401-4117, 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , Issue.1 , pp. 401-4117
    • Camey, L.1
  • 16
    • 0023294678 scopus 로고
    • A nonstationary model for the analysis of transient speech signals
    • F. Casacuberta and E. Vidai, "A nonstationary model for the analysis of transient speech signals," IEEE Trans. Acoust. Speech Signal Process., vol. 35, pp. 226-228, Feb. 1987.
    • (1987) IEEE Trans. Acoust. Speech Signal Process. , vol.35 , pp. 226-228
    • Casacuberta, F.1    Vidai, E.2
  • 17
    • 0026372224 scopus 로고    scopus 로고
    • Combined multi-resolution (wideband/narrowband) spectrogram
    • S. Cheung and J. Lim, "Combined multi-resolution (wideband/narrowband) spectrogram," in IEEE Proc. ICASSP'91, pp. 457-460.
    • IEEE Proc. ICASSP'91 , pp. 457-460
    • Cheung, S.1    Lim, J.2
  • 18
    • 0024681555 scopus 로고
    • Improved time-frequency representation of multi-component signals using exponential kernels
    • H. Choi and W. Williams, "Improved time-frequency representation of multi-component signals using exponential kernels," IEEE Trans. Acoust. Speech Signal Process., vol. 37, pp. 862-871, June 1989.
    • (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , pp. 862-871
    • Choi, H.1    Williams, W.2
  • 19
    • 0141582037 scopus 로고
    • Generalized phase-space distribution functions
    • L. Cohen, "Generalized phase-space distribution functions," J. Math. Phys., vol. 7, no. 5, pp. 781-786, 1966.
    • (1966) J. Math. Phys. , vol.7 , Issue.5 , pp. 781-786
    • Cohen, L.1
  • 20
    • 0003733873 scopus 로고
    • New York: Prentice-Hall
    • Time-Frequency Analysis. New York: Prentice-Hall, 1995.
    • (1995) Time-Frequency Analysis
  • 21
    • 84957503277 scopus 로고
    • Instantaneous frequency, its standard deviation and multicomponent signals
    • L. Cohen and C. Lee, "Instantaneous frequency, its standard deviation and multicomponent signals," SPIE Advanced Algs. Archs. Sig. Proc. Ill, vol. 975, pp. 186-208, 1988.
    • (1988) SPIE Advanced Algs. Archs. Sig. Proc. Ill , vol.975 , pp. 186-208
    • Cohen, L.1    Lee, C.2
  • 23
    • 0019053271 scopus 로고
    • Comparison of parametric representation for monosyllable word recognition in continuously spoken sentences
    • S. Davis and P. Mermelstein, "Comparison of parametric representation for monosyllable word recognition in continuously spoken sentences," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-28, pp. 357-366, Apr. 1980.
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 24
    • 84953656135 scopus 로고
    • Acoustic loci and transitional cues for consonants
    • P. Delattre, A. Liberman, and F. Cooper, "Acoustic loci and transitional cues for consonants," J. Acoust. Soc. Amer., vol. 27, no. 4, pp. 769-773, 1955.
    • (1955) J. Acoust. Soc. Amer. , vol.27 , Issue.4 , pp. 769-773
    • Delattre, P.1    Liberman, A.2    Cooper, F.3
  • 25
    • 0026854213 scopus 로고
    • A generalized hidden Markov model with stateconditioned trend functions of time for the speech signal
    • L. Deng, "A generalized hidden Markov model with stateconditioned trend functions of time for the speech signal," Signal Process., vol. 27, pp. 65-78, 1992.
    • (1992) Signal Process. , vol.27 , pp. 65-78
    • Deng, L.1
  • 26
    • 0028516022 scopus 로고
    • Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
    • L. Deng, M. Aksmanovic, X. Sun, and C. Wu, "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 507-520, 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 507-520
    • Deng, L.1    Aksmanovic, M.2    Sun, X.3    Wu, C.4
  • 27
    • 84928839596 scopus 로고
    • A composite model of the auditory periphery for the processing of speech
    • L. Deng, C. Geisler, and S. Greenberg, "A composite model of the auditory periphery for the processing of speech," J. Phonetics, vol. 16, no. 1, p. 93, 1988.
    • (1988) J. Phonetics , vol.16 , Issue.1 , pp. 93
    • Deng, L.1    Geisler, C.2    Greenberg, S.3
  • 28
    • 0027681974 scopus 로고
    • ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
    • V. Digilakis, J. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE Trans. Speech. Audio Process., vol. 1, no. 4, pp. 431-442, 1993.
    • (1993) IEEE Trans. Speech. Audio Process. , vol.1 , Issue.4 , pp. 431-442
    • Digilakis, V.1    Rohlicek, J.2    Ostendorf, M.3
  • 29
    • 33646922512 scopus 로고    scopus 로고
    • Wigner distribution analysis of stop consonant release transients: Labials, velars, and labio-velars
    • G. Dogil and W. Wokurek, "Wigner distribution analysis of stop consonant release transients: labials, velars, and labio-velars," in Proc. Int. Conf. Speech Res. '89, Budapest, pp. 1-4.
    • Proc. Int. Conf. Speech Res. '89, Budapest , pp. 1-4
    • Dogil, G.1    Wokurek, W.2
  • 30
    • 33646936237 scopus 로고    scopus 로고
    • Wigner time-frequency analysis for major places of articulation in stop consonants
    • "Wigner time-frequency analysis for major places of articulation in stop consonants," in Proc. 12th Int. Cong. Phon. Sci., 1991, vol. 3, pp. 390-393.
    • Proc. 12th Int. Cong. Phon. Sci., 1991 , vol.3 , pp. 390-393
  • 31
    • 0023904763 scopus 로고
    • Frequency importance function for a feature recognition test material
    • V. Duggirala, G. Studebaker, C. Pavlovic, and R. Sherbecoe, "Frequency importance function for a feature recognition test material," J. Acoust. Soc. Amer., vol. 83, no. 6, pp. 2372-2382, 1988.
    • (1988) J. Acoust. Soc. Amer. , vol.83 , Issue.6 , pp. 2372-2382
    • Duggirala, V.1    Studebaker, G.2    Pavlovic, C.3    Sherbecoe, R.4
  • 32
    • 0038676741 scopus 로고
    • Methods of measuring vowel formant bandwidths
    • H. Dünn, "Methods of measuring vowel formant bandwidths," J. Acoust. Soc. Amer., vol. 33, no. 12, pp. 1737-1746, 1961.
    • (1961) J. Acoust. Soc. Amer. , vol.33 , Issue.12 , pp. 1737-1746
    • Dünn, H.1
  • 33
    • 0021881648 scopus 로고
    • Peripheral auditory adaptation and fatigue
    • J. Eggermont, "Peripheral auditory adaptation and fatigue," Hearing Res., vol. 18, pp. 57-71, 1985.
    • (1985) Hearing Res. , vol.18 , pp. 57-71
    • Eggermont, J.1
  • 36
    • 0022667694 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of speech spectrum
    • S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-34, pp. 52-59, Jan. 1986.
    • (1986) IEEE Trans. Acoust. Speech Signal Process. , vol.34 , pp. 52-59
    • Furui, S.1
  • 37
    • 0022548705 scopus 로고
    • On the role of spectral transition for speech perception
    • "On the role of spectral transition for speech perception," J. Acoust. Soc. Amer., vol. 80, no. 4, pp. 1016-1025, 1986.
    • (1986) J. Acoust. Soc. Amer. , vol.80 , Issue.4 , pp. 1016-1025
  • 38
    • 33646934291 scopus 로고
    • Invariant acoustic cues in stop consonants: A cross-language study using the Wigner distribution
    • H. Garudadri, J, Gilbert, A. Benguerel, and M. Beddoes, "Invariant acoustic cues in stop consonants: a cross-language study using the Wigner distribution," J. Acoust. Soc. Amer., vol. 82, no. S55, 1987.
    • (1987) J. Acoust. Soc. Amer. , vol.82 , Issue.S55
    • Garudadri, H.1    Gilbert, J.2    Benguerel, A.3    Beddoes, M.4
  • 39
    • 84991416125 scopus 로고
    • Auditory nerve representation as a front end for speech recognition in a noisy environment
    • O. Ghitza, "Auditory nerve representation as a front end for speech recognition in a noisy environment," Computer Speech and Lang., vol. 1, pp. 109-130, 1986.
    • (1986) Computer Speech and Lang. , vol.1 , pp. 109-130
    • Ghitza, O.1
  • 40
    • 0027578207 scopus 로고
    • Hidden Markov models with templates as nonstationary states: An application to speech recognition
    • O. Ghitza and M. M. Sondhi, "Hidden Markov models with templates as nonstationary states: An application to speech recognition," Computer Speech and Lang., vol. 7, no. 2, pp. 101-119, 1993.
    • (1993) Computer Speech and Lang. , vol.7 , Issue.2 , pp. 101-119
    • Ghitza, O.1    Sondhi, M.M.2
  • 42
    • 0020798029 scopus 로고
    • Time-dependent ARMA modeling of nonstationary signals
    • Y. Grenier, "Time-dependent ARMA modeling of nonstationary signals," IEEE Trans. Acoust. Speech Signal Process., vol. 31, pp. 899-911, Apr. 1983.
    • (1983) IEEE Trans. Acoust. Speech Signal Process. , vol.31 , pp. 899-911
    • Grenier, Y.1
  • 43
    • 0028206226 scopus 로고
    • The contribution of the murmur and vowel to the place of articulation distinction in nasal consonants
    • J. Harrington, "The contribution of the murmur and vowel to the place of articulation distinction in nasal consonants," J. Acoust. Soc. Amer., vol. 96, no. 1, pp. 19-32, 1994.
    • (1994) J. Acoust. Soc. Amer. , vol.96 , Issue.1 , pp. 19-32
    • Harrington, J.1
  • 44
    • 0027491495 scopus 로고
    • Effect of relative amplitude of frication on perception of place of articulations
    • M. Hedrick and R. Ohde, "Effect of relative amplitude of frication on perception of place of articulations," J. Acoust. Soc. Amer., vol. 94, no. 4, pp. 2005-2026, 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.94 , Issue.4 , pp. 2005-2026
    • Hedrick, M.1    Ohde, R.2
  • 45
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis for speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis for speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 47
    • 0010423564 scopus 로고
    • Diphthong formants and their movements
    • A. Holbrook and G. Fairbanks, "Diphthong formants and their movements," J. Speech Hearing Res., vol. 5, no. 1, pp. 38-58, 1962.
    • (1962) J. Speech Hearing Res. , vol.5 , Issue.1 , pp. 38-58
    • Holbrook, A.1    Fairbanks, G.2
  • 48
    • 0028996918 scopus 로고    scopus 로고
    • Measuring fine structure in speech: Application to speaker identification
    • C. Jankowski, T. Quatieri, and D. Reynolds, "Measuring fine structure in speech: application to speaker identification," in Proc. ICASSP'95, pp. 325-328.
    • Proc. ICASSP'95 , pp. 325-328
    • Jankowski, C.1    Quatieri, T.2    Reynolds, D.3
  • 49
    • 0027684995 scopus 로고
    • Signal compression based on models of human perception
    • N. Jayant, J. Johnston, and R. Safranek, "Signal compression based on models of human perception," Proc. IEEE, vol. 81, pp. 1385-1422, Oct. 1993.
    • (1993) Proc. IEEE , vol.81 , pp. 1385-1422
    • Jayant, N.1    Johnston, J.2    Safranek, R.3
  • 50
    • 0028040175 scopus 로고
    • Vowel identification in mixed-speaker silent-center syllables
    • J. Jenkins, W. Strange, and S. Miranda, "Vowel identification in mixed-speaker silent-center syllables," J. Aconst. Soc. Amer., vol. 9, no. 2, pp. 1030-1041, 1994.
    • (1994) J. Aconst. Soc. Amer. , vol.9 , Issue.2 , pp. 1030-1041
    • Jenkins, J.1    Strange, W.2    Miranda, S.3
  • 51
    • 0022097649 scopus 로고
    • Maximum likelihood estimation for mixture multivariate stochastic observations of Markov chains
    • B. H. Juang, "Maximum likelihood estimation for mixture multivariate stochastic observations of Markov chains," AT&T Tech. J., vol. 64, no. 6, pp. 1235-1249, 1985.
    • (1985) AT&T Tech. J. , vol.64 , Issue.6 , pp. 1235-1249
    • Juang, B.H.1
  • 52
    • 33646944964 scopus 로고    scopus 로고
    • Hierarchical AR model for time varying speech signals
    • O. Kakusho and M. Yanagida, "Hierarchical AR model for time varying speech signals," in Proc. ICASSP'82, pp. 1295-1298.
    • Proc. ICASSP'82 , pp. 1295-1298
    • Kakusho, O.1    Yanagida, M.2
  • 53
    • 0022806994 scopus 로고
    • Spectral analysis and discrimination by zerocrossings
    • B. Kedem, "Spectral analysis and discrimination by zerocrossings," Proc. IEEE, vol. PROC-74, pp. 1477-1493, Nov. 1986.
    • (1986) Proc. IEEE , vol.74 , pp. 1477-1493
    • Kedem, B.1
  • 54
    • 0020642039 scopus 로고
    • Time-varying features as correlates of place of articulation in stop consonants
    • D. Kewley-Port, 'Time-varying features as correlates of place of articulation in stop consonants," J. Aconst. Soc. Amer., vol. 73, no. 1, pp. 322-335, 1983.
    • (1983) J. Aconst. Soc. Amer. , vol.73 , Issue.1 , pp. 322-335
    • Kewley-Port, D.1
  • 55
  • 56
    • 0001463644 scopus 로고
    • A duplex theory of pitch perception
    • J. Licklider, "A duplex theory of pitch perception," Experienlia, vol. 7, pp. 128-133, 1951.
    • (1951) Experienlia , vol.7 , pp. 128-133
    • Licklider, J.1
  • 57
    • 84953653173 scopus 로고
    • Perturbations in vocal pitch
    • P. Lieberman, "Perturbations in vocal pitch," J. Aconst. Soc. Amer., vol. 33, no. 5, pp. 597-603, 1961.
    • (1961) J. Aconst. Soc. Amer. , vol.33 , Issue.5 , pp. 597-603
    • Lieberman, P.1
  • 58
    • 0016735638 scopus 로고
    • Linear estimation of nonstationary signals
    • L. Liporace, "Linear estimation of nonstationary signals," J. Acoust. Soc. Amer., vol. 58, no. 6, pp. 1288-1295, 1975.
    • (1975) J. Acoust. Soc. Amer. , vol.58 , Issue.6 , pp. 1288-1295
    • Liporace, L.1
  • 60
    • 0027542642 scopus 로고
    • Bilinear time-frequency representations: New insights and properties
    • "Bilinear time-frequency representations: new insights and properties," IEEE Trans. Signal Process., vol. 41, pp. 750-767, Feb. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , pp. 750-767
  • 61
    • 0028517015 scopus 로고
    • Construction of positive time-frequency distributions
    • "Construction of positive time-frequency distributions," IEEE Trans. Signal Process., vol. 42, pp. 2697-2705, Oct. 1994.
    • (1994) IEEE Trans. Signal Process. , vol.42 , pp. 2697-2705
  • 62
    • 0028739812 scopus 로고
    • Approximating time-frequency density functions via optimal combinations of spectrograms
    • P. Loughlin, J. Pitton, and B. Hannaford, "Approximating time-frequency density functions via optimal combinations of spectrograms," IEEE Signal Process. Lett., vol. 1, pp. 199-202, Dec. 1994.
    • (1994) IEEE Signal Process. Lett. , vol.1 , pp. 199-202
    • Loughlin, P.1    Pitton, J.2    Hannaford, B.3
  • 63
    • 0021204483 scopus 로고
    • Computational models of neural auditory processing
    • R. Lyon, "Computational models of neural auditory processing," Proc. ICASSP'84, 1984.
    • (1984) Proc. ICASSP'84
    • Lyon, R.1
  • 64
    • 0027676955 scopus 로고
    • Energy separation in signal modulations with application to speech analysis
    • P. Maragos, J. Kaiser, and T. Quatieri, "Energy separation in signal modulations with application to speech analysis," IEEE Trans. Signal Process., vol. 41, pp. 3024-3051, Oct. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.2    Quatieri, T.3
  • 66
    • 0026654967 scopus 로고
    • Modeling the identification of concurrent vowels with different fundamental frequencies
    • R. Meddis and M. Hewitt, "Modeling the identification of concurrent vowels with different fundamental frequencies," J. Aconst. Soc. Amer., vol. 91, no. 1, pp. 233-245, 1992.
    • (1992) J. Aconst. Soc. Amer. , vol.91 , Issue.1 , pp. 233-245
    • Meddis, R.1    Hewitt, M.2
  • 67
    • 0027409390 scopus 로고
    • Voice source model for continuous control of pitch period
    • P. Milenkovic, "Voice source model for continuous control of pitch period," J. Aconst. Soc. Amer., vol. 93, no. 2, pp. 1087-1096, 1993.
    • (1993) J. Aconst. Soc. Amer. , vol.93 , Issue.2 , pp. 1087-1096
    • Milenkovic, P.1
  • 71
    • 0028996926 scopus 로고    scopus 로고
    • Stochastic perceptual models of speech, Proc
    • N. Morgan et al., "Stochastic perceptual models of speech," Proc. ICASSP'95, vol. 1, pp. 397-400.
    • ICASSP'95 , vol.1 , pp. 397-400
    • Morgan, N.1
  • 72
    • 33646946122 scopus 로고
    • Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition
    • C. Nadeau, P. Paches-Leal, and B. H. Juang, "Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition," Eurospeech95, Sept. 1995.
    • (1995) Eurospeech95, Sept.
    • Nadeau, C.1    Paches-Leal, P.2    Juang, B.H.3
  • 73
    • 33749761107 scopus 로고    scopus 로고
    • Speech enhancement based on a new set of auditory constrained parameters
    • S. Nandkumar and J. Hansen, "Speech enhancement based on a new set of auditory constrained parameters," in Proc. ICASSP'94, pp. 1-4.
    • Proc. ICASSP'94 , pp. 1-4
    • Nandkumar, S.1    Hansen, J.2
  • 74
    • 0026142442 scopus 로고
    • A time-varying analysis method for rapid transitions in speech
    • K. Nathan, Y. Lee, and H. Silverman, "A time-varying analysis method for rapid transitions in speech," IEEE Trans. Signal Process., vol. 39, pp. 815-824, Apr. 1991.
    • (1991) IEEE Trans. Signal Process. , vol.39 , pp. 815-824
    • Nathan, K.1    Lee, Y.2    Silverman, H.3
  • 75
    • 0028460992 scopus 로고
    • Time-varying feature selection and classification of unvoiced stop consonants
    • K. Nathan and H. Silverman, "Time-varying feature selection and classification of unvoiced stop consonants," IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 395-405, 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.3 , pp. 395-405
    • Nathan, K.1    Silverman, H.2
  • 76
    • 0028710576 scopus 로고
    • Neuromorphic speech processing for noisy environments
    • Orlando, FL, June
    • C. Neti, "Neuromorphic speech processing for noisy environments," in Proc. ICNN-94, Orlando, FL, June 1994, pp. 4425-4430.
    • (1994) Proc. ICNN-94 , pp. 4425-4430
    • Neti, C.1
  • 79
    • 84941328385 scopus 로고
    • Control methods used in a study of vowels
    • G. Peterson and H. Barney, "Control methods used in a study of vowels," J. Acoust. Soc. Amer., vol. 24, no. 2, pp. 175-184, 1952.
    • (1952) J. Acoust. Soc. Amer. , vol.24 , Issue.2 , pp. 175-184
    • Peterson, G.1    Barney, H.2
  • 81
    • 0028516834 scopus 로고
    • Applications of positive time-frequency distributions to speech processing
    • J. Pitton, L. Atlas, and P. Loughlin, "Applications of positive time-frequency distributions to speech processing," IEEE Trans. Speech Audio. Process, vol. 2, no. 4, pp. 554-566, 1994.
    • (1994) IEEE Trans. Speech Audio. Process , vol.2 , Issue.4 , pp. 554-566
    • Pitton, J.1    Atlas, L.2    Loughlin, P.3
  • 82
    • 0026078506 scopus 로고
    • A computational model of afferent neural activity from the cochlea to the dorsal acoustic stria
    • M. Pont and R. Damper, "A computational model of afferent neural activity from the cochlea to the dorsal acoustic stria," J. Acoust. Soc. Amer., vol. 89, no. 3, pp. 1213-1228, 1991.
    • (1991) J. Acoust. Soc. Amer. , vol.89 , Issue.3 , pp. 1213-1228
    • Pont, M.1    Damper, R.2
  • 85
    • 0028996925 scopus 로고    scopus 로고
    • Robust utterance verification for connected digits recognition
    • M. Rahim, C. H. Lee, and B. H. Juang, "Robust utterance verification for connected digits recognition," in Proc. ICASSP'95, vol. 1, pp. 285-288.
    • Proc. ICASSP'95 , vol.1 , pp. 285-288
    • Rahim, M.1    Lee, C.H.2    Juang, B.H.3
  • 86
    • 0024589234 scopus 로고
    • Acoustic properties and perception of consonant release transients
    • B. Repp and H. Lin, "Acoustic properties and perception of consonant release transients," J. Acoust. Soc. Amer., vol. 85, no. 1, pp. 379-396, 1989.
    • (1989) J. Acoust. Soc. Amer. , vol.85 , Issue.1 , pp. 379-396
    • Repp, B.1    Lin, H.2
  • 87
    • 0028912840 scopus 로고
    • Auditory-nerve encoding of pinna-based spectral cues: Rate representation of high-frequency stimuli
    • J. Rice, E. Young, and G. Spirou, "Auditory-nerve encoding of pinna-based spectral cues: rate representation of high-frequency stimuli," J. Acoust. Soc. Amer., vol. 97, no. 3, pp. 1764-1776, 1995.
    • (1995) J. Acoust. Soc. Amer. , vol.97 , Issue.3 , pp. 1764-1776
    • Rice, J.1    Young, E.2    Spirou, G.3
  • 90
    • 0020579183 scopus 로고
    • Auditory nerve representation of vowels in background noise
    • M. Sachs, H. Voigt, and E. Young, "Auditory nerve representation of vowels in background noise," J. Neurophys., vol. 50, pp. 27-45, 1983.
    • (1983) J. Neurophys. , vol.50 , pp. 27-45
    • Sachs, M.1    Voigt, H.2    Young, E.3
  • 91
    • 0018617277 scopus 로고
    • Encoding of steady-state vowels in the auditory nerve: Representation in terms of discharge rate
    • M. Sachs and E. Young, "Encoding of steady-state vowels in the auditory nerve: representation in terms of discharge rate," J. Acoust. Soc. Amer., vol. 66, no. 1, pp. 470-479, 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.66 , Issue.1 , pp. 470-479
    • Sachs, M.1    Young, E.2
  • 92
    • 0029239090 scopus 로고    scopus 로고
    • A comparative study of mel cepstra and EIH for phone classification under adverse conditions
    • S. Sandhu and O. Ghitza, "A comparative study of mel cepstra and EIH for phone classification under adverse conditions," in Proc. ICASSP'95, vol. 1, pp. 409-412.
    • Proc. ICASSP'95 , vol.1 , pp. 409-412
    • Sandhu, S.1    Ghitza, O.2
  • 93
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory processing
    • S. Seneff, "A joint synchrony/mean-rate model of auditory processing," J. Phonetics, vol. 85, no. 1, pp. 55-76, 1988.
    • (1988) J. Phonetics , vol.85 , Issue.1 , pp. 55-76
    • Seneff, S.1
  • 94
    • 0022348981 scopus 로고
    • Speech processing in the auditory system II: Lateral inhibition and the central processing of speech evoked activity in the auditory nerve
    • S. Shamma, "Speech processing in the auditory system II: lateral inhibition and the central processing of speech evoked activity in the auditory nerve," J. Acoust. Soc. Amer., vol. 78, no. 5, pp. 1622-1632, 1985.
    • (1985) J. Acoust. Soc. Amer. , vol.78 , Issue.5 , pp. 1622-1632
    • Shamma, S.1
  • 95
    • 84928841878 scopus 로고
    • The acoustic features of speech sounds in a model of auditory processing: Vowels and voiceless fricatives
    • "The acoustic features of speech sounds in a model of auditory processing: vowels and voiceless fricatives," J. Phonetics, vol. 16, pp. 77-91, 1988.
    • (1988) J. Phonetics , vol.16 , pp. 77-91
  • 96
    • 0020707077 scopus 로고
    • Responses of auditory-nerve fibers to consonant-vowel syllables
    • D. Sinex and C. Geisler, "Responses of auditory-nerve fibers to consonant-vowel syllables," J. Acoust. Soc. Amer., vol. 73, no. 2, pp. 602-615, 1983.
    • (1983) J. Acoust. Soc. Amer. , vol.73 , Issue.2 , pp. 602-615
    • Sinex, D.1    Geisler, C.2
  • 97
    • 0021461483 scopus 로고
    • Comparison of the responses of auditory-nerve fibers to consonant-vowel syllables with predictions from linear models
    • "Comparison of the responses of auditory-nerve fibers to consonant-vowel syllables with predictions from linear models," J. Acoust. Soc. Amer., vol. 76, no. 1, pp. 116-121, 1984.
    • (1984) J. Acoust. Soc. Amer. , vol.76 , Issue.1 , pp. 116-121
  • 99
    • 0028657430 scopus 로고
    • Accuracy of quasistationary analysis of highly dynamic speech signals
    • R. Smits, "Accuracy of quasistationary analysis of highly dynamic speech signals," J. Acoitst. Soc. Amer., vol. 96, no. 6, pp. 3401-3415, 1994.
    • (1994) J. Acoitst. Soc. Amer. , vol.96 , Issue.6 , pp. 3401-3415
    • Smits, R.1
  • 100
    • 33646919328 scopus 로고    scopus 로고
    • personal communication
    • M. M. Sondhi, personal communication.
    • Sondhi, M.M.1
  • 101
    • 0018038036 scopus 로고
    • Invariant cues for place of articulation in stop consonants
    • K. Stevens and S. Blumstein, "Invariant cues for place of articulation in stop consonants," J. Acoust. Soc. Amer., vol. 64, no. 5, pp. 1358-1368, 1978.
    • (1978) J. Acoust. Soc. Amer. , vol.64 , Issue.5 , pp. 1358-1368
    • Stevens, K.1    Blumstein, S.2
  • 102
    • 0020816189 scopus 로고
    • Dynamic specification of coarticulated vowels
    • W. Strange, J. Jenkins, and T. Johnson, "Dynamic specification of coarticulated vowels," J. Acoust. Soc. Amer., vol. 74, no. 3, pp. 695-705, 1983.
    • (1983) J. Acoust. Soc. Amer. , vol.74 , Issue.3 , pp. 695-705
    • Strange, W.1    Jenkins, J.2    Johnson, T.3
  • 103
    • 0026030074 scopus 로고
    • Perception of concurrent vowels: Effects of harmonic misalignment and pitch-period asynchrony
    • Q. Summerfield and P. Assmann, "Perception of concurrent vowels: effects of harmonic misalignment and pitch-period asynchrony," J. Acoust. Soc. Amer., vol. 89, no. 3, pp. 1364-1377, 1991.
    • (1991) J. Acoust. Soc. Amer. , vol.89 , Issue.3 , pp. 1364-1377
    • Summerfield, Q.1    Assmann, P.2
  • 104
    • 0024929841 scopus 로고    scopus 로고
    • Transient analysis of speech signals using the Wigner time-frequency representation
    • E. Velez and R. Abshcr, 'Transient analysis of speech signals using the Wigner time-frequency representation," in Proc. ICASSP'89, pp. 2242-2245.
    • Proc. ICASSP'89 , pp. 2242-2245
    • Velez, E.1    Abshcr, R.2
  • 105
    • 0001843298 scopus 로고
    • Theorie et applications de la notion de signal analytique
    • J. Ville, "Theorie et applications de la notion de signal analytique," Cables et Transmissions, vol. 2A, no. 1, pp. 61-74, 1948;
    • (1948) Cables Et Transmissions , vol.2 A , Issue.1 , pp. 61-74
    • Ville, J.1
  • 106
    • 0346642106 scopus 로고
    • Theory and applications of the notion of complex signal
    • RAND Corp., Santa Monica, CA
    • I. Selin, transi., "Theory and applications of the notion of complex signal," Tech. Rep. T-92, RAND Corp., Santa Monica, CA, 1958.
    • (1958) Tech. Rep.
    • Selin Transi, I.1
  • 107
    • 0028997028 scopus 로고    scopus 로고
    • Speech enhancement based on masking properties of the auditory system
    • N. Virag, "Speech enhancement based on masking properties of the auditory system," in Proc. ICASSP'95, vol. 1, pp. 796-799.
    • Proc. ICASSP'95 , vol.1 , pp. 796-799
    • Virag, N.1
  • 108
    • 0028462212 scopus 로고
    • Self-normalization and noiserobustness in early auditory representations
    • K. Wang and S. Shamma, "Self-normalization and noiserobustness in early auditory representations," IEEE Trans. Speech Audio Process., vol. 2, pp. 421-435, 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 421-435
    • Wang, K.1    Shamma, S.2
  • 109
    • 33646919723 scopus 로고
    • A diffusion model of the transient response of the cochlear inner hair cell synapse
    • L. Westerman and R. Smith, "A diffusion model of the transient response of the cochlear inner hair cell synapse," J. Acoust. Soc. Amer., vol. 93, no. 1, pp. 401-417, 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , Issue.1 , pp. 401-417
    • Westerman, L.1    Smith, R.2
  • 110
    • 0021207990 scopus 로고
    • Rapid and short term adaptation in auditory nerve responses
    • "Rapid and short term adaptation in auditory nerve responses," Hearing Res., vol. 15, pp. 249-260, 1985.
    • (1985) Hearing Res. , vol.15 , pp. 249-260
  • 111
    • 33745014742 scopus 로고
    • On the quantum correction for thermodynamic equilibrium
    • E. Wigner, "On the quantum correction for thermodynamic equilibrium," Phys. Rev., vol. 40, pp. 749-759, 1932.
    • (1932) Phys. Rev. , vol.40 , pp. 749-759
    • Wigner, E.1
  • 112
    • 0018653975 scopus 로고
    • Least squares glottal inverse filtering from the acoustic speech waveform
    • D. Wong, J. Markel, and A. Gray, "Least squares glottal inverse filtering from the acoustic speech waveform," IEEE Trans. Acoust. Speech Signal Process., vol. 27, no. 4, pp. 350-355, 1979.
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.4 , pp. 350-355
    • Wong, D.1    Markel, J.2    Gray, A.3
  • 113
    • 0018606571 scopus 로고
    • Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers, J
    • E. Young and M. Sachs, "Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers," J. Acoust. Soc. Amer., vol. 66, no. 3, pp. 1381-1403, 1979.
    • (1979) Acoust. Soc. Amer. , vol.66 , Issue.3 , pp. 1381-1403
    • Young, E.1    Sachs, M.2
  • 114
    • 0027368837 scopus 로고
    • Spectral-shape features versus formants as acoustic correlates for vowels
    • S. Zahorian and A. Jagharghi, "Spectral-shape features versus formants as acoustic correlates for vowels," J. Acoust. Soc. Amer., vol. 94, no. 4, pp. 1966-1982, 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.94 , Issue.4 , pp. 1966-1982
    • Zahorian, S.1    Jagharghi, A.2
  • 115
    • 0025463449 scopus 로고
    • The use of cone-shaped kernels for generalized time-frequency representations of nonstationary signals
    • Y. Zhao, L. Atlas, and R. Marks, "The use of cone-shaped kernels for generalized time-frequency representations of nonstationary signals," IEEE Trans. Acoust. Speech Signal Process., vol. 38, no. 7, pp. 1084-1091, 1990.
    • (1990) IEEE Trans. Acoust. Speech Signal Process. , vol.38 , Issue.7 , pp. 1084-1091
    • Zhao, Y.1    Atlas, L.2    Marks, R.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.