-
2
-
-
33646912649
-
Perceptually-based dynamic spectrograms
-
T. Applebaum and B. Hanson, "Perceptually-based dynamic spectrograms," in Visual Representations of Speech Signals, M. Cooke and S. Beet, Eds. London: Wiley, 1993, pp. 153-160.
-
Visual Representations of Speech Signals, M. Cooke and S. Beet, Eds. London: Wiley, 1993
, pp. 153-160
-
-
Applebaum, T.1
Hanson, B.2
-
3
-
-
84942219496
-
Autocorrelogram models of the segregation of competing voices
-
P. Assmann and D. Paschall, "Autocorrelogram models of the segregation of competing voices," in Proc. 15th Winter ARO Meet., St. Petersburg, FL, Feb. 1992.
-
(1992)
Proc. 15th Winter ARO Meet., St. Petersburg, FL, Feb.
-
-
Assmann, P.1
Paschall, D.2
-
4
-
-
0024592122
-
Modeling the perception of concurrent vowels: Vowels with the same fundamental frequency
-
P. Assmann and Q. Summerfield, "Modeling the perception of concurrent vowels: vowels with the same fundamental frequency," J. Acoust. Soc. Amer., vol. 85, no. 1, pp. 327-338, 1989.
-
(1989)
J. Acoust. Soc. Amer.
, vol.85
, Issue.1
, pp. 327-338
-
-
Assmann, P.1
Summerfield, Q.2
-
5
-
-
0025003184
-
Modeling the perception of concurrent vowels: Vowels with different fundamental frequencies
-
"Modeling the perception of concurrent vowels: vowels with different fundamental frequencies," J. Acoust. Soc. Amer., vol. 88, no. 2, pp. 680-697, 1990.
-
(1990)
J. Acoust. Soc. Amer.
, vol.88
, Issue.2
, pp. 680-697
-
-
-
6
-
-
0025682331
-
New nonstationary techniques for the analysis and display of speech transients
-
L. Atlas, W. Kooiman, P. Loughlin, and R. Cole, "New nonstationary techniques for the analysis and display of speech transients," in Proc. ICASSP'90, pp. 385-388.
-
Proc. ICASSP'90
, pp. 385-388
-
-
Atlas, L.1
Kooiman, W.2
Loughlin, P.3
Cole, R.4
-
7
-
-
0026388701
-
Truly nonstationary techniques for the analysis and display of voiced speech
-
L. Atlas, P. Loughlin, and J. Pitton, 'Truly nonstationary techniques for the analysis and display of voiced speech," in Proc. ICASSP'91, pp. 433-436.
-
Proc. ICASSP'91
, pp. 433-436
-
-
Atlas, L.1
Loughlin, P.2
Pitton, J.3
-
8
-
-
0027373113
-
Objective analysis versus subjective assessment of vowels pronounced by native, nonnative and deaf male speakers of Dutch
-
B. Bakkum, R. Plomp, and L. Pols, "Objective analysis versus subjective assessment of vowels pronounced by native, nonnative and deaf male speakers of Dutch," J. Acoust. Soc. Amer., vol. 94, no. 10, pp. 1989-2004, 1993.
-
(1993)
J. Acoust. Soc. Amer.
, vol.94
, Issue.10
, pp. 1989-2004
-
-
Bakkum, B.1
Plomp, R.2
Pols, L.3
-
9
-
-
0027577041
-
A signal-dependent time-frequency representation: Optimal kernel design
-
R. Baraniuk and D. Jones, "A signal-dependent time-frequency representation: optimal kernel design," IEEE Trans. Signal Process., vol. 41, no. 4, pp. 1589-1602, 1993.
-
(1993)
IEEE Trans. Signal Process.
, vol.41
, Issue.4
, pp. 1589-1602
-
-
Baraniuk, R.1
Jones, D.2
-
10
-
-
0026185556
-
Zero-crossing rates of functions of gaussian processes
-
J. Barnett and B. Kedem, "Zero-crossing rates of functions of gaussian processes," IEEE Trans. Inform. Theory, vol. 37, pp. 1188-1194, Apr. 1991.
-
(1991)
IEEE Trans. Inform. Theory
, vol.37
, pp. 1188-1194
-
-
Barnett, J.1
Kedem, B.2
-
11
-
-
33646940920
-
Optimal real-time signal processing in the nervous system
-
W. Bialek, "Optimal real-time signal processing in the nervous system," in Neural Systems: Analysis and Modeling, F. H. Eeckman, Ed. Amsterdam: Kluwer, 1993, pp. 5-28.
-
Neural Systems: Analysis and Modeling, F. H. Eeckman, Ed. Amsterdam: Kluwer, 1993
, pp. 5-28
-
-
Bialek, W.1
-
12
-
-
0018664543
-
Acoustic invariance in speech production: Evidence from measurements of the spectral characteristics of stop consonants
-
S. Blumstein and K. Stevens, "Acoustic invariance in speech production: evidence from measurements of the spectral characteristics of stop consonants," J. Acoust. Soc. Amer., vol. 66, no. 4, pp. 1001-1017, 1979.
-
(1979)
J. Acoust. Soc. Amer.
, vol.66
, Issue.4
, pp. 1001-1017
-
-
Blumstein, S.1
Stevens, K.2
-
14
-
-
84941442230
-
The auditory processing and recognition of speech
-
W. Byrne, J. Robinson, and S. Shamma, "The auditory processing and recognition of speech," in Proc. Speech and Natural Lang. Workshop, 1989, pp. 325-331.
-
Proc. Speech and Natural Lang. Workshop, 1989
, pp. 325-331
-
-
Byrne, W.1
Robinson, J.2
Shamma, S.3
-
15
-
-
0027528776
-
A model for the responses of low-frequency auditory nerve fibers in cats
-
L. Camey, "A model for the responses of low-frequency auditory nerve fibers in cats," J. Acoust. Soc. Amer., vol. 93, no. 1, pp. 401-4117, 1993.
-
(1993)
J. Acoust. Soc. Amer.
, vol.93
, Issue.1
, pp. 401-4117
-
-
Camey, L.1
-
16
-
-
0023294678
-
A nonstationary model for the analysis of transient speech signals
-
F. Casacuberta and E. Vidai, "A nonstationary model for the analysis of transient speech signals," IEEE Trans. Acoust. Speech Signal Process., vol. 35, pp. 226-228, Feb. 1987.
-
(1987)
IEEE Trans. Acoust. Speech Signal Process.
, vol.35
, pp. 226-228
-
-
Casacuberta, F.1
Vidai, E.2
-
17
-
-
0026372224
-
Combined multi-resolution (wideband/narrowband) spectrogram
-
S. Cheung and J. Lim, "Combined multi-resolution (wideband/narrowband) spectrogram," in IEEE Proc. ICASSP'91, pp. 457-460.
-
IEEE Proc. ICASSP'91
, pp. 457-460
-
-
Cheung, S.1
Lim, J.2
-
18
-
-
0024681555
-
Improved time-frequency representation of multi-component signals using exponential kernels
-
H. Choi and W. Williams, "Improved time-frequency representation of multi-component signals using exponential kernels," IEEE Trans. Acoust. Speech Signal Process., vol. 37, pp. 862-871, June 1989.
-
(1989)
IEEE Trans. Acoust. Speech Signal Process.
, vol.37
, pp. 862-871
-
-
Choi, H.1
Williams, W.2
-
19
-
-
0141582037
-
Generalized phase-space distribution functions
-
L. Cohen, "Generalized phase-space distribution functions," J. Math. Phys., vol. 7, no. 5, pp. 781-786, 1966.
-
(1966)
J. Math. Phys.
, vol.7
, Issue.5
, pp. 781-786
-
-
Cohen, L.1
-
20
-
-
0003733873
-
-
New York: Prentice-Hall
-
Time-Frequency Analysis. New York: Prentice-Hall, 1995.
-
(1995)
Time-Frequency Analysis
-
-
-
21
-
-
84957503277
-
Instantaneous frequency, its standard deviation and multicomponent signals
-
L. Cohen and C. Lee, "Instantaneous frequency, its standard deviation and multicomponent signals," SPIE Advanced Algs. Archs. Sig. Proc. Ill, vol. 975, pp. 186-208, 1988.
-
(1988)
SPIE Advanced Algs. Archs. Sig. Proc. Ill
, vol.975
, pp. 186-208
-
-
Cohen, L.1
Lee, C.2
-
23
-
-
0019053271
-
Comparison of parametric representation for monosyllable word recognition in continuously spoken sentences
-
S. Davis and P. Mermelstein, "Comparison of parametric representation for monosyllable word recognition in continuously spoken sentences," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-28, pp. 357-366, Apr. 1980.
-
(1980)
IEEE Trans. Acoust. Speech Signal Process.
, vol.28
, pp. 357-366
-
-
Davis, S.1
Mermelstein, P.2
-
24
-
-
84953656135
-
Acoustic loci and transitional cues for consonants
-
P. Delattre, A. Liberman, and F. Cooper, "Acoustic loci and transitional cues for consonants," J. Acoust. Soc. Amer., vol. 27, no. 4, pp. 769-773, 1955.
-
(1955)
J. Acoust. Soc. Amer.
, vol.27
, Issue.4
, pp. 769-773
-
-
Delattre, P.1
Liberman, A.2
Cooper, F.3
-
25
-
-
0026854213
-
A generalized hidden Markov model with stateconditioned trend functions of time for the speech signal
-
L. Deng, "A generalized hidden Markov model with stateconditioned trend functions of time for the speech signal," Signal Process., vol. 27, pp. 65-78, 1992.
-
(1992)
Signal Process.
, vol.27
, pp. 65-78
-
-
Deng, L.1
-
26
-
-
0028516022
-
Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
-
L. Deng, M. Aksmanovic, X. Sun, and C. Wu, "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 507-520, 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.4
, pp. 507-520
-
-
Deng, L.1
Aksmanovic, M.2
Sun, X.3
Wu, C.4
-
27
-
-
84928839596
-
A composite model of the auditory periphery for the processing of speech
-
L. Deng, C. Geisler, and S. Greenberg, "A composite model of the auditory periphery for the processing of speech," J. Phonetics, vol. 16, no. 1, p. 93, 1988.
-
(1988)
J. Phonetics
, vol.16
, Issue.1
, pp. 93
-
-
Deng, L.1
Geisler, C.2
Greenberg, S.3
-
28
-
-
0027681974
-
ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
-
V. Digilakis, J. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE Trans. Speech. Audio Process., vol. 1, no. 4, pp. 431-442, 1993.
-
(1993)
IEEE Trans. Speech. Audio Process.
, vol.1
, Issue.4
, pp. 431-442
-
-
Digilakis, V.1
Rohlicek, J.2
Ostendorf, M.3
-
29
-
-
33646922512
-
Wigner distribution analysis of stop consonant release transients: Labials, velars, and labio-velars
-
G. Dogil and W. Wokurek, "Wigner distribution analysis of stop consonant release transients: labials, velars, and labio-velars," in Proc. Int. Conf. Speech Res. '89, Budapest, pp. 1-4.
-
Proc. Int. Conf. Speech Res. '89, Budapest
, pp. 1-4
-
-
Dogil, G.1
Wokurek, W.2
-
30
-
-
33646936237
-
Wigner time-frequency analysis for major places of articulation in stop consonants
-
"Wigner time-frequency analysis for major places of articulation in stop consonants," in Proc. 12th Int. Cong. Phon. Sci., 1991, vol. 3, pp. 390-393.
-
Proc. 12th Int. Cong. Phon. Sci., 1991
, vol.3
, pp. 390-393
-
-
-
31
-
-
0023904763
-
Frequency importance function for a feature recognition test material
-
V. Duggirala, G. Studebaker, C. Pavlovic, and R. Sherbecoe, "Frequency importance function for a feature recognition test material," J. Acoust. Soc. Amer., vol. 83, no. 6, pp. 2372-2382, 1988.
-
(1988)
J. Acoust. Soc. Amer.
, vol.83
, Issue.6
, pp. 2372-2382
-
-
Duggirala, V.1
Studebaker, G.2
Pavlovic, C.3
Sherbecoe, R.4
-
32
-
-
0038676741
-
Methods of measuring vowel formant bandwidths
-
H. Dünn, "Methods of measuring vowel formant bandwidths," J. Acoust. Soc. Amer., vol. 33, no. 12, pp. 1737-1746, 1961.
-
(1961)
J. Acoust. Soc. Amer.
, vol.33
, Issue.12
, pp. 1737-1746
-
-
Dünn, H.1
-
33
-
-
0021881648
-
Peripheral auditory adaptation and fatigue
-
J. Eggermont, "Peripheral auditory adaptation and fatigue," Hearing Res., vol. 18, pp. 57-71, 1985.
-
(1985)
Hearing Res.
, vol.18
, pp. 57-71
-
-
Eggermont, J.1
-
36
-
-
0022667694
-
Speaker-independent isolated word recognition using dynamic features of speech spectrum
-
S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust. Speech Signal Process., vol. ASSP-34, pp. 52-59, Jan. 1986.
-
(1986)
IEEE Trans. Acoust. Speech Signal Process.
, vol.34
, pp. 52-59
-
-
Furui, S.1
-
37
-
-
0022548705
-
On the role of spectral transition for speech perception
-
"On the role of spectral transition for speech perception," J. Acoust. Soc. Amer., vol. 80, no. 4, pp. 1016-1025, 1986.
-
(1986)
J. Acoust. Soc. Amer.
, vol.80
, Issue.4
, pp. 1016-1025
-
-
-
38
-
-
33646934291
-
Invariant acoustic cues in stop consonants: A cross-language study using the Wigner distribution
-
H. Garudadri, J, Gilbert, A. Benguerel, and M. Beddoes, "Invariant acoustic cues in stop consonants: a cross-language study using the Wigner distribution," J. Acoust. Soc. Amer., vol. 82, no. S55, 1987.
-
(1987)
J. Acoust. Soc. Amer.
, vol.82
, Issue.S55
-
-
Garudadri, H.1
Gilbert, J.2
Benguerel, A.3
Beddoes, M.4
-
39
-
-
84991416125
-
Auditory nerve representation as a front end for speech recognition in a noisy environment
-
O. Ghitza, "Auditory nerve representation as a front end for speech recognition in a noisy environment," Computer Speech and Lang., vol. 1, pp. 109-130, 1986.
-
(1986)
Computer Speech and Lang.
, vol.1
, pp. 109-130
-
-
Ghitza, O.1
-
40
-
-
0027578207
-
Hidden Markov models with templates as nonstationary states: An application to speech recognition
-
O. Ghitza and M. M. Sondhi, "Hidden Markov models with templates as nonstationary states: An application to speech recognition," Computer Speech and Lang., vol. 7, no. 2, pp. 101-119, 1993.
-
(1993)
Computer Speech and Lang.
, vol.7
, Issue.2
, pp. 101-119
-
-
Ghitza, O.1
Sondhi, M.M.2
-
42
-
-
0020798029
-
Time-dependent ARMA modeling of nonstationary signals
-
Y. Grenier, "Time-dependent ARMA modeling of nonstationary signals," IEEE Trans. Acoust. Speech Signal Process., vol. 31, pp. 899-911, Apr. 1983.
-
(1983)
IEEE Trans. Acoust. Speech Signal Process.
, vol.31
, pp. 899-911
-
-
Grenier, Y.1
-
43
-
-
0028206226
-
The contribution of the murmur and vowel to the place of articulation distinction in nasal consonants
-
J. Harrington, "The contribution of the murmur and vowel to the place of articulation distinction in nasal consonants," J. Acoust. Soc. Amer., vol. 96, no. 1, pp. 19-32, 1994.
-
(1994)
J. Acoust. Soc. Amer.
, vol.96
, Issue.1
, pp. 19-32
-
-
Harrington, J.1
-
44
-
-
0027491495
-
Effect of relative amplitude of frication on perception of place of articulations
-
M. Hedrick and R. Ohde, "Effect of relative amplitude of frication on perception of place of articulations," J. Acoust. Soc. Amer., vol. 94, no. 4, pp. 2005-2026, 1993.
-
(1993)
J. Acoust. Soc. Amer.
, vol.94
, Issue.4
, pp. 2005-2026
-
-
Hedrick, M.1
Ohde, R.2
-
45
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis for speech
-
H. Hermansky, "Perceptual linear predictive (PLP) analysis for speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990.
-
(1990)
J. Acoust. Soc. Amer.
, vol.87
, Issue.4
, pp. 1738-1752
-
-
Hermansky, H.1
-
46
-
-
0028517164
-
RASTA processing of speech
-
H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.4
, pp. 578-589
-
-
Hermansky, H.1
Morgan, N.2
-
47
-
-
0010423564
-
Diphthong formants and their movements
-
A. Holbrook and G. Fairbanks, "Diphthong formants and their movements," J. Speech Hearing Res., vol. 5, no. 1, pp. 38-58, 1962.
-
(1962)
J. Speech Hearing Res.
, vol.5
, Issue.1
, pp. 38-58
-
-
Holbrook, A.1
Fairbanks, G.2
-
48
-
-
0028996918
-
Measuring fine structure in speech: Application to speaker identification
-
C. Jankowski, T. Quatieri, and D. Reynolds, "Measuring fine structure in speech: application to speaker identification," in Proc. ICASSP'95, pp. 325-328.
-
Proc. ICASSP'95
, pp. 325-328
-
-
Jankowski, C.1
Quatieri, T.2
Reynolds, D.3
-
49
-
-
0027684995
-
Signal compression based on models of human perception
-
N. Jayant, J. Johnston, and R. Safranek, "Signal compression based on models of human perception," Proc. IEEE, vol. 81, pp. 1385-1422, Oct. 1993.
-
(1993)
Proc. IEEE
, vol.81
, pp. 1385-1422
-
-
Jayant, N.1
Johnston, J.2
Safranek, R.3
-
50
-
-
0028040175
-
Vowel identification in mixed-speaker silent-center syllables
-
J. Jenkins, W. Strange, and S. Miranda, "Vowel identification in mixed-speaker silent-center syllables," J. Aconst. Soc. Amer., vol. 9, no. 2, pp. 1030-1041, 1994.
-
(1994)
J. Aconst. Soc. Amer.
, vol.9
, Issue.2
, pp. 1030-1041
-
-
Jenkins, J.1
Strange, W.2
Miranda, S.3
-
51
-
-
0022097649
-
Maximum likelihood estimation for mixture multivariate stochastic observations of Markov chains
-
B. H. Juang, "Maximum likelihood estimation for mixture multivariate stochastic observations of Markov chains," AT&T Tech. J., vol. 64, no. 6, pp. 1235-1249, 1985.
-
(1985)
AT&T Tech. J.
, vol.64
, Issue.6
, pp. 1235-1249
-
-
Juang, B.H.1
-
52
-
-
33646944964
-
Hierarchical AR model for time varying speech signals
-
O. Kakusho and M. Yanagida, "Hierarchical AR model for time varying speech signals," in Proc. ICASSP'82, pp. 1295-1298.
-
Proc. ICASSP'82
, pp. 1295-1298
-
-
Kakusho, O.1
Yanagida, M.2
-
53
-
-
0022806994
-
Spectral analysis and discrimination by zerocrossings
-
B. Kedem, "Spectral analysis and discrimination by zerocrossings," Proc. IEEE, vol. PROC-74, pp. 1477-1493, Nov. 1986.
-
(1986)
Proc. IEEE
, vol.74
, pp. 1477-1493
-
-
Kedem, B.1
-
54
-
-
0020642039
-
Time-varying features as correlates of place of articulation in stop consonants
-
D. Kewley-Port, 'Time-varying features as correlates of place of articulation in stop consonants," J. Aconst. Soc. Amer., vol. 73, no. 1, pp. 322-335, 1983.
-
(1983)
J. Aconst. Soc. Amer.
, vol.73
, Issue.1
, pp. 322-335
-
-
Kewley-Port, D.1
-
55
-
-
70349995741
-
The sound spectrograph
-
W. Koenig, H. Dunn, and L. Lacy, "The sound spectrograph," J. Aconst. Soc. Amer., vol. 18, no. 1, pp. 19-49, 1946.
-
(1946)
J. Aconst. Soc. Amer.
, vol.18
, Issue.1
, pp. 19-49
-
-
Koenig, W.1
Dunn, H.2
Lacy, L.3
-
56
-
-
0001463644
-
A duplex theory of pitch perception
-
J. Licklider, "A duplex theory of pitch perception," Experienlia, vol. 7, pp. 128-133, 1951.
-
(1951)
Experienlia
, vol.7
, pp. 128-133
-
-
Licklider, J.1
-
57
-
-
84953653173
-
Perturbations in vocal pitch
-
P. Lieberman, "Perturbations in vocal pitch," J. Aconst. Soc. Amer., vol. 33, no. 5, pp. 597-603, 1961.
-
(1961)
J. Aconst. Soc. Amer.
, vol.33
, Issue.5
, pp. 597-603
-
-
Lieberman, P.1
-
58
-
-
0016735638
-
Linear estimation of nonstationary signals
-
L. Liporace, "Linear estimation of nonstationary signals," J. Acoust. Soc. Amer., vol. 58, no. 6, pp. 1288-1295, 1975.
-
(1975)
J. Acoust. Soc. Amer.
, vol.58
, Issue.6
, pp. 1288-1295
-
-
Liporace, L.1
-
59
-
-
84885564021
-
Advanced time-frequency representations for speech processing
-
P. Loughlin, L. Atlas, and J. Pitton, "Advanced time-frequency representations for speech processing," Visual Representations of Speech Signals, M. Cooke and S. Beet, Eds. London, U.K.: Wiley, 1993, pp. 27-53.
-
Visual Representations of Speech Signals, M. Cooke and S. Beet, Eds. London, U.K.: Wiley, 1993
, pp. 27-53
-
-
Loughlin, P.1
Atlas, L.2
Pitton, J.3
-
60
-
-
0027542642
-
Bilinear time-frequency representations: New insights and properties
-
"Bilinear time-frequency representations: new insights and properties," IEEE Trans. Signal Process., vol. 41, pp. 750-767, Feb. 1993.
-
(1993)
IEEE Trans. Signal Process.
, vol.41
, pp. 750-767
-
-
-
61
-
-
0028517015
-
Construction of positive time-frequency distributions
-
"Construction of positive time-frequency distributions," IEEE Trans. Signal Process., vol. 42, pp. 2697-2705, Oct. 1994.
-
(1994)
IEEE Trans. Signal Process.
, vol.42
, pp. 2697-2705
-
-
-
62
-
-
0028739812
-
Approximating time-frequency density functions via optimal combinations of spectrograms
-
P. Loughlin, J. Pitton, and B. Hannaford, "Approximating time-frequency density functions via optimal combinations of spectrograms," IEEE Signal Process. Lett., vol. 1, pp. 199-202, Dec. 1994.
-
(1994)
IEEE Signal Process. Lett.
, vol.1
, pp. 199-202
-
-
Loughlin, P.1
Pitton, J.2
Hannaford, B.3
-
63
-
-
0021204483
-
Computational models of neural auditory processing
-
R. Lyon, "Computational models of neural auditory processing," Proc. ICASSP'84, 1984.
-
(1984)
Proc. ICASSP'84
-
-
Lyon, R.1
-
64
-
-
0027676955
-
Energy separation in signal modulations with application to speech analysis
-
P. Maragos, J. Kaiser, and T. Quatieri, "Energy separation in signal modulations with application to speech analysis," IEEE Trans. Signal Process., vol. 41, pp. 3024-3051, Oct. 1993.
-
(1993)
IEEE Trans. Signal Process.
, vol.41
, pp. 3024-3051
-
-
Maragos, P.1
Kaiser, J.2
Quatieri, T.3
-
66
-
-
0026654967
-
Modeling the identification of concurrent vowels with different fundamental frequencies
-
R. Meddis and M. Hewitt, "Modeling the identification of concurrent vowels with different fundamental frequencies," J. Aconst. Soc. Amer., vol. 91, no. 1, pp. 233-245, 1992.
-
(1992)
J. Aconst. Soc. Amer.
, vol.91
, Issue.1
, pp. 233-245
-
-
Meddis, R.1
Hewitt, M.2
-
67
-
-
0027409390
-
Voice source model for continuous control of pitch period
-
P. Milenkovic, "Voice source model for continuous control of pitch period," J. Aconst. Soc. Amer., vol. 93, no. 2, pp. 1087-1096, 1993.
-
(1993)
J. Aconst. Soc. Amer.
, vol.93
, Issue.2
, pp. 1087-1096
-
-
Milenkovic, P.1
-
68
-
-
0022737369
-
Adaptive identification of a time-varying ARMA speech model
-
Y. Miyanaga, N. Miki, and N. Nagai, "Adaptive identification of a time-varying ARMA speech model," IEEE Trans. Aconst. Speech, Signal Process., vol. ASSP-34, pp. 423-433, Mar. 1986.
-
(1986)
IEEE Trans. Aconst. Speech, Signal Process.
, vol.34
, pp. 423-433
-
-
Miyanaga, Y.1
Miki, N.2
Nagai, N.3
-
70
-
-
0028997032
-
Co-channel speaker separation
-
D. Morgan, E. George, L. Lee, and S. Kay, "Co-channel speaker separation," in Proc. ICASSP'95, vol. 1, pp. 828-831.
-
Proc. ICASSP'95
, vol.1
, pp. 828-831
-
-
Morgan, D.1
George, E.2
Lee, L.3
Kay, S.4
-
71
-
-
0028996926
-
Stochastic perceptual models of speech, Proc
-
N. Morgan et al., "Stochastic perceptual models of speech," Proc. ICASSP'95, vol. 1, pp. 397-400.
-
ICASSP'95
, vol.1
, pp. 397-400
-
-
Morgan, N.1
-
72
-
-
33646946122
-
Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition
-
C. Nadeau, P. Paches-Leal, and B. H. Juang, "Filtering the time sequence of spectral parameters for speaker-independent CDHMM word recognition," Eurospeech95, Sept. 1995.
-
(1995)
Eurospeech95, Sept.
-
-
Nadeau, C.1
Paches-Leal, P.2
Juang, B.H.3
-
73
-
-
33749761107
-
Speech enhancement based on a new set of auditory constrained parameters
-
S. Nandkumar and J. Hansen, "Speech enhancement based on a new set of auditory constrained parameters," in Proc. ICASSP'94, pp. 1-4.
-
Proc. ICASSP'94
, pp. 1-4
-
-
Nandkumar, S.1
Hansen, J.2
-
74
-
-
0026142442
-
A time-varying analysis method for rapid transitions in speech
-
K. Nathan, Y. Lee, and H. Silverman, "A time-varying analysis method for rapid transitions in speech," IEEE Trans. Signal Process., vol. 39, pp. 815-824, Apr. 1991.
-
(1991)
IEEE Trans. Signal Process.
, vol.39
, pp. 815-824
-
-
Nathan, K.1
Lee, Y.2
Silverman, H.3
-
75
-
-
0028460992
-
Time-varying feature selection and classification of unvoiced stop consonants
-
K. Nathan and H. Silverman, "Time-varying feature selection and classification of unvoiced stop consonants," IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 395-405, 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.3
, pp. 395-405
-
-
Nathan, K.1
Silverman, H.2
-
76
-
-
0028710576
-
Neuromorphic speech processing for noisy environments
-
Orlando, FL, June
-
C. Neti, "Neuromorphic speech processing for noisy environments," in Proc. ICNN-94, Orlando, FL, June 1994, pp. 4425-4430.
-
(1994)
Proc. ICNN-94
, pp. 4425-4430
-
-
Neti, C.1
-
77
-
-
0000460671
-
-
R. D. Patterson et al., "Complex Sounds and auditory images," in Auditory Physiology and Perception, Y. Gazais, L. Demany, and K. Honer, Eds. London: Pergamon, 1992, pp. 429-446.
-
Complex Sounds and auditory images, Auditory Physiology and Perception, Y. Gazais, L. Demany, and K. Honer, Eds. London: Pergamon, 1992
, pp. 429-446
-
-
Patterson, R.D.1
-
79
-
-
84941328385
-
Control methods used in a study of vowels
-
G. Peterson and H. Barney, "Control methods used in a study of vowels," J. Acoust. Soc. Amer., vol. 24, no. 2, pp. 175-184, 1952.
-
(1952)
J. Acoust. Soc. Amer.
, vol.24
, Issue.2
, pp. 175-184
-
-
Peterson, G.1
Barney, H.2
-
81
-
-
0028516834
-
Applications of positive time-frequency distributions to speech processing
-
J. Pitton, L. Atlas, and P. Loughlin, "Applications of positive time-frequency distributions to speech processing," IEEE Trans. Speech Audio. Process, vol. 2, no. 4, pp. 554-566, 1994.
-
(1994)
IEEE Trans. Speech Audio. Process
, vol.2
, Issue.4
, pp. 554-566
-
-
Pitton, J.1
Atlas, L.2
Loughlin, P.3
-
82
-
-
0026078506
-
A computational model of afferent neural activity from the cochlea to the dorsal acoustic stria
-
M. Pont and R. Damper, "A computational model of afferent neural activity from the cochlea to the dorsal acoustic stria," J. Acoust. Soc. Amer., vol. 89, no. 3, pp. 1213-1228, 1991.
-
(1991)
J. Acoust. Soc. Amer.
, vol.89
, Issue.3
, pp. 1213-1228
-
-
Pont, M.1
Damper, R.2
-
85
-
-
0028996925
-
Robust utterance verification for connected digits recognition
-
M. Rahim, C. H. Lee, and B. H. Juang, "Robust utterance verification for connected digits recognition," in Proc. ICASSP'95, vol. 1, pp. 285-288.
-
Proc. ICASSP'95
, vol.1
, pp. 285-288
-
-
Rahim, M.1
Lee, C.H.2
Juang, B.H.3
-
86
-
-
0024589234
-
Acoustic properties and perception of consonant release transients
-
B. Repp and H. Lin, "Acoustic properties and perception of consonant release transients," J. Acoust. Soc. Amer., vol. 85, no. 1, pp. 379-396, 1989.
-
(1989)
J. Acoust. Soc. Amer.
, vol.85
, Issue.1
, pp. 379-396
-
-
Repp, B.1
Lin, H.2
-
87
-
-
0028912840
-
Auditory-nerve encoding of pinna-based spectral cues: Rate representation of high-frequency stimuli
-
J. Rice, E. Young, and G. Spirou, "Auditory-nerve encoding of pinna-based spectral cues: rate representation of high-frequency stimuli," J. Acoust. Soc. Amer., vol. 97, no. 3, pp. 1764-1776, 1995.
-
(1995)
J. Acoust. Soc. Amer.
, vol.97
, Issue.3
, pp. 1764-1776
-
-
Rice, J.1
Young, E.2
Spirou, G.3
-
89
-
-
0037795511
-
Frequency selectivity and the perception of speech
-
S. Rosen and A. Fourcin, "Frequency selectivity and the perception of speech," in Frequency Selectivity in Hearing, B. Moore, Ed. New York: Academic, 1986, pp. 373-487.
-
Frequency Selectivity in Hearing, B. Moore, Ed. New York: Academic, 1986
, pp. 373-487
-
-
Rosen, S.1
Fourcin, A.2
-
90
-
-
0020579183
-
Auditory nerve representation of vowels in background noise
-
M. Sachs, H. Voigt, and E. Young, "Auditory nerve representation of vowels in background noise," J. Neurophys., vol. 50, pp. 27-45, 1983.
-
(1983)
J. Neurophys.
, vol.50
, pp. 27-45
-
-
Sachs, M.1
Voigt, H.2
Young, E.3
-
91
-
-
0018617277
-
Encoding of steady-state vowels in the auditory nerve: Representation in terms of discharge rate
-
M. Sachs and E. Young, "Encoding of steady-state vowels in the auditory nerve: representation in terms of discharge rate," J. Acoust. Soc. Amer., vol. 66, no. 1, pp. 470-479, 1979.
-
(1979)
J. Acoust. Soc. Amer.
, vol.66
, Issue.1
, pp. 470-479
-
-
Sachs, M.1
Young, E.2
-
92
-
-
0029239090
-
A comparative study of mel cepstra and EIH for phone classification under adverse conditions
-
S. Sandhu and O. Ghitza, "A comparative study of mel cepstra and EIH for phone classification under adverse conditions," in Proc. ICASSP'95, vol. 1, pp. 409-412.
-
Proc. ICASSP'95
, vol.1
, pp. 409-412
-
-
Sandhu, S.1
Ghitza, O.2
-
93
-
-
84928837806
-
A joint synchrony/mean-rate model of auditory processing
-
S. Seneff, "A joint synchrony/mean-rate model of auditory processing," J. Phonetics, vol. 85, no. 1, pp. 55-76, 1988.
-
(1988)
J. Phonetics
, vol.85
, Issue.1
, pp. 55-76
-
-
Seneff, S.1
-
94
-
-
0022348981
-
Speech processing in the auditory system II: Lateral inhibition and the central processing of speech evoked activity in the auditory nerve
-
S. Shamma, "Speech processing in the auditory system II: lateral inhibition and the central processing of speech evoked activity in the auditory nerve," J. Acoust. Soc. Amer., vol. 78, no. 5, pp. 1622-1632, 1985.
-
(1985)
J. Acoust. Soc. Amer.
, vol.78
, Issue.5
, pp. 1622-1632
-
-
Shamma, S.1
-
95
-
-
84928841878
-
The acoustic features of speech sounds in a model of auditory processing: Vowels and voiceless fricatives
-
"The acoustic features of speech sounds in a model of auditory processing: vowels and voiceless fricatives," J. Phonetics, vol. 16, pp. 77-91, 1988.
-
(1988)
J. Phonetics
, vol.16
, pp. 77-91
-
-
-
96
-
-
0020707077
-
Responses of auditory-nerve fibers to consonant-vowel syllables
-
D. Sinex and C. Geisler, "Responses of auditory-nerve fibers to consonant-vowel syllables," J. Acoust. Soc. Amer., vol. 73, no. 2, pp. 602-615, 1983.
-
(1983)
J. Acoust. Soc. Amer.
, vol.73
, Issue.2
, pp. 602-615
-
-
Sinex, D.1
Geisler, C.2
-
97
-
-
0021461483
-
Comparison of the responses of auditory-nerve fibers to consonant-vowel syllables with predictions from linear models
-
"Comparison of the responses of auditory-nerve fibers to consonant-vowel syllables with predictions from linear models," J. Acoust. Soc. Amer., vol. 76, no. 1, pp. 116-121, 1984.
-
(1984)
J. Acoust. Soc. Amer.
, vol.76
, Issue.1
, pp. 116-121
-
-
-
98
-
-
0002296637
-
On the importance of time-a temporal representation of sound
-
M. Slaney and R. Lyon, "On the importance of time-a temporal representation of sound," in Visual Representations of Speech Signals, M. Cooke, S. Beet, and M. Crawford, Eds. New York: Wiley, 1993, pp. 95-116.
-
Visual Representations of Speech Signals, M. Cooke, S. Beet, and M. Crawford, Eds. New York: Wiley, 1993
, pp. 95-116
-
-
Slaney, M.1
Lyon, R.2
-
99
-
-
0028657430
-
Accuracy of quasistationary analysis of highly dynamic speech signals
-
R. Smits, "Accuracy of quasistationary analysis of highly dynamic speech signals," J. Acoitst. Soc. Amer., vol. 96, no. 6, pp. 3401-3415, 1994.
-
(1994)
J. Acoitst. Soc. Amer.
, vol.96
, Issue.6
, pp. 3401-3415
-
-
Smits, R.1
-
100
-
-
33646919328
-
-
personal communication
-
M. M. Sondhi, personal communication.
-
-
-
Sondhi, M.M.1
-
101
-
-
0018038036
-
Invariant cues for place of articulation in stop consonants
-
K. Stevens and S. Blumstein, "Invariant cues for place of articulation in stop consonants," J. Acoust. Soc. Amer., vol. 64, no. 5, pp. 1358-1368, 1978.
-
(1978)
J. Acoust. Soc. Amer.
, vol.64
, Issue.5
, pp. 1358-1368
-
-
Stevens, K.1
Blumstein, S.2
-
102
-
-
0020816189
-
Dynamic specification of coarticulated vowels
-
W. Strange, J. Jenkins, and T. Johnson, "Dynamic specification of coarticulated vowels," J. Acoust. Soc. Amer., vol. 74, no. 3, pp. 695-705, 1983.
-
(1983)
J. Acoust. Soc. Amer.
, vol.74
, Issue.3
, pp. 695-705
-
-
Strange, W.1
Jenkins, J.2
Johnson, T.3
-
103
-
-
0026030074
-
Perception of concurrent vowels: Effects of harmonic misalignment and pitch-period asynchrony
-
Q. Summerfield and P. Assmann, "Perception of concurrent vowels: effects of harmonic misalignment and pitch-period asynchrony," J. Acoust. Soc. Amer., vol. 89, no. 3, pp. 1364-1377, 1991.
-
(1991)
J. Acoust. Soc. Amer.
, vol.89
, Issue.3
, pp. 1364-1377
-
-
Summerfield, Q.1
Assmann, P.2
-
104
-
-
0024929841
-
Transient analysis of speech signals using the Wigner time-frequency representation
-
E. Velez and R. Abshcr, 'Transient analysis of speech signals using the Wigner time-frequency representation," in Proc. ICASSP'89, pp. 2242-2245.
-
Proc. ICASSP'89
, pp. 2242-2245
-
-
Velez, E.1
Abshcr, R.2
-
105
-
-
0001843298
-
Theorie et applications de la notion de signal analytique
-
J. Ville, "Theorie et applications de la notion de signal analytique," Cables et Transmissions, vol. 2A, no. 1, pp. 61-74, 1948;
-
(1948)
Cables Et Transmissions
, vol.2 A
, Issue.1
, pp. 61-74
-
-
Ville, J.1
-
106
-
-
0346642106
-
Theory and applications of the notion of complex signal
-
RAND Corp., Santa Monica, CA
-
I. Selin, transi., "Theory and applications of the notion of complex signal," Tech. Rep. T-92, RAND Corp., Santa Monica, CA, 1958.
-
(1958)
Tech. Rep.
-
-
Selin Transi, I.1
-
107
-
-
0028997028
-
Speech enhancement based on masking properties of the auditory system
-
N. Virag, "Speech enhancement based on masking properties of the auditory system," in Proc. ICASSP'95, vol. 1, pp. 796-799.
-
Proc. ICASSP'95
, vol.1
, pp. 796-799
-
-
Virag, N.1
-
108
-
-
0028462212
-
Self-normalization and noiserobustness in early auditory representations
-
K. Wang and S. Shamma, "Self-normalization and noiserobustness in early auditory representations," IEEE Trans. Speech Audio Process., vol. 2, pp. 421-435, 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, pp. 421-435
-
-
Wang, K.1
Shamma, S.2
-
109
-
-
33646919723
-
A diffusion model of the transient response of the cochlear inner hair cell synapse
-
L. Westerman and R. Smith, "A diffusion model of the transient response of the cochlear inner hair cell synapse," J. Acoust. Soc. Amer., vol. 93, no. 1, pp. 401-417, 1993.
-
(1993)
J. Acoust. Soc. Amer.
, vol.93
, Issue.1
, pp. 401-417
-
-
Westerman, L.1
Smith, R.2
-
110
-
-
0021207990
-
Rapid and short term adaptation in auditory nerve responses
-
"Rapid and short term adaptation in auditory nerve responses," Hearing Res., vol. 15, pp. 249-260, 1985.
-
(1985)
Hearing Res.
, vol.15
, pp. 249-260
-
-
-
111
-
-
33745014742
-
On the quantum correction for thermodynamic equilibrium
-
E. Wigner, "On the quantum correction for thermodynamic equilibrium," Phys. Rev., vol. 40, pp. 749-759, 1932.
-
(1932)
Phys. Rev.
, vol.40
, pp. 749-759
-
-
Wigner, E.1
-
112
-
-
0018653975
-
Least squares glottal inverse filtering from the acoustic speech waveform
-
D. Wong, J. Markel, and A. Gray, "Least squares glottal inverse filtering from the acoustic speech waveform," IEEE Trans. Acoust. Speech Signal Process., vol. 27, no. 4, pp. 350-355, 1979.
-
(1979)
IEEE Trans. Acoust. Speech Signal Process.
, vol.27
, Issue.4
, pp. 350-355
-
-
Wong, D.1
Markel, J.2
Gray, A.3
-
113
-
-
0018606571
-
Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers, J
-
E. Young and M. Sachs, "Representation of steady-state vowels in the temporal aspects of the discharge patterns of populations of auditory-nerve fibers," J. Acoust. Soc. Amer., vol. 66, no. 3, pp. 1381-1403, 1979.
-
(1979)
Acoust. Soc. Amer.
, vol.66
, Issue.3
, pp. 1381-1403
-
-
Young, E.1
Sachs, M.2
-
114
-
-
0027368837
-
Spectral-shape features versus formants as acoustic correlates for vowels
-
S. Zahorian and A. Jagharghi, "Spectral-shape features versus formants as acoustic correlates for vowels," J. Acoust. Soc. Amer., vol. 94, no. 4, pp. 1966-1982, 1993.
-
(1993)
J. Acoust. Soc. Amer.
, vol.94
, Issue.4
, pp. 1966-1982
-
-
Zahorian, S.1
Jagharghi, A.2
-
115
-
-
0025463449
-
The use of cone-shaped kernels for generalized time-frequency representations of nonstationary signals
-
Y. Zhao, L. Atlas, and R. Marks, "The use of cone-shaped kernels for generalized time-frequency representations of nonstationary signals," IEEE Trans. Acoust. Speech Signal Process., vol. 38, no. 7, pp. 1084-1091, 1990.
-
(1990)
IEEE Trans. Acoust. Speech Signal Process.
, vol.38
, Issue.7
, pp. 1084-1091
-
-
Zhao, Y.1
Atlas, L.2
Marks, R.3
|