-
1
-
-
4544303183
-
Speech discrimination based on multiscale spectro-temporal modulations
-
May
-
N. Mesgarani, S. Shamma, and M. Slaney, "Speech discrimination based on multiscale spectro-temporal modulations," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process.,May 2004, vol. 1, pp. 601-604.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech. Signal Process
, vol.1
, pp. 601-604
-
-
Mesgarani, N.1
Shamma, S.2
Slaney, M.3
-
3
-
-
33744994972
-
Automatic speech recognition with an adaptation model motivated by auditory processing
-
Jan
-
M. Holmberg, D. Gelbart, and W. Hemmert, "Automatic speech recognition with an adaptation model motivated by auditory processing," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 43-49, Jan. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.1
, pp. 43-49
-
-
Holmberg, M.1
Gelbart, D.2
Hemmert, W.3
-
4
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis of speech
-
H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, 1990.
-
(1990)
J. Acoust. Soc. Amer
, vol.87
, pp. 1738-1752
-
-
Hermansky, H.1
-
5
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Aug
-
S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
-
(1980)
IEEE Trans. Acoust., Speech, Signal Process
, vol.ASSP-28
, Issue.4
, pp. 357-366
-
-
Davis, S.1
Mermelstein, P.2
-
6
-
-
0031187171
-
Speech recognition by machines and humans
-
Mar
-
R. Lippmann, "Speech recognition by machines and humans," Speech Commun., vol. 22, no. 1, pp. 1-15, Mar. 1997.
-
(1997)
Speech Commun
, vol.22
, Issue.1
, pp. 1-15
-
-
Lippmann, R.1
-
7
-
-
0016067897
-
Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
-
B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, 1974.
-
(1974)
J. Acoust. Soc. Amer
, vol.55
, pp. 1304-1312
-
-
Atal, B.S.1
-
8
-
-
0018455310
-
Suppression of acoustic noise in speech using spectral subtraction
-
Apr
-
S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. 27, no. 2, pp. 113-120, Apr. 1979.
-
(1979)
IEEE Trans. Acoust., Speech, Signal Process
, vol.27
, Issue.2
, pp. 113-120
-
-
Boll, S.1
-
9
-
-
0033099548
-
On second-order statistics and linear estimation of cepstral coefficients
-
Mar
-
Y. Ephraim and M. Rahim, "On second-order statistics and linear estimation of cepstral coefficients," IEEE Trans. Speech Audio Process., vol. 7, no. 2, pp. 162-176, Mar. 1999.
-
(1999)
IEEE Trans. Speech Audio Process
, vol.7
, Issue.2
, pp. 162-176
-
-
Ephraim, Y.1
Rahim, M.2
-
10
-
-
0001459635
-
Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises
-
May
-
Y. Zhao, "Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 255-266, May 2000.
-
(2000)
IEEE Trans. Speech Audio Process
, vol.8
, Issue.3
, pp. 255-266
-
-
Zhao, Y.1
-
11
-
-
0026881830
-
Gain-adapted hidden markov models for recognition of clean and noisy speech
-
Jun
-
Y. Ephraim, "Gain-adapted hidden markov models for recognition of clean and noisy speech," IEEE Trans. Signal Process., vol. 40, no. 6, pp. 1303-1316, Jun. 1992.
-
(1992)
IEEE Trans. Signal Process
, vol.40
, Issue.6
, pp. 1303-1316
-
-
Ephraim, Y.1
-
12
-
-
0002671953
-
A minimax classification approach with application to robust speech recognition
-
Jan
-
N. Merhav and C.-H. Lee, "A minimax classification approach with application to robust speech recognition," IEEE Trans. Speech Audio Process., vol. 1, no. 1, pp. 90-100, Jan. 1993.
-
(1993)
IEEE Trans. Speech Audio Process
, vol.1
, Issue.1
, pp. 90-100
-
-
Merhav, N.1
Lee, C.-H.2
-
13
-
-
0018437122
-
Automatic speech recognition using psychoacoustic models
-
E. Zwicker, E. Terhardt, and E. Paulus, "Automatic speech recognition using psychoacoustic models," J. Acoust. Soc. Amer., vol. 65, pp. 487-498, 1979.
-
(1979)
J. Acoust. Soc. Amer
, vol.65
, pp. 487-498
-
-
Zwicker, E.1
Terhardt, E.2
Paulus, E.3
-
14
-
-
0023859986
-
Auditory neural feedback as a basis for speech processing
-
O. Ghitza, "Auditory neural feedback as a basis for speech processing," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., 1988, vol. 1, pp. 91-94.
-
(1988)
Proc. IEEE Int. Conf. Acoust., Speech. Signal Process
, vol.1
, pp. 91-94
-
-
Ghitza, O.1
-
15
-
-
0024392496
-
Application of an auditory model to speech recognition
-
J. R. Cohen, "Application of an auditory model to speech recognition," J. Acoust. Soc. Amer., vol. 85, pp. 2623-2629, 1989.
-
(1989)
J. Acoust. Soc. Amer
, vol.85
, pp. 2623-2629
-
-
Cohen, J.R.1
-
16
-
-
0032828464
-
A model of auditory perception as front end for automatic speech recognition
-
Oct
-
J. Tchorz and B. Kollmeier, "A model of auditory perception as front end for automatic speech recognition," J. Acoust. Soc. Amer., vol. 106, no. 4, pp. 2040-2050, Oct. 1999.
-
(1999)
J. Acoust. Soc. Amer
, vol.106
, Issue.4
, pp. 2040-2050
-
-
Tchorz, J.1
Kollmeier, B.2
-
17
-
-
0029345416
-
A comparison of signal processing front ends for automatic word recognition
-
Jul
-
C. R. Jankowski, H.-D. H. Vo, and R. P. Lippmann, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech Audio Process., vol. 3, no. 4, pp. 286-293, Jul. 1995.
-
(1995)
IEEE Trans. Speech Audio Process
, vol.3
, Issue.4
, pp. 286-293
-
-
Jankowski, C.R.1
Vo, H.-D.H.2
Lippmann, R.P.3
-
18
-
-
0031647650
-
Speech analysis and recognition using interval statistics generated from a composite auditory model
-
Jan
-
H. Sheikhzadeh and L. Deng, "Speech analysis and recognition using interval statistics generated from a composite auditory model," IEEE Trans. Speech Audio Process., vol. 6, no. 1, pp. 90-94, Jan. 1998.
-
(1998)
IEEE Trans. Speech Audio Process
, vol.6
, Issue.1
, pp. 90-94
-
-
Sheikhzadeh, H.1
Deng, L.2
-
19
-
-
0031238095
-
A model of dynamic auditory perception and its application to robust word recognition
-
Sep
-
B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 451-464, Sep. 1997.
-
(1997)
IEEE Trans. Speech Audio Process
, vol.5
, Issue.5
, pp. 451-464
-
-
Strope, B.1
Alwan, A.2
-
20
-
-
0003760813
-
Central auditory model for spectral processing
-
Apr
-
Y. Gao, T. Huang, and J.-P. Haton, "Central auditory model for spectral processing," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., Apr. 1993, pp. 704-707.
-
(1993)
Proc. IEEE Int. Conf. Acoust., Speech. Signal Process
, pp. 704-707
-
-
Gao, Y.1
Huang, T.2
Haton, J.-P.3
-
21
-
-
85009227802
-
Localized spectro-temporal features for automatic speech recognition
-
M. Kleinschmidt, "Localized spectro-temporal features for automatic speech recognition," in Proc. Interspeech'02, 2002, pp. 2573-2576.
-
(2002)
Proc. Interspeech'02
, pp. 2573-2576
-
-
Kleinschmidt, M.1
-
23
-
-
0026626445
-
Auditory representations of acoustic signals
-
Mar
-
X. Yang, K. Wang, and S. A. Shamma, "Auditory representations of acoustic signals," IEEE Trans. Inf. Theory, vol. 38, no. 2, pp. 824-839, Mar. 1992.
-
(1992)
IEEE Trans. Inf. Theory
, vol.38
, Issue.2
, pp. 824-839
-
-
Yang, X.1
Wang, K.2
Shamma, S.A.3
-
24
-
-
0029378080
-
Spectral shape analysis in the central auditory system
-
Sep
-
K. Wang and S. A. Shamma, "Spectral shape analysis in the central auditory system," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 382-395, Sep. 1995.
-
(1995)
IEEE Trans. Speech Audio Process
, vol.3
, Issue.5
, pp. 382-395
-
-
Wang, K.1
Shamma, S.A.2
-
26
-
-
0034710863
-
Auditory neuroscience: Development, transduction, and integration
-
A. J. Hudspeth and M. Konishi, "Auditory neuroscience: Development, transduction, and integration," Proc. National Academy Sci., pp. 11690-11691, 2000.
-
(2000)
Proc. National Academy Sci
, pp. 11690-11691
-
-
Hudspeth, A.J.1
Konishi, M.2
-
27
-
-
23744508888
-
Multiresolution spectrotemporal analysis of complex sounds
-
Aug
-
T. Chi, P. Ru, and S. A. Shamma, "Multiresolution spectrotemporal analysis of complex sounds," J. Acoust. Soc. Amer., vol. 118, no. 2, pp. 887-906, Aug. 2005.
-
(2005)
J. Acoust. Soc. Amer
, vol.118
, Issue.2
, pp. 887-906
-
-
Chi, T.1
Ru, P.2
Shamma, S.A.3
-
28
-
-
0028462212
-
Self-normalization and noise-robustness in early auditory representations
-
Jul
-
K. Wang and S. Shamma, "Self-normalization and noise-robustness in early auditory representations," IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 421-435, Jul. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.3
, pp. 421-435
-
-
Wang, K.1
Shamma, S.2
-
32
-
-
79251542316
-
A computational model of filtering, detection, and compression in the cochlea
-
May
-
R. Lyon, "A computational model of filtering, detection, and compression in the cochlea," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., May 1982, vol. 7, pp. 1282-1285.
-
(1982)
Proc. IEEE Int. Conf. Acoust., Speech. Signal Process
, vol.7
, pp. 1282-1285
-
-
Lyon, R.1
-
33
-
-
0021794508
-
Cochlear modeling, IEEE Acoust., Speech
-
Jan
-
J. Allen, "Cochlear modeling," IEEE Acoust., Speech, Signal Process. Mag., vol. 2, no. 1, pp. 3-29, Jan. 1985.
-
(1985)
Signal Process. Mag
, vol.2
, Issue.1
, pp. 3-29
-
-
Allen, J.1
-
34
-
-
33750418033
-
Properties of auditory model representations
-
F. S. Perdigao and L. V. Sa, "Properties of auditory model representations," in Proc. Eurospeech'97, 1997, pp. 2499-2502.
-
(1997)
Proc. Eurospeech'97
, pp. 2499-2502
-
-
Perdigao, F.S.1
Sa, L.V.2
-
35
-
-
0022873930
-
A computational model for the peripheral auditory system: Application of speech recognition research
-
Apr
-
S. Seneff, "A computational model for the peripheral auditory system: Application of speech recognition research," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., Apr. 1986, pp. 1983-1986.
-
(1986)
Proc. IEEE Int. Conf. Acoust., Speech. Signal Process
, pp. 1983-1986
-
-
Seneff, S.1
-
37
-
-
64549088551
-
-
The Institute for Systems Research, Online, Available
-
The Institute for Systems Research. [Online]. Available: http://www.isr.umd.edu/CAAR/
-
-
-
-
38
-
-
0030740959
-
Laminar fine structure of frequency organization in auditory midbrain
-
Jul
-
C. E. Schreiner and G. Langner, "Laminar fine structure of frequency organization in auditory midbrain," Nature, vol. 388, pp. 383-386, Jul. 1997.
-
(1997)
Nature
, vol.388
, pp. 383-386
-
-
Schreiner, C.E.1
Langner, G.2
-
39
-
-
0034037502
-
Modular organization of frequency integration in primary auditory cortex
-
Mar
-
C. E. Schreiner, H. L. Read, and M. L. Sutter, "Modular organization of frequency integration in primary auditory cortex," Annu. Rev. Neurosci., vol. 23, pp. 501-529, Mar. 2000.
-
(2000)
Annu. Rev. Neurosci
, vol.23
, pp. 501-529
-
-
Schreiner, C.E.1
Read, H.L.2
Sutter, M.L.3
-
43
-
-
0029238302
-
Subband analysis for robust speech recognition in the presence of car noise
-
May
-
E. Erzin, A. E. Cetin, and Y. Yardimci, "Subband analysis for robust speech recognition in the presence of car noise," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., May 1995, pp. 417-420.
-
(1995)
Proc. IEEE Int. Conf. Acoust., Speech. Signal Process
, pp. 417-420
-
-
Erzin, E.1
Cetin, A.E.2
Yardimci, Y.3
-
44
-
-
84962871227
-
Robust speech recognition using wavelet coefficient features
-
Dec
-
M. Gupta and A. Gilbert, "Robust speech recognition using wavelet coefficient features," in Proc. IEEE Workshop ASRU 2001, Dec. 2001, pp. 445-448.
-
(2001)
Proc. IEEE Workshop ASRU 2001
, pp. 445-448
-
-
Gupta, M.1
Gilbert, A.2
-
45
-
-
0037340693
-
Distinct brain regions associated with syllable and phoneme
-
W. T. Siok, Z. Jin, P. Fletcher, and L. H. Tan, "Distinct brain regions associated with syllable and phoneme," Human Brain Mapping, vol. 18, pp. 201-207, 2003.
-
(2003)
Human Brain Mapping
, vol.18
, pp. 201-207
-
-
Siok, W.T.1
Jin, Z.2
Fletcher, P.3
Tan, L.H.4
-
46
-
-
0030960693
-
Lefthemisphere specialization for the processing of acoustic transients
-
I. S. Johnsrude, R. J. Zatorre, B. A. Milner, and A. C. Evans, "Lefthemisphere specialization for the processing of acoustic transients," NeuroReport, vol. 8, pp. 1761-1765, 1997.
-
(1997)
NeuroReport
, vol.8
, pp. 1761-1765
-
-
Johnsrude, I.S.1
Zatorre, R.J.2
Milner, B.A.3
Evans, A.C.4
-
47
-
-
64549112338
-
-
R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York: Wiley, 2001, pp. 117-170.
-
R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York: Wiley, 2001, pp. 117-170.
-
-
-
-
48
-
-
33745190989
-
A category-dependent feature selection method for speech signals
-
Lisbon, Portugal, Sep
-
W. Jeon and B. -H. Juang, "A category-dependent feature selection method for speech signals," in Proc. Interspeech'05, Lisbon, Portugal, Sep. 2005, pp. 365-368.
-
(2005)
Proc. Interspeech'05
, pp. 365-368
-
-
Jeon, W.1
Juang, B.-H.2
-
49
-
-
0035145191
-
Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging
-
C. M. Wessinger, J. VanMeter, B. Tian, J. V. Lare, J. Pekar, and J. P. Rauschecker, "Hierarchical organization of the human auditory cortex revealed by functional magnetic resonance imaging," J. Cognitive Neurosci., vol. 13, no. 1, pp. 1-7, 2001.
-
(2001)
J. Cognitive Neurosci
, vol.13
, Issue.1
, pp. 1-7
-
-
Wessinger, C.M.1
VanMeter, J.2
Tian, B.3
Lare, J.V.4
Pekar, J.5
Rauschecker, J.P.6
-
50
-
-
0024768209
-
Speaker-independent phone recognition using hidden markov models
-
Nov
-
K.-F. Lee and H.-W. Hon, "Speaker-independent phone recognition using hidden markov models," IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 11, pp. 1641-1648, Nov. 1989.
-
(1989)
IEEE Trans. Acoust., Speech, Signal Process
, vol.37
, Issue.11
, pp. 1641-1648
-
-
Lee, K.-F.1
Hon, H.-W.2
-
51
-
-
33745185781
-
Hidden conditional random fields for phone classification
-
Lisbon, Portugal, Sep
-
A. Gunawardana, M. Mahajan, A. Acero, and J. C. Platt, "Hidden conditional random fields for phone classification," in Interspeech'05, Lisbon, Portugal, Sep. 2005, pp. 1117-1120.
-
(2005)
Interspeech'05
, pp. 1117-1120
-
-
Gunawardana, A.1
Mahajan, M.2
Acero, A.3
Platt, J.C.4
|