-
1
-
-
85009233038
-
Improving word accuracy with gabor feature extraction
-
M. Kleinschmidt and D. Gelbart, "Improving word accuracy with gabor feature extraction, " in Proc. ICSLP, vol. 5, 2002, pp. 16-38.
-
(2002)
Proc. ICSLP
, vol.5
, pp. 16-38
-
-
Kleinschmidt, M.1
Gelbart, D.2
-
2
-
-
85009227802
-
Localized spectro-temporal features for automatic speech recognition
-
Citeseer
-
M. Kleinschmidt, "Localized spectro-temporal features for automatic speech recognition, " in Proc. Eurospeech, vol. 87. Citeseer, 2003.
-
(2003)
Proc. Eurospeech
, vol.87
-
-
Kleinschmidt, M.1
-
3
-
-
34547509128
-
Representation of phonemes in primary auditory cortex: How the brain analyzes speech
-
IV-765
-
N. Mesgarani, S. David, and S. Shamma, "Representation of phonemes in primary auditory cortex: how the brain analyzes speech, " in Proc. ICASSP, vol. 4, 2007, pp. IV-765.
-
(2007)
Proc. ICASSP
, vol.4
-
-
Mesgarani, N.1
David, S.2
Shamma, S.3
-
4
-
-
0038711696
-
A spectro-temporal modulation index (stmi) for assessment of speech intelligibility
-
M. Elhilali, T. Chi, and S. A. Shamma, "A spectro-temporal modulation index (stmi) for assessment of speech intelligibility, " Speech communication, vol. 41, no. 2, pp. 331-348, 2003.
-
(2003)
Speech Communication
, vol.41
, Issue.2
, pp. 331-348
-
-
Elhilali, M.1
Chi, T.2
Shamma, S.A.3
-
5
-
-
34047272330
-
Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations
-
N. Mesgarani, M. Slaney, and S. Shamma, "Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 14, no. 3, pp. 920-930, 2006.
-
(2006)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.14
, Issue.3
, pp. 920-930
-
-
Mesgarani, N.1
Slaney, M.2
Shamma, S.3
-
6
-
-
84865769808
-
Comparing different flavors of spectro-temporal features for asr
-
B. Meyer, S. Ravuri, M. Schädler, and N. Morgan, "Comparing different flavors of spectro-temporal features for asr, " in Proc. of Inter Speech, 2011, pp. 1269-1272.
-
(2011)
Proc. of Inter Speech
, pp. 1269-1272
-
-
Meyer, B.1
Ravuri, S.2
Schädler, M.3
Morgan, N.4
-
7
-
-
84890497049
-
Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition
-
B. Meyer, C. Spille, B. Kollmeier, and N. Morgan, "Hooking up spectro-temporal filters with auditory-inspired representations for robust automatic speech recognition, " in Proc. Inter Speech, vol. 15, 2012, p. 20.
-
(2012)
Proc. Inter Speech
, vol.15
, pp. 20
-
-
Meyer, B.1
Spille, C.2
Kollmeier, B.3
Morgan, N.4
-
8
-
-
84878611488
-
Normalization of spectrotemporal gabor filter bank features for improved robust automatic speech recognition systems
-
M. R. Schädler and B. Kollmeier, "Normalization of spectrotemporal gabor filter bank features for improved robust automatic speech recognition systems, " in Proc. Inter Speech, 2012.
-
(2012)
Proc. Inter Speech
-
-
Schädler, M.R.1
Kollmeier, B.2
-
9
-
-
84878395103
-
Longer features: They do a speech detector good
-
T. Tsai and N. Morgan, "Longer features: They do a speech detector good, " in Proc. Inter Speech, 2012.
-
(2012)
Proc. Inter Speech
-
-
Tsai, T.1
Morgan, N.2
-
10
-
-
84863799482
-
Spectro-temporal modulation subspace-spanning filter bank features for robust automatic speech recognition
-
M. R. Schädler, B. T. Meyer, and B. Kollmeier, "Spectro- temporal modulation subspace-spanning filter bank features for robust automatic speech recognition, " The Journal of the Acoustical Society of America, vol. 131, p. 4134, 2012.
-
(2012)
The Journal of the Acoustical Society of America
, vol.131
, pp. 4134
-
-
Schädler, M.R.1
Meyer, B.T.2
Kollmeier, B.3
-
11
-
-
0141624530
-
An efficient auditory filterbank based on the gammatone function
-
R. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Rice, "An efficient auditory filterbank based on the gammatone function, " APU report, vol. 2341, 1988.
-
(1988)
APU Report
, vol.2341
-
-
Patterson, R.1
Nimmo-Smith, I.2
Holdsworth, J.3
Rice, P.4
-
12
-
-
0032136330
-
Robust speech recognition using the modulation spectrogram
-
B. E. Kingsbury, N. Morgan, and S. Greenberg, "Robust speech recognition using the modulation spectrogram, " Speech Communication, vol. 25, no. 1, pp. 117-132, 1998.
-
(1998)
Speech Communication
, vol.25
, Issue.1
, pp. 117-132
-
-
Kingsbury, B.E.1
Morgan, N.2
Greenberg, S.3
-
13
-
-
0026626445
-
Auditory representations of acoustic signals
-
X. Yang, K. Wang, and S. A. Shamma, "Auditory representations of acoustic signals, " Information Theory, IEEE Transactions on, vol. 38, no. 2, pp. 824-839, 1992.
-
(1992)
Information Theory, IEEE Transactions on
, vol.38
, Issue.2
, pp. 824-839
-
-
Yang, X.1
Wang, K.2
Shamma, S.A.3
-
14
-
-
0026735337
-
Shiftable multiscale transforms
-
E. P. Simoncelli, W. T. Freeman, E. H. Adelson, and D. J. Heeger, "Shiftable multiscale transforms, " Information Theory, IEEE Transactions on, vol. 38, no. 2, pp. 587-607, 1992.
-
(1992)
Information Theory, IEEE Transactions on
, vol.38
, Issue.2
, pp. 587-607
-
-
Simoncelli, E.P.1
Freeman, W.T.2
Adelson, E.H.3
Heeger, D.J.4
-
15
-
-
0029487233
-
The steerable pyramid: A flexible architecture for multi-scale derivative computation
-
E. Simoncelli and W. Freeman, "The steerable pyramid: A flexible architecture for multi-scale derivative computation, " in Proc. ICIP, vol. 3, 1995, pp. 444-447.
-
(1995)
Proc. ICIP
, vol.3
, pp. 444-447
-
-
Simoncelli, E.1
Freeman, W.2
-
19
-
-
0003822743
-
-
Cambridge University Engineering Department
-
S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, "The htk book, " Cambridge University Engineering Department, vol. 3, 2002.
-
(2002)
The Htk Book
, vol.3
-
-
Young, S.1
Evermann, G.2
Kershaw, D.3
Moore, G.4
Odell, J.5
Ollason, D.6
Valtchev, V.7
Woodland, P.8
-
20
-
-
84883097102
-
On the importance of various modulation frequencies for speech recognition
-
N. Kanedera, T. Arai, H. Hermansky, and M. Pavel, "On the importance of various modulation frequencies for speech recognition, " in Proc. Eurospeech, vol. 97, 1997, pp. 1079-1082.
-
(1997)
Proc. Eurospeech
, vol.97
, pp. 1079-1082
-
-
Kanedera, N.1
Arai, T.2
Hermansky, H.3
Pavel, M.4
-
21
-
-
33646064275
-
Multi-resolution rasta filtering for tandem-based asr
-
H. Hermansky and P. Fousek, "Multi-resolution rasta filtering for tandem-based asr, " in Proc. Inter Speech, 2005.
-
(2005)
Proc. Inter Speech
-
-
Hermansky, H.1
Fousek, P.2
-
22
-
-
70450182191
-
Tandem representations of spectral envelope and modulation frequency features for asr
-
S. Thomas, S. Ganapathy, and H. Hermansky, "Tandem representations of spectral envelope and modulation frequency features for asr, " in Proc. Inter Speech, 2009.
-
(2009)
Proc. Inter Speech
-
-
Thomas, S.1
Ganapathy, S.2
Hermansky, H.3
-
23
-
-
0034427366
-
Curvelets, multi resolution representation, and scaling laws
-
E. Candes and D. Donoho, "Curvelets, multiresolution representation, and scaling laws, " in Proc. SPIE, vol. 4119, no. 1, 2000.
-
(2000)
Proc. SPIE
, vol.4119
, Issue.1
-
-
Candes, E.1
Donoho, D.2
-
24
-
-
28944432472
-
The contourlet transform: An efficient directional multi resolution image representation
-
M. Do and M. Vetterli, "The contourlet transform: An efficient directional multi resolution image representation, " Image Processing, IEEE Transactions on, vol. 14, no. 12, pp. 2091-2106, 2005.
-
(2005)
Image Processing, IEEE Transactions on
, vol.14
, Issue.12
, pp. 2091-2106
-
-
Do, M.1
Vetterli, M.2
-
25
-
-
0030369274
-
Inclusion of temporal information into features for speech recognition
-
B. Milner, "Inclusion of temporal information into features for speech recognition, " in Proc. ICSLP, vol. 1, 1996, pp. 256-259.
-
(1996)
Proc. ICSLP
, vol.1
, pp. 256-259
-
-
Milner, B.1
|