-
2
-
-
0032139768
-
Should recognizers have ears?
-
PII S0167639398000272
-
H. Hermansky, "Should recognizers have ears?, " Speech Commun. , vol. 25, pp. 3-27, 1998. (Pubitemid 128413632)
-
(1998)
Speech Communication
, vol.25
, Issue.1-3
, pp. 3-27
-
-
Hermansky, H.1
-
3
-
-
0018455310
-
Suppression of acoustic noise in speech using spectral subtraction
-
S. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoustic, Speech, Signal Process. , vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979. (Pubitemid 9467471)
-
(1979)
IEEE Trans Acoust Speech Signal Process
, vol.ASSP-27
, Issue.2
, pp. 113-120
-
-
Boll Steven, F.1
-
4
-
-
0019555090
-
Cepstral analysis technique for automatic speaker verification
-
S. Furui, "Cepstral analysis technique for automatic speaker verification, " IEEE Trans. Acoust. , Speech, Signal Process. , vol. 29, no. 2, pp. 254-272, Apr. 1981. (Pubitemid 11495877)
-
(1981)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.ASSP-29
, Issue.2
, pp. 254-272
-
-
Furui Sadaoki1
-
5
-
-
0030711157
-
Transcription of broadcast television and radio news: The 1996 abbot system
-
G. Cook, D. Kershaw, J. Christie, C. Seymour, and S. Waterhouse, "Transcription of broadcast television and radio news: The 1996 abbot system, " Proc. Int. Acoustics Speech Signal Process. , pp. 723-726, 1997.
-
(1997)
Proc. Int. Acoustics Speech Signal Process.
, pp. 723-726
-
-
Cook, G.1
Kershaw, D.2
Christie, J.3
Seymour, C.4
Waterhouse, S.5
-
6
-
-
42549139762
-
Mva processing of speech features
-
Jan
-
C. Chen and J. Bilmes, "Mva processing of speech features, " IEEE Trans. Audio, Speech, Lang. Process. , vol. 15, no. 1, pp. 257-270, Jan. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.1
, pp. 257-270
-
-
Chen, C.1
Bilmes, J.2
-
9
-
-
9644298434
-
-
Ph. D. dissertation, Oregon Graduate Inst. of Sci. Technol. , Portland, OR
-
S. Sharma, "Multi-stream approach to robust speech recognition, " Ph. D. dissertation, Oregon Graduate Inst. of Sci. Technol. , Portland, OR, 1999.
-
(1999)
Multi-stream Approach to Robust Speech Recognition
-
-
Sharma, S.1
-
10
-
-
0141697346
-
-
Ph. D. dissertation, Lab. Intell. Artif. Perceptive, cole Polytechnique Fdrale, Lausanne, Switzerland
-
A. Hagen, "Robust speech recognition based on multi-stream processing, " Ph. D. dissertation, Lab. Intell. Artif. Perceptive, cole Polytechnique Fdrale, Lausanne, Switzerland, 2001.
-
(2001)
Robust Speech Recognition Based on Multi-stream Processing
-
-
Hagen, A.1
-
11
-
-
73649085443
-
Multi-stream speech recognition based onDempster-Shafer combination rule
-
F. Valente, "Multi-stream speech recognition based onDempster-Shafer combination rule, " Speech Commun. , vol. 52, no. 3, pp. 213-222, 2010.
-
(2010)
Speech Commun.
, vol.52
, Issue.3
, pp. 213-222
-
-
Valente, F.1
-
12
-
-
70450216114
-
Multi-stream to many-stream: Using spectro-temporal features for ASR
-
R. S. and M. N.
-
S. Y. Zhao, R. S. , and M. N. , "Multi-stream to many-stream: Using spectro-temporal features for ASR, " in Proc. INTERSPEECH, 2009, pp. 2951-2954.
-
(2009)
Proc. INTERSPEECH
, pp. 2951-2954
-
-
Zhao, S.Y.1
-
13
-
-
79959816304
-
A multistream multiresolution framework for phoneme recognition
-
N. Mesgarani, S. Thomas, and H. Hermansky, "A multistream multiresolution framework for phoneme recognition, " in Proc. INTERSPEECH, 2010, pp. 318-321.
-
(2010)
Proc. INTERSPEECH
, pp. 318-321
-
-
Mesgarani, N.1
Thomas, S.2
Hermansky, H.3
-
14
-
-
0034825241
-
Multi-stream adaptive evidence combination for noise robust ASR
-
DOI 10.1016/S0167-6393(00)00044-3
-
A. Morris, A. Hagen, H. Glotin, and H. Bourlard, "Multi-stream adaptive evidence combination for noise robust ASR, " Speech Commun. , vol. 34, pp. 25-40, 2001. (Pubitemid 32874681)
-
(2001)
Speech Communication
, vol.34
, Issue.1-2
, pp. 25-40
-
-
Morris, A.1
Hagen, A.2
Glotin, H.3
Bourlard, H.4
-
15
-
-
45549100188
-
Speech analysis in a model of the central auditory system
-
Aug
-
J. Woojay and B. Juang, "Speech analysis in a model of the central auditory system, " IEEE Trans. Speech Audio Process. , vol. 15, no. 6, pp. 1802-1817, Aug. 2007.
-
(2007)
IEEE Trans. Speech Audio Process.
, vol.15
, Issue.6
, pp. 1802-1817
-
-
Woojay, J.1
Juang, B.2
-
16
-
-
84871848126
-
Spectro-temporal gabor features as a front end for automatic speech recognition
-
M. Kleinschmidt, "Spectro-temporal gabor features as a front end for automatic speech recognition, " in Forum Acusticum, 2002.
-
(2002)
Forum Acusticum
-
-
Kleinschmidt, M.1
-
17
-
-
84865769808
-
Comparing different flavors of spectro-temporal features for ASR
-
B. Meyer, S. Ravuri, M. Schädler, and N. Morgan, "Comparing different flavors of spectro-temporal features for ASR, " Proc. INTERSPEECH, vol. 1, pp. 1269-1272, 2011.
-
(2011)
Proc. INTERSPEECH
, vol.1
, pp. 1269-1272
-
-
Meyer, B.1
Ravuri, S.2
Schädler, M.3
Morgan, N.4
-
18
-
-
84867619222
-
Spectro-temporal gabor features for speaker recognition
-
H. Lei, B. Meyer, and N. Mirghafori, "Spectro-temporal gabor features for speaker recognition, " in Proc. IEEE Conf. Acoust. , Speech, Signal Process. , 2012, pp. 4241-4244.
-
(2012)
Proc IEEE Conf. Acoust. Speech, Signal Process.
, pp. 4241-4244
-
-
Lei, H.1
Meyer, B.2
Mirghafori, N.3
-
19
-
-
84865738978
-
Multistream bandpass modulation features for robust speech recognition
-
S. Nemala, K. Patil, and M. Elhilali, "Multistream bandpass modulation features for robust speech recognition, " in Proc. ISCA, 2011, pp. 1277-1280.
-
(2011)
Proc. ISCA
, pp. 1277-1280
-
-
Nemala, S.1
Patil, K.2
Elhilali, M.3
-
20
-
-
0026626445
-
Auditory representations of acoustic signals
-
Mar
-
X. Yang, K. Wang, and S. A. Shamma, "Auditory representations of acoustic signals, " IEEE Trans. Inf. Theory, vol. 38, no. 2, pp. 824-839, Mar. 1992.
-
(1992)
IEEE Trans. Inf. Theory
, vol.38
, Issue.2
, pp. 824-839
-
-
Yang, X.1
Wang, K.2
Shamma, S.A.3
-
21
-
-
23744508888
-
Multiresolution spectrotemporal analysis of complex sounds
-
DOI 10.1121/1.1945807
-
T. Chi, P. Ru, and S. Shamma, "Multiresolution spectrotemporal analysis of complex sounds, " J. Acoust. Soc. Amer. , vol. 118, pp. 887-906, 2005. (Pubitemid 41129224)
-
(2005)
Journal of the Acoustical Society of America
, vol.118
, Issue.2
, pp. 887-906
-
-
Chi, T.1
Ru, P.2
Shamma, S.A.3
-
22
-
-
34247487053
-
The cortical organization of speech processing
-
DOI 10.1038/nrn2113, PII NRN2113
-
G. Hickock and D. Poeppel, "The cortical organization of speech processing, " Nature Neurosc. Reviews, vol. 8, pp. 393-402, 2007. (Pubitemid 46652465)
-
(2007)
Nature Reviews Neuroscience
, vol.8
, Issue.5
, pp. 393-402
-
-
Hickok, G.1
Poeppel, D.2
-
23
-
-
0032142971
-
Cortical processing of complex sounds
-
DOI 10.1016/S0959-4388(98)80040-8
-
J. P. Rauschecker, "Cortical processing of complex sounds, " Curr. Opin. Neurobiol. , vol. 8, pp. 516-521, 1998. (Pubitemid 28431742)
-
(1998)
Current Opinion in Neurobiology
, vol.8
, Issue.4
, pp. 516-521
-
-
Rauschecker, J.P.1
-
24
-
-
0018564438
-
Temporal modulation transfer functions based upon modulation thresholds
-
N. F. Viemeister, "Temporal modulation transfer functions based upon modulation thresholds, " J Acoust Soc Amer. , vol. 66, no. 5, pp. 1364-1380, Nov. 1979. (Pubitemid 10098323)
-
(1979)
Journal of the Acoustical Society of America
, vol.66
, Issue.5
, pp. 1364-1380
-
-
Viemeister, N.F.1
-
25
-
-
0039816305
-
-
Cambridge MA Plenum ch. Frequency and the detection of spectral shape change
-
D. Green, Auditory Frequency Selectivity. Cambridge, MA: Plenum, 1986, ch. Frequency and the detection of spectral shape change, pp. 351-359.
-
(1986)
Auditory Frequency Selectivity
, pp. 351-359
-
-
Green, D.1
-
26
-
-
0040290402
-
Spectrotemporal modulation transfer functions and speech intelligibility
-
T. Chi, Y. Gao, M. C. Guyton, P. Ru, and S. A. Shamma, "Spectrotemporal modulation transfer functions and speech intelligibility, " J. Acoust. Soc. Amer. , vol. 106, pp. 2719-2732, 1999.
-
(1999)
J. Acoust. Soc. Amer.
, vol.106
, pp. 2719-2732
-
-
Chi, T.1
Gao, Y.2
Guyton, M.C.3
Ru, P.4
Shamma, S.A.5
-
27
-
-
0027957839
-
Effect of temporal envelope smearing on speech reception
-
R. Drullman, J. Festen, and R. Plomp, "Effect of temporal envelope smearing on speech reception, " J. Acoust. Soc. Amer. , vol. 95, pp. 1053-1064, 1994. (Pubitemid 24056370)
-
(1994)
Journal of the Acoustical Society of America
, vol.95
, Issue.2
, pp. 1053-1064
-
-
Drullman, R.1
Festen, J.M.2
Plomp, R.3
-
28
-
-
0038711696
-
A spectro-temporal modulation index (STMI) for assessment of speech intelligibility
-
M. Elhilali, T. Chi, and S. A. Shamma, "A spectro-temporal modulation index (STMI) for assessment of speech intelligibility, " Speech Commun. , vol. 41, pp. 331-348, 2003.
-
(2003)
Speech Commun.
, vol.41
, pp. 331-348
-
-
Elhilali, M.1
Chi, T.2
Shamma, S.A.3
-
29
-
-
14044252930
-
Speech recognition with amplitude and frequency modulations
-
DOI 10.1073/pnas.0406460102
-
F.-G. Zeng, K. Nie, G. S. Stickney, Y.-Y. Kong, M. Vongphoe, A. Bhargave, C. Wei, and K. Cao, "Speech recognition with amplitude and frequency modulations, " Proc. National Acad. Sci. , USA, vol. 102, no. 7, pp. 2293-2298, Feb. 2005. (Pubitemid 40279369)
-
(2005)
Proceedings of the National Academy of Sciences of the United States of America
, vol.102
, Issue.7
, pp. 2293-2298
-
-
Zeng, F.-G.1
Nie, K.2
Stickney, G.S.3
Kong, Y.-Y.4
Vongphoe, M.5
Bhargave, A.6
Wei, C.7
Cao, K.8
-
30
-
-
63549114783
-
The modulation transfer function for speech intelligibility
-
T. Elliott and F. Theunissen, "The modulation transfer function for speech intelligibility, " PLoS Comput. Biol. , vol. 5, p. e1000302, 2009.
-
(2009)
PLoS Comput. Biol.
, vol.5
-
-
Elliott, T.1
Theunissen, F.2
-
31
-
-
0003548585
-
-
Philadelphia, PA: Linguistic Data Consortium
-
J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus. Philadelphia, PA: Linguistic Data Consortium, 1993, p. LDC93S1.
-
(1993)
DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus
-
-
Garofolo, J.S.1
Lamel, L.F.2
Fisher, W.M.3
Fiscus, J.G.4
Pallett, D.S.5
Dahlgren, N.L.6
-
33
-
-
0024768209
-
Speaker-independent phone recognition using hidden Markov models
-
Nov
-
K. F. Lee and H. W. Hon, "Speaker-independent phone recognition using hidden Markov models, " IEEE Trans. Acoust. , Speech, Signal Process. , vol. 37, no. 11, pp. 1641-1648, Nov. 1989.
-
(1989)
IEEE Trans. Acoust. , Speech, Signal Process.
, vol.37
, Issue.11
, pp. 1641-1648
-
-
Lee, K.F.1
Hon, H.W.2
-
34
-
-
77957731201
-
Datadriven and feedback based spectro-temporal features for speech recognition
-
Nov.
-
S. Garimella, S. Nemala, N. Mesgarani, and H. Hermansky, "Datadriven and feedback based spectro-temporal features for speech recognition, " IEEE Signal Process. Lett. , vol. 17, no. 11, pp. 957-960, Nov. 2010.
-
(2010)
IEEE Signal Process. Lett.
, vol.17
, Issue.11
, pp. 957-960
-
-
Garimella, S.1
Nemala, S.2
Mesgarani, N.3
Hermansky, H.4
-
35
-
-
11144222882
-
Comparison and combination of features in a hybrid HMM/MLP and a HMM/GMM speech recognition system
-
DOI 10.1109/TSA.2004.834466
-
P. Pujol, S. Pol, C. Nadeu, A. Hagen, and H. Bourlard, "Comparison and combination of features in a hybridHMM/MLP and aHMM/GMM speech recognition system, " IEEE Trans. Speech Audio Process. , vol. 13, no. 1, pp. 14-22, Jan. 2005. (Pubitemid 40049936)
-
(2005)
IEEE Transactions on Speech and Audio Processing
, vol.13
, Issue.1
, pp. 14-22
-
-
Pujol, P.1
Pol, S.2
Nadeu, C.3
Hagen, A.4
Bourlard, H.5
-
36
-
-
0001595997
-
Neural network classifiers estimate Bayesian a posteriori probabilities
-
M. Richard and R. Lippmann, "Neural network classifiers estimate Bayesian a posteriori probabilities, " Neural Computation, vol. 3, no. 4, pp. 461-483, 1991.
-
(1991)
Neural Computation
, vol.3
, Issue.4
, pp. 461-483
-
-
Richard, M.1
Lippmann, R.2
-
37
-
-
78049251448
-
Analyzing MLP. Based Hierarchical Phoneme posterior probability estimator
-
J. Pinto, S. Garimella, M. Magimai.-Doss, H. Hermansky, and H. Bourlard, "Analyzing MLP. Based Hierarchical Phoneme posterior probability estimator, " IEEE Trans. Speech and Audio Process. , vol. 19, pp. 225-241, 2011.
-
(2011)
IEEE Trans. Speech and Audio Process.
, vol.19
, pp. 225-241
-
-
Pinto, J.1
Garimella, S.2
Magimai.-Doss, M.3
Hermansky, H.4
Bourlard, H.5
-
38
-
-
70450144093
-
-
Ph. D. dissertation, UC Berkeley, Berkeley, CA
-
D. Gelbart, "Ensemble feature selection for multi-stream automatic speech recognition, " Ph. D. dissertation, UC Berkeley, Berkeley, CA, 2008.
-
(2008)
Ensemble Feature Selection for Multi-stream Automatic Speech Recognition
-
-
Gelbart, D.1
-
39
-
-
0004319968
-
-
Defense Research Agency, Malvern, U. K. Tech. Rep.
-
A. Varga, H. Steeneken, M. Tomlinson, and D. Jones, The Noisex-92 study on the effect of additive noise on automatic speech recognition Speech Research Unit, Defense Research Agency, Malvern, U. K. , 1992, Tech. Rep. .
-
(1992)
The Noisex-92 Study on the Effect of Additive Noise on Automatic Speech Recognition Speech Research Unit
-
-
Varga, A.1
Steeneken, H.2
Tomlinson, M.3
Jones, D.4
-
40
-
-
79551573428
-
-
[Online]. Available date last viewed 11/25/2011
-
H. Hirsch, FaNT: Filtering andNoiseAdding Tool. [Online]. Available: http://dnt. kr. hsnr. de/download. html (date last viewed 11/25/2011), 2005
-
(2005)
FaNT: Filtering AndNoiseAdding Tool
-
-
Hirsch, H.1
-
42
-
-
84871840545
-
-
Philadelphia PA: Linguistic Data Consortium
-
D. Reynolds, HTIMIT. Philadelphia, PA: Linguistic Data Consortium, 1998, p. LDC98S67.
-
(1998)
HTIMIT
-
-
Reynolds, D.1
-
44
-
-
0028517164
-
RASTA processing of speech
-
Oct
-
H. Hermansky and N. Morgan, "RASTA processing of speech, " IEEE Trans. Speech Audio Process. , vol. 2, no. 4, pp. 382-395, Oct. 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.4
, pp. 382-395
-
-
Hermansky, H.1
Morgan, N.2
-
45
-
-
79952171347
-
Temporal envelope compensation for robust phoneme recognition using modulation spectrum
-
S. Ganapathy, S. Thomas, and H. Hermansky, "Temporal envelope compensation for robust phoneme recognition using modulation spectrum, " J. Acoust. Soc. Amer. , vol. 128, pp. 3769-3780, 2010.
-
(2010)
J. Acoust. Soc. Amer.
, vol.128
, pp. 3769-3780
-
-
Ganapathy, S.1
Thomas, S.2
Hermansky, H.3
-
46
-
-
0037828299
-
Effects of simulated cochlear-implant processing on speech reception in fluctuating maskers
-
DOI 10.1121/1.1579009
-
M. K. Qin and A. J. Oxenham, "Effects of simulated cochlear-implant processing on speech reception in fluctuating maskers, " J. Acoust. Soc. Amer. , vol. 114, no. 1, pp. 446-454, Jul. 2003. (Pubitemid 36835514)
-
(2003)
Journal of the Acoustical Society of America
, vol.114
, Issue.1
, pp. 446-454
-
-
Qin, M.K.1
Oxenham, A.J.2
-
47
-
-
84871829391
-
Two time scales in speech processing
-
New York
-
M. Chait, S. Greenberg, T. Arai, J. Simon, and D. Poeppel, "Two time scales in speech processing, " in Proc. Annu. Meeting Cognitive Neurosci. Soc. , New York, 2005.
-
(2005)
Proc. Annu. Meeting Cognitive Neurosci. Soc.
-
-
Chait, M.1
Greenberg, S.2
Arai, T.3
Simon, J.4
Poeppel, D.5
-
48
-
-
79960669709
-
Toward optimizing stream fusion in multistream recognition of speech
-
N. Mesgarani, S. Thomas, and H. Hermansky, "Toward optimizing stream fusion in multistream recognition of speech, " J. Acoust. Soc. Amer. , vol. 130, pp. 14-18, 2011.
-
(2011)
J. Acoust. Soc. Amer.
, vol.130
, pp. 14-18
-
-
Mesgarani, N.1
Thomas, S.2
Hermansky, H.3
-
49
-
-
84878421888
-
Robust phoneme recognition based on biomimetic speech contours
-
INTERSPEECH
-
M. Carlin, K. Patil, S. Nemala, and M. Elhilali, "Robust phoneme recognition based on biomimetic speech contours, " in Proc. 13th Annu. Conf. Int. Speech Commun. Assoc. (INTERSPEECH, 2012.
-
(2012)
Proc. 13th Annu. Conf. Int. Speech Commun. Assoc.
-
-
Carlin, M.1
Patil, K.2
Nemala, S.3
Elhilali, M.4
-
50
-
-
0033709098
-
Tandem connectionist feature extraction for conventional HMM systems
-
H. Hermansky, D. P. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems, " in Proc. IEEE Int. Conf. Acoust. , Speech, Signal Process. , 2000, pp. 1635-1638.
-
(2000)
Proc IEEE Int. Conf. Acoust. , Speech, Signal Process.
, pp. 1635-1638
-
-
Hermansky, H.1
Ellis, D.P.2
Sharma, S.3
|