-
1
-
-
0027167185
-
A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition
-
Minneapolis, MN
-
Aikawa, K., Singer, H., Kawahara, H., Tohkura, Y., 1993. A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition. In: Proceedings of the International Conference on Acoust. Speech and Signal Processing, Minneapolis, MN, pp. II-668-671.
-
(1993)
Proceedings of the International Conference on Acoust. Speech and Signal Processing
-
-
Aikawa, K.1
Singer, H.2
Kawahara, H.3
Tohkura, Y.4
-
2
-
-
0028516073
-
How do humans process and recognize speech?
-
Allen, J.B., 1994. How do humans process and recognize speech?. IEEE Trans. Speech Audio Process. 2 (4), 567-577.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.4
, pp. 567-577
-
-
Allen, J.B.1
-
3
-
-
0030369532
-
Intelligibility of speech with filtered time trajectories of spectral envelopes
-
Philadelphia
-
Arai, T.M., Pavel, H.H., Avendano, C., 1996. Intelligibility of speech with filtered time trajectories of spectral envelopes. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia, pp. 2490-2493.
-
(1996)
Proceedings of the International Conference on Spoken Language Processing
, pp. 2490-2493
-
-
Arai, T.M.1
Pavel, H.H.2
Avendano, C.3
-
4
-
-
84898992685
-
Coding of naturalistic stimuli by auditory midbrain neurons
-
Morgan Kaufmann, Los Altos, CA.
-
Attias, H., Schreiner, C.E., 1998. Coding of naturalistic stimuli by auditory midbrain neurons. In: Advances in Neural Information Processing Systems, Vol. 10. Morgan Kaufmann, Los Altos, CA.
-
(1998)
Advances in Neural Information Processing Systems
, vol.10
-
-
Attias, H.1
Schreiner, C.E.2
-
6
-
-
0031347666
-
On the properties of temporal processing for speech in adverse environments
-
Mohonk Mountain House, New Paltz, New York
-
Avendano, C., Hermansky, H., 1997. On the properties of temporal processing for speech in adverse environments. In: Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics, Mohonk Mountain House, New Paltz, New York.
-
(1997)
Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics
-
-
Avendano, C.1
Hermansky, H.2
-
7
-
-
0020932333
-
Two-formant models of vowel perception: Shortcomings and enhancements
-
Bladon, A., 1983. Two-formant models of vowel perception: Shortcomings and enhancements. Speech Communication 2, 305-313.
-
(1983)
Speech Communication
, vol.2
, pp. 305-313
-
-
Bladon, A.1
-
9
-
-
0004988601
-
Copernicus and ASR challenge: Waiting for Kepler
-
Arden House, NY
-
Bourlard, H., Hermansky, H., Morgan, N., 1996. Copernicus and ASR challenge: Waiting for Kepler. In: Proceedings of the ARPA ASR Workshop Spring 1996, Arden House, NY, pp. 157-162.
-
(1996)
Proceedings of the ARPA ASR Workshop Spring 1996
, pp. 157-162
-
-
Bourlard, H.1
Hermansky, H.2
Morgan, N.3
-
11
-
-
0347387977
-
An experimental automatic word recognition system
-
Joint Speech Research Unit, Ruislip, England
-
Bridle, J.S., Brown, M.D., 1974. An experimental automatic word recognition system. JSRU Report No. 1003, Joint Speech Research Unit, Ruislip, England.
-
(1974)
JSRU Report No. 1003
, vol.1003
-
-
Bridle, J.S.1
Brown, M.D.2
-
12
-
-
25044464569
-
The front cavity/F2' hypothesis tested by data on tongue movements
-
Broad, D., Hermansky, H., 1989. The front cavity/F2' hypothesis tested by data on tongue movements. J. Acoust. Soc. Amer. 86 (Suppl. 1), S13-S14.
-
(1989)
J. Acoust. Soc. Amer.
, vol.86
, Issue.1 SUPPL.
-
-
Broad, D.1
Hermansky, H.2
-
14
-
-
0024392496
-
Application of an auditory model to speech recognition
-
Cohen, J.R., 1989. Application of an auditory model to speech recognition. J. Acoust. Soc. Amer. 85 (6), 2623-2629.
-
(1989)
J. Acoust. Soc. Amer.
, vol.85
, Issue.6
, pp. 2623-2629
-
-
Cohen, J.R.1
-
15
-
-
84955042239
-
Some experiments on the perception of synthetic speech sounds
-
Cooper, F.S., Delattre, P.C., Liberman, A.M., Borst, J.M., Gerstman, L.J., 1952. Some experiments on the perception of synthetic speech sounds. J. Acoust. Soc. Amer. 24, 579-606.
-
(1952)
J. Acoust. Soc. Amer.
, vol.24
, pp. 579-606
-
-
Cooper, F.S.1
Delattre, P.C.2
Liberman, A.M.3
Borst, J.M.4
Gerstman, L.J.5
-
16
-
-
0029725367
-
Real-time recognition of broadcast radio speech
-
Cook, G.D., Christie, J.D., Clarkson, P.R., Hochberg, M.M., Logan, B.T., Robinson, A.J., 1996. Real-time recognition of broadcast radio speech. In: Proceedings of the International Conference on Acoust. Speech and Signal Processing, pp. 141-144.
-
(1996)
Proceedings of the International Conference on Acoust. Speech and Signal Processing
, pp. 141-144
-
-
Cook, G.D.1
Christie, J.D.2
Clarkson, P.R.3
Hochberg, M.M.4
Logan, B.T.5
Robinson, A.J.6
-
17
-
-
0021906779
-
Central auditory processing of peripheral vowel spectra
-
Chistovich, L.A., 1985. Central auditory processing of peripheral vowel spectra. J. Acoust. Soc. Amer. 77, 789-805.
-
(1985)
J. Acoust. Soc. Amer.
, vol.77
, pp. 789-805
-
-
Chistovich, L.A.1
-
18
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Davis, S.B., Mermelstein, P., 1980. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28 (4), 357-366.
-
(1980)
IEEE Trans. Acoust. Speech Signal Process.
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.B.1
Mermelstein, P.2
-
19
-
-
0347387973
-
Sound feature decomposition by the primary auditory cortex
-
Breckenridge, Colorado (submitted to Science, also unpublished technical memo)
-
deCharms, C.R., Blake, D., Merzenich, M.M., 1997. Sound feature decomposition by the primary auditory cortex. In: 1997 Workshop on Advances in Neural Information Processing, Breckenridge, Colorado (submitted to Science, also unpublished technical memo).
-
(1997)
1997 Workshop on Advances in Neural Information Processing
-
-
DeCharms, C.R.1
Blake, D.2
Merzenich, M.M.3
-
20
-
-
0027957839
-
Effect of temporal envelope smearing on speech reception
-
Drullman, R., Festen, J.M., Plomp, R., 1994. Effect of temporal envelope smearing on speech reception. J. Acoust. Soc. Amer. 95, 1053-1064.
-
(1994)
J. Acoust. Soc. Amer.
, vol.95
, pp. 1053-1064
-
-
Drullman, R.1
Festen, J.M.2
Plomp, R.3
-
21
-
-
0028287770
-
Effect of reducing slow temporal modulations on speech reception
-
Drullman, R., Festen, J.M., Plomp, R., 1994. Effect of reducing slow temporal modulations on speech reception. J. Acoust. Soc. Amer. 95, 2670-2680.
-
(1994)
J. Acoust. Soc. Amer.
, vol.95
, pp. 2670-2680
-
-
Drullman, R.1
Festen, J.M.2
Plomp, R.3
-
22
-
-
84885525095
-
Auditory matching of vowels with two formant synthetic sounds
-
Speech Transmission Laboratory, Royal Institute of Technology, Stockholm
-
Fant, G., Risberg, A., 1962. Auditory matching of vowels with two formant synthetic sounds. Quarterly Progress and Status Report 4, Speech Transmission Laboratory, Royal Institute of Technology, Stockholm.
-
(1962)
Quarterly Progress and Status Report
, vol.4
-
-
Fant, G.1
Risberg, A.2
-
23
-
-
0038747568
-
Acoustic description and classification of phonetic units
-
Fant, G., 1965. Acoustic description and classification of phonetic units. Ericsson Technics, No. 1, reprinted in: Fant, G., 1973. Speech Sounds and Features. MIT Press, Cambridge, MA.
-
(1965)
Ericsson Technics
, vol.1
-
-
Fant, G.1
-
24
-
-
0004110342
-
-
reprinted MIT Press, Cambridge, MA.
-
Fant, G., 1965. Acoustic description and classification of phonetic units. Ericsson Technics, No. 1, reprinted in: Fant, G., 1973. Speech Sounds and Features. MIT Press, Cambridge, MA.
-
(1973)
Speech Sounds and Features
-
-
Fant, G.1
-
27
-
-
0014113409
-
On the second spectral peak of front vowels: A perceptual study of the role of the second and third formants
-
Fujimura, O., 1964. On the second spectral peak of front vowels: A perceptual study of the role of the second and third formants. Language and Speech 10, 181-193.
-
(1964)
Language and Speech
, vol.10
, pp. 181-193
-
-
Fujimura, O.1
-
28
-
-
0019555090
-
Cepstral analysis technique for automatic speaker verification
-
Furui, S., 1981. Cepstral analysis technique for automatic speaker verification. IEEE Trans. Acoust. Speech Signal Process. 29, 254-272.
-
(1981)
IEEE Trans. Acoust. Speech Signal Process.
, vol.29
, pp. 254-272
-
-
Furui, S.1
-
29
-
-
0001942829
-
Neural networks and the bias/variance dilemma
-
Geman, S., Bienenstock, E., Doursat, R., 1992. Neural networks and the bias/variance dilemma. Neural Computation 4 (1), 1-58.
-
(1992)
Neural Computation
, vol.4
, Issue.1
, pp. 1-58
-
-
Geman, S.1
Bienenstock, E.2
Doursat, R.3
-
30
-
-
0028996921
-
Auditory scene analysis and hidden Markov model recognition of speech in noise
-
Detroit, MI
-
Green, P.D., Cooke, M.P., Crawford, M.D., 1995. Auditory scene analysis and hidden Markov model recognition of speech in noise. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Detroit, MI, pp. 401-404.
-
(1995)
Proceedings of International Conference on Acoust. Speech and Signal Processing
, pp. 401-404
-
-
Green, P.D.1
Cooke, M.P.2
Crawford, M.D.3
-
32
-
-
0141629798
-
Spectral dynamics for speech recognition under adverse conditions
-
Lee, C.H., Soong, F.K., Paliwal, K.K. (Eds.), Kluwer Academic Publishers, Dordrecht
-
Hanson, B.A., Applebaum, T.H., Junqua, J.C., 1996. Spectral dynamics for speech recognition under adverse conditions. In: Lee, C.H., Soong, F.K., Paliwal, K.K. (Eds.), Automatic Speech and Speaker Recognition. Kluwer Academic Publishers, Dordrecht.
-
(1996)
Automatic Speech and Speaker Recognition
-
-
Hanson, B.A.1
Applebaum, T.H.2
Junqua, J.C.3
-
33
-
-
0021122763
-
The harmonic magnitude suppression (HMS) technique for intelligibility enhancement in the presence in interfering speech
-
Hanson, B., Wong, D., 1984. The harmonic magnitude suppression (HMS) technique for intelligibility enhancement in the presence in interfering speech. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, pp. 18.A.5.1-18.A.5.4.
-
(1984)
Proceedings of International Conference on Acoust. Speech and Signal Processing
-
-
Hanson, B.1
Wong, D.2
-
35
-
-
0025041264
-
Perceptual linear predictive (PLP) analysis of speech
-
Hermansky, H., 1990. Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Amer. 87 (4), 1738-1752.
-
(1990)
J. Acoust. Soc. Amer.
, vol.87
, Issue.4
, pp. 1738-1752
-
-
Hermansky, H.1
-
36
-
-
0348018654
-
Exploring temporal domain for robustness in speech recognition
-
Trondheim, Norway
-
Hermansky, H., 1995. Exploring temporal domain for robustness in speech recognition. In: Proceedings of the 15th International Congress on Acoustics, Vol. II, Trondheim, Norway, pp. 61-64.
-
(1995)
Proceedings of the 15th International Congress on Acoustics
, vol.2
, pp. 61-64
-
-
Hermansky, H.1
-
37
-
-
0030365517
-
Towards ASR on partially corrupted speech
-
Philadelphia, PA
-
Hermansky, H., Tibrewala, S., Pavel, M., 1996. Towards ASR on partially corrupted speech. In: Proceedings of International Conference on Spoken Language Processing, Philadelphia, PA, pp. 462-465.
-
(1996)
Proceedings of International Conference on Spoken Language Processing
, pp. 462-465
-
-
Hermansky, H.1
Tibrewala, S.2
Pavel, M.3
-
38
-
-
0020542318
-
Analysis and synthesis of speech based on spectral transform linear predictive method
-
Boston, MA
-
Hermansky, H., Fujisaki, H., Sato, Y., 1983. Analysis and synthesis of speech based on spectral transform linear predictive method. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Boston, MA, pp. 777-780.
-
(1983)
Proceedings of International Conference on Acoust. Speech and Signal Processing
, pp. 777-780
-
-
Hermansky, H.1
Fujisaki, H.2
Sato, Y.3
-
39
-
-
0028517164
-
RASTA processing of speech
-
Hermansky, H., Morgan, N., 1994. RASTA processing of speech. IEEE Trans. Speech Audio Process. 2 (4), 578-589.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.4
, pp. 578-589
-
-
Hermansky, H.1
Morgan, N.2
-
40
-
-
0028996922
-
Speech enhancement based on temporal processing
-
Detroit, MI
-
Hermansky, H., Wan, E., Avendano, C., 1995. Speech enhancement based on temporal processing. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Detroit, MI, pp. 405-408.
-
(1995)
Proceedings of International Conference on Acoust. Speech and Signal Processing
, pp. 405-408
-
-
Hermansky, H.1
Wan, E.2
Avendano, C.3
-
41
-
-
0024879199
-
The effective second formant F2' and the vocal tract front cavity
-
Glasgow, Scotland
-
Hermansky, H., Broad, D., 1989. The effective second formant F2' and the vocal tract front cavity. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Glasgow, Scotland, pp. 480-483.
-
(1989)
Proceedings of International Conference on Acoust. Speech and Signal Processing
, pp. 480-483
-
-
Hermansky, H.1
Broad, D.2
-
42
-
-
85135377175
-
Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)
-
Genova, Italy
-
Hermansky, H., Morgan, N., Bayya, A., Kohn, P., 1991. Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP). In: Proceedings of Eurospeech'91, Genova, Italy, pp. 1367-1371.
-
(1991)
Proceedings of Eurospeech'91
, pp. 1367-1371
-
-
Hermansky, H.1
Morgan, N.2
Bayya, A.3
Kohn, P.4
-
43
-
-
0347387953
-
Psychophysics of speech engineering systems
-
Invited paper, Stockholm, Sweden
-
Hermansky, H., Pavel, M., 1995. Psychophysics of speech engineering systems. Invited paper, 13th International Congress on Phonetic Sciences, Stockholm, Sweden, pp. 42-49.
-
(1995)
13th International Congress on Phonetic Sciences
, pp. 42-49
-
-
Hermansky, H.1
Pavel, M.2
-
44
-
-
3543081154
-
Modulation spectrum in speech processing
-
Prochazka, A., Uhlir, J., Rayner, P.J.W., Kingsbury, N.G. (Eds.), Birkhauser, Boston
-
Hermansky, H., 1988. Modulation spectrum in speech processing. In: Prochazka, A., Uhlir, J., Rayner, P.J.W., Kingsbury, N.G. (Eds.), Signal Analysis and Prediction. Birkhauser, Boston.
-
(1988)
Signal Analysis and Prediction
-
-
Hermansky, H.1
-
45
-
-
0011823639
-
Improved speech recognition using high-pass filtering of subband envelopes
-
Genova, Italy
-
Hirsch, H.G., Meyer, P., Ruehl, H., 1991. Improved speech recognition using high-pass filtering of subband envelopes. In: Proceedings of Eurospeech'91, Genova, Italy, pp. 413-416.
-
(1991)
Proceedings of Eurospeech'91
, pp. 413-416
-
-
Hirsch, H.G.1
Meyer, P.2
Ruehl, H.3
-
46
-
-
84873312246
-
A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria
-
Houtgast, T., Steeneken, H.J.M., 1985. A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria. J. Acoust. Soc. Amer. 77 (3), 1069-1077.
-
(1985)
J. Acoust. Soc. Amer.
, vol.77
, Issue.3
, pp. 1069-1077
-
-
Houtgast, T.1
Steeneken, H.J.M.2
-
47
-
-
0038133932
-
A statistical approach to metrics for word and syllable recognition
-
Hunt, M.J., 1979. A statistical approach to metrics for word and syllable recognition. J. Acoust. Soc. Amer. 66 (S1), S35(A).
-
(1979)
J. Acoust. Soc. Amer.
, vol.66
, Issue.S1
-
-
Hunt, M.J.1
-
48
-
-
0024905238
-
A comparison of several acoustic representations for speech recognition with degraded and undegraded speech
-
Glasgow, Scotland
-
Hunt, M., Lefebvre, C., 1989. A comparison of several acoustic representations for speech recognition with degraded and undegraded speech. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Glasgow, Scotland, pp. 262-265.
-
(1989)
Proceedings of International Conference on Acoust. Speech and Signal Processing
, pp. 262-265
-
-
Hunt, M.1
Lefebvre, C.2
-
49
-
-
0037662539
-
Automatic formant extraction utilizing mel scale and equal loudness contour
-
Philadelphia, PA
-
Itahashi, S., Yokoyama, S., 1976. Automatic formant extraction utilizing mel scale and equal loudness contour. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Philadelphia, PA, pp. 310-313.
-
(1976)
Proceedings of International Conference on Acoust. Speech and Signal Processing
, pp. 310-313
-
-
Itahashi, S.1
Yokoyama, S.2
-
50
-
-
0026626584
-
Speaker independent phonetic classification in continuous English letters
-
Seattle, WA
-
Janseen, R.D.T., Fanty, M., Cole, R.A., 1991. Speaker independent phonetic classification in continuous English letters. In: Proceedings of International Joint Conference on Neural Networks, Seattle, WA, pp. II-801-808.
-
(1991)
Proceedings of International Joint Conference on Neural Networks
-
-
Janseen, R.D.T.1
Fanty, M.2
Cole, R.A.3
-
51
-
-
0020117798
-
Forward masking as a function of frequency, masker level, and signal delay
-
Jestead, W., Bacon, S.P., Lehman, J.R., 1982. Forward masking as a function of frequency, masker level, and signal delay. J. Acoust. Soc. Amer. 950-962.
-
(1982)
J. Acoust. Soc. Amer.
, pp. 950-962
-
-
Jestead, W.1
Bacon, S.P.2
Lehman, J.R.3
-
52
-
-
84883097102
-
On the importance of various modulation frequencies for speech recognition
-
Rhodos, Greece
-
Kanedera, N., Arai, T., Hermansky, H., Pavel, M., 1997. On the importance of various modulation frequencies for speech recognition. In: Proceedings of Eurospeech'97, Rhodos, Greece, pp. 1079-1082.
-
(1997)
Proceedings of Eurospeech'97
, pp. 1079-1082
-
-
Kanedera, N.1
Arai, T.2
Hermansky, H.3
Pavel, M.4
-
53
-
-
0346126997
-
-
Submitted to Speech Communication
-
Kanedera, N., Arai, T., Hermansky, H., Pavel, M., 1997. On the relative importance of various components of the modulation spectrum for automatic speech recognition. Submitted to Speech Communication.
-
(1997)
On the Relative Importance of Various Components of the Modulation Spectrum for Automatic Speech Recognition
-
-
Kanedera, N.1
Arai, T.2
Hermansky, H.3
Pavel, M.4
-
54
-
-
0001490199
-
Speech processing strategies based on auditory models
-
Carlson, R., Granstrom, B. (Eds.), Elsevier Biomedical Press, New York
-
Klatt, D.H., 1982. Speech processing strategies based on auditory models. In: Carlson, R., Granstrom, B. (Eds.), The Representation of Speech in The Peripheral Auditory System. Elsevier Biomedical Press, New York, pp. 181-202.
-
(1982)
The Representation of Speech in the Peripheral Auditory System
, pp. 181-202
-
-
Klatt, D.H.1
-
57
-
-
0018478297
-
Spectral root homomorphic deconvolution system
-
Lim, J.S., 1979. Spectral root homomorphic deconvolution system. IEEE Trans. Acoust. Speech Signal Process. 27 (3), 223-233.
-
(1979)
IEEE Trans. Acoust. Speech Signal Process.
, vol.27
, Issue.3
, pp. 223-233
-
-
Lim, J.S.1
-
58
-
-
0029754956
-
Accurate consonant perception without mid-frequency speech energy
-
Lippmann, R.P., 1995. Accurate consonant perception without mid-frequency speech energy. IEEE Trans. Speech and Audio 4 (1), 66-69.
-
(1995)
IEEE Trans. Speech and Audio
, vol.4
, Issue.1
, pp. 66-69
-
-
Lippmann, R.P.1
-
59
-
-
0020544161
-
Recognition of consonant based on the perceptron model
-
Boston, MA
-
Makino, S., Kawabata, T., Kido, K., 1983. Recognition of consonant based on the perceptron model. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Boston, MA, pp. 738-741.
-
(1983)
Proceedings of International Conference on Acoust. Speech and Signal Processing
, pp. 738-741
-
-
Makino, S.1
Kawabata, T.2
Kido, K.3
-
60
-
-
33646771442
-
Towards decomposing the sources of variability in speech
-
Rhodos, Greece
-
Malayath, N., Hermansky, H., Kain, A., 1997. Towards decomposing the sources of variability in speech. In: Proceedings of Eurospeech'97, Rhodos, Greece.
-
(1997)
Proceedings of Eurospeech'97
-
-
Malayath, N.1
Hermansky, H.2
Kain, A.3
-
61
-
-
0003834557
-
-
Freeman, San Francisco, CA.
-
Marr, D., 1982. Vision. Freeman, San Francisco, CA.
-
(1982)
Vision
-
-
Marr, D.1
-
62
-
-
0038133939
-
Distance measures for speech recognition, psychological and instrumental
-
Chen, R.C.H. (Ed.), Academic Press, New York
-
Mermelstein, P., 1976. Distance measures for speech recognition, psychological and instrumental. In: Chen, R.C.H. (Ed.), Pattern Recognition and Artificial Intelligence. Academic Press, New York, pp. 374-388.
-
(1976)
Pattern Recognition and Artificial Intelligence
, pp. 374-388
-
-
Mermelstein, P.1
-
63
-
-
0002127129
-
Probabilistic optimum filtering for robust speech recognition
-
Adelaide, Australia
-
Neumayer, L., Weintraub, M., 1994. Probabilistic optimum filtering for robust speech recognition. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Adelaide, Australia, pp. I-417-420.
-
(1994)
Proceedings of International Conference on Acoust. Speech and Signal Processing
-
-
Neumayer, L.1
Weintraub, M.2
-
65
-
-
0007636578
-
Temporal masking in automatic speech recognition
-
Pavel, M., Hermansky, H., 1994. Temporal masking in automatic speech recognition. J. Acoust. Soc. Amer. A 95, 2876.
-
(1994)
J. Acoust. Soc. Amer. A
, vol.95
, pp. 2876
-
-
Pavel, M.1
Hermansky, H.2
-
66
-
-
0015129120
-
Real-time recognition of spoken words
-
Pols, L.C.W., 1971. Real-time recognition of spoken words. IEEE Trans. Comput. 20 (C) 972-978.
-
(1971)
IEEE Trans. Comput.
, vol.20
, Issue.C
, pp. 972-978
-
-
Pols, L.C.W.1
-
67
-
-
84881675408
-
Cepstral channel normalization techniques for HMM-based speaker verification
-
Yokohama, Japan
-
Rosenberg, A.E., Lee, C., Soong, F.K., 1994. Cepstral channel normalization techniques for HMM-based speaker verification. In: Proceedings of International Conference on Spoken Language Processing, Yokohama, Japan, pp. 1835-1838.
-
(1994)
Proceedings of International Conference on Spoken Language Processing
, pp. 1835-1838
-
-
Rosenberg, A.E.1
Lee, C.2
Soong, F.K.3
-
68
-
-
84928837806
-
A joint synchrony/mean-rate model of auditory speech processing
-
Seneff, S., 1985. A joint synchrony/mean-rate model of auditory speech processing. J. Phonetics 16 (1), 55-76.
-
(1985)
J. Phonetics
, vol.16
, Issue.1
, pp. 55-76
-
-
Seneff, S.1
-
69
-
-
0011405405
-
Brightness and loudness as functions of stimulus duration
-
Stevens, J.C., Hall, J.W., 1966. Brightness and loudness as functions of stimulus duration. Perception and Psychophysics 1, 319-327.
-
(1966)
Perception and Psychophysics
, vol.1
, pp. 319-327
-
-
Stevens, J.C.1
Hall, J.W.2
-
70
-
-
0002220140
-
Applying phonetic knowledge to lexical access
-
Madrid, Spain
-
Stevens, K.N., 1996. Applying phonetic knowledge to lexical access. In: Proceedings of Eurospeech'95, Madrid, Spain, p. 3.
-
(1996)
Proceedings of Eurospeech'95
, pp. 3
-
-
Stevens, K.N.1
-
71
-
-
85135190755
-
Multi-band and adaptation approaches to robust speech recognition
-
Rhodos, Greece
-
Tibrewala, S., Hermansky, H., 1997. Multi-band and adaptation approaches to robust speech recognition. In: Proceedings of Eurospeech'97, Rhodos, Greece, pp. 2619-2622.
-
(1997)
Proceedings of Eurospeech'97
, pp. 2619-2622
-
-
Tibrewala, S.1
Hermansky, H.2
-
72
-
-
84947590142
-
Data-driven design of RASTA-like filters
-
Rhodos, Greece
-
van Vuuren, S., Hermansky, H., 1997. Data-driven design of RASTA-like filters. In: Proceedings of Eurospeech'97, Rhodos, Greece, pp. 409-412.
-
(1997)
Proceedings of Eurospeech'97
, pp. 409-412
-
-
Van Vuuren, S.1
Hermansky, H.2
-
73
-
-
0023833469
-
Phoneme recognition using time-delay neural networks
-
New York
-
Waibel, A., Hanazawa, T., Hinton, G., Shikano, K., Lang, K., 1988. Phoneme recognition using time-delay neural networks, Proceedings of International Conference on Acoust. Speech and Signal Processing, New York, pp. 107-110.
-
(1988)
Proceedings of International Conference on Acoust. Speech and Signal Processing
, pp. 107-110
-
-
Waibel, A.1
Hanazawa, T.2
Hinton, G.3
Shikano, K.4
Lang, K.5
-
74
-
-
0029378080
-
Spectral shape analysis in the central auditory system
-
Wang, K., Shamma, S.S., 1995. Spectral shape analysis in the central auditory system. IEEE Trans. Speech Audio Process. 3 (5), 382-394.
-
(1995)
IEEE Trans. Speech Audio Process.
, vol.3
, Issue.5
, pp. 382-394
-
-
Wang, K.1
Shamma, S.S.2
-
75
-
-
0030028881
-
Some effects of filtered context on the perception of vowels and fricatives
-
Watkins, A.J., Makin, S.J., 1997. Some effects of filtered context on the perception of vowels and fricatives. J. Acoust. Soc. Amer. 99 (1), 588-594.
-
(1997)
J. Acoust. Soc. Amer.
, vol.99
, Issue.1
, pp. 588-594
-
-
Watkins, A.J.1
Makin, S.J.2
-
76
-
-
0029726509
-
Improving environmental robustness in large vocabulary speech recognition
-
Woodland, P.C., Gales, M.J.F., Pye, D., 1996. Improving environmental robustness in large vocabulary speech recognition. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, pp. 65-68.
-
(1996)
Proceedings of International Conference on Acoust. Speech and Signal Processing
, pp. 65-68
-
-
Woodland, P.C.1
Gales, M.J.F.2
Pye, D.3
-
77
-
-
0039777029
-
Scaling
-
Keidel O., Neff W. (Eds.), Springer, Berlin
-
Zwicker, E., 1975. Scaling. In: Keidel O., Neff W. (Eds.), Handbook of Sensory Physiology, Vol. V.3. Springer, Berlin, pp. 401-448.
-
(1975)
Handbook of Sensory Physiology
, vol.3
, pp. 401-448
-
-
Zwicker, E.1
|