SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 25, Issue 1-3, 1998, Pages 3-27

Should recognizers have ears?

(1) Hermansky, Hynek a,b,c

a OREGON HEALTH AND SCIENCE UNIVERSITY (United States)

b INTERNATIONAL COMPUTER SCIENCE INSTITUTE (United States)

c BRNO UNIVERSITY OF TECHNOLOGY (Czech Republic)

Author keywords

Auditory modeling; Automatic speech recognition; Human like processing; Modulation frequency

Indexed keywords

OPTIMIZATION; RANDOM PROCESSES; SENSORY PERCEPTION; SPECTRUM ANALYSIS; SPEECH;

SPEECH RECOGNIZERS;

SPEECH RECOGNITION;

EID: 0032139768 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/S0167-6393(98)00027-2 Document Type: Article

Times cited : (124)

References (77)

1
- 0027167185
- A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition
- Minneapolis, MN
- Aikawa, K., Singer, H., Kawahara, H., Tohkura, Y., 1993. A dynamic cepstrum incorporating time-frequency masking and its application to continuous speech recognition. In: Proceedings of the International Conference on Acoust. Speech and Signal Processing, Minneapolis, MN, pp. II-668-671.
- (1993) Proceedings of the International Conference on Acoust. Speech and Signal Processing
- Aikawa, K.¹ Singer, H.² Kawahara, H.³ Tohkura, Y.⁴

2
- 0028516073
- How do humans process and recognize speech?
- Allen, J.B., 1994. How do humans process and recognize speech?. IEEE Trans. Speech Audio Process. 2 (4), 567-577.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 567-577
- Allen, J.B.¹

3
- 0030369532
- Intelligibility of speech with filtered time trajectories of spectral envelopes
- Philadelphia
- Arai, T.M., Pavel, H.H., Avendano, C., 1996. Intelligibility of speech with filtered time trajectories of spectral envelopes. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia, pp. 2490-2493.
- (1996) Proceedings of the International Conference on Spoken Language Processing , pp. 2490-2493
- Arai, T.M.¹ Pavel, H.H.² Avendano, C.³

4
- 84898992685
- Coding of naturalistic stimuli by auditory midbrain neurons
- Morgan Kaufmann, Los Altos, CA.
- Attias, H., Schreiner, C.E., 1998. Coding of naturalistic stimuli by auditory midbrain neurons. In: Advances in Neural Information Processing Systems, Vol. 10. Morgan Kaufmann, Los Altos, CA.
- (1998) Advances in Neural Information Processing Systems , vol.10
- Attias, H.¹ Schreiner, C.E.²

5
- 0030374936
- Data based filter design for RASTA-like channel normalization in ASR
- Philadelphia
- Avendano, C. van Vuuren, S., Hermansky, H., 1996. Data based filter design for RASTA-like channel normalization in ASR. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia.
- (1996) Proceedings of the International Conference on Spoken Language Processing
- Avendano, C.¹ Van Vuuren, S.² Hermansky, H.³

6
- 0031347666
- On the properties of temporal processing for speech in adverse environments
- Mohonk Mountain House, New Paltz, New York
- Avendano, C., Hermansky, H., 1997. On the properties of temporal processing for speech in adverse environments. In: Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics, Mohonk Mountain House, New Paltz, New York.
- (1997) Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics
- Avendano, C.¹ Hermansky, H.²

7
- 0020932333
- Two-formant models of vowel perception: Shortcomings and enhancements
- Bladon, A., 1983. Two-formant models of vowel perception: Shortcomings and enhancements. Speech Communication 2, 305-313.
- (1983) Speech Communication , vol.2 , pp. 305-313
- Bladon, A.¹

8
- 0003573244
- Kluwer Academic Publishers, Dordrecht
- Bourlard, H., Morgan, N., 1994. Connectionist Speech Recognition - A Hybrid Approach. Kluwer Academic Publishers, Dordrecht.
- (1994) Connectionist Speech Recognition - A Hybrid Approach
- Bourlard, H.¹ Morgan, N.²

9
- 0004988601
- Copernicus and ASR challenge: Waiting for Kepler
- Arden House, NY
- Bourlard, H., Hermansky, H., Morgan, N., 1996. Copernicus and ASR challenge: Waiting for Kepler. In: Proceedings of the ARPA ASR Workshop Spring 1996, Arden House, NY, pp. 157-162.
- (1996) Proceedings of the ARPA ASR Workshop Spring 1996 , pp. 157-162
- Bourlard, H.¹ Hermansky, H.² Morgan, N.³

10
- 0030355935
- A new ASR approach based on independent processing and re-combination of partial frequency bands
- Philadelphia
- Bourlard, H., Dupont, S., 1996. A new ASR approach based on independent processing and re-combination of partial frequency bands. In: Proceedings of the International Conference on Spoken Language Processing, Philadelphia, pp. 426-429.
- (1996) Proceedings of the International Conference on Spoken Language Processing , pp. 426-429
- Bourlard, H.¹ Dupont, S.²

11
- 0347387977
- An experimental automatic word recognition system
- Joint Speech Research Unit, Ruislip, England
- Bridle, J.S., Brown, M.D., 1974. An experimental automatic word recognition system. JSRU Report No. 1003, Joint Speech Research Unit, Ruislip, England.
- (1974) JSRU Report No. 1003 , vol.1003
- Bridle, J.S.¹ Brown, M.D.²

12
- 25044464569
- The front cavity/F2' hypothesis tested by data on tongue movements
- Broad, D., Hermansky, H., 1989. The front cavity/F2' hypothesis tested by data on tongue movements. J. Acoust. Soc. Amer. 86 (Suppl. 1), S13-S14.
- (1989) J. Acoust. Soc. Amer. , vol.86 , Issue.1 SUPPL.
- Broad, D.¹ Hermansky, H.²

13
- 0003572996
- Ph.D. Thesis, Computer Science Department, Carnegie Mellon University
- Brown, P., 1987. The acoustic-modeling problem in automatic speech recognition. Ph.D. Thesis, Computer Science Department, Carnegie Mellon University.
- (1987) The Acoustic-modeling Problem in Automatic Speech Recognition
- Brown, P.¹

14
- 0024392496
- Application of an auditory model to speech recognition
- Cohen, J.R., 1989. Application of an auditory model to speech recognition. J. Acoust. Soc. Amer. 85 (6), 2623-2629.
- (1989) J. Acoust. Soc. Amer. , vol.85 , Issue.6 , pp. 2623-2629
- Cohen, J.R.¹

15
- 84955042239
- Some experiments on the perception of synthetic speech sounds
- Cooper, F.S., Delattre, P.C., Liberman, A.M., Borst, J.M., Gerstman, L.J., 1952. Some experiments on the perception of synthetic speech sounds. J. Acoust. Soc. Amer. 24, 579-606.
- (1952) J. Acoust. Soc. Amer. , vol.24 , pp. 579-606
- Cooper, F.S.¹ Delattre, P.C.² Liberman, A.M.³ Borst, J.M.⁴ Gerstman, L.J.⁵

16
- 0029725367
- Real-time recognition of broadcast radio speech
- Cook, G.D., Christie, J.D., Clarkson, P.R., Hochberg, M.M., Logan, B.T., Robinson, A.J., 1996. Real-time recognition of broadcast radio speech. In: Proceedings of the International Conference on Acoust. Speech and Signal Processing, pp. 141-144.
- (1996) Proceedings of the International Conference on Acoust. Speech and Signal Processing , pp. 141-144
- Cook, G.D.¹ Christie, J.D.² Clarkson, P.R.³ Hochberg, M.M.⁴ Logan, B.T.⁵ Robinson, A.J.⁶

17
- 0021906779
- Central auditory processing of peripheral vowel spectra
- Chistovich, L.A., 1985. Central auditory processing of peripheral vowel spectra. J. Acoust. Soc. Amer. 77, 789-805.
- (1985) J. Acoust. Soc. Amer. , vol.77 , pp. 789-805
- Chistovich, L.A.¹

18
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Davis, S.B., Mermelstein, P., 1980. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28 (4), 357-366.
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

19
- 0347387973
- Sound feature decomposition by the primary auditory cortex
- Breckenridge, Colorado (submitted to Science, also unpublished technical memo)
- deCharms, C.R., Blake, D., Merzenich, M.M., 1997. Sound feature decomposition by the primary auditory cortex. In: 1997 Workshop on Advances in Neural Information Processing, Breckenridge, Colorado (submitted to Science, also unpublished technical memo).
- (1997) 1997 Workshop on Advances in Neural Information Processing
- DeCharms, C.R.¹ Blake, D.² Merzenich, M.M.³

20
- 0027957839
- Effect of temporal envelope smearing on speech reception
- Drullman, R., Festen, J.M., Plomp, R., 1994. Effect of temporal envelope smearing on speech reception. J. Acoust. Soc. Amer. 95, 1053-1064.
- (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 1053-1064
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

21
- 0028287770
- Effect of reducing slow temporal modulations on speech reception
- Drullman, R., Festen, J.M., Plomp, R., 1994. Effect of reducing slow temporal modulations on speech reception. J. Acoust. Soc. Amer. 95, 2670-2680.
- (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 2670-2680
- Drullman, R.¹ Festen, J.M.² Plomp, R.³

22
- 84885525095
- Auditory matching of vowels with two formant synthetic sounds
- Speech Transmission Laboratory, Royal Institute of Technology, Stockholm
- Fant, G., Risberg, A., 1962. Auditory matching of vowels with two formant synthetic sounds. Quarterly Progress and Status Report 4, Speech Transmission Laboratory, Royal Institute of Technology, Stockholm.
- (1962) Quarterly Progress and Status Report , vol.4
- Fant, G.¹ Risberg, A.²

23
- 0038747568
- Acoustic description and classification of phonetic units
- Fant, G., 1965. Acoustic description and classification of phonetic units. Ericsson Technics, No. 1, reprinted in: Fant, G., 1973. Speech Sounds and Features. MIT Press, Cambridge, MA.
- (1965) Ericsson Technics , vol.1
- Fant, G.¹

24
- 0004110342
- reprinted MIT Press, Cambridge, MA.
- Fant, G., 1965. Acoustic description and classification of phonetic units. Ericsson Technics, No. 1, reprinted in: Fant, G., 1973. Speech Sounds and Features. MIT Press, Cambridge, MA.
- (1973) Speech Sounds and Features
- Fant, G.¹

25
- 0003757962
- Springer, Berlin
- Flanagan, J.L., 1972. Speech Analysis Synthesis and Perception, second edition. Springer, Berlin.
- (1972) Speech Analysis Synthesis and Perception, Second Edition
- Flanagan, J.L.¹

26
- 0003549684
- Krieger, New York
- Fletcher, H., 1953. Speech and Hearing in Communication. Krieger, New York.
- (1953) Speech and Hearing in Communication
- Fletcher, H.¹

27
- 0014113409
- On the second spectral peak of front vowels: A perceptual study of the role of the second and third formants
- Fujimura, O., 1964. On the second spectral peak of front vowels: A perceptual study of the role of the second and third formants. Language and Speech 10, 181-193.
- (1964) Language and Speech , vol.10 , pp. 181-193
- Fujimura, O.¹

28
- 0019555090
- Cepstral analysis technique for automatic speaker verification
- Furui, S., 1981. Cepstral analysis technique for automatic speaker verification. IEEE Trans. Acoust. Speech Signal Process. 29, 254-272.
- (1981) IEEE Trans. Acoust. Speech Signal Process. , vol.29 , pp. 254-272
- Furui, S.¹

29
- 0001942829
- Neural networks and the bias/variance dilemma
- Geman, S., Bienenstock, E., Doursat, R., 1992. Neural networks and the bias/variance dilemma. Neural Computation 4 (1), 1-58.
- (1992) Neural Computation , vol.4 , Issue.1 , pp. 1-58
- Geman, S.¹ Bienenstock, E.² Doursat, R.³

30
- 0028996921
- Auditory scene analysis and hidden Markov model recognition of speech in noise
- Detroit, MI
- Green, P.D., Cooke, M.P., Crawford, M.D., 1995. Auditory scene analysis and hidden Markov model recognition of speech in noise. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Detroit, MI, pp. 401-404.
- (1995) Proceedings of International Conference on Acoust. Speech and Signal Processing , pp. 401-404
- Green, P.D.¹ Cooke, M.P.² Crawford, M.D.³

31
- 0039881085
- On the origins of speech intelligibility in the real world
- Pont-a-Mousson, France
- Greenberg, S., 1997. On the origins of speech intelligibility in the real world. In: Proceedings of ESCA-NATO Tutorial and Research Workshop on Robust speech recognition for unknown communication channels, Pont-a-Mousson, France, pp. 23-32.
- (1997) Proceedings of ESCA-NATO Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels , pp. 23-32
- Greenberg, S.¹

32
- 0141629798
- Spectral dynamics for speech recognition under adverse conditions
- Lee, C.H., Soong, F.K., Paliwal, K.K. (Eds.), Kluwer Academic Publishers, Dordrecht
- Hanson, B.A., Applebaum, T.H., Junqua, J.C., 1996. Spectral dynamics for speech recognition under adverse conditions. In: Lee, C.H., Soong, F.K., Paliwal, K.K. (Eds.), Automatic Speech and Speaker Recognition. Kluwer Academic Publishers, Dordrecht.
- (1996) Automatic Speech and Speaker Recognition
- Hanson, B.A.¹ Applebaum, T.H.² Junqua, J.C.³

33
- 0021122763
- The harmonic magnitude suppression (HMS) technique for intelligibility enhancement in the presence in interfering speech
- Hanson, B., Wong, D., 1984. The harmonic magnitude suppression (HMS) technique for intelligibility enhancement in the presence in interfering speech. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, pp. 18.A.5.1-18.A.5.4.
- (1984) Proceedings of International Conference on Acoust. Speech and Signal Processing
- Hanson, B.¹ Wong, D.²

34
- 0004220068
- Dover, New York
- Helmholtz, H., 1954. On the Sensation of Tone. Dover, New York.
- (1954) On the Sensation of Tone
- Helmholtz, H.¹

35
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Hermansky, H., 1990. Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Amer. 87 (4), 1738-1752.
- (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

36
- 0348018654
- Exploring temporal domain for robustness in speech recognition
- Trondheim, Norway
- Hermansky, H., 1995. Exploring temporal domain for robustness in speech recognition. In: Proceedings of the 15th International Congress on Acoustics, Vol. II, Trondheim, Norway, pp. 61-64.
- (1995) Proceedings of the 15th International Congress on Acoustics , vol.2 , pp. 61-64
- Hermansky, H.¹

37
- 0030365517
- Towards ASR on partially corrupted speech
- Philadelphia, PA
- Hermansky, H., Tibrewala, S., Pavel, M., 1996. Towards ASR on partially corrupted speech. In: Proceedings of International Conference on Spoken Language Processing, Philadelphia, PA, pp. 462-465.
- (1996) Proceedings of International Conference on Spoken Language Processing , pp. 462-465
- Hermansky, H.¹ Tibrewala, S.² Pavel, M.³

38
- 0020542318
- Analysis and synthesis of speech based on spectral transform linear predictive method
- Boston, MA
- Hermansky, H., Fujisaki, H., Sato, Y., 1983. Analysis and synthesis of speech based on spectral transform linear predictive method. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Boston, MA, pp. 777-780.
- (1983) Proceedings of International Conference on Acoust. Speech and Signal Processing , pp. 777-780
- Hermansky, H.¹ Fujisaki, H.² Sato, Y.³

39
- 0028517164
- RASTA processing of speech
- Hermansky, H., Morgan, N., 1994. RASTA processing of speech. IEEE Trans. Speech Audio Process. 2 (4), 578-589.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

40
- 0028996922
- Speech enhancement based on temporal processing
- Detroit, MI
- Hermansky, H., Wan, E., Avendano, C., 1995. Speech enhancement based on temporal processing. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Detroit, MI, pp. 405-408.
- (1995) Proceedings of International Conference on Acoust. Speech and Signal Processing , pp. 405-408
- Hermansky, H.¹ Wan, E.² Avendano, C.³

41
- 0024879199
- The effective second formant F2' and the vocal tract front cavity
- Glasgow, Scotland
- Hermansky, H., Broad, D., 1989. The effective second formant F2' and the vocal tract front cavity. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Glasgow, Scotland, pp. 480-483.
- (1989) Proceedings of International Conference on Acoust. Speech and Signal Processing , pp. 480-483
- Hermansky, H.¹ Broad, D.²

42
- 85135377175
- Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)
- Genova, Italy
- Hermansky, H., Morgan, N., Bayya, A., Kohn, P., 1991. Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP). In: Proceedings of Eurospeech'91, Genova, Italy, pp. 1367-1371.
- (1991) Proceedings of Eurospeech'91 , pp. 1367-1371
- Hermansky, H.¹ Morgan, N.² Bayya, A.³ Kohn, P.⁴

43
- 0347387953
- Psychophysics of speech engineering systems
- Invited paper, Stockholm, Sweden
- Hermansky, H., Pavel, M., 1995. Psychophysics of speech engineering systems. Invited paper, 13th International Congress on Phonetic Sciences, Stockholm, Sweden, pp. 42-49.
- (1995) 13th International Congress on Phonetic Sciences , pp. 42-49
- Hermansky, H.¹ Pavel, M.²

44
- 3543081154
- Modulation spectrum in speech processing
- Prochazka, A., Uhlir, J., Rayner, P.J.W., Kingsbury, N.G. (Eds.), Birkhauser, Boston
- Hermansky, H., 1988. Modulation spectrum in speech processing. In: Prochazka, A., Uhlir, J., Rayner, P.J.W., Kingsbury, N.G. (Eds.), Signal Analysis and Prediction. Birkhauser, Boston.
- (1988) Signal Analysis and Prediction
- Hermansky, H.¹

45
- 0011823639
- Improved speech recognition using high-pass filtering of subband envelopes
- Genova, Italy
- Hirsch, H.G., Meyer, P., Ruehl, H., 1991. Improved speech recognition using high-pass filtering of subband envelopes. In: Proceedings of Eurospeech'91, Genova, Italy, pp. 413-416.
- (1991) Proceedings of Eurospeech'91 , pp. 413-416
- Hirsch, H.G.¹ Meyer, P.² Ruehl, H.³

46
- 84873312246
- A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria
- Houtgast, T., Steeneken, H.J.M., 1985. A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria. J. Acoust. Soc. Amer. 77 (3), 1069-1077.
- (1985) J. Acoust. Soc. Amer. , vol.77 , Issue.3 , pp. 1069-1077
- Houtgast, T.¹ Steeneken, H.J.M.²

47
- 0038133932
- A statistical approach to metrics for word and syllable recognition
- Hunt, M.J., 1979. A statistical approach to metrics for word and syllable recognition. J. Acoust. Soc. Amer. 66 (S1), S35(A).
- (1979) J. Acoust. Soc. Amer. , vol.66 , Issue.S1
- Hunt, M.J.¹

48
- 0024905238
- A comparison of several acoustic representations for speech recognition with degraded and undegraded speech
- Glasgow, Scotland
- Hunt, M., Lefebvre, C., 1989. A comparison of several acoustic representations for speech recognition with degraded and undegraded speech. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Glasgow, Scotland, pp. 262-265.
- (1989) Proceedings of International Conference on Acoust. Speech and Signal Processing , pp. 262-265
- Hunt, M.¹ Lefebvre, C.²

49
- 0037662539
- Automatic formant extraction utilizing mel scale and equal loudness contour
- Philadelphia, PA
- Itahashi, S., Yokoyama, S., 1976. Automatic formant extraction utilizing mel scale and equal loudness contour. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Philadelphia, PA, pp. 310-313.
- (1976) Proceedings of International Conference on Acoust. Speech and Signal Processing , pp. 310-313
- Itahashi, S.¹ Yokoyama, S.²

50
- 0026626584
- Speaker independent phonetic classification in continuous English letters
- Seattle, WA
- Janseen, R.D.T., Fanty, M., Cole, R.A., 1991. Speaker independent phonetic classification in continuous English letters. In: Proceedings of International Joint Conference on Neural Networks, Seattle, WA, pp. II-801-808.
- (1991) Proceedings of International Joint Conference on Neural Networks
- Janseen, R.D.T.¹ Fanty, M.² Cole, R.A.³

51
- 0020117798
- Forward masking as a function of frequency, masker level, and signal delay
- Jestead, W., Bacon, S.P., Lehman, J.R., 1982. Forward masking as a function of frequency, masker level, and signal delay. J. Acoust. Soc. Amer. 950-962.
- (1982) J. Acoust. Soc. Amer. , pp. 950-962
- Jestead, W.¹ Bacon, S.P.² Lehman, J.R.³

52
- 84883097102
- On the importance of various modulation frequencies for speech recognition
- Rhodos, Greece
- Kanedera, N., Arai, T., Hermansky, H., Pavel, M., 1997. On the importance of various modulation frequencies for speech recognition. In: Proceedings of Eurospeech'97, Rhodos, Greece, pp. 1079-1082.
- (1997) Proceedings of Eurospeech'97 , pp. 1079-1082
- Kanedera, N.¹ Arai, T.² Hermansky, H.³ Pavel, M.⁴

53
- 0346126997
- Submitted to Speech Communication
- Kanedera, N., Arai, T., Hermansky, H., Pavel, M., 1997. On the relative importance of various components of the modulation spectrum for automatic speech recognition. Submitted to Speech Communication.
- (1997) On the Relative Importance of Various Components of the Modulation Spectrum for Automatic Speech Recognition
- Kanedera, N.¹ Arai, T.² Hermansky, H.³ Pavel, M.⁴

54
- 0001490199
- Speech processing strategies based on auditory models
- Carlson, R., Granstrom, B. (Eds.), Elsevier Biomedical Press, New York
- Klatt, D.H., 1982. Speech processing strategies based on auditory models. In: Carlson, R., Granstrom, B. (Eds.), The Representation of Speech in The Peripheral Auditory System. Elsevier Biomedical Press, New York, pp. 181-202.
- (1982) The Representation of Speech in the Peripheral Auditory System , pp. 181-202
- Klatt, D.H.¹

55
- 0004288806
- Translated from Russian by US Department of Commerce
- Kozhevnikov, V.A., Chistovich, L.A., 1967. Speech: Articulation and Perception. Translated from Russian by US Department of Commerce, p. 250, 251.
- (1967) Speech: Articulation and Perception , pp. 250
- Kozhevnikov, V.A.¹ Chistovich, L.A.²

56
- 0003636274
- Oxford Univ. Press, Oxford
- Ladefoged, P., 1967. Three Areas of Experimental Phonetics. Oxford Univ. Press, Oxford, p. 65.
- (1967) Three Areas of Experimental Phonetics , pp. 65
- Ladefoged, P.¹

57
- 0018478297
- Spectral root homomorphic deconvolution system
- Lim, J.S., 1979. Spectral root homomorphic deconvolution system. IEEE Trans. Acoust. Speech Signal Process. 27 (3), 223-233.
- (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.3 , pp. 223-233
- Lim, J.S.¹

58
- 0029754956
- Accurate consonant perception without mid-frequency speech energy
- Lippmann, R.P., 1995. Accurate consonant perception without mid-frequency speech energy. IEEE Trans. Speech and Audio 4 (1), 66-69.
- (1995) IEEE Trans. Speech and Audio , vol.4 , Issue.1 , pp. 66-69
- Lippmann, R.P.¹

59
- 0020544161
- Recognition of consonant based on the perceptron model
- Boston, MA
- Makino, S., Kawabata, T., Kido, K., 1983. Recognition of consonant based on the perceptron model. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Boston, MA, pp. 738-741.
- (1983) Proceedings of International Conference on Acoust. Speech and Signal Processing , pp. 738-741
- Makino, S.¹ Kawabata, T.² Kido, K.³

60
- 33646771442
- Towards decomposing the sources of variability in speech
- Rhodos, Greece
- Malayath, N., Hermansky, H., Kain, A., 1997. Towards decomposing the sources of variability in speech. In: Proceedings of Eurospeech'97, Rhodos, Greece.
- (1997) Proceedings of Eurospeech'97
- Malayath, N.¹ Hermansky, H.² Kain, A.³

61
- 0003834557
- Freeman, San Francisco, CA.
- Marr, D., 1982. Vision. Freeman, San Francisco, CA.
- (1982) Vision
- Marr, D.¹

62
- 0038133939
- Distance measures for speech recognition, psychological and instrumental
- Chen, R.C.H. (Ed.), Academic Press, New York
- Mermelstein, P., 1976. Distance measures for speech recognition, psychological and instrumental. In: Chen, R.C.H. (Ed.), Pattern Recognition and Artificial Intelligence. Academic Press, New York, pp. 374-388.
- (1976) Pattern Recognition and Artificial Intelligence , pp. 374-388
- Mermelstein, P.¹

63
- 0002127129
- Probabilistic optimum filtering for robust speech recognition
- Adelaide, Australia
- Neumayer, L., Weintraub, M., 1994. Probabilistic optimum filtering for robust speech recognition. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, Adelaide, Australia, pp. I-417-420.
- (1994) Proceedings of International Conference on Acoust. Speech and Signal Processing
- Neumayer, L.¹ Weintraub, M.²

64
- 0346757724
- Ph.D. Thesis, New York University
- Pavel, M., 1980. Homogeneity in complete and partial masking. Ph.D. Thesis, New York University.
- (1980) Homogeneity in Complete and Partial Masking
- Pavel, M.¹

65
- 0007636578
- Temporal masking in automatic speech recognition
- Pavel, M., Hermansky, H., 1994. Temporal masking in automatic speech recognition. J. Acoust. Soc. Amer. A 95, 2876.
- (1994) J. Acoust. Soc. Amer. A , vol.95 , pp. 2876
- Pavel, M.¹ Hermansky, H.²

66
- 0015129120
- Real-time recognition of spoken words
- Pols, L.C.W., 1971. Real-time recognition of spoken words. IEEE Trans. Comput. 20 (C) 972-978.
- (1971) IEEE Trans. Comput. , vol.20 , Issue.C , pp. 972-978
- Pols, L.C.W.¹

67
- 84881675408
- Cepstral channel normalization techniques for HMM-based speaker verification
- Yokohama, Japan
- Rosenberg, A.E., Lee, C., Soong, F.K., 1994. Cepstral channel normalization techniques for HMM-based speaker verification. In: Proceedings of International Conference on Spoken Language Processing, Yokohama, Japan, pp. 1835-1838.
- (1994) Proceedings of International Conference on Spoken Language Processing , pp. 1835-1838
- Rosenberg, A.E.¹ Lee, C.² Soong, F.K.³

68
- 84928837806
- A joint synchrony/mean-rate model of auditory speech processing
- Seneff, S., 1985. A joint synchrony/mean-rate model of auditory speech processing. J. Phonetics 16 (1), 55-76.
- (1985) J. Phonetics , vol.16 , Issue.1 , pp. 55-76
- Seneff, S.¹

69
- 0011405405
- Brightness and loudness as functions of stimulus duration
- Stevens, J.C., Hall, J.W., 1966. Brightness and loudness as functions of stimulus duration. Perception and Psychophysics 1, 319-327.
- (1966) Perception and Psychophysics , vol.1 , pp. 319-327
- Stevens, J.C.¹ Hall, J.W.²

70
- 0002220140
- Applying phonetic knowledge to lexical access
- Madrid, Spain
- Stevens, K.N., 1996. Applying phonetic knowledge to lexical access. In: Proceedings of Eurospeech'95, Madrid, Spain, p. 3.
- (1996) Proceedings of Eurospeech'95 , pp. 3
- Stevens, K.N.¹

71
- 85135190755
- Multi-band and adaptation approaches to robust speech recognition
- Rhodos, Greece
- Tibrewala, S., Hermansky, H., 1997. Multi-band and adaptation approaches to robust speech recognition. In: Proceedings of Eurospeech'97, Rhodos, Greece, pp. 2619-2622.
- (1997) Proceedings of Eurospeech'97 , pp. 2619-2622
- Tibrewala, S.¹ Hermansky, H.²

72
- 84947590142
- Data-driven design of RASTA-like filters
- Rhodos, Greece
- van Vuuren, S., Hermansky, H., 1997. Data-driven design of RASTA-like filters. In: Proceedings of Eurospeech'97, Rhodos, Greece, pp. 409-412.
- (1997) Proceedings of Eurospeech'97 , pp. 409-412
- Van Vuuren, S.¹ Hermansky, H.²

73
- 0023833469
- Phoneme recognition using time-delay neural networks
- New York
- Waibel, A., Hanazawa, T., Hinton, G., Shikano, K., Lang, K., 1988. Phoneme recognition using time-delay neural networks, Proceedings of International Conference on Acoust. Speech and Signal Processing, New York, pp. 107-110.
- (1988) Proceedings of International Conference on Acoust. Speech and Signal Processing , pp. 107-110
- Waibel, A.¹ Hanazawa, T.² Hinton, G.³ Shikano, K.⁴ Lang, K.⁵

74
- 0029378080
- Spectral shape analysis in the central auditory system
- Wang, K., Shamma, S.S., 1995. Spectral shape analysis in the central auditory system. IEEE Trans. Speech Audio Process. 3 (5), 382-394.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 382-394
- Wang, K.¹ Shamma, S.S.²

75
- 0030028881
- Some effects of filtered context on the perception of vowels and fricatives
- Watkins, A.J., Makin, S.J., 1997. Some effects of filtered context on the perception of vowels and fricatives. J. Acoust. Soc. Amer. 99 (1), 588-594.
- (1997) J. Acoust. Soc. Amer. , vol.99 , Issue.1 , pp. 588-594
- Watkins, A.J.¹ Makin, S.J.²

76
- 0029726509
- Improving environmental robustness in large vocabulary speech recognition
- Woodland, P.C., Gales, M.J.F., Pye, D., 1996. Improving environmental robustness in large vocabulary speech recognition. In: Proceedings of International Conference on Acoust. Speech and Signal Processing, pp. 65-68.
- (1996) Proceedings of International Conference on Acoust. Speech and Signal Processing , pp. 65-68
- Woodland, P.C.¹ Gales, M.J.F.² Pye, D.³

77
- 0039777029
- Scaling
- Keidel O., Neff W. (Eds.), Springer, Berlin
- Zwicker, E., 1975. Scaling. In: Keidel O., Neff W. (Eds.), Handbook of Sensory Physiology, Vol. V.3. Springer, Berlin, pp. 401-448.
- (1975) Handbook of Sensory Physiology , vol.3 , pp. 401-448
- Zwicker, E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.