SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 87, Issue 4, 1990, Pages 1738-1752

Perceptual linear predictive (PLP) analysis of speech

(1) Hermansky, Hynek a

a Panasonic Corporation of North America (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTICLE; HEARING; NONHUMAN; PSYCHOPHYSICS; SPECTROSCOPY; SPEECH ANALYSIS;

ADULT; ATTENTION; CHILD, PRESCHOOL; FOURIER ANALYSIS; HUMAN; LOUDNESS PERCEPTION; MALE; PERCEPTUAL DISTORTION; PHONETICS; PITCH DISCRIMINATION; SIGNAL PROCESSING, COMPUTER-ASSISTED; SOUND SPECTROGRAPHY; SPEECH ACOUSTICS; SPEECH PERCEPTION;

EID: 0025041264 PISSN: 00014966 EISSN: NA Source Type: Journal
DOI: 10.1121/1.399423 Document Type: Article

Times cited : (2056)

References (52)

1
- 0020932333
- Two-formant models of vowel perception: shortcomings and enhancements
- Bladon, A. (1983). “Two-formant models of vowel perception: shortcomings and enhancements,” Speech Commun. 2, 305–313.
- (1983) Speech Commun , vol.2 , pp. 305-313
- Bladon, A.¹

2
- 84955035811
- Fant and, G., (1978). “A two-formant model and the cardinal vowels,” STL-QPRS 1, 1–8, Royal Institute of Technology, Stockholm.
- Bladon, A., and Fant, G., (1978). “A two-formant model and the cardinal vowels,” STL-QPRS 1, 1–8, Royal Institute of Technology, Stockholm.
- Bladon, A.¹

3
- 84955044888
- Ladefoged and, P. (1982). “A further test of a two-formant model,” J. Acoust. Soc. Am. Suppl. 1 71, S104.
- Bladon, A., and Ladefoged, P. (1982). “A further test of a two-formant model,” J. Acoust. Soc. Am. Suppl. 1 71, S104.
- Bladon, A.¹

4
- 0019461948
- Lindblom and, B. (1981). “Modeling the judgment of vowel quality differences,” J. Acoust. Soc. Am. 69, 1414–1422.
- Bladon, A., and Lindblom, B. (1981). “Modeling the judgment of vowel quality differences,” J. Acoust. Soc. Am. 69, 1414–1422.
- Bladon, A.¹

5
- 0347387977
- An experimental automatic word recognition system
- JSRU Report No. 1003, Joint Speech Research Unit, Ruislip, England.
- Bridle, J. S., and Brown, M. D. (1974). “An experimental automatic word recognition system,” JSRU Report No. 1003, Joint Speech Research Unit, Ruislip, England.
- (1974)
- Bridle, J.S.¹ Brown, M.D.²

6
- 84955042539
- Hermansky, H. (1989). The front-cavity/F2' hypothesis tested by data on tongue movements, J. Acoust. Soc. Am. Suppl. 1 86, S13-S14.
- Broad, D. J., and Hermansky, H. (1989). The front-cavity/F2' hypothesis tested by data on tongue movements, J. Acoust. Soc. Am. Suppl. 1 86, S13-S14.
- Broad, D.J.¹

7
- 43549085820
- Some studies concerning perception of isolated vowels
- STL-QPRS 2–3, 19–35, Royal Institute of Technology, Stockholm.
- Carlson, R., Granstrom, B., and Fant, G. (1970). “Some studies concerning perception of isolated vowels,” STL-QPRS 2–3, 19–35, Royal Institute of Technology, Stockholm.
- (1970)
- Carlson, R.¹ Granstrom, B.² Fant, G.³

8
- 0002355130
- Two-formant models, pitch and vowel perception
- edited by G. S. Fant and M. A. A. Tatham (Academic, New York), pp. 55–82.
- Carlson, R., Fant, G., and Granstrom, B. (1975). “Two-formant models, pitch and vowel perception,” in Auditory Analysis and Perception of Speech, edited by G. S. Fant and M. A. A. Tatham (Academic, New York), pp. 55–82.
- (1975) Auditory Analysis and Perception of Speech
- Carlson, R.¹ Fant, G.² Granstrom, B.³

9
- 4243571925
- Vowel perception: the relative perceptual salience of selected acoustic manipulations
- STL-QPSR 3–4, 73–83, Royal Institute of Technology, Stockholm.
- Carlson, R., Granstrom, B., and Klatt, D. (1979). “Vowel perception: the relative perceptual salience of selected acoustic manipulations,” STL-QPSR 3–4, 73–83, Royal Institute of Technology, Stockholm.
- (1979)
- Carlson, R.¹ Granstrom, B.² Klatt, D.³

10
- 84955043059
- Kajiyama and, M. (1941). The Vowel: Its Nature and Structure (Tokyo Kaiseikan, Tokyo).
- Chiba, T., and Kajiyama, M. (1941). The Vowel: Its Nature and Structure (Tokyo Kaiseikan, Tokyo).
- Chiba, T.¹

11
- 84955031999
- in Frontiers of Speech Communication Research, edited by B. Linblom and S. Ohman (Academic, New York), pp. 143–157.
- Chistovich, L. A., Sheikin, R. L., and Lublinskaja, V. V. (1978). “‘Centers of gravity’ and spectral peaks as the determinants of vowel quality,” in Frontiers of Speech Communication Research, edited by B. Linblom and S. Ohman (Academic, New York), pp. 143–157.
- Chistovich, L.A.¹

12
- 0021906779
- Central auditory processing of peripheral vowel spectra
- Chistovich, L. A. (1985). “Central auditory processing of peripheral vowel spectra,” J. Acoust. Soc. Am. 77, 789–805.
- (1985) J. Acoust. Soc. Am. 77 , pp. 789-805
- Chistovich, L.A.¹

13
- 0000955505
- An experimental study of the acoustic determinants of vowel color
- Delattre, P., Liberman, A. M., Cooper, F. S., and Gerstman, L. J. (1952). “An experimental study of the acoustic determinants of vowel color,” Word 8, 195–210.
- (1952) Word , vol.8 , pp. 195-210
- Delattre, P.¹ Liberman, A.M.² Cooper, F.S.³ Gerstman, L.J.⁴

14
- 0023167190
- in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 87, pp. 320–323.
- El Jaroudi, A., and Makhoul, J. (1987). “Discrete all-pole modeling,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 87, pp. 320–323.
- El Jaroudi, A.¹

15
- 84955014830
- STL-QPRS 4, 7–11, Royal Institute of Technology, Stockholm.
- Fant, G., and Risberg, A. (1962). “Auditory matching of vowels with two formant synthetic sounds,” STL-QPRS 4, 7–11, Royal Institute of Technology, Stockholm.
- Fant, G.¹

16
- 0003418124
- Acoustic Theory of Speech Production
- (Mouton, The Hague), 2nd printing, p. 123.
- Fant, G. (1970). Acoustic Theory of Speech Production (Mouton, The Hague), 2nd printing, p. 123.
- (1970)
- Fant, G.¹

17
- 30244446534
- Vocal tract wall effects, losses and resonance bandwidths
- STL-QPRS 2–3, 28–52, Royal Institute of Technology, Stockholm.
- Fant, G. (1972). “Vocal tract wall effects, losses and resonance bandwidths,” STL-QPRS 2–3, 28–52, Royal Institute of Technology, Stockholm.
- (1972)
- Fant, G.¹

18
- 84912675689
- Difference limen for vowel formant frequency
- Flanagan, J. (1955). “Difference limen for vowel formant frequency,” J. Acoust. Soc. Am. 27, 613–617.
- (1955) J. Acoust. Soc. Am. 27 , pp. 613-617
- Flanagan, J.¹

19
- 0005065011
- Estimates of maximum precision necessary in quantizing certain dimensions of vowel sounds
- J. Acoust. Soc. Am. 29, 533—534.
- Flanagan, J. (1957). “Estimates of maximum precision necessary in quantizing certain dimensions of vowel sounds,” J. Acoust. Soc. Am. 29, 533—534.
- (1957)
- Flanagan, J.¹

20
- 0002439510
- Auditory patterns
- Fletcher, H. (1940). “Auditory patterns,” Rev. Mod. Phys. 12, 47–65.
- (1940) Rev. Mod. Phys. 12 , pp. 47-65
- Fletcher, H.¹

21
- 0014113409
- On the second spectral peak of front vowels: a perceptual study of the role of the second and third formants
- Fujimura, O. (1967). “On the second spectral peak of front vowels: a perceptual study of the role of the second and third formants,” Lang. Speech, 10, 181–193.
- (1967) Lang. Speech , vol.10 , pp. 181-193
- Fujimura, O.¹

22
- 84955047944
- Trans. Comm. Speech Res., Acoust. Soc. Japan, December 1973 (in Japanese).
- Fujisaki, H., and Sato, Y. (1973). “Comparison of errors in formant frequencies obtained by various methods of formant extraction,” Trans. Comm. Speech Res., Acoust. Soc. Japan, December 1973 (in Japanese).
- Fujisaki, H.¹

23
- 84955027174
- in Proceedings of International Conference on Chinese Information Processing, Beijung, China.
- Gu, Y., and Mason, J. S. D. (1987). “Vocal tract and auditory feature analysis using Chinese utterance in ASR system,” in Proceedings of International Conference on Chinese Information Processing, Beijung, China.
- Gu, Y.¹

24
- 84955040649
- Improved linear predictive analysis of speech based on spectral processing
- Ph.D. dissertation, University of Tokyo.
- Hermansky, H. (1982). “Improved linear predictive analysis of speech based on spectral processing,” Ph.D. dissertation, University of Tokyo.
- (1982)
- Hermansky, H.¹

25
- 0021124704
- Spectral envelope sampling and interpolation in linear predictive analysis of speech
- Hermansky, H., Fujisaki, H., and Sato, Y. (1984). “Spectral envelope sampling and interpolation in linear predictive analysis of speech” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 84, pp. 221–224.
- (1984) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 84, pp , pp. 221-224
- Hermansky, H.¹ Fujisaki, H.² Sato, Y.³

26
- 0022112505
- Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain
- Hermansky, H., Hanson, B. A., and Wakita, H. (1985). “Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain,” Speech Commun. 4, (1–3), 181–187.
- (1985) Speech Commun. 4, (1–3) , pp. 181-187
- Hermansky, H.¹ Hanson, B.A.² Wakita, H.³

27
- 0023167028
- in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 87
- Hermansky, H. (1987a). “An efficient speaker-independent automatic speech recognition by simulation of some properties of human auditory perception,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 87, pp. 1159–1162.

28
- 84955024049
- J. Acoust. Soc. Am. Suppl. 1 81, SI8; full text in STL Research Reports No. 1, Santa Barbara
- Hermansky, H. (1987b). “Why is the formant frequency DL curve asymmetric?,” J. Acoust. Soc. Am. Suppl. 1 81, SI8; full text in STL Research Reports No. 1, Santa Barbara, 1987.
- (1987)

29
- 0023839642
- Optimization of perceptually based ASR front-end
- Paper S5.10
- Hermansky, H., and Junqua, J. C. (1988). “Optimization of perceptually based ASR front-end,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 88, Paper S5.10, pp. 219—222.
- (1988) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 88 , pp. 219-222
- Hermansky, H.¹ Junqua, J.C.²

30
- 0024879199
- The effective second formant F2' and the vocal tract front cavity
- Paper S10a.4, pp. 480–483.
- Hermansky, H., and Broad, D. J. (1989). “The effective second formant F2' and the vocal tract front cavity,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 89, Paper S10a.4, pp. 480–483.
- (1989) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 89
- Hermansky, H.¹ Broad, D.J.²

31
- 60049088346
- On the role of fundamental frequency in vowel perception
- 1 84, SI56.
- Hirahara, T. (1988). “On the role of fundamental frequency in vowel perception,” J. Acoust. Soc. Am. Suppl. 1 84, SI56.
- (1988) J. Acoust. Soc. Am. Suppl
- Hirahara, T.¹

32
- 0037662539
- in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 76, Paper 9.2, pp. 310–313.
- Itahashi, S., and Yokoyama, S. (1976). “Automatic formant extraction utilizing mel scale and equal loudness contour,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 76, Paper 9.2, pp. 310–313.
- Itahashi, S.¹

33
- 84955019699
- Evaluation of ASR front-ends in speaker dependent and speaker independent recognition
- J. Acoust. Soc. Am. Suppl. 1 81, S93; Full text in STL Research Reports No. 1, Santa Barbara, 1987.
- Junqua, J. C. (1987). “Evaluation of ASR front-ends in speaker dependent and speaker independent recognition,” J. Acoust. Soc. Am. Suppl. 1 81, S93; Full text in STL Research Reports No. 1, Santa Barbara, 1987.
- (1987)
- Junqua, J.C.¹

34
- 84955050098
- J. Acoust. Soc. Am. Suppl. 1 78, S82.
- Kamm, C., and Kahn, D. (1985). “Relationship between LP residual spectral distances and phonetic judgement,” J. Acoust. Soc. Am. Suppl. 1 78, S82.
- Kamm, C.¹

35
- 0016542558
- On the front cavity resonance and its possible role in speech perception
- Kuhn, G. M. (1975). “On the front cavity resonance and its possible role in speech perception,” J. Acoust. Soc. Am. 58, 428–433.
- (1975) J. Acoust. Soc. Am. 58 , pp. 428-433
- Kuhn, G.M.¹

36
- 0018387712
- Stop consonant place perception with single-for-mant stimuli: Evidence for the role of the front-cavity resonance.
- Kuhn, G. M. (1978). “Stop consonant place perception with single-for-mant stimuli: Evidence for the role of the front-cavity resonance.”, J. Acoust. Soc. Am. 65, 774–788.
- (1978) J. Acoust. Soc. Am. 65 , pp. 774-788
- Kuhn, G.M.¹

37
- 84989489267
- Prediction of perceived phonetic distance from critical-band spectra: a first step
- Klatt, D. (1982). “Prediction of perceived phonetic distance from critical-band spectra: a first step,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 82, pp. 1278–1281.
- (1982) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 82 , pp. 1278-1281
- Klatt, D.¹

38
- 0016519041
- Spectral linear prediction: properties and applications
- Makhoul, J. (1975). “Spectral linear prediction: properties and applications,” IEEE Trans. ASSP-23, 283–296.
- (1975) IEEE Trans. ASSP-23 , pp. 283-296
- Makhoul, J.¹

39
- 85067594456
- in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 76, pp. Philadelphia.
- Makhoul, J., and Cosell, L. (1976). “LPCW: An LPC vocoder with linear predictive spectral warping,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 76, pp. 466-469, Philadelphia.
- Makhoul, J.¹

40
- 33845891520
- Perceptually-based features in ASR
- London.
- Mason, J. S., and Gu, Y. (1988). “Perceptually-based features in ASR,” in Proceedings of IEEE Colloquium on Speech Processing, London.
- (1988) Proceedings of IEEE Colloquium on Speech Processing
- Mason, J.S.¹ Gu, Y.²

41
- 0038133939
- Distance measures for speech recognition, psychological and instrumental
- edited by C. H. Chen (Academic, New York), pp. 374—388.
- Mermelstein, P. (1976). “Distance measures for speech recognition, psychological and instrumental,” in Pattern Recognition and Artificial Intelligence, edited by C. H. Chen (Academic, New York), pp. 374—388.
- (1976) Pattern Recognition and Artificial Intelligence
- Mermelstein, P.¹

42
- 0020905044
- Speech Commun. 2 (4), 295–303.
- Paliwal, K. K., Lindsay, D., and Ainsworth, W. A. (1983). “A study of two-formant models for vowel identification,” Speech Commun. 2 (4), 295–303.
- Paliwal, K.K.¹

43
- 0000146349
- Br. J. Appl. Phys.
- Robinson, D.W., and Dadson, R.S. (1956). “A redetermination of the equal-loudness relations for pure tones,” Br. J. Appl. Phys. 7, 166–181.
- , vol.7 , pp. 166-181

44
- 0015008817
- J. Acoust. Soc. Am. 49
- Rosenberg, A. (1970). “Effect of glottal pulse shape on the quality of natural vowels,” J. Acoust. Soc. Am. 49, 583–590.

45
- 17344367464
- Recognition of Complex Acoustic Signals, Life Sciences Research Report 5
- edited by T. H. Bullock (Abakon Verlag, Berlin), p. 324
- Schroeder, M. R. (1977). Recognition of Complex Acoustic Signals, Life Sciences Research Report 5, edited by T. H. Bullock (Abakon Verlag, Berlin), p. 324.
- (1977)
- Schroeder, M.R.¹

46
- 84928841878
- The acoustic features of speech phonemes in a model of auditory processing: Vowels and unvoiced fricatives
- Shamma, S. A. (1988). “The acoustic features of speech phonemes in a model of auditory processing: Vowels and unvoiced fricatives,” J. Phon. 16,79-91.
- (1988) J. Phon , vol.16 , pp. 79-91
- Shamma, S.A.¹

47
- 34447546202
- On the psychophysical law
- Psychol. Rev. 64, 153—181.
- Stevens, S. S. (1957). “On the psychophysical law,” Psychol. Rev. 64, 153—181.
- (1957)
- Stevens, S.S.¹

48
- 0019068177
- Linear prediction on a warped frequency scale
- Strube, H. W. (1980). “Linear prediction on a warped frequency scale,” J. Acoust. Soc. Am. 68, 1071–1076.
- (1980) J. Acoust. Soc. Am , vol.68 , pp. 1071-1076
- Strube, H.W.¹

49
- 84955025636
- IEEE Trans. ASSP-26
- Vishwanathan, R., and Makhoul, J. (1975). “Quantization properties of transmission parameters in linear predictive systems,” IEEE Trans. ASSP-26, 587–596.
- Vishwanathan, R.¹

50
- 0017969757
- Formant extraction from linear prediction phase spectra
- Yegnanarayana, B. (1977). “Formant extraction from linear prediction phase spectra,” J. Acoust. Soc. Am. 63, 1638–1640.
- (1977) J. Acoust. Soc. Am , vol.63 , pp. 1638-1640
- Yegnanarayana, B.¹

51
- 0018319567
- in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 79, pp. 744—747.
- Yegnanarayana, B., and Reddy, R. (1979). “A distance measure derived from the first derivative of the linear prediction phase spectra,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 79, pp. 744—747.
- Yegnanarayana, B.¹

52
- 0002237909
- Masking and psychological excitation as consequences of ear's frequency analysis
- edited by R. Plomp and G. F. Smoorenburg (Sijthoff, Leyden, The Netherlands).
- Zwicker, E. (1970). “Masking and psychological excitation as consequences of ear's frequency analysis,” in Frequency Analysis and Periodicity Detection in Hearing, edited by R. Plomp and G. F. Smoorenburg (Sijthoff, Leyden, The Netherlands).
- (1970) Frequency Analysis and Periodicity Detection in Hearing
- Zwicker, E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.