메뉴 건너뛰기




Volumn 87, Issue 4, 1990, Pages 1738-1752

Perceptual linear predictive (PLP) analysis of speech

Author keywords

[No Author keywords available]

Indexed keywords

ARTICLE; HEARING; NONHUMAN; PSYCHOPHYSICS; SPECTROSCOPY; SPEECH ANALYSIS;

EID: 0025041264     PISSN: 00014966     EISSN: NA     Source Type: Journal    
DOI: 10.1121/1.399423     Document Type: Article
Times cited : (2056)

References (52)
  • 1
    • 0020932333 scopus 로고
    • Two-formant models of vowel perception: shortcomings and enhancements
    • Bladon, A. (1983). “Two-formant models of vowel perception: shortcomings and enhancements,” Speech Commun. 2, 305–313.
    • (1983) Speech Commun , vol.2 , pp. 305-313
    • Bladon, A.1
  • 2
    • 84955035811 scopus 로고    scopus 로고
    • Fant and, G., (1978). “A two-formant model and the cardinal vowels,” STL-QPRS 1, 1–8, Royal Institute of Technology, Stockholm.
    • Bladon, A., and Fant, G., (1978). “A two-formant model and the cardinal vowels,” STL-QPRS 1, 1–8, Royal Institute of Technology, Stockholm.
    • Bladon, A.1
  • 3
    • 84955044888 scopus 로고    scopus 로고
    • Ladefoged and, P. (1982). “A further test of a two-formant model,” J. Acoust. Soc. Am. Suppl. 1 71, S104.
    • Bladon, A., and Ladefoged, P. (1982). “A further test of a two-formant model,” J. Acoust. Soc. Am. Suppl. 1 71, S104.
    • Bladon, A.1
  • 4
    • 0019461948 scopus 로고    scopus 로고
    • Lindblom and, B. (1981). “Modeling the judgment of vowel quality differences,” J. Acoust. Soc. Am. 69, 1414–1422.
    • Bladon, A., and Lindblom, B. (1981). “Modeling the judgment of vowel quality differences,” J. Acoust. Soc. Am. 69, 1414–1422.
    • Bladon, A.1
  • 5
    • 0347387977 scopus 로고
    • An experimental automatic word recognition system
    • JSRU Report No. 1003, Joint Speech Research Unit, Ruislip, England.
    • Bridle, J. S., and Brown, M. D. (1974). “An experimental automatic word recognition system,” JSRU Report No. 1003, Joint Speech Research Unit, Ruislip, England.
    • (1974)
    • Bridle, J.S.1    Brown, M.D.2
  • 6
    • 84955042539 scopus 로고    scopus 로고
    • Hermansky, H. (1989). The front-cavity/F2' hypothesis tested by data on tongue movements, J. Acoust. Soc. Am. Suppl. 1 86, S13-S14.
    • Broad, D. J., and Hermansky, H. (1989). The front-cavity/F2' hypothesis tested by data on tongue movements, J. Acoust. Soc. Am. Suppl. 1 86, S13-S14.
    • Broad, D.J.1
  • 7
    • 43549085820 scopus 로고
    • Some studies concerning perception of isolated vowels
    • STL-QPRS 2–3, 19–35, Royal Institute of Technology, Stockholm.
    • Carlson, R., Granstrom, B., and Fant, G. (1970). “Some studies concerning perception of isolated vowels,” STL-QPRS 2–3, 19–35, Royal Institute of Technology, Stockholm.
    • (1970)
    • Carlson, R.1    Granstrom, B.2    Fant, G.3
  • 8
    • 0002355130 scopus 로고
    • Two-formant models, pitch and vowel perception
    • edited by G. S. Fant and M. A. A. Tatham (Academic, New York), pp. 55–82.
    • Carlson, R., Fant, G., and Granstrom, B. (1975). “Two-formant models, pitch and vowel perception,” in Auditory Analysis and Perception of Speech, edited by G. S. Fant and M. A. A. Tatham (Academic, New York), pp. 55–82.
    • (1975) Auditory Analysis and Perception of Speech
    • Carlson, R.1    Fant, G.2    Granstrom, B.3
  • 9
    • 4243571925 scopus 로고
    • Vowel perception: the relative perceptual salience of selected acoustic manipulations
    • STL-QPSR 3–4, 73–83, Royal Institute of Technology, Stockholm.
    • Carlson, R., Granstrom, B., and Klatt, D. (1979). “Vowel perception: the relative perceptual salience of selected acoustic manipulations,” STL-QPSR 3–4, 73–83, Royal Institute of Technology, Stockholm.
    • (1979)
    • Carlson, R.1    Granstrom, B.2    Klatt, D.3
  • 10
    • 84955043059 scopus 로고    scopus 로고
    • Kajiyama and, M. (1941). The Vowel: Its Nature and Structure (Tokyo Kaiseikan, Tokyo).
    • Chiba, T., and Kajiyama, M. (1941). The Vowel: Its Nature and Structure (Tokyo Kaiseikan, Tokyo).
    • Chiba, T.1
  • 11
    • 84955031999 scopus 로고    scopus 로고
    • in Frontiers of Speech Communication Research, edited by B. Linblom and S. Ohman (Academic, New York), pp. 143–157.
    • Chistovich, L. A., Sheikin, R. L., and Lublinskaja, V. V. (1978). “‘Centers of gravity’ and spectral peaks as the determinants of vowel quality,” in Frontiers of Speech Communication Research, edited by B. Linblom and S. Ohman (Academic, New York), pp. 143–157.
    • Chistovich, L.A.1
  • 12
    • 0021906779 scopus 로고
    • Central auditory processing of peripheral vowel spectra
    • Chistovich, L. A. (1985). “Central auditory processing of peripheral vowel spectra,” J. Acoust. Soc. Am. 77, 789–805.
    • (1985) J. Acoust. Soc. Am. 77 , pp. 789-805
    • Chistovich, L.A.1
  • 13
    • 0000955505 scopus 로고
    • An experimental study of the acoustic determinants of vowel color
    • Delattre, P., Liberman, A. M., Cooper, F. S., and Gerstman, L. J. (1952). “An experimental study of the acoustic determinants of vowel color,” Word 8, 195–210.
    • (1952) Word , vol.8 , pp. 195-210
    • Delattre, P.1    Liberman, A.M.2    Cooper, F.S.3    Gerstman, L.J.4
  • 14
    • 0023167190 scopus 로고    scopus 로고
    • in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 87, pp. 320–323.
    • El Jaroudi, A., and Makhoul, J. (1987). “Discrete all-pole modeling,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 87, pp. 320–323.
    • El Jaroudi, A.1
  • 15
    • 84955014830 scopus 로고    scopus 로고
    • STL-QPRS 4, 7–11, Royal Institute of Technology, Stockholm.
    • Fant, G., and Risberg, A. (1962). “Auditory matching of vowels with two formant synthetic sounds,” STL-QPRS 4, 7–11, Royal Institute of Technology, Stockholm.
    • Fant, G.1
  • 16
    • 0003418124 scopus 로고
    • Acoustic Theory of Speech Production
    • (Mouton, The Hague), 2nd printing, p. 123.
    • Fant, G. (1970). Acoustic Theory of Speech Production (Mouton, The Hague), 2nd printing, p. 123.
    • (1970)
    • Fant, G.1
  • 17
    • 30244446534 scopus 로고
    • Vocal tract wall effects, losses and resonance bandwidths
    • STL-QPRS 2–3, 28–52, Royal Institute of Technology, Stockholm.
    • Fant, G. (1972). “Vocal tract wall effects, losses and resonance bandwidths,” STL-QPRS 2–3, 28–52, Royal Institute of Technology, Stockholm.
    • (1972)
    • Fant, G.1
  • 18
    • 84912675689 scopus 로고
    • Difference limen for vowel formant frequency
    • Flanagan, J. (1955). “Difference limen for vowel formant frequency,” J. Acoust. Soc. Am. 27, 613–617.
    • (1955) J. Acoust. Soc. Am. 27 , pp. 613-617
    • Flanagan, J.1
  • 19
    • 0005065011 scopus 로고
    • Estimates of maximum precision necessary in quantizing certain dimensions of vowel sounds
    • J. Acoust. Soc. Am. 29, 533—534.
    • Flanagan, J. (1957). “Estimates of maximum precision necessary in quantizing certain dimensions of vowel sounds,” J. Acoust. Soc. Am. 29, 533—534.
    • (1957)
    • Flanagan, J.1
  • 20
  • 21
    • 0014113409 scopus 로고
    • On the second spectral peak of front vowels: a perceptual study of the role of the second and third formants
    • Fujimura, O. (1967). “On the second spectral peak of front vowels: a perceptual study of the role of the second and third formants,” Lang. Speech, 10, 181–193.
    • (1967) Lang. Speech , vol.10 , pp. 181-193
    • Fujimura, O.1
  • 22
    • 84955047944 scopus 로고    scopus 로고
    • Trans. Comm. Speech Res., Acoust. Soc. Japan, December 1973 (in Japanese).
    • Fujisaki, H., and Sato, Y. (1973). “Comparison of errors in formant frequencies obtained by various methods of formant extraction,” Trans. Comm. Speech Res., Acoust. Soc. Japan, December 1973 (in Japanese).
    • Fujisaki, H.1
  • 23
    • 84955027174 scopus 로고    scopus 로고
    • in Proceedings of International Conference on Chinese Information Processing, Beijung, China.
    • Gu, Y., and Mason, J. S. D. (1987). “Vocal tract and auditory feature analysis using Chinese utterance in ASR system,” in Proceedings of International Conference on Chinese Information Processing, Beijung, China.
    • Gu, Y.1
  • 24
    • 84955040649 scopus 로고
    • Improved linear predictive analysis of speech based on spectral processing
    • Ph.D. dissertation, University of Tokyo.
    • Hermansky, H. (1982). “Improved linear predictive analysis of speech based on spectral processing,” Ph.D. dissertation, University of Tokyo.
    • (1982)
    • Hermansky, H.1
  • 26
    • 0022112505 scopus 로고
    • Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain
    • Hermansky, H., Hanson, B. A., and Wakita, H. (1985). “Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain,” Speech Commun. 4, (1–3), 181–187.
    • (1985) Speech Commun. 4, (1–3) , pp. 181-187
    • Hermansky, H.1    Hanson, B.A.2    Wakita, H.3
  • 27
    • 0023167028 scopus 로고    scopus 로고
    • in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 87
    • Hermansky, H. (1987a). “An efficient speaker-independent automatic speech recognition by simulation of some properties of human auditory perception,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 87, pp. 1159–1162.
  • 28
    • 84955024049 scopus 로고
    • J. Acoust. Soc. Am. Suppl. 1 81, SI8; full text in STL Research Reports No. 1, Santa Barbara
    • Hermansky, H. (1987b). “Why is the formant frequency DL curve asymmetric?,” J. Acoust. Soc. Am. Suppl. 1 81, SI8; full text in STL Research Reports No. 1, Santa Barbara, 1987.
    • (1987)
  • 31
    • 60049088346 scopus 로고
    • On the role of fundamental frequency in vowel perception
    • 1 84, SI56.
    • Hirahara, T. (1988). “On the role of fundamental frequency in vowel perception,” J. Acoust. Soc. Am. Suppl. 1 84, SI56.
    • (1988) J. Acoust. Soc. Am. Suppl
    • Hirahara, T.1
  • 32
    • 0037662539 scopus 로고    scopus 로고
    • in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 76, Paper 9.2, pp. 310–313.
    • Itahashi, S., and Yokoyama, S. (1976). “Automatic formant extraction utilizing mel scale and equal loudness contour,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 76, Paper 9.2, pp. 310–313.
    • Itahashi, S.1
  • 33
    • 84955019699 scopus 로고
    • Evaluation of ASR front-ends in speaker dependent and speaker independent recognition
    • J. Acoust. Soc. Am. Suppl. 1 81, S93; Full text in STL Research Reports No. 1, Santa Barbara, 1987.
    • Junqua, J. C. (1987). “Evaluation of ASR front-ends in speaker dependent and speaker independent recognition,” J. Acoust. Soc. Am. Suppl. 1 81, S93; Full text in STL Research Reports No. 1, Santa Barbara, 1987.
    • (1987)
    • Junqua, J.C.1
  • 34
    • 84955050098 scopus 로고    scopus 로고
    • J. Acoust. Soc. Am. Suppl. 1 78, S82.
    • Kamm, C., and Kahn, D. (1985). “Relationship between LP residual spectral distances and phonetic judgement,” J. Acoust. Soc. Am. Suppl. 1 78, S82.
    • Kamm, C.1
  • 35
    • 0016542558 scopus 로고
    • On the front cavity resonance and its possible role in speech perception
    • Kuhn, G. M. (1975). “On the front cavity resonance and its possible role in speech perception,” J. Acoust. Soc. Am. 58, 428–433.
    • (1975) J. Acoust. Soc. Am. 58 , pp. 428-433
    • Kuhn, G.M.1
  • 36
    • 0018387712 scopus 로고
    • Stop consonant place perception with single-for-mant stimuli: Evidence for the role of the front-cavity resonance.
    • Kuhn, G. M. (1978). “Stop consonant place perception with single-for-mant stimuli: Evidence for the role of the front-cavity resonance.”, J. Acoust. Soc. Am. 65, 774–788.
    • (1978) J. Acoust. Soc. Am. 65 , pp. 774-788
    • Kuhn, G.M.1
  • 38
    • 0016519041 scopus 로고
    • Spectral linear prediction: properties and applications
    • Makhoul, J. (1975). “Spectral linear prediction: properties and applications,” IEEE Trans. ASSP-23, 283–296.
    • (1975) IEEE Trans. ASSP-23 , pp. 283-296
    • Makhoul, J.1
  • 39
    • 85067594456 scopus 로고    scopus 로고
    • in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 76, pp. Philadelphia.
    • Makhoul, J., and Cosell, L. (1976). “LPCW: An LPC vocoder with linear predictive spectral warping,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 76, pp. 466-469, Philadelphia.
    • Makhoul, J.1
  • 41
    • 0038133939 scopus 로고
    • Distance measures for speech recognition, psychological and instrumental
    • edited by C. H. Chen (Academic, New York), pp. 374—388.
    • Mermelstein, P. (1976). “Distance measures for speech recognition, psychological and instrumental,” in Pattern Recognition and Artificial Intelligence, edited by C. H. Chen (Academic, New York), pp. 374—388.
    • (1976) Pattern Recognition and Artificial Intelligence
    • Mermelstein, P.1
  • 42
    • 0020905044 scopus 로고    scopus 로고
    • Speech Commun. 2 (4), 295–303.
    • Paliwal, K. K., Lindsay, D., and Ainsworth, W. A. (1983). “A study of two-formant models for vowel identification,” Speech Commun. 2 (4), 295–303.
    • Paliwal, K.K.1
  • 43
    • 0000146349 scopus 로고    scopus 로고
    • Br. J. Appl. Phys.
    • Robinson, D.W., and Dadson, R.S. (1956). “A redetermination of the equal-loudness relations for pure tones,” Br. J. Appl. Phys. 7, 166–181.
    • , vol.7 , pp. 166-181
  • 44
    • 0015008817 scopus 로고    scopus 로고
    • J. Acoust. Soc. Am. 49
    • Rosenberg, A. (1970). “Effect of glottal pulse shape on the quality of natural vowels,” J. Acoust. Soc. Am. 49, 583–590.
  • 45
    • 17344367464 scopus 로고
    • Recognition of Complex Acoustic Signals, Life Sciences Research Report 5
    • edited by T. H. Bullock (Abakon Verlag, Berlin), p. 324
    • Schroeder, M. R. (1977). Recognition of Complex Acoustic Signals, Life Sciences Research Report 5, edited by T. H. Bullock (Abakon Verlag, Berlin), p. 324.
    • (1977)
    • Schroeder, M.R.1
  • 46
    • 84928841878 scopus 로고
    • The acoustic features of speech phonemes in a model of auditory processing: Vowels and unvoiced fricatives
    • Shamma, S. A. (1988). “The acoustic features of speech phonemes in a model of auditory processing: Vowels and unvoiced fricatives,” J. Phon. 16,79-91.
    • (1988) J. Phon , vol.16 , pp. 79-91
    • Shamma, S.A.1
  • 47
    • 34447546202 scopus 로고
    • On the psychophysical law
    • Psychol. Rev. 64, 153—181.
    • Stevens, S. S. (1957). “On the psychophysical law,” Psychol. Rev. 64, 153—181.
    • (1957)
    • Stevens, S.S.1
  • 48
    • 0019068177 scopus 로고
    • Linear prediction on a warped frequency scale
    • Strube, H. W. (1980). “Linear prediction on a warped frequency scale,” J. Acoust. Soc. Am. 68, 1071–1076.
    • (1980) J. Acoust. Soc. Am , vol.68 , pp. 1071-1076
    • Strube, H.W.1
  • 49
    • 84955025636 scopus 로고    scopus 로고
    • IEEE Trans. ASSP-26
    • Vishwanathan, R., and Makhoul, J. (1975). “Quantization properties of transmission parameters in linear predictive systems,” IEEE Trans. ASSP-26, 587–596.
    • Vishwanathan, R.1
  • 50
    • 0017969757 scopus 로고
    • Formant extraction from linear prediction phase spectra
    • Yegnanarayana, B. (1977). “Formant extraction from linear prediction phase spectra,” J. Acoust. Soc. Am. 63, 1638–1640.
    • (1977) J. Acoust. Soc. Am , vol.63 , pp. 1638-1640
    • Yegnanarayana, B.1
  • 51
    • 0018319567 scopus 로고    scopus 로고
    • in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 79, pp. 744—747.
    • Yegnanarayana, B., and Reddy, R. (1979). “A distance measure derived from the first derivative of the linear prediction phase spectra,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 79, pp. 744—747.
    • Yegnanarayana, B.1
  • 52
    • 0002237909 scopus 로고
    • Masking and psychological excitation as consequences of ear's frequency analysis
    • edited by R. Plomp and G. F. Smoorenburg (Sijthoff, Leyden, The Netherlands).
    • Zwicker, E. (1970). “Masking and psychological excitation as consequences of ear's frequency analysis,” in Frequency Analysis and Periodicity Detection in Hearing, edited by R. Plomp and G. F. Smoorenburg (Sijthoff, Leyden, The Netherlands).
    • (1970) Frequency Analysis and Periodicity Detection in Hearing
    • Zwicker, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.