메뉴 건너뛰기




Volumn 95, Issue 5, 1994, Pages 2702-2719

A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features

Author keywords

[No Author keywords available]

Indexed keywords

ARTICLE; AUTOMATION; DATA BASE; MATHEMATICAL ANALYSIS; PRIORITY JOURNAL; SPEECH DISCRIMINATION; WORD RECOGNITION;

EID: 0028234947     PISSN: 00014966     EISSN: NA     Source Type: Journal    
DOI: 10.1121/1.409839     Document Type: Article
Times cited : (101)

References (39)
  • 2
    • 0001862769 scopus 로고
    • An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes
    • Baum, L. E. (1972). “An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes,” Inequalities 3, 1–8.
    • (1972) Inequalities , vol.3 , pp. 1-8
    • Baum, L.E.1
  • 3
    • 0027024362 scopus 로고
    • Articulatory phonology: An overview
    • Browman, C., and Goldstein, L. (1992). “Articulatory phonology: An overview,” Phonetica 49, 155–180.
    • (1992) Phonetica 49 , pp. 155-180
    • Browman, C.1    Goldstein, L.2
  • 4
    • 84955535347 scopus 로고
    • Gestural specification using dynamically-defined articulatory structures
    • Browman, C., and Goldstein, L. (1990). “Gestural specification using dynamically-defined articulatory structures,” J. Phon. 18, 299–320.
    • (1990) J. Phon. 18 , pp. 299-320
    • Browman, C.1    Goldstein, L.2
  • 5
    • 84955548400 scopus 로고
    • Towards an articulatory phonology
    • Browman, C., and Goldstein, L. (1986). “Towards an articulatory phonology,” Phonol. Yearbook 3, 219–252.
    • (1986) Phonol. Yearbook , vol.3 , pp. 219-252
    • Browman, C.1    Goldstein, L.2
  • 6
    • 0004119259 scopus 로고
    • The Sound Pattern of English
    • Harper and Row, New York
    • Chomsky, N., and Halle, M. (1968). The Sound Pattern of English (Harper and Row, New York).
    • (1968)
    • Chomsky, N.1    Halle, M.2
  • 7
    • 84955553395 scopus 로고
    • The geometry of phonological features
    • Clements, G. N. (1985). “The geometry of phonological features,” Pho-nol. Yearbook 2, 225–252.
    • (1985) Pho-nol. Yearbook , vol.2 , pp. 225-252
    • Clements, G.N.1
  • 8
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis, S., and Mermelstein, P. (1980). “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences,” IEEE Trans. Acoust. Speech Signal Process. 28 (4), 357–365.
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.4 , pp. 357-365
    • Davis, S.1    Mermelstein, P.2
  • 9
    • 0027678649 scopus 로고
    • A stochastic model of speech incorporating hierarchical nonstationarity
    • Deng, L. (1993). “A stochastic model of speech incorporating hierarchical nonstationarity,” IEEE Trans. Speech Audio Process. 1 (4), 471–474.
    • (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.4 , pp. 471-474
    • Deng, L.1
  • 10
    • 0026854213 scopus 로고
    • A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
    • Deng, L. (1992). “A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal,” Signal Process. 27 (1), 65–78.
    • (1992) Signal Process. , vol.27 , Issue.1 , pp. 65-78
    • Deng, L.1
  • 11
    • 0026185698 scopus 로고
    • The semi-relaxed algorithm for parameter estimation of hidden Markov models
    • Deng, L. (1991). “The semi-relaxed algorithm for parameter estimation of hidden Markov models,” Comput. Speech Language 5 (3), 231–236.
    • (1991) Comput. Speech Language , vol.5 , Issue.3 , pp. 231-236
    • Deng, L.1
  • 12
    • 0026458724 scopus 로고
    • Structural design of a hidden Markov model based speech recognizer using multivalued phonetic features: Comparison with segmental speech units
    • Deng, L., and Erler, K. (1992). “Structural design of a hidden Markov model based speech recognizer using multivalued phonetic features: Comparison with segmental speech units,” J. Acoust. Soc. Am. 92, 3058–3067.
    • (1992) J. Acoust. Soc. Am. , vol.92 , pp. 3058-3067
    • Deng, L.1    Erler, K.2
  • 13
    • 0026189555 scopus 로고
    • Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition
    • Deng, L., Kenny, P. Lennig, M., Gupta, V., Seitz, F., and Mermelstein, P. (1991). “Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition,” IEEE Trans. Acoust. Speech, Signal Process. 39 (7), 1677–1681.
    • (1681) IEEE Trans. Acoust. Speech, Signal Process. , vol.39 , Issue.7
    • Deng, L.1    Kenny, P.2    Lennig, M.3    Gupta, V.4    Seitz, F.5    Mermelstein, P.6
  • 14
    • 0026821564 scopus 로고
    • Modeling acoustic transitions in speech by state-interpolation hidden Markov models
    • Deng, L., Kenny, P., Lennig, M., and Mermelstein, P. (1992). “Modeling acoustic transitions in speech by state-interpolation hidden Markov models,” IEEE Trans. Signal Process. 40 (2), 265–272.
    • (1992) IEEE Trans. Signal Process. , vol.40 , Issue.2 , pp. 265-272
    • Deng, L.1    Kenny, P.2    Lennig, M.3    Mermelstein, P.4
  • 15
    • 0024382082 scopus 로고
    • Use of vowel duration information in a large vocabulary word recognizer
    • Deng, L., Lennig, M., and Mermelstein, P. (1989). “Use of vowel duration information in a large vocabulary word recognizer,” J. Acoust. Soc. Am. 86, 540–548.
    • (1989) J. Acoust. Soc. Am. 86 , pp. 540-548
    • Deng, L.1    Lennig, M.2    Mermelstein, P.3
  • 16
    • 0025145948 scopus 로고
    • Modeling microsegments of stop consonants in a hidden Markov model based word recognizer
    • Deng, L., Lennig, M., and Mermelstein, P. (1990). “Modeling microsegments of stop consonants in a hidden Markov model based word recognizer,” J. Acoust. Soc. Am. 87, 2738–2747.
    • (1990) J. Acoust. Soc. Am. 87 , pp. 2738-2747
    • Deng, L.1    Lennig, M.2    Mermelstein, P.3
  • 17
    • 10244257175 scopus 로고
    • Large vocabulary word recognition using context-dependent allophonic hidden Markov models
    • Deng, L., Lennig, M., Seitz, F., and Mermelstein, P. (1990). “Large vocabulary word recognition using context-dependent allophonic hidden Markov models,” Comput. Speech Language 4, 345–357.
    • (1990) Comput. Speech Language , vol.4 , pp. 345-357
    • Deng, L.1    Lennig, M.2    Seitz, F.3    Mermelstein, P.4
  • 18
    • 0027627252 scopus 로고
    • Hidden Markov model representation of quantized articulatory features for speech recognition
    • Erler, K., and Deng, L. (1993). “Hidden Markov model representation of quantized articulatory features for speech recognition,” Comput. Speech Language 7 (3), 101–118.
    • (1993) Comput. Speech Language , vol.7 , Issue.3 , pp. 101-118
    • Erler, K.1    Deng, L.2
  • 19
    • 0008776682 scopus 로고
    • Knowledge of language and the sounds of speech
    • edited by J. Sundberg, L. Nord, and R. Carlson (MacMillan, London)
    • Halle, M., and Stevens, K. (1991). “Knowledge of language and the sounds of speech,” in Music, Language, Speech, and Brain, edited by J. Sundberg, L. Nord, and R. Carlson (MacMillan, London), pp. 1–19.
    • (1991) Music, Language, Speech, and Brain , pp. 1-19
    • Halle, M.1    Stevens, K.2
  • 22
    • 0025419316 scopus 로고
    • Context-dependent phonetic hidden Markov models for continuous speech recognition
    • Lee, K., and Hon, H. (1990). “Context-dependent phonetic hidden Markov models for continuous speech recognition,” IEEE Trans. Signal Process. 38 (4), 599–609.
    • (1990) IEEE Trans. Signal Process. , vol.38 , Issue.4 , pp. 599-609
    • Lee, K.1    Hon, H.2
  • 23
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using hidden Markov models
    • Lee, K., and Hon, H. (1989). “Speaker-independent phone recognition using hidden Markov models,” IEEE Trans. Signal Process. 37 (11), 1641–1648.
    • (1989) IEEE Trans. Signal Process. , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.1    Hon, H.2
  • 24
    • 0003791727 scopus 로고
    • Readings in Acoustic Phonetics
    • MIT, Cambridge, MA
    • Lehiste, I. (1967). Readings in Acoustic Phonetics (MIT, Cambridge, MA).
    • (1967)
    • Lehiste, I.1
  • 26
    • 84940803687 scopus 로고
    • Feature geometry and dependency: a review
    • McCarthy, J. (1988). “Feature geometry and dependency: a review,” Phonetica 43, 84–108.
    • (1988) Phonetica 43 , pp. 84-108
    • McCarthy, J.1
  • 27
    • 0015613574 scopus 로고
    • Articulatory model for the study of speech production
    • Mermelstein, P. (1973). “Articulatory model for the study of speech production,” J. Acoust. Soc. Am. 53, 1070–1082.
    • (1973) J. Acoust. Soc. Am. 53 , pp. 1070-1082
    • Mermelstein, P.1
  • 29
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, L. (1989). “A tutorial on hidden Markov models and selected applications in speech recognition,” Proc. IEEE 77, 257–285.
    • (1989) Proc. IEEE 77 , pp. 257-285
    • Rabiner, L.1
  • 30
    • 0023032731 scopus 로고
    • Task dynamic coordination of the speech articulators: A preliminary model
    • edited by H. Heuer and C. Fromm (Springer-Verlag, New York)
    • Saltzman, E. (1986). “Task dynamic coordination of the speech articulators: A preliminary model,” in Generation and Modulation of Action Patterns, edited by H. Heuer and C. Fromm (Springer-Verlag, New York), pp. 129–144.
    • (1986) Generation and Modulation of Action Patterns , pp. 129-144
    • Saltzman, E.1
  • 31
    • 77956779481 scopus 로고
    • A dynamical approach to gestural patterning in speech production
    • Saltzman, E., and Munhall, K. (1989). “A dynamical approach to gestural patterning in speech production,” Ecol. Psychol. 1 (4), 333–382.
    • (1989) Ecol. Psychol. , vol.1 , Issue.4 , pp. 333-382
    • Saltzman, E.1    Munhall, K.2
  • 32
    • 0021142214 scopus 로고
    • Improved hidden Markov modeling of phonemes for continuous speech recognition
    • Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 6.1-35.6.4.
    • Schwartz, R., Chow, Y., Roucos, S., Krasner, M., and Makhoul, J. (1984). “Improved hidden Markov modeling of phonemes for continuous speech recognition,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 35.6.1-35.6.4.
    • (1984) , pp. 35
    • Schwartz, R.1    Chow, Y.2    Roucos, S.3    Krasner, M.4    Makhoul, J.5
  • 33
    • 0039201394 scopus 로고
    • Models of phonetic recognition II: An approach to feature-based recognition
    • Representation in Speech Recognition, edited by P. Mermelstein (12th Int’l Congress on Acoustics, Montreal)
    • Stevens, K. (1986). “Models of phonetic recognition II: An approach to feature-based recognition,” in Symposium on Units and Their Representation in Speech Recognition, edited by P. Mermelstein (12th Int’l Congress on Acoustics, Montreal), pp. 67–69.
    • (1986) Symposium on Units and Their , pp. 67-69
    • Stevens, K.1
  • 34
    • 84936526529 scopus 로고
    • On the quantal nature of speech
    • Stevens, K. (1989). “On the quantal nature of speech,” J. Phon. 17, 3–45.
    • (1989) J. Phon. 17 , pp. 3-45
    • Stevens, K.1
  • 36
    • 0022151324 scopus 로고
    • The use of speech knowledge in automatic speech recognition
    • Zue, V. (1985). “The use of speech knowledge in automatic speech recognition,” Proc. IEEE 73, 1602–1550.
    • (1985) Proc. IEEE 73 , pp. 1550-1602
    • Zue, V.1
  • 37
    • 0040083548 scopus 로고
    • Notes on Speech Spectrogram Reading
    • course notes, M.I.T., Cambridge, MA
    • Zue, V. (1991). “Notes on Speech Spectrogram Reading”, course notes, M.I.T., Cambridge, MA.
    • (1991)
    • Zue, V.1
  • 39
    • 84956048285 scopus 로고
    • Nasal articulation in homorganic clusters in American English
    • Zue, V., and Sia, E. (1982). “Nasal articulation in homorganic clusters in American English,” Working Papers in MIT Speech Commun. Group 1, pp. 9–17.
    • (1982) Working Papers in MIT Speech Commun. Group , pp. 9-17
    • Zue, V.1    Sia, E.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.