SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 95, Issue 5, 1994, Pages 2702-2719

A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features

(2) Deng, Li a Sun, Don X b

a UNIVERSITY OF WATERLOO (Canada)

b STONY BROOK UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTICLE; AUTOMATION; DATA BASE; MATHEMATICAL ANALYSIS; PRIORITY JOURNAL; SPEECH DISCRIMINATION; WORD RECOGNITION;

EID: 0028234947 PISSN: 00014966 EISSN: NA Source Type: Journal
DOI: 10.1121/1.409839 Document Type: Article

Times cited : (101)

References (39)

1
- 0039046406
- Coarticulation modeling with continuous-state HMMs
- Arden House, Harriman, New York
- Bakis, R. (1991). “Coarticulation modeling with continuous-state HMMs,” Proceedings of the 1991 IEEE Workshop on Automatic Speech Recognition (Arden House, Harriman, New York), pp. 20–21.
- (1991) Proceedings of the 1991 IEEE Workshop on Automatic Speech Recognition , pp. 20-21
- Bakis, R.¹

2
- 0001862769
- An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes
- Baum, L. E. (1972). “An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes,” Inequalities 3, 1–8.
- (1972) Inequalities , vol.3 , pp. 1-8
- Baum, L.E.¹

3
- 0027024362
- Articulatory phonology: An overview
- Browman, C., and Goldstein, L. (1992). “Articulatory phonology: An overview,” Phonetica 49, 155–180.
- (1992) Phonetica 49 , pp. 155-180
- Browman, C.¹ Goldstein, L.²

4
- 84955535347
- Gestural specification using dynamically-defined articulatory structures
- Browman, C., and Goldstein, L. (1990). “Gestural specification using dynamically-defined articulatory structures,” J. Phon. 18, 299–320.
- (1990) J. Phon. 18 , pp. 299-320
- Browman, C.¹ Goldstein, L.²

5
- 84955548400
- Towards an articulatory phonology
- Browman, C., and Goldstein, L. (1986). “Towards an articulatory phonology,” Phonol. Yearbook 3, 219–252.
- (1986) Phonol. Yearbook , vol.3 , pp. 219-252
- Browman, C.¹ Goldstein, L.²

6
- 0004119259
- The Sound Pattern of English
- Harper and Row, New York
- Chomsky, N., and Halle, M. (1968). The Sound Pattern of English (Harper and Row, New York).
- (1968)
- Chomsky, N.¹ Halle, M.²

7
- 84955553395
- The geometry of phonological features
- Clements, G. N. (1985). “The geometry of phonological features,” Pho-nol. Yearbook 2, 225–252.
- (1985) Pho-nol. Yearbook , vol.2 , pp. 225-252
- Clements, G.N.¹

8
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Davis, S., and Mermelstein, P. (1980). “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences,” IEEE Trans. Acoust. Speech Signal Process. 28 (4), 357–365.
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.4 , pp. 357-365
- Davis, S.¹ Mermelstein, P.²

9
- 0027678649
- A stochastic model of speech incorporating hierarchical nonstationarity
- Deng, L. (1993). “A stochastic model of speech incorporating hierarchical nonstationarity,” IEEE Trans. Speech Audio Process. 1 (4), 471–474.
- (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.4 , pp. 471-474
- Deng, L.¹

10
- 0026854213
- A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
- Deng, L. (1992). “A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal,” Signal Process. 27 (1), 65–78.
- (1992) Signal Process. , vol.27 , Issue.1 , pp. 65-78
- Deng, L.¹

11
- 0026185698
- The semi-relaxed algorithm for parameter estimation of hidden Markov models
- Deng, L. (1991). “The semi-relaxed algorithm for parameter estimation of hidden Markov models,” Comput. Speech Language 5 (3), 231–236.
- (1991) Comput. Speech Language , vol.5 , Issue.3 , pp. 231-236
- Deng, L.¹

12
- 0026458724
- Structural design of a hidden Markov model based speech recognizer using multivalued phonetic features: Comparison with segmental speech units
- Deng, L., and Erler, K. (1992). “Structural design of a hidden Markov model based speech recognizer using multivalued phonetic features: Comparison with segmental speech units,” J. Acoust. Soc. Am. 92, 3058–3067.
- (1992) J. Acoust. Soc. Am. , vol.92 , pp. 3058-3067
- Deng, L.¹ Erler, K.²

13
- 0026189555
- Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition
- Deng, L., Kenny, P. Lennig, M., Gupta, V., Seitz, F., and Mermelstein, P. (1991). “Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition,” IEEE Trans. Acoust. Speech, Signal Process. 39 (7), 1677–1681.
- (1681) IEEE Trans. Acoust. Speech, Signal Process. , vol.39 , Issue.7
- Deng, L.¹ Kenny, P.² Lennig, M.³ Gupta, V.⁴ Seitz, F.⁵ Mermelstein, P.⁶

14
- 0026821564
- Modeling acoustic transitions in speech by state-interpolation hidden Markov models
- Deng, L., Kenny, P., Lennig, M., and Mermelstein, P. (1992). “Modeling acoustic transitions in speech by state-interpolation hidden Markov models,” IEEE Trans. Signal Process. 40 (2), 265–272.
- (1992) IEEE Trans. Signal Process. , vol.40 , Issue.2 , pp. 265-272
- Deng, L.¹ Kenny, P.² Lennig, M.³ Mermelstein, P.⁴

15
- 0024382082
- Use of vowel duration information in a large vocabulary word recognizer
- Deng, L., Lennig, M., and Mermelstein, P. (1989). “Use of vowel duration information in a large vocabulary word recognizer,” J. Acoust. Soc. Am. 86, 540–548.
- (1989) J. Acoust. Soc. Am. 86 , pp. 540-548
- Deng, L.¹ Lennig, M.² Mermelstein, P.³

16
- 0025145948
- Modeling microsegments of stop consonants in a hidden Markov model based word recognizer
- Deng, L., Lennig, M., and Mermelstein, P. (1990). “Modeling microsegments of stop consonants in a hidden Markov model based word recognizer,” J. Acoust. Soc. Am. 87, 2738–2747.
- (1990) J. Acoust. Soc. Am. 87 , pp. 2738-2747
- Deng, L.¹ Lennig, M.² Mermelstein, P.³

17
- 10244257175
- Large vocabulary word recognition using context-dependent allophonic hidden Markov models
- Deng, L., Lennig, M., Seitz, F., and Mermelstein, P. (1990). “Large vocabulary word recognition using context-dependent allophonic hidden Markov models,” Comput. Speech Language 4, 345–357.
- (1990) Comput. Speech Language , vol.4 , pp. 345-357
- Deng, L.¹ Lennig, M.² Seitz, F.³ Mermelstein, P.⁴

18
- 0027627252
- Hidden Markov model representation of quantized articulatory features for speech recognition
- Erler, K., and Deng, L. (1993). “Hidden Markov model representation of quantized articulatory features for speech recognition,” Comput. Speech Language 7 (3), 101–118.
- (1993) Comput. Speech Language , vol.7 , Issue.3 , pp. 101-118
- Erler, K.¹ Deng, L.²

19
- 0008776682
- Knowledge of language and the sounds of speech
- edited by J. Sundberg, L. Nord, and R. Carlson (MacMillan, London)
- Halle, M., and Stevens, K. (1991). “Knowledge of language and the sounds of speech,” in Music, Language, Speech, and Brain, edited by J. Sundberg, L. Nord, and R. Carlson (MacMillan, London), pp. 1–19.
- (1991) Music, Language, Speech, and Brain , pp. 1-19
- Halle, M.¹ Stevens, K.²

20
- 0027153655
- Predicting unseen triphones with senones
- Hwang, M., Huang, X., and Alieva, F. (1993). “Predicting unseen triphones with senones,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, pp. 311–315.
- (1993) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 311-315
- Hwang, M.¹ Huang, X.² Alieva, F.³

21
- 0039638379
- Articulatory Markov models
- Arden House, Harriman, New York
- Kenny, P., Zhao, R., Gupta, V., Lennig, M., and O’Shaughnessy, D. (1991). “Articulatory Markov models,” Proceedings of the 1991 IEEE Workshop on Automatic Speech Recognition (Arden House, Harriman, New York), pp. 22–23.
- (1991) Proceedings of the 1991 IEEE Workshop on Automatic Speech Recognition , pp. 22-23
- Kenny, P.¹ Zhao, R.² Gupta, V.³ Lennig, M.⁴ O’Shaughnessy, D.⁵

22
- 0025419316
- Context-dependent phonetic hidden Markov models for continuous speech recognition
- Lee, K., and Hon, H. (1990). “Context-dependent phonetic hidden Markov models for continuous speech recognition,” IEEE Trans. Signal Process. 38 (4), 599–609.
- (1990) IEEE Trans. Signal Process. , vol.38 , Issue.4 , pp. 599-609
- Lee, K.¹ Hon, H.²

23
- 0024768209
- Speaker-independent phone recognition using hidden Markov models
- Lee, K., and Hon, H. (1989). “Speaker-independent phone recognition using hidden Markov models,” IEEE Trans. Signal Process. 37 (11), 1641–1648.
- (1989) IEEE Trans. Signal Process. , vol.37 , Issue.11 , pp. 1641-1648
- Lee, K.¹ Hon, H.²

24
- 0003791727
- Readings in Acoustic Phonetics
- MIT, Cambridge, MA
- Lehiste, I. (1967). Readings in Acoustic Phonetics (MIT, Cambridge, MA).
- (1967)
- Lehiste, I.¹

25
- 0027166399
- A comparative study of signal representations and classification techniques for speech recognition
- Leung, H., Chigier, B., and Glass, J. (1993). “A comparative study of signal representations and classification techniques for speech recognition,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, pp. 680–683.
- (1993) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 680-683
- Leung, H.¹ Chigier, B.² Glass, J.³

26
- 84940803687
- Feature geometry and dependency: a review
- McCarthy, J. (1988). “Feature geometry and dependency: a review,” Phonetica 43, 84–108.
- (1988) Phonetica 43 , pp. 84-108
- McCarthy, J.¹

27
- 0015613574
- Articulatory model for the study of speech production
- Mermelstein, P. (1973). “Articulatory model for the study of speech production,” J. Acoust. Soc. Am. 53, 1070–1082.
- (1973) J. Acoust. Soc. Am. 53 , pp. 1070-1082
- Mermelstein, P.¹

28
- 84914802505
- Automatic discovery of acoustic measurements for phonetic classification
- Phillips, M., Glass, J., and, Zue, V. (1992). “Automatic discovery of acoustic measurements for phonetic classification,” Proceedings of the IEEE International Conference on Spoken Language Processing, Vol. 1, pp. 795–798.
- (1992) Proceedings of the IEEE International Conference on Spoken Language Processing , vol.1 , pp. 795-798
- Phillips, M.¹ Glass, J.² Zue, V.³

29
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Rabiner, L. (1989). “A tutorial on hidden Markov models and selected applications in speech recognition,” Proc. IEEE 77, 257–285.
- (1989) Proc. IEEE 77 , pp. 257-285
- Rabiner, L.¹

30
- 0023032731
- Task dynamic coordination of the speech articulators: A preliminary model
- edited by H. Heuer and C. Fromm (Springer-Verlag, New York)
- Saltzman, E. (1986). “Task dynamic coordination of the speech articulators: A preliminary model,” in Generation and Modulation of Action Patterns, edited by H. Heuer and C. Fromm (Springer-Verlag, New York), pp. 129–144.
- (1986) Generation and Modulation of Action Patterns , pp. 129-144
- Saltzman, E.¹

31
- 77956779481
- A dynamical approach to gestural patterning in speech production
- Saltzman, E., and Munhall, K. (1989). “A dynamical approach to gestural patterning in speech production,” Ecol. Psychol. 1 (4), 333–382.
- (1989) Ecol. Psychol. , vol.1 , Issue.4 , pp. 333-382
- Saltzman, E.¹ Munhall, K.²

32
- 0021142214
- Improved hidden Markov modeling of phonemes for continuous speech recognition
- Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing 6.1-35.6.4.
- Schwartz, R., Chow, Y., Roucos, S., Krasner, M., and Makhoul, J. (1984). “Improved hidden Markov modeling of phonemes for continuous speech recognition,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 35.6.1-35.6.4.
- (1984) , pp. 35
- Schwartz, R.¹ Chow, Y.² Roucos, S.³ Krasner, M.⁴ Makhoul, J.⁵

33
- 0039201394
- Models of phonetic recognition II: An approach to feature-based recognition
- Representation in Speech Recognition, edited by P. Mermelstein (12th Int’l Congress on Acoustics, Montreal)
- Stevens, K. (1986). “Models of phonetic recognition II: An approach to feature-based recognition,” in Symposium on Units and Their Representation in Speech Recognition, edited by P. Mermelstein (12th Int’l Congress on Acoustics, Montreal), pp. 67–69.
- (1986) Symposium on Units and Their , pp. 67-69
- Stevens, K.¹

34
- 84936526529
- On the quantal nature of speech
- Stevens, K. (1989). “On the quantal nature of speech,” J. Phon. 17, 3–45.
- (1989) J. Phon. 17 , pp. 3-45
- Stevens, K.¹

35
- 85135109310
- Implementation of a model for lexical access based on features
- Stevens, K., Manuel, S., Shattuck-Hufnagel, S., and Liu, S. (1992). “Implementation of a model for lexical access based on features,” Proc. Int. Conf. Spoken Language Process. 1, 499–502.
- (1992) Proc. Int. Conf. Spoken Language Process. , vol.1 , pp. 499-502
- Stevens, K.¹ Manuel, S.² Shattuck-Hufnagel, S.³ Liu, S.⁴

36
- 0022151324
- The use of speech knowledge in automatic speech recognition
- Zue, V. (1985). “The use of speech knowledge in automatic speech recognition,” Proc. IEEE 73, 1602–1550.
- (1985) Proc. IEEE 73 , pp. 1550-1602
- Zue, V.¹

37
- 0040083548
- Notes on Speech Spectrogram Reading
- course notes, M.I.T., Cambridge, MA
- Zue, V. (1991). “Notes on Speech Spectrogram Reading”, course notes, M.I.T., Cambridge, MA.
- (1991)
- Zue, V.¹

38
- 0025587109
- The SUMMIT speech recognition system: Phonological modeling and lexical access
- Zue, V., Glass, J., Goodine, D., Phillips, M., and Seneff, S. (1990). “The SUMMIT speech recognition system: Phonological modeling and lexical access,” Proc. IEEE Int. Conf. Acoust. Speech, Signal Process. 1, 49–52.
- (1990) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 49-52
- Zue, V.¹ Glass, J.² Goodine, D.³ Phillips, M.⁴ Seneff, S.⁵

39
- 84956048285
- Nasal articulation in homorganic clusters in American English
- Zue, V., and Sia, E. (1982). “Nasal articulation in homorganic clusters in American English,” Working Papers in MIT Speech Commun. Group 1, pp. 9–17.
- (1982) Working Papers in MIT Speech Commun. Group , pp. 9-17
- Zue, V.¹ Sia, E.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.