SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 4, Issue 6, 1996, Pages 430-445

High-performance alphabet recognition

(2) Loizou, P C a Spanias, Andreas S a

a University of Arkansas (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CHARACTER SETS; MARKOV PROCESSES; MATHEMATICAL MODELS; PATTERN RECOGNITION SYSTEMS; PERFORMANCE; SPEECH COMMUNICATION;

ALPHABET RECOGNITION; ALPHABET RECOGNIZER; ALPHABET VOCABULARY; HIDDEN MARKOV MODELS;

CHARACTER RECOGNITION;

EID: 0030286185 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/89.544528 Document Type: Article

Times cited : (62)

References (57)

1
- 0028378020
- Applications of voice processing, to telecommunications
- L. Rabiner, "Applications of voice processing, to telecommunications," in Proc. IEEE, vol. 82, no. 2, pp. 199-228, Feb. 1994.
- (1994) Proc. IEEE , vol.82 , Issue.2 , pp. 199-228
- Rabiner, L.¹

2
- 0003572996
- Ph.D. thesis, Carnegie Mellon Univ., Pittsburgh, PA, Apr.
- P. Brown, "The acoustic-modeling problem in automatic speech recognition,"Ph.D. thesis, Carnegie Mellon Univ., Pittsburgh, PA, Apr. 1987.
- (1987) The acoustic-modeling problem in automatic speech recognition
- Brown, P.¹

3
- 0027579316
- Discriminative training of dynamic programming based speech recognizers
- Apr.
- P. Chang and B. Juang, "Discriminative training of dynamic programming based speech recognizers," IEEE Trans. Speech Audio Processing, vol. 1, no. 2, pp. 135-143, Apr. 1993.
- (1993) IEEE Trans. Speech Audio Processing , vol.1 , Issue.2 , pp. 135-143
- Chang, P.¹ Juang, B.²

4
- 0025588058
- A probabilistic acoustic MAP based discriminative HMM training
- E. Huang and F. Soong, "A probabilistic acoustic MAP based discriminative HMM training," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1990, pp. 693-696.
- (1990) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 693-696
- Huang, E.¹ Soong, F.²

5
- 0028195650
- Speech recognition using weighted HMM and subspace projection approaches
- Jan.
- K. Su and C. Lee, "Speech recognition using weighted HMM and subspace projection approaches," IEEE Trans. Speech Audio Processing, vol. 2, no. 1, pp. 69-79, Jan. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.1 , pp. 69-79
- Su, K.¹ Lee, C.²

6
- 0001462521
- A cross-language study of voicing in initial stops: Acoustical measurements
- L. Lisker and S. Abramson, "A cross-language study of voicing in initial stops: Acoustical measurements," Word, vol. 20, pp. 384-422, 1964.
- (1964) Word , vol.20 , pp. 384-422
- Lisker, L.¹ Abramson, S.²

7
- 0003467241
- Ph.D. thesis, Mass. Inst. of Technol., Cambridge, MA, May
- V. Zue, "Acoustic characteristics of stop consonants: A controlled study,"Ph.D. thesis, Mass. Inst. of Technol., Cambridge, MA, May 1976.
- (1976) Acoustic characteristics of stop consonants: A controlled study
- Zue, V.¹

8
- 0003418124
- The Hague, Netherlands: Mouton
- G. Fant, Acoustic Theory of Speech Production. The Hague, Netherlands: Mouton, 1970.
- (1970) Acoustic Theory of Speech Production
- Fant, G.¹

9
- 0004225947
- San Diego, CA: Singular
- R. Kent and C. Read, The Acoustic Analysis of Speech. San Diego, CA: Singular, 1992.
- (1992) The Acoustic Analysis of Speech
- Kent, R.¹ Read, C.²

10
- 0001559782
- Analysis of nasal consonants
- Dec.
- O. Fujimura, "Analysis of nasal consonants," J. Acoust. Soc. Amer., vol. 34, no. 12, pp. 1865-1875, Dec. 1962.
- (1962) J. Acoust. Soc. Amer. , vol.34 , Issue.12 , pp. 1865-1875
- Fujimura, O.¹

11
- 0008457913
- "Speech coding and recognition: A review,"
- Feb.
- A. Spanias and F. Wu, "Speech coding and recognition: A review," IEICE Trans. Fundamentals, vol. E75-A, no. 2, pp. 132-148, Feb. 1992.
- (1992) IEICE Trans. Fundamentals , vol.E75-A , Issue.2 , pp. 132-148
- Spanias, A.¹ Wu, F.²

12
- 0016467604
- "Minimum prediction residual applied to speech recognition
- F. Itakura, "Minimum prediction residual applied to speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, no. 1, pp. 67-72, Feb. 1975.
- (1975) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-23 , Issue.1 , pp. 67-72
- Itakura, F.¹

13
- 0018656519
- "Speaker independent recognition of isolated words using clustering techniques,"
- L. Rabiner, S. Levinson, A. Rosenberg, and J. Wilpon, "Speaker independent recognition of isolated words using clustering techniques," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, pp. 336-349, Aug. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-27 , pp. 336-349
- Rabiner, L.¹ Levinson, S.² Rosenberg, A.³ Wilpon, J.⁴

14
- 0019680113
- Isolated word recognition using a two-pass pattern recognition approach
- L. Rabiner and J. Wilpon, "Isolated word recognition using a two-pass pattern recognition approach," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, 1981, pp. 724-727.
- (1981) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 724-727
- Rabiner, L.¹ Wilpon, J.²

15
- 85061400113
- Comparison of learning techniques in speech recognition
- G. Bradshaw, R. Cole, and L. Zi, "Comparison of learning techniques in speech recognition," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, 1982, pp. 554-557.
- (1982) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 554-557
- Bradshaw, G.¹ Cole, R.² Zi, L.³

16
- 33646908717
- "Performance improvement in a dynamic-programming based isolated word recognition system for the alpha-digit task
- L. Lamel and V. Zue, "Performance improvement in a dynamic-programming based isolated word recognition system for the alpha-digit task," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1982, pp. 558-561.
- (1982) Proc. Int. Conf. Acoust., Speech, Signal Processing , pp. 558-561
- Lamel, L.¹ Zue, V.²

17
- 0001887625
- "Performing fine phonetic distinctions: Templates vs. features
- J. Perkell and D. Klatt, Eds. New York: Lawrence Erlbaum
- R. Cole, R. Stern, and M. Lasry, "Performing fine phonetic distinctions: Templates vs. features," in Invariance and Variability of Speech Processes, J. Perkell and D. Klatt, Eds. New York: Lawrence Erlbaum, 1986, pp. 325-341.
- (1986) Invariance and Variability of Speech Processes , pp. 325-341
- Cole, R.¹ Stern, R.² Lasry, M.³

18
- 0039627352
- "Speech as patterns on paper
- R. Cole, Ed. New York: Lawrence Erlbaum
- R. Cole, A. Rudnicky, V. Zue, and R. Reddy, "Speech as patterns on paper," in Perception and Production of Fluent Speech, R. Cole, Ed. New York: Lawrence Erlbaum, 1978.
- (1978) Perception and Production of Fluent Speech
- Cole, R.¹ Rudnicky, A.² Zue, V.³ Reddy, R.⁴

19
- 0004989362
- "Some performance benchmarks for isolated word speech recognition systems
- L. Rabiner and J. Wilpon, "Some performance benchmarks for isolated word speech recognition systems," Comput. Speech Language, vol. 2, pp. 343-357, 1987.
- (1987) Comput. Speech Language , vol.2 , pp. 343-357
- Rabiner, L.¹ Wilpon, J.²

20
- 0025659601
- Statistical segmentation and word modeling techniques in isolated word recognition
- S. Euler, B. Juang, C. Lee, and F. Soong, Statistical segmentation and word modeling techniques in isolated word recognition," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1990, pp. 745-748.
- (1990) Proc. Int. Conf. Acoust., Speech, Signal Processing , pp. 745-748
- Euler B Juang, S.¹ Lee, C.² Soong, F.³

21
- 0025557590
- "Speaker-independent recognition of spoken English letters
- June
- R. Cole, M. Fanty, Y. Muthusamy, and M. Gopalakrishnan, "Speaker-independent recognition of spoken English letters," in Proc. Int. Joint Conf. Neural Networks, vol. 2, June 1990, pp. 45-51.
- (1990) Proc. Int. Joint Conf. Neural Networks , vol.2 , pp. 45-51
- Cole, R.¹ Fanty, M.² Muthusamy, Y.³ Gopalakrishnan, M.⁴

22
- 0002516752
- "Spoken letter recognition
- M. Fanty and R. Cole, "Spoken letter recognition," in Proc. Neural Inform. Processing Syst. Conf., Nov. 1990, pp. 220-226.
- (1990) Proc. Neural Inform. Processing Syst. Conf., Nov. , pp. 220-226
- Fanty, M.¹ Cole, R.²

23
- 33646901212
- "English alphabet recognition of telephone speech
- R. Cole, K. Roginski, and M. Fanty, "English alphabet recognition of telephone speech," in Proc. 2nd Euro. Conf. Speech Commun. Technol., 1991, pp. 24-26.
- (1991) Proc. 2nd Euro. Conf. Speech Commun. Technol. , pp. 24-26
- Cole, R.¹ Roginski, K.² Fanty, M.³

24
- 0003640523
- "The ISOLET spoken letter database
- Oregon Graduate Inst.
- R. Cole, Y. Muthusamy, and M. Fanty, "The ISOLET spoken letter database," Tech. Rep. 90-004, Oregon Graduate Inst., 1990.
- (1990) Tech. Rep. 90-004
- Cole, R.¹ Muthusamy, Y.² Fanty, M.³

25
- 0002583871
- "Speech database development: Design and analysis of the acoustic phonetic corpus
- L. Lamel, R. Kassel, and S. Seneff, "Speech database development: Design and analysis of the acoustic phonetic corpus," in Proc. DARPA Speech Recognition Workshop, 1986, pp. 100-109.
- (1986) Proc. DARPA Speech Recognition Workshop , pp. 100-109
- Lamel, L.¹ Kassel, R.² Seneff, S.³

26
- 0019053271
- "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug.
- B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, B.¹ Mermelstein, P.²

27
- 0025493667
- "The segmental k-means algorithm for estimating parameters of hidden Markov models,"
- B. Juang and L. Rabiner, "The segmental k-means algorithm for estimating parameters of hidden Markov models," IEEE Trans. Acoust., Speech, Signal Processing, vol. 38, no. 9, pp. 1639-1641, 1990.
- (1990) IEEE Trans. Acoust., Speech, Signal Processing , vol.38 , Issue.9 , pp. 1639-1641
- Juang, B.¹ Rabiner, L.²

28
- 0001882615
- "Self-organized language modeling for speech recognition
- A. Waibel and K. Lee, Eds. San Francisco, CA: Morgan Kaufmann
- F. Jelinek, "Self-organized language modeling for speech recognition," in Readings in Speech Rcognition, A. Waibel and K. Lee, Eds. San Francisco, CA: Morgan Kaufmann, 1990, pp. 450-506.
- (1990) Readings in Speech Rcognition , pp. 450-506
- Jelinek, F.¹

29
- 0028573857
- "Context-dependent modeling in alphabet recognition
- P. Loizou and A. Spanias, "Context-dependent modeling in alphabet recognition," in Proc. Int. Symp. Circuits Syst., 1994, pp. 189-192.
- (1994) Proc. Int. Symp. Circuits Syst. , pp. 189-192
- Loizou, P.¹ Spanias, A.²

30
- 0022859679
- The role of word-dependent coarticulatory effects in a phoneme-based speech recognition system
- Y. Chow et al., "The role of word-dependent coarticulatory effects in a phoneme-based speech recognition system," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1986, pp. 1593-1596.
- (1986) Proc. Int. Conf. Acoust., Speech, Signal Processing , pp. 1593-1596
- Chow, Y.¹

31
- 0003539541
- Ph.D. thesis, Carnegie Mellon Univ., Pittsburgh, PA, Apr.
- K. Lee, "Large vocabulary speaker-independent continuous speech recognition: The SPHINX system,"Ph.D. thesis, Carnegie Mellon Univ., Pittsburgh, PA, Apr. 1988.
- (1988) Large Vocabulary Speaker-independent Continuous Speech Recognition: the SPHINX System
- Lee, K.¹

32
- 0003874959
- New York, Springer-Verlag
- J. Markel and A. Gray, Linear Prediction of Speech. New York, Springer-Verlag, 1976.
- (1976) Linear Prediction of Speech
- Markel, J.¹ Gray, A.²

33
- 0003459982
- "Evaluation of LPC spectral matching measures for phonetic unit recognition
- Carnegie Mellon Univ.
- K. Shikano, "Evaluation of LPC spectral matching measures for phonetic unit recognition," Tech. Rep. CMU-CS-86-108, Carnegie Mellon Univ., 1986.
- (1986) Tech. Rep. , vol.CMU-CS-86-108
- Shikano, K.¹

34
- 0022914334
- Detection and recognition of nasal consonants in American english
- J. Glass and V. Zue, "Detection and recognition of nasal consonants in American english," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1986, pp. 2767-2770.
- (1986) Proc. Int. Conf. Acoust., Speech, Signal Processing , pp. 2767-2770
- Glass, J.¹ Zue, V.²

35
- 0021475513
- "Perceptual integration of the murmur and formant transitions for place of articulation in nasal consonants
- K. Kurowski and S. Blumstein, "Perceptual integration of the murmur and formant transitions for place of articulation in nasal consonants," J. Acoust. Soc. Amer., vol. 76, pp. 383-390, 1984.
- (1984) J. Acoust. Soc. Amer. , vol.76 , pp. 383-390
- Kurowski, K.¹ Blumstein, S.²

36
- 0022523182
- Perception of the [m]-[n] distinction in CV syllables
- B. Repp, "Perception of the [m]-[n] distinction in CV syllables," J. Acoust. Soc. Amer., vol. 79, pp. 1987-1999, 1986.
- (1986) J. Acoust. Soc. Amer. , vol.79 , pp. 1987-1999
- Repp, B.¹

37
- 0000629601
- Acoustic cues for nasal consonants: An experimental study involving a tape-splicing technique
- A. Malecot, "Acoustic cues for nasal consonants: An experimental study involving a tape-splicing technique," Language, vol. 32, pp. 274-284, 1956.
- (1956) Language , vol.32 , pp. 274-284
- Malecot, A.¹

38
- 0028936631
- Automatic recognition of syllable-final nasals preceded by /eh
- Mar.
- P. Loizou, M. Dorman, and A. Spanias, "Automatic recognition of syllable-final nasals preceded by /eh/," J. Acoust. Soc. Amer., vol. 97, no. 3, pp. 1925-1928, Mar. 1995.
- (1995) J. Acoust. Soc. Amer. , vol.97 , Issue.3 , pp. 1925-1928
- Loizou, P.¹ Dorman, M.² Spanias, A.³

39
- 33646920550
- Ph.D. thesis, Arizona State Univ., Tempe, AZ
- P. Loizou, "Robust speaker-independent recognition of a confusable vocabulary,"Ph.D. thesis, Arizona State Univ., Tempe, AZ, 1995.
- (1995) Robust speaker-independent recognition of a confusable vocabulary
- Loizou, P.¹

40
- 0004257992
- New York: Wiley
- S. Kullback, Information Theory and Statistics. New York: Wiley, 1958.
- (1958) Information Theory and Statistics
- Kullback, S.¹

41
- 0002215069
- "On a measure of divergence between two statistical populations defined by their probability distributions
- A. Bhattacharyya, "On a measure of divergence between two statistical populations defined by their probability distributions," Bull. Calcutta Math. Soc., vol. 35, pp. 99-109, 1943.,
- (1943) Bull. Calcutta Math. Soc. , vol.35 , pp. 99-109
- Bhattacharyya, A.¹

42
- 65249157560
- "The divergence and Bhattacharyya distance measures in signal selection,"
- T. Kailath, "The divergence and Bhattacharyya distance measures in signal selection," IEEE Trans. Commun. Technol., vol. COM-15, no. 1, pp. 52-60, 1967.
- (1967) IEEE Trans. Commun. Technol. , vol.COM-15 , Issue.1 , pp. 52-60
- Kailath, T.¹

43
- 0000042860
- Signal selection in communication and radar systems
- Oct.
- T. Grettenberg, "Signal selection in communication and radar systems," IEEE Trans. Inform. Theory, vol. IT-9, pp. 265-275, Oct. 1963.
- (1963) IEEE Trans. Inform. Theory , vol.IT-9 , pp. 265-275
- Grettenberg, T.¹

44
- 84914813506
- On the effectiveness of receptors in recognition systems
- T. Marill and M. Green, "On the effectiveness of receptors in recognition systems," IEEE Trans. Inform. Theory, vol. IT-9, pp. 11-17, 1963.
- (1963) IEEE Trans. Inform. Theory , vol.IT-9 , pp. 11-17
- Marill, T.¹ Green, M.²

45
- 0009061528
- Some approaches to optimum feature extraction
- J. Tou, Ed. New York: Academic
- J. Tou and R. Heydorn, "Some approaches to optimum feature extraction," Computer and Information Sciences-II, J. Tou, Ed. New York: Academic, 1967, pp. 57-89.
- (1967) Computer and Information Sciences-II , pp. 57-89
- Tou, J.¹ Heydorn, R.²

46
- 0004210306
- New York: Addison-Wesley
- J. Tou and R. Gonzalez, Pattern Recognition Principles. New York: Addison-Wesley, 1974.
- (1974) Pattern Recognition Principles
- Tou, J.¹ Gonzalez, R.²

47
- 0014604351
- A class of upper bounds on probability of error for multihypothesis pattern recognition
- G. Lainiolis, "A class of upper bounds on probability of error for multihypothesis pattern recognition," IEEE Trans. Inform. Theory, vol. IT-15, pp. 730-731, 1969.
- (1969) IEEE Trans. Inform. Theory , vol.IT-15 , pp. 730-731
- Lainiolis, G.¹

48
- 0346838156
- "English alphabet recognition with telephone speech
- J. Moody, S. Hanson, and R. Lippmann, Eds. San Francisco, CA: Morgan Kaufmann
- M. Fanty, R. Cole, and K. Roginsky, "English alphabet recognition with telephone speech," in Advances in Neural Information Processing Systems 4, J. Moody, S. Hanson, and R. Lippmann, Eds. San Francisco, CA: Morgan Kaufmann, 1992.
- (1992) Advances in Neural Information Processing Systems 4
- Fanty, M.¹ Cole, R.² Roginsky, K.³

49
- 85135100178
- A telephone speech database of spelled and spoken names
- R. Cole, M. Fanty, and K. Roginsky, "A telephone speech database of spelled and spoken names," in Proc. Int. Conf. Spoken Language Processing, 1992, pp. 891-893.
- (1992) Proc. Int. Conf. Spoken Language Processing , pp. 891-893
- Cole, R.¹ Fanty, M.² Roginsky, K.³

50
- 0025145948
- Modeling the microsegments of stop consonants in a hidden Markov model based recognizer
- June
- L. Deng, M. Lennig, and P. Mermelstein, "Modeling the microsegments of stop consonants in a hidden Markov model based recognizer," J. Acoust. Soc. Amer., vol. 87, no. 6, pp. 2738-2747, June 1990.
- (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.6 , pp. 2738-2747
- Deng, L.¹ Lennig, M.² Mermelstein, P.³

51
- 0010568388
- "Telephone alphabet recognition for name-retrieval applications
- Oct.
- P. Loizou, A. Mekkoth, and A. Spanias, "Telephone alphabet recognition for name-retrieval applications," in Proc. Int. Conf. Signal Processing Applications Technol., vol. II, Oct. 1995, pp. 2014-2018.
- (1995) Proc. Int. Conf. Signal Processing Applications Technol. , vol.2 , pp. 2014-2018
- Loizou, P.¹ Mekkoth, A.² Spanias, A.³

52
- 1842272975
- "Improved speech recognition using the weighted average divergence measure
- P. Loizou and A. Spanias, "Improved speech recognition using the weighted average divergence measure," in Proc. Int. Conf. Digital Signal Processing, 1995, pp. 90-95.
- (1995) Proc. Int. Conf. Digital Signal Processing , pp. 90-95
- Loizou, P.¹ Spanias, A.²

53
- 0003723178
- New York: Wiley
- S. Searle, Matrix Algebra Useful for Statistics. New York: Wiley, 1982.
- (1982) Matrix Algebra Useful for Statistics
- Searle, S.¹

54
- 0032097263
- San Diego, CA: Academic
- K. Fukunaga, Introduction to Statistical Pattern Recognition. San Diego, CA: Academic, 1990.
- (1990) Introduction to Statistical Pattern Recognition
- Fukunaga, K.¹

55
- 6244257245
- "Comparative study of nonlinear time warping techniques in isolated word speech recognition systems
- Carnegie Mellon Univ., Pittsburgh, PA
- A. Waibel and B. Yegnanarayana, "Comparative study of nonlinear time warping techniques in isolated word speech recognition systems," Tech. Rep. CMU-CS-81-125, Carnegie Mellon Univ., Pittsburgh, PA, 1981.
- (1981) Tech. Rep. CMU-CS-81-125
- Waibel, A.¹ Yegnanarayana, B.²

56
- 0026370985
- "Optimising hidden Markov models using discriminative output distributions
- P. Woodland and D. Cole, "Optimising hidden Markov models using discriminative output distributions," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1991, pp. 545-548.
- Proc. Int. Conf. Acoust., Speech, Signal Processing , vol.1991
- Woodland, P.¹ Cole, D.²

57
- 0028251797
- Stochastic modeling of temporal information in speech for Hidden Markov Models
- Jan.
- J. Dai, I. MacKenzie, and J. Tyler, "Stochastic modeling of temporal information in speech for Hidden Markov Models," IEEE Trans. Speech Audio Processing, vol. 2, no. 1, pp. 102-104, Jan. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.1 , pp. 102-104
- Dai, J.¹ MacKenzie, I.² Tyler, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.