메뉴 건너뛰기




Volumn 4, Issue 6, 1996, Pages 430-445

High-performance alphabet recognition

Author keywords

[No Author keywords available]

Indexed keywords

CHARACTER SETS; MARKOV PROCESSES; MATHEMATICAL MODELS; PATTERN RECOGNITION SYSTEMS; PERFORMANCE; SPEECH COMMUNICATION;

EID: 0030286185     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.544528     Document Type: Article
Times cited : (62)

References (57)
  • 1
    • 0028378020 scopus 로고
    • Applications of voice processing, to telecommunications
    • L. Rabiner, "Applications of voice processing, to telecommunications," in Proc. IEEE, vol. 82, no. 2, pp. 199-228, Feb. 1994.
    • (1994) Proc. IEEE , vol.82 , Issue.2 , pp. 199-228
    • Rabiner, L.1
  • 3
    • 0027579316 scopus 로고
    • Discriminative training of dynamic programming based speech recognizers
    • Apr.
    • P. Chang and B. Juang, "Discriminative training of dynamic programming based speech recognizers," IEEE Trans. Speech Audio Processing, vol. 1, no. 2, pp. 135-143, Apr. 1993.
    • (1993) IEEE Trans. Speech Audio Processing , vol.1 , Issue.2 , pp. 135-143
    • Chang, P.1    Juang, B.2
  • 5
    • 0028195650 scopus 로고
    • Speech recognition using weighted HMM and subspace projection approaches
    • Jan.
    • K. Su and C. Lee, "Speech recognition using weighted HMM and subspace projection approaches," IEEE Trans. Speech Audio Processing, vol. 2, no. 1, pp. 69-79, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.1 , pp. 69-79
    • Su, K.1    Lee, C.2
  • 6
    • 0001462521 scopus 로고
    • A cross-language study of voicing in initial stops: Acoustical measurements
    • L. Lisker and S. Abramson, "A cross-language study of voicing in initial stops: Acoustical measurements," Word, vol. 20, pp. 384-422, 1964.
    • (1964) Word , vol.20 , pp. 384-422
    • Lisker, L.1    Abramson, S.2
  • 10
    • 0001559782 scopus 로고
    • Analysis of nasal consonants
    • Dec.
    • O. Fujimura, "Analysis of nasal consonants," J. Acoust. Soc. Amer., vol. 34, no. 12, pp. 1865-1875, Dec. 1962.
    • (1962) J. Acoust. Soc. Amer. , vol.34 , Issue.12 , pp. 1865-1875
    • Fujimura, O.1
  • 11
    • 0008457913 scopus 로고
    • "Speech coding and recognition: A review,"
    • Feb.
    • A. Spanias and F. Wu, "Speech coding and recognition: A review," IEICE Trans. Fundamentals, vol. E75-A, no. 2, pp. 132-148, Feb. 1992.
    • (1992) IEICE Trans. Fundamentals , vol.E75-A , Issue.2 , pp. 132-148
    • Spanias, A.1    Wu, F.2
  • 12
    • 0016467604 scopus 로고
    • "Minimum prediction residual applied to speech recognition
    • F. Itakura, "Minimum prediction residual applied to speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, no. 1, pp. 67-72, Feb. 1975.
    • (1975) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-23 , Issue.1 , pp. 67-72
    • Itakura, F.1
  • 14
  • 16
    • 33646908717 scopus 로고
    • "Performance improvement in a dynamic-programming based isolated word recognition system for the alpha-digit task
    • L. Lamel and V. Zue, "Performance improvement in a dynamic-programming based isolated word recognition system for the alpha-digit task," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1982, pp. 558-561.
    • (1982) Proc. Int. Conf. Acoust., Speech, Signal Processing , pp. 558-561
    • Lamel, L.1    Zue, V.2
  • 17
    • 0001887625 scopus 로고
    • "Performing fine phonetic distinctions: Templates vs. features
    • J. Perkell and D. Klatt, Eds. New York: Lawrence Erlbaum
    • R. Cole, R. Stern, and M. Lasry, "Performing fine phonetic distinctions: Templates vs. features," in Invariance and Variability of Speech Processes, J. Perkell and D. Klatt, Eds. New York: Lawrence Erlbaum, 1986, pp. 325-341.
    • (1986) Invariance and Variability of Speech Processes , pp. 325-341
    • Cole, R.1    Stern, R.2    Lasry, M.3
  • 19
    • 0004989362 scopus 로고
    • "Some performance benchmarks for isolated word speech recognition systems
    • L. Rabiner and J. Wilpon, "Some performance benchmarks for isolated word speech recognition systems," Comput. Speech Language, vol. 2, pp. 343-357, 1987.
    • (1987) Comput. Speech Language , vol.2 , pp. 343-357
    • Rabiner, L.1    Wilpon, J.2
  • 24
    • 0003640523 scopus 로고
    • "The ISOLET spoken letter database
    • Oregon Graduate Inst.
    • R. Cole, Y. Muthusamy, and M. Fanty, "The ISOLET spoken letter database," Tech. Rep. 90-004, Oregon Graduate Inst., 1990.
    • (1990) Tech. Rep. 90-004
    • Cole, R.1    Muthusamy, Y.2    Fanty, M.3
  • 25
    • 0002583871 scopus 로고
    • "Speech database development: Design and analysis of the acoustic phonetic corpus
    • L. Lamel, R. Kassel, and S. Seneff, "Speech database development: Design and analysis of the acoustic phonetic corpus," in Proc. DARPA Speech Recognition Workshop, 1986, pp. 100-109.
    • (1986) Proc. DARPA Speech Recognition Workshop , pp. 100-109
    • Lamel, L.1    Kassel, R.2    Seneff, S.3
  • 26
    • 0019053271 scopus 로고
    • "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, B.1    Mermelstein, P.2
  • 27
    • 0025493667 scopus 로고
    • "The segmental k-means algorithm for estimating parameters of hidden Markov models,"
    • B. Juang and L. Rabiner, "The segmental k-means algorithm for estimating parameters of hidden Markov models," IEEE Trans. Acoust., Speech, Signal Processing, vol. 38, no. 9, pp. 1639-1641, 1990.
    • (1990) IEEE Trans. Acoust., Speech, Signal Processing , vol.38 , Issue.9 , pp. 1639-1641
    • Juang, B.1    Rabiner, L.2
  • 28
    • 0001882615 scopus 로고
    • "Self-organized language modeling for speech recognition
    • A. Waibel and K. Lee, Eds. San Francisco, CA: Morgan Kaufmann
    • F. Jelinek, "Self-organized language modeling for speech recognition," in Readings in Speech Rcognition, A. Waibel and K. Lee, Eds. San Francisco, CA: Morgan Kaufmann, 1990, pp. 450-506.
    • (1990) Readings in Speech Rcognition , pp. 450-506
    • Jelinek, F.1
  • 29
    • 0028573857 scopus 로고
    • "Context-dependent modeling in alphabet recognition
    • P. Loizou and A. Spanias, "Context-dependent modeling in alphabet recognition," in Proc. Int. Symp. Circuits Syst., 1994, pp. 189-192.
    • (1994) Proc. Int. Symp. Circuits Syst. , pp. 189-192
    • Loizou, P.1    Spanias, A.2
  • 30
    • 0022859679 scopus 로고
    • The role of word-dependent coarticulatory effects in a phoneme-based speech recognition system
    • Y. Chow et al., "The role of word-dependent coarticulatory effects in a phoneme-based speech recognition system," in Proc. Int. Conf. Acoust., Speech, Signal Processing, 1986, pp. 1593-1596.
    • (1986) Proc. Int. Conf. Acoust., Speech, Signal Processing , pp. 1593-1596
    • Chow, Y.1
  • 33
    • 0003459982 scopus 로고
    • "Evaluation of LPC spectral matching measures for phonetic unit recognition
    • Carnegie Mellon Univ.
    • K. Shikano, "Evaluation of LPC spectral matching measures for phonetic unit recognition," Tech. Rep. CMU-CS-86-108, Carnegie Mellon Univ., 1986.
    • (1986) Tech. Rep. , vol.CMU-CS-86-108
    • Shikano, K.1
  • 35
    • 0021475513 scopus 로고
    • "Perceptual integration of the murmur and formant transitions for place of articulation in nasal consonants
    • K. Kurowski and S. Blumstein, "Perceptual integration of the murmur and formant transitions for place of articulation in nasal consonants," J. Acoust. Soc. Amer., vol. 76, pp. 383-390, 1984.
    • (1984) J. Acoust. Soc. Amer. , vol.76 , pp. 383-390
    • Kurowski, K.1    Blumstein, S.2
  • 36
    • 0022523182 scopus 로고
    • Perception of the [m]-[n] distinction in CV syllables
    • B. Repp, "Perception of the [m]-[n] distinction in CV syllables," J. Acoust. Soc. Amer., vol. 79, pp. 1987-1999, 1986.
    • (1986) J. Acoust. Soc. Amer. , vol.79 , pp. 1987-1999
    • Repp, B.1
  • 37
    • 0000629601 scopus 로고
    • Acoustic cues for nasal consonants: An experimental study involving a tape-splicing technique
    • A. Malecot, "Acoustic cues for nasal consonants: An experimental study involving a tape-splicing technique," Language, vol. 32, pp. 274-284, 1956.
    • (1956) Language , vol.32 , pp. 274-284
    • Malecot, A.1
  • 38
    • 0028936631 scopus 로고
    • Automatic recognition of syllable-final nasals preceded by /eh
    • Mar.
    • P. Loizou, M. Dorman, and A. Spanias, "Automatic recognition of syllable-final nasals preceded by /eh/," J. Acoust. Soc. Amer., vol. 97, no. 3, pp. 1925-1928, Mar. 1995.
    • (1995) J. Acoust. Soc. Amer. , vol.97 , Issue.3 , pp. 1925-1928
    • Loizou, P.1    Dorman, M.2    Spanias, A.3
  • 41
    • 0002215069 scopus 로고
    • "On a measure of divergence between two statistical populations defined by their probability distributions
    • A. Bhattacharyya, "On a measure of divergence between two statistical populations defined by their probability distributions," Bull. Calcutta Math. Soc., vol. 35, pp. 99-109, 1943.,
    • (1943) Bull. Calcutta Math. Soc. , vol.35 , pp. 99-109
    • Bhattacharyya, A.1
  • 42
    • 65249157560 scopus 로고
    • "The divergence and Bhattacharyya distance measures in signal selection,"
    • T. Kailath, "The divergence and Bhattacharyya distance measures in signal selection," IEEE Trans. Commun. Technol., vol. COM-15, no. 1, pp. 52-60, 1967.
    • (1967) IEEE Trans. Commun. Technol. , vol.COM-15 , Issue.1 , pp. 52-60
    • Kailath, T.1
  • 43
    • 0000042860 scopus 로고
    • Signal selection in communication and radar systems
    • Oct.
    • T. Grettenberg, "Signal selection in communication and radar systems," IEEE Trans. Inform. Theory, vol. IT-9, pp. 265-275, Oct. 1963.
    • (1963) IEEE Trans. Inform. Theory , vol.IT-9 , pp. 265-275
    • Grettenberg, T.1
  • 44
    • 84914813506 scopus 로고
    • On the effectiveness of receptors in recognition systems
    • T. Marill and M. Green, "On the effectiveness of receptors in recognition systems," IEEE Trans. Inform. Theory, vol. IT-9, pp. 11-17, 1963.
    • (1963) IEEE Trans. Inform. Theory , vol.IT-9 , pp. 11-17
    • Marill, T.1    Green, M.2
  • 45
    • 0009061528 scopus 로고
    • Some approaches to optimum feature extraction
    • J. Tou, Ed. New York: Academic
    • J. Tou and R. Heydorn, "Some approaches to optimum feature extraction," Computer and Information Sciences-II, J. Tou, Ed. New York: Academic, 1967, pp. 57-89.
    • (1967) Computer and Information Sciences-II , pp. 57-89
    • Tou, J.1    Heydorn, R.2
  • 47
    • 0014604351 scopus 로고
    • A class of upper bounds on probability of error for multihypothesis pattern recognition
    • G. Lainiolis, "A class of upper bounds on probability of error for multihypothesis pattern recognition," IEEE Trans. Inform. Theory, vol. IT-15, pp. 730-731, 1969.
    • (1969) IEEE Trans. Inform. Theory , vol.IT-15 , pp. 730-731
    • Lainiolis, G.1
  • 48
    • 0346838156 scopus 로고
    • "English alphabet recognition with telephone speech
    • J. Moody, S. Hanson, and R. Lippmann, Eds. San Francisco, CA: Morgan Kaufmann
    • M. Fanty, R. Cole, and K. Roginsky, "English alphabet recognition with telephone speech," in Advances in Neural Information Processing Systems 4, J. Moody, S. Hanson, and R. Lippmann, Eds. San Francisco, CA: Morgan Kaufmann, 1992.
    • (1992) Advances in Neural Information Processing Systems 4
    • Fanty, M.1    Cole, R.2    Roginsky, K.3
  • 50
    • 0025145948 scopus 로고
    • Modeling the microsegments of stop consonants in a hidden Markov model based recognizer
    • June
    • L. Deng, M. Lennig, and P. Mermelstein, "Modeling the microsegments of stop consonants in a hidden Markov model based recognizer," J. Acoust. Soc. Amer., vol. 87, no. 6, pp. 2738-2747, June 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.6 , pp. 2738-2747
    • Deng, L.1    Lennig, M.2    Mermelstein, P.3
  • 52
    • 1842272975 scopus 로고
    • "Improved speech recognition using the weighted average divergence measure
    • P. Loizou and A. Spanias, "Improved speech recognition using the weighted average divergence measure," in Proc. Int. Conf. Digital Signal Processing, 1995, pp. 90-95.
    • (1995) Proc. Int. Conf. Digital Signal Processing , pp. 90-95
    • Loizou, P.1    Spanias, A.2
  • 55
    • 6244257245 scopus 로고
    • "Comparative study of nonlinear time warping techniques in isolated word speech recognition systems
    • Carnegie Mellon Univ., Pittsburgh, PA
    • A. Waibel and B. Yegnanarayana, "Comparative study of nonlinear time warping techniques in isolated word speech recognition systems," Tech. Rep. CMU-CS-81-125, Carnegie Mellon Univ., Pittsburgh, PA, 1981.
    • (1981) Tech. Rep. CMU-CS-81-125
    • Waibel, A.1    Yegnanarayana, B.2
  • 57
    • 0028251797 scopus 로고
    • Stochastic modeling of temporal information in speech for Hidden Markov Models
    • Jan.
    • J. Dai, I. MacKenzie, and J. Tyler, "Stochastic modeling of temporal information in speech for Hidden Markov Models," IEEE Trans. Speech Audio Processing, vol. 2, no. 1, pp. 102-104, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.1 , pp. 102-104
    • Dai, J.1    MacKenzie, I.2    Tyler, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.