메뉴 건너뛰기




Volumn 88, Issue 8, 2000, Pages 1142-1165

Automatic recognition and understanding of spoken language - A first step toward natural human-machine communication

Author keywords

Acoustic modeling; Acoustic phonetics; Articulation; Automatic recognition and understanding; Bayes risk; Cepsiral distance; Continuous speech recognition; Detection based approach; Dialogue systems; Discriminative training; Dynamic programming

Indexed keywords


EID: 0000763574     PISSN: 00189219     EISSN: None     Source Type: Journal    
DOI: 10.1109/5.880077     Document Type: Article
Times cited : (101)

References (53)
  • 1
    • 84955014394 scopus 로고
    • Automatic recognition of spoken digits
    • K. H. Davis, R. Biddulph, and S. Balashek, "Automatic recognition of spoken digits." J. Acoust. Soc. Amer., vol. 24, no. 6, pp. 637-642, 1952.
    • (1952) J. Acoust. Soc. Amer. , vol.24 , Issue.6 , pp. 637-642
    • Davis, K.H.1    Biddulph, R.2    Balashek, S.3
  • 2
    • 33646933259 scopus 로고
    • Phonetic typewriter
    • H. F. Olson and H. Belar, "Phonetic typewriter," J. Acoust. Soc. Amer., vol. 28, no. 6, pp. 1072-1081, 1956.
    • (1956) J. Acoust. Soc. Amer. , vol.28 , Issue.6 , pp. 1072-1081
    • Olson, H.F.1    Belar, H.2
  • 3
    • 0343948927 scopus 로고
    • Results obtained from a vowel recognition computer program
    • J, W. Forgie and C. D. Forgie, "Results obtained from a vowel recognition computer program," J. Acoust. Soc. Amer., vol. 31, no. 11, pp. 1480-1489, 1959.
    • (1959) J. Acoust. Soc. Amer. , vol.31 , Issue.11 , pp. 1480-1489
    • Forgie, J.W.1    Forgie, C.D.2
  • 4
    • 33646897634 scopus 로고
    • Recognition of Japanese vowels - Preliminary to the recognition of speech
    • J. Suzuki and K. Nakata, "Recognition of Japanese vowels - Preliminary to the recognition of speech," J. Radio Res. Lab, vol. 37, no. 8, pp. 193-212, 1961.
    • (1961) J. Radio Res. Lab , vol.37 , Issue.8 , pp. 193-212
    • Suzuki, J.1    Nakata, K.2
  • 5
    • 84878853840 scopus 로고
    • The phonetic typewriter, information processing 1962
    • Munich, Germany
    • T. Sakai and S. Doshita, "The phonetic typewriter, information processing 1962," presented at the Proc. IFIP Congr., Munich, Germany, 1962.
    • (1962) Proc. IFIP Congr.
    • Sakai, T.1    Doshita, S.2
  • 6
    • 33646950726 scopus 로고
    • Spoken digit recognizer for Japanese language
    • K. Nagata, Y. Kato, and S. Chiba, "Spoken digit recognizer for Japanese language," NEC Res. Develop., no. 6, 1963.
    • (1963) NEC Res. Develop. , Issue.6
    • Nagata, K.1    Kato, Y.2    Chiba, S.3
  • 7
    • 33646909415 scopus 로고
    • Theoretical aspects of the mechanical speech recognition
    • D. B. Fry, "Theoretical aspects of the mechanical speech recognition," J. Br. Inst. Radio Eng., vol. 19, no. 4, pp. 211-229, 1959.
    • (1959) J. Br. Inst. Radio Eng. , vol.19 , Issue.4 , pp. 211-229
    • Fry, D.B.1
  • 9
    • 0010727514 scopus 로고
    • Speech discrimination by dynamic programming
    • Jan.-Feb.
    • T. K. Vintsyuk, "Speech discrimination by dynamic programming," Kibernetika, vol. 4, pp. 81-88, Jan.-Feb. 1968.
    • (1968) Kibernetika , vol.4 , pp. 81-88
    • Vintsyuk, T.K.1
  • 10
    • 0017930815 scopus 로고
    • Dynamic programming algorithm optimization for spoken word recognition
    • Feb.
    • H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-26. pp. 43-49, Feb. 1978.
    • (1978) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-26 , pp. 43-49
    • Sakoe, H.1    Chiba, S.2
  • 11
    • 0016507833 scopus 로고
    • Design of a linguistic statistical decoder for the recognition of continuous speech
    • F. Jelinek, L. R. Bahl, and R. L. Mercer, "Design of a linguistic statistical decoder for the recognition of continuous speech," IEEE Trans. Inform. Theory, vol. IT-21, pp. 250-256, 1975.
    • (1975) IEEE Trans. Inform. Theory , vol.IT-21 , pp. 250-256
    • Jelinek, F.1    Bahl, L.R.2    Mercer, R.L.3
  • 12
    • 0022150487 scopus 로고
    • The development of an experimental discrete dictation recognizer
    • Nov.
    • F. Jelinek, "The development of an experimental discrete dictation recognizer," in Proc. IEEE, vol. 73, Nov. 1985, pp. 1616-1624.
    • (1985) Proc. IEEE , vol.73 , pp. 1616-1624
    • Jelinek, F.1
  • 14
    • 0016467604 scopus 로고
    • Minimum prediction residual applied to speech recognition
    • Feb.
    • F. Itakura, "Minimum prediction residual applied to speech recognition," IEEE Trans. Aconsl., Speech. Signal Processing, vol. ASSP-23, pp. 67-72, Feb. 1975.
    • (1975) IEEE Trans. Aconsl., Speech. Signal Processing , vol.ASSP-23 , pp. 67-72
    • Itakura, F.1
  • 16
    • 0022082035 scopus 로고
    • A modified K-means clustering algorithm for use in isolated word recognition
    • June
    • J. G. Wilpon and L. R. Rabiner, "A modified K-means clustering algorithm for use in isolated word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 587-594, June 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-33 , pp. 587-594
    • Wilpon, J.G.1    Rabiner, L.R.2
  • 18
    • 26744458175 scopus 로고
    • An approach to computer speech recognition by direct analysis of the speech wave
    • Comput. Sci. Dept., Stanford Univ., Sept.
    • D. R. Reddy, "An approach to computer speech recognition by direct analysis of the speech wave," Comput. Sci. Dept., Stanford Univ., Tech. Rep. C549, Sept. 1966.
    • (1966) Tech. Rep. , vol.C549
    • Reddy, D.R.1
  • 20
    • 33646909120 scopus 로고
    • J. Ferguson, Ed., Princeton, NJ: IDA
    • J. Ferguson, Ed., Hidden Markov Models for Speech. Princeton, NJ: IDA, 1980.
    • (1980) Hidden Markov Models for Speech.
  • 21
    • 0022097649 scopus 로고
    • Maximum likelihood estimation for mixture multivariale stochastic observations of Markov chains
    • B. H. Juang, "Maximum likelihood estimation for mixture multivariale stochastic observations of Markov chains," AT&T Tech. J., vol. 64, 1985.
    • (1985) AT&T Tech. J. , vol.64
    • Juang, B.H.1
  • 22
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," in Proc. IEEE, vol. 77, Feb. 1989, pp. 257-286.
    • (1989) Proc. IEEE , vol.77 , pp. 257-286
    • Rabiner, L.R.1
  • 23
    • 0347105140 scopus 로고
    • Stochastic representation of semantic structure for speech understanding
    • Genova, Italy, Sept.
    • R. Pieraccini and E. Levin, "Stochastic representation of semantic structure for speech understanding," in Proc. Eurospeech 91, Genova, Italy, Sept. 1991.
    • (1991) Proc. Eurospeech 91
    • Pieraccini, R.1    Levin, E.2
  • 25
    • 0027271235 scopus 로고
    • A novel approach to the speaker identification over telephone networks
    • Minneapolis, MN, Apr.
    • H. C. Wang, M.-S, Chen, and T. Yang, "A novel approach to the speaker identification over telephone networks," in Proc. ICASSP-93 Minneapolis, MN, Apr. 1993, vol. 2, pp. 407-410.
    • (1993) Proc. ICASSP-93 , vol.2 , pp. 407-410
    • Wang, H.C.1    M-S2    Chen3    Yang, T.4
  • 26
    • 0042660763 scopus 로고    scopus 로고
    • Speech and language processing for next-millenium communications services
    • Aug.
    • R. V. Cox, et al., "Speech and language processing for next-millenium communications services," Proc. IEEE, vol. 88, pp. 1314-1337, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , pp. 1314-1337
    • Cox, R.V.1
  • 27
    • 33646934064 scopus 로고    scopus 로고
    • Automatic speech recognition: Problems, progress & prospects
    • Kyoto, Japan, Oct.
    • B. H. Juang, "Automatic speech recognition: Problems, progress & prospects," presented at the IEEE Workshop Neural Networks for Signal Processing, Kyoto, Japan, Oct. 1996.
    • (1996) IEEE Workshop Neural Networks for Signal Processing
    • Juang, B.H.1
  • 28
    • 33646936057 scopus 로고    scopus 로고
    • National Institute of Science and Technology, Feb.
    • D, Palleu, et al., "DARPA HUB-4 rep.," National Institute of Science and Technology, Feb. 1999.
    • (1999) "DARPA HUB-4 Rep.
    • Palleu, D.1
  • 32
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 33
    • 0031221099 scopus 로고    scopus 로고
    • Filtering the time sequences of spectral parameters for speech recognition
    • C. Nadeu, P. Paches-Leal, and B. H. Juang, "Filtering the time sequences of spectral parameters for speech recognition," Speech Commun., vol. 22, pp. 315-332, 1997.
    • (1997) Speech Commun. , vol.22 , pp. 315-332
    • Nadeu, C.1    Paches-Leal, P.2    Juang, B.H.3
  • 35
    • 0022667694 scopus 로고
    • Speaker independent isolated word recognition using dynamic features of speech spectrum
    • Feb.
    • S. Furui, "Speaker independent isolated word recognition using dynamic features of speech spectrum,'' IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 52-59, Feb. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , pp. 52-59
    • Furui, S.1
  • 36
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • Apr.
    • _, "Cepstral analysis technique for automatic speaker verification," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-29, pp. 254-272, Apr. 1981.
    • (1981) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-29 , pp. 254-272
  • 38
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • L. E. Baum, T. Petri, G. Soules, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," Ann. Math. Statist., vol. 41, pp. 164-171, 1970.
    • (1970) Ann. Math. Statist. , vol.41 , pp. 164-171
    • Baum, L.E.1    Petri, T.2    Soules, G.3    Weiss, N.4
  • 39
    • 85007758808 scopus 로고
    • Discriminative training
    • B. H. Juang and S. Katagiri, "Discriminative training," J. Acoust. Soc. Jpn (E), vol. 13, no. 6, pp. 333-339, 1992.
    • (1992) J. Acoust. Soc. Jpn (E) , vol.13 , Issue.6 , pp. 333-339
    • Juang, B.H.1    Katagiri, S.2
  • 40
    • 0031139839 scopus 로고    scopus 로고
    • Minimum classification error rate methods for speech recognition
    • May
    • B. H. Juang, W. Chou, and C. H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Processing, vol. 5, pp. 257-265. May 1997.
    • (1997) IEEE Trans. Speech Audio Processing , vol.5 , pp. 257-265
    • Juang, B.H.1    Chou, W.2    Lee, C.H.3
  • 41
    • 0022018101 scopus 로고
    • A probabilistic distance measure for hidden Markov modeis
    • Feb.
    • B. H. Juang and L. R. Rabiner, "A probabilistic distance measure for hidden Markov modeis," AT&T Tech. J., vol. 64, pp. 391-408, Feb. 1985.
    • (1985) AT&T Tech. J. , vol.64 , pp. 391-408
    • Juang, B.H.1    Rabiner, L.R.2
  • 43
    • 33646917477 scopus 로고    scopus 로고
    • Using natural-language knowledge sources in speech recognition
    • K. Ponting, Ed. Berlin, Germany: Springer-Verlag
    • R. C. Moore, "Using natural-language knowledge sources in speech recognition," in Computational Models of Speech Pattern Processing, K. Ponting, Ed. Berlin, Germany: Springer-Verlag, 1997, pp. 304-327.
    • (1997) Computational Models of Speech Pattern Processing , pp. 304-327
    • Moore, R.C.1
  • 44
    • 0000635720 scopus 로고    scopus 로고
    • Progress in dynamic programming search for LVCSR
    • Aug.
    • H. Nev and S. Ortmanns, "Progress in dynamic programming search for LVCSR," Proc. IEEE, vol. 88, pp. 1224-1240, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , pp. 1224-1240
    • Nev, H.1    Ortmanns, S.2
  • 47
    • 84989525001 scopus 로고
    • Indexing by latent semantic analysis
    • S. Deerwester, et al., "Indexing by latent semantic analysis," J. Amer. Soc. Inform. Sci., vol. 41, pp. 391-407, 1990.
    • (1990) J. Amer. Soc. Inform. Sci. , vol.41 , pp. 391-407
    • Deerwester, S.1
  • 48
    • 0030682289 scopus 로고    scopus 로고
    • Combining key-phrase detection and subword based verification for flexible speech understanding
    • May
    • T. Kawahara, C. H. Lee, and B. H. Juang, "Combining key-phrase detection and subword based verification for flexible speech understanding," in Proc. IEEE ICASSP97, May 1997.
    • (1997) Proc. IEEE ICASSP97
    • Kawahara, T.1    Lee, C.H.2    Juang, B.H.3
  • 51
    • 0000159105 scopus 로고    scopus 로고
    • On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    • Aug.
    • C. H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, pp. 1241-1269, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , pp. 1241-1269
    • Lee, C.H.1    Huo, Q.2
  • 53
    • 33646909415 scopus 로고
    • The design and operation of the mechanical speech recognizer at University College London
    • P. Denes, "The design and operation of the mechanical speech recognizer at University College London," J. Br. Inst. Radio Eng., vol. 19, no. 4, pp. 211-229, 1959.
    • (1959) J. Br. Inst. Radio Eng. , vol.19 , Issue.4 , pp. 211-229
    • Denes, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.