SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE

Volumn 88, Issue 8, 2000, Pages 1142-1165

Automatic recognition and understanding of spoken language - A first step toward natural human-machine communication

(2) Juang, Biing Hwang a,c Furui, Sadaoki b

a Lucent Technologics Inc (United States)

b TOKYO INSTITUTE OF TECHNOLOGY (Japan)

c LUCENT TECHNOLOGIES (United States)

Author keywords

Acoustic modeling; Acoustic phonetics; Articulation; Automatic recognition and understanding; Bayes risk; Cepsiral distance; Continuous speech recognition; Detection based approach; Dialogue systems; Discriminative training; Dynamic programming

Indexed keywords

EID: 0000763574 PISSN: 00189219 EISSN: None Source Type: Journal
DOI: 10.1109/5.880077 Document Type: Article

Times cited : (101)

References (53)

1
- 84955014394
- Automatic recognition of spoken digits
- K. H. Davis, R. Biddulph, and S. Balashek, "Automatic recognition of spoken digits." J. Acoust. Soc. Amer., vol. 24, no. 6, pp. 637-642, 1952.
- (1952) J. Acoust. Soc. Amer. , vol.24 , Issue.6 , pp. 637-642
- Davis, K.H.¹ Biddulph, R.² Balashek, S.³

2
- 33646933259
- Phonetic typewriter
- H. F. Olson and H. Belar, "Phonetic typewriter," J. Acoust. Soc. Amer., vol. 28, no. 6, pp. 1072-1081, 1956.
- (1956) J. Acoust. Soc. Amer. , vol.28 , Issue.6 , pp. 1072-1081
- Olson, H.F.¹ Belar, H.²

3
- 0343948927
- Results obtained from a vowel recognition computer program
- J, W. Forgie and C. D. Forgie, "Results obtained from a vowel recognition computer program," J. Acoust. Soc. Amer., vol. 31, no. 11, pp. 1480-1489, 1959.
- (1959) J. Acoust. Soc. Amer. , vol.31 , Issue.11 , pp. 1480-1489
- Forgie, J.W.¹ Forgie, C.D.²

4
- 33646897634
- Recognition of Japanese vowels - Preliminary to the recognition of speech
- J. Suzuki and K. Nakata, "Recognition of Japanese vowels - Preliminary to the recognition of speech," J. Radio Res. Lab, vol. 37, no. 8, pp. 193-212, 1961.
- (1961) J. Radio Res. Lab , vol.37 , Issue.8 , pp. 193-212
- Suzuki, J.¹ Nakata, K.²

5
- 84878853840
- The phonetic typewriter, information processing 1962
- Munich, Germany
- T. Sakai and S. Doshita, "The phonetic typewriter, information processing 1962," presented at the Proc. IFIP Congr., Munich, Germany, 1962.
- (1962) Proc. IFIP Congr.
- Sakai, T.¹ Doshita, S.²

6
- 33646950726
- Spoken digit recognizer for Japanese language
- K. Nagata, Y. Kato, and S. Chiba, "Spoken digit recognizer for Japanese language," NEC Res. Develop., no. 6, 1963.
- (1963) NEC Res. Develop. , Issue.6
- Nagata, K.¹ Kato, Y.² Chiba, S.³

7
- 33646909415
- Theoretical aspects of the mechanical speech recognition
- D. B. Fry, "Theoretical aspects of the mechanical speech recognition," J. Br. Inst. Radio Eng., vol. 19, no. 4, pp. 211-229, 1959.
- (1959) J. Br. Inst. Radio Eng. , vol.19 , Issue.4 , pp. 211-229
- Fry, D.B.¹

8
- 33646944695
- Speech recognition by feature abstraction techniques
- T. B. Martin, A. L. Nelson, and H. J. Zadell, "Speech recognition by feature abstraction techniques," Air Force Avionics Lab, Tech. Rep. AL-TDR-64-176, 1964.
- (1964) Air Force Avionics Lab, Tech. Rep. , vol.AL-TDR-64-176
- Martin, T.B.¹ Nelson, A.L.² Zadell, H.J.³

9
- 0010727514
- Speech discrimination by dynamic programming
- Jan.-Feb.
- T. K. Vintsyuk, "Speech discrimination by dynamic programming," Kibernetika, vol. 4, pp. 81-88, Jan.-Feb. 1968.
- (1968) Kibernetika , vol.4 , pp. 81-88
- Vintsyuk, T.K.¹

10
- 0017930815
- Dynamic programming algorithm optimization for spoken word recognition
- Feb.
- H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-26. pp. 43-49, Feb. 1978.
- (1978) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-26 , pp. 43-49
- Sakoe, H.¹ Chiba, S.²

11
- 0016507833
- Design of a linguistic statistical decoder for the recognition of continuous speech
- F. Jelinek, L. R. Bahl, and R. L. Mercer, "Design of a linguistic statistical decoder for the recognition of continuous speech," IEEE Trans. Inform. Theory, vol. IT-21, pp. 250-256, 1975.
- (1975) IEEE Trans. Inform. Theory , vol.IT-21 , pp. 250-256
- Jelinek, F.¹ Bahl, L.R.² Mercer, R.L.³

12
- 0022150487
- The development of an experimental discrete dictation recognizer
- Nov.
- F. Jelinek, "The development of an experimental discrete dictation recognizer," in Proc. IEEE, vol. 73, Nov. 1985, pp. 1616-1624.
- (1985) Proc. IEEE , vol.73 , pp. 1616-1624
- Jelinek, F.¹

13
- 0018656519
- Speaker independent recognition of isolated words using clustering techniques
- Aug.
- L. R. Rabiner, S. E. Levinson, A. E. Rosenberg, and J. G. Wilpon, "Speaker independent recognition of isolated words using clustering techniques," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, pp. 336-349. Aug. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-27 , pp. 336-349
- Rabiner, L.R.¹ Levinson, S.E.² Rosenberg, A.E.³ Wilpon, J.G.⁴

14
- 0016467604
- Minimum prediction residual applied to speech recognition
- Feb.
- F. Itakura, "Minimum prediction residual applied to speech recognition," IEEE Trans. Aconsl., Speech. Signal Processing, vol. ASSP-23, pp. 67-72, Feb. 1975.
- (1975) IEEE Trans. Aconsl., Speech. Signal Processing , vol.ASSP-23 , pp. 67-72
- Itakura, F.¹

15
- 0023165215
- On the use of bandpass liftering in speech recognition
- July
- B. H. Juang, L. R. Rabiner, and J. G. Wilpon, "On the use of bandpass liftering in speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-35, pp. 947-954, July 1987.
- (1987) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-35 , pp. 947-954
- Juang, B.H.¹ Rabiner, L.R.² Wilpon, J.G.³

16
- 0022082035
- A modified K-means clustering algorithm for use in isolated word recognition
- June
- J. G. Wilpon and L. R. Rabiner, "A modified K-means clustering algorithm for use in isolated word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 587-594, June 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-33 , pp. 587-594
- Wilpon, J.G.¹ Rabiner, L.R.²

17
- 0025517070
- Automatic recognition of keywords in unconstrained speech using hidden Markov models
- Nov.
- J. G. Wilpon, L. R. Rabiner, C. H. Lee, and E. Goldman, "Automatic recognition of keywords in unconstrained speech using hidden Markov models," IEEE Trans. Acoust., Speech, Signal Processing, vol. 38, pp. 1870-1878, Nov. 1990.
- (1990) IEEE Trans. Acoust., Speech, Signal Processing , vol.38 , pp. 1870-1878
- Wilpon, J.G.¹ Rabiner, L.R.² Lee, C.H.³ Goldman, E.⁴

18
- 26744458175
- An approach to computer speech recognition by direct analysis of the speech wave
- Comput. Sci. Dept., Stanford Univ., Sept.
- D. R. Reddy, "An approach to computer speech recognition by direct analysis of the speech wave," Comput. Sci. Dept., Stanford Univ., Tech. Rep. C549, Sept. 1966.
- (1966) Tech. Rep. , vol.C549
- Reddy, D.R.¹

19
- 0016610301
- Organization of the hearsay - II: Speech understanding system
- June
- V. R. Lesser, R. D. Fennell, L. D. Ermar, and D. R. Reddy, "Organization of the hearsay - II: Speech understanding system," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, pp. 11-23, June 1975.
- (1975) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-23 , pp. 11-23
- Lesser, V.R.¹ Fennell, R.D.² Ermar, L.D.³ Reddy, D.R.⁴

20
- 33646909120
- J. Ferguson, Ed., Princeton, NJ: IDA
- J. Ferguson, Ed., Hidden Markov Models for Speech. Princeton, NJ: IDA, 1980.
- (1980) Hidden Markov Models for Speech.

21
- 0022097649
- Maximum likelihood estimation for mixture multivariale stochastic observations of Markov chains
- B. H. Juang, "Maximum likelihood estimation for mixture multivariale stochastic observations of Markov chains," AT&T Tech. J., vol. 64, 1985.
- (1985) AT&T Tech. J. , vol.64
- Juang, B.H.¹

22
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Feb.
- L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," in Proc. IEEE, vol. 77, Feb. 1989, pp. 257-286.
- (1989) Proc. IEEE , vol.77 , pp. 257-286
- Rabiner, L.R.¹

23
- 0347105140
- Stochastic representation of semantic structure for speech understanding
- Genova, Italy, Sept.
- R. Pieraccini and E. Levin, "Stochastic representation of semantic structure for speech understanding," in Proc. Eurospeech 91, Genova, Italy, Sept. 1991.
- (1991) Proc. Eurospeech 91
- Pieraccini, R.¹ Levin, E.²

24
- 0003757962
- New York: Springer-Verlag
- J. L. Flanagan, Speech Analysis, Synthesis and Perception. New York: Springer-Verlag, 1972.
- (1972) Speech Analysis, Synthesis and Perception
- Flanagan, J.L.¹

25
- 0027271235
- A novel approach to the speaker identification over telephone networks
- Minneapolis, MN, Apr.
- H. C. Wang, M.-S, Chen, and T. Yang, "A novel approach to the speaker identification over telephone networks," in Proc. ICASSP-93 Minneapolis, MN, Apr. 1993, vol. 2, pp. 407-410.
- (1993) Proc. ICASSP-93 , vol.2 , pp. 407-410
- Wang, H.C.¹ M-S² Chen³ Yang, T.⁴

26
- 0042660763
- Speech and language processing for next-millenium communications services
- Aug.
- R. V. Cox, et al., "Speech and language processing for next-millenium communications services," Proc. IEEE, vol. 88, pp. 1314-1337, Aug. 2000.
- (2000) Proc. IEEE , vol.88 , pp. 1314-1337
- Cox, R.V.¹

27
- 33646934064
- Automatic speech recognition: Problems, progress & prospects
- Kyoto, Japan, Oct.
- B. H. Juang, "Automatic speech recognition: Problems, progress & prospects," presented at the IEEE Workshop Neural Networks for Signal Processing, Kyoto, Japan, Oct. 1996.
- (1996) IEEE Workshop Neural Networks for Signal Processing
- Juang, B.H.¹

28
- 33646936057
- National Institute of Science and Technology, Feb.
- D, Palleu, et al., "DARPA HUB-4 rep.," National Institute of Science and Technology, Feb. 1999.
- (1999) "DARPA HUB-4 Rep.
- Palleu, D.¹

29
- 0003472470
- New York: Wiley
- R. O. Duda and P. E. Hart, Pattern Classification and Scene Analysis. New York: Wiley, 1973.
- (1973) Pattern Classification and Scene Analysis
- Duda, R.O.¹ Hart, P.E.²

30
- 0141629128
- Experiments in vocal tract normalization
- A. Andreou, T. Kainni, and J. Cohen, "Experiments in vocal tract normalization," presented at the Proc. CAIP/Rutgers Workshop: Frontiers in Speech Recognition II, 1994.
- (1994) Proc. CAIP/Rutgers Workshop: Frontiers in Speech Recognition II
- Andreou, A.¹ Kainni, T.² Cohen, J.³

31
- 0003874959
- Berlin, Germany: Springer-Verlag
- J. D. Markel and A. H. Gray Jr., Linear Prediction of Speech. Berlin, Germany: Springer-Verlag, 1976.
- (1976) Linear Prediction of Speech
- Markel, J.D.¹ Gray Jr., A.H.²

32
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990.
- (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

33
- 0031221099
- Filtering the time sequences of spectral parameters for speech recognition
- C. Nadeu, P. Paches-Leal, and B. H. Juang, "Filtering the time sequences of spectral parameters for speech recognition," Speech Commun., vol. 22, pp. 315-332, 1997.
- (1997) Speech Commun. , vol.22 , pp. 315-332
- Nadeu, C.¹ Paches-Leal, P.² Juang, B.H.³

34
- 0028517164
- RASTA processing of speech
- Oct.
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Processing, vol. 2, pp. 578-589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

35
- 0022667694
- Speaker independent isolated word recognition using dynamic features of speech spectrum
- Feb.
- S. Furui, "Speaker independent isolated word recognition using dynamic features of speech spectrum,'' IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 52-59, Feb. 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , pp. 52-59
- Furui, S.¹

36
- 0019555090
- Cepstral analysis technique for automatic speaker verification
- Apr.
- _, "Cepstral analysis technique for automatic speaker verification," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-29, pp. 254-272, Apr. 1981.
- (1981) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-29 , pp. 254-272

37
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.H.²

38
- 0000353178
- A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
- L. E. Baum, T. Petri, G. Soules, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," Ann. Math. Statist., vol. 41, pp. 164-171, 1970.
- (1970) Ann. Math. Statist. , vol.41 , pp. 164-171
- Baum, L.E.¹ Petri, T.² Soules, G.³ Weiss, N.⁴

39
- 85007758808
- Discriminative training
- B. H. Juang and S. Katagiri, "Discriminative training," J. Acoust. Soc. Jpn (E), vol. 13, no. 6, pp. 333-339, 1992.
- (1992) J. Acoust. Soc. Jpn (E) , vol.13 , Issue.6 , pp. 333-339
- Juang, B.H.¹ Katagiri, S.²

40
- 0031139839
- Minimum classification error rate methods for speech recognition
- May
- B. H. Juang, W. Chou, and C. H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Processing, vol. 5, pp. 257-265. May 1997.
- (1997) IEEE Trans. Speech Audio Processing , vol.5 , pp. 257-265
- Juang, B.H.¹ Chou, W.² Lee, C.H.³

41
- 0022018101
- A probabilistic distance measure for hidden Markov modeis
- Feb.
- B. H. Juang and L. R. Rabiner, "A probabilistic distance measure for hidden Markov modeis," AT&T Tech. J., vol. 64, pp. 391-408, Feb. 1985.
- (1985) AT&T Tech. J. , vol.64 , pp. 391-408
- Juang, B.H.¹ Rabiner, L.R.²

42
- 0003786003
- Cambridge, MA: MIT Press
- F. Jelinek, Statistical Methods for Speech Recognition. Cambridge, MA: MIT Press, 1997.
- (1997) Statistical Methods for Speech Recognition
- Jelinek, F.¹

43
- 33646917477
- Using natural-language knowledge sources in speech recognition
- K. Ponting, Ed. Berlin, Germany: Springer-Verlag
- R. C. Moore, "Using natural-language knowledge sources in speech recognition," in Computational Models of Speech Pattern Processing, K. Ponting, Ed. Berlin, Germany: Springer-Verlag, 1997, pp. 304-327.
- (1997) Computational Models of Speech Pattern Processing , pp. 304-327
- Moore, R.C.¹

44
- 0000635720
- Progress in dynamic programming search for LVCSR
- Aug.
- H. Nev and S. Ortmanns, "Progress in dynamic programming search for LVCSR," Proc. IEEE, vol. 88, pp. 1224-1240, Aug. 2000.
- (2000) Proc. IEEE , vol.88 , pp. 1224-1240
- Nev, H.¹ Ortmanns, S.²

45
- 33646911672
- ATIS Tech. Rep
- Austin, TX
- "ATIS Tech. Rep.," in Proc. ARPA Spoken Language Systems Technology Workshop, Austin, TX, 1995, pp. 241-280.
- (1995) Proc. ARPA Spoken Language Systems Technology Workshop , pp. 241-280

46
- 0006455466
- Structured networks for adaptive language acquisition
- L. G. Miller and A. Gorin, "Structured networks for adaptive language acquisition," Int. J. Pattern Recognit. Artif. Intell. (Special Issue on Neural Networks), vol. 7, no. 4, pp. 873-898, 1993.
- (1993) Int. J. Pattern Recognit. Artif. Intell. (Special Issue on Neural Networks) , vol.7 , Issue.4 , pp. 873-898
- Miller, L.G.¹ Gorin, A.²

47
- 84989525001
- Indexing by latent semantic analysis
- S. Deerwester, et al., "Indexing by latent semantic analysis," J. Amer. Soc. Inform. Sci., vol. 41, pp. 391-407, 1990.
- (1990) J. Amer. Soc. Inform. Sci. , vol.41 , pp. 391-407
- Deerwester, S.¹

48
- 0030682289
- Combining key-phrase detection and subword based verification for flexible speech understanding
- May
- T. Kawahara, C. H. Lee, and B. H. Juang, "Combining key-phrase detection and subword based verification for flexible speech understanding," in Proc. IEEE ICASSP97, May 1997.
- (1997) Proc. IEEE ICASSP97
- Kawahara, T.¹ Lee, C.H.² Juang, B.H.³

49
- 0003770709
- Boston, MA: Kluwer
- J.-C. Junqua and J.-P. Haton, Robustness in Automatic Speech Recognition. Boston, MA: Kluwer, 1996.
- (1996) Robustness in Automatic Speech Recognition
- Junqua, J.-C.¹ Haton, J.-P.²

50
- 0002960982
- Recent advances in robust speech recognition
- Pont-a-Mouson, France
- S. Furui, "Recent advances in robust speech recognition," in Proc. ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-a-Mouson, France, 1997, pp. 11-20.
- (1997) Proc. ESCA-NATO Workshop on Robust Speech Recognition for Unknown Communication Channels , pp. 11-20
- Furui, S.¹

51
- 0000159105
- On adaptive decision rules and decision parameter adaptation for automatic speech recognition
- Aug.
- C. H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol. 88, pp. 1241-1269, Aug. 2000.
- (2000) Proc. IEEE , vol.88 , pp. 1241-1269
- Lee, C.H.¹ Huo, Q.²

52
- 0024076692
- On a model-robust training method for speech recognition
- A,. Nadas, D. Nahamoo, and M. A. Picheny, "On a model-robust training method for speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 36. pp. 1432-1436, 1988.
- (1988) IEEE Trans. Acoust., Speech, Signal Processing , vol.36 , pp. 1432-1436
- Nadas, A.¹ Nahamoo, D.² Picheny, M.A.³

53
- 33646909415
- The design and operation of the mechanical speech recognizer at University College London
- P. Denes, "The design and operation of the mechanical speech recognizer at University College London," J. Br. Inst. Radio Eng., vol. 19, no. 4, pp. 211-229, 1959.
- (1959) J. Br. Inst. Radio Eng. , vol.19 , Issue.4 , pp. 211-229
- Denes, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.