SCOPUS 정보 검색 플랫폼

IEEE Transactions on Neural Networks

Volumn 8, Issue 2, 1997, Pages 194-204

Robust speech recognition based on joint model and feature space optimization of hidden Markov models

(2) Moon, Seokyong a,b Hwang, Jenq Neng a,c

a IEEE (South Korea)

b SAMSUNG Electronics (South Korea)

c University of Washington (United States)

Author keywords

Baum Welch inversion; Baum Welch reestimation; Hidden Markov model; Maximum likelihood; Minimax optimization; Minimum mean squared error; Mismatch compensation; Neural network inversion; Robust speech recognition

Indexed keywords

ALGORITHMS; ERROR COMPENSATION; LEARNING SYSTEMS; MATHEMATICAL MODELS; NEURAL NETWORKS; OPTIMIZATION; SIGNAL TO NOISE RATIO;

BAUM-WELCH INVERSION; BAUM-WELCH REESTIMATION; FEATURE SPACE OPTIMIZATION; HIDDEN MARKOV MODELS; MAXIMUM LIKELIHOOD; MINIMAX OPTIMIZATION; MINIMUM MEAN SQUARED ERROR; MISMATCH COMPENSATION; NEURAL NETWORK INVERSION;

SPEECH RECOGNITION;

EID: 0031100269 PISSN: 10459227 EISSN: None Source Type: Journal
DOI: 10.1109/72.557656 Document Type: Article

Times cited : (32)

References (33)

1
- 0025628728
- Environmental robustness in automatic speech recognition
- Apr.
- A. Acero and R. M. Stern, "Environmental robustness in automatic speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Apr. 1990, pp. 849-852.
- (1990) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 849-852
- Acero, A.¹ Stern, R.M.²

2
- 0026835134
- Global optimization of a neural network-hidden Markov model hybrid
- Mar.
- Y. Bengio, R. D. Mori, G. Flammia, and R. Kompe, "Global optimization of a neural network-hidden Markov model hybrid," IEEE Trans. Neural Networks, vol. 3, pp. 252-259, Mar. 1992.
- (1992) IEEE Trans. Neural Networks , vol.3 , pp. 252-259
- Bengio, Y.¹ Mori, R.D.² Flammia, G.³ Kompe, R.⁴

3
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr.
- S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-29, pp. 113-120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-29 , pp. 113-120
- Boll, S.F.¹

4
- 0003572996
- Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, PA
- P. F. Brown, "Acoustic-phonetic modeling problem in automatic speech recognition," Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, PA, 1987.
- (1987) Acoustic-phonetic Modeling Problem in Automatic Speech Recognition
- Brown, P.F.¹

5
- 0028317510
- A projection-based likelihood measure for speech recognition in noise
- Jan.
- B. A. Carlson and M. A. Clements, "A projection-based likelihood measure for speech recognition in noise," IEEE Trans. Speech Audio Processing, vol. 2, pp. 97-102, Jan. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 97-102
- Carlson, B.A.¹ Clements, M.A.²

6
- 0027629444
- Retrieval of snow parameters by iterative inversion of a neural network
- July
- D. T. Davis, Z. Chen, L. Tsang, J. N. Hwang, and A. T. C. Chang, "Retrieval of snow parameters by iterative inversion of a neural network," IEEE Trans. Geosci. Remote Sensing, vol. 31, pp. 842-852, July 1993.
- (1993) IEEE Trans. Geosci. Remote Sensing , vol.31 , pp. 842-852
- Davis, D.T.¹ Chen, Z.² Tsang, L.³ Hwang, J.N.⁴ Chang, A.T.C.⁵

7
- 0028312802
- Auditory models and human performance in tasks related to speech coding and speech recognition
- Jan.
- O. Ghitza, "Auditory models and human performance in tasks related to speech coding and speech recognition," IEEE Trans. Speech Audio Processing, vol. 2, Pt. II, pp. 115-132, Jan. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.2 PART , pp. 115-132
- Ghitza, O.¹

8
- 0026838119
- Iterative inversion of neural networks and its application to adaptive control
- Mar.
- D. Hoskins, J. N. Hwang, and J. Vagners, "Iterative inversion of neural networks and its application to adaptive control," IEEE Trans. Neural Networks, vol. 3, pp. 292-301, Mar. 1992.
- (1992) IEEE Trans. Neural Networks , vol.3 , pp. 292-301
- Hoskins, D.¹ Hwang, J.N.² Vagners, J.³

9
- 0000375621
- A robust version of the probability ratio test
- P. J. Huber, "A robust version of the probability ratio test," Ann. Math. Stat., vol. 36, no. 4, pp. 1753-1758, 1965.
- (1965) Ann. Math. Stat. , vol.36 , Issue.4 , pp. 1753-1758
- Huber, P.J.¹

10
- 0024915502
- A systolic neural network architecture for hidden Markov models
- Dec.
- J. N. Hwang, J. A. Vlontzos, and S. Y. Kung, "A systolic neural network architecture for hidden Markov models," IEEE Trans. Acoust., Speech, Signal Processing, vol. 37, pp. 1967-1979, Dec. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Processing , vol.37 , pp. 1967-1979
- Hwang, J.N.¹ Vlontzos, J.A.² Kung, S.Y.³

11
- 16444380833
- Iterative constrained inversion of neural networks and its applications
- Princeton, NJ, Mar.
- J. N. Hwang and C. H. Chan, "Iterative constrained inversion of neural networks and its applications, in Proc. 24th Conf. Inform. Syst. Sci., Princeton, NJ, Mar. 1990, pp. 754-759.
- (1990) Proc. 24th Conf. Inform. Syst. Sci. , pp. 754-759
- Hwang, J.N.¹ Chan, C.H.²

12
- 0025721732
- Query learning applied to partially trained multilayer perceptrons
- Jan.
- J. N. Hwang, J. J. Choi, S. Oh, and R. J. Marks, II, "Query learning applied to partially trained multilayer perceptrons," IEEE Trans. Neural Networks, vol. 2, pp. 131-136, Jan. 1991.
- (1991) IEEE Trans. Neural Networks , vol.2 , pp. 131-136
- Hwang, J.N.¹ Choi, J.J.² Oh, S.³ Marks II, R.J.⁴

13
- 5544303185
- Interactive query learning for isolated speech recognition
- Helsinger, Denmark, Sept.
- J. N. Hwang and H. Li, "Interactive query learning for isolated speech recognition," in Proc. IEEE Int. Wkshp. Neural Networks Signal Processing, Helsinger, Denmark, Sept. 1992, pp. 93-102.
- (1992) Proc. IEEE Int. Wkshp. Neural Networks Signal Processing , pp. 93-102
- Hwang, J.N.¹ Li, H.²

14
- 0003690086
- New York: Springer-Verlag
- A. Isidori, Nonlinear Control Systems. New York: Springer-Verlag, 1995.
- (1995) Nonlinear Control Systems
- Isidori, A.¹

15
- 0022097649
- Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains
- July
- B. H. Juang, "Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains," AT&T Tech. J., vol. 64, no. 6, pp. 1235-1249, July 1985.
- (1985) AT&T Tech. J. , vol.64 , Issue.6 , pp. 1235-1249
- Juang, B.H.¹

16
- 0026189808
- Speech recognition in adverse environment
- _, "Speech recognition in adverse environment," Comput. Speech Language, vol. 5, no. 3, pp. 275-294, 1991.
- (1991) Comput. Speech Language , vol.5 , Issue.3 , pp. 275-294

17
- 0026925484
- Hidden Markov models with first-order equalization for noisy speech recognition
- Sept.
- B. H. Juang and K. K. Paliwal, "Hidden Markov models with first-order equalization for noisy speech recognition," IEEE Trans. Signal Processing, vol. 40, pp. 2136-2143, Sept. 1992.
- (1992) IEEE Trans. Signal Processing , vol.40 , pp. 2136-2143
- Juang, B.H.¹ Paliwal, K.K.²

18
- 0026982122
- Discriminative learning for minimum error classification
- Dec.
- B. H. Juang and S. Katagiri, "Discriminative learning for minimum error classification," IEEE Trans. Signal Processing, vol. 40, pp. 3043-3054, Dec. 1992.
- (1992) IEEE Trans. Signal Processing , vol.40 , pp. 3043-3054
- Juang, B.H.¹ Katagiri, S.²

19
- 0026271562
- New discriminative training algorithm based on the generalized probabilistic descent method
- Piscataway, NJ, Aug.
- S. Katagiri, C. H. Lee, and B. H. Juang, "New discriminative training algorithm based on the generalized probabilistic descent method," in Proc. IEEE Wkshp. Neural Networks Signal Processing, Piscataway, NJ, Aug. 1991, pp. 299-308.
- (1991) Proc. IEEE Wkshp. Neural Networks Signal Processing , pp. 299-308
- Katagiri, S.¹ Lee, C.H.² Juang, B.H.³

20
- 0017980972
- All-pole modeling of degraded speech
- June
- J. S. Lim and A. V. Oppenheim, "All-pole modeling of degraded speech," IEEE Trans. Acoust., Speech, Signal Processing, vol. 26, no. 3, pp. 197-210, June 1978.
- (1978) IEEE Trans. Acoust., Speech, Signal Processing , vol.26 , Issue.3 , pp. 197-210
- Lim, J.S.¹ Oppenheim, A.V.²

21
- 0018642851
- Enhancement and bandwidth compression of noisy speech (Invited Paper)
- Dec.
- _, "Enhancement and bandwidth compression of noisy speech (Invited Paper)," Proc. IEEE, vol. 67, no. 12, pp. 1586-1604, Dec. 1979.
- (1979) Proc. IEEE , vol.67 , Issue.12 , pp. 1586-1604

22
- 0024921866
- Inversion of multilayer nets
- Washington, D.C., June
- A. Linden and J. Kindermann, "Inversion of multilayer nets," in Proc. Int. Joint Conf. Neural Networks, Washington, D.C., June 1989, pp. 425-430.
- (1989) Proc. Int. Joint Conf. Neural Networks , pp. 425-430
- Linden, A.¹ Kindermann, J.²

23
- 0024766457
- A family of distortion measures based upon projection operation for robust speech recognition
- Nov.
- D. Mansour and B. H. Juang, "A family of distortion measures based upon projection operation for robust speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 37, pp. 1659-1671, Nov. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Processing , vol.37 , pp. 1659-1671
- Mansour, D.¹ Juang, B.H.²

24
- 0002671953
- A minimax classification approach with application to robust speech recognition
- Jan.
- N. Merhav and C. H. Lee, "A minimax classification approach with application to robust speech recognition," IEEE Trans. Speech Audio Processing, vol. 1, pp. 90-100, Jan. 1993.
- (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 90-100
- Merhav, N.¹ Lee, C.H.²

25
- 0006528820
- Noisy speech recognition via wavelet coefficient enhancement
- Monterey, CA, Oct.
- S. Y. Moon and J. N. Hwang, "Noisy speech recognition via wavelet coefficient enhancement," in Proc. IEEE 26th Asilomar Conf. Signals, Syst., Comput., Monterey, CA, Oct. 1992, pp. 1086-1090.
- (1992) Proc. IEEE 26th Asilomar Conf. Signals, Syst., Comput. , pp. 1086-1090
- Moon, S.Y.¹ Hwang, J.N.²

26
- 33747640657
- Robust noisy speech enhancement using wavelets
- Seoul, Korea, Aug.
- _, "Robust noisy speech enhancement using wavelets," in Proc. 1st Asia Pacific Conf. Commun., vol. 2, Seoul, Korea, Aug. 1993.
- (1993) Proc. 1st Asia Pacific Conf. Commun. , vol.2

27
- 0027269023
- Coordinated training of noise removing networks
- Minneapolis, MN, Apr.
- _, "Coordinated training of noise removing networks," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, Minneapolis, MN, Apr. 1993, pp. 573-576.
- (1993) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 573-576

28
- 0028996864
- Noisy speech recognition using robust inversion of hidden Markov models
- Detroit, MI, May
- _, "Noisy speech recognition using robust inversion of hidden Markov models," in Proc. EEE Int. Conf. Acoust., Speech, Signal Processing, Detroit, MI, May 1995, pp. 145-148.
- (1995) Proc. EEE Int. Conf. Acoust., Speech, Signal Processing , pp. 145-148

29
- 33747646318
- Ph.D. dissertation, Brown University, Providence, RI, May
- L. T. Niles, "Modeling and learning in speech recognition: The relationship between stochastic pattern classifiers and neural networks," Ph.D. dissertation, Brown University, Providence, RI, May 1991.
- (1991) Modeling and Learning in Speech Recognition: The Relationship between Stochastic Pattern Classifiers and Neural Networks
- Niles, L.T.¹

30
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.H.²

31
- 0000646059
- Learning internal representations by error propagation
- Cambridge, MA: MIT Press, ch. 8
- D. E. Rumelhart, G. E. Hinton, and R. J. Williams, "Learning internal representations by error propagation," Parallel Distributed Processing (PDP): Exploration in the Microstructure of Cognition, vol. 1. Cambridge, MA: MIT Press, 1986, ch. 8, pp. 318-362.
- (1986) Parallel Distributed Processing (PDP): Exploration in the Microstructure of Cognition , vol.1 , pp. 318-362
- Rumelhart, D.E.¹ Hinton, G.E.² Williams, R.J.³

32
- 0023739472
- Noise reduction using connectionist models
- S. Tamura and A. Waibel, "Noise reduction using connectionist models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1988, pp. 553-556.
- (1988) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 553-556
- Tamura, S.¹ Waibel, A.²

33
- 0000243355
- Learning in artificial neural networks: A statistical perspective
- winter
- H. White, "Learning in artificial neural networks: A statistical perspective," Neural Computa., vol. 1, no. 4, pp. 425-464, winter 1989.
- (1989) Neural Computa. , vol.1 , Issue.4 , pp. 425-464
- White, H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.