메뉴 건너뛰기




Volumn 8, Issue 2, 1997, Pages 194-204

Robust speech recognition based on joint model and feature space optimization of hidden Markov models

Author keywords

Baum Welch inversion; Baum Welch reestimation; Hidden Markov model; Maximum likelihood; Minimax optimization; Minimum mean squared error; Mismatch compensation; Neural network inversion; Robust speech recognition

Indexed keywords

ALGORITHMS; ERROR COMPENSATION; LEARNING SYSTEMS; MATHEMATICAL MODELS; NEURAL NETWORKS; OPTIMIZATION; SIGNAL TO NOISE RATIO;

EID: 0031100269     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/72.557656     Document Type: Article
Times cited : (32)

References (33)
  • 2
    • 0026835134 scopus 로고
    • Global optimization of a neural network-hidden Markov model hybrid
    • Mar.
    • Y. Bengio, R. D. Mori, G. Flammia, and R. Kompe, "Global optimization of a neural network-hidden Markov model hybrid," IEEE Trans. Neural Networks, vol. 3, pp. 252-259, Mar. 1992.
    • (1992) IEEE Trans. Neural Networks , vol.3 , pp. 252-259
    • Bengio, Y.1    Mori, R.D.2    Flammia, G.3    Kompe, R.4
  • 3
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr.
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-29, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-29 , pp. 113-120
    • Boll, S.F.1
  • 5
    • 0028317510 scopus 로고
    • A projection-based likelihood measure for speech recognition in noise
    • Jan.
    • B. A. Carlson and M. A. Clements, "A projection-based likelihood measure for speech recognition in noise," IEEE Trans. Speech Audio Processing, vol. 2, pp. 97-102, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 97-102
    • Carlson, B.A.1    Clements, M.A.2
  • 7
    • 0028312802 scopus 로고
    • Auditory models and human performance in tasks related to speech coding and speech recognition
    • Jan.
    • O. Ghitza, "Auditory models and human performance in tasks related to speech coding and speech recognition," IEEE Trans. Speech Audio Processing, vol. 2, Pt. II, pp. 115-132, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.2 PART , pp. 115-132
    • Ghitza, O.1
  • 8
    • 0026838119 scopus 로고
    • Iterative inversion of neural networks and its application to adaptive control
    • Mar.
    • D. Hoskins, J. N. Hwang, and J. Vagners, "Iterative inversion of neural networks and its application to adaptive control," IEEE Trans. Neural Networks, vol. 3, pp. 292-301, Mar. 1992.
    • (1992) IEEE Trans. Neural Networks , vol.3 , pp. 292-301
    • Hoskins, D.1    Hwang, J.N.2    Vagners, J.3
  • 9
    • 0000375621 scopus 로고
    • A robust version of the probability ratio test
    • P. J. Huber, "A robust version of the probability ratio test," Ann. Math. Stat., vol. 36, no. 4, pp. 1753-1758, 1965.
    • (1965) Ann. Math. Stat. , vol.36 , Issue.4 , pp. 1753-1758
    • Huber, P.J.1
  • 11
    • 16444380833 scopus 로고
    • Iterative constrained inversion of neural networks and its applications
    • Princeton, NJ, Mar.
    • J. N. Hwang and C. H. Chan, "Iterative constrained inversion of neural networks and its applications, in Proc. 24th Conf. Inform. Syst. Sci., Princeton, NJ, Mar. 1990, pp. 754-759.
    • (1990) Proc. 24th Conf. Inform. Syst. Sci. , pp. 754-759
    • Hwang, J.N.1    Chan, C.H.2
  • 12
    • 0025721732 scopus 로고
    • Query learning applied to partially trained multilayer perceptrons
    • Jan.
    • J. N. Hwang, J. J. Choi, S. Oh, and R. J. Marks, II, "Query learning applied to partially trained multilayer perceptrons," IEEE Trans. Neural Networks, vol. 2, pp. 131-136, Jan. 1991.
    • (1991) IEEE Trans. Neural Networks , vol.2 , pp. 131-136
    • Hwang, J.N.1    Choi, J.J.2    Oh, S.3    Marks II, R.J.4
  • 13
    • 5544303185 scopus 로고
    • Interactive query learning for isolated speech recognition
    • Helsinger, Denmark, Sept.
    • J. N. Hwang and H. Li, "Interactive query learning for isolated speech recognition," in Proc. IEEE Int. Wkshp. Neural Networks Signal Processing, Helsinger, Denmark, Sept. 1992, pp. 93-102.
    • (1992) Proc. IEEE Int. Wkshp. Neural Networks Signal Processing , pp. 93-102
    • Hwang, J.N.1    Li, H.2
  • 15
    • 0022097649 scopus 로고
    • Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains
    • July
    • B. H. Juang, "Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains," AT&T Tech. J., vol. 64, no. 6, pp. 1235-1249, July 1985.
    • (1985) AT&T Tech. J. , vol.64 , Issue.6 , pp. 1235-1249
    • Juang, B.H.1
  • 16
    • 0026189808 scopus 로고
    • Speech recognition in adverse environment
    • _, "Speech recognition in adverse environment," Comput. Speech Language, vol. 5, no. 3, pp. 275-294, 1991.
    • (1991) Comput. Speech Language , vol.5 , Issue.3 , pp. 275-294
  • 17
    • 0026925484 scopus 로고
    • Hidden Markov models with first-order equalization for noisy speech recognition
    • Sept.
    • B. H. Juang and K. K. Paliwal, "Hidden Markov models with first-order equalization for noisy speech recognition," IEEE Trans. Signal Processing, vol. 40, pp. 2136-2143, Sept. 1992.
    • (1992) IEEE Trans. Signal Processing , vol.40 , pp. 2136-2143
    • Juang, B.H.1    Paliwal, K.K.2
  • 18
    • 0026982122 scopus 로고
    • Discriminative learning for minimum error classification
    • Dec.
    • B. H. Juang and S. Katagiri, "Discriminative learning for minimum error classification," IEEE Trans. Signal Processing, vol. 40, pp. 3043-3054, Dec. 1992.
    • (1992) IEEE Trans. Signal Processing , vol.40 , pp. 3043-3054
    • Juang, B.H.1    Katagiri, S.2
  • 19
    • 0026271562 scopus 로고
    • New discriminative training algorithm based on the generalized probabilistic descent method
    • Piscataway, NJ, Aug.
    • S. Katagiri, C. H. Lee, and B. H. Juang, "New discriminative training algorithm based on the generalized probabilistic descent method," in Proc. IEEE Wkshp. Neural Networks Signal Processing, Piscataway, NJ, Aug. 1991, pp. 299-308.
    • (1991) Proc. IEEE Wkshp. Neural Networks Signal Processing , pp. 299-308
    • Katagiri, S.1    Lee, C.H.2    Juang, B.H.3
  • 21
    • 0018642851 scopus 로고
    • Enhancement and bandwidth compression of noisy speech (Invited Paper)
    • Dec.
    • _, "Enhancement and bandwidth compression of noisy speech (Invited Paper)," Proc. IEEE, vol. 67, no. 12, pp. 1586-1604, Dec. 1979.
    • (1979) Proc. IEEE , vol.67 , Issue.12 , pp. 1586-1604
  • 23
    • 0024766457 scopus 로고
    • A family of distortion measures based upon projection operation for robust speech recognition
    • Nov.
    • D. Mansour and B. H. Juang, "A family of distortion measures based upon projection operation for robust speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 37, pp. 1659-1671, Nov. 1989.
    • (1989) IEEE Trans. Acoust., Speech, Signal Processing , vol.37 , pp. 1659-1671
    • Mansour, D.1    Juang, B.H.2
  • 24
    • 0002671953 scopus 로고
    • A minimax classification approach with application to robust speech recognition
    • Jan.
    • N. Merhav and C. H. Lee, "A minimax classification approach with application to robust speech recognition," IEEE Trans. Speech Audio Processing, vol. 1, pp. 90-100, Jan. 1993.
    • (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 90-100
    • Merhav, N.1    Lee, C.H.2
  • 25
    • 0006528820 scopus 로고
    • Noisy speech recognition via wavelet coefficient enhancement
    • Monterey, CA, Oct.
    • S. Y. Moon and J. N. Hwang, "Noisy speech recognition via wavelet coefficient enhancement," in Proc. IEEE 26th Asilomar Conf. Signals, Syst., Comput., Monterey, CA, Oct. 1992, pp. 1086-1090.
    • (1992) Proc. IEEE 26th Asilomar Conf. Signals, Syst., Comput. , pp. 1086-1090
    • Moon, S.Y.1    Hwang, J.N.2
  • 26
    • 33747640657 scopus 로고
    • Robust noisy speech enhancement using wavelets
    • Seoul, Korea, Aug.
    • _, "Robust noisy speech enhancement using wavelets," in Proc. 1st Asia Pacific Conf. Commun., vol. 2, Seoul, Korea, Aug. 1993.
    • (1993) Proc. 1st Asia Pacific Conf. Commun. , vol.2
  • 27
    • 0027269023 scopus 로고
    • Coordinated training of noise removing networks
    • Minneapolis, MN, Apr.
    • _, "Coordinated training of noise removing networks," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, Minneapolis, MN, Apr. 1993, pp. 573-576.
    • (1993) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 573-576
  • 28
    • 0028996864 scopus 로고
    • Noisy speech recognition using robust inversion of hidden Markov models
    • Detroit, MI, May
    • _, "Noisy speech recognition using robust inversion of hidden Markov models," in Proc. EEE Int. Conf. Acoust., Speech, Signal Processing, Detroit, MI, May 1995, pp. 145-148.
    • (1995) Proc. EEE Int. Conf. Acoust., Speech, Signal Processing , pp. 145-148
  • 33
    • 0000243355 scopus 로고
    • Learning in artificial neural networks: A statistical perspective
    • winter
    • H. White, "Learning in artificial neural networks: A statistical perspective," Neural Computa., vol. 1, no. 4, pp. 425-464, winter 1989.
    • (1989) Neural Computa. , vol.1 , Issue.4 , pp. 425-464
    • White, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.