메뉴 건너뛰기




Volumn 5, Issue 2, 1997, Pages 161-172

On-Line Adaptive Learning of the Continuous Density Hidden Markov Model Based on Approximate Recursive Bayes Estimate

Author keywords

Automatic speech recognition, speaker adaptation; Em algorithm; Hidden markov model; Incremental maximum likelihood estimation; Recursive bayesian estimation

Indexed keywords

ADAPTIVE ALGORITHMS; LEARNING SYSTEMS; LOUDSPEAKERS; MARKOV PROCESSES; ONLINE SYSTEMS; RECURSIVE FUNCTIONS; TIME VARYING SYSTEMS;

EID: 0031103160     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.554778     Document Type: Article
Times cited : (108)

References (42)
  • 2
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, 1974.
    • (1974) J. Acoust. Soc. Amer. , vol.55 , pp. 1304-1312
    • Atal, B.S.1
  • 3
    • 0001862769 scopus 로고
    • An inequality and associated maximization techniques in statistical estimation for probabilistic functions of Markov processes
    • L. E. Baum, "An inequality and associated maximization techniques in statistical estimation for probabilistic functions of Markov processes," Inequalities, vol. 3, pp. 1-8, 1972.
    • (1972) Inequalities , vol.3 , pp. 1-8
    • Baum, L.E.1
  • 4
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic function of Markov chains
    • L. E. Baum, T. Pétrie, G. Soûles, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic function of Markov chains," Ann. Math. Stat., vol. 41, pp. 164-171, 1970.
    • (1970) Ann. Math. Stat. , vol.41 , pp. 164-171
    • Baum, L.E.1    Pétrie, T.2    Soûles, G.3    Weiss, N.4
  • 5
    • 33646949412 scopus 로고
    • Approximations in statistics from a decision-theoretical viewpoint
    • R. Viertl, Ed. New York: Plenum
    • J. M. Bernardo, "Approximations in statistics from a decision-theoretical viewpoint," in Probability and Bayesian Statistics, R. Viertl, Ed. New York: Plenum, pp. 53-60, 1987.
    • (1987) Probability and Bayesian Statistics , pp. 53-60
    • Bernardo, J.M.1
  • 6
    • 0002858519 scopus 로고
    • A Bayesian analysis of simple mixture problems
    • J. M. Bernardo, M. H. DeGroot, D. V. Lindley, and A. F. M. Smith, Eds. Oxford, UK: Oxford Univ. Press
    • J. M. Bernardo and F. J. Giron, "A Bayesian analysis of simple mixture problems," Bayesian Statistics 3, J. M. Bernardo, M. H. DeGroot, D. V. Lindley, and A. F. M. Smith, Eds. Oxford, UK: Oxford Univ. Press, 1988, pp. 67-78.
    • (1988) Bayesian Statistics 3 , pp. 67-78
    • Bernardo, J.M.1    Giron, F.J.2
  • 7
    • 0024940640 scopus 로고    scopus 로고
    • Unsupervised speaker adaptation by probabilistic spectrum fitting
    • S. Cox and J. Bridle, "Unsupervised speaker adaptation by probabilistic spectrum fitting," in Proc. 1CASSP-89, pp. 294-297.
    • Proc. 1CASSP-89 , pp. 294-297
    • Cox, S.1    Bridle, J.2
  • 9
    • 0002629270 scopus 로고    scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Roy. Slat. Soc, Ser. B, vol. 39, no. 1, pp. 1-38.
    • J. Roy. Slat. Soc, Ser. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 10
  • 11
    • 0029725604 scopus 로고    scopus 로고
    • A parametric approach to vocal tract length normalization
    • Atlanta, GA, May
    • E. Eide and H. Gish, "A parametric approach to vocal tract length normalization," in Proc. 1CASSP-96, Atlanta, GA, May 1996, pp. 346-349.
    • (1996) Proc. 1CASSP-96 , pp. 346-349
    • Eide, E.1    Gish, H.2
  • 12
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Processing, vol. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 13
    • 0028996947 scopus 로고    scopus 로고
    • Incremental MAP estimation of HMM's for efficient training and improved performance
    • Detroit, MI
    • Y. Gotoh, M. M. Hochberg, D. J. Mashao, and H. F. Silverman, "Incremental MAP estimation of HMM's for efficient training and improved performance," in Proc. ICASSP-95, Detroit, MI, pp. I-457-I-460.
    • Proc. ICASSP-95
    • Gotoh, Y.1    Hochberg, M.M.2    Mashao, D.J.3    Silverman, H.F.4
  • 14
    • 0029377113 scopus 로고    scopus 로고
    • Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition
    • Q. Huo, C. Chan, and C.-H. Lee,'"Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition," IEEE Trans. Speech Audio Processing, vol. 3, no. 5, pp. 334-345.
    • IEEE Trans. Speech Audio Processing , vol.3 , Issue.5 , pp. 334-345
    • Huo, Q.1    Chan, C.2
  • 15
    • 0030105005 scopus 로고    scopus 로고
    • On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition
    • _, "On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 141-144, 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 141-144
  • 16
    • 0030359777 scopus 로고    scopus 로고
    • On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition
    • Philadelphia, PA, Oct.
    • Q. Huo and C.-H. Lee, "On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition," in Proc. ICSLP-96, Philadelphia, PA, Oct. 1996, pp. 985-988.
    • (1996) Proc. ICSLP-96 , pp. 985-988
    • Huo, Q.1    Lee, C.-H.2
  • 17
    • 0022691022 scopus 로고    scopus 로고
    • Maximum likelihood estimation for multivariate mixture observations of Markov chains
    • B.-H. Juang, S. E. Levinson, and M. M. Sondhi, "Maximum likelihood estimation for multivariate mixture observations of Markov chains," IEEE Trans. Inform. Tlieory, vol. IT-32, no. 2, pp. 307-309.
    • IEEE Trans. Inform. Tlieory , vol.IT-32 , Issue.2 , pp. 307-309
    • Juang, B.-H.1    Levinson, S.E.2    Sondhi, M.M.3
  • 18
    • 0027797470 scopus 로고    scopus 로고
    • On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure
    • V. Krishnamurthy and J. B. Moore, "On-line estimation of hidden Markov model parameters based on the Kullback-Leibler information measure," IEEE Trans. Signal Processing, vol. 41, no. 8, pp. 2557-2573.
    • IEEE Trans. Signal Processing , vol.41 , Issue.8 , pp. 2557-2573
    • Krishnamurthy, V.1    Moore, J.B.2
  • 19
    • 0021458298 scopus 로고    scopus 로고
    • A posteriori estimation of correlated jointly Gaussian mean vectors
    • M. J. Lasry and R. M. Stem, "A posteriori estimation of correlated jointly Gaussian mean vectors," IEEE Trans. Pattern Anal. Machine Intell., vol. PAMI-6, no. 4, pp. 530-535.
    • IEEE Trans. Pattern Anal. Machine Intell. , vol.PAMI-6 , Issue.4 , pp. 530-535
    • Lasry, M.J.1    Stem, R.M.2
  • 21
    • 0026142334 scopus 로고
    • A study on speaker adaptation of the parameters of continuous density hidden Markov models
    • Apr.
    • C.-H. Lee, C.-H. Lin, and B.-H. Juang, "A study on speaker adaptation of the parameters of continuous density hidden Markov models," IEEE Trans. Signal Processing, vol. 39, pp. 806-814, Apr. 1991.
    • (1991) IEEE Trans. Signal Processing , vol.39 , pp. 806-814
    • Lee, C.-H.1    Lin, C.-H.2    Juang, B.-H.3
  • 22
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedures
    • Atlanta, GA
    • L. Lee and R. C. Rose, "Speaker normalization using efficient frequency warping procedures," in Proc. ICASSP-96, Atlanta, GA, pp. 353-356.
    • Proc. ICASSP-96 , pp. 353-356
    • Lee, L.1    Rose, R.C.2
  • 23
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 24
    • 0020180460 scopus 로고    scopus 로고
    • Maximum likelihood estimation for multivariate observations of Markov sources
    • L. R. Liporace, "Maximum likelihood estimation for multivariate observations of Markov sources," IEEE Trans. Inform. Tlieory, vol. IT-28, pp. 729-734.
    • IEEE Trans. Inform. Tlieory , vol.IT-28 , pp. 729-734
    • Liporace, L.R.1
  • 25
    • 0017553461 scopus 로고    scopus 로고
    • A quasi-Bayes unsupervised learning procedure for priors
    • U. E. Makov and A. F. M. Smith, "A quasi-Bayes unsupervised learning procedure for priors," IEEE Trans. Inform. Tlieory, vol. IT-23, no. 6, pp. 761-764.
    • IEEE Trans. Inform. Tlieory , vol.IT-23 , Issue.6 , pp. 761-764
    • Makov, U.E.1    Smith, A.F.M.2
  • 28
    • 0040262048 scopus 로고    scopus 로고
    • A study of on-line Bayesian adaptation for HMM-based speech recognition
    • Berlin, Germany
    • T. Matsuoka and C.-H. Lee, "A study of on-line Bayesian adaptation for HMM-based speech recognition," in Proc. EUROSPEECH-93, Berlin, Germany, pp. 815-818.
    • Proc. EUROSPEECH-93 , pp. 815-818
    • Matsuoka, T.1    Lee, C.-H.2
  • 29
    • 0024610919 scopus 로고    scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," in Proc. IEEE, vol. 77, no. 2, pp. 257-286.
    • Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 30
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • Jan.
    • M. G. Rahim and B.-H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 19-30, Jan. 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 19-30
    • Rahim, M.G.1    Juang, B.-H.2
  • 31
    • 0030127017 scopus 로고    scopus 로고
    • Signal conditioning techniques for robust speech recognition
    • Apr.
    • M. G. Rahim, B.-H. Juang, W. Chou, and E. Buhrke, "Signal conditioning techniques for robust speech recognition," IEEE Signal Processing Lett., vol. 3, pp. 107-109, Apr. 1996.
    • (1996) IEEE Signal Processing Lett. , vol.3 , pp. 107-109
    • Rahim, M.G.1    Juang, B.-H.2    Chou, W.3    Buhrke, E.4
  • 32
    • 0002533801 scopus 로고
    • The empirical Bayes approach to statistical decision problems
    • H. Robbins, "The empirical Bayes approach to statistical decision problems," Ann. Math. Stat., vol. 35, pp. 1-20, 1964.
    • (1964) Ann. Math. Stat. , vol.35 , pp. 1-20
    • Robbins, H.1
  • 33
    • 0030149866 scopus 로고    scopus 로고
    • A maximum likelihood approach to stochastic matching for robust speech recognition
    • A. Sankar and C.-H. Lee, "A maximum likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, no. 3, pp. 190-202.
    • IEEE Trans. Speech Audio Processing , vol.4 , Issue.3 , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 34
    • 0001526001 scopus 로고    scopus 로고
    • A quasi-Bayes sequential procedure for mixtures
    • A. F. M. Smith and U. E. Makov, "A quasi-Bayes sequential procedure for mixtures," J. Roy. Slat. Soc., Ser. B, vol. 40, no. 1, pp. 106-112.
    • J. Roy. Slat. Soc., Ser. B , vol.40 , Issue.1 , pp. 106-112
    • Smith, A.F.M.1    Makov, U.E.2
  • 35
    • 0010049861 scopus 로고    scopus 로고
    • A note on the iterative application of Bayes' rule
    • J. Spragins, "A note on the iterative application of Bayes' rule," IEEE Trans. Inform. Theory, vol. IT-11, no. 4, pp. 544-549.
    • IEEE Trans. Inform. Theory , vol.IT-11 , Issue.4 , pp. 544-549
    • Spragins, J.1
  • 36
    • 0028997003 scopus 로고    scopus 로고
    • Vector-field-smoothed Bayesian learning for incremental speaker adaptation
    • Detroit, MI
    • J. Takahashi and S. Sagayama, "Vector-field-smoothed Bayesian learning for incremental speaker adaptation," in Proc. ICASSP-95, Detroit, MI, pp. I-696-I-699.
    • Proc. ICASSP-95
    • Takahashi, J.1    Sagayama, S.2
  • 37
    • 0001593436 scopus 로고    scopus 로고
    • Recursive parameter estimation using incomplete data
    • D. M. Titterington, "Recursive parameter estimation using incomplete data," J. Roy. Stat. Soc., Ser. B, vol. 46, no. 2, pp. 257-267.
    • J. Roy. Stat. Soc., Ser. B , vol.46 , Issue.2 , pp. 257-267
    • Titterington, D.M.1
  • 38
    • 0028997002 scopus 로고    scopus 로고
    • Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation
    • Detroit, MI
    • M. Tonomura, T. Kosaka, and S. Matsunaga, "Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation," in Proc. ICASSP-95, Detroit, MI, pp. I-688-I-691.
    • Proc. ICASSP-95
    • Tonomura, M.1    Kosaka, T.2    Matsunaga, S.3
  • 39
    • 0029764708 scopus 로고    scopus 로고
    • Speaker normalization on conversational telephone speech
    • Atlanta, GA
    • S. Wegmann, D. McAlIaster, J. Orloff, and B. Peskin, "Speaker normalization on conversational telephone speech," in Proc. ICASSP-96, Atlanta, GA, pp. 339-341.
    • Proc. ICASSP-96 , pp. 339-341
    • Wegmann, S.1    McAliaster, D.2    Orloff, J.3    Peskin, B.4
  • 40
    • 0025494624 scopus 로고    scopus 로고
    • Sequential algorithms for parameter estimation based on the Kullback-Leibler information measure
    • E. Weinstein, M. Feder, and A. V. Oppenheim, "Sequential algorithms for parameter estimation based on the Kullback-Leibler information measure," IEEE Trans. Acoust, Speech, Signal Processing, vol. 38, no. 9, pp. 1652-1654.
    • IEEE Trans. Acoust, Speech, Signal Processing , vol.38 , Issue.9 , pp. 1652-1654
    • Weinstein, E.1    Feder, M.2    Oppenheim, A.V.3
  • 41
    • 0028460810 scopus 로고    scopus 로고
    • An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition
    • Y.-X. Zhao, "An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition," IEEE Trans. Speech Audio Processing, vol. 2, no. 3, pp. 380-394.
    • IEEE Trans. Speech Audio Processing , vol.2 , Issue.3 , pp. 380-394
    • Zhao, Y.-X.1
  • 42
    • 0029770844 scopus 로고    scopus 로고
    • Self-learning speaker and channel adaptation based on spectral variation source decomposition
    • Y.-X.' Zhao, "Self-learning speaker and channel adaptation based on spectral variation source decomposition," Speech Commun., vol. 18, pp. 65-77, 1996.
    • (1996) Speech Commun. , vol.18 , pp. 65-77
    • Zhao, Y.-X.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.