메뉴 건너뛰기




Volumn 74, Issue 3, 2014, Pages 341-358

Structural bayesian linear regression for hidden markov models

Author keywords

Hidden Markov model; Linear regression; Structural prior; Variational bayes

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; FORESTRY; LINEAR REGRESSION; SIGNAL PROCESSING; SPEECH PROCESSING; TREES (MATHEMATICS); VARIATIONAL TECHNIQUES;

EID: 84897393748     PISSN: 19398018     EISSN: 19398115     Source Type: Journal    
DOI: 10.1007/s11265-013-0785-8     Document Type: Article
Times cited : (11)

References (47)
  • 1
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • 10.1006/csla.1995.0010
    • Leggetter, C.J.; & Woodland, P.C. (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 9, 171-185.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 3
    • 0000159105 scopus 로고    scopus 로고
    • On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    • Lee, C.-H.; & Huo, Q. (2000). On adaptive decision rules and decision parameter adaptation for automatic speech recognition. In Proceedings of the IEEE (Vol. 88, pp. 1241-1269).
    • (2000) Proceedings of the IEEE , vol.88 , pp. 1241-1269
    • Lee C., .-H.1    Huo, Q.2
  • 4
    • 77956865237 scopus 로고    scopus 로고
    • Acoustic model adaptation for speech recognition
    • 10.1587/transinf.E93.D.2348
    • Shinoda, K. (2010). Acoustic model adaptation for speech recognition. IEICE Transactions on Information and Systems, 93(9), 2348-2362.
    • (2010) IEICE Transactions on Information and Systems , vol.93 , Issue.9 , pp. 2348-2362
    • Shinoda, K.1
  • 5
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • Sankar, A.; & Lee, C.-H. (1996). A maximum-likelihood approach to stochastic matching for robust speech recognition. IEEE Transactions on Speech and Audio Processing, 4(3), 190-202.
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.3 , pp. 190-202
    • Sankar, A.1    Lee C., .-H.2
  • 6
    • 0030643678 scopus 로고    scopus 로고
    • Improved bayesian learning of hidden Markov models for speaker adaptation
    • IEEE
    • Chien, J.-T.; Lee, C.-H.; Wang, H.-C. (1997). Improved bayesian learning of hidden Markov models for speaker adaptation. In Processing of ICASSP (Vol. 2, pp. 1027-1030). IEEE
    • (1997) Processing of ICASSP , vol.2 , pp. 1027-1030
    • Chien J., .-T.1    Lee C., .-H.2    Wang H., .-C.3
  • 7
    • 85009097035 scopus 로고    scopus 로고
    • Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
    • Chen, K.-T.; Liau, W.-W.; Wang, H.-W.; Lee, L.-S. (2000). Fast speaker adaptation using eigenspace-based maximum likelihood linear regression. In Proceedings of ICSLP (Vol. 3, pp. 742-745).
    • (2000) Proceedings of ICSLP , vol.3 , pp. 742-745
    • Chen K., .-T.1    Liau W., .-W.2    Wang H., .-W.3    Lee L. -S. (4
  • 9
    • 70350450398 scopus 로고    scopus 로고
    • Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
    • Delcroix, M.; Nakatani, T.; Watanabe, S. (2009). Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing. IEEE Transactions on Audio, Speech and Language Processing, 17(2), 324-334.
    • (2009) IEEE Transactions on Audio, Speech and Language Processing , vol.17 , Issue.2 , pp. 324-334
    • Delcroix, M.1    Nakatani, T.2    Watanabe, S.3
  • 10
    • 0034842740 scopus 로고    scopus 로고
    • Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
    • Tamura, M.; Masuko, T.; Tokuda, K.; Kobayashi, T. (2001). Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR. In Proceedings of ICASSP (Vol. 2, pp. 806-808).
    • (2001) Proceedings of ICASSP , vol.2 , pp. 806-808
    • Tamura, M.1    Masuko, T.2    Tokuda, K.3    Kobayashi, T.4
  • 12
    • 27744546990 scopus 로고    scopus 로고
    • On transforming statistical models for non-frontal face verification
    • 10.1016/j.patcog.2005.07.001
    • Sanderson, C.; Bengio, S.; Gao, Y. (2006). On transforming statistical models for non-frontal face verification. Pattern Recognition, 39(2), 288-302.
    • (2006) Pattern Recognition , vol.39 , Issue.2 , pp. 288-302
    • Sanderson, C.1    Bengio, S.2    Gao, Y.3
  • 15
    • 85135272864 scopus 로고    scopus 로고
    • Maximum a posteriori linear regression for hidden Markov model adaptation
    • Chesta, C.; Siohan, O.; Lee, C.-H. (1999). Maximum a posteriori linear regression for hidden Markov model adaptation. In Proceedings of Eurospeech (Vol. 1, pp. 211-214).
    • (1999) Proceedings of Eurospeech , vol.1 , pp. 211-214
    • Chesta, C.1    Siohan, O.2    Lee C., .-H.3
  • 16
    • 0036649879 scopus 로고    scopus 로고
    • Quasi-Bayes linear regression for sequential learning of hidden Markov models
    • 10.1109/TSA.2002.800555 1009230
    • Chien, J.-T. (2002). Quasi-Bayes linear regression for sequential learning of hidden Markov models. IEEE Transactions on Speech and Audio Processing, 10(5), 268-278
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.5 , pp. 268-278
    • Chien, J.-T.1
  • 18
    • 0036461005 scopus 로고    scopus 로고
    • Structural maximum a posteriori linear regression for fast HMM adaptation
    • 10.1006/csla.2001.0181
    • Siohan, O.; Myrvoll, T.A.; Lee, C.H. (2002). Structural maximum a posteriori linear regression for fast HMM adaptation. Computer Speech & Language, 16(1), 5-24.
    • (2002) Computer Speech & Language , vol.16 , Issue.1 , pp. 5-24
    • Siohan, O.1    Myrvoll, T.A.2    Lee, C.H.3
  • 20
    • 0002788893 scopus 로고    scopus 로고
    • A view of the em algorithm that justifies incremental, sparse, and other variants
    • Neal, R.M.; & Hinton, G.E. (1998). A view of the EM algorithm that justifies incremental, sparse, and other variants. Learning in Graphical Models, 355-368.
    • (1998) Learning in Graphical Models , pp. 355-368
    • Neal R., .M.1    Hinton G., .E.2
  • 21
    • 0033225865 scopus 로고    scopus 로고
    • An introduction to variational methods for graphical models
    • 10.1023/A:1007665907178 0945.68164
    • Jordan, M.I.; Ghahramani, Z.; Jaakkola, T.S.; Saul, L.K. (1999). An introduction to variational methods for graphical models. Machine Learning, 37(2), 183-233.
    • (1999) Machine Learning , vol.37 , Issue.2 , pp. 183-233
    • Jordan, M.I.1    Ghahramani, Z.2    Jaakkola, T.S.3    Saul, L.K.4
  • 22
    • 0003278032 scopus 로고    scopus 로고
    • Inferring parameters structure of latent variable models by variational Bayes
    • Attias, H. (1999). Inferring parameters structure of latent variable models by variational Bayes. In Proceedings of uncertainty in artificial intelligence (UAI) (Vol. 15, pp. 21-30).
    • (1999) Proceedings of Uncertainty in Artificial Intelligence (UAI) , vol.15 , pp. 21-30
    • Attias, H.1
  • 23
    • 0036887504 scopus 로고    scopus 로고
    • Bayesian model search for mixture models based on optimizing variational bounds
    • 10.1016/S0893-6080(02)00040-0
    • Ueda, N.; & Ghahramani, Z. (2002). Bayesian model search for mixture models based on optimizing variational bounds. Neural Networks, 15, 1223-1241.
    • (2002) Neural Networks , vol.15 , pp. 1223-1241
    • Ueda, N.1    Ghahramani, Z.2
  • 24
    • 79957689964 scopus 로고    scopus 로고
    • Application of variational Bayesian approach to speech recognition
    • MIT Press
    • Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N. (2002). Application of variational Bayesian approach to speech recognition. NIPS 2002: MIT Press.
    • (2002) NIPS 2002
    • Watanabe, S.1    Minami, Y.2    Nakamura, A.3    Ueda, N.4
  • 25
    • 85009174866 scopus 로고    scopus 로고
    • Variational Bayesian GMM for speech recognition
    • Valente, F.; & Wellekens, C. (2003). Variational Bayesian GMM for speech recognition. In Proceedings of Eurospeech (pp. 441-444).
    • (2003) Proceedings of Eurospeech , pp. 441-444
    • Valente, F.1    Wellekens, C.2
  • 27
    • 82455177766 scopus 로고    scopus 로고
    • Comparison of ML, MAP, and VB based acoustic models in large vocabulary speech recognition
    • Somervuo, P. (2004). Comparison of ML, MAP, and VB based acoustic models in large vocabulary speech recognition. In Proceedings of ICSL (Vol. 1, pp. 830-833).
    • (2004) Proceedings of ICSL , vol.1 , pp. 830-833
    • Somervuo, P.1
  • 28
    • 4544253566 scopus 로고    scopus 로고
    • Automatic generation of non-uniform HMM structures based on variational Bayesian approach
    • Jitsuhiro, T.; & Nakamura, S. (2004). Automatic generation of non-uniform HMM structures based on variational Bayesian approach. In Proceedings of ICASSP (Vol. 1, pp. 805-808).
    • (2004) Proceedings of ICASSP , vol.1 , pp. 805-808
    • Jitsuhiro, T.1    Nakamura, S.2
  • 29
    • 84867213785 scopus 로고    scopus 로고
    • Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition
    • Hashimoto, K.; Zen, H.; Nankaku, Y.; Lee, A.; Tokuda, K. (2008). Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition. In Proceedings of Interspeech.
    • (2008) Proceedings of Interspeech
    • Hashimoto, K.1    Zen, H.2    Nankaku, Y.3    Lee, A.4    Tokuda, K.5
  • 30
    • 51449089545 scopus 로고    scopus 로고
    • Weighted distance measures for efficient reduction of Gaussian mixture components in HMM-based acoustic model
    • Ogawa, A.; & Takahashi, S. (2008). Weighted distance measures for efficient reduction of Gaussian mixture components in HMM-based acoustic model. In Proceedings of ICASSP (pp. 4173-4176).
    • (2008) Proceedings of ICASSP , pp. 4173-4176
    • Ogawa, A.1    Takahashi, S.2
  • 31
    • 78049394635 scopus 로고    scopus 로고
    • Variational nonparametric Bayesian hidden Markov model
    • Ding, N.; & Ou, Z. (2010). Variational nonparametric Bayesian hidden Markov model. In Proceedings of ICASSP (pp. 2098-2101).
    • (2010) Proceedings of ICASSP , pp. 2098-2101
    • Ding, N.1    Ou, Z.2
  • 32
    • 85009135071 scopus 로고    scopus 로고
    • Acoustic model adaptation based on coarse/fine training of transfer vectors and its application to a speaker adaptation task
    • Watanabe, S.; & Nakamura, A. (2004). Acoustic model adaptation based on coarse/fine training of transfer vectors and its application to a speaker adaptation task. In Proceedings of ICSLP (pp. 2933-2936).
    • (2004) Proceedings of ICSLP , pp. 2933-2936
    • Watanabe, S.1    Nakamura, A.2
  • 33
    • 33947643186 scopus 로고    scopus 로고
    • Incremental adaptation using bayesian inference
    • Yu, K.; & Gales, M.J.F. (2006). Incremental adaptation using bayesian inference. In Proceedings of ICASSP (Vol. 1, pp. 217-220).
    • (2006) Proceedings of ICASSP , vol.1 , pp. 217-220
    • Yu, K.1    Gales M. .J., .F.2
  • 36
    • 0000043041 scopus 로고
    • Some matrix-variate distribution theory: Notational considerations and a Bayesian application
    • 10.1093/biomet/68.1.265 0464.62039 614963
    • Dawid, A.P. (1981). Some matrix-variate distribution theory: notational considerations and a Bayesian application. Biometrika, 68(1), 265-274.
    • (1981) Biometrika , vol.68 , Issue.1 , pp. 265-274
    • Dawid, A.P.1
  • 37
    • 82455212515 scopus 로고    scopus 로고
    • Bayesian linear regression for hidden Markov model based on optimizing variational bounds
    • Watanabe, S.; Nakamura, A.; Juang, B.H. (2011). Bayesian linear regression for hidden Markov model based on optimizing variational bounds. In Proceedings of MLSP (pp. 1-6).
    • (2011) Proceedings of MLSP , pp. 1-6
    • Watanabe, S.1    Nakamura, A.2    Juang B., .H.3
  • 40
    • 78049488169 scopus 로고    scopus 로고
    • Evaluation of the SOLON speech recognition system: 2006 benchmark using the corpus of spontaneous Japanese
    • (in Japanese)
    • Nakamura, A.; Oba, T.; Watanabe, S.; Ishizuka, K.; Fujimoto, M.; Hori, T.; McDermott, E.; Minami, Y. (2006). Evaluation of the SOLON speech recognition system: 2006 benchmark using the corpus of spontaneous japanese. IPSJ SIG Notes, 2006(136), 251-256. (in Japanese).
    • (2006) IPSJ SIG Notes , Issue.136 , pp. 251-256
    • Nakamura, A.1    Oba, T.2    Watanabe, S.3    Ishizuka, K.4    Fujimoto, M.5    Hori, T.6    McDermott, E.7    Minami, Y.8
  • 43
    • 84878411087 scopus 로고    scopus 로고
    • Speaker adaptation using variational Bayesian linear regression in normalized feature space
    • Hahm, S.J.; Ogawa, A.; Fujimoto, M.; Hori, T.; Nakamura, A. (2012). Speaker adaptation using variational Bayesian linear regression in normalized feature space. In Proceedings of Interspeech (pp. 803-806).
    • (2012) Proceedings of Interspeech , pp. 803-806
    • Hahm S., .J.1    Ogawa, A.2    Fujimoto, M.3    Hori, T.4    Nakamura, A.5
  • 44
    • 84890534474 scopus 로고    scopus 로고
    • Feature space variational Bayesian linear regression and its combination with model space VBLR
    • Hahm, S.J.; Ogawa, A.; Fujimoto, M.; Hori, T.; Nakamura, A. (2013). Feature space variational Bayesian linear regression and its combination with model space VBLR. In Proceedings of ICASSP (pp. 7898-7902).
    • (2013) Proceedings of ICASSP , pp. 7898-7902
    • Hahm S., .J.1    Ogawa, A.2    Fujimoto, M.3    Hori, T.4    Nakamura, A.5
  • 45
    • 84867186048 scopus 로고    scopus 로고
    • Variational inference for Dirichlet process mixtures
    • Blei, D.M.; & Jordan, M.I. (2006). Variational inference for Dirichlet process mixtures. Bayesian Analysis, 1(1), 121-144.
    • (2006) Bayesian Analysis , vol.1 , Issue.1 , pp. 121-144
    • Blei D., .M.1    Jordan M., .I.2
  • 47
    • 79959828521 scopus 로고    scopus 로고
    • A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination
    • Kubo, Y.; Watanabe, S.; Nakamura, A.; Kobayashi, T. (2010). A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination. In Proceedings of Interspeech (pp. 2954-2957).
    • (2010) Proceedings of Interspeech , pp. 2954-2957
    • Kubo, Y.1    Watanabe, S.2    Nakamura, A.3    Kobayashi, T.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.