SCOPUS 정보 검색 플랫폼

Journal of Signal Processing Systems

Volumn 74, Issue 3, 2014, Pages 341-358

Structural bayesian linear regression for hidden markov models

(3) Watanabe, Shinji a Nakamura, Atsushi b Juang, Biing Hwang c

a MITSUBISHI ELECTRIC RESEARCH LABORATORIES (United States)

b Nippon Telegraph and Telephone Corporation (Japan)

c GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Hidden Markov model; Linear regression; Structural prior; Variational bayes

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; FORESTRY; LINEAR REGRESSION; SIGNAL PROCESSING; SPEECH PROCESSING; TREES (MATHEMATICS); VARIATIONAL TECHNIQUES;

ADAPTIVE TRAINING; GAUSSIAN CLUSTERS; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; OBJECTIVE FUNCTIONS; REGRESSION PARAMETERS; STRUCTURAL PRIOR; TIME SERIES PATTERNS; VARIATIONAL BAYES;

HIDDEN MARKOV MODELS;

COMMUNICATION; FORESTRY; MATHEMATICAL MODELS; PATTERN RECOGNITION; REGRESSION ANALYSIS; STRUCTURAL ANALYSIS;

EID: 84897393748 PISSN: 19398018 EISSN: 19398115 Source Type: Journal
DOI: 10.1007/s11265-013-0785-8 Document Type: Article

Times cited : (11)

References (47)

1
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- 10.1006/csla.1995.0010
- Leggetter, C.J.; & Woodland, P.C. (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 9, 171-185.
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

2
- 0029375590
- Speaker adaptation using constrained reestimation of Gaussian mixtures
- Digalakis, V.; Ritischev, D.; Neumeyer, L. (1995). Speaker adaptation using constrained reestimation of Gaussian mixtures. IEEE Transactions on Speech and Audio Processing, 3, 357-366.
- (1995) IEEE Transactions on Speech and Audio Processing , vol.3 , pp. 357-366
- Digalakis, V.¹ Ritischev, D.² Neumeyer, L.³

3
- 0000159105
- On adaptive decision rules and decision parameter adaptation for automatic speech recognition
- Lee, C.-H.; & Huo, Q. (2000). On adaptive decision rules and decision parameter adaptation for automatic speech recognition. In Proceedings of the IEEE (Vol. 88, pp. 1241-1269).
- (2000) Proceedings of the IEEE , vol.88 , pp. 1241-1269
- Lee C., .-H.¹ Huo, Q.²

4
- 77956865237
- Acoustic model adaptation for speech recognition
- 10.1587/transinf.E93.D.2348
- Shinoda, K. (2010). Acoustic model adaptation for speech recognition. IEICE Transactions on Information and Systems, 93(9), 2348-2362.
- (2010) IEICE Transactions on Information and Systems , vol.93 , Issue.9 , pp. 2348-2362
- Shinoda, K.¹

5
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- Sankar, A.; & Lee, C.-H. (1996). A maximum-likelihood approach to stochastic matching for robust speech recognition. IEEE Transactions on Speech and Audio Processing, 4(3), 190-202.
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.3 , pp. 190-202
- Sankar, A.¹ Lee C., .-H.²

6
- 0030643678
- Improved bayesian learning of hidden Markov models for speaker adaptation
- IEEE
- Chien, J.-T.; Lee, C.-H.; Wang, H.-C. (1997). Improved bayesian learning of hidden Markov models for speaker adaptation. In Processing of ICASSP (Vol. 2, pp. 1027-1030). IEEE
- (1997) Processing of ICASSP , vol.2 , pp. 1027-1030
- Chien J., .-T.¹ Lee C., .-H.² Wang H., .-C.³

7
- 85009097035
- Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
- Chen, K.-T.; Liau, W.-W.; Wang, H.-W.; Lee, L.-S. (2000). Fast speaker adaptation using eigenspace-based maximum likelihood linear regression. In Proceedings of ICSLP (Vol. 3, pp. 742-745).
- (2000) Proceedings of ICSLP , vol.3 , pp. 742-745
- Chen K., .-T.¹ Liau W., .-W.² Wang H., .-W.³ Lee L. -S. (⁴

8
- 27644511614
- Kernel eigenvoice speaker adaptation
- 10.1109/TSA.2005.851971
- Mak, B.; Kwok, J.T.; Ho, S. (2005). Kernel eigenvoice speaker adaptation. IEEE Transactions on Speech and Audio Processing, 13(5), 984-992.
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 984-992
- Mak, B.¹ Kwok, J.T.² Ho, S.³

9
- 70350450398
- Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
- Delcroix, M.; Nakatani, T.; Watanabe, S. (2009). Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing. IEEE Transactions on Audio, Speech and Language Processing, 17(2), 324-334.
- (2009) IEEE Transactions on Audio, Speech and Language Processing , vol.17 , Issue.2 , pp. 324-334
- Delcroix, M.¹ Nakatani, T.² Watanabe, S.³

10
- 0034842740
- Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
- Tamura, M.; Masuko, T.; Tokuda, K.; Kobayashi, T. (2001). Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR. In Proceedings of ICASSP (Vol. 2, pp. 806-808).
- (2001) Proceedings of ICASSP , vol.2 , pp. 806-808
- Tamura, M.¹ Masuko, T.² Tokuda, K.³ Kobayashi, T.⁴

11
- 33745216683
- MLLR transforms as features in speaker recognition
- Stolcke, A.; Ferrer, L.; Kajarekar, S.; Shriberg, E.; Venkataraman, A. (2005). MLLR transforms as features in speaker recognition. In Proceedings of Interspeech (pp. 2425-2428).
- (2005) Proceedings of Interspeech , pp. 2425-2428
- Stolcke, A.¹ Ferrer, L.² Kajarekar, S.³ Shriberg, E.⁴ Venkataraman, A.⁵

12
- 27744546990
- On transforming statistical models for non-frontal face verification
- 10.1016/j.patcog.2005.07.001
- Sanderson, C.; Bengio, S.; Gao, Y. (2006). On transforming statistical models for non-frontal face verification. Pattern Recognition, 39(2), 288-302.
- (2006) Pattern Recognition , vol.39 , Issue.2 , pp. 288-302
- Sanderson, C.¹ Bengio, S.² Gao, Y.³

13
- 80051981740
- Unsupervised activity recognition with user's physical characteristics data
- Maekawa, T.; & Watanabe, S. (2011). Unsupervised activity recognition with user's physical characteristics data. In Proceedings of international symposium on wearable computers (ISWC 2011), (pp. 89-96).
- (2011) Proceedings of International Symposium on Wearable Computers (ISWC 2011) , pp. 89-96
- Maekawa, T.¹ Watanabe, S.²

14
- 0003571976
- Cambridge University Engineering Department Cambridge
- Young, S.; Evermann, G.; Gales, M.; Hain, T.; Kershaw, D.; Liu, X.; Moore, G.; Odell, J.; Ollason, D.; Povey, D. (2006). The HTK book (for HTK version 3.4). Cambridge: Cambridge University Engineering Department.
- (2006) The HTK Book (For HTK Version 3.4)
- Young, S.¹ Evermann, G.² Gales, M.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.⁶ Moore, G.⁷ Odell, J.⁸ Ollason, D.⁹ Povey, D.¹⁰

15
- 85135272864
- Maximum a posteriori linear regression for hidden Markov model adaptation
- Chesta, C.; Siohan, O.; Lee, C.-H. (1999). Maximum a posteriori linear regression for hidden Markov model adaptation. In Proceedings of Eurospeech (Vol. 1, pp. 211-214).
- (1999) Proceedings of Eurospeech , vol.1 , pp. 211-214
- Chesta, C.¹ Siohan, O.² Lee C., .-H.³

16
- 0036649879
- Quasi-Bayes linear regression for sequential learning of hidden Markov models
- 10.1109/TSA.2002.800555 1009230
- Chien, J.-T. (2002). Quasi-Bayes linear regression for sequential learning of hidden Markov models. IEEE Transactions on Speech and Audio Processing, 10(5), 268-278
- (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.5 , pp. 268-278
- Chien, J.-T.¹

17
- 0035279111
- A structural Bayes approach to speaker adaptation
- Shinoda, K.; & Lee, C.-H. (2001). A structural Bayes approach to speaker adaptation. IEEE Transactions on Speech and Audio Processing, 9, 276-287).
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , pp. 276-287
- Shinoda, K.¹ Lee C., .-H.²

18
- 0036461005
- Structural maximum a posteriori linear regression for fast HMM adaptation
- 10.1006/csla.2001.0181
- Siohan, O.; Myrvoll, T.A.; Lee, C.H. (2002). Structural maximum a posteriori linear regression for fast HMM adaptation. Computer Speech & Language, 16(1), 5-24.
- (2002) Computer Speech & Language , vol.16 , Issue.1 , pp. 5-24
- Siohan, O.¹ Myrvoll, T.A.² Lee, C.H.³

19
- 0003598536
- University of Cambridge Cavendish Laboratory
- MacKay, D.J.C. (1997). Ensemble learning for hidden Markov models. Technical report, Cavendish Laboratory: University of Cambridge.
- (1997) Ensemble Learning for Hidden Markov Models. Technical Report, Technical Report
- Mackay, D.J.C.¹

20
- 0002788893
- A view of the em algorithm that justifies incremental, sparse, and other variants
- Neal, R.M.; & Hinton, G.E. (1998). A view of the EM algorithm that justifies incremental, sparse, and other variants. Learning in Graphical Models, 355-368.
- (1998) Learning in Graphical Models , pp. 355-368
- Neal R., .M.¹ Hinton G., .E.²

21
- 0033225865
- An introduction to variational methods for graphical models
- 10.1023/A:1007665907178 0945.68164
- Jordan, M.I.; Ghahramani, Z.; Jaakkola, T.S.; Saul, L.K. (1999). An introduction to variational methods for graphical models. Machine Learning, 37(2), 183-233.
- (1999) Machine Learning , vol.37 , Issue.2 , pp. 183-233
- Jordan, M.I.¹ Ghahramani, Z.² Jaakkola, T.S.³ Saul, L.K.⁴

22
- 0003278032
- Inferring parameters structure of latent variable models by variational Bayes
- Attias, H. (1999). Inferring parameters structure of latent variable models by variational Bayes. In Proceedings of uncertainty in artificial intelligence (UAI) (Vol. 15, pp. 21-30).
- (1999) Proceedings of Uncertainty in Artificial Intelligence (UAI) , vol.15 , pp. 21-30
- Attias, H.¹

23
- 0036887504
- Bayesian model search for mixture models based on optimizing variational bounds
- 10.1016/S0893-6080(02)00040-0
- Ueda, N.; & Ghahramani, Z. (2002). Bayesian model search for mixture models based on optimizing variational bounds. Neural Networks, 15, 1223-1241.
- (2002) Neural Networks , vol.15 , pp. 1223-1241
- Ueda, N.¹ Ghahramani, Z.²

24
- 79957689964
- Application of variational Bayesian approach to speech recognition
- MIT Press
- Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N. (2002). Application of variational Bayesian approach to speech recognition. NIPS 2002: MIT Press.
- (2002) NIPS 2002
- Watanabe, S.¹ Minami, Y.² Nakamura, A.³ Ueda, N.⁴

25
- 85009174866
- Variational Bayesian GMM for speech recognition
- Valente, F.; & Wellekens, C. (2003). Variational Bayesian GMM for speech recognition. In Proceedings of Eurospeech (pp. 441-444).
- (2003) Proceedings of Eurospeech , pp. 441-444
- Valente, F.¹ Wellekens, C.²

26
- 3042741069
- Variational bayesian estimation and clustering for speech recognition
- Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N. (2004). Variational bayesian estimation and clustering for speech recognition. IEEE Transactions on Speech and Audio Processing, 12, 365-381.
- (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , pp. 365-381
- Watanabe, S.¹ Minami, Y.² Nakamura, A.³ Ueda, N.⁴

27
- 82455177766
- Comparison of ML, MAP, and VB based acoustic models in large vocabulary speech recognition
- Somervuo, P. (2004). Comparison of ML, MAP, and VB based acoustic models in large vocabulary speech recognition. In Proceedings of ICSL (Vol. 1, pp. 830-833).
- (2004) Proceedings of ICSL , vol.1 , pp. 830-833
- Somervuo, P.¹

28
- 4544253566
- Automatic generation of non-uniform HMM structures based on variational Bayesian approach
- Jitsuhiro, T.; & Nakamura, S. (2004). Automatic generation of non-uniform HMM structures based on variational Bayesian approach. In Proceedings of ICASSP (Vol. 1, pp. 805-808).
- (2004) Proceedings of ICASSP , vol.1 , pp. 805-808
- Jitsuhiro, T.¹ Nakamura, S.²

29
- 84867213785
- Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition
- Hashimoto, K.; Zen, H.; Nankaku, Y.; Lee, A.; Tokuda, K. (2008). Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition. In Proceedings of Interspeech.
- (2008) Proceedings of Interspeech
- Hashimoto, K.¹ Zen, H.² Nankaku, Y.³ Lee, A.⁴ Tokuda, K.⁵

30
- 51449089545
- Weighted distance measures for efficient reduction of Gaussian mixture components in HMM-based acoustic model
- Ogawa, A.; & Takahashi, S. (2008). Weighted distance measures for efficient reduction of Gaussian mixture components in HMM-based acoustic model. In Proceedings of ICASSP (pp. 4173-4176).
- (2008) Proceedings of ICASSP , pp. 4173-4176
- Ogawa, A.¹ Takahashi, S.²

31
- 78049394635
- Variational nonparametric Bayesian hidden Markov model
- Ding, N.; & Ou, Z. (2010). Variational nonparametric Bayesian hidden Markov model. In Proceedings of ICASSP (pp. 2098-2101).
- (2010) Proceedings of ICASSP , pp. 2098-2101
- Ding, N.¹ Ou, Z.²

32
- 85009135071
- Acoustic model adaptation based on coarse/fine training of transfer vectors and its application to a speaker adaptation task
- Watanabe, S.; & Nakamura, A. (2004). Acoustic model adaptation based on coarse/fine training of transfer vectors and its application to a speaker adaptation task. In Proceedings of ICSLP (pp. 2933-2936).
- (2004) Proceedings of ICSLP , pp. 2933-2936
- Watanabe, S.¹ Nakamura, A.²

33
- 33947643186
- Incremental adaptation using bayesian inference
- Yu, K.; & Gales, M.J.F. (2006). Incremental adaptation using bayesian inference. In Proceedings of ICASSP (Vol. 1, pp. 217-220).
- (2006) Proceedings of ICASSP , vol.1 , pp. 217-220
- Yu, K.¹ Gales M. .J., .F.²

34
- 21844450606
- Variational message passing
- 2249835
- Winn, J.; & Bishop, C.M. (2006). Variational message passing. Journal of Machine Learning Research, 6(1), 661.
- (2006) Journal of Machine Learning Research , vol.6 , Issue.1 , pp. 661
- Winn, J.¹ Bishop, C.M.²

35
- 1642372928
- Variance compensation within the MLLR framework
- Gales, M.J.F.; & Woodland, P.C. (1996). Variance compensation within the MLLR framework, Technical Report 242: Cambridge University Engineering Department.
- (1996) Technical Report 242, Cambridge University Engineering Department
- Gales M. .J., .F.¹ Woodland P., .C.²

36
- 0000043041
- Some matrix-variate distribution theory: Notational considerations and a Bayesian application
- 10.1093/biomet/68.1.265 0464.62039 614963
- Dawid, A.P. (1981). Some matrix-variate distribution theory: notational considerations and a Bayesian application. Biometrika, 68(1), 265-274.
- (1981) Biometrika , vol.68 , Issue.1 , pp. 265-274
- Dawid, A.P.¹

37
- 82455212515
- Bayesian linear regression for hidden Markov model based on optimizing variational bounds
- Watanabe, S.; Nakamura, A.; Juang, B.H. (2011). Bayesian linear regression for hidden Markov model based on optimizing variational bounds. In Proceedings of MLSP (pp. 1-6).
- (2011) Proceedings of MLSP , pp. 1-6
- Watanabe, S.¹ Nakamura, A.² Juang B., .H.³

38
- 0003805597
- PhD thesis: Cambridge University
- Odell, J.J. (1995). The use of context in large vocabulary speech recognition. PhD thesis: Cambridge University.
- (1995) The Use of Context in Large Vocabulary Speech Recognition
- Odell, J.J.¹

39
- 85037142779
- Spontaneous speech corpus of Japanese
- Maekawa, K.; Koiso, H.; Furui, S.; Isahara, H. (2000). Spontaneous speech corpus of Japanese. In Proceedings of LREC (Vol. 2, pp. 947-952).
- (2000) Proceedings of LREC , vol.2 , pp. 947-952
- Maekawa, K.¹ Koiso, H.² Furui, S.³ Isahara, H.⁴

40
- 78049488169
- Evaluation of the SOLON speech recognition system: 2006 benchmark using the corpus of spontaneous Japanese
- (in Japanese)
- Nakamura, A.; Oba, T.; Watanabe, S.; Ishizuka, K.; Fujimoto, M.; Hori, T.; McDermott, E.; Minami, Y. (2006). Evaluation of the SOLON speech recognition system: 2006 benchmark using the corpus of spontaneous japanese. IPSJ SIG Notes, 2006(136), 251-256. (in Japanese).
- (2006) IPSJ SIG Notes , Issue.136 , pp. 251-256
- Nakamura, A.¹ Oba, T.² Watanabe, S.³ Ishizuka, K.⁴ Fujimoto, M.⁵ Hori, T.⁶ McDermott, E.⁷ Minami, Y.⁸

41
- 34547522070
- Discriminative training for large-vocabulary speech recognition using minimum classification error
- McDermott, E.; Hazen, T.J.; Le Roux, J.; Nakamura, A.; Katagiri, S. (2007). Discriminative training for large-vocabulary speech recognition using minimum classification error. IEEE Transactions on Audio, Speech, and Language Processing, 15(1), 203-223.
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.1 , pp. 203-223
- McDermott, E.¹ Hazen T., .J.² Le Roux, J.³ Nakamura, A.⁴ Katagiri, S.⁵

42
- 33645758265
- NTT speech recognizer with outlook on the next generation: SOLON
- Hori, T. (2004). NTT speech recognizer with outlook on the next generation: SOLON. In Proceedings of NTT workshop on communication scene analysis (Vol. 1, p. SP-6.)
- (2004) Proceedings of NTT Workshop on Communication Scene Analysis , vol.1
- Hori, T.¹

43
- 84878411087
- Speaker adaptation using variational Bayesian linear regression in normalized feature space
- Hahm, S.J.; Ogawa, A.; Fujimoto, M.; Hori, T.; Nakamura, A. (2012). Speaker adaptation using variational Bayesian linear regression in normalized feature space. In Proceedings of Interspeech (pp. 803-806).
- (2012) Proceedings of Interspeech , pp. 803-806
- Hahm S., .J.¹ Ogawa, A.² Fujimoto, M.³ Hori, T.⁴ Nakamura, A.⁵

44
- 84890534474
- Feature space variational Bayesian linear regression and its combination with model space VBLR
- Hahm, S.J.; Ogawa, A.; Fujimoto, M.; Hori, T.; Nakamura, A. (2013). Feature space variational Bayesian linear regression and its combination with model space VBLR. In Proceedings of ICASSP (pp. 7898-7902).
- (2013) Proceedings of ICASSP , pp. 7898-7902
- Hahm S., .J.¹ Ogawa, A.² Fujimoto, M.³ Hori, T.⁴ Nakamura, A.⁵

45
- 84867186048
- Variational inference for Dirichlet process mixtures
- Blei, D.M.; & Jordan, M.I. (2006). Variational inference for Dirichlet process mixtures. Bayesian Analysis, 1(1), 121-144.
- (2006) Bayesian Analysis , vol.1 , Issue.1 , pp. 121-144
- Blei D., .M.¹ Jordan M., .I.²

46
- 4344625659
- Springer
- Jebara, T. (2004). Machine learning: discriminative and generative (Vol. 755). Springer.
- (2004) Machine Learning: Discriminative and Generative , vol.755
- Jebara, T.¹

47
- 79959828521
- A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination
- Kubo, Y.; Watanabe, S.; Nakamura, A.; Kobayashi, T. (2010). A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination. In Proceedings of Interspeech (pp. 2954-2957).
- (2010) Proceedings of Interspeech , pp. 2954-2957
- Kubo, Y.¹ Watanabe, S.² Nakamura, A.³ Kobayashi, T.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.