SCOPUS 정보 검색 플랫폼

APSIPA Transactions on Signal and Information Processing

Volumn 1, Issue , 2012, Pages

Bayesian approaches to acoustic modeling: A review

(2) Watanabe, Shinji a Nakamura, Atsushi b

a MITSUBISHI ELECTRIC RESEARCH LABORATORIES (United States)

b Nippon Telegraph and Telephone Corporation (Japan)

Author keywords

Approximate bayesian inference; Bayesian approach; Machine learning; Speech processing

Indexed keywords

ARTIFICIAL INTELLIGENCE; BAYESIAN NETWORKS; INFERENCE ENGINES; LEARNING SYSTEMS; MARKOV PROCESSES; MAXIMUM LIKELIHOOD; MONTE CARLO METHODS; SPEECH; SPEECH PROCESSING; VARIATIONAL TECHNIQUES;

APPROXIMATE BAYESIAN INFERENCE; ASYMPTOTIC APPROXIMATION; BAYESIAN APPROACHES; BAYESIAN INFORMATION CRITERION; GENERALIZATION CAPABILITY; MARKOV CHAIN MONTE-CARLO; NUMERICAL COMPUTATIONS; VARIATIONAL APPROXIMATION;

SPEECH RECOGNITION;

EID: 84887091716 PISSN: None EISSN: 20487703 Source Type: Journal
DOI: 10.1017/ATSIP.2012.6 Document Type: Review

Times cited : (4)

References (93)

1
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- Dempster, A.P.; Laird, N.M.; Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. B, 39, (1976) 1-38.
- (1976) J. Roy. Stat. Soc. B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

2
- 0016939124
- Continuous speech recognition by statistical methods
- Jelinek, F.: Continuous speech recognition by statistical methods. Proc. IEEE, 64(4), (1976) 532-556.
- (1976) Proc. IEEE , vol.64 , Issue.4 , pp. 532-556
- Jelinek, F.¹

3
- 0003462715
- Edinburgh University Press
- Huang, X.D.; Ariki, Y.; Jack, M.A.: Hidden Markov Models for Speech Recognition, Edinburgh University Press, 1990.
- (1990) Hidden Markov Models for Speech Recognition
- Huang, X.D.¹ Ariki, Y.² Jack, M.A.³

4
- 70349227947
- The application of hidden markov models in speech recognition
- Gales, M.; Young, S.: The application of hidden Markov models in speech recognition. Signal Process., 1, (3), (2007) 195-304.
- (2007) Signal Process , vol.1 , Issue.3 , pp. 195-304
- Gales, M.¹ Young, S.²

5
- 3042730370
- Recent advances in spontaneous speech recognition and understanding
- Furui, S.: Recent advances in spontaneous speech recognition and understanding. in Proc. SSPR 2003, 2003, 1-6.
- (2003) Proc. SSPR 2003 , pp. 1-6
- Furui, S.¹

6
- 0003757758
- 2nd ed. Springer-Verlag
- Berger, J.O.: Statistical Decision Theory and Bayesian Analysis, 2nd ed., Springer-Verlag, 1985.
- (1985) Statistical Decision Theory and Bayesian Analysis
- Berger, J.O.¹

7
- 84981748011
- John Wiley & Sons Ltd
- Bernardo, J.M.; Smith, A.F.M.: Bayesian Theory, John Wiley & Sons Ltd, 1994.
- (1994) Bayesian Theory
- Bernardo, J.M.¹ Smith, A.F.M.²

8
- 33846516584
- Springer New York
- Bishop, C.M.: Pattern Recognition and Machine Learning, vol. 4, Springer New York, 2006.
- (2006) Pattern Recognition and Machine Learning , vol.4
- Bishop, C.M.¹

9
- 26844565415
- Unsupervised learning
- Springer
- Ghahramani, Z.: Unsupervised learning. Advanced Lectures on Machine Learning, 2004, 72-112, Springer.
- (2004) Advanced Lectures on Machine Learning , pp. 72-112
- Ghahramani, Z.¹

10
- 0026142334
- A study on speaker adaptation of the parameters of continuous density hidden markov models
- Lee, C.-H.; Lin, C.H.; Juang, B-H.: A study on speaker adaptation of the parameters of continuous density hidden Markov models. IEEE Trans.Acoust. Speech Signal Process., 39, (1991) 806-814.
- (1991) IEEE Trans.Acoust. Speech Signal Process. , vol.39 , pp. 806-814
- Lee, C.-H.¹ Lin, C.H.² Juang, B.-H.³

11
- 85009065028
- Improved acoustic modeling with bayesian learning
- Gauvain, J.L.; Lee, C.H.: Improved acoustic modeling with Bayesian learning. in ICASSP'92, 1, (1992) 481-484.
- (1992) ICASSP'92 , vol.1 , pp. 481-484
- Gauvain, J.L.¹ Lee, C.H.²

12
- 0028419019
- Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains
- Gauvain, J.-L.; Lee, C.-H.: Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. Speech Audio Process., 2, (1994) 291-298.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

13
- 0033884858
- Speaker verification using adapted gaussian mixture models
- Reynolds, D.A.; Quatieri, T.F.; Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digit. Signal Process., 10, (1-3), (2000) 19-41.
- (2000) Digit. Signal Process. , vol.10 , Issue.1-3 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

14
- 0000120766
- Estimating the dimension of a model
- Schwarz, G.: Estimating the dimension of a model. Ann. Stat., 6, (1978) 461-464.
- (1978) Ann. Stat. , vol.6 , pp. 461-464
- Schwarz, G.¹

15
- 51849177370
- Likelihood and the bayes procedure
- J.M.Bernardo, M. H. DeGroot, D. V. Lindley, andA. F.M. Smith, eds., University Press, Valencia, Spain
- Akaike, H.: Likelihood and the Bayes procedure. in Bayesian Statistics, J.M.Bernardo, M. H. DeGroot, D. V. Lindley, andA. F.M. Smith, eds. 1980, 143-166, University Press, Valencia, Spain.
- (1980) Bayesian Statistics , pp. 143-166
- Akaike, H.¹

16
- 0033906251
- MDL-based context-dependent subword modeling for speech recognition
- Shinoda, K.; Watanabe, T.: MDL-based context-dependent subword modeling for speech recognition. J. Acoust. Soc. Jpn. (E), 21, (2000) 79-86.
- (2000) J. Acoust. Soc. Jpn. (E) , vol.21 , pp. 79-86
- Shinoda, K.¹ Watanabe, T.²

17
- 0032658258
- Decision tree state tying based on penalized bayesian information criterion
- Chou, W.; Reichl, W.: Decision tree state tying based on penalized Bayesian information criterion, in Proc. ICASSP 1999, 1, (1999) 345-348.
- (1999) Proc. ICASSP 1999 , vol.1 , pp. 345-348
- Chou, W.¹ Reichl, W.²

18
- 0009685440
- Model selection in acoustic modeling
- Chen, S.; Gopinath, R.: Model selection in acoustic modeling. in Proc. Eurospeech 1999, 3, (1999) 1087-1090.
- (1999) Proc. Eurospeech 1999 , vol.3 , pp. 1087-1090
- Chen, S.¹ Gopinath, R.²

19
- 0036305005
- Efficient reduction of gaussian components using MDL criterion for HMM-based speech recognition
- Shinoda, K.; Iso, K.: Efficient reduction of Gaussian components using MDL criterion for HMM-based speech recognition. in Proc. ICASSP 2001, 1, (2001) 869-872.
- (2001) Proc. ICASSP 2001 , vol.1 , pp. 869-872
- Shinoda, K.¹ Iso, K.²

20
- 79957689964
- Application of variational bayesian approach to speech recognition
- MIT Press
- Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Application of Variational Bayesian Approach to Speech Recognition, NIPS 2002, MIT Press, 2002, 1261-1268.
- (2002) NIPS 2002 , pp. 1261-1268
- Watanabe, S.¹ Minami, Y.² Nakamura, A.³ Ueda, N.⁴

21
- 3042741069
- Variational bayesian estimation and clustering for speech recognition
- Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Variational Bayesian estimation and clustering for speech recognition. IEEE Trans. Speech Audio Process., 12, (2004) 365-381.
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , pp. 365-381
- Watanabe, S.¹ Minami, Y.² Nakamura, A.³ Ueda, N.⁴

22
- 56449084167
- An HDP-HMM for systems with state persistence
- Fox, E.B.; Sudderth, E.B.; Jordan, M.I.;Willsky, A.S.: An HDP-HMM for systems with state persistence. in Proc. of ICML, 2008, 312-319.
- (2008) Proc. of ICML , pp. 312-319
- Fox, E.B.¹ Sudderth, E.B.² Jordan, M.I.³ Willsky, A.S.⁴

23
- 84865792512
- Speaker clustering based on utterance-oriented dirichlet process mixture model
- Tawara, N.;Watanabe, S.; Ogawa, T.;Kobayashi, T.: Speaker clustering based on utterance-oriented Dirichlet process mixture model. in Proc. Interspeech'11, 2011, 2905-2908.
- (2011) Proc. Interspeech'11 , pp. 2905-2908
- Tawara, N.¹ Watanabe, S.² Ogawa, T.³ Kobayashi, T.⁴

24
- 0032122203
- On-line adaptive learning of the correlated continuous density hidden markov models for speech recognition
- Huo, Q.; Lee, C.-H.: On-line adaptive learning of the correlated continuous density hiddenMarkov models for speech recognition. IEEE Trans. Speech Audio Process., 6, (1998) 386-397.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , pp. 386-397
- Huo, Q.¹ Lee, C.-H.²

25
- 85008538758
- Predictor-corrector adaptation by using time evolution system with macroscopic time scale
- Watanabe, S.;Nakamura, A.: Predictor-corrector adaptation by using time evolution system with macroscopic time scale. IEEE Trans. Audio Speech Lang. Process., 18, (2), (2010) 395-406.
- (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.2 , pp. 395-406
- Watanabe, S.¹ Nakamura, A.²

26
- 0035279111
- A structural bayes approach to speaker adaptation
- Shinoda, K.; Lee, C.-H.: A structural Bayes approach to speaker adaptation. IEEE Trans. Speech Audio Process., 9, (2001) 276-287.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 276-287
- Shinoda, K.¹ Lee, C.-H.²

27
- 0036461005
- Structural maximuma posteriori linear regression for fas HMM adaptation
- Siohan, O.;Myrvoll, T.A.; Lee, C.H.: Structural maximuma posteriori linear regression for fas HMM adaptation.Comput. Speech Lang., 16, (1), (2002) 5-24.
- (2002) Comput. Speech Lang. , vol.16 , Issue.1 , pp. 5-24
- Siohan, O.¹ Myrvoll, T.A.² Lee, C.H.³

28
- 0017553461
- A quasi-bayes unsupervised learning procedure for priors
- Makov, U.E.; Smith, A.F.M.: A quasi-Bayes unsupervised learning procedure for priors. IEEE Trans. Inf. Theory, 23, (1977) 761-764.
- (1977) IEEE Trans. Inf. Theory , vol.23 , pp. 761-764
- Makov, U.E.¹ Smith, A.F.M.²

29
- 0030105005
- On-line adaptation of the SCHMM parameters based on the segmental quasi-bayes learning for speech recognition
- Huo, Q.; Chan, C.; Lee, C.-H.: On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition. IEEE Trans. Speech Audio Process., 4, (1996) 141-144.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 141-144
- Huo, Q.¹ Chan, C.² Lee, C.-H.³

30
- 0036649879
- Quasi-bayes linear regression for sequential learning of hidden markov models
- Chien, J.T.: Quasi-Bayes linear regression for sequential learning of hidden Markov models. IEEE Trans. Speech Audio Process., 10, (2002) 268-278.
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , pp. 268-278
- Chien, J.T.¹

31
- 0031624532
- Speech recognition with dynamic bayesian networks
- Zweig, G.; Russell, S.: Speech recognition with dynamic Bayesian networks, in Proc. Nat. Conf. Artificial Intelligence, 1998, 173-180.
- (1998) Proc. Nat. Conf. Artificial Intelligence , pp. 173-180
- Zweig, G.¹ Russell, S.²

32
- 0036293559
- The graphical models toolkit: An open source software system for speech and time-series processing
- Bilmes, J.; Zweig, G.: The Graphical Models Toolkit: An open source software system for speech and time-series processing, in Proc. ICASSP'02, 2002, vol. 4, 3916-3919.
- (2002) Proc. ICASSP'02 , vol.4 , pp. 3916-3919
- Bilmes, J.¹ Zweig, G.²

33
- 85032751986
- Single channel multi-talker speech recognition: Graphical modeling approaches
- Rennie, S.; Hershey, J.R.; Olsen, P.A.: Single channel multi-talker speech recognition: Graphical modeling approaches. IEEE Signal Process.Mag. Spec. Issue Graph. Models, 27, (6), (2010) 66-80.
- (2010) IEEE Signal Process.Mag. Spec. Issue Graph. Models , vol.27 , Issue.6 , pp. 66-80
- Rennie, S.¹ Hershey, J.R.² Olsen, P.A.³

34
- 80051625262
- Bayesian sensing hidden markov models for speech recognition
- IEEE
- Saon, G.; Chien, J.T.: Bayesian sensing hidden Markov models for speech recognition. in Proc. ICASSP'11. IEEE, 2011, 5056-5059.
- (2011) Proc. ICASSP'11 , pp. 5056-5059
- Saon, G.¹ Chien, J.T.²

35
- 0000159105
- On adaptive decision rules and decision parameter adaptation for automatic speech recognition
- Lee, C.-H.; Huo, Q.: On adaptive decision rules and decision parameter adaptation for automatic speech recognition. in Proc. IEEE, 88, (2000) 1241-1269.
- (2000) Proc. IEEE , vol.88 , pp. 1241-1269
- Lee, C.-H.¹ Huo, Q.²

36
- 85032752364
- Graphical model architectures for speech recognition
- Bilmes, J.; Bartels, C.: Graphical model architectures for speech recognition. IEEE Signal Process. Mag., 22, (5), (2005) 89-100.
- (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 89-100
- Bilmes, J.¹ Bartels, C.²

37
- 84906883334
- Tutorial: Bayesian learning for speech and language processing T-10
- Watanabe, S.; Chien, J.T.: Tutorial: Bayesian learning for speech and language processing T-10, ICASSP'12, 2012.
- (2012) ICASSP'12
- Watanabe, S.¹ Chien, J.T.²

38
- 0023776398
- The DARPA 1000- word resource management database for continuous speech recognition
- Price, P.; Fisher, W.M.; Bernstein, J.; Pallett, D.S.: The DARPA 1000- word resource management database for continuous speech recognition, in Proc. ICASSP'88, 1988, 651-654.
- (1988) Proc. ICASSP'88 , pp. 651-654
- Price, P.¹ Fisher, W.M.² Bernstein, J.³ Pallett, D.S.⁴

39
- 4544265717
- Ph.D. thesis, Cambridge University
- Povey, D.: Discriminative training for large vocabulary speech recognition, Ph.D. thesis, Cambridge University, 2003.
- (2003) Discriminative Training for Large Vocabulary Speech Recognition
- Povey, D.¹

40
- 0002144369
- Tree-based state tying for high accuracy acoustic modelling
- Young, S.J.; Odell, JJ; Woodland, PC: Tree-based state tying for high accuracy acoustic modelling, in Proc. Workshop on Human Language Technology, 1994, 307-312.
- (1994) Proc. Workshop on Human Language Technology , pp. 307-312
- Young, S.J.¹ Odell, J.J.² Woodland, P.C.³

41
- 64549109650
- Knowledge-based adaptive decision tree state tying for conversational speech recognition
- Hu, R.; Zhao, Y.: Knowledge-based adaptive decision tree state tying for conversational speech recognition. IEEE Trans. Audio Speech Lang. Process., 15, (7), (2007) 2160-2168.
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.7 , pp. 2160-2168
- Hu, R.¹ Zhao, Y.²

42
- 85008530405
- Speaker diarization: A review of recent research
- Anguera Miro, X.; Bozonnet, S.; Evans, N.; Fredouille, C.; Friedland, G.; Vinyals, O.: Speaker diarization: A review of recent research. IEEE Trans. Audio Speech Lang. Process., 20, (2), (2012) 356-370.
- (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.2 , pp. 356-370
- Anguera Miro, X.¹ Bozonnet, S.² Evans, N.³ Fredouille, C.⁴ Friedland, G.⁵ Vinyals, O.⁶

43
- 0032685060
- Robust speech recognition based on a bayesian prediction approach
- Jiang, H.; Hirose, K.; Huo, Q.: Robust speech recognition based on a Bayesian prediction approach. IEEE Trans. Speech Audio Process., 7, (1999) 426-440.
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , pp. 426-440
- Jiang, H.¹ Hirose, K.² Huo, Q.³

44
- 0033900150
- A bayesian predictive classification approach to robust speech recognition
- Huo, Q.; Lee, C.-H.: A Bayesian predictive classification approach to robust speech recognition. IEEE Trans. Speech Audio Process., 8, (2000) 200-204.
- (2000) IEEE Trans. Speech Audio Process. , vol.8 , pp. 200-204
- Huo, Q.¹ Lee, C.-H.²

45
- 0033225865
- An introduction to variational methods for graphical models
- Jordan, M.I.; Ghahramani, Z.; Jaakkola, T.S.; Saul, L.K.: An introduction to variational methods for graphical models. Mach. Learn., 37, (1997) 183-233.
- (1997) Mach. Learn. , vol.37 , pp. 183-233
- Jordan, M.I.¹ Ghahramani, Z.² Jaakkola, T.S.³ Saul, L.K.⁴

46
- 85156191859
- Bayesian methods for mixtures of experts
- MIT Press
- Waterhouse, S.; MacKay, D.; Robinson, T.: Bayesian Methods for Mixtures of Experts, NIPS 7, MIT Press, 1995, 351-357.
- (1995) NIPS , vol.7 , pp. 351-357
- Waterhouse, S.¹ MacKay, D.² Robinson, T.³

47
- 0003278032
- Inferring parameters and structure of latent variable models by variational bayes
- Attias, H.: Inferring parameters and structure of latent variable models by variational Bayes, in Proc. Uncertainty in Artificial Intelligence (UAI) 15, 1999, 21-30.
- (1999) Proc. Uncertainty in Artificial Intelligence (UAI) , vol.15 , pp. 21-30
- Attias, H.¹

48
- 0036887504
- Bayesian model search for mixture models based on optimizing variational bounds
- Ueda, N.; Ghahramani, Z.: Bayesian model search for mixture models based on optimizing variational bounds. Neural Netw., 15, (2002) 1223-1241.
- (2002) Neural Netw. , vol.15 , pp. 1223-1241
- Ueda, N.¹ Ghahramani, Z.²

49
- 84906884224
- Ph.D. thesis, Waseda University
- Watanabe, S.: Speech recognition based on a Bayesian approach, Ph.D. thesis, Waseda University, 2006.
- (2006) Speech Recognition Based on a Bayesian Approach
- Watanabe, S.¹

50
- 0036294874
- Application of variational bayesian PCA for speech feature extraction
- Kwon, O.; Lee, T.-W.; Chan, K.: Application of variational Bayesian PCA for speech feature extraction, in Proc. ICASSP 2002, 2002, vol. 1, 825-828.
- (2002) Proc. ICASSP 2002 , vol.1 , pp. 825-828
- Kwon, O.¹ Lee, T.-W.² Chan, K.³

51
- 33646768578
- Variational bayesian feature selection for gaussian mixture models
- Valente, F.; Wellekens, C.: Variational Bayesian feature selection for Gaussian mixture models, in Proc. ICASSP 2004, 2004, vol. 1, 513-516.
- (2004) Proc. ICASSP 2004 , vol.1 , pp. 513-516
- Valente, F.¹ Wellekens, C.²

52
- 84866851908
- Ph.D. thesis, Norwegian University of Science and Technology
- Pettersen, S.G.S.: Robust Speech Recognition in the Presence of Additive Noise, Ph.D. thesis, Norwegian University of Science and Technology, 2008.
- (2008) Robust Speech Recognition in the Presence of Additive Noise
- Pettersen, S.G.S.¹

53
- 78649271854
- Online unsupervised classification withmodel comparison in the variational bayes framework for voice activity detection
- Cournapeau, D.; Watanabe, S.; Nakamura, A.; Kawahara, T.: Online unsupervised classification withmodel comparison in the variational Bayes framework for voice activity detection. IEEE J. Sel. Top. Signal Process., 4, (6), (2010) 1071-1083.
- (2010) IEEE J. Sel. Top. Signal Process. , vol.4 , Issue.6 , pp. 1071-1083
- Cournapeau, D.¹ Watanabe, S.² Nakamura, A.³ Kawahara, T.⁴

54
- 70349205593
- An evidence framework for bayesian learning of continuous-density hidden markov models
- Zhang, Y.; Liu, P.; Chien, J.T.; Soong, F.: An evidence framework for Bayesian learning of continuous-density hidden Markov models, in Proc. ICASSP 2009, 2009, 3857-3860.
- (2009) Proc. ICASSP 2009 , pp. 3857-3860
- Zhang, Y.¹ Liu, P.² Chien, J.T.³ Soong, F.⁴

55
- 70349226870
- Bayesian large margin hiddenmarkov models for speech recognition
- Chen, J.C.; Chien, J.T.: Bayesian large margin hiddenMarkov models for speech recognition, in Proc. ICASSP 2009, 2009, pp. 3765-3768.
- (2009) Proc. ICASSP 2009 , pp. 3765-3768
- Chen, J.C.¹ Chien, J.T.²

56
- 4544286714
- Bayesian acoustic modeling for spontaneous speech recognition
- Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Bayesian acoustic modeling for spontaneous speech recognition, in Proc. SSPR 2003, 2003, 47-50.
- (2003) Proc. SSPR 2003 , pp. 47-50
- Watanabe, S.¹ Minami, Y.² Nakamura, A.³ Ueda, N.⁴

57
- 85009174866
- Variational bayesian GMM for speech recognition
- Valente, F.; Wellekens, C.: Variational Bayesian GMM for speech recognition, in Proc. Eurospeech 2003, 2003, 441-444.
- (2003) Proc. Eurospeech 2003 , pp. 441-444
- Valente, F.¹ Wellekens, C.²

58
- 51449089545
- Weighted distance measures for efficient reduction of gaussian mixture components in HMM-based acoustic model
- Ogawa, A.; Takahashi, S.: Weighted distance measures for efficient reduction of Gaussian mixture components in HMM-based acoustic model, in Proc. ICASSP'08, 2008, 4173-4176.
- (2008) Proc. ICASSP'08 , pp. 4173-4176
- Ogawa, A.¹ Takahashi, S.²

59
- 85009135071
- Acoustic model adaptation based on coarse-fine training of transfer vectors and its application to speaker adaptation task
- Watanabe, S.; Nakamura, A.: Acoustic model adaptation based on coarse-fine training of transfer vectors and its application to speaker adaptation task, in Proc. ICSLP 2004, 2004, vol. 4, pp. 2933-2936.
- (2004) Proc. ICSLP 2004 , vol.4 , pp. 2933-2936
- Watanabe, S.¹ Nakamura, A.²

60
- 33846221143
- Bayesian adaptation and adaptively trained systems
- Yu, K.; Gales, M.J.F.; Bayesian adaptation and adaptively trained systems, in Proc. Automatic Speech Recognition and Understanding Workshop (ASRU) 2005, 2005, pp. 209-214.
- (2005) Proc. Automatic Speech Recognition and Understanding Workshop (ASRU) 2005 , pp. 209-214
- Yu, K.¹ Gales, M.J.F.²

61
- 82455212515
- Bayesian linear regression for hidden markov model based on optimizing variational bounds
- Watanabe, S.; Nakamura, A.; Juang, B.H.: Bayesian linear regression for hidden Markov model based on optimizing variational bounds, ' in Proc. MLSP 2011, 2011, 1-6.
- (2011) Proc. MLSP 2011 , pp. 1-6
- Watanabe, S.¹ Nakamura, A.² Juang, B.H.³

62
- 84878411087
- Speaker adaptation using variational bayesian linear regression in normalized feature space
- Hahm, S.J.; Ogawa, A.; Fujimoto, M.;Hori, T.;Nakamura, A.: Speaker adaptation using variational Bayesian linear regression in normalized feature space, in Proc. of Interspeech'12, 2012.
- (2012) Proc. of Interspeech'12
- Hahm, S.J.¹ Ogawa, A.² Fujimoto, M.³ Hori, T.⁴ Nakamura, A.⁵

63
- 0141852571
- Constructing shared-state hidden Markov models based on a Bayesian approach
- Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Constructing shared-state hidden Markov models based on a Bayesian approach, in Proc. ICSLP 2002, 2002, vol. 4, 2669-2672.
- (2002) Proc. ICSLP 2002 , vol.4 , pp. 2669-2672
- Watanabe, S.¹ Minami, Y.² Nakamura, A.³ Ueda, N.⁴

64
- 4544253566
- Automatic generation of non-uniform HMM structures based on variational Bayesian approach
- Jitsuhiro, T.; Nakamura, S.: Automatic generation of non-uniform HMM structures based on variational Bayesian approach, in Proc. ICASSP 2004, 2004, vol. 1, 805-808.
- (2004) Proc. ICASSP 2004 , vol.1 , pp. 805-808
- Jitsuhiro, T.¹ Nakamura, S.²

65
- 33646418145
- Automatic determination of acoustic model topology using variational bayesian estimation and clustering for large vocabulary continuous speech recognition
- Watanabe, S.; Sako, A.; Nakamura, A.: Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition. IEEE Trans. Audio Speech Lang. Process. 14, (2006) 855-872.
- (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , pp. 855-872
- Watanabe, S.¹ Sako, A.² Nakamura, A.³

66
- 84867213785
- Bayesian context clustering using cross valid prior distribution for HMM based speech recognition
- Hashimoto, K.; Zen, H.; Nankaku, Y.; Lee, A.; Tokuda, K.: Bayesian context clustering using cross valid prior distribution for HMM based speech recognition, in Proc. Interspeech'08, 2008, 936-939.
- (2008) Proc. Interspeech'08 , pp. 936-939
- Hashimoto, K.¹ Zen, H.² Nankaku, Y.³ Lee, A.⁴ Tokuda, K.⁵

67
- 70450194713
- Deterministic annealing based training algorithm for bayesian speech recognition
- Shiota, S.; Hashimoto, K.; Nankaku, Y.; Tokuda, K.: Deterministic annealing based training algorithm for Bayesian speech recognition, in Proc. Interspeech' 09, 2009, 680-683.
- (2009) Proc. Interspeech' 09 , pp. 680-683
- Shiota, S.¹ Hashimoto, K.² Nankaku, Y.³ Tokuda, K.⁴

68
- 44949158124
- Infinite models for speaker clustering
- Valente, F.: Infinite models for speaker clustering, in Proc. Interspeech' 06, 2006, 1329-1332.
- (2006) Proc. Interspeech' 06 , pp. 1329-1332
- Valente, F.¹

69
- 78049394635
- Variational nonparametric Bayesian hidden Markov model
- Ding, N.;Ou, Z.:Variational nonparametric Bayesian hidden Markov model, in Proc. ICASSP'10, 2010, 2098-2101.
- (2010) Proc. ICASSP'10 , pp. 2098-2101
- Ding, N.¹ Ou, Z.²

70
- 85008550452
- Probabilistic speaker diarization with bag-of-words representations of speaker angle information
- Ishiguro, K.; Yamada, T.; Araki, S.; Nakatani, T.; Sawada, H.: Probabilistic speaker diarization with bag-of-words representations of speaker angle information. IEEE Trans. Audio Speech Lang. Process., 20, (2), (2012) 447-460.
- (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.2 , pp. 447-460
- Ishiguro, K.¹ Yamada, T.² Araki, S.³ Nakatani, T.⁴ Sawada, H.⁵

71
- 84867626020
- Fully bayesian inference of multi-mixture gaussian model and its evaluation using speaker clustering
- Tawara, N.; Ogawa, T.; Watanabe, S.; Kobayashi, T.: Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering, in Proc. ICASSP'12, 2012, 5253-5256.
- (2012) Proc. ICASSP'12 , pp. 5253-5256
- Tawara, N.¹ Ogawa, T.² Watanabe, S.³ Kobayashi, T.⁴

72
- 70349223889
- A bayesian approach to HMM-based speech synthesis
- Hashimoto, K.; Zen, H.; Nankaku, Y.; Masuko, T.; Tokuda, K.: A Bayesian approach to HMM-based speech synthesis, in Proc, ICASSP 2009, 2009, 4029-4032.
- (2009) Proc, ICASSP 2009 , pp. 4029-4032
- Hashimoto, K.¹ Zen, H.² Nankaku, Y.³ Masuko, T.⁴ Tokuda, K.⁵

73
- 34547516258
- Approximating the kullback leibler divergence between gaussian mixturemodels
- Hershey, J.R.; Olsen, P.A.: Approximating the Kullback Leibler divergence between Gaussian mixturemodels, in Proc. ICASSP 2007, 2007, pp. 317-320.
- (2007) Proc. ICASSP 2007 , pp. 317-320
- Hershey, J.R.¹ Olsen, P.A.²

74
- 79959828521
- A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination
- Kubo, Y.; Watanabe, S.; Nakamura, A.; Kobayashi, T.: A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination, in Proc. Interspeech 2010, 2010, 2954-2957.
- (2010) Proc. Interspeech 2010 , pp. 2954-2957
- Kubo, Y.¹ Watanabe, S.² Nakamura, A.³ Kobayashi, T.⁴

75
- 84860525845
- A fully bayesian approach toun super vised part-of-speech tagging
- Goldwater, S.; Griffiths, T.: A fully Bayesian approach toun super vised part-of-speech tagging, in Proc. ACL'07, 2007, 744-751.
- (2007) Proc. ACL'07 , pp. 744-751
- Goldwater, S.¹ Griffiths, T.²

76
- 84859895217
- Bayesian unsupervised word segmentation with nested pitman-yor language modeling
- Mochihashi, D.; Yamada, T.; Ueda, N.: Bayesian unsupervised word segmentation with nested Pitman-Yor language modeling, in Proc. ACL-IJCNLP, 2009, 100-108.
- (2009) Proc. ACL-IJCNLP , pp. 100-108
- Mochihashi, D.¹ Yamada, T.² Ueda, N.³

77
- 80051606569
- Gibbs sampling basedmulti-scale mixturemodel for speaker clustering
- Watanabe, S.;Mochihashi, D.;Hori, T.;Nakamura, A.:Gibbs sampling basedmulti-scale mixturemodel for speaker clustering, in ICASSP'11, 2011, 4524-4527.
- (2011) ICASSP'11 , pp. 4524-4527
- Watanabe, S.¹ Mochihashi, D.² Hori, T.³ Nakamura, A.⁴

78
- 0021518209
- Stochastic relaxation, gibbs distributions, and the bayesian restoration of images
- Geman, S.;Geman, D.: Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell., 6, (6), (1984) 721-741.
- (1984) IEEE Trans. Pattern Anal. Mach. Intell. , vol.6 , Issue.6 , pp. 721-741
- Geman, S.¹ Geman, D.²

79
- 85008590333
- Lowlatency real-time meeting recognition and understanding using distant microphones and omni-directional camera
- Hori, T.; Araki, S.; Yoshioka, T.; Fujimoto, M.; Watanabe, S.; Oba, T.; Ogawa, A.; Otsuka, K.; Mikami, D.; Kinoshita, K.; et al., : Lowlatency real-time meeting recognition and understanding using distant microphones and omni-directional camera. IEEE Trans. Audio Speech Lang. Process., 20, (2), (2012) 499.
- (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.2 , pp. 499
- Hori, T.¹ Araki, S.² Yoshioka, T.³ Fujimoto, M.⁴ Watanabe, S.⁵ Oba, T.⁶ Ogawa, A.⁷ Otsuka, K.⁸ Mikami, D.⁹ Kinoshita, K.¹⁰

80
- 47749152568
- The rich transcription 2007 meeting recognition evaluation
- Fiscus, J.; Ajot, J.; Garofolo, J.: The rich transcription 2007 meeting recognition evaluation. Multimodal Technol. Percept. Humans, 2009 373-389. http://www.springerlink.com/content/94w143777u0165v5/.
- (2009) Multimodal Technol. Percept. Humans , pp. 373-389
- Fiscus, J.¹ Ajot, J.² Garofolo, J.³

81
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- Hinton, G.;Deng, L.; Yu, D.;Dahl, G.;Mohamed, A.; Jaitly, N.; Senior, A.; Van houcke, V.; Nguyen, P.; Sainath, T.; Kingsbury, B.: Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Process. Mag., 29, (6), (2012), 82-97.
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Van Houcke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

82
- 0001120413
- A Bayesian analysis of some nonparametric problems
- Ferguson, T.S.: A Bayesian analysis of some nonparametric problems. Ann. Stat., 1, (2) (1973) 209-230.
- (1973) Ann. Stat. , vol.1 , Issue.2 , pp. 209-230
- Ferguson, T.S.¹

83
- 33645039209
- Infinite latent feature models and the Indian buffet process
- Griffiths, T.; Ghahramani, Z.: Infinite latent feature models and the Indian buffet process. Tech. Rep., Gatsby Unit, 2005.
- (2005) Tech. Rep., Gatsby Unit
- Griffiths, T.¹ Ghahramani, Z.²

84
- 33749249312
- Hierarchical dirichlet processes
- Teh, Y.W.; Jordan, M.I.; Beal, M.J.; Blei, D.M.: Hierarchical Dirichlet processes. J. Am. Stat. Assoc., 101, (476), (2006) 1566-1581.
- (2006) J. Am. Stat. Assoc. , vol.101 , Issue.476 , pp. 1566-1581
- Teh, Y.W.¹ Jordan, M.I.² Beal, M.J.³ Blei, D.M.⁴

85
- 76849117578
- The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies
- Blei, D.M.; Griffiths, T.L.; Jordan, M.I.: The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies. J. ACM, 57, (2), (2010) 7.
- (2010) J. ACM , vol.57 , Issue.2 , pp. 7
- Blei, D.M.¹ Griffiths, T.L.² Jordan, M.I.³

86
- 84867809023
- A nonparametric bayesian approach to acoustic model discovery
- Lee, C. Y.; Glass, J.: A nonparametric Bayesian approach to acoustic model discovery, in Proc. ACL'12, 2012.
- (2012) Proc. ACL'12
- Lee, C.Y.¹ Glass, J.²

87
- 79959859627
- Learning a language model from continuous speech
- Neubig, G.; Mimura, M.; Mori, S.; Kawahara, T.: Learning a language model from continuous speech, in Proc. Interspeech'10, 2010, 1053-1056.
- (2010) Proc. Interspeech'10 , pp. 1053-1056
- Neubig, G.¹ Mimura, M.² Mori, S.³ Kawahara, T.⁴

88
- 84872741459
- Finding latent sources in recorded music with a shift-invariant HDP
- Hoffman, M.; Blei, D.; Cook, P.R.: Finding latent sources in recorded music with a shift-invariant HDP, in Proceedings of the International Conference on Digital Audio Effects (DAFx), 2009, 438-444.
- (2009) Proceedings of the International Conference on Digital Audio Effects (DAFx) , pp. 438-444
- Hoffman, M.¹ Blei, D.² Cook, P.R.³

89
- 84873586598
- Infinite latent harmonic allocation: A nonparametric Bayesian approach to multipitch analysis
- Yoshii, K.; Goto, M.: Infinite latent harmonic allocation: A nonparametric Bayesian approach to multipitch analysis, in Proc. 11th Int. Conf. Music Information Retrieval (ISMIR), 2010, 309-314.
- (2010) Proc. 11th Int. Conf. Music Information Retrieval (ISMIR) , pp. 309-314
- Yoshii, K.¹ Goto, M.²

90
- 83455246038
- Bayesian nonparametric spectrogram modeling based oninfinite factorial infinitehidden Markov model
- Nakano, M.; Le Roux, J.; Kameoka, H.; Nakamura, T.; Ono, N.; Sagayama, S.: Bayesian nonparametric spectrogram modeling based oninfinite factorial infinitehidden Markov model, in 2011 IEEE Workshop on, Applications of Signal Processing to Audio and Acoustics (WASPAA), 2011, 325-328.
- (2011) 2011 IEEE Workshop On, Applications of Signal Processing to Audio and Acoustics (WASPAA) , pp. 325-328
- Nakano, M.¹ Le Roux, J.² Kameoka, H.³ Nakamura, T.⁴ Ono, N.⁵ Sagayama, S.⁶

91
- 34547522070
- Discriminative training for large-vocabulary speech recognition using minimum classification error
- McDermott, E.; Hazen, T.J.; Le Roux, J.; Nakamura, A.; Katagiri, S.: "Discriminative training for large-vocabulary speech recognition using minimum classification error", IEEE Trans. Audio Speech Lang. Process., 15, (1), (2007) 203-223.
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.1 , pp. 203-223
- McDermott, E.¹ Hazen, T.J.² Le Roux, J.³ Nakamura, A.⁴ Katagiri, S.⁵

92
- 85032751545
- Structured discriminativemodels for speech recognition
- Gales, M.; Watanabe, S.; Fossler-Lussier, E.: Structured discriminativemodels for speech recognition. IEEE Signal Process.Mag., 29, (6), (2012), 70-81.
- (2012) IEEE Signal Process.Mag. , vol.29 , Issue.6 , pp. 70-81
- Gales, M.¹ Watanabe, S.² Fossler-Lussier, E.³

93
- 4344625659
- Springer
- Jebara, T.: Machine Learning: Discriminative and Generative, Springer, 2004.
- (2004) Machine Learning: Discriminative and Generative
- Jebara, T.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.