메뉴 건너뛰기




Volumn 1, Issue , 2012, Pages

Bayesian approaches to acoustic modeling: A review

Author keywords

Approximate bayesian inference; Bayesian approach; Machine learning; Speech processing

Indexed keywords

ARTIFICIAL INTELLIGENCE; BAYESIAN NETWORKS; INFERENCE ENGINES; LEARNING SYSTEMS; MARKOV PROCESSES; MAXIMUM LIKELIHOOD; MONTE CARLO METHODS; SPEECH; SPEECH PROCESSING; VARIATIONAL TECHNIQUES;

EID: 84887091716     PISSN: None     EISSN: 20487703     Source Type: Journal    
DOI: 10.1017/ATSIP.2012.6     Document Type: Review
Times cited : (4)

References (93)
  • 1
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • Dempster, A.P.; Laird, N.M.; Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. B, 39, (1976) 1-38.
    • (1976) J. Roy. Stat. Soc. B , vol.39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 2
    • 0016939124 scopus 로고
    • Continuous speech recognition by statistical methods
    • Jelinek, F.: Continuous speech recognition by statistical methods. Proc. IEEE, 64(4), (1976) 532-556.
    • (1976) Proc. IEEE , vol.64 , Issue.4 , pp. 532-556
    • Jelinek, F.1
  • 4
    • 70349227947 scopus 로고    scopus 로고
    • The application of hidden markov models in speech recognition
    • Gales, M.; Young, S.: The application of hidden Markov models in speech recognition. Signal Process., 1, (3), (2007) 195-304.
    • (2007) Signal Process , vol.1 , Issue.3 , pp. 195-304
    • Gales, M.1    Young, S.2
  • 5
    • 3042730370 scopus 로고    scopus 로고
    • Recent advances in spontaneous speech recognition and understanding
    • Furui, S.: Recent advances in spontaneous speech recognition and understanding. in Proc. SSPR 2003, 2003, 1-6.
    • (2003) Proc. SSPR 2003 , pp. 1-6
    • Furui, S.1
  • 10
    • 0026142334 scopus 로고
    • A study on speaker adaptation of the parameters of continuous density hidden markov models
    • Lee, C.-H.; Lin, C.H.; Juang, B-H.: A study on speaker adaptation of the parameters of continuous density hidden Markov models. IEEE Trans.Acoust. Speech Signal Process., 39, (1991) 806-814.
    • (1991) IEEE Trans.Acoust. Speech Signal Process. , vol.39 , pp. 806-814
    • Lee, C.-H.1    Lin, C.H.2    Juang, B.-H.3
  • 11
    • 85009065028 scopus 로고
    • Improved acoustic modeling with bayesian learning
    • Gauvain, J.L.; Lee, C.H.: Improved acoustic modeling with Bayesian learning. in ICASSP'92, 1, (1992) 481-484.
    • (1992) ICASSP'92 , vol.1 , pp. 481-484
    • Gauvain, J.L.1    Lee, C.H.2
  • 12
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains
    • Gauvain, J.-L.; Lee, C.-H.: Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. Speech Audio Process., 2, (1994) 291-298.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 13
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted gaussian mixture models
    • Reynolds, D.A.; Quatieri, T.F.; Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digit. Signal Process., 10, (1-3), (2000) 19-41.
    • (2000) Digit. Signal Process. , vol.10 , Issue.1-3 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 14
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • Schwarz, G.: Estimating the dimension of a model. Ann. Stat., 6, (1978) 461-464.
    • (1978) Ann. Stat. , vol.6 , pp. 461-464
    • Schwarz, G.1
  • 15
    • 51849177370 scopus 로고
    • Likelihood and the bayes procedure
    • J.M.Bernardo, M. H. DeGroot, D. V. Lindley, andA. F.M. Smith, eds., University Press, Valencia, Spain
    • Akaike, H.: Likelihood and the Bayes procedure. in Bayesian Statistics, J.M.Bernardo, M. H. DeGroot, D. V. Lindley, andA. F.M. Smith, eds. 1980, 143-166, University Press, Valencia, Spain.
    • (1980) Bayesian Statistics , pp. 143-166
    • Akaike, H.1
  • 16
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based context-dependent subword modeling for speech recognition
    • Shinoda, K.; Watanabe, T.: MDL-based context-dependent subword modeling for speech recognition. J. Acoust. Soc. Jpn. (E), 21, (2000) 79-86.
    • (2000) J. Acoust. Soc. Jpn. (E) , vol.21 , pp. 79-86
    • Shinoda, K.1    Watanabe, T.2
  • 17
    • 0032658258 scopus 로고    scopus 로고
    • Decision tree state tying based on penalized bayesian information criterion
    • Chou, W.; Reichl, W.: Decision tree state tying based on penalized Bayesian information criterion, in Proc. ICASSP 1999, 1, (1999) 345-348.
    • (1999) Proc. ICASSP 1999 , vol.1 , pp. 345-348
    • Chou, W.1    Reichl, W.2
  • 18
    • 0009685440 scopus 로고    scopus 로고
    • Model selection in acoustic modeling
    • Chen, S.; Gopinath, R.: Model selection in acoustic modeling. in Proc. Eurospeech 1999, 3, (1999) 1087-1090.
    • (1999) Proc. Eurospeech 1999 , vol.3 , pp. 1087-1090
    • Chen, S.1    Gopinath, R.2
  • 19
    • 0036305005 scopus 로고    scopus 로고
    • Efficient reduction of gaussian components using MDL criterion for HMM-based speech recognition
    • Shinoda, K.; Iso, K.: Efficient reduction of Gaussian components using MDL criterion for HMM-based speech recognition. in Proc. ICASSP 2001, 1, (2001) 869-872.
    • (2001) Proc. ICASSP 2001 , vol.1 , pp. 869-872
    • Shinoda, K.1    Iso, K.2
  • 20
    • 79957689964 scopus 로고    scopus 로고
    • Application of variational bayesian approach to speech recognition
    • MIT Press
    • Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Application of Variational Bayesian Approach to Speech Recognition, NIPS 2002, MIT Press, 2002, 1261-1268.
    • (2002) NIPS 2002 , pp. 1261-1268
    • Watanabe, S.1    Minami, Y.2    Nakamura, A.3    Ueda, N.4
  • 23
    • 84865792512 scopus 로고    scopus 로고
    • Speaker clustering based on utterance-oriented dirichlet process mixture model
    • Tawara, N.;Watanabe, S.; Ogawa, T.;Kobayashi, T.: Speaker clustering based on utterance-oriented Dirichlet process mixture model. in Proc. Interspeech'11, 2011, 2905-2908.
    • (2011) Proc. Interspeech'11 , pp. 2905-2908
    • Tawara, N.1    Watanabe, S.2    Ogawa, T.3    Kobayashi, T.4
  • 24
    • 0032122203 scopus 로고    scopus 로고
    • On-line adaptive learning of the correlated continuous density hidden markov models for speech recognition
    • Huo, Q.; Lee, C.-H.: On-line adaptive learning of the correlated continuous density hiddenMarkov models for speech recognition. IEEE Trans. Speech Audio Process., 6, (1998) 386-397.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , pp. 386-397
    • Huo, Q.1    Lee, C.-H.2
  • 25
    • 85008538758 scopus 로고    scopus 로고
    • Predictor-corrector adaptation by using time evolution system with macroscopic time scale
    • Watanabe, S.;Nakamura, A.: Predictor-corrector adaptation by using time evolution system with macroscopic time scale. IEEE Trans. Audio Speech Lang. Process., 18, (2), (2010) 395-406.
    • (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.2 , pp. 395-406
    • Watanabe, S.1    Nakamura, A.2
  • 26
    • 0035279111 scopus 로고    scopus 로고
    • A structural bayes approach to speaker adaptation
    • Shinoda, K.; Lee, C.-H.: A structural Bayes approach to speaker adaptation. IEEE Trans. Speech Audio Process., 9, (2001) 276-287.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 276-287
    • Shinoda, K.1    Lee, C.-H.2
  • 27
    • 0036461005 scopus 로고    scopus 로고
    • Structural maximuma posteriori linear regression for fas HMM adaptation
    • Siohan, O.;Myrvoll, T.A.; Lee, C.H.: Structural maximuma posteriori linear regression for fas HMM adaptation.Comput. Speech Lang., 16, (1), (2002) 5-24.
    • (2002) Comput. Speech Lang. , vol.16 , Issue.1 , pp. 5-24
    • Siohan, O.1    Myrvoll, T.A.2    Lee, C.H.3
  • 28
    • 0017553461 scopus 로고
    • A quasi-bayes unsupervised learning procedure for priors
    • Makov, U.E.; Smith, A.F.M.: A quasi-Bayes unsupervised learning procedure for priors. IEEE Trans. Inf. Theory, 23, (1977) 761-764.
    • (1977) IEEE Trans. Inf. Theory , vol.23 , pp. 761-764
    • Makov, U.E.1    Smith, A.F.M.2
  • 29
    • 0030105005 scopus 로고    scopus 로고
    • On-line adaptation of the SCHMM parameters based on the segmental quasi-bayes learning for speech recognition
    • Huo, Q.; Chan, C.; Lee, C.-H.: On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition. IEEE Trans. Speech Audio Process., 4, (1996) 141-144.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 141-144
    • Huo, Q.1    Chan, C.2    Lee, C.-H.3
  • 30
    • 0036649879 scopus 로고    scopus 로고
    • Quasi-bayes linear regression for sequential learning of hidden markov models
    • Chien, J.T.: Quasi-Bayes linear regression for sequential learning of hidden Markov models. IEEE Trans. Speech Audio Process., 10, (2002) 268-278.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , pp. 268-278
    • Chien, J.T.1
  • 32
    • 0036293559 scopus 로고    scopus 로고
    • The graphical models toolkit: An open source software system for speech and time-series processing
    • Bilmes, J.; Zweig, G.: The Graphical Models Toolkit: An open source software system for speech and time-series processing, in Proc. ICASSP'02, 2002, vol. 4, 3916-3919.
    • (2002) Proc. ICASSP'02 , vol.4 , pp. 3916-3919
    • Bilmes, J.1    Zweig, G.2
  • 34
    • 80051625262 scopus 로고    scopus 로고
    • Bayesian sensing hidden markov models for speech recognition
    • IEEE
    • Saon, G.; Chien, J.T.: Bayesian sensing hidden Markov models for speech recognition. in Proc. ICASSP'11. IEEE, 2011, 5056-5059.
    • (2011) Proc. ICASSP'11 , pp. 5056-5059
    • Saon, G.1    Chien, J.T.2
  • 35
    • 0000159105 scopus 로고    scopus 로고
    • On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    • Lee, C.-H.; Huo, Q.: On adaptive decision rules and decision parameter adaptation for automatic speech recognition. in Proc. IEEE, 88, (2000) 1241-1269.
    • (2000) Proc. IEEE , vol.88 , pp. 1241-1269
    • Lee, C.-H.1    Huo, Q.2
  • 36
    • 85032752364 scopus 로고    scopus 로고
    • Graphical model architectures for speech recognition
    • Bilmes, J.; Bartels, C.: Graphical model architectures for speech recognition. IEEE Signal Process. Mag., 22, (5), (2005) 89-100.
    • (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 89-100
    • Bilmes, J.1    Bartels, C.2
  • 37
    • 84906883334 scopus 로고    scopus 로고
    • Tutorial: Bayesian learning for speech and language processing T-10
    • Watanabe, S.; Chien, J.T.: Tutorial: Bayesian learning for speech and language processing T-10, ICASSP'12, 2012.
    • (2012) ICASSP'12
    • Watanabe, S.1    Chien, J.T.2
  • 38
    • 0023776398 scopus 로고
    • The DARPA 1000- word resource management database for continuous speech recognition
    • Price, P.; Fisher, W.M.; Bernstein, J.; Pallett, D.S.: The DARPA 1000- word resource management database for continuous speech recognition, in Proc. ICASSP'88, 1988, 651-654.
    • (1988) Proc. ICASSP'88 , pp. 651-654
    • Price, P.1    Fisher, W.M.2    Bernstein, J.3    Pallett, D.S.4
  • 41
    • 64549109650 scopus 로고    scopus 로고
    • Knowledge-based adaptive decision tree state tying for conversational speech recognition
    • Hu, R.; Zhao, Y.: Knowledge-based adaptive decision tree state tying for conversational speech recognition. IEEE Trans. Audio Speech Lang. Process., 15, (7), (2007) 2160-2168.
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.7 , pp. 2160-2168
    • Hu, R.1    Zhao, Y.2
  • 43
    • 0032685060 scopus 로고    scopus 로고
    • Robust speech recognition based on a bayesian prediction approach
    • Jiang, H.; Hirose, K.; Huo, Q.: Robust speech recognition based on a Bayesian prediction approach. IEEE Trans. Speech Audio Process., 7, (1999) 426-440.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , pp. 426-440
    • Jiang, H.1    Hirose, K.2    Huo, Q.3
  • 44
    • 0033900150 scopus 로고    scopus 로고
    • A bayesian predictive classification approach to robust speech recognition
    • Huo, Q.; Lee, C.-H.: A Bayesian predictive classification approach to robust speech recognition. IEEE Trans. Speech Audio Process., 8, (2000) 200-204.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , pp. 200-204
    • Huo, Q.1    Lee, C.-H.2
  • 45
    • 0033225865 scopus 로고    scopus 로고
    • An introduction to variational methods for graphical models
    • Jordan, M.I.; Ghahramani, Z.; Jaakkola, T.S.; Saul, L.K.: An introduction to variational methods for graphical models. Mach. Learn., 37, (1997) 183-233.
    • (1997) Mach. Learn. , vol.37 , pp. 183-233
    • Jordan, M.I.1    Ghahramani, Z.2    Jaakkola, T.S.3    Saul, L.K.4
  • 46
    • 85156191859 scopus 로고
    • Bayesian methods for mixtures of experts
    • MIT Press
    • Waterhouse, S.; MacKay, D.; Robinson, T.: Bayesian Methods for Mixtures of Experts, NIPS 7, MIT Press, 1995, 351-357.
    • (1995) NIPS , vol.7 , pp. 351-357
    • Waterhouse, S.1    MacKay, D.2    Robinson, T.3
  • 47
    • 0003278032 scopus 로고    scopus 로고
    • Inferring parameters and structure of latent variable models by variational bayes
    • Attias, H.: Inferring parameters and structure of latent variable models by variational Bayes, in Proc. Uncertainty in Artificial Intelligence (UAI) 15, 1999, 21-30.
    • (1999) Proc. Uncertainty in Artificial Intelligence (UAI) , vol.15 , pp. 21-30
    • Attias, H.1
  • 48
    • 0036887504 scopus 로고    scopus 로고
    • Bayesian model search for mixture models based on optimizing variational bounds
    • Ueda, N.; Ghahramani, Z.: Bayesian model search for mixture models based on optimizing variational bounds. Neural Netw., 15, (2002) 1223-1241.
    • (2002) Neural Netw. , vol.15 , pp. 1223-1241
    • Ueda, N.1    Ghahramani, Z.2
  • 50
    • 0036294874 scopus 로고    scopus 로고
    • Application of variational bayesian PCA for speech feature extraction
    • Kwon, O.; Lee, T.-W.; Chan, K.: Application of variational Bayesian PCA for speech feature extraction, in Proc. ICASSP 2002, 2002, vol. 1, 825-828.
    • (2002) Proc. ICASSP 2002 , vol.1 , pp. 825-828
    • Kwon, O.1    Lee, T.-W.2    Chan, K.3
  • 51
    • 33646768578 scopus 로고    scopus 로고
    • Variational bayesian feature selection for gaussian mixture models
    • Valente, F.; Wellekens, C.: Variational Bayesian feature selection for Gaussian mixture models, in Proc. ICASSP 2004, 2004, vol. 1, 513-516.
    • (2004) Proc. ICASSP 2004 , vol.1 , pp. 513-516
    • Valente, F.1    Wellekens, C.2
  • 53
    • 78649271854 scopus 로고    scopus 로고
    • Online unsupervised classification withmodel comparison in the variational bayes framework for voice activity detection
    • Cournapeau, D.; Watanabe, S.; Nakamura, A.; Kawahara, T.: Online unsupervised classification withmodel comparison in the variational Bayes framework for voice activity detection. IEEE J. Sel. Top. Signal Process., 4, (6), (2010) 1071-1083.
    • (2010) IEEE J. Sel. Top. Signal Process. , vol.4 , Issue.6 , pp. 1071-1083
    • Cournapeau, D.1    Watanabe, S.2    Nakamura, A.3    Kawahara, T.4
  • 54
    • 70349205593 scopus 로고    scopus 로고
    • An evidence framework for bayesian learning of continuous-density hidden markov models
    • Zhang, Y.; Liu, P.; Chien, J.T.; Soong, F.: An evidence framework for Bayesian learning of continuous-density hidden Markov models, in Proc. ICASSP 2009, 2009, 3857-3860.
    • (2009) Proc. ICASSP 2009 , pp. 3857-3860
    • Zhang, Y.1    Liu, P.2    Chien, J.T.3    Soong, F.4
  • 55
    • 70349226870 scopus 로고    scopus 로고
    • Bayesian large margin hiddenmarkov models for speech recognition
    • Chen, J.C.; Chien, J.T.: Bayesian large margin hiddenMarkov models for speech recognition, in Proc. ICASSP 2009, 2009, pp. 3765-3768.
    • (2009) Proc. ICASSP 2009 , pp. 3765-3768
    • Chen, J.C.1    Chien, J.T.2
  • 56
    • 4544286714 scopus 로고    scopus 로고
    • Bayesian acoustic modeling for spontaneous speech recognition
    • Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Bayesian acoustic modeling for spontaneous speech recognition, in Proc. SSPR 2003, 2003, 47-50.
    • (2003) Proc. SSPR 2003 , pp. 47-50
    • Watanabe, S.1    Minami, Y.2    Nakamura, A.3    Ueda, N.4
  • 57
    • 85009174866 scopus 로고    scopus 로고
    • Variational bayesian GMM for speech recognition
    • Valente, F.; Wellekens, C.: Variational Bayesian GMM for speech recognition, in Proc. Eurospeech 2003, 2003, 441-444.
    • (2003) Proc. Eurospeech 2003 , pp. 441-444
    • Valente, F.1    Wellekens, C.2
  • 58
    • 51449089545 scopus 로고    scopus 로고
    • Weighted distance measures for efficient reduction of gaussian mixture components in HMM-based acoustic model
    • Ogawa, A.; Takahashi, S.: Weighted distance measures for efficient reduction of Gaussian mixture components in HMM-based acoustic model, in Proc. ICASSP'08, 2008, 4173-4176.
    • (2008) Proc. ICASSP'08 , pp. 4173-4176
    • Ogawa, A.1    Takahashi, S.2
  • 59
    • 85009135071 scopus 로고    scopus 로고
    • Acoustic model adaptation based on coarse-fine training of transfer vectors and its application to speaker adaptation task
    • Watanabe, S.; Nakamura, A.: Acoustic model adaptation based on coarse-fine training of transfer vectors and its application to speaker adaptation task, in Proc. ICSLP 2004, 2004, vol. 4, pp. 2933-2936.
    • (2004) Proc. ICSLP 2004 , vol.4 , pp. 2933-2936
    • Watanabe, S.1    Nakamura, A.2
  • 61
    • 82455212515 scopus 로고    scopus 로고
    • Bayesian linear regression for hidden markov model based on optimizing variational bounds
    • Watanabe, S.; Nakamura, A.; Juang, B.H.: Bayesian linear regression for hidden Markov model based on optimizing variational bounds, ' in Proc. MLSP 2011, 2011, 1-6.
    • (2011) Proc. MLSP 2011 , pp. 1-6
    • Watanabe, S.1    Nakamura, A.2    Juang, B.H.3
  • 62
    • 84878411087 scopus 로고    scopus 로고
    • Speaker adaptation using variational bayesian linear regression in normalized feature space
    • Hahm, S.J.; Ogawa, A.; Fujimoto, M.;Hori, T.;Nakamura, A.: Speaker adaptation using variational Bayesian linear regression in normalized feature space, in Proc. of Interspeech'12, 2012.
    • (2012) Proc. of Interspeech'12
    • Hahm, S.J.1    Ogawa, A.2    Fujimoto, M.3    Hori, T.4    Nakamura, A.5
  • 63
    • 0141852571 scopus 로고    scopus 로고
    • Constructing shared-state hidden Markov models based on a Bayesian approach
    • Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Constructing shared-state hidden Markov models based on a Bayesian approach, in Proc. ICSLP 2002, 2002, vol. 4, 2669-2672.
    • (2002) Proc. ICSLP 2002 , vol.4 , pp. 2669-2672
    • Watanabe, S.1    Minami, Y.2    Nakamura, A.3    Ueda, N.4
  • 64
    • 4544253566 scopus 로고    scopus 로고
    • Automatic generation of non-uniform HMM structures based on variational Bayesian approach
    • Jitsuhiro, T.; Nakamura, S.: Automatic generation of non-uniform HMM structures based on variational Bayesian approach, in Proc. ICASSP 2004, 2004, vol. 1, 805-808.
    • (2004) Proc. ICASSP 2004 , vol.1 , pp. 805-808
    • Jitsuhiro, T.1    Nakamura, S.2
  • 65
    • 33646418145 scopus 로고    scopus 로고
    • Automatic determination of acoustic model topology using variational bayesian estimation and clustering for large vocabulary continuous speech recognition
    • Watanabe, S.; Sako, A.; Nakamura, A.: Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition. IEEE Trans. Audio Speech Lang. Process. 14, (2006) 855-872.
    • (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , pp. 855-872
    • Watanabe, S.1    Sako, A.2    Nakamura, A.3
  • 66
    • 84867213785 scopus 로고    scopus 로고
    • Bayesian context clustering using cross valid prior distribution for HMM based speech recognition
    • Hashimoto, K.; Zen, H.; Nankaku, Y.; Lee, A.; Tokuda, K.: Bayesian context clustering using cross valid prior distribution for HMM based speech recognition, in Proc. Interspeech'08, 2008, 936-939.
    • (2008) Proc. Interspeech'08 , pp. 936-939
    • Hashimoto, K.1    Zen, H.2    Nankaku, Y.3    Lee, A.4    Tokuda, K.5
  • 67
    • 70450194713 scopus 로고    scopus 로고
    • Deterministic annealing based training algorithm for bayesian speech recognition
    • Shiota, S.; Hashimoto, K.; Nankaku, Y.; Tokuda, K.: Deterministic annealing based training algorithm for Bayesian speech recognition, in Proc. Interspeech' 09, 2009, 680-683.
    • (2009) Proc. Interspeech' 09 , pp. 680-683
    • Shiota, S.1    Hashimoto, K.2    Nankaku, Y.3    Tokuda, K.4
  • 68
    • 44949158124 scopus 로고    scopus 로고
    • Infinite models for speaker clustering
    • Valente, F.: Infinite models for speaker clustering, in Proc. Interspeech' 06, 2006, 1329-1332.
    • (2006) Proc. Interspeech' 06 , pp. 1329-1332
    • Valente, F.1
  • 69
    • 78049394635 scopus 로고    scopus 로고
    • Variational nonparametric Bayesian hidden Markov model
    • Ding, N.;Ou, Z.:Variational nonparametric Bayesian hidden Markov model, in Proc. ICASSP'10, 2010, 2098-2101.
    • (2010) Proc. ICASSP'10 , pp. 2098-2101
    • Ding, N.1    Ou, Z.2
  • 70
    • 85008550452 scopus 로고    scopus 로고
    • Probabilistic speaker diarization with bag-of-words representations of speaker angle information
    • Ishiguro, K.; Yamada, T.; Araki, S.; Nakatani, T.; Sawada, H.: Probabilistic speaker diarization with bag-of-words representations of speaker angle information. IEEE Trans. Audio Speech Lang. Process., 20, (2), (2012) 447-460.
    • (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.2 , pp. 447-460
    • Ishiguro, K.1    Yamada, T.2    Araki, S.3    Nakatani, T.4    Sawada, H.5
  • 71
    • 84867626020 scopus 로고    scopus 로고
    • Fully bayesian inference of multi-mixture gaussian model and its evaluation using speaker clustering
    • Tawara, N.; Ogawa, T.; Watanabe, S.; Kobayashi, T.: Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering, in Proc. ICASSP'12, 2012, 5253-5256.
    • (2012) Proc. ICASSP'12 , pp. 5253-5256
    • Tawara, N.1    Ogawa, T.2    Watanabe, S.3    Kobayashi, T.4
  • 73
    • 34547516258 scopus 로고    scopus 로고
    • Approximating the kullback leibler divergence between gaussian mixturemodels
    • Hershey, J.R.; Olsen, P.A.: Approximating the Kullback Leibler divergence between Gaussian mixturemodels, in Proc. ICASSP 2007, 2007, pp. 317-320.
    • (2007) Proc. ICASSP 2007 , pp. 317-320
    • Hershey, J.R.1    Olsen, P.A.2
  • 74
    • 79959828521 scopus 로고    scopus 로고
    • A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination
    • Kubo, Y.; Watanabe, S.; Nakamura, A.; Kobayashi, T.: A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination, in Proc. Interspeech 2010, 2010, 2954-2957.
    • (2010) Proc. Interspeech 2010 , pp. 2954-2957
    • Kubo, Y.1    Watanabe, S.2    Nakamura, A.3    Kobayashi, T.4
  • 75
    • 84860525845 scopus 로고    scopus 로고
    • A fully bayesian approach toun super vised part-of-speech tagging
    • Goldwater, S.; Griffiths, T.: A fully Bayesian approach toun super vised part-of-speech tagging, in Proc. ACL'07, 2007, 744-751.
    • (2007) Proc. ACL'07 , pp. 744-751
    • Goldwater, S.1    Griffiths, T.2
  • 76
    • 84859895217 scopus 로고    scopus 로고
    • Bayesian unsupervised word segmentation with nested pitman-yor language modeling
    • Mochihashi, D.; Yamada, T.; Ueda, N.: Bayesian unsupervised word segmentation with nested Pitman-Yor language modeling, in Proc. ACL-IJCNLP, 2009, 100-108.
    • (2009) Proc. ACL-IJCNLP , pp. 100-108
    • Mochihashi, D.1    Yamada, T.2    Ueda, N.3
  • 77
    • 80051606569 scopus 로고    scopus 로고
    • Gibbs sampling basedmulti-scale mixturemodel for speaker clustering
    • Watanabe, S.;Mochihashi, D.;Hori, T.;Nakamura, A.:Gibbs sampling basedmulti-scale mixturemodel for speaker clustering, in ICASSP'11, 2011, 4524-4527.
    • (2011) ICASSP'11 , pp. 4524-4527
    • Watanabe, S.1    Mochihashi, D.2    Hori, T.3    Nakamura, A.4
  • 78
    • 0021518209 scopus 로고
    • Stochastic relaxation, gibbs distributions, and the bayesian restoration of images
    • Geman, S.;Geman, D.: Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell., 6, (6), (1984) 721-741.
    • (1984) IEEE Trans. Pattern Anal. Mach. Intell. , vol.6 , Issue.6 , pp. 721-741
    • Geman, S.1    Geman, D.2
  • 80
    • 47749152568 scopus 로고    scopus 로고
    • The rich transcription 2007 meeting recognition evaluation
    • Fiscus, J.; Ajot, J.; Garofolo, J.: The rich transcription 2007 meeting recognition evaluation. Multimodal Technol. Percept. Humans, 2009 373-389. http://www.springerlink.com/content/94w143777u0165v5/.
    • (2009) Multimodal Technol. Percept. Humans , pp. 373-389
    • Fiscus, J.1    Ajot, J.2    Garofolo, J.3
  • 82
    • 0001120413 scopus 로고
    • A Bayesian analysis of some nonparametric problems
    • Ferguson, T.S.: A Bayesian analysis of some nonparametric problems. Ann. Stat., 1, (2) (1973) 209-230.
    • (1973) Ann. Stat. , vol.1 , Issue.2 , pp. 209-230
    • Ferguson, T.S.1
  • 83
    • 33645039209 scopus 로고    scopus 로고
    • Infinite latent feature models and the Indian buffet process
    • Griffiths, T.; Ghahramani, Z.: Infinite latent feature models and the Indian buffet process. Tech. Rep., Gatsby Unit, 2005.
    • (2005) Tech. Rep., Gatsby Unit
    • Griffiths, T.1    Ghahramani, Z.2
  • 85
    • 76849117578 scopus 로고    scopus 로고
    • The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies
    • Blei, D.M.; Griffiths, T.L.; Jordan, M.I.: The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies. J. ACM, 57, (2), (2010) 7.
    • (2010) J. ACM , vol.57 , Issue.2 , pp. 7
    • Blei, D.M.1    Griffiths, T.L.2    Jordan, M.I.3
  • 86
    • 84867809023 scopus 로고    scopus 로고
    • A nonparametric bayesian approach to acoustic model discovery
    • Lee, C. Y.; Glass, J.: A nonparametric Bayesian approach to acoustic model discovery, in Proc. ACL'12, 2012.
    • (2012) Proc. ACL'12
    • Lee, C.Y.1    Glass, J.2
  • 89
    • 84873586598 scopus 로고    scopus 로고
    • Infinite latent harmonic allocation: A nonparametric Bayesian approach to multipitch analysis
    • Yoshii, K.; Goto, M.: Infinite latent harmonic allocation: A nonparametric Bayesian approach to multipitch analysis, in Proc. 11th Int. Conf. Music Information Retrieval (ISMIR), 2010, 309-314.
    • (2010) Proc. 11th Int. Conf. Music Information Retrieval (ISMIR) , pp. 309-314
    • Yoshii, K.1    Goto, M.2
  • 91
  • 92
    • 85032751545 scopus 로고    scopus 로고
    • Structured discriminativemodels for speech recognition
    • Gales, M.; Watanabe, S.; Fossler-Lussier, E.: Structured discriminativemodels for speech recognition. IEEE Signal Process.Mag., 29, (6), (2012), 70-81.
    • (2012) IEEE Signal Process.Mag. , vol.29 , Issue.6 , pp. 70-81
    • Gales, M.1    Watanabe, S.2    Fossler-Lussier, E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.