-
1
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
Dempster, A.P.; Laird, N.M.; Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Stat. Soc. B, 39, (1976) 1-38.
-
(1976)
J. Roy. Stat. Soc. B
, vol.39
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
2
-
-
0016939124
-
Continuous speech recognition by statistical methods
-
Jelinek, F.: Continuous speech recognition by statistical methods. Proc. IEEE, 64(4), (1976) 532-556.
-
(1976)
Proc. IEEE
, vol.64
, Issue.4
, pp. 532-556
-
-
Jelinek, F.1
-
4
-
-
70349227947
-
The application of hidden markov models in speech recognition
-
Gales, M.; Young, S.: The application of hidden Markov models in speech recognition. Signal Process., 1, (3), (2007) 195-304.
-
(2007)
Signal Process
, vol.1
, Issue.3
, pp. 195-304
-
-
Gales, M.1
Young, S.2
-
5
-
-
3042730370
-
Recent advances in spontaneous speech recognition and understanding
-
Furui, S.: Recent advances in spontaneous speech recognition and understanding. in Proc. SSPR 2003, 2003, 1-6.
-
(2003)
Proc. SSPR 2003
, pp. 1-6
-
-
Furui, S.1
-
10
-
-
0026142334
-
A study on speaker adaptation of the parameters of continuous density hidden markov models
-
Lee, C.-H.; Lin, C.H.; Juang, B-H.: A study on speaker adaptation of the parameters of continuous density hidden Markov models. IEEE Trans.Acoust. Speech Signal Process., 39, (1991) 806-814.
-
(1991)
IEEE Trans.Acoust. Speech Signal Process.
, vol.39
, pp. 806-814
-
-
Lee, C.-H.1
Lin, C.H.2
Juang, B.-H.3
-
11
-
-
85009065028
-
Improved acoustic modeling with bayesian learning
-
Gauvain, J.L.; Lee, C.H.: Improved acoustic modeling with Bayesian learning. in ICASSP'92, 1, (1992) 481-484.
-
(1992)
ICASSP'92
, vol.1
, pp. 481-484
-
-
Gauvain, J.L.1
Lee, C.H.2
-
12
-
-
0028419019
-
Maximum a posteriori estimation for multivariate gaussian mixture observations of markov chains
-
Gauvain, J.-L.; Lee, C.-H.: Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. Speech Audio Process., 2, (1994) 291-298.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, pp. 291-298
-
-
Gauvain, J.-L.1
Lee, C.-H.2
-
13
-
-
0033884858
-
Speaker verification using adapted gaussian mixture models
-
Reynolds, D.A.; Quatieri, T.F.; Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digit. Signal Process., 10, (1-3), (2000) 19-41.
-
(2000)
Digit. Signal Process.
, vol.10
, Issue.1-3
, pp. 19-41
-
-
Reynolds, D.A.1
Quatieri, T.F.2
Dunn, R.B.3
-
14
-
-
0000120766
-
Estimating the dimension of a model
-
Schwarz, G.: Estimating the dimension of a model. Ann. Stat., 6, (1978) 461-464.
-
(1978)
Ann. Stat.
, vol.6
, pp. 461-464
-
-
Schwarz, G.1
-
15
-
-
51849177370
-
Likelihood and the bayes procedure
-
J.M.Bernardo, M. H. DeGroot, D. V. Lindley, andA. F.M. Smith, eds., University Press, Valencia, Spain
-
Akaike, H.: Likelihood and the Bayes procedure. in Bayesian Statistics, J.M.Bernardo, M. H. DeGroot, D. V. Lindley, andA. F.M. Smith, eds. 1980, 143-166, University Press, Valencia, Spain.
-
(1980)
Bayesian Statistics
, pp. 143-166
-
-
Akaike, H.1
-
16
-
-
0033906251
-
MDL-based context-dependent subword modeling for speech recognition
-
Shinoda, K.; Watanabe, T.: MDL-based context-dependent subword modeling for speech recognition. J. Acoust. Soc. Jpn. (E), 21, (2000) 79-86.
-
(2000)
J. Acoust. Soc. Jpn. (E)
, vol.21
, pp. 79-86
-
-
Shinoda, K.1
Watanabe, T.2
-
17
-
-
0032658258
-
Decision tree state tying based on penalized bayesian information criterion
-
Chou, W.; Reichl, W.: Decision tree state tying based on penalized Bayesian information criterion, in Proc. ICASSP 1999, 1, (1999) 345-348.
-
(1999)
Proc. ICASSP 1999
, vol.1
, pp. 345-348
-
-
Chou, W.1
Reichl, W.2
-
18
-
-
0009685440
-
Model selection in acoustic modeling
-
Chen, S.; Gopinath, R.: Model selection in acoustic modeling. in Proc. Eurospeech 1999, 3, (1999) 1087-1090.
-
(1999)
Proc. Eurospeech 1999
, vol.3
, pp. 1087-1090
-
-
Chen, S.1
Gopinath, R.2
-
19
-
-
0036305005
-
Efficient reduction of gaussian components using MDL criterion for HMM-based speech recognition
-
Shinoda, K.; Iso, K.: Efficient reduction of Gaussian components using MDL criterion for HMM-based speech recognition. in Proc. ICASSP 2001, 1, (2001) 869-872.
-
(2001)
Proc. ICASSP 2001
, vol.1
, pp. 869-872
-
-
Shinoda, K.1
Iso, K.2
-
20
-
-
79957689964
-
Application of variational bayesian approach to speech recognition
-
MIT Press
-
Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Application of Variational Bayesian Approach to Speech Recognition, NIPS 2002, MIT Press, 2002, 1261-1268.
-
(2002)
NIPS 2002
, pp. 1261-1268
-
-
Watanabe, S.1
Minami, Y.2
Nakamura, A.3
Ueda, N.4
-
21
-
-
3042741069
-
Variational bayesian estimation and clustering for speech recognition
-
Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Variational Bayesian estimation and clustering for speech recognition. IEEE Trans. Speech Audio Process., 12, (2004) 365-381.
-
(2004)
IEEE Trans. Speech Audio Process.
, vol.12
, pp. 365-381
-
-
Watanabe, S.1
Minami, Y.2
Nakamura, A.3
Ueda, N.4
-
22
-
-
56449084167
-
An HDP-HMM for systems with state persistence
-
Fox, E.B.; Sudderth, E.B.; Jordan, M.I.;Willsky, A.S.: An HDP-HMM for systems with state persistence. in Proc. of ICML, 2008, 312-319.
-
(2008)
Proc. of ICML
, pp. 312-319
-
-
Fox, E.B.1
Sudderth, E.B.2
Jordan, M.I.3
Willsky, A.S.4
-
23
-
-
84865792512
-
Speaker clustering based on utterance-oriented dirichlet process mixture model
-
Tawara, N.;Watanabe, S.; Ogawa, T.;Kobayashi, T.: Speaker clustering based on utterance-oriented Dirichlet process mixture model. in Proc. Interspeech'11, 2011, 2905-2908.
-
(2011)
Proc. Interspeech'11
, pp. 2905-2908
-
-
Tawara, N.1
Watanabe, S.2
Ogawa, T.3
Kobayashi, T.4
-
24
-
-
0032122203
-
On-line adaptive learning of the correlated continuous density hidden markov models for speech recognition
-
Huo, Q.; Lee, C.-H.: On-line adaptive learning of the correlated continuous density hiddenMarkov models for speech recognition. IEEE Trans. Speech Audio Process., 6, (1998) 386-397.
-
(1998)
IEEE Trans. Speech Audio Process.
, vol.6
, pp. 386-397
-
-
Huo, Q.1
Lee, C.-H.2
-
25
-
-
85008538758
-
Predictor-corrector adaptation by using time evolution system with macroscopic time scale
-
Watanabe, S.;Nakamura, A.: Predictor-corrector adaptation by using time evolution system with macroscopic time scale. IEEE Trans. Audio Speech Lang. Process., 18, (2), (2010) 395-406.
-
(2010)
IEEE Trans. Audio Speech Lang. Process.
, vol.18
, Issue.2
, pp. 395-406
-
-
Watanabe, S.1
Nakamura, A.2
-
27
-
-
0036461005
-
Structural maximuma posteriori linear regression for fas HMM adaptation
-
Siohan, O.;Myrvoll, T.A.; Lee, C.H.: Structural maximuma posteriori linear regression for fas HMM adaptation.Comput. Speech Lang., 16, (1), (2002) 5-24.
-
(2002)
Comput. Speech Lang.
, vol.16
, Issue.1
, pp. 5-24
-
-
Siohan, O.1
Myrvoll, T.A.2
Lee, C.H.3
-
28
-
-
0017553461
-
A quasi-bayes unsupervised learning procedure for priors
-
Makov, U.E.; Smith, A.F.M.: A quasi-Bayes unsupervised learning procedure for priors. IEEE Trans. Inf. Theory, 23, (1977) 761-764.
-
(1977)
IEEE Trans. Inf. Theory
, vol.23
, pp. 761-764
-
-
Makov, U.E.1
Smith, A.F.M.2
-
29
-
-
0030105005
-
On-line adaptation of the SCHMM parameters based on the segmental quasi-bayes learning for speech recognition
-
Huo, Q.; Chan, C.; Lee, C.-H.: On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition. IEEE Trans. Speech Audio Process., 4, (1996) 141-144.
-
(1996)
IEEE Trans. Speech Audio Process.
, vol.4
, pp. 141-144
-
-
Huo, Q.1
Chan, C.2
Lee, C.-H.3
-
30
-
-
0036649879
-
Quasi-bayes linear regression for sequential learning of hidden markov models
-
Chien, J.T.: Quasi-Bayes linear regression for sequential learning of hidden Markov models. IEEE Trans. Speech Audio Process., 10, (2002) 268-278.
-
(2002)
IEEE Trans. Speech Audio Process.
, vol.10
, pp. 268-278
-
-
Chien, J.T.1
-
32
-
-
0036293559
-
The graphical models toolkit: An open source software system for speech and time-series processing
-
Bilmes, J.; Zweig, G.: The Graphical Models Toolkit: An open source software system for speech and time-series processing, in Proc. ICASSP'02, 2002, vol. 4, 3916-3919.
-
(2002)
Proc. ICASSP'02
, vol.4
, pp. 3916-3919
-
-
Bilmes, J.1
Zweig, G.2
-
33
-
-
85032751986
-
Single channel multi-talker speech recognition: Graphical modeling approaches
-
Rennie, S.; Hershey, J.R.; Olsen, P.A.: Single channel multi-talker speech recognition: Graphical modeling approaches. IEEE Signal Process.Mag. Spec. Issue Graph. Models, 27, (6), (2010) 66-80.
-
(2010)
IEEE Signal Process.Mag. Spec. Issue Graph. Models
, vol.27
, Issue.6
, pp. 66-80
-
-
Rennie, S.1
Hershey, J.R.2
Olsen, P.A.3
-
34
-
-
80051625262
-
Bayesian sensing hidden markov models for speech recognition
-
IEEE
-
Saon, G.; Chien, J.T.: Bayesian sensing hidden Markov models for speech recognition. in Proc. ICASSP'11. IEEE, 2011, 5056-5059.
-
(2011)
Proc. ICASSP'11
, pp. 5056-5059
-
-
Saon, G.1
Chien, J.T.2
-
35
-
-
0000159105
-
On adaptive decision rules and decision parameter adaptation for automatic speech recognition
-
Lee, C.-H.; Huo, Q.: On adaptive decision rules and decision parameter adaptation for automatic speech recognition. in Proc. IEEE, 88, (2000) 1241-1269.
-
(2000)
Proc. IEEE
, vol.88
, pp. 1241-1269
-
-
Lee, C.-H.1
Huo, Q.2
-
36
-
-
85032752364
-
Graphical model architectures for speech recognition
-
Bilmes, J.; Bartels, C.: Graphical model architectures for speech recognition. IEEE Signal Process. Mag., 22, (5), (2005) 89-100.
-
(2005)
IEEE Signal Process. Mag.
, vol.22
, Issue.5
, pp. 89-100
-
-
Bilmes, J.1
Bartels, C.2
-
37
-
-
84906883334
-
Tutorial: Bayesian learning for speech and language processing T-10
-
Watanabe, S.; Chien, J.T.: Tutorial: Bayesian learning for speech and language processing T-10, ICASSP'12, 2012.
-
(2012)
ICASSP'12
-
-
Watanabe, S.1
Chien, J.T.2
-
38
-
-
0023776398
-
The DARPA 1000- word resource management database for continuous speech recognition
-
Price, P.; Fisher, W.M.; Bernstein, J.; Pallett, D.S.: The DARPA 1000- word resource management database for continuous speech recognition, in Proc. ICASSP'88, 1988, 651-654.
-
(1988)
Proc. ICASSP'88
, pp. 651-654
-
-
Price, P.1
Fisher, W.M.2
Bernstein, J.3
Pallett, D.S.4
-
40
-
-
0002144369
-
Tree-based state tying for high accuracy acoustic modelling
-
Young, S.J.; Odell, JJ; Woodland, PC: Tree-based state tying for high accuracy acoustic modelling, in Proc. Workshop on Human Language Technology, 1994, 307-312.
-
(1994)
Proc. Workshop on Human Language Technology
, pp. 307-312
-
-
Young, S.J.1
Odell, J.J.2
Woodland, P.C.3
-
41
-
-
64549109650
-
Knowledge-based adaptive decision tree state tying for conversational speech recognition
-
Hu, R.; Zhao, Y.: Knowledge-based adaptive decision tree state tying for conversational speech recognition. IEEE Trans. Audio Speech Lang. Process., 15, (7), (2007) 2160-2168.
-
(2007)
IEEE Trans. Audio Speech Lang. Process.
, vol.15
, Issue.7
, pp. 2160-2168
-
-
Hu, R.1
Zhao, Y.2
-
42
-
-
85008530405
-
Speaker diarization: A review of recent research
-
Anguera Miro, X.; Bozonnet, S.; Evans, N.; Fredouille, C.; Friedland, G.; Vinyals, O.: Speaker diarization: A review of recent research. IEEE Trans. Audio Speech Lang. Process., 20, (2), (2012) 356-370.
-
(2012)
IEEE Trans. Audio Speech Lang. Process.
, vol.20
, Issue.2
, pp. 356-370
-
-
Anguera Miro, X.1
Bozonnet, S.2
Evans, N.3
Fredouille, C.4
Friedland, G.5
Vinyals, O.6
-
43
-
-
0032685060
-
Robust speech recognition based on a bayesian prediction approach
-
Jiang, H.; Hirose, K.; Huo, Q.: Robust speech recognition based on a Bayesian prediction approach. IEEE Trans. Speech Audio Process., 7, (1999) 426-440.
-
(1999)
IEEE Trans. Speech Audio Process.
, vol.7
, pp. 426-440
-
-
Jiang, H.1
Hirose, K.2
Huo, Q.3
-
44
-
-
0033900150
-
A bayesian predictive classification approach to robust speech recognition
-
Huo, Q.; Lee, C.-H.: A Bayesian predictive classification approach to robust speech recognition. IEEE Trans. Speech Audio Process., 8, (2000) 200-204.
-
(2000)
IEEE Trans. Speech Audio Process.
, vol.8
, pp. 200-204
-
-
Huo, Q.1
Lee, C.-H.2
-
45
-
-
0033225865
-
An introduction to variational methods for graphical models
-
Jordan, M.I.; Ghahramani, Z.; Jaakkola, T.S.; Saul, L.K.: An introduction to variational methods for graphical models. Mach. Learn., 37, (1997) 183-233.
-
(1997)
Mach. Learn.
, vol.37
, pp. 183-233
-
-
Jordan, M.I.1
Ghahramani, Z.2
Jaakkola, T.S.3
Saul, L.K.4
-
46
-
-
85156191859
-
Bayesian methods for mixtures of experts
-
MIT Press
-
Waterhouse, S.; MacKay, D.; Robinson, T.: Bayesian Methods for Mixtures of Experts, NIPS 7, MIT Press, 1995, 351-357.
-
(1995)
NIPS
, vol.7
, pp. 351-357
-
-
Waterhouse, S.1
MacKay, D.2
Robinson, T.3
-
47
-
-
0003278032
-
Inferring parameters and structure of latent variable models by variational bayes
-
Attias, H.: Inferring parameters and structure of latent variable models by variational Bayes, in Proc. Uncertainty in Artificial Intelligence (UAI) 15, 1999, 21-30.
-
(1999)
Proc. Uncertainty in Artificial Intelligence (UAI)
, vol.15
, pp. 21-30
-
-
Attias, H.1
-
48
-
-
0036887504
-
Bayesian model search for mixture models based on optimizing variational bounds
-
Ueda, N.; Ghahramani, Z.: Bayesian model search for mixture models based on optimizing variational bounds. Neural Netw., 15, (2002) 1223-1241.
-
(2002)
Neural Netw.
, vol.15
, pp. 1223-1241
-
-
Ueda, N.1
Ghahramani, Z.2
-
50
-
-
0036294874
-
Application of variational bayesian PCA for speech feature extraction
-
Kwon, O.; Lee, T.-W.; Chan, K.: Application of variational Bayesian PCA for speech feature extraction, in Proc. ICASSP 2002, 2002, vol. 1, 825-828.
-
(2002)
Proc. ICASSP 2002
, vol.1
, pp. 825-828
-
-
Kwon, O.1
Lee, T.-W.2
Chan, K.3
-
51
-
-
33646768578
-
Variational bayesian feature selection for gaussian mixture models
-
Valente, F.; Wellekens, C.: Variational Bayesian feature selection for Gaussian mixture models, in Proc. ICASSP 2004, 2004, vol. 1, 513-516.
-
(2004)
Proc. ICASSP 2004
, vol.1
, pp. 513-516
-
-
Valente, F.1
Wellekens, C.2
-
53
-
-
78649271854
-
Online unsupervised classification withmodel comparison in the variational bayes framework for voice activity detection
-
Cournapeau, D.; Watanabe, S.; Nakamura, A.; Kawahara, T.: Online unsupervised classification withmodel comparison in the variational Bayes framework for voice activity detection. IEEE J. Sel. Top. Signal Process., 4, (6), (2010) 1071-1083.
-
(2010)
IEEE J. Sel. Top. Signal Process.
, vol.4
, Issue.6
, pp. 1071-1083
-
-
Cournapeau, D.1
Watanabe, S.2
Nakamura, A.3
Kawahara, T.4
-
54
-
-
70349205593
-
An evidence framework for bayesian learning of continuous-density hidden markov models
-
Zhang, Y.; Liu, P.; Chien, J.T.; Soong, F.: An evidence framework for Bayesian learning of continuous-density hidden Markov models, in Proc. ICASSP 2009, 2009, 3857-3860.
-
(2009)
Proc. ICASSP 2009
, pp. 3857-3860
-
-
Zhang, Y.1
Liu, P.2
Chien, J.T.3
Soong, F.4
-
55
-
-
70349226870
-
Bayesian large margin hiddenmarkov models for speech recognition
-
Chen, J.C.; Chien, J.T.: Bayesian large margin hiddenMarkov models for speech recognition, in Proc. ICASSP 2009, 2009, pp. 3765-3768.
-
(2009)
Proc. ICASSP 2009
, pp. 3765-3768
-
-
Chen, J.C.1
Chien, J.T.2
-
56
-
-
4544286714
-
Bayesian acoustic modeling for spontaneous speech recognition
-
Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Bayesian acoustic modeling for spontaneous speech recognition, in Proc. SSPR 2003, 2003, 47-50.
-
(2003)
Proc. SSPR 2003
, pp. 47-50
-
-
Watanabe, S.1
Minami, Y.2
Nakamura, A.3
Ueda, N.4
-
57
-
-
85009174866
-
Variational bayesian GMM for speech recognition
-
Valente, F.; Wellekens, C.: Variational Bayesian GMM for speech recognition, in Proc. Eurospeech 2003, 2003, 441-444.
-
(2003)
Proc. Eurospeech 2003
, pp. 441-444
-
-
Valente, F.1
Wellekens, C.2
-
58
-
-
51449089545
-
Weighted distance measures for efficient reduction of gaussian mixture components in HMM-based acoustic model
-
Ogawa, A.; Takahashi, S.: Weighted distance measures for efficient reduction of Gaussian mixture components in HMM-based acoustic model, in Proc. ICASSP'08, 2008, 4173-4176.
-
(2008)
Proc. ICASSP'08
, pp. 4173-4176
-
-
Ogawa, A.1
Takahashi, S.2
-
59
-
-
85009135071
-
Acoustic model adaptation based on coarse-fine training of transfer vectors and its application to speaker adaptation task
-
Watanabe, S.; Nakamura, A.: Acoustic model adaptation based on coarse-fine training of transfer vectors and its application to speaker adaptation task, in Proc. ICSLP 2004, 2004, vol. 4, pp. 2933-2936.
-
(2004)
Proc. ICSLP 2004
, vol.4
, pp. 2933-2936
-
-
Watanabe, S.1
Nakamura, A.2
-
61
-
-
82455212515
-
Bayesian linear regression for hidden markov model based on optimizing variational bounds
-
Watanabe, S.; Nakamura, A.; Juang, B.H.: Bayesian linear regression for hidden Markov model based on optimizing variational bounds, ' in Proc. MLSP 2011, 2011, 1-6.
-
(2011)
Proc. MLSP 2011
, pp. 1-6
-
-
Watanabe, S.1
Nakamura, A.2
Juang, B.H.3
-
62
-
-
84878411087
-
Speaker adaptation using variational bayesian linear regression in normalized feature space
-
Hahm, S.J.; Ogawa, A.; Fujimoto, M.;Hori, T.;Nakamura, A.: Speaker adaptation using variational Bayesian linear regression in normalized feature space, in Proc. of Interspeech'12, 2012.
-
(2012)
Proc. of Interspeech'12
-
-
Hahm, S.J.1
Ogawa, A.2
Fujimoto, M.3
Hori, T.4
Nakamura, A.5
-
63
-
-
0141852571
-
Constructing shared-state hidden Markov models based on a Bayesian approach
-
Watanabe, S.; Minami, Y.; Nakamura, A.; Ueda, N.: Constructing shared-state hidden Markov models based on a Bayesian approach, in Proc. ICSLP 2002, 2002, vol. 4, 2669-2672.
-
(2002)
Proc. ICSLP 2002
, vol.4
, pp. 2669-2672
-
-
Watanabe, S.1
Minami, Y.2
Nakamura, A.3
Ueda, N.4
-
64
-
-
4544253566
-
Automatic generation of non-uniform HMM structures based on variational Bayesian approach
-
Jitsuhiro, T.; Nakamura, S.: Automatic generation of non-uniform HMM structures based on variational Bayesian approach, in Proc. ICASSP 2004, 2004, vol. 1, 805-808.
-
(2004)
Proc. ICASSP 2004
, vol.1
, pp. 805-808
-
-
Jitsuhiro, T.1
Nakamura, S.2
-
65
-
-
33646418145
-
Automatic determination of acoustic model topology using variational bayesian estimation and clustering for large vocabulary continuous speech recognition
-
Watanabe, S.; Sako, A.; Nakamura, A.: Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition. IEEE Trans. Audio Speech Lang. Process. 14, (2006) 855-872.
-
(2006)
IEEE Trans. Audio Speech Lang. Process.
, vol.14
, pp. 855-872
-
-
Watanabe, S.1
Sako, A.2
Nakamura, A.3
-
66
-
-
84867213785
-
Bayesian context clustering using cross valid prior distribution for HMM based speech recognition
-
Hashimoto, K.; Zen, H.; Nankaku, Y.; Lee, A.; Tokuda, K.: Bayesian context clustering using cross valid prior distribution for HMM based speech recognition, in Proc. Interspeech'08, 2008, 936-939.
-
(2008)
Proc. Interspeech'08
, pp. 936-939
-
-
Hashimoto, K.1
Zen, H.2
Nankaku, Y.3
Lee, A.4
Tokuda, K.5
-
67
-
-
70450194713
-
Deterministic annealing based training algorithm for bayesian speech recognition
-
Shiota, S.; Hashimoto, K.; Nankaku, Y.; Tokuda, K.: Deterministic annealing based training algorithm for Bayesian speech recognition, in Proc. Interspeech' 09, 2009, 680-683.
-
(2009)
Proc. Interspeech' 09
, pp. 680-683
-
-
Shiota, S.1
Hashimoto, K.2
Nankaku, Y.3
Tokuda, K.4
-
68
-
-
44949158124
-
Infinite models for speaker clustering
-
Valente, F.: Infinite models for speaker clustering, in Proc. Interspeech' 06, 2006, 1329-1332.
-
(2006)
Proc. Interspeech' 06
, pp. 1329-1332
-
-
Valente, F.1
-
69
-
-
78049394635
-
Variational nonparametric Bayesian hidden Markov model
-
Ding, N.;Ou, Z.:Variational nonparametric Bayesian hidden Markov model, in Proc. ICASSP'10, 2010, 2098-2101.
-
(2010)
Proc. ICASSP'10
, pp. 2098-2101
-
-
Ding, N.1
Ou, Z.2
-
70
-
-
85008550452
-
Probabilistic speaker diarization with bag-of-words representations of speaker angle information
-
Ishiguro, K.; Yamada, T.; Araki, S.; Nakatani, T.; Sawada, H.: Probabilistic speaker diarization with bag-of-words representations of speaker angle information. IEEE Trans. Audio Speech Lang. Process., 20, (2), (2012) 447-460.
-
(2012)
IEEE Trans. Audio Speech Lang. Process.
, vol.20
, Issue.2
, pp. 447-460
-
-
Ishiguro, K.1
Yamada, T.2
Araki, S.3
Nakatani, T.4
Sawada, H.5
-
71
-
-
84867626020
-
Fully bayesian inference of multi-mixture gaussian model and its evaluation using speaker clustering
-
Tawara, N.; Ogawa, T.; Watanabe, S.; Kobayashi, T.: Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering, in Proc. ICASSP'12, 2012, 5253-5256.
-
(2012)
Proc. ICASSP'12
, pp. 5253-5256
-
-
Tawara, N.1
Ogawa, T.2
Watanabe, S.3
Kobayashi, T.4
-
72
-
-
70349223889
-
A bayesian approach to HMM-based speech synthesis
-
Hashimoto, K.; Zen, H.; Nankaku, Y.; Masuko, T.; Tokuda, K.: A Bayesian approach to HMM-based speech synthesis, in Proc, ICASSP 2009, 2009, 4029-4032.
-
(2009)
Proc, ICASSP 2009
, pp. 4029-4032
-
-
Hashimoto, K.1
Zen, H.2
Nankaku, Y.3
Masuko, T.4
Tokuda, K.5
-
73
-
-
34547516258
-
Approximating the kullback leibler divergence between gaussian mixturemodels
-
Hershey, J.R.; Olsen, P.A.: Approximating the Kullback Leibler divergence between Gaussian mixturemodels, in Proc. ICASSP 2007, 2007, pp. 317-320.
-
(2007)
Proc. ICASSP 2007
, pp. 317-320
-
-
Hershey, J.R.1
Olsen, P.A.2
-
74
-
-
79959828521
-
A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination
-
Kubo, Y.; Watanabe, S.; Nakamura, A.; Kobayashi, T.: A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination, in Proc. Interspeech 2010, 2010, 2954-2957.
-
(2010)
Proc. Interspeech 2010
, pp. 2954-2957
-
-
Kubo, Y.1
Watanabe, S.2
Nakamura, A.3
Kobayashi, T.4
-
75
-
-
84860525845
-
A fully bayesian approach toun super vised part-of-speech tagging
-
Goldwater, S.; Griffiths, T.: A fully Bayesian approach toun super vised part-of-speech tagging, in Proc. ACL'07, 2007, 744-751.
-
(2007)
Proc. ACL'07
, pp. 744-751
-
-
Goldwater, S.1
Griffiths, T.2
-
76
-
-
84859895217
-
Bayesian unsupervised word segmentation with nested pitman-yor language modeling
-
Mochihashi, D.; Yamada, T.; Ueda, N.: Bayesian unsupervised word segmentation with nested Pitman-Yor language modeling, in Proc. ACL-IJCNLP, 2009, 100-108.
-
(2009)
Proc. ACL-IJCNLP
, pp. 100-108
-
-
Mochihashi, D.1
Yamada, T.2
Ueda, N.3
-
77
-
-
80051606569
-
Gibbs sampling basedmulti-scale mixturemodel for speaker clustering
-
Watanabe, S.;Mochihashi, D.;Hori, T.;Nakamura, A.:Gibbs sampling basedmulti-scale mixturemodel for speaker clustering, in ICASSP'11, 2011, 4524-4527.
-
(2011)
ICASSP'11
, pp. 4524-4527
-
-
Watanabe, S.1
Mochihashi, D.2
Hori, T.3
Nakamura, A.4
-
78
-
-
0021518209
-
Stochastic relaxation, gibbs distributions, and the bayesian restoration of images
-
Geman, S.;Geman, D.: Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell., 6, (6), (1984) 721-741.
-
(1984)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.6
, Issue.6
, pp. 721-741
-
-
Geman, S.1
Geman, D.2
-
79
-
-
85008590333
-
Lowlatency real-time meeting recognition and understanding using distant microphones and omni-directional camera
-
Hori, T.; Araki, S.; Yoshioka, T.; Fujimoto, M.; Watanabe, S.; Oba, T.; Ogawa, A.; Otsuka, K.; Mikami, D.; Kinoshita, K.; et al., : Lowlatency real-time meeting recognition and understanding using distant microphones and omni-directional camera. IEEE Trans. Audio Speech Lang. Process., 20, (2), (2012) 499.
-
(2012)
IEEE Trans. Audio Speech Lang. Process.
, vol.20
, Issue.2
, pp. 499
-
-
Hori, T.1
Araki, S.2
Yoshioka, T.3
Fujimoto, M.4
Watanabe, S.5
Oba, T.6
Ogawa, A.7
Otsuka, K.8
Mikami, D.9
Kinoshita, K.10
-
80
-
-
47749152568
-
The rich transcription 2007 meeting recognition evaluation
-
Fiscus, J.; Ajot, J.; Garofolo, J.: The rich transcription 2007 meeting recognition evaluation. Multimodal Technol. Percept. Humans, 2009 373-389. http://www.springerlink.com/content/94w143777u0165v5/.
-
(2009)
Multimodal Technol. Percept. Humans
, pp. 373-389
-
-
Fiscus, J.1
Ajot, J.2
Garofolo, J.3
-
81
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
Hinton, G.;Deng, L.; Yu, D.;Dahl, G.;Mohamed, A.; Jaitly, N.; Senior, A.; Van houcke, V.; Nguyen, P.; Sainath, T.; Kingsbury, B.: Deep neural networks for acoustic modeling in speech recognition. IEEE Signal Process. Mag., 29, (6), (2012), 82-97.
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Van Houcke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
82
-
-
0001120413
-
A Bayesian analysis of some nonparametric problems
-
Ferguson, T.S.: A Bayesian analysis of some nonparametric problems. Ann. Stat., 1, (2) (1973) 209-230.
-
(1973)
Ann. Stat.
, vol.1
, Issue.2
, pp. 209-230
-
-
Ferguson, T.S.1
-
84
-
-
33749249312
-
Hierarchical dirichlet processes
-
Teh, Y.W.; Jordan, M.I.; Beal, M.J.; Blei, D.M.: Hierarchical Dirichlet processes. J. Am. Stat. Assoc., 101, (476), (2006) 1566-1581.
-
(2006)
J. Am. Stat. Assoc.
, vol.101
, Issue.476
, pp. 1566-1581
-
-
Teh, Y.W.1
Jordan, M.I.2
Beal, M.J.3
Blei, D.M.4
-
85
-
-
76849117578
-
The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies
-
Blei, D.M.; Griffiths, T.L.; Jordan, M.I.: The nested Chinese restaurant process and Bayesian nonparametric inference of topic hierarchies. J. ACM, 57, (2), (2010) 7.
-
(2010)
J. ACM
, vol.57
, Issue.2
, pp. 7
-
-
Blei, D.M.1
Griffiths, T.L.2
Jordan, M.I.3
-
86
-
-
84867809023
-
A nonparametric bayesian approach to acoustic model discovery
-
Lee, C. Y.; Glass, J.: A nonparametric Bayesian approach to acoustic model discovery, in Proc. ACL'12, 2012.
-
(2012)
Proc. ACL'12
-
-
Lee, C.Y.1
Glass, J.2
-
87
-
-
79959859627
-
Learning a language model from continuous speech
-
Neubig, G.; Mimura, M.; Mori, S.; Kawahara, T.: Learning a language model from continuous speech, in Proc. Interspeech'10, 2010, 1053-1056.
-
(2010)
Proc. Interspeech'10
, pp. 1053-1056
-
-
Neubig, G.1
Mimura, M.2
Mori, S.3
Kawahara, T.4
-
88
-
-
84872741459
-
Finding latent sources in recorded music with a shift-invariant HDP
-
Hoffman, M.; Blei, D.; Cook, P.R.: Finding latent sources in recorded music with a shift-invariant HDP, in Proceedings of the International Conference on Digital Audio Effects (DAFx), 2009, 438-444.
-
(2009)
Proceedings of the International Conference on Digital Audio Effects (DAFx)
, pp. 438-444
-
-
Hoffman, M.1
Blei, D.2
Cook, P.R.3
-
89
-
-
84873586598
-
Infinite latent harmonic allocation: A nonparametric Bayesian approach to multipitch analysis
-
Yoshii, K.; Goto, M.: Infinite latent harmonic allocation: A nonparametric Bayesian approach to multipitch analysis, in Proc. 11th Int. Conf. Music Information Retrieval (ISMIR), 2010, 309-314.
-
(2010)
Proc. 11th Int. Conf. Music Information Retrieval (ISMIR)
, pp. 309-314
-
-
Yoshii, K.1
Goto, M.2
-
90
-
-
83455246038
-
Bayesian nonparametric spectrogram modeling based oninfinite factorial infinitehidden Markov model
-
Nakano, M.; Le Roux, J.; Kameoka, H.; Nakamura, T.; Ono, N.; Sagayama, S.: Bayesian nonparametric spectrogram modeling based oninfinite factorial infinitehidden Markov model, in 2011 IEEE Workshop on, Applications of Signal Processing to Audio and Acoustics (WASPAA), 2011, 325-328.
-
(2011)
2011 IEEE Workshop On, Applications of Signal Processing to Audio and Acoustics (WASPAA)
, pp. 325-328
-
-
Nakano, M.1
Le Roux, J.2
Kameoka, H.3
Nakamura, T.4
Ono, N.5
Sagayama, S.6
-
91
-
-
34547522070
-
Discriminative training for large-vocabulary speech recognition using minimum classification error
-
McDermott, E.; Hazen, T.J.; Le Roux, J.; Nakamura, A.; Katagiri, S.: "Discriminative training for large-vocabulary speech recognition using minimum classification error", IEEE Trans. Audio Speech Lang. Process., 15, (1), (2007) 203-223.
-
(2007)
IEEE Trans. Audio Speech Lang. Process.
, vol.15
, Issue.1
, pp. 203-223
-
-
McDermott, E.1
Hazen, T.J.2
Le Roux, J.3
Nakamura, A.4
Katagiri, S.5
-
92
-
-
85032751545
-
Structured discriminativemodels for speech recognition
-
Gales, M.; Watanabe, S.; Fossler-Lussier, E.: Structured discriminativemodels for speech recognition. IEEE Signal Process.Mag., 29, (6), (2012), 70-81.
-
(2012)
IEEE Signal Process.Mag.
, vol.29
, Issue.6
, pp. 70-81
-
-
Gales, M.1
Watanabe, S.2
Fossler-Lussier, E.3
|