SCOPUS 정보 검색 플랫폼

IEEE Journal on Selected Topics in Signal Processing

Volumn 4, Issue 3, 2010, Pages 540-547

Gaussian mixture optimization based on efficient cross-validation

(3) Shinozaki, Takahiro a Furui, Sadaoki a Kawahara, Tatsuya b

a TOKYO INSTITUTE OF TECHNOLOGY (Japan)

b KYOTO UNIVERSITY (Japan)

Author keywords

Cross validation; Gaussian mixture; Hidden Markov model (HMM); Speech recognition; Sufficient statistics

Indexed keywords

COMPUTATIONAL COSTS; CROSS VALIDATION; EVALUATION ALGORITHM; GAUSSIAN MIXTURE; GAUSSIAN MIXTURES; GAUSSIANS; GENERALIZATION PERFORMANCE; MIXTURE COMPONENTS; MODEL COMPLEXITY; OBJECTIVE FUNCTIONS; OPTIMIZATION METHOD; ORAL PRESENTATIONS; OVERFITTING; RECOGNITION PERFORMANCE; SPEECH RECOGNITION PERFORMANCE; STEP-BY-STEP; SUFFICIENT STATISTICS; TERMINATION CRITERIA; TRAINING SETS;

HIDDEN MARKOV MODELS; OPTIMIZATION;

SPEECH RECOGNITION;

EID: 77952624645 PISSN: 19324553 EISSN: None Source Type: Journal
DOI: 10.1109/JSTSP.2010.2048235 Document Type: Article

Times cited : (7)

References (30)

1
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models, " Digital Signal Process., pp. 19-41, 2000.
- (2000) Digital Signal Process. , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

2
- 10044240378
- Improved adaptive gaussian mixture model for background subtraction
- Z. Zivkovic, "Improved adaptive gaussian mixture model for background subtraction, " in Proc. ICPR, 2004, vol. 2, pp. 23-26.
- (2004) Proc. ICPR , vol.2 , pp. 23-26
- Zivkovic, Z.¹

3
- 34547504101
- Model complexity selection and cross-validation EM training for robust speaker diarization
- X. Anguera, T. Shinozaki, C. Wooters, and J. Hernando, "Model complexity selection and cross-validation EM training for robust speaker diarization, " in Proc. ICASSP, 2007, vol. IV, pp. 273-276.
- (2007) Proc. ICASSP , vol.4 , pp. 273-276
- Anguera, X.¹ Shinozaki, T.² Wooters, C.³ Hernando, J.⁴

4
- 0002144369
- Tree-based state tying for high accuracy acoustic modelling
- S. Young, J. Odell, and P. Woodland, "Tree-based state tying for high accuracy acoustic modelling, " in Proc. ARPA Workshop Human Lang. Technol., 1994, pp. 307-312.
- (1994) Proc. ARPA Workshop Human Lang. Technol. , pp. 307-312
- Young, S.¹ Odell, J.² Woodland, P.³

5
- 0016355478
- A new look at the statistical model identification
- Dec
- H. Akaike, "A new look at the statistical model identification, " IEEE Trans. Autom. Control, vol. AC-19, no. 6, pp. 716-723, Dec. 1974.
- (1974) IEEE Trans. Autom. Control , vol.AC-19 , Issue.6 , pp. 716-723
- Akaike, H.¹

6
- 0021466584
- Universal coding, information, prediction, and estimation
- Jul
- J. Rissanen, "Universal coding, information, prediction, and estimation, " IEEE Trans. Inf. Theory, vol. IT-30, no. 4, pp. 629-638, Jul. 1984.
- (1984) IEEE Trans. Inf. Theory , vol.IT-30 , Issue.4 , pp. 629-638
- Rissanen, J.¹

7
- 85135145174
- Acoustic modeling based on the MDL principle for speech recognition
- K. Shinoda and T. Watanabe, "Acoustic modeling based on the MDL principle for speech recognition, " in Proc. Eurospeech, 1997, vol. 1, pp. 99-102.
- (1997) Proc. Eurospeech , vol.1 , pp. 99-102
- Shinoda, K.¹ Watanabe, T.²

8
- 34147153716
- A comparative evaluation of variance flooring techniques in HMM-based speaker verification
- in, Sydney, Australia
- H. Melin, J. W. Koolwaaij, J. Lindberg, and F. Bimbot, "A comparative evaluation of variance flooring techniques in HMM-based speaker verification, " in Proc. ICSLP, Sydney, Australia, 1998, pp. 2379-2382.
- (1998) Proc. ICSLP , pp. 2379-2382
- Melin, H.¹ Koolwaaij, J.W.² Lindberg, J.³ Bimbot, F.⁴

9
- 84928746885
- London, U. K.: Prentice-Hall
- P. A. Devijver and J. Kittler, Pattern Recognition: A Statistical Approach. London, U. K.: Prentice-Hall, 1982.
- (1982) Pattern Recognition: A Statistical Approach
- Devijver, P.A.¹ Kittler, J.²

10
- 0003684449
- New York: Springer-Verlag
- T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning. New York: Springer-Verlag, 2001.
- (2001) The Elements of Statistical Learning
- Hastie, T.¹ Tibshirani, R.² Friedman, J.³

11
- 0017336301
- Asymptotics for and against cross-validation
- Apr
- M. Stone, "Asymptotics for and against cross-validation, " Biometrika, vol. 64, no. 1, pp. 29-35, Apr. 1977.
- (1977) Biometrika , vol.64 , Issue.1 , pp. 29-35
- Stone, M.¹

12
- 85135141040
- Automatic architecture design by likelihood-based context clustering with crossvalidation
- in, Rhodes, Greece
- I. Rogina, "Automatic architecture design by likelihood-based context clustering with crossvalidation, " in Proc. Eurospeech, Rhodes, Greece, 1997, pp. 1223-1226.
- (1997) Proc. Eurospeech , pp. 1223-1226
- Rogina, I.¹

13
- 33947650089
- HMM state clustering based on efficient cross-validation
- in, Toulouse, France
- T. Shinozaki, "HMM state clustering based on efficient cross-validation, " in Proc. ICASSP, Toulouse, France, 2006, vol. I, pp. 1157-1160.
- (2006) Proc. ICASSP , vol.1 , pp. 1157-1160
- Shinozaki, T.¹

14
- 44849119309
- Gaussian mixture optimization for HMM based on efficient cross-validation
- T. Shinozaki and T. Kawahara, "Gaussian mixture optimization for HMM based on efficient cross-validation, " in Proc. Interspeech, 2007, pp. 2061-2064.
- (2007) Proc. Interspeech , pp. 2061-2064
- Shinozaki, T.¹ Kawahara, T.²

15
- 0030715097
- HMM topology design using maximum likelihood successive state splitting
- M. Ostendorf and H. Singer, "HMM topology design using maximum likelihood successive state splitting, " Comput. Speech Lang., vol. 11, pp. 17-41, 1997.
- (1997) Comput. Speech Lang. , vol.11 , pp. 17-41
- Ostendorf, M.¹ Singer, H.²

16
- 33645797655
- Utterance-based selective training for the automatic creation of task-dependent acoustic models
- T. Cincarek, T. Tomoki, H. Saruwatari, and K. Shikano, "Utterance-based selective training for the automatic creation of task-dependent acoustic models, " IEICE Trans. Inf. Syst., vol. E89-D, no. 3, pp. 962-969, 2006.
- (2006) IEICE Trans. Inf. Syst. , vol.E89-D , Issue.3 , pp. 962-969
- Cincarek, T.¹ Tomoki, T.² Saruwatari, H.³ Shikano, K.⁴

17
- 84867216283
- Aggregated cross-validation and its efficient application to Gaussian mixture optimization
- T. Shinozaki, S. Furui, and T. Kawahara, "Aggregated cross-validation and its efficient application to Gaussian mixture optimization, " in Proc. Interspeech, 2008, pp. 2382-2385.
- (2008) Proc. Interspeech , pp. 2382-2385
- Shinozaki, T.¹ Furui, S.² Kawahara, T.³

18
- 0030211964
- Bagging predictors
- L. Breiman, "Bagging predictors, " Mach. Learn., vol. 24, no. 2, pp. 123-140, 1996.
- (1996) Mach. Learn. , vol.24 , Issue.2 , pp. 123-140
- Breiman, L.¹

19
- 35549000218
- Cross-validation and aggregated EM training for robust parameter estimation
- T. Shinozaki and M. Ostendorf, "Cross-validation and aggregated EM training for robust parameter estimation, " Comput. Speech Lang., vol. 22, no. 2, pp. 185-195, 2008.
- (2008) Comput. Speech Lang. , vol.22 , Issue.2 , pp. 185-195
- Shinozaki, T.¹ Ostendorf, M.²

20
- 51449090592
- GMM and HMM training by aggregatedEM algorithm with increased ensemble sizes for robust parameter estimation
- T. Shinozaki and T. Kawahara, "GMM and HMM training by aggregatedEM algorithm with increased ensemble sizes for robust parameter estimation, " in Proc. ICASSP, 2008, pp. 4405-4408.
- (2008) Proc. ICASSP , pp. 4405-4408
- Shinozaki, T.¹ Kawahara, T.²

21
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- no. Series B 39
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm, " J. R. Statist. Soc., no. 1, pp. 1-38, 1977, no. Series B 39.
- (1977) J. R. Statist. Soc. , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

22
- 3042854734
- Benchmark test for speech recognition using the Corpus of Spontaneous Japanese
- T. Kawahara, H. Nanjo, T. Shinozaki, and S. Furui, "Benchmark test for speech recognition using the Corpus of Spontaneous Japanese, " in Proc. SSPR2003, 2003, pp. 135-138.
- (2003) Proc. SSPR2003 , pp. 135-138
- Kawahara, T.¹ Nanjo, H.² Shinozaki, T.³ Furui, S.⁴

23
- 0003822743
- Cambridge, U. K.: Cambridge Univ. Eng. Dept.
- S. Young et al., The HTK Book. Cambridge, U. K.: Cambridge Univ. Eng. Dept., 2005.
- (2005) The HTK Book
- Young, S.¹

24
- 85128340946
- An efficient two-pass search algorithm using word trellis index
- A. Lee, T. Kawahara, and S. Doshita, "An efficient two-pass search algorithm using word trellis index, " in Proc. ICSLP, 1998, pp. 1831-1834.
- (1998) Proc. ICSLP , pp. 1831-1834
- Lee, A.¹ Kawahara, T.² Doshita, S.³

25
- 0024909979
- Some statistical issues in the comparison of speech recognition algorithms
- L. Gillick and S. Cox, "Some statistical issues in the comparison of speech recognition algorithms, " in Proc. ICASSP, vol. 89, pp. 532-535.
- Proc. ICASSP , vol.89 , pp. 532-535
- Gillick, L.¹ Cox, S.²

26
- 0033225865
- An introduction to variational methods for graphical models
- M. I. Jordan, Z. Ghahramani, T. S. Jaakkola, and L. K. Saul, "An introduction to variational methods for graphical models, " Mach. Learn., vol. 37, pp. 183-233, 1999.
- (1999) Mach. Learn. , vol.37 , pp. 183-233
- Jordan, M.I.¹ Ghahramani, Z.² Jaakkola, T.S.³ Saul, L.K.⁴

27
- 33646418145
- Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition
- May
- S. Watanabe, A. Sako, and A. Nakamura, "Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition, " IEEE Trans. Speech Audio Process., vol. 14, no. 3, pp. 855-872, May 2006.
- (2006) IEEE Trans. Speech Audio Process. , vol.14 , Issue.3 , pp. 855-872
- Watanabe, S.¹ Sako, A.² Nakamura, A.³

28
- 84867213785
- Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition
- K. Hashimoto, H. Zen, Y. Nankaku, A. Lee, and K. Tokuda, "Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition, " in Proc. Interspeech, 2008, pp. 936-939.
- (2008) Proc. Interspeech , pp. 936-939
- Hashimoto, K.¹ Zen, H.² Nankaku, Y.³ Lee, A.⁴ Tokuda, K.⁵

29
- 0022890536
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- L. R. Bahl, P. F. Brown, P. V. de Souza, and R. L. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition, " in Proc. ICASSP, 1986, pp. 49-52.
- (1986) Proc. ICASSP , pp. 49-52
- Bahl, L.R.¹ Brown, P.F.² De Souza, P.V.³ Mercer, R.L.⁴

30
- 0036461035
- Large scale discriminative training of hidden Markov models for speech recognition
- P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models for speech recognition, "Comput. Speech Lang., vol. 16, pp. 25-47, 2002.
- (2002) Comput. Speech Lang. , vol.16 , pp. 25-47
- Woodland, P.C.¹ Povey, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.