SCOPUS 정보 검색 플랫폼

IEICE Transactions on Information and Systems

Volumn E93-D, Issue 9, 2010, Pages 2348-2362

Acoustic model adaptation for speech recognition

(1) Shinoda, Koichi a

a TOKYO INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

Acoustic model adaptation; Hidden Markov models; Speech recognition

Indexed keywords

DEEP NEURAL NETWORKS; HIDDEN MARKOV MODELS; MARKOV PROCESSES; MAXIMUM LIKELIHOOD; MAXIMUM LIKELIHOOD ESTIMATION;

ACOUSTIC MODEL ADAPTATION; ADAPTATION TECHNIQUES; CONTINUOUS DENSITY HIDDEN MARKOV MODELS; INPUT DATAS; MAXIMUM A POSTERIORI ESTIMATION; MAXIMUM LIKELIHOOD LINEAR REGRESSION; RECOGNITION ACCURACY; TRAINING DATA;

SPEECH RECOGNITION;

EID: 77956865237 PISSN: 09168532 EISSN: 17451361 Source Type: Journal
DOI: 10.1587/transinf.E93.D.2348 Document Type: Article

Times cited : (15)

References (104)

1
- 0030362995
- A compact model for speaker-adaptive training
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," Proc. ICSLP96, vol.2, FrP2L1.3, 1996.
- (1996) Proc. ICSLP96 , vol.2
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

2
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Am., vol.55, pp.1304-1312, 1974.
- (1974) J. Acoust. Soc. Am. , vol.55 , pp. 1304-1312
- Atal, B.¹

3
- 0346008165
- Statistical language model adaptation: Review and perspectives
- J.R. Bellegarda, "Statistical language model adaptation: Review and perspectives," Speech Commun., vol.42, no.1, pp.93-108, 2004.
- (2004) Speech Commun. , vol.42 , Issue.1 , pp. 93-108
- Bellegarda, J.R.¹

4
- 0347899510
- α-Jacobian environmental adaptation
- C. Cerisara, L. Rigazio, and J.C. Janqua, "α-Jacobian environmental adaptation," Speech Commun., vol.42, no.1, pp.25-41, 2004.
- (2004) Speech Commun. , vol.42 , Issue.1 , pp. 25-41
- Cerisara, C.¹ Rigazio, L.² Janqua, J.C.³

5
- 85009097035
- Fast speaker adaptation using Eigenspace-based maximum likelihood linear regression
- K. Chen, W. Liau, H. Wang, and L. Lee, "Fast speaker adaptation using Eigenspace-based maximum likelihood linear regression," Proc. ICSLP-2000, 2000.
- (2000) Proc. ICSLP-2000
- Chen, K.¹ Liau, W.² Wang, H.³ Lee, L.⁴

6
- 85135272864
- Maximum a posteriori linear regression for hidden Markov model adaptation
- C. Chesta, O. Siohan, and C.-H. Lee, "Maximum a posteriori linear regression for hidden Markov model adaptation," Proc. EuroSpeech99, pp.211-214, 1999.
- (1999) Proc. EuroSpeech99 , pp. 211-214
- Chesta, C.¹ Siohan, O.² Lee, C.-H.³

7
- 0030643678
- Improved Bayesian learning of hidden Markov models for speaker adaptation
- J.-T. Chien, C.-H. Lee, and H.-C. Wang, "Improved Bayesian learning of hidden Markov models for speaker adaptation," Proc. ICASSP-97, pp.1027-1039, 1997.
- (1997) Proc. ICASSP-97 , pp. 1027-1039
- Chien, J.-T.¹ Lee, C.-H.² Wang, H.-C.³

8
- 0036649879
- Quasi-Bayes linear regression for sequential learning of hidden Markov models
- J.-T. Chien, "Quasi-Bayes linear regression for sequential learning of hidden Markov models," IEEE Trans. Speech Audio Process., vol.10, no.4, pp.268-278, 2002.
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.4 , pp. 268-278
- Chien, J.-T.¹

9
- 84874875877
- Maximum a posteriori linear regression with ellipticallysymmetric matrix variance priors
- W. Chou, "Maximum a posteriori linear regression with ellipticallysymmetric matrix variance priors," Proc. Eurospeech-99, vol.1, pp.1-4, 1999.
- (1999) Proc. Eurospeech-99 , vol.1 , pp. 1-4
- Chou, W.¹

10
- 0029209204
- Predictive speaker adaptation in speech recognition
- S. Cox, "Predictive speaker adaptation in speech recognition," Comput. Speech Lang., vol.9, pp.1-17, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 1-17
- Cox, S.¹

11
- 0036294872
- Efficient adaptation text design based on the Kullback-Leibler measure
- X. Cui and A. Alwan, "Efficient adaptation text design based on the Kullback-Leibler measure," Proc. ICASSP-2002, pp.I-613-616, 2002.
- (2002) Proc. ICASSP-2002
- Cui, X.¹ Alwan, A.²

12
- 64149094039
- Robust speaker adaptation by weighted model averaging based on the minimum description length criterion
- X. Cui and A. Alwan, "Robust speaker adaptation by weighted model averaging based on the minimum description length criterion," IEEE Trans. Audio, Speech, and Language Processing, vol.15, no.2, pp.652-660, 2007.
- (2007) IEEE Trans. Audio , vol.15 , Issue.2 , pp. 652-660
- Cui, X.¹ Alwan, A.²

13
- 0005448921
- McGraw-Hill
- M.H. DeGroot, Statistical Decision Theory and Bayesian Analysis, McGraw-Hill, 1970.
- (1970) Statistical Decision Theory and Bayesian Analysis
- Degroot, M.H.¹

14
- 0029375590
- Speaker adaptation using constrained estimation of Gaussian mixtures
- V.V. Digalakis, D. Rtishev, and L.G. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol.3, no.5, pp.357-366, 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 357-366
- Digalakis, V.V.¹ Rtishev, D.² Neumeyer, L.G.³

15
- 0030189744
- Speaker adaptation using combined transformation and Bayesian methods
- V.V. Digalakis and L.G. Neumeyer, "Speaker adaptation using combined transformation and Bayesian methods," IEEE Trans. Speech Audio Process., vol.4, no.4, pp.294-300, 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.4 , pp. 294-300
- Digalakis, V.V.¹ Neumeyer, L.G.²

16
- 51449115117
- Fast speaker adaptation using non-negative matrix factorization
- J. Duchateau, T. Leroy, K. Demuynck, and H. Van homme, "Fast speaker adaptation using non-negative matrix factorization," ICASSP '08, pp.4269-4272, 2008.
- (2008) ICASSP '08 , pp. 4269-4272
- Duchateau, J.¹ Leroy, T.² Demuynck, K.³ Van Homme, H.⁴

17
- 0029725604
- A parametric approach to vocal tract length normalization
- E. Eide and H. Gish, "A parametric approach to vocal tract length normalization," Proc. ICASSP96, vol.1, pp.346-3483, 1996.
- (1996) Proc. ICASSP96 , vol.1 , pp. 346-3483
- Eide, E.¹ Gish, H.²

18
- 51449094035
- Rapid vocal tract length normalization using maximum likelihood estimation
- T. Emori and K. Shinoda, "Rapid vocal tract length normalization using maximum likelihood estimation," Proc. Eurospeech-2001, pp.1649-1652, 2001.
- (2001) Proc. Eurospeech-2001 , pp. 1649-1652
- Emori, T.¹ Shinoda, K.²

19
- 0019009839
- A training procedure for isolated word recognition systems
- S. Furui, "A training procedure for isolated word recognition systems," IEEE Trans. Acoust. Speech Signal Process., vol.ASSP-28, no.2, pp.129-136, 1980.
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.2 , pp. 129-136
- Furui, S.¹

20
- 0024899244
- Unsupervised speaker adaptation method based on hierarchical spectral clustering
- S. Furui, "Unsupervised speaker adaptation method based on hierarchical spectral clustering," Proc. ICASSP-89, pp.286-289, Glasgow, 1989. (Pubitemid 20604111)
- (1989) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1 , pp. 286-289
- Furui Sadaoki¹

21
- 77949431571
- Generalization problem in ASR acoustic model training and adaptation
- Merano
- S. Furui, "Generalization Problem in ASR Acoustic Model Training and Adaptation," IEEE ASRU Workshop, Merano, pp.1-10, 2009.
- (2009) IEEE ASRU Workshop , pp. 1-10
- Furui, S.¹

22
- 0030263447
- Mean and covariance adaptation within MLLR framework
- M.J.F. Gales and P.C. Woodland, "Mean and covariance adaptation within MLLR framework," Comput. Speech Lang., vol.10, pp.249-264, 1996.
- (1996) Comput. Speech Lang. , vol.10 , pp. 249-264
- Gales, M.J.F.¹ Woodland, P.C.²

23
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol.12, pp.75-98, 1998.
- (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
- Gales, M.J.F.¹

24
- 0034227757
- Cluster adaptive training for hidden Markov models
- M.J.F. Gales, "Cluster adaptive training for hidden Markov models," IEEE Trans. Audio and Speech Processing, vol.8, no.4, pp.417-428, 2000.
- (2000) IEEE Trans. Audio and Speech Processing , vol.8 , Issue.4 , pp. 417-428
- Gales, M.J.F.¹

25
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- April
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol.2, no.2, pp.291-298, April 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

26
- 55049094528
- Techniques in rapid unsupervised speaker adaptation based on HMM-sufficient statistics
- R. Gomez, T. Toda, H. Saruwatari, and K. Shikano, "Techniques in rapid unsupervised speaker adaptation based on HMM-sufficient statistics," Speech Commun., vol.51, pp.42-57, 2004.
- (2004) Speech Commun. , vol.51 , pp. 42-57
- Gomez, R.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

27
- 84867218967
- A fast speaker adaptation method using aspect model
- S. Hahm, A. Ito, S. Makino, and M. Suzuki, "A fast speaker adaptation method using aspect model," Interspeech '08, pp.1221-1224, 2008.
- (2008) Interspeech '08 , pp. 1221-1224
- Hahm, S.¹ Ito, A.² Makino, S.³ Suzuki, M.⁴

28
- 0009625231
- A comparison of novel techniques for rapid speaker adaptation
- T.J. Hazen, "A comparison of novel techniques for rapid speaker adaptation," Speech Commun., vol.31, pp.15-33, 2000.
- (2000) Speech Commun. , vol.31 , pp. 15-33
- Hazen, T.J.¹

29
- 85009113198
- Analysis of speaker variability
- C. Huang, T. Chen, S. Li, E. Chang, and J. Zhou, "Analysis of speaker variability," EuroSpeech-2001, 2001.
- (2001) EuroSpeech-2001
- Huang, C.¹ Chen, T.² Li, S.³ Chang, E.⁴ Zhou, J.⁵

30
- 0029377113
- Bayesian adaptive training of the parameters of hidden Markov model for speech recognition
- Q. Huo and C. Chan, "Bayesian adaptive training of the parameters of hidden Markov model for speech recognition," IEEE Trans. Speech Audio Process., vol.3, no.5, pp.334-345, 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 334-345
- Huo, Q.¹ Chan, C.²

31
- 0031103160
- On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate
- March
- Q. Huo and C.-H. Lee, "On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate," IEEE Trans. Audio and Speech Processing, vol.5, no.2, pp.161-172, March 1997.
- (1997) IEEE Trans. Audio and Speech Processing , vol.5 , Issue.2 , pp. 161-172
- Huo, Q.¹ Lee, C.-H.²

32
- 0032122203
- On-line adaptive learning of the corre-lated continuous-density hidden Markov model for speech recognition
- Q. Huo and C.-H. Lee, "On-line adaptive learning of the corre-lated continuous-density hidden Markov model for speech recognition," IEEE Trans. Speech Audio Process., vol.6, no.4, pp.386-397, 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.4 , pp. 386-397
- Huo, Q.¹ Lee, C.-H.²

33
- 0013208309
- Modeling dependency in adaptation of acoustic models using multiscale tree processes
- A. Kannan and M. Ostendorf, "Modeling dependency in adaptation of acoustic models using multiscale tree processes," Proc. EuroSpeech-97, pp.1863-1866, 1997.
- (1997) Proc. EuroSpeech-97 , pp. 1863-1866
- Kannan, A.¹ Ostendorf, M.²

34
- 33846232075
- Bayesian adaptation revisited
- P. Kenny, G. Boulianne, and P. Dumouchel, "Bayesian adaptation revisited," Workshop ISCAITRW ASR2000, pp.112-119, 2000.
- (2000) Workshop ISCAITRW ASR2000 , pp. 112-119
- Kenny, P.¹ Boulianne, G.² Dumouchel, P.³

35
- 77956864267
- Inter-speaker correlations, intra-speaker correlations and Bayesian adaptation
- Sophia-Antipolis
- P. Kenny, G. Boulianne, and P. Dumouchel, "Inter-speaker correlations, intra-speaker correlations and Bayesian adaptation," Proc. Isca ITR-Workshop2001, Sophia-Antipolis, 2001.
- (2001) Proc. Isca ITR-workshop2001
- Kenny, P.¹ Boulianne, G.² Dumouchel, P.³

36
- 22544446112
- What is the best type of prior distribution for EMAP speaker adaptation
- P. Kenny, G. Boulianne, and P. Dumouchel, "What is the best type of prior distribution for EMAP speaker adaptation," EuroSpeech-2001,pp.1207- 1210, 2001.
- (2001) EuroSpeech-2001 , pp. 1207-1210
- Kenny, P.¹ Boulianne, G.² Dumouchel, P.³

37
- 18744386134
- Eigenvoice modeling with sparse training data
- P. Kenny, G. Boulianne, and P. Dumouchel, "Eigenvoice modeling with sparse training data," IEEE Trans. Speech Audio Process., vol.13, no.3, pp.345-354, 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.3 , pp. 345-354
- Kenny, P.¹ Boulianne, G.² Dumouchel, P.³

38
- 0347789326
- Maximum a posteriori adaptation of HMM parameters based on probabilistic component analysis
- D.K. Kim and N.S. Kim, "Maximum a posteriori adaptation of HMM parameters based on probabilistic component analysis," Proc. ISCA ITR-Workshop 2001, pp.25-28, 2001.
- (2001) Proc. ISCA ITR-workshop 2001 , pp. 25-28
- Kim, D.K.¹ Kim, N.S.²

39
- 0346460276
- Maximum a posteriori adaptation of HMM parameters based on speaker space projection
- D.K. Kim and N.S. Kim, "Maximum a posteriori adaptation of HMM parameters based on speaker space projection," Speech Commun., vol.42, pp.59-73, 2004.
- (2004) Speech Commun. , vol.42 , pp. 59-73
- Kim, D.K.¹ Kim, N.S.²

40
- 14644409729
- Rapid online adaptation based on transformation space model evolution
- D.K. Kim and N.S. Kim, "Rapid online adaptation based on transformation space model evolution," IEEE Trans. Speech Audio Process., vol.13, no.2, pp.194-202, 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 194-202
- Kim, D.K.¹ Kim, N.S.²

41
- 0025236073
- Application of the Karhunen-Lo'eve procedure for the characterization of human faces
- M. Kirby and L. Sirovich, "Application of the Karhunen-Lo'eve procedure for the characterization of human faces," IEEE Trans. Pattern Anal. Mach. Intell., vol.12, no.1, pp.103-108, 1990.
- (1990) IEEE Trans. Pattern Anal. Mach. Intell. , vol.12 , Issue.1 , pp. 103-108
- Kirby, M.¹ Sirovich, L.²

42
- 85009078667
- Tree-structured speaker clustering for fast speaker adaptation
- Adelaide
- T. Kosaka and S. Sagayama, "Tree-structured speaker clustering for fast speaker adaptation," ICASSP-94, vol.1, pp.245-248, Adelaide, 1994.
- (1994) ICASSP-94 , vol.1 , pp. 245-248
- Kosaka, T.¹ Sagayama, S.²

43
- 84871609195
- Eigenvoices for speaker adaptation
- R. Kuhn, P. Nguyen, J.-C. Janqua, L. Goldwasser, N. Niedzielski, S. Finke, K. Field, and M. Contolini, "Eigenvoices for speaker adaptation," Proc. ICSLP-98, pp.1771-1774, 1998.
- (1998) Proc. ICSLP-98 , pp. 1771-1774
- Kuhn, R.¹ Nguyen, P.² Janqua, J.-C.³ Goldwasser, L.⁴ Niedzielski, N.⁵ Finke, S.⁶ Field, K.⁷ Contolini, M.⁸

44
- 77956865520
- Fast speaker adaptation using a priori knowledge
- R. Kuhn, J.-C. Janqua, R. Boman, N. Niedzielski, S. Fincke, K. Field, and M. Contolini, "Fast speaker adaptation using a priori knowledge," Proc. ICASSP-99, 2001.
- (2001) Proc. ICASSP-99
- Kuhn, R.¹ Janqua, J.-C.² Boman, R.³ Niedzielski, N.⁴ Fincke, S.⁵ Field, K.⁶ Contolini, M.⁷

45
- 0034320005
- Rapid speaker adaptation in Eigenvoice space robust speech recognition
- R. Kuhn, J.-C. Janqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in Eigenvoice space robust speech recognition," IEEE Trans. Speech Audio Process., vol.8, no.6, pp.695-707, 2000.
- (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.6 , pp. 695-707
- Kuhn, R.¹ Janqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

46
- 0021458298
- A posteriori estimation of correlated jointly Gaussian mean vectors
- M.J. Lasry and R.M. Stern, "A posteriori estimation of correlated jointly Gaussian mean vectors," IEEE Trans. Pattern Anal. Mach. Intell., vol.6, no.4., pp.530-535, 1984.
- (1984) IEEE Trans. Pattern Anal. Mach. Intell. , vol.6 , Issue.4 , pp. 530-535
- Lasry, M.J.¹ Stern, R.M.²

47
- 0026142334
- A study on speaker adaptation of the parameters of continuous density hidden Markov models
- April
- C.-H. Lee, C.-H. Lin, and B.-H. Juang, "A study on speaker adaptation of the parameters of continuous density hidden Markov models," IEEE Trans. Signal Process., vol.39, no.4, pp.806-814, April 1991.
- (1991) IEEE Trans. Signal Process. , vol.39 , Issue.4 , pp. 806-814
- Lee, C.-H.¹ Lin, C.-H.² Juang, B.-H.³

48
- 0029747183
- Speaker normalization using efficient frequency warping procedures
- L. Lee and R.C. Rose, "Speaker normalization using efficient frequency warping procedures," Proc. ICASSP96, vol.1, pp.353-356, 1996.
- (1996) Proc. ICASSP96 , vol.1 , pp. 353-356
- Lee, L.¹ Rose, R.C.²

49
- 0000159105
- On adaptive decision rules and decision parameter adaptation for automatic speech recognition
- C.-H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol.88, pp.1241-1269, 2000.
- (2000) Proc. IEEE , vol.88 , pp. 1241-1269
- Lee, C.-H.¹ Huo, Q.²

50
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous-density hidden Markov models
- C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous-density hidden Markov models," Comput. Speech Lang., vol.9, pp.171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

51
- 62249130045
- A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
- J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, "A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions," Comput. Speech Lang., vol.23, pp.389-405, 2009.
- (2009) Comput. Speech Lang. , vol.23 , pp. 389-405
- Li, J.¹ Deng, L.² Yu, D.³ Gong, Y.⁴ Acero, A.⁵

52
- 84867224386
- Speaker adaptive training using Shift-MLLR
- Brisbane
- J. Loof, C. Gollan, and H. Ney, "Speaker adaptive training using Shift-MLLR," Proc. Interspeech2008, pp.1701-1704, Brisbane, 2008.
- (2008) Proc. Interspeech2008 , pp. 1701-1704
- Loof, J.¹ Gollan, C.² Ney, H.³

53
- 27644511614
- Kernel eigenvoice speaker adaptation
- B. Mak, J.T. Kwak, and S. Ho, "Kernel eigenvoice speaker adaptation," IEEE Trans. Audio, Speech, and Language Processing, vol.13, no.5, pp.984-992, 2005.
- (2005) IEEE Trans. Audio , vol.13 , Issue.5 , pp. 984-992
- Mak, B.¹ Kwak, J.T.² Ho, S.³

54
- 34047246852
- Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting
- B.K.-W. Mak, R.W.-H. Hsiao, S.K.-L. Ho, and J.T. Kwak, "Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting," IEEE Trans. Audio, Speech, and Language Processing, vol.14, no.4, pp.1267-1280, 2006.
- (2006) IEEE Trans. Audio Speech and Language Processing , vol.14 , Issue.4 , pp. 1267-1280
- Mak, B.K.-W.¹ Hsiao, R.W.-H.² Ho, S.K.-L.³ Kwak, J.T.⁴

55
- 68549133255
- Maximum penalized likelihood kernel regression for fast adaptation
- B.K.-W. Mak, T.-C. Lai, I.W. Tsang, and J.T.-Y. Kwak, "Maximum penalized likelihood kernel regression for fast adaptation," IEEE Trans. Audio, Speech, and Language Processing, vol.17, no.7, pp.1372-1381, 2006.
- (2006) IEEE Trans. Audio Speech and Language Processing , vol.17 , Issue.7 , pp. 1372-1381
- Mak, B.K.-W.¹ Lai, T.-C.² Tsang, I.W.³ Kwak, J.T.-Y.⁴

56
- 53849127143
- Improving robustness of MLLR adaptation with speaker-clustered regression class trees
- A. Mandal, M. Ostendorf, and A. Stolcke, "Improving robustness of MLLR adaptation with speaker-clustered regression class trees," Comput. Speech Lang., vol.23, pp.176-199, 2009.
- (2009) Comput. Speech Lang. , vol.23 , pp. 176-199
- Mandal, A.¹ Ostendorf, M.² Stolcke, A.³

57
- 0031681725
- N-best-based unsupervised speaker adaptation for speech recognition
- T. Matsui and S. Furui, "N-best-based unsupervised speaker adaptation for speech recognition," Comput. Speech Lang., vol.12, pp.41-50, 1998.
- (1998) Comput. Speech Lang. , vol.12 , pp. 41-50
- Matsui, T.¹ Furui, S.²

58
- 0347269184
- Speaker adaptation with all-pass transforms
- J. McDonough, T. Schaaf, and A. Waibel, "Speaker adaptation with all-pass transforms," Speech Commun., vol.42, pp.75-91, 2004.
- (2004) Speech Commun. , vol.42 , pp. 75-91
- McDonough, J.¹ Schaaf, T.² Waibel, A.³

59
- 0029725301
- A vector Taylor series approach for environment-independent speech recognition
- P.J. Moreno, B. Raj, and R.M. Stern, "A vector Taylor series approach for environment-independent speech recognition," Proc. ICASSP '96, pp.733-736, 1996.
- (1996) Proc. ICASSP '96 , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

60
- 70450193702
- Speaker adaptation based on two-step active learning
- H. Murakami, K. Shinoda, and S. Furui, "Speaker adaptation based on two-step active learning," Interspeech '09, pp.576-579, 2009.
- (2009) Interspeech '09 , pp. 576-579
- Murakami, H.¹ Shinoda, K.² Furui, S.³

61
- 85009067865
- Speaker recognition by separating phonetic space and speaker space
- N. Nishida and Y. Ariki, "Speaker recognition by separating phonetic space and speaker space," Proc. Eurospeech-2001, 2001.
- (2001) Proc. Eurospeech-2001
- Nishida, N.¹ Ariki, Y.²

62
- 85135280100
- Maximum likelihood eigenspace and MLLR for speech recognition in noisy environment
- P. Nguyen, C. Wellekens, and J.-C. Janqua, "Maximum likelihood eigenspace and MLLR for speech recognition in noisy environment," Proc. Eurospeech-99, pp.2519-2522, 1999.
- (1999) Proc. Eurospeech-99 , pp. 2519-2522
- Nguyen, P.¹ Wellekens, C.² Janqua, J.-C.³

63
- 85009289284
- LU factorization for feature transformation
- P. Nguyen, L. Rigazio, C. Wellekens, and J.C. Junqua, "LU factorization for feature transformation," Proc. ICSLP-2002, pp.73-76, 2001.
- (2001) Proc. ICSLP-2002 , pp. 73-76
- Nguyen, P.¹ Rigazio, L.² Wellekens, C.³ Junqua, J.C.⁴

64
- 85135109228
- Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs
- K. Ohkura, M. Sugiyama, and S. Sagayama, "Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs," Proc. ICSLP '92, pp.369-372, 2002.
- (2002) Proc. ICSLP '92 , pp. 369-372
- Ohkura, K.¹ Sugiyama, M.² Sagayama, S.³

65
- 0141813799
- Speaker adaptation by hierarchical eigen- voice
- Y. Onishi and K. Iso, "Speaker adaptation by hierarchical eigen- voice," Proc. of ICASSP-2003, pp.1576-1579, 2003.
- (2003) Proc. of ICASSP-2003 , pp. 1576-1579
- Onishi, Y.¹ Iso, K.²

66
- 0031704151
- Speaker clustering and transformation for speaker adaptation in speech recognition systems
- M. Padmanabhan, L.R. Bahl, D. Nahamoo, and M.A. Picheny, "Speaker clustering and transformation for speaker adaptation in speech recognition systems," IEEE Trans. Speech Audio Process., vol.6, no.1, pp.71-77, 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.1 , pp. 71-77
- Padmanabhan, M.¹ Bahl, L.R.² Nahamoo, D.³ Picheny, M.A.⁴

67
- 0003778679
- Lattice-based unsuper- vised MLLR for speaker adaptation
- M. Padmanabhan, G. Saon, and G. Zweig, "Lattice-based unsuper- vised MLLR for speaker adaptation," Proc. ISCA ITR-Workshop 2000, pp.128-131, 2000.
- (2000) Proc. ISCA ITR-workshop 2000 , pp. 128-131
- Padmanabhan, M.¹ Saon, G.² Zweig, G.³

68
- 7544235206
- Maximum-likelihood nonlinear transformation for acoustic adaptation
- M. Padmanabhan and S. Dharanipragada, "Maximum-likelihood nonlinear transformation for acoustic adaptation," IEEE Trans. Speech Audio Process., vol.12, no.6, pp.572-578, 2004.
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.6 , pp. 572-578
- Padmanabhan, M.¹ Dharanipragada, S.²

69
- 0022181749
- Some acoustic-phonetic correlates of speech produced in noise
- D.B. Pisoni, R.H. Bernacki, H.C. Nusbaum, and M. Yuchtman, "Some acoustic-phonetic correlates of speech produced in noise," Proc. ICASSP, pp.1581-1584, 1985.
- (1985) Proc. ICASSP , pp. 1581-1584
- Pisoni, D.B.¹ Bernacki, R.H.² Nusbaum, H.C.³ Yuchtman, M.⁴

70
- 85009089020
- Vocal tract normalization equals linear transformation in cepstral space
- M. Pitz, S. Molau, R. Schluter, and H. Ney, "Vocal tract normalization equals linear transformation in cepstral space," Proc. Eurospeech-2001, 2001.
- (2001) Proc. Eurospeech-2001
- Pitz, M.¹ Molau, S.² Schluter, R.³ Ney, H.⁴

71
- 0030672082
- Experiments in speaker normalization and adaptation for large vocabulary speech recognition
- D. Pye and P.C. Woodland, "Experiments in speaker normalization and adaptation for large vocabulary speech recognition," Proc. ICASSP97, vol.2, pp.1047-1050, 1997.
- (1997) Proc. ICASSP97 , vol.2 , pp. 1047-1050
- Pye, D.¹ Woodland, P.C.²

72
- 0029769867
- Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
- M. Rahim and B.-H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Process., vol.4, no.1, pp.19-30, 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 19-30
- Rahim, M.¹ Juang, B.-H.²

73
- 0030649027
- Jacobian approach to fast acoustic model adaptation
- S. Sagayama, Y. Yamaguchi, S. Takahashi, and J. Takahashi, "Jacobian approach to fast acoustic model adaptation," Proc. ICASSP '97, pp.835-838, 1997.
- (1997) Proc. ICASSP '97 , pp. 835-838
- Sagayama, S.¹ Yamaguchi, Y.² Takahashi, S.³ Takahashi, J.⁴

74
- 0347159336
- Analytic methods for acoustic model adaptation: A review
- Sophia-Antiplois
- S. Sagayama, K. Shinoda, M. Nakai, and H. Shimodaira, "Analytic methods for acoustic model adaptation: A review," ISCA ITR-Workshop, pp.67-76, Sophia-Antiplois, 2001.
- (2001) ISCA ITR-workshop , pp. 67-76
- Sagayama, S.¹ Shinoda, K.² Nakai, M.³ Shimodaira, H.⁴

75
- 0030149866
- A maximum likelihood approach to stochastic matching for robust speech recognition
- A. Sankar and C-.H. Lee, "A maximum likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Process., vol.4, no.3, pp.190-202, 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.3 , pp. 190-202
- Sankar, A.¹ Lee, C.H.²

76
- 0031103747
- A Markov random field approach to Bayesian speaker adaptation
- B.M. Shahshahani, "A Markov random field approach to Bayesian speaker adaptation," IEEE Trans. Speech Audio Process., vol.5, no.2, pp.183-191, 1997.
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.2 , pp. 183-191
- Shahshahani, B.M.¹

77
- 0026384360
- Speaker adaptation for demi-syllable-based continuous-density HMM
- Toronto
- K. Shinoda, K. Iso, and T. Watanabe, "Speaker adaptation for demi-syllable-based continuous-density HMM," Proc. ICASSP- 91, pp.857-860, Toronto, 1991.
- (1991) Proc. ICASSP- 91 , pp. 857-860
- Shinoda, K.¹ Iso, K.² Watanabe, T.³

78
- 0002488301
- Speaker adaptation with autonomous control using tree structure
- K. Shinoda and T. Watanabe, "Speaker adaptation with autonomous control using tree structure," Proc. EuroSpeech-95, pp.1143-1146, 1995.
- (1995) Proc. EuroSpeech-95 , pp. 1143-1146
- Shinoda, K.¹ Watanabe, T.²

79
- 0029747193
- Speaker adaptation with autonomous model complexity control by MDL principle
- Atlanta
- K. Shinoda and T. Watanabe, "Speaker adaptation with autonomous model complexity control by MDL principle," Proc. ICASSP '96, pp.717-720, Atlanta, 1996.
- (1996) Proc. ICASSP '96 , pp. 717-720
- Shinoda, K.¹ Watanabe, T.²

80
- 0030640789
- Structural MAP speaker adaptation using hierarchical priors
- K. Shinoda and C.-H. Lee, "Structural MAP speaker adaptation using hierarchical priors," Proc. IEEE Workshop on Speech Recognition and Understanding, 1997.
- (1997) Proc. IEEE Workshop on Speech Recognition and Understanding
- Shinoda, K.¹ Lee, C.-H.²

81
- 0035279111
- A structural Bayes approach to speaker adaptation
- K. Shinoda and C.-H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol.9, no.3, pp.276-287, 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 276-287
- Shinoda, K.¹ Lee, C.-H.²

82
- 70349220091
- Unsupervised cross- validation adaptation algorithms for Improved adaptation performance
- T. Shinozaki, Y. Kubota, and S. Furui, "Unsupervised cross- validation adaptation algorithms for Improved adaptation performance," Proc. ICASSP '09, pp.4377-4380, 2009.
- (2009) Proc. ICASSP '09 , pp. 4377-4380
- Shinozaki, T.¹ Kubota, Y.² Furui, S.³

83
- 38149075469
- Structural maximum a posteriori linear regression for fast HMM adaptation
- O. Siohan, T.-A. Myrvoll, and C.-H. Lee, "Structural maximum a posteriori linear regression for fast HMM adaptation," Workshop ISCA ITRW ASR2000, 2000.
- (2000) Workshop ISCA ITRW ASR2000
- Siohan, O.¹ Myrvoll, T.-A.² Lee, C.-H.³

84
- 0023365939
- Dynamic speaker adaptation for feature-based isolated word recognition
- R.M. Stern and M.J. Lasry, "Dynamic speaker adaptation for feature-based isolated word recognition," IEEE Trans. Audio Speech Process., vol.35, no.6, pp.751-763, 1987.
- (1987) IEEE Trans. Audio Speech Process. , vol.35 , Issue.6 , pp. 751-763
- Stern, R.M.¹ Lasry, M.J.²

85
- 0001583797
- Non-linear compensation for stochastic matching
- A.C. Surendran, M. Rahim, and C.-H. Lee, "Non-linear compensation for stochastic matching," IEEE Trans. Audio Speech Process., vol.6, no.7, pp.643-655, 1999.
- (1999) IEEE Trans. Audio Speech Process. , vol.6 , Issue.7 , pp. 643-655
- Surendran, A.C.¹ Rahim, M.² Lee, C.-H.³

86
- 0034818518
- Transformation-based Bayesian prediction for adaptation of HMMs
- A.C. Surendran and C.-H. Lee, "Transformation-based Bayesian prediction for adaptation of HMMs," Speech Commun., vol.34, pp.159-174, 2001.
- (2001) Speech Commun. , vol.34 , pp. 159-174
- Surendran, A.C.¹ Lee, C.-H.²

87
- 64949158419
- Rapid speaker adaptation using clustered maximum-likelihood linear basis with sparse training data
- Y. Tang and R. Rose, "Rapid speaker adaptation using clustered maximum-likelihood linear basis with sparse training data," IEEE Trans. Audio, Speech, and Language Processing, vol.16, no.3, pp.607-616, 2008.
- (2008) IEEE Trans. Audio Speech and Language Processing , vol.16 , Issue.3 , pp. 607-616
- Tang, Y.¹ Rose, R.²

88
- 84867222885
- Improvement of Eigenvoice-based speaker adaptation by parameter space clustering
- S. Tanji, K. Shinoda, S. Furui, and A. Ortega, "Improvement of Eigenvoice-based speaker adaptation by parameter space clustering," Proc. Interspeech '08, pp.1229-1232, 2008.
- (2008) Proc. Interspeech '08 , pp. 1229-1232
- Tanji, S.¹ Shinoda, K.² Furui, S.³ Ortega, A.⁴

89
- 18744406714
- Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation
- S. Tsakalidis, V. Doumpiotis, and W. Byrne, "Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation," IEEE Trans. Audio, Speech, and Language Processing, vol.13, no.3, pp.367-376, 2005.
- (2005) IEEE Trans. Audio Speech and Language Processing , vol.13 , Issue.3 , pp. 367-376
- Tsakalidis, S.¹ Doumpiotis, V.² Byrne, W.³

90
- 18744411268
- Segmental eigenvoice with delicate eigenspace for improved speaker adaptation
- Y. Tsao, S.-M. Lee, and L.-S. Lee, "Segmental eigenvoice with delicate eigenspace for improved speaker adaptation," IEEE Trans. Audio, Speech, and Language Processing, vol.13, no.3, pp.399-411, 2005.
- (2005) IEEE Trans. Audio Speech and Language Processing , vol.13 , Issue.3 , pp. 399-411
- Tsao, Y.¹ Lee, S.-M.² Lee, L.-S.³

91
- 0028997002
- Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation
- Detroit
- M. Tonomura, T. Kosaka, and S. Matsunaga, "Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation," ICASSP-95, vol.1, pp.688-691, Detroit, 1995.
- (1995) ICASSP-95 , vol.1 , pp. 688-691
- Tonomura, M.¹ Kosaka, T.² Matsunaga, S.³

92
- 40149091397
- MPE-based discriminative linear transforms for speaker adaptation
- L. Wang and P.C. Woodland, "MPE-based discriminative linear transforms for speaker adaptation," Comput. Speech Lang., vol.22, pp.256-272, 2008.
- (2008) Comput. Speech Lang. , vol.22 , pp. 256-272
- Wang, L.¹ Woodland, P.C.²

93
- 51449104599
- A unified interpretation of adaptation techniques based on a macroscopic time evolution system with indirect/direct approaches
- S. Watanabe and A. Nakamura, "A unified interpretation of adaptation techniques based on a macroscopic time evolution system with indirect/direct approaches," Proc. ICASSP '08, pp.4285-4288, 2008.
- (2008) Proc. ICASSP '08 , pp. 4285-4288
- Watanabe, S.¹ Nakamura, A.²

94
- 0346528936
- Speaker adaptation for continuous density HMMs: A review
- Sophia-Antipolis
- P.C. Woodland, "Speaker adaptation for continuous density HMMs: A review," ISCA ITR-Workshop, pp.11-19, Sophia-Antipolis, 2001.
- (2001) ISCA ITR-workshop , pp. 11-19
- Woodland, P.C.¹

95
- 58349123022
- A study of minimum classification error (MCE) linear regression for supervised adaptation of MCE-trained continuous-density hidden Markov models
- J. Wu and Q. Huo, "A study of minimum classification error (MCE) linear regression for supervised adaptation of MCE-trained continuous-density hidden Markov models," IEEE Trans. Audio, Speech, and Language Processing, vol.15, no.2, pp.478-488, 2007.
- (2007) IEEE Trans. Audio Speech and Language Processing , vol.15 , Issue.2 , pp. 478-488
- Wu, J.¹ Huo, Q.²

96
- 0034848875
- Unsupervised speaker adaptation based on the sufficient HMM statistics of selected speakers
- S. Yoshizawa, A. Baba, K. Matsunami, Y. Mera, M. Yamada, and K. Shikano, "Unsupervised speaker adaptation based on the sufficient HMM statistics of selected speakers," Proc. ICASSP2001, 2001.
- (2001) Proc. ICASSP2001
- Yoshizawa, S.¹ Baba, A.² Matsunami, K.³ Mera, Y.⁴ Yamada, M.⁵ Shikano, K.⁶

97
- 65249094352
- Unsupervised adaptation with discriminative mapping transforms
- K. Yu, M. Gales, and P.C. Woodland, "Unsupervised adaptation with discriminative mapping transforms," IEEE Trans. Audio, Speech, and Language Processing, vol.17, no.4, pp.714-723, 2009.
- (2009) IEEE Trans. Audio Speech and Language Processing , vol.17 , Issue.4 , pp. 714-723
- Yu, K.¹ Gales, M.² Woodland, P.C.³

98
- 45549093229
- Bayesian adaptive inference and adaptive training
- K. Yu and M.J.F. Gales, "Bayesian adaptive inference and adaptive training," IEEE Trans. Audio, Speech, and Language Processing, vol.15, no.6, pp.1932-1943, 2007.
- (2007) IEEE Trans. Audio Speech and Language Processing , vol.15 , Issue.6 , pp. 1932-1943
- Yu, K.¹ Gales, M.J.F.²

99
- 34047260093
- Discriminative cluster adaptive training
- K. Yu and M. Gales, "Discriminative cluster adaptive training," IEEE Trans. Audio, Speech, and Language Processing, vol.14, no.5, pp.1694-1703, 2006.
- (2006) IEEE Trans. Audio Speech and Language Processing , vol.14 , Issue.5 , pp. 1694-1703
- Yu, K.¹ Gales, M.²

100
- 0030705337
- Speaker normalization based on frequency warping
- P. Zhan and M. Westohal, "Speaker normalization based on frequency warping," Proc. ICASSP97, pp.1039-1042, 1997.
- (1997) Proc. ICASSP97 , pp. 1039-1042
- Zhan, P.¹ Westohal, M.²

101
- 0028996999
- Batch, incremental and instantaneous adaptation techniques for speech recognition
- Detroit, May
- G. Zavaliagkos, R. Schwartz, and J. Makhoul, "Batch, incremental and instantaneous adaptation techniques for speech recognition," Proc. ICASSP-95, pp.676-679, Detroit, May 1995.
- (1995) Proc. ICASSP-95 , pp. 676-679
- Zavaliagkos, G.¹ Schwartz, R.² Makhoul, J.³

102
- 0029745232
- Maximum a posteriori adaptation for large-scale HMM recognizers
- Detroit, May
- G. Zavaliagkos, "Maximum a posteriori adaptation for large-scale HMM recognizers," Proc. ICASSP-96, pp.725-728, Detroit, May 1995.
- (1995) Proc. ICASSP-96 , pp. 725-728
- Zavaliagkos, G.¹

103
- 0347899508
- Piecewise-linear transformation-based HMM adaptation for noisy speech
- Z. Zhang and S. Furui, "Piecewise-linear transformation-based HMM adaptation for noisy speech," Speech Commun., vol.42, pp.43-58, 2004.
- (2004) Speech Commun. , vol.42 , pp. 43-58
- Zhang, Z.¹ Furui, S.²

104
- 85009084294
- A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping
- Aalborg
- B. Zhou and J.H.L. Hansen, "A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping," Proc. Eurospeech-2001, pp.1215-1218, Aalborg, 2001.
- (2001) Proc. Eurospeech-2001 , pp. 1215-1218
- Zhou, B.¹ Hansen, J.H.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.