메뉴 건너뛰기




Volumn E93-D, Issue 9, 2010, Pages 2348-2362

Acoustic model adaptation for speech recognition

Author keywords

Acoustic model adaptation; Hidden Markov models; Speech recognition

Indexed keywords

DEEP NEURAL NETWORKS; HIDDEN MARKOV MODELS; MARKOV PROCESSES; MAXIMUM LIKELIHOOD; MAXIMUM LIKELIHOOD ESTIMATION;

EID: 77956865237     PISSN: 09168532     EISSN: 17451361     Source Type: Journal    
DOI: 10.1587/transinf.E93.D.2348     Document Type: Article
Times cited : (15)

References (104)
  • 2
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Am., vol.55, pp.1304-1312, 1974.
    • (1974) J. Acoust. Soc. Am. , vol.55 , pp. 1304-1312
    • Atal, B.1
  • 3
    • 0346008165 scopus 로고    scopus 로고
    • Statistical language model adaptation: Review and perspectives
    • J.R. Bellegarda, "Statistical language model adaptation: Review and perspectives," Speech Commun., vol.42, no.1, pp.93-108, 2004.
    • (2004) Speech Commun. , vol.42 , Issue.1 , pp. 93-108
    • Bellegarda, J.R.1
  • 4
    • 0347899510 scopus 로고    scopus 로고
    • α-Jacobian environmental adaptation
    • C. Cerisara, L. Rigazio, and J.C. Janqua, "α-Jacobian environmental adaptation," Speech Commun., vol.42, no.1, pp.25-41, 2004.
    • (2004) Speech Commun. , vol.42 , Issue.1 , pp. 25-41
    • Cerisara, C.1    Rigazio, L.2    Janqua, J.C.3
  • 5
    • 85009097035 scopus 로고    scopus 로고
    • Fast speaker adaptation using Eigenspace-based maximum likelihood linear regression
    • K. Chen, W. Liau, H. Wang, and L. Lee, "Fast speaker adaptation using Eigenspace-based maximum likelihood linear regression," Proc. ICSLP-2000, 2000.
    • (2000) Proc. ICSLP-2000
    • Chen, K.1    Liau, W.2    Wang, H.3    Lee, L.4
  • 6
    • 85135272864 scopus 로고    scopus 로고
    • Maximum a posteriori linear regression for hidden Markov model adaptation
    • C. Chesta, O. Siohan, and C.-H. Lee, "Maximum a posteriori linear regression for hidden Markov model adaptation," Proc. EuroSpeech99, pp.211-214, 1999.
    • (1999) Proc. EuroSpeech99 , pp. 211-214
    • Chesta, C.1    Siohan, O.2    Lee, C.-H.3
  • 7
    • 0030643678 scopus 로고    scopus 로고
    • Improved Bayesian learning of hidden Markov models for speaker adaptation
    • J.-T. Chien, C.-H. Lee, and H.-C. Wang, "Improved Bayesian learning of hidden Markov models for speaker adaptation," Proc. ICASSP-97, pp.1027-1039, 1997.
    • (1997) Proc. ICASSP-97 , pp. 1027-1039
    • Chien, J.-T.1    Lee, C.-H.2    Wang, H.-C.3
  • 8
    • 0036649879 scopus 로고    scopus 로고
    • Quasi-Bayes linear regression for sequential learning of hidden Markov models
    • J.-T. Chien, "Quasi-Bayes linear regression for sequential learning of hidden Markov models," IEEE Trans. Speech Audio Process., vol.10, no.4, pp.268-278, 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.4 , pp. 268-278
    • Chien, J.-T.1
  • 9
    • 84874875877 scopus 로고    scopus 로고
    • Maximum a posteriori linear regression with ellipticallysymmetric matrix variance priors
    • W. Chou, "Maximum a posteriori linear regression with ellipticallysymmetric matrix variance priors," Proc. Eurospeech-99, vol.1, pp.1-4, 1999.
    • (1999) Proc. Eurospeech-99 , vol.1 , pp. 1-4
    • Chou, W.1
  • 10
    • 0029209204 scopus 로고
    • Predictive speaker adaptation in speech recognition
    • S. Cox, "Predictive speaker adaptation in speech recognition," Comput. Speech Lang., vol.9, pp.1-17, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 1-17
    • Cox, S.1
  • 11
    • 0036294872 scopus 로고    scopus 로고
    • Efficient adaptation text design based on the Kullback-Leibler measure
    • X. Cui and A. Alwan, "Efficient adaptation text design based on the Kullback-Leibler measure," Proc. ICASSP-2002, pp.I-613-616, 2002.
    • (2002) Proc. ICASSP-2002
    • Cui, X.1    Alwan, A.2
  • 12
    • 64149094039 scopus 로고    scopus 로고
    • Robust speaker adaptation by weighted model averaging based on the minimum description length criterion
    • X. Cui and A. Alwan, "Robust speaker adaptation by weighted model averaging based on the minimum description length criterion," IEEE Trans. Audio, Speech, and Language Processing, vol.15, no.2, pp.652-660, 2007.
    • (2007) IEEE Trans. Audio , vol.15 , Issue.2 , pp. 652-660
    • Cui, X.1    Alwan, A.2
  • 14
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • V.V. Digalakis, D. Rtishev, and L.G. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol.3, no.5, pp.357-366, 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.V.1    Rtishev, D.2    Neumeyer, L.G.3
  • 15
    • 0030189744 scopus 로고    scopus 로고
    • Speaker adaptation using combined transformation and Bayesian methods
    • V.V. Digalakis and L.G. Neumeyer, "Speaker adaptation using combined transformation and Bayesian methods," IEEE Trans. Speech Audio Process., vol.4, no.4, pp.294-300, 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.4 , pp. 294-300
    • Digalakis, V.V.1    Neumeyer, L.G.2
  • 16
    • 51449115117 scopus 로고    scopus 로고
    • Fast speaker adaptation using non-negative matrix factorization
    • J. Duchateau, T. Leroy, K. Demuynck, and H. Van homme, "Fast speaker adaptation using non-negative matrix factorization," ICASSP '08, pp.4269-4272, 2008.
    • (2008) ICASSP '08 , pp. 4269-4272
    • Duchateau, J.1    Leroy, T.2    Demuynck, K.3    Van Homme, H.4
  • 17
    • 0029725604 scopus 로고    scopus 로고
    • A parametric approach to vocal tract length normalization
    • E. Eide and H. Gish, "A parametric approach to vocal tract length normalization," Proc. ICASSP96, vol.1, pp.346-3483, 1996.
    • (1996) Proc. ICASSP96 , vol.1 , pp. 346-3483
    • Eide, E.1    Gish, H.2
  • 18
    • 51449094035 scopus 로고    scopus 로고
    • Rapid vocal tract length normalization using maximum likelihood estimation
    • T. Emori and K. Shinoda, "Rapid vocal tract length normalization using maximum likelihood estimation," Proc. Eurospeech-2001, pp.1649-1652, 2001.
    • (2001) Proc. Eurospeech-2001 , pp. 1649-1652
    • Emori, T.1    Shinoda, K.2
  • 19
    • 0019009839 scopus 로고
    • A training procedure for isolated word recognition systems
    • S. Furui, "A training procedure for isolated word recognition systems," IEEE Trans. Acoust. Speech Signal Process., vol.ASSP-28, no.2, pp.129-136, 1980.
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.2 , pp. 129-136
    • Furui, S.1
  • 21
    • 77949431571 scopus 로고    scopus 로고
    • Generalization problem in ASR acoustic model training and adaptation
    • Merano
    • S. Furui, "Generalization Problem in ASR Acoustic Model Training and Adaptation," IEEE ASRU Workshop, Merano, pp.1-10, 2009.
    • (2009) IEEE ASRU Workshop , pp. 1-10
    • Furui, S.1
  • 22
    • 0030263447 scopus 로고    scopus 로고
    • Mean and covariance adaptation within MLLR framework
    • M.J.F. Gales and P.C. Woodland, "Mean and covariance adaptation within MLLR framework," Comput. Speech Lang., vol.10, pp.249-264, 1996.
    • (1996) Comput. Speech Lang. , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 23
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol.12, pp.75-98, 1998.
    • (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 24
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training for hidden Markov models
    • M.J.F. Gales, "Cluster adaptive training for hidden Markov models," IEEE Trans. Audio and Speech Processing, vol.8, no.4, pp.417-428, 2000.
    • (2000) IEEE Trans. Audio and Speech Processing , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.J.F.1
  • 25
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • April
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol.2, no.2, pp.291-298, April 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 26
    • 55049094528 scopus 로고    scopus 로고
    • Techniques in rapid unsupervised speaker adaptation based on HMM-sufficient statistics
    • R. Gomez, T. Toda, H. Saruwatari, and K. Shikano, "Techniques in rapid unsupervised speaker adaptation based on HMM-sufficient statistics," Speech Commun., vol.51, pp.42-57, 2004.
    • (2004) Speech Commun. , vol.51 , pp. 42-57
    • Gomez, R.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 27
    • 84867218967 scopus 로고    scopus 로고
    • A fast speaker adaptation method using aspect model
    • S. Hahm, A. Ito, S. Makino, and M. Suzuki, "A fast speaker adaptation method using aspect model," Interspeech '08, pp.1221-1224, 2008.
    • (2008) Interspeech '08 , pp. 1221-1224
    • Hahm, S.1    Ito, A.2    Makino, S.3    Suzuki, M.4
  • 28
    • 0009625231 scopus 로고    scopus 로고
    • A comparison of novel techniques for rapid speaker adaptation
    • T.J. Hazen, "A comparison of novel techniques for rapid speaker adaptation," Speech Commun., vol.31, pp.15-33, 2000.
    • (2000) Speech Commun. , vol.31 , pp. 15-33
    • Hazen, T.J.1
  • 30
    • 0029377113 scopus 로고
    • Bayesian adaptive training of the parameters of hidden Markov model for speech recognition
    • Q. Huo and C. Chan, "Bayesian adaptive training of the parameters of hidden Markov model for speech recognition," IEEE Trans. Speech Audio Process., vol.3, no.5, pp.334-345, 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 334-345
    • Huo, Q.1    Chan, C.2
  • 31
    • 0031103160 scopus 로고    scopus 로고
    • On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate
    • March
    • Q. Huo and C.-H. Lee, "On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate," IEEE Trans. Audio and Speech Processing, vol.5, no.2, pp.161-172, March 1997.
    • (1997) IEEE Trans. Audio and Speech Processing , vol.5 , Issue.2 , pp. 161-172
    • Huo, Q.1    Lee, C.-H.2
  • 32
    • 0032122203 scopus 로고    scopus 로고
    • On-line adaptive learning of the corre-lated continuous-density hidden Markov model for speech recognition
    • Q. Huo and C.-H. Lee, "On-line adaptive learning of the corre-lated continuous-density hidden Markov model for speech recognition," IEEE Trans. Speech Audio Process., vol.6, no.4, pp.386-397, 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.4 , pp. 386-397
    • Huo, Q.1    Lee, C.-H.2
  • 33
    • 0013208309 scopus 로고    scopus 로고
    • Modeling dependency in adaptation of acoustic models using multiscale tree processes
    • A. Kannan and M. Ostendorf, "Modeling dependency in adaptation of acoustic models using multiscale tree processes," Proc. EuroSpeech-97, pp.1863-1866, 1997.
    • (1997) Proc. EuroSpeech-97 , pp. 1863-1866
    • Kannan, A.1    Ostendorf, M.2
  • 35
    • 77956864267 scopus 로고    scopus 로고
    • Inter-speaker correlations, intra-speaker correlations and Bayesian adaptation
    • Sophia-Antipolis
    • P. Kenny, G. Boulianne, and P. Dumouchel, "Inter-speaker correlations, intra-speaker correlations and Bayesian adaptation," Proc. Isca ITR-Workshop2001, Sophia-Antipolis, 2001.
    • (2001) Proc. Isca ITR-workshop2001
    • Kenny, P.1    Boulianne, G.2    Dumouchel, P.3
  • 36
    • 22544446112 scopus 로고    scopus 로고
    • What is the best type of prior distribution for EMAP speaker adaptation
    • P. Kenny, G. Boulianne, and P. Dumouchel, "What is the best type of prior distribution for EMAP speaker adaptation," EuroSpeech-2001,pp.1207- 1210, 2001.
    • (2001) EuroSpeech-2001 , pp. 1207-1210
    • Kenny, P.1    Boulianne, G.2    Dumouchel, P.3
  • 38
    • 0347789326 scopus 로고    scopus 로고
    • Maximum a posteriori adaptation of HMM parameters based on probabilistic component analysis
    • D.K. Kim and N.S. Kim, "Maximum a posteriori adaptation of HMM parameters based on probabilistic component analysis," Proc. ISCA ITR-Workshop 2001, pp.25-28, 2001.
    • (2001) Proc. ISCA ITR-workshop 2001 , pp. 25-28
    • Kim, D.K.1    Kim, N.S.2
  • 39
    • 0346460276 scopus 로고    scopus 로고
    • Maximum a posteriori adaptation of HMM parameters based on speaker space projection
    • D.K. Kim and N.S. Kim, "Maximum a posteriori adaptation of HMM parameters based on speaker space projection," Speech Commun., vol.42, pp.59-73, 2004.
    • (2004) Speech Commun. , vol.42 , pp. 59-73
    • Kim, D.K.1    Kim, N.S.2
  • 40
    • 14644409729 scopus 로고    scopus 로고
    • Rapid online adaptation based on transformation space model evolution
    • D.K. Kim and N.S. Kim, "Rapid online adaptation based on transformation space model evolution," IEEE Trans. Speech Audio Process., vol.13, no.2, pp.194-202, 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 194-202
    • Kim, D.K.1    Kim, N.S.2
  • 41
    • 0025236073 scopus 로고
    • Application of the Karhunen-Lo'eve procedure for the characterization of human faces
    • M. Kirby and L. Sirovich, "Application of the Karhunen-Lo'eve procedure for the characterization of human faces," IEEE Trans. Pattern Anal. Mach. Intell., vol.12, no.1, pp.103-108, 1990.
    • (1990) IEEE Trans. Pattern Anal. Mach. Intell. , vol.12 , Issue.1 , pp. 103-108
    • Kirby, M.1    Sirovich, L.2
  • 42
    • 85009078667 scopus 로고
    • Tree-structured speaker clustering for fast speaker adaptation
    • Adelaide
    • T. Kosaka and S. Sagayama, "Tree-structured speaker clustering for fast speaker adaptation," ICASSP-94, vol.1, pp.245-248, Adelaide, 1994.
    • (1994) ICASSP-94 , vol.1 , pp. 245-248
    • Kosaka, T.1    Sagayama, S.2
  • 45
    • 0034320005 scopus 로고    scopus 로고
    • Rapid speaker adaptation in Eigenvoice space robust speech recognition
    • R. Kuhn, J.-C. Janqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in Eigenvoice space robust speech recognition," IEEE Trans. Speech Audio Process., vol.8, no.6, pp.695-707, 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.6 , pp. 695-707
    • Kuhn, R.1    Janqua, J.-C.2    Nguyen, P.3    Niedzielski, N.4
  • 46
    • 0021458298 scopus 로고
    • A posteriori estimation of correlated jointly Gaussian mean vectors
    • M.J. Lasry and R.M. Stern, "A posteriori estimation of correlated jointly Gaussian mean vectors," IEEE Trans. Pattern Anal. Mach. Intell., vol.6, no.4., pp.530-535, 1984.
    • (1984) IEEE Trans. Pattern Anal. Mach. Intell. , vol.6 , Issue.4 , pp. 530-535
    • Lasry, M.J.1    Stern, R.M.2
  • 47
    • 0026142334 scopus 로고
    • A study on speaker adaptation of the parameters of continuous density hidden Markov models
    • April
    • C.-H. Lee, C.-H. Lin, and B.-H. Juang, "A study on speaker adaptation of the parameters of continuous density hidden Markov models," IEEE Trans. Signal Process., vol.39, no.4, pp.806-814, April 1991.
    • (1991) IEEE Trans. Signal Process. , vol.39 , Issue.4 , pp. 806-814
    • Lee, C.-H.1    Lin, C.-H.2    Juang, B.-H.3
  • 48
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedures
    • L. Lee and R.C. Rose, "Speaker normalization using efficient frequency warping procedures," Proc. ICASSP96, vol.1, pp.353-356, 1996.
    • (1996) Proc. ICASSP96 , vol.1 , pp. 353-356
    • Lee, L.1    Rose, R.C.2
  • 49
    • 0000159105 scopus 로고    scopus 로고
    • On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    • C.-H. Lee and Q. Huo, "On adaptive decision rules and decision parameter adaptation for automatic speech recognition," Proc. IEEE, vol.88, pp.1241-1269, 2000.
    • (2000) Proc. IEEE , vol.88 , pp. 1241-1269
    • Lee, C.-H.1    Huo, Q.2
  • 50
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous-density hidden Markov models
    • C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous-density hidden Markov models," Comput. Speech Lang., vol.9, pp.171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 51
    • 62249130045 scopus 로고    scopus 로고
    • A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
    • J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, "A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions," Comput. Speech Lang., vol.23, pp.389-405, 2009.
    • (2009) Comput. Speech Lang. , vol.23 , pp. 389-405
    • Li, J.1    Deng, L.2    Yu, D.3    Gong, Y.4    Acero, A.5
  • 52
    • 84867224386 scopus 로고    scopus 로고
    • Speaker adaptive training using Shift-MLLR
    • Brisbane
    • J. Loof, C. Gollan, and H. Ney, "Speaker adaptive training using Shift-MLLR," Proc. Interspeech2008, pp.1701-1704, Brisbane, 2008.
    • (2008) Proc. Interspeech2008 , pp. 1701-1704
    • Loof, J.1    Gollan, C.2    Ney, H.3
  • 53
    • 27644511614 scopus 로고    scopus 로고
    • Kernel eigenvoice speaker adaptation
    • B. Mak, J.T. Kwak, and S. Ho, "Kernel eigenvoice speaker adaptation," IEEE Trans. Audio, Speech, and Language Processing, vol.13, no.5, pp.984-992, 2005.
    • (2005) IEEE Trans. Audio , vol.13 , Issue.5 , pp. 984-992
    • Mak, B.1    Kwak, J.T.2    Ho, S.3
  • 54
    • 34047246852 scopus 로고    scopus 로고
    • Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting
    • B.K.-W. Mak, R.W.-H. Hsiao, S.K.-L. Ho, and J.T. Kwak, "Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting," IEEE Trans. Audio, Speech, and Language Processing, vol.14, no.4, pp.1267-1280, 2006.
    • (2006) IEEE Trans. Audio Speech and Language Processing , vol.14 , Issue.4 , pp. 1267-1280
    • Mak, B.K.-W.1    Hsiao, R.W.-H.2    Ho, S.K.-L.3    Kwak, J.T.4
  • 56
    • 53849127143 scopus 로고    scopus 로고
    • Improving robustness of MLLR adaptation with speaker-clustered regression class trees
    • A. Mandal, M. Ostendorf, and A. Stolcke, "Improving robustness of MLLR adaptation with speaker-clustered regression class trees," Comput. Speech Lang., vol.23, pp.176-199, 2009.
    • (2009) Comput. Speech Lang. , vol.23 , pp. 176-199
    • Mandal, A.1    Ostendorf, M.2    Stolcke, A.3
  • 57
    • 0031681725 scopus 로고    scopus 로고
    • N-best-based unsupervised speaker adaptation for speech recognition
    • T. Matsui and S. Furui, "N-best-based unsupervised speaker adaptation for speech recognition," Comput. Speech Lang., vol.12, pp.41-50, 1998.
    • (1998) Comput. Speech Lang. , vol.12 , pp. 41-50
    • Matsui, T.1    Furui, S.2
  • 58
    • 0347269184 scopus 로고    scopus 로고
    • Speaker adaptation with all-pass transforms
    • J. McDonough, T. Schaaf, and A. Waibel, "Speaker adaptation with all-pass transforms," Speech Commun., vol.42, pp.75-91, 2004.
    • (2004) Speech Commun. , vol.42 , pp. 75-91
    • McDonough, J.1    Schaaf, T.2    Waibel, A.3
  • 59
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • P.J. Moreno, B. Raj, and R.M. Stern, "A vector Taylor series approach for environment-independent speech recognition," Proc. ICASSP '96, pp.733-736, 1996.
    • (1996) Proc. ICASSP '96 , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 60
    • 70450193702 scopus 로고    scopus 로고
    • Speaker adaptation based on two-step active learning
    • H. Murakami, K. Shinoda, and S. Furui, "Speaker adaptation based on two-step active learning," Interspeech '09, pp.576-579, 2009.
    • (2009) Interspeech '09 , pp. 576-579
    • Murakami, H.1    Shinoda, K.2    Furui, S.3
  • 61
    • 85009067865 scopus 로고    scopus 로고
    • Speaker recognition by separating phonetic space and speaker space
    • N. Nishida and Y. Ariki, "Speaker recognition by separating phonetic space and speaker space," Proc. Eurospeech-2001, 2001.
    • (2001) Proc. Eurospeech-2001
    • Nishida, N.1    Ariki, Y.2
  • 62
    • 85135280100 scopus 로고    scopus 로고
    • Maximum likelihood eigenspace and MLLR for speech recognition in noisy environment
    • P. Nguyen, C. Wellekens, and J.-C. Janqua, "Maximum likelihood eigenspace and MLLR for speech recognition in noisy environment," Proc. Eurospeech-99, pp.2519-2522, 1999.
    • (1999) Proc. Eurospeech-99 , pp. 2519-2522
    • Nguyen, P.1    Wellekens, C.2    Janqua, J.-C.3
  • 64
    • 85135109228 scopus 로고    scopus 로고
    • Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs
    • K. Ohkura, M. Sugiyama, and S. Sagayama, "Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs," Proc. ICSLP '92, pp.369-372, 2002.
    • (2002) Proc. ICSLP '92 , pp. 369-372
    • Ohkura, K.1    Sugiyama, M.2    Sagayama, S.3
  • 65
    • 0141813799 scopus 로고    scopus 로고
    • Speaker adaptation by hierarchical eigen- voice
    • Y. Onishi and K. Iso, "Speaker adaptation by hierarchical eigen- voice," Proc. of ICASSP-2003, pp.1576-1579, 2003.
    • (2003) Proc. of ICASSP-2003 , pp. 1576-1579
    • Onishi, Y.1    Iso, K.2
  • 66
    • 0031704151 scopus 로고    scopus 로고
    • Speaker clustering and transformation for speaker adaptation in speech recognition systems
    • M. Padmanabhan, L.R. Bahl, D. Nahamoo, and M.A. Picheny, "Speaker clustering and transformation for speaker adaptation in speech recognition systems," IEEE Trans. Speech Audio Process., vol.6, no.1, pp.71-77, 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.1 , pp. 71-77
    • Padmanabhan, M.1    Bahl, L.R.2    Nahamoo, D.3    Picheny, M.A.4
  • 68
    • 7544235206 scopus 로고    scopus 로고
    • Maximum-likelihood nonlinear transformation for acoustic adaptation
    • M. Padmanabhan and S. Dharanipragada, "Maximum-likelihood nonlinear transformation for acoustic adaptation," IEEE Trans. Speech Audio Process., vol.12, no.6, pp.572-578, 2004.
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.6 , pp. 572-578
    • Padmanabhan, M.1    Dharanipragada, S.2
  • 69
    • 0022181749 scopus 로고
    • Some acoustic-phonetic correlates of speech produced in noise
    • D.B. Pisoni, R.H. Bernacki, H.C. Nusbaum, and M. Yuchtman, "Some acoustic-phonetic correlates of speech produced in noise," Proc. ICASSP, pp.1581-1584, 1985.
    • (1985) Proc. ICASSP , pp. 1581-1584
    • Pisoni, D.B.1    Bernacki, R.H.2    Nusbaum, H.C.3    Yuchtman, M.4
  • 70
    • 85009089020 scopus 로고    scopus 로고
    • Vocal tract normalization equals linear transformation in cepstral space
    • M. Pitz, S. Molau, R. Schluter, and H. Ney, "Vocal tract normalization equals linear transformation in cepstral space," Proc. Eurospeech-2001, 2001.
    • (2001) Proc. Eurospeech-2001
    • Pitz, M.1    Molau, S.2    Schluter, R.3    Ney, H.4
  • 71
    • 0030672082 scopus 로고    scopus 로고
    • Experiments in speaker normalization and adaptation for large vocabulary speech recognition
    • D. Pye and P.C. Woodland, "Experiments in speaker normalization and adaptation for large vocabulary speech recognition," Proc. ICASSP97, vol.2, pp.1047-1050, 1997.
    • (1997) Proc. ICASSP97 , vol.2 , pp. 1047-1050
    • Pye, D.1    Woodland, P.C.2
  • 72
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • M. Rahim and B.-H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Process., vol.4, no.1, pp.19-30, 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 19-30
    • Rahim, M.1    Juang, B.-H.2
  • 74
    • 0347159336 scopus 로고    scopus 로고
    • Analytic methods for acoustic model adaptation: A review
    • Sophia-Antiplois
    • S. Sagayama, K. Shinoda, M. Nakai, and H. Shimodaira, "Analytic methods for acoustic model adaptation: A review," ISCA ITR-Workshop, pp.67-76, Sophia-Antiplois, 2001.
    • (2001) ISCA ITR-workshop , pp. 67-76
    • Sagayama, S.1    Shinoda, K.2    Nakai, M.3    Shimodaira, H.4
  • 75
    • 0030149866 scopus 로고    scopus 로고
    • A maximum likelihood approach to stochastic matching for robust speech recognition
    • A. Sankar and C-.H. Lee, "A maximum likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Process., vol.4, no.3, pp.190-202, 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.3 , pp. 190-202
    • Sankar, A.1    Lee, C.H.2
  • 76
    • 0031103747 scopus 로고    scopus 로고
    • A Markov random field approach to Bayesian speaker adaptation
    • B.M. Shahshahani, "A Markov random field approach to Bayesian speaker adaptation," IEEE Trans. Speech Audio Process., vol.5, no.2, pp.183-191, 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.2 , pp. 183-191
    • Shahshahani, B.M.1
  • 77
    • 0026384360 scopus 로고
    • Speaker adaptation for demi-syllable-based continuous-density HMM
    • Toronto
    • K. Shinoda, K. Iso, and T. Watanabe, "Speaker adaptation for demi-syllable-based continuous-density HMM," Proc. ICASSP- 91, pp.857-860, Toronto, 1991.
    • (1991) Proc. ICASSP- 91 , pp. 857-860
    • Shinoda, K.1    Iso, K.2    Watanabe, T.3
  • 78
    • 0002488301 scopus 로고
    • Speaker adaptation with autonomous control using tree structure
    • K. Shinoda and T. Watanabe, "Speaker adaptation with autonomous control using tree structure," Proc. EuroSpeech-95, pp.1143-1146, 1995.
    • (1995) Proc. EuroSpeech-95 , pp. 1143-1146
    • Shinoda, K.1    Watanabe, T.2
  • 79
    • 0029747193 scopus 로고    scopus 로고
    • Speaker adaptation with autonomous model complexity control by MDL principle
    • Atlanta
    • K. Shinoda and T. Watanabe, "Speaker adaptation with autonomous model complexity control by MDL principle," Proc. ICASSP '96, pp.717-720, Atlanta, 1996.
    • (1996) Proc. ICASSP '96 , pp. 717-720
    • Shinoda, K.1    Watanabe, T.2
  • 81
    • 0035279111 scopus 로고    scopus 로고
    • A structural Bayes approach to speaker adaptation
    • K. Shinoda and C.-H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol.9, no.3, pp.276-287, 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 276-287
    • Shinoda, K.1    Lee, C.-H.2
  • 82
    • 70349220091 scopus 로고    scopus 로고
    • Unsupervised cross- validation adaptation algorithms for Improved adaptation performance
    • T. Shinozaki, Y. Kubota, and S. Furui, "Unsupervised cross- validation adaptation algorithms for Improved adaptation performance," Proc. ICASSP '09, pp.4377-4380, 2009.
    • (2009) Proc. ICASSP '09 , pp. 4377-4380
    • Shinozaki, T.1    Kubota, Y.2    Furui, S.3
  • 83
    • 38149075469 scopus 로고    scopus 로고
    • Structural maximum a posteriori linear regression for fast HMM adaptation
    • O. Siohan, T.-A. Myrvoll, and C.-H. Lee, "Structural maximum a posteriori linear regression for fast HMM adaptation," Workshop ISCA ITRW ASR2000, 2000.
    • (2000) Workshop ISCA ITRW ASR2000
    • Siohan, O.1    Myrvoll, T.-A.2    Lee, C.-H.3
  • 84
    • 0023365939 scopus 로고
    • Dynamic speaker adaptation for feature-based isolated word recognition
    • R.M. Stern and M.J. Lasry, "Dynamic speaker adaptation for feature-based isolated word recognition," IEEE Trans. Audio Speech Process., vol.35, no.6, pp.751-763, 1987.
    • (1987) IEEE Trans. Audio Speech Process. , vol.35 , Issue.6 , pp. 751-763
    • Stern, R.M.1    Lasry, M.J.2
  • 86
    • 0034818518 scopus 로고    scopus 로고
    • Transformation-based Bayesian prediction for adaptation of HMMs
    • A.C. Surendran and C.-H. Lee, "Transformation-based Bayesian prediction for adaptation of HMMs," Speech Commun., vol.34, pp.159-174, 2001.
    • (2001) Speech Commun. , vol.34 , pp. 159-174
    • Surendran, A.C.1    Lee, C.-H.2
  • 87
    • 64949158419 scopus 로고    scopus 로고
    • Rapid speaker adaptation using clustered maximum-likelihood linear basis with sparse training data
    • Y. Tang and R. Rose, "Rapid speaker adaptation using clustered maximum-likelihood linear basis with sparse training data," IEEE Trans. Audio, Speech, and Language Processing, vol.16, no.3, pp.607-616, 2008.
    • (2008) IEEE Trans. Audio Speech and Language Processing , vol.16 , Issue.3 , pp. 607-616
    • Tang, Y.1    Rose, R.2
  • 88
    • 84867222885 scopus 로고    scopus 로고
    • Improvement of Eigenvoice-based speaker adaptation by parameter space clustering
    • S. Tanji, K. Shinoda, S. Furui, and A. Ortega, "Improvement of Eigenvoice-based speaker adaptation by parameter space clustering," Proc. Interspeech '08, pp.1229-1232, 2008.
    • (2008) Proc. Interspeech '08 , pp. 1229-1232
    • Tanji, S.1    Shinoda, K.2    Furui, S.3    Ortega, A.4
  • 89
    • 18744406714 scopus 로고    scopus 로고
    • Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation
    • S. Tsakalidis, V. Doumpiotis, and W. Byrne, "Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation," IEEE Trans. Audio, Speech, and Language Processing, vol.13, no.3, pp.367-376, 2005.
    • (2005) IEEE Trans. Audio Speech and Language Processing , vol.13 , Issue.3 , pp. 367-376
    • Tsakalidis, S.1    Doumpiotis, V.2    Byrne, W.3
  • 90
    • 18744411268 scopus 로고    scopus 로고
    • Segmental eigenvoice with delicate eigenspace for improved speaker adaptation
    • Y. Tsao, S.-M. Lee, and L.-S. Lee, "Segmental eigenvoice with delicate eigenspace for improved speaker adaptation," IEEE Trans. Audio, Speech, and Language Processing, vol.13, no.3, pp.399-411, 2005.
    • (2005) IEEE Trans. Audio Speech and Language Processing , vol.13 , Issue.3 , pp. 399-411
    • Tsao, Y.1    Lee, S.-M.2    Lee, L.-S.3
  • 91
    • 0028997002 scopus 로고
    • Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation
    • Detroit
    • M. Tonomura, T. Kosaka, and S. Matsunaga, "Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation," ICASSP-95, vol.1, pp.688-691, Detroit, 1995.
    • (1995) ICASSP-95 , vol.1 , pp. 688-691
    • Tonomura, M.1    Kosaka, T.2    Matsunaga, S.3
  • 92
    • 40149091397 scopus 로고    scopus 로고
    • MPE-based discriminative linear transforms for speaker adaptation
    • L. Wang and P.C. Woodland, "MPE-based discriminative linear transforms for speaker adaptation," Comput. Speech Lang., vol.22, pp.256-272, 2008.
    • (2008) Comput. Speech Lang. , vol.22 , pp. 256-272
    • Wang, L.1    Woodland, P.C.2
  • 93
    • 51449104599 scopus 로고    scopus 로고
    • A unified interpretation of adaptation techniques based on a macroscopic time evolution system with indirect/direct approaches
    • S. Watanabe and A. Nakamura, "A unified interpretation of adaptation techniques based on a macroscopic time evolution system with indirect/direct approaches," Proc. ICASSP '08, pp.4285-4288, 2008.
    • (2008) Proc. ICASSP '08 , pp. 4285-4288
    • Watanabe, S.1    Nakamura, A.2
  • 94
    • 0346528936 scopus 로고    scopus 로고
    • Speaker adaptation for continuous density HMMs: A review
    • Sophia-Antipolis
    • P.C. Woodland, "Speaker adaptation for continuous density HMMs: A review," ISCA ITR-Workshop, pp.11-19, Sophia-Antipolis, 2001.
    • (2001) ISCA ITR-workshop , pp. 11-19
    • Woodland, P.C.1
  • 95
    • 58349123022 scopus 로고    scopus 로고
    • A study of minimum classification error (MCE) linear regression for supervised adaptation of MCE-trained continuous-density hidden Markov models
    • J. Wu and Q. Huo, "A study of minimum classification error (MCE) linear regression for supervised adaptation of MCE-trained continuous-density hidden Markov models," IEEE Trans. Audio, Speech, and Language Processing, vol.15, no.2, pp.478-488, 2007.
    • (2007) IEEE Trans. Audio Speech and Language Processing , vol.15 , Issue.2 , pp. 478-488
    • Wu, J.1    Huo, Q.2
  • 100
    • 0030705337 scopus 로고    scopus 로고
    • Speaker normalization based on frequency warping
    • P. Zhan and M. Westohal, "Speaker normalization based on frequency warping," Proc. ICASSP97, pp.1039-1042, 1997.
    • (1997) Proc. ICASSP97 , pp. 1039-1042
    • Zhan, P.1    Westohal, M.2
  • 101
    • 0028996999 scopus 로고
    • Batch, incremental and instantaneous adaptation techniques for speech recognition
    • Detroit, May
    • G. Zavaliagkos, R. Schwartz, and J. Makhoul, "Batch, incremental and instantaneous adaptation techniques for speech recognition," Proc. ICASSP-95, pp.676-679, Detroit, May 1995.
    • (1995) Proc. ICASSP-95 , pp. 676-679
    • Zavaliagkos, G.1    Schwartz, R.2    Makhoul, J.3
  • 102
    • 0029745232 scopus 로고
    • Maximum a posteriori adaptation for large-scale HMM recognizers
    • Detroit, May
    • G. Zavaliagkos, "Maximum a posteriori adaptation for large-scale HMM recognizers," Proc. ICASSP-96, pp.725-728, Detroit, May 1995.
    • (1995) Proc. ICASSP-96 , pp. 725-728
    • Zavaliagkos, G.1
  • 103
    • 0347899508 scopus 로고    scopus 로고
    • Piecewise-linear transformation-based HMM adaptation for noisy speech
    • Z. Zhang and S. Furui, "Piecewise-linear transformation-based HMM adaptation for noisy speech," Speech Commun., vol.42, pp.43-58, 2004.
    • (2004) Speech Commun. , vol.42 , pp. 43-58
    • Zhang, Z.1    Furui, S.2
  • 104
    • 85009084294 scopus 로고    scopus 로고
    • A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping
    • Aalborg
    • B. Zhou and J.H.L. Hansen, "A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping," Proc. Eurospeech-2001, pp.1215-1218, Aalborg, 2001.
    • (2001) Proc. Eurospeech-2001 , pp. 1215-1218
    • Zhou, B.1    Hansen, J.H.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.