메뉴 건너뛰기




Volumn 18, Issue 2, 2010, Pages 395-406

Predictor—Corrector Adaptation by Using Time Evolution System With Macroscopic Time Scale

Author keywords

Acoustic model; incremental adaptation; macroscopic time evolution; predictor corrector algorithm; speech recognition

Indexed keywords


EID: 85008538758     PISSN: 15587916     EISSN: 15587924     Source Type: Journal    
DOI: 10.1109/TASL.2009.2029717     Document Type: Article
Times cited : (4)

References (36)
  • 1
    • 0000159105 scopus 로고    scopus 로고
    • On adaptive decision rules and decision parameter adaptation for automatic speech recognition
    • Aug.
    • C.-H. Lee and Q. Huo, “On adaptive decision rules and decision parameter adaptation for automatic speech recognition,” Proc. IEEE, vol. 88, no. 8, pp. 1241–1269, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1241-1269
    • Lee, C.-H.1    Huo, Q.2
  • 2
    • 11744314947 scopus 로고
    • Speaker adaptation based on transfer vector field smoothing method with continuous mixture density HMMs
    • K. Ohkura, M. Sugiyama, and S. Sagayama “Speaker adaptation based on transfer vector field smoothing method with continuous mixture density HMMs,” IEICE(D-II), vol. J76-D-II, pp. 2469–2476, 1993.
    • (1993) IEICE(D-II) , vol.J76-D-II , pp. 2469-2476
    • Ohkura, K.1    Sugiyama, M.2    Sagayama, S.3
  • 3
    • 0002488301 scopus 로고
    • Speaker adaptation with autonomous control using tree structure
    • K. Shinoda and T. Watanabe, “Speaker adaptation with autonomous control using tree structure,” in Proc. Eurospeech'95, 1995, pp. 1143–1146.
    • (1995) Proc. Eurospeech'95 , pp. 1143-1146
    • Shinoda, K.1    Watanabe, T.2
  • 4
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • Jan.
    • M. Rahim and B.-H. Juang “Signal bias removal by maximum likelihood estimation for robust telephone speech recognition,” IEEE Trans. Speech Audio Process., vol. 4, no. 1, pp. 19–30, Jan. 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 19-30
    • Rahim, M.1    Juang, B.-H.2
  • 5
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland “Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models,” Comput. Speech Lang., vol. 9, pp. 171–185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 6
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • Sep.
    • V. Digalakis, D. Ritischev, and L. Neumeyer “Speaker adaptation using constrained estimation of Gaussian mixtures,” IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357–366, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.1    Ritischev, D.2    Neumeyer, L.3
  • 7
    • 85135272864 scopus 로고    scopus 로고
    • Maximum a posteriori linear regression for hidden Markov model adaptation
    • C. Chesta, O. Siohan, and C.-H. Lee, “Maximum a posteriori linear regression for hidden Markov model adaptation,” in Proc. Eurospeech'99, 1999, vol. 1, pp. 211–214.
    • (1999) Proc. Eurospeech'99 , vol.1 , pp. 211-214
    • Chesta, C.1    Siohan, O.2    Lee, C.-H.3
  • 8
    • 84874875877 scopus 로고    scopus 로고
    • Maximum a posterior linear regression with elliptically symmetric matrix variate priors
    • W. Chou, “Maximum a posterior linear regression with elliptically symmetric matrix variate priors,” in Proc. Eurospeech'99, 1999, vol. 1, pp. 1–4.
    • (1999) Proc. Eurospeech'99 , vol.1 , pp. 1-4
    • Chou, W.1
  • 9
    • 0035279111 scopus 로고    scopus 로고
    • A structural Bayes approach to speaker adaptation
    • Mar.
    • K. Shinoda and C.-H. Lee, “A structural Bayes approach to speaker adaptation,” IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 276–287, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 276-287
    • Shinoda, K.1    Lee, C.-H.2
  • 10
    • 0009623939 scopus 로고
    • Flexible speaker adaptation using maximum likelihood linear regression
    • C. J. Leggetter and P. C. Woodland, “Flexible speaker adaptation using maximum likelihood linear regression,” in Proc. ARPA Spoken Lang. Technol. Workshop, 1995, pp. 104–109.
    • (1995) Proc. ARPA Spoken Lang. Technol. Workshop , pp. 104-109
    • Leggetter, C.J.1    Woodland, P.C.2
  • 11
    • 0032655793 scopus 로고    scopus 로고
    • Online adaptation of hidden Markov models using incremental estimation algorithms
    • May
    • V. Digalakis, “Online adaptation of hidden Markov models using incremental estimation algorithms,” IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 253–261, May 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 253-261
    • Digalakis, V.1
  • 12
    • 0001365754 scopus 로고    scopus 로고
    • Online hierarchical transformation of hidden Markov models for speech recognition
    • Nov.
    • J.-T. Chien, “Online hierarchical transformation of hidden Markov models for speech recognition,” IEEE Trans. Speech Audio Process., vol. 7, no. 6, pp. 656–667, Nov. 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.6 , pp. 656-667
    • Chien, J.-T.1
  • 13
    • 0036649879 scopus 로고    scopus 로고
    • Quasi-Bayes linear regression for sequential learning of hidden Markov models
    • Jul.
    • J. T. Chien, “Quasi-Bayes linear regression for sequential learning of hidden Markov models,” IEEE Trans. Speech Audio Process., vol. 10, no. 5, pp. 268–278, Jul. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.5 , pp. 268-278
    • Chien, J.T.1
  • 14
    • 45549093229 scopus 로고    scopus 로고
    • Bayesian adaptive inference and adaptive training
    • Aug.
    • K. Yu and M. J. F. Gales, “Bayesian adaptive inference and adaptive training,” IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1932–1943, Aug. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1932-1943
    • Yu, K.1    Gales, M.J.F.2
  • 15
    • 0026142334 scopus 로고
    • A study on speaker adaptation of the parameters of continuous density hidden Markov models
    • Apr.
    • C.-H. Lee, C. H. Lin, and B.-H. Juang “A study on speaker adaptation of the parameters of continuous density hidden Markov models,” IEEE Trans.Signal Process., vol. 39, no. 4, pp. 806–814, Apr. 1991.
    • (1991) IEEE Trans.Signal Process. , vol.39 , Issue.4 , pp. 806-814
    • Lee, C.-H.1    Lin, C.H.2    Juang, B.-H.3
  • 16
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J.-L. Gauvain and C.-H. Lee “Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,” IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291–298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 17
    • 0040262048 scopus 로고
    • A study of on-line Bayesian adaptation for HMM-based speech recognition
    • T. Matsuoka and C.-H. Lee, “A study of on-line Bayesian adaptation for HMM-based speech recognition,” in Proc. Eurospeech'93, 1993, pp. 815–818.
    • (1993) Proc. Eurospeech'93 , pp. 815-818
    • Matsuoka, T.1    Lee, C.-H.2
  • 18
    • 0028996999 scopus 로고
    • Batch, incremental and instantaneous adaptation techniques for speech recognition
    • G. Zavaliagkos, R. Schwartz, and J. Makhoul, “Batch, incremental and instantaneous adaptation techniques for speech recognition,” in Proc. ICASSP'95, 1995, vol. 1, pp. 676–679.
    • (1995) Proc. ICASSP'95 , vol.1 , pp. 676-679
    • Zavaliagkos, G.1    Schwartz, R.2    Makhoul, J.3
  • 19
    • 0031103160 scopus 로고    scopus 로고
    • On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate
    • Sep.
    • Q. Huo and C.-H. Lee, “On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate,” IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 161–172, Sep. 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.5 , pp. 161-172
    • Huo, Q.1    Lee, C.-H.2
  • 20
    • 0032122203 scopus 로고    scopus 로고
    • On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition
    • Jul.
    • Q. Huo and C.-H. Lee “On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition,” IEEE Trans. Speech Audio Process., vol. 6, no. 4, pp. 386–397, Jul. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.4 , pp. 386-397
    • Huo, Q.1    Lee, C.-H.2
  • 21
    • 34547552788 scopus 로고    scopus 로고
    • Incremental adaptation based on a macroscopic time evolution system
    • S. Watanabe and A. Nakamura, “Incremental adaptation based on a macroscopic time evolution system,” in Proc. ICASSP'07, 2007, vol. 4, pp. 769–772.
    • (2007) Proc. ICASSP'07 , vol.4 , pp. 769-772
    • Watanabe, S.1    Nakamura, A.2
  • 23
    • 51449104599 scopus 로고    scopus 로고
    • A unified interpretation of adaptation techniques based on a macroscopic time evolution system with indirect/direct approaches
    • S. Watanabe and A. Nakamura, “A unified interpretation of adaptation techniques based on a macroscopic time evolution system with indirect/direct approaches,” in Proc. ICASSP'08, 2008, pp. 4285–4288.
    • (2008) Proc. ICASSP'08 , pp. 4285-4288
    • Watanabe, S.1    Nakamura, A.2
  • 24
    • 0035341099 scopus 로고    scopus 로고
    • Online adaptive learning of continuous-density hidden Markov models based on multiple-stream prior evolution and posterior pooling
    • May
    • Q. Huo and B. Ma, “Online adaptive learning of continuous-density hidden Markov models based on multiple-stream prior evolution and posterior pooling,” IEEE Trans. Speech Audio Process., vol. 9, no. 4, pp. 388–398, May 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.4 , pp. 388-398
    • Huo, Q.1    Ma, B.2
  • 25
    • 0031118076 scopus 로고    scopus 로고
    • Vector-field-smoothed Bayesian learning for fast and incremental speaker/telephone-channel adaptation
    • J. Takahashi and S. Sagayama “Vector-field-smoothed Bayesian learning for fast and incremental speaker/telephone-channel adaptation,” Computer Speech and Language, vol. 11, pp. 127–146, 1997.
    • (1997) Computer Speech and Language , vol.11 , pp. 127-146
    • Takahashi, J.1    Sagayama, S.2
  • 26
    • 0030189744 scopus 로고    scopus 로고
    • Speaker adaptation using combined transformation and Bayesian methods
    • Jul.
    • V. Digalakis and L. Neumeyer “Speaker adaptation using combined transformation and Bayesian methods,” IEEE Trans. Speech Audio Process., vol. 4, no. 4, pp. 294–300, Jul. 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.4 , pp. 294-300
    • Digalakis, V.1    Neumeyer, L.2
  • 27
    • 0035341086 scopus 로고    scopus 로고
    • Joint maximum a posteriori adaptation of transformation and HMM parameters
    • May
    • O. Siohan, C. Chesta, and C.-H. Lee, “Joint maximum a posteriori adaptation of transformation and HMM parameters,” IEEE Trans. Speech Audio Process., vol. 9, no. 4, pp. 417–428, May 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.4 , pp. 417-428
    • Siohan, O.1    Chesta, C.2    Lee, C.-H.3
  • 28
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin “Maximum likelihood from incomplete data via the EM algorithm,” J. R. Statist. Soc. B, vol. 39, pp. 1–38, 1976.
    • (1976) J. R. Statist. Soc. B , vol.39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 31
    • 0036293703 scopus 로고    scopus 로고
    • A recognition method with parametric trajectory synthesized using direct relations between static and dynamic feature vector time series
    • Y. Minami, E. McDermott, A. Nakamura, and S. Katagiri, “A recognition method with parametric trajectory synthesized using direct relations between static and dynamic feature vector time series,” in Proc. ICASSP'02, 2002, vol. 1, pp. 957–960.
    • (2002) Proc. ICASSP'02 , vol.1 , pp. 957-960
    • Minami, Y.1    McDermott, E.2    Nakamura, A.3    Katagiri, S.4
  • 33
    • 33745207361 scopus 로고    scopus 로고
    • A Japanese national project on spontaneous speech corpus and processing technology
    • S. Furui, K. Maekawa, and M. H. Isahara, “A Japanese national project on spontaneous speech corpus and processing technology,” in Proc. ASR2000, 2000, pp. 244–248.
    • (2000) Proc. ASR2000 , pp. 244-248
    • Furui, S.1    Maekawa, K.2    Isahara, M.H.3
  • 34
    • 33645758265 scopus 로고    scopus 로고
    • NTT Speech recognizer with outlook on the next generation: SOLON
    • T. Hori, “NTT Speech recognizer with outlook on the next generation: SOLON,” in Proc. NTT Workshop Commun. Scene Anal., 2004, vol. 1, SP-6.
    • (2004) Proc. NTT Workshop Commun. Scene Anal. , vol.1 , pp. SP-6
    • Hori, T.1
  • 35
    • 70349213985 scopus 로고    scopus 로고
    • On-line adaptation and Bayesian detection of environmental changes based on a macroscopic time evolution system
    • S. Watanabe and A. Nakamura, “On-line adaptation and Bayesian detection of environmental changes based on a macroscopic time evolution system,” in Proc. ICASSP'09, 2009, pp. 4373–4376.
    • (2009) Proc. ICASSP'09 , pp. 4373-4376
    • Watanabe, S.1    Nakamura, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.