메뉴 건너뛰기




Volumn 14, Issue 3, 2006, Pages 855-872

Automatic determination of acoustic model topology using variational bayesian estimation and clustering for large vocabulary continuous speech recognition

Author keywords

Determination of acoustic model topologies; Speech recognition; Variational bayes; Variational bayesian estimation and clustering (VBEC)

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; GAUSSIAN DISTRIBUTION; LEARNING SYSTEMS; MARKOV PROCESSES; PARAMETER ESTIMATION; SPEECH RECOGNITION;

EID: 33646418145     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.857791     Document Type: Article
Times cited : (14)

References (23)
  • 1
    • 0003805597 scopus 로고
    • The Use of Context in Large Vocabulary Speech Recognition,
    • Ph.D. dissertation, Cambridge Univ, Cambridge, U.K
    • J. Odell, "The Use of Context in Large Vocabulary Speech Recognition," Ph.D. dissertation, Cambridge Univ., Cambridge, U.K., 1995.
    • (1995)
    • Odell, J.1
  • 2
    • 0030715097 scopus 로고    scopus 로고
    • HMM topology design using maximum likelihood successive state splitting
    • M. Ostendorf and H. Singer, "HMM topology design using maximum likelihood successive state splitting," Comput. Speech Lang., vol. 11, pp. 17-41, 1997.
    • (1997) Comput. Speech Lang , vol.11 , pp. 17-41
    • Ostendorf, M.1    Singer, H.2
  • 4
    • 85135145174 scopus 로고    scopus 로고
    • Acoustic modeling based on the MDL criterion for speech recognition
    • K. Shinoda and T. Watanabe, "Acoustic modeling based on the MDL criterion for speech recognition," in Proc. Eurospeech 1997, vol. 1, 1997, pp. 99-102.
    • (1997) Proc. Eurospeech 1997 , vol.1 , pp. 99-102
    • Shinoda, K.1    Watanabe, T.2
  • 6
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based context-dependent subword modeling for speech recognition
    • K. Shinoda and T. Watanabe, "MDL-based context-dependent subword modeling for speech recognition," J. Acoust. Soc. Jpn. E, vol. 21, pp. 79-86, 2000.
    • (2000) J. Acoust. Soc. Jpn. E , vol.21 , pp. 79-86
    • Shinoda, K.1    Watanabe, T.2
  • 7
    • 0009685440 scopus 로고    scopus 로고
    • Model selection in acoustic modeling
    • S. Chen and R. Gopinath, "Model selection in acoustic modeling," in 1 Proc. Eurospeech 1999, vol. 3, 1999, pp. 1087-1090.
    • (1999) 1 Proc. Eurospeech 1999 , vol.3 , pp. 1087-1090
    • Chen, S.1    Gopinath, R.2
  • 12
    • 0003278032 scopus 로고    scopus 로고
    • Inferring parameters and structure of latent variable models by variational Bayes
    • H. Attias, "Inferring parameters and structure of latent variable models by variational Bayes," in Proc. Uncertainty in Artificial Intelligence (UAI 15), 1999.
    • (1999) Proc. Uncertainty in Artificial Intelligence (UAI 15)
    • Attias, H.1
  • 13
    • 0036887504 scopus 로고    scopus 로고
    • Bayesian model search for mixture models based on optimizing variational bounds
    • N. Ueda and Z. Ghahramani, "Bayesian model search for mixture models based on optimizing variational bounds," Neural Networks, vol. 15, pp. 1223-1241, 2002.
    • (2002) Neural Networks , vol.15 , pp. 1223-1241
    • Ueda, N.1    Ghahramani, Z.2
  • 14
    • 85009237883 scopus 로고    scopus 로고
    • Speech modeling using variational Bayesian mixture of Gaussians
    • P. Somervuo, "Speech modeling using variational Bayesian mixture of Gaussians," in Proc. Int. Conf. Spoken Language Processing (ICSLP 2002), vol. 2, 2002, pp. 1245-1248.
    • (2002) Proc. Int. Conf. Spoken Language Processing (ICSLP , vol.2 , pp. 1245-1248
    • Somervuo, P.1
  • 16
    • 4544253566 scopus 로고    scopus 로고
    • T. Jitsuhiro and S. Nakamura, Automatic generation of nonuniform HMM structures based on variational Bayesian approach, in Proc. IEEE Int. Conf. Acoustics, Speech, & Signal Processing (ICASSP 2004), 1, 2004, pp. 805-808.
    • T. Jitsuhiro and S. Nakamura, "Automatic generation of nonuniform HMM structures based on variational Bayesian approach," in Proc. IEEE Int. Conf. Acoustics, Speech, & Signal Processing (ICASSP 2004), vol. 1, 2004, pp. 805-808.
  • 18
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Processing, vol. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 19
    • 0035279111 scopus 로고    scopus 로고
    • A structural Bayes approach to speaker adaptation
    • Mar
    • K. Shinoda and C.-H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Processing, vol. 9, pp. 276-287, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Processing , vol.9 , pp. 276-287
    • Shinoda, K.1    Lee, C.-H.2
  • 21
    • 33645758265 scopus 로고    scopus 로고
    • NTT Speech recognizer with outlook on the next generation: SOLON
    • T. Hori, "NTT Speech recognizer with outlook on the next generation: SOLON," in Proc. NTT Workshop on Communication Scene Analysis, vol. 1, 2004.
    • (2004) Proc. NTT Workshop on Communication Scene Analysis , vol.1
    • Hori, T.1
  • 22
    • 34047255331 scopus 로고    scopus 로고
    • Japanese Dictation Toolkit, Free Software Repository for Automatic Speech Recognition
    • K. Shikano et al., Japanese Dictation Toolkit - Free Software Repository for Automatic Speech Recognition, 1999.
    • (1999)
    • Shikano, K.1
  • 23
    • 34047271693 scopus 로고    scopus 로고
    • Robustness of acoustic model topology determined by variational Bayesian estimation and clustering for speech recognition for different speech data sets
    • S. Watanabe and A. Nakamura, "Robustness of acoustic model topology determined by variational Bayesian estimation and clustering for speech recognition for different speech data sets," in Proc. lEICE Int. Workshop of Beyond HMM, SP2004-90, 2004, pp. 55-60.
    • (2004) Proc. lEICE Int. Workshop of Beyond HMM, SP2004-90 , pp. 55-60
    • Watanabe, S.1    Nakamura, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.