메뉴 건너뛰기




Volumn 20, Issue 8, 2012, Pages 2252-2264

Hidden markov acoustic modeling with bootstrap and restructuring for low-resourced languages

Author keywords

Bagging; bootstrap and restructuring; hidden Markov model (HMM); large vocabulary continuous speech recognition (LVCSR); low resourced language

Indexed keywords

ACOUSTIC MODEL; ACOUSTIC MODELING; AUTOMATIC SPEECH RECOGNITION; BAGGING; BOOTSTRAP AND RESTRUCTURING; CLUSTERING CRITERIA; COVARIANCE MODELS; DATA SPARSITY; DECODING SPEED; GAUSSIANS; HIDDEN MARKOV MODELS (HMMS); LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; LAST STAGE; LOW-RESOURCED LANGUAGE; MEMORY CONSUMPTION; MODEL REFINEMENT; MODEL SIZE; PREDICTION CAPABILITY; REAL-WORLD APPLICATION; RUNTIMES; SEQUENCE PREDICTION; STATISTICAL RELIABILITY; TRAINING DATA; TRAINING PROCEDURES;

EID: 84865265602     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2199982     Document Type: Article
Times cited : (14)

References (44)
  • 1
    • 84865238845 scopus 로고    scopus 로고
    • Automatic speech recognition for an under-resourced language - Amharic
    • S. T. Abate and W. Menzel, "Automatic speech recognition for an under-resourced language - Amharic," in Proc. Interspeech, 2007, pp. 1541-1544.
    • (2007) Proc. Interspeech , pp. 1541-1544
    • Abate, S.T.1    Menzel, W.2
  • 2
    • 69249083744 scopus 로고    scopus 로고
    • Using phonetic features in unsupervised word decompounding for ASR with application to a less-represented language
    • T. Pellegrini and L. Lamel, "Using phonetic features in unsupervised word decompounding for ASR with application to a less-represented language," in Proc. Interspeech, 2007, pp. 1797-1800.
    • (2007) Proc. Interspeech , pp. 1797-1800
    • Pellegrini, T.1    Lamel, L.2
  • 3
    • 69249139569 scopus 로고    scopus 로고
    • Automatic speech recognition for under-resourced languages: Application to vietnamese language
    • Nov.
    • V.-B. Le and L. Besacier, "Automatic speech recognition for under-resourced languages: Application to Vietnamese language," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 8, pp. 1471-1482, Nov. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.8 , pp. 1471-1482
    • Le, V.-B.1    Besacier, L.2
  • 4
    • 0035426931 scopus 로고    scopus 로고
    • Language-independent and language-adaptive acoustic modeling for speech recognition
    • DOI 10.1016/S0167-6393(00)00094-7, PII S0167639300000947
    • T. Schultz and A. Waibel, "Language-independent and language-adaptive acoustic modeling for speech recognition," Speech Commun., vol. 35, pp. 31-51, 2001. (Pubitemid 32599645)
    • (2001) Speech Communication , vol.35 , Issue.1-2 , pp. 31-51
    • Schultz, T.1    Waibel, A.2
  • 5
    • 0036722707 scopus 로고    scopus 로고
    • Cross-language use of acoustic information for automatic speech recognition
    • DOI 10.1016/S0167-6393(01)00046-2, PII S0167639301000462
    • C. Nieuwoudt and E. C. Botha, "Cross-language use of acoustic information for automatic speech recognition," Speech Commun., vol. 38, pp. 101-113, 2002. (Pubitemid 34873601)
    • (2002) Speech Communication , vol.38 , Issue.1-2 , pp. 101-113
    • Nieuwoudt, C.1    Botha, E.C.2
  • 8
    • 0002344794 scopus 로고
    • Bootstrap methods: Another look at the jackknife
    • B. Efron, "Bootstrap methods: Another look at the jackknife," Ann. Statist., vol. 1, no. 1, pp. 1-26, 1979.
    • (1979) Ann. Statist. , vol.1 , Issue.1 , pp. 1-26
    • Efron, B.1
  • 9
    • 0001077032 scopus 로고
    • Nonparametric estimates of standard error: The jackknife, the bootstrap and other methods
    • B. Efron, "Nonparametric estimates of standard error: The jackknife, the bootstrap and other methods," Biometrika, vol. 68, no. 3, pp. 589-599, 1981.
    • (1981) Biometrika , vol.68 , Issue.3 , pp. 589-599
    • Efron, B.1
  • 13
    • 0030211964 scopus 로고    scopus 로고
    • Baggging predictors
    • L. Breiman, "Baggging predictors," Mach. Learn., vol. 24, no. 2, pp. 123-140, 1996.
    • (1996) Mach. Learn. , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 14
    • 0030344230 scopus 로고    scopus 로고
    • Heuristics of instability and stabilization in model selection
    • L. Breiman, "Heuristics of instability and stabilization in model selection," Ann. Statist., vol. 24, no. 6, pp. 2350-2383, 1996.
    • (1996) Ann. Statist. , vol.24 , Issue.6 , pp. 2350-2383
    • Breiman, L.1
  • 15
    • 79959843187 scopus 로고    scopus 로고
    • Acoustic modeling with bootstrap and restructuring for low-resourced languages
    • X. Cui, J. Xue, P. L. Dognin, U. V. Chaudhari, and B. Zhou, "Acoustic modeling with bootstrap and restructuring for low-resourced languages," in Proc. Interspeech, 2010, pp. 2974-2977.
    • (2010) Proc. Interspeech , pp. 2974-2977
    • Cui, X.1    Xue, J.2    Dognin, P.L.3    Chaudhari, U.V.4    Zhou, B.5
  • 20
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 21
    • 0043289776 scopus 로고    scopus 로고
    • Analyzing bagging
    • DOI 10.1214/aos/1031689014
    • P. Bühlmann and B. Yu, "Analyzing bagging," Ann. Statist., vol. 30, no. 4, pp. 927-961, 2002. (Pubitemid 37095335)
    • (2002) Annals of Statistics , vol.30 , Issue.4 , pp. 927-961
    • Buhlmann, P.1    Yu, B.2
  • 22
    • 76249101406 scopus 로고    scopus 로고
    • Effect of subsampling rate on subbagging and related ensembles of stable classifiers
    • F. Zaman and H. Hirose, "Effect of subsampling rate on subbagging and related ensembles of stable classifiers," in Proc. Int. Conf. Pattern Recogn. Mach. Intell., 2009, pp. 44-49.
    • (2009) Proc. Int. Conf. Pattern Recogn. Mach. Intell. , pp. 44-49
    • Zaman, F.1    Hirose, H.2
  • 23
  • 24
    • 64149085496 scopus 로고    scopus 로고
    • Automatic model complexity control using marginalized discriminative growth functions
    • May
    • X. Liu and M. Gales, "Automatic model complexity control using marginalized discriminative growth functions," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1414-1424, May 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.4 , pp. 1414-1424
    • Liu, X.1    Gales, M.2
  • 25
    • 0033884712 scopus 로고    scopus 로고
    • Model complexity adaptation using a discriminant measure
    • DOI 10.1109/89.824707
    • M. Padmanabhan and L. R. Bahl, "Model complexity adaptation using a discriminant measure," IEEE Trans. Speech Audio Process., vol. 8, no. 2, pp. 205-208, Mar. 2000. (Pubitemid 30578375)
    • (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , Issue.2 , pp. 205-208
    • Padmanabhan, M.1    Bahl, L.R.2
  • 26
    • 33745205656 scopus 로고    scopus 로고
    • Gaussian elimination algorithm for HMM complexity reduction in continuous speech recognition systems
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • G. F. G. Yared, F. Violaro, and L. C. Sousa, "Gaussian elimination algorithm for HMM complexity reduction in continuous speech recognition systems," in Proc. Interspeech, 2005, pp. 377-380. (Pubitemid 43908078)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 377-380
    • Yared, G.F.G.1    Violaro, F.2    Sousa, L.C.3
  • 27
    • 0029747193 scopus 로고
    • Speaker adaptation with autonomous model complexity control by MDL principle
    • K. Shinoda and T. Watanabe, "Speaker adaptation with autonomous model complexity control by MDL principle," in Proc. Int. Conf. Acoust., Speech, Signal Process., 1995, pp. 717-720.
    • (1995) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 717-720
    • Shinoda, K.1    Watanabe, T.2
  • 28
    • 85009136681 scopus 로고    scopus 로고
    • Model complexity optimization for nonnative English speakers
    • X. He and Y. Zhao, "Model complexity optimization for nonnative English speakers," in Proc. Interspeech, 2001, pp. 1461-1464.
    • (2001) Proc. Interspeech , pp. 1461-1464
    • He, X.1    Zhao, Y.2
  • 29
    • 77955091542 scopus 로고    scopus 로고
    • Methods for merging Gaussian mixture components
    • C. Hennig, "Methods for merging gaussian mixture components," Adv. Data Anal. Classific., vol. 4, no. 1, pp. 3-34, 2010.
    • (2010) Adv. Data Anal. Classific. , vol.4 , Issue.1 , pp. 3-34
    • Hennig, C.1
  • 31
    • 66249107761 scopus 로고    scopus 로고
    • A new approach to merging Gaussian densities in large vocabulary continuous speech recognition
    • W. Xu, J. Duchateau, K. Demuynck, and I. Dologlou, "A new approach to merging Gaussian densities in large vocabulary continuous speech recognition," in Proc. IEEE Benelux Signal Process. Symp., 1998, pp. 231-234.
    • (1998) Proc. IEEE Benelux Signal Process. Symp. , pp. 231-234
    • Xu, W.1    Duchateau, J.2    Demuynck, K.3    Dologlou, I.4
  • 36
    • 70450191334 scopus 로고    scopus 로고
    • Refactoring acoustic models using variational expectation-maximization
    • P. L. Dognin, J. R. Hershey, V. Goel, and P. A. Olsen, "Refactoring acoustic models using variational Expectation-Maximization," in Proc. Interspeech, 2009, pp. 212-215.
    • (2009) Proc. Interspeech , pp. 212-215
    • Dognin, P.L.1    Hershey, J.R.2    Goel, V.3    Olsen, P.A.4
  • 40
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., Ser. B, vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc., Ser. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 41
    • 84865800558 scopus 로고    scopus 로고
    • Acoustic modeling with bootstrap and restructuring based on full covariance
    • X. Cui, X. Chen, J. Xue, P. A. Olsen, J. R. Hershey, and B. Zhou, "Acoustic modeling with bootstrap and restructuring based on full covariance," in Proc. Interspeech, 2011, pp. 1697-1700.
    • (2011) Proc. Interspeech , pp. 1697-1700
    • Cui, X.1    Chen, X.2    Xue, J.3    Olsen, P.A.4    Hershey, J.R.5    Zhou, B.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.