메뉴 건너뛰기




Volumn 30, Issue 13, 2009, Pages 1228-1235

Training data selection for improving discriminative training of acoustic models

Author keywords

Acoustic models; Continuous speech recognition; Data selection; Discriminative training; Entropy; Phone accuracy

Indexed keywords

ACOUSTIC MODEL; ACOUSTIC MODELS; BROADCAST NEWS; DATA SELECTION; DISCRIMINATIVE ACOUSTIC MODEL; DISCRIMINATIVE TRAINING; GAUSSIANS; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; PHONE ACCURACY; POSTERIOR PROBABILITY; SPEECH TRANSCRIPTIONS; TRAINING DATA;

EID: 68149178821     PISSN: 01678655     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patrec.2009.05.009     Document Type: Article
Times cited : (10)

References (35)
  • 1
    • 0036460898 scopus 로고    scopus 로고
    • An overview of decoding techniques for large vocabulary continuous speech recognition
    • Aubert X.L. An overview of decoding techniques for large vocabulary continuous speech recognition. Comput. Speech Language 16 (2002) 89-114
    • (2002) Comput. Speech Language , vol.16 , pp. 89-114
    • Aubert, X.L.1
  • 6
    • 0036475982 scopus 로고    scopus 로고
    • Maximum likelihood multiple subspace projections for hidden Markov models
    • Gales M.J.F. Maximum likelihood multiple subspace projections for hidden Markov models. IEEE Trans. Speech Audio Process. 10 2 (2002) 37-47
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.2 , pp. 37-47
    • Gales, M.J.F.1
  • 7
    • 27644444018 scopus 로고    scopus 로고
    • A dynamic in-search data selection method with its applications to acoustic modeling and utterance verification
    • Jiang H., Soong F.K., and Lee C.H. A dynamic in-search data selection method with its applications to acoustic modeling and utterance verification. IEEE Trans. Speech Audio Process. 13 5 (2005) 945-955
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 945-955
    • Jiang, H.1    Soong, F.K.2    Lee, C.H.3
  • 9
    • 0031139839 scopus 로고    scopus 로고
    • Minimum classification error rate methods for speech recognition
    • Juang B.H., Chou W., and Lee C.H. Minimum classification error rate methods for speech recognition. IEEE Trans. Speech Audio Process. 5 3 (1997) 257-265
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.3 , pp. 257-265
    • Juang, B.H.1    Chou, W.2    Lee, C.H.3
  • 11
    • 46449138280 scopus 로고    scopus 로고
    • An empirical study of word error minimization approaches for mandarin large vocabulary speech recognition
    • Kuo J.W., Liu S.H., Wang H.M., and Chen B. An empirical study of word error minimization approaches for mandarin large vocabulary speech recognition. Internat. J. Comput. Linguistic Chinese Language Process. 11 3 (2006) 201-222
    • (2006) Internat. J. Comput. Linguistic Chinese Language Process. , vol.11 , Issue.3 , pp. 201-222
    • Kuo, J.W.1    Liu, S.H.2    Wang, H.M.3    Chen, B.4
  • 14
    • 64149098818 scopus 로고    scopus 로고
    • Approximate test risk bound minimization through soft margin estimation
    • Li J., Ma B., and Lee C.H. Approximate test risk bound minimization through soft margin estimation. IEEE Trans. Audio, Speech Language Process. 15 8 (2007) 2393-2404
    • (2007) IEEE Trans. Audio, Speech Language Process. , vol.15 , Issue.8 , pp. 2393-2404
    • Li, J.1    Ma, B.2    Lee, C.H.3
  • 17
    • 68149125060 scopus 로고    scopus 로고
    • Improved minimum phone error based discriminative training of acoustic models for Mandarin large vocabulary continuous speech recognition
    • Liu S.H., Chu F.H., Lo Y.T., and Chen B. Improved minimum phone error based discriminative training of acoustic models for Mandarin large vocabulary continuous speech recognition. Internat. J. Comput. Linguistics Chinese Language Process. 3 (2008) 327-342
    • (2008) Internat. J. Comput. Linguistics Chinese Language Process. , Issue.3 , pp. 327-342
    • Liu, S.H.1    Chu, F.H.2    Lo, Y.T.3    Chen, B.4
  • 20
    • 33745205617 scopus 로고    scopus 로고
    • Spectral entropy feature in full-combination multi-stream for robust ASR
    • Speech Communication and Technology, pp
    • Misra, H., Bourlard, H., 2005. Spectral entropy feature in full-combination multi-stream for robust ASR. In: Proc. European Conf. Speech Communication and Technology, pp. 2633-2636.
    • (2005) Proc. European Conf , pp. 2633-2636
    • Misra, H.1    Bourlard, H.2
  • 21
    • 0020796537 scopus 로고
    • A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood
    • Nadas A. A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood. IEEE Trans. Acoustics, Speech, Signal Process. 31 4 (1983) 814-817
    • (1983) IEEE Trans. Acoustics, Speech, Signal Process. , vol.31 , Issue.4 , pp. 814-817
    • Nadas, A.1
  • 22
    • 68149178175 scopus 로고    scopus 로고
    • Normandin, Y. Hidden, Markov Models, Maximum Mutual Information Estimation, and the Speech Recognition Problems. Ph.D. Dissertation, McGill University
    • Normandin, Y. Hidden, Markov Models, Maximum Mutual Information Estimation, and the Speech Recognition Problems. Ph.D. Dissertation, McGill University.
  • 23
    • 0030719155 scopus 로고    scopus 로고
    • A word graph algorithm for large vocabulary continuous speech recognition
    • Ortmanns S., Ney H., and Aubert X. A word graph algorithm for large vocabulary continuous speech recognition. Comput. Speech Language 11 (1997) 43-72
    • (1997) Comput. Speech Language , vol.11 , pp. 43-72
    • Ortmanns, S.1    Ney, H.2    Aubert, X.3
  • 25
    • 0036461035 scopus 로고    scopus 로고
    • Large scale discriminative training of acoustic models for speech recognition
    • Povey D., and Woodland P.C. Large scale discriminative training of acoustic models for speech recognition. Comput. Speech Language 16 (2002) 25-47
    • (2002) Comput. Speech Language , vol.16 , pp. 25-47
    • Povey, D.1    Woodland, P.C.2
  • 27
    • 0035340902 scopus 로고    scopus 로고
    • Data-driven approach to designing compound words for continuous speech recognition
    • Saon G., and Padmanabhan M. Data-driven approach to designing compound words for continuous speech recognition. IEEE Trans. Speech Audio Process. 9 4 (2001) 327-332
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.4 , pp. 327-332
    • Saon, G.1    Padmanabhan, M.2
  • 29
    • 68149150369 scopus 로고    scopus 로고
    • Stolcke, A, 2000. SRI language modeling toolkit. Version 1.3.3, 2000
    • Stolcke, A., 2000. SRI language modeling toolkit. Version 1.3.3, 2000. .
  • 33
    • 47749150672 scopus 로고    scopus 로고
    • Large-margin discriminative training of hidden markov models for speech recognition
    • Yu, D., Deng, L., 2007. Large-margin discriminative training of hidden markov models for speech recognition. In: Proc. IEEE Internat. Conf. Semantic Computing, pp. 429-438.
    • (2007) Proc. IEEE Internat. Conf. Semantic Computing , pp. 429-438
    • Yu, D.1    Deng, L.2
  • 35
    • 42949105203 scopus 로고    scopus 로고
    • Large-margin minimum classification error training: A theoretical risk minimization perspective
    • Yu D., Deng L., He X., and Acero A. Large-margin minimum classification error training: A theoretical risk minimization perspective. Comput. Speech Language 22 4 (2008) 415-429
    • (2008) Comput. Speech Language , vol.22 , Issue.4 , pp. 415-429
    • Yu, D.1    Deng, L.2    He, X.3    Acero, A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.