SCOPUS 정보 검색 플랫폼

Pattern Recognition Letters

Volumn 30, Issue 13, 2009, Pages 1228-1235

Training data selection for improving discriminative training of acoustic models

(3) Chen, Berlin a Liu, Shih Hung a Chu, Fang Hui a

a NATIONAL TAIWAN NORMAL UNIVERSITY (Taiwan)

Author keywords

Acoustic models; Continuous speech recognition; Data selection; Discriminative training; Entropy; Phone accuracy

Indexed keywords

ACOUSTIC MODEL; ACOUSTIC MODELS; BROADCAST NEWS; DATA SELECTION; DISCRIMINATIVE ACOUSTIC MODEL; DISCRIMINATIVE TRAINING; GAUSSIANS; LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION; PHONE ACCURACY; POSTERIOR PROBABILITY; SPEECH TRANSCRIPTIONS; TRAINING DATA;

ACOUSTICS; CONTINUOUS SPEECH RECOGNITION; ENTROPY; SPEECH TRANSMISSION; TELEPHONE SETS; TRANSCRIPTION; TURNAROUND TIME;

DATA REDUCTION;

EID: 68149178821 PISSN: 01678655 EISSN: None Source Type: Journal
DOI: 10.1016/j.patrec.2009.05.009 Document Type: Article

Times cited : (10)

References (35)

1
- 0036460898
- An overview of decoding techniques for large vocabulary continuous speech recognition
- Aubert X.L. An overview of decoding techniques for large vocabulary continuous speech recognition. Comput. Speech Language 16 (2002) 89-114
- (2002) Comput. Speech Language , vol.16 , pp. 89-114
- Aubert, X.L.¹

2
- 0022890536
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- Bahl, L.R., Brown, P.F., de Souza, P.V., Mercer, R.L., 1986. Maximum mutual information estimation of hidden Markov model parameters for speech recognition. In: Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing, pp. 49-52.
- (1986) Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing , pp. 49-52
- Bahl, L.R.¹ Brown, P.F.² de Souza, P.V.³ Mercer, R.L.⁴

3
- 4544253838
- Improving broadcast news transcription by lightly supervised discriminative training
- Chan, H.Y., Woodland, P.C., 2004. Improving broadcast news transcription by lightly supervised discriminative training. In: Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing, pp.737-740.
- (2004) Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing , pp. 737-740
- Chan, H.Y.¹ Woodland, P.C.²

4
- 4544302571
- Lightly supervised and data-driven approaches to mandarin broadcast news transcription
- Chen, B., Kuo, J.W., Tsai, W.H., 2004. Lightly supervised and data-driven approaches to mandarin broadcast news transcription. In: Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing, pp. 777-780.
- (2004) Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing , pp. 777-780
- Chen, B.¹ Kuo, J.W.² Tsai, W.H.³

5
- 34547505668
- Word topical mixture models for dynamic language model adaptation
- Chiu, H.S., Chen, B., 2007. Word topical mixture models for dynamic language model adaptation, In: Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing, pp. 169-172.
- (2007) Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing , pp. 169-172
- Chiu, H.S.¹ Chen, B.²

6
- 0036475982
- Maximum likelihood multiple subspace projections for hidden Markov models
- Gales M.J.F. Maximum likelihood multiple subspace projections for hidden Markov models. IEEE Trans. Speech Audio Process. 10 2 (2002) 37-47
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.2 , pp. 37-47
- Gales, M.J.F.¹

7
- 27644444018
- A dynamic in-search data selection method with its applications to acoustic modeling and utterance verification
- Jiang H., Soong F.K., and Lee C.H. A dynamic in-search data selection method with its applications to acoustic modeling and utterance verification. IEEE Trans. Speech Audio Process. 13 5 (2005) 945-955
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 945-955
- Jiang, H.¹ Soong, F.K.² Lee, C.H.³

8
- 34047115134
- Large margin hidden Markov models for speech recognition
- Jiang H., Li X.W., and Liu C.J. Large margin hidden Markov models for speech recognition. IEEE Trans. Audio, Speech Language Process. 14 5 (2006) 1584-1595
- (2006) IEEE Trans. Audio, Speech Language Process. , vol.14 , Issue.5 , pp. 1584-1595
- Jiang, H.¹ Li, X.W.² Liu, C.J.³

9
- 0031139839
- Minimum classification error rate methods for speech recognition
- Juang B.H., Chou W., and Lee C.H. Minimum classification error rate methods for speech recognition. IEEE Trans. Speech Audio Process. 5 3 (1997) 257-265
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.3 , pp. 257-265
- Juang, B.H.¹ Chou, W.² Lee, C.H.³

10
- 0003871508
- Ph.D. Dissertation, John Hopkins University
- Kumar, N., 1997. Investigation of Silicon-Auditory Models and Generalization of Linear Discriminant Analysis for Improved Speech Recognition. Ph.D. Dissertation, John Hopkins University.
- (1997) Investigation of Silicon-Auditory Models and Generalization of Linear Discriminant Analysis for Improved Speech Recognition
- Kumar, N.¹

11
- 46449138280
- An empirical study of word error minimization approaches for mandarin large vocabulary speech recognition
- Kuo J.W., Liu S.H., Wang H.M., and Chen B. An empirical study of word error minimization approaches for mandarin large vocabulary speech recognition. Internat. J. Comput. Linguistic Chinese Language Process. 11 3 (2006) 201-222
- (2006) Internat. J. Comput. Linguistic Chinese Language Process. , vol.11 , Issue.3 , pp. 201-222
- Kuo, J.W.¹ Liu, S.H.² Wang, H.M.³ Chen, B.⁴

12
- 68149124028
- Empirical error rate minimization based linear discriminant analysis
- Lee, H.S., Chen, B., 2009. Empirical error rate minimization based linear discriminant analysis. In: Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing, pp. 1801-1804.
- (2009) Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing , pp. 1801-1804
- Lee, H.S.¹ Chen, B.²

13
- 67651157825
- Ph.D. Dissertation, Georgia Institute of Technology
- Li, J., 2008. Soft Margin Estimation for Automatic Speech Recognition. Ph.D. Dissertation, Georgia Institute of Technology.
- (2008) Soft Margin Estimation for Automatic Speech Recognition
- Li, J.¹

14
- 64149098818
- Approximate test risk bound minimization through soft margin estimation
- Li J., Ma B., and Lee C.H. Approximate test risk bound minimization through soft margin estimation. IEEE Trans. Audio, Speech Language Process. 15 8 (2007) 2393-2404
- (2007) IEEE Trans. Audio, Speech Language Process. , vol.15 , Issue.8 , pp. 2393-2404
- Li, J.¹ Ma, B.² Lee, C.H.³

15
- 46449108334
- Investigating data selection for minimum phone error training of acoustic models
- Liu, S.H., Chu, F.H., Lin, S.H., Chen, B., 2007a. Investigating data selection for minimum phone error training of acoustic models. In: Proc. IEEE Internat. Conf. on Multimedia and Expo, pp. 348-351.
- (2007) Proc. IEEE Internat. Conf. on Multimedia and Expo , pp. 348-351
- Liu, S.H.¹ Chu, F.H.² Lin, S.H.³ Chen, B.⁴

16
- 44849114709
- Training data selection for improving discriminative training of acoustic models
- Liu, S.H., Chu, F.H., Lin, S.H., Lee, H.S., Chen, B., 2007b. Training data selection for improving discriminative training of acoustic models. In: Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 284-289.
- (2007) Proc. IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 284-289
- Liu, S.H.¹ Chu, F.H.² Lin, S.H.³ Lee, H.S.⁴ Chen, B.⁵

17
- 68149125060
- Improved minimum phone error based discriminative training of acoustic models for Mandarin large vocabulary continuous speech recognition
- Liu S.H., Chu F.H., Lo Y.T., and Chen B. Improved minimum phone error based discriminative training of acoustic models for Mandarin large vocabulary continuous speech recognition. Internat. J. Comput. Linguistics Chinese Language Process. 3 (2008) 327-342
- (2008) Internat. J. Comput. Linguistics Chinese Language Process. , Issue.3 , pp. 327-342
- Liu, S.H.¹ Chu, F.H.² Lo, Y.T.³ Chen, B.⁴

18
- 33646762098
- Discriminative training of acoustic models applied to domains with unreliable transcripts
- Mathias, L., Yegnanarayanan, G., Fritsch, J., 2005. Discriminative training of acoustic models applied to domains with unreliable transcripts. In: Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing, pp. 109-112.
- (2005) Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing , pp. 109-112
- Mathias, L.¹ Yegnanarayanan, G.² Fritsch, J.³

19
- 34547522070
- Discriminative training for large vocabulary speech recognition using minimum classification error
- McDermott E., Hazen T.J., Roux J.L., Nakamura A., and Katagiri S. Discriminative training for large vocabulary speech recognition using minimum classification error. IEEE Trans. Audio, Speech Language Process. 15 1 (2007) 203-223
- (2007) IEEE Trans. Audio, Speech Language Process. , vol.15 , Issue.1 , pp. 203-223
- McDermott, E.¹ Hazen, T.J.² Roux, J.L.³ Nakamura, A.⁴ Katagiri, S.⁵

20
- 33745205617
- Spectral entropy feature in full-combination multi-stream for robust ASR
- Speech Communication and Technology, pp
- Misra, H., Bourlard, H., 2005. Spectral entropy feature in full-combination multi-stream for robust ASR. In: Proc. European Conf. Speech Communication and Technology, pp. 2633-2636.
- (2005) Proc. European Conf , pp. 2633-2636
- Misra, H.¹ Bourlard, H.²

21
- 0020796537
- A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood
- Nadas A. A decision theoretic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood. IEEE Trans. Acoustics, Speech, Signal Process. 31 4 (1983) 814-817
- (1983) IEEE Trans. Acoustics, Speech, Signal Process. , vol.31 , Issue.4 , pp. 814-817
- Nadas, A.¹

22
- 68149178175
- Normandin, Y. Hidden, Markov Models, Maximum Mutual Information Estimation, and the Speech Recognition Problems. Ph.D. Dissertation, McGill University
- Normandin, Y. Hidden, Markov Models, Maximum Mutual Information Estimation, and the Speech Recognition Problems. Ph.D. Dissertation, McGill University.

23
- 0030719155
- A word graph algorithm for large vocabulary continuous speech recognition
- Ortmanns S., Ney H., and Aubert X. A word graph algorithm for large vocabulary continuous speech recognition. Comput. Speech Language 11 (1997) 43-72
- (1997) Comput. Speech Language , vol.11 , pp. 43-72
- Ortmanns, S.¹ Ney, H.² Aubert, X.³

24
- 4544265717
- Ph.D. Dissertation, Peterhouse, University of Cambridge
- Povey, D., 2004. Discriminative Training for Large Vocabulary Speech Recognition. Ph.D. Dissertation, Peterhouse, University of Cambridge.
- (2004) Discriminative Training for Large Vocabulary Speech Recognition
- Povey, D.¹

25
- 0036461035
- Large scale discriminative training of acoustic models for speech recognition
- Povey D., and Woodland P.C. Large scale discriminative training of acoustic models for speech recognition. Comput. Speech Language 16 (2002) 25-47
- (2002) Comput. Speech Language , vol.16 , pp. 25-47
- Povey, D.¹ Woodland, P.C.²

26
- 0036296863
- Minimum phone error and i-smoothing for improved discriminative training
- Povey, D., Woodland, P.C., 2002b. Minimum phone error and i-smoothing for improved discriminative training. In: Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing, pp. 105-108.
- (2002) Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

27
- 0035340902
- Data-driven approach to designing compound words for continuous speech recognition
- Saon G., and Padmanabhan M. Data-driven approach to designing compound words for continuous speech recognition. IEEE Trans. Speech Audio Process. 9 4 (2001) 327-332
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.4 , pp. 327-332
- Saon, G.¹ Padmanabhan, M.²

28
- 0033677121
- Maximum likelihood discriminant feature spaces
- Saon, G., Padmanabhan, M., Gopinath, R., Chen, S., 2000. Maximum likelihood discriminant feature spaces. In: Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing, vol. 2, pp. 1129-1132.
- (2000) Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 1129-1132
- Saon, G.¹ Padmanabhan, M.² Gopinath, R.³ Chen, S.⁴

29
- 68149150369
- Stolcke, A, 2000. SRI language modeling toolkit. Version 1.3.3, 2000
- Stolcke, A., 2000. SRI language modeling toolkit. Version 1.3.3, 2000. .

30
- 0012350788
- Ph.D. Dissertation, Peterhouse, University of Cambridge
- Valchev, V., 1995. Discriminative Methods in HMM-based Speech Recognition. Ph.D. Dissertation, Peterhouse, University of Cambridge.
- (1995) Discriminative Methods in HMM-based Speech Recognition
- Valchev, V.¹

31
- 0003450542
- Springer-Verlag, New York
- Vapnik V. The Nature of Statistical Learning Theory (1995), Springer-Verlag, New York
- (1995) The Nature of Statistical Learning Theory
- Vapnik, V.¹

32
- 33745184949
- MATBN: A Mandarin Chinese broadcast news corpus
- Wang H.M., Chen B., Kuo J.W., and Cheng S.S. MATBN: A Mandarin Chinese broadcast news corpus. Internat. J. Comput. Linguistic Chinese Language Process. 10 1 (2005) 219-235
- (2005) Internat. J. Comput. Linguistic Chinese Language Process. , vol.10 , Issue.1 , pp. 219-235
- Wang, H.M.¹ Chen, B.² Kuo, J.W.³ Cheng, S.S.⁴

33
- 47749150672
- Large-margin discriminative training of hidden markov models for speech recognition
- Yu, D., Deng, L., 2007. Large-margin discriminative training of hidden markov models for speech recognition. In: Proc. IEEE Internat. Conf. Semantic Computing, pp. 429-438.
- (2007) Proc. IEEE Internat. Conf. Semantic Computing , pp. 429-438
- Yu, D.¹ Deng, L.²

34
- 34547526577
- Large-margin minimum classification error training for large-scale speech recognition tasks
- Yu, D., Deng, L., He, X., Acero A., 2007. Large-margin minimum classification error training for large-scale speech recognition tasks. In: Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing, vol. 4, pp. 1137-1140.
- (2007) Proc. IEEE Internat. Conf. Acoustics, Speech, Signal Processing , vol.4 , pp. 1137-1140
- Yu, D.¹ Deng, L.² He, X.³ Acero, A.⁴

35
- 42949105203
- Large-margin minimum classification error training: A theoretical risk minimization perspective
- Yu D., Deng L., He X., and Acero A. Large-margin minimum classification error training: A theoretical risk minimization perspective. Comput. Speech Language 22 4 (2008) 415-429
- (2008) Comput. Speech Language , vol.22 , Issue.4 , pp. 415-429
- Yu, D.¹ Deng, L.² He, X.³ Acero, A.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.