SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 30, Issue 4, 2000, Pages 273-293

Robust training algorithm for adverse speech recognition

(2) Hong, Wei Tyng a Chen, Sin Horng b

a INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE (Taiwan)

b NATIONAL CHIAO TUNG UNIVERSITY (Taiwan)

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; COMMUNICATION CHANNELS (INFORMATION THEORY); ITERATIVE METHODS; LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES; MATHEMATICAL MODELS; SPEECH ANALYSIS; SPEECH COMMUNICATION;

HIDDEN MARKOV MODELS (HMM);

SPEECH RECOGNITION;

EID: 0033888153 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/S0167-6393(99)00057-6 Document Type: Article

Times cited : (17)

References (37)

1
- 0025628728
- Environmental robustness in automatic speech recognition
- Acero, A., Stern, R.M., 1990. Environmental robustness in automatic speech recognition. In: Proceedings of ICASSP-90, pp. 849-852.
- (1990) In: Proceedings of ICASSP-90 , pp. 849-852
- Acero, A.¹ Stern, R.M.²

2
- 0026385284
- Robust speech recognition by normalization of the acoustic space
- Acero, A., Stern, R.M., 1991. Robust speech recognition by normalization of the acoustic space. In: Proceedings of ICASSP-91, pp. 893-896.
- (1991) In: Proceedings of ICASSP-91 , pp. 893-896
- Acero, A.¹ Stern, R.M.²

3
- 0030677475
- Speaker adaptive training: A maximum likelihood approach to speaker normalization
- Anastasakos, T., McDonough, J., Makhoul, J., 1997. Speaker adaptive training: a maximum likelihood approach to speaker normalization. In: Proceedings of ICASSP-97, pp. 1043-1046.
- (1997) In: Proceedings of ICASSP-97 , pp. 1043-1046
- Anastasakos, T.¹ McDonough, J.² Makhoul, J.³

4
- 0027627368
- Discriminative analysis of distortion sequences in speech recognition
- Chang P.-C., Chen S.-H., Juang B.-H. Discriminative analysis of distortion sequences in speech recognition. IEEE Trans. Speech and Audio Process. 1:1993;326-333.
- (1993) IEEE Trans. Speech and Audio Process. , vol.1 , pp. 326-333
- Chang, P.-C.¹ Chen, S.-H.² Juang, B.-H.³

5
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- Dempster A., Laird N., Rubin D. Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Statist. Soc. 39:1977;1-38.
- (1977) J. Roy. Statist. Soc. , vol.39 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

6
- 84948598244
- Statistical-model-based speech enhancement systems
- Ephraim Y. Statistical-model-based speech enhancement systems. Proc. IEEE. 80:1992;1526-1555.
- (1992) Proc. IEEE , vol.80 , pp. 1526-1555
- Ephraim, Y.¹

7
- 0015600423
- The Viterbi algorithm
- Forney G. The Viterbi algorithm. Proc. IEEE. 61:1973;268-278.
- (1973) Proc. IEEE , vol.61 , pp. 268-278
- Forney, G.¹

8
- 0012265819
- Toward robust speech recognition under adverse conditions
- Furui, S., 1992. Toward robust speech recognition under adverse conditions. In: Proceedings of the ESCA Workshop on Speech Processing in Adverse Conditions, pp. 31-24.
- (1992) In: Proceedings of the ESCA Workshop on Speech Processing in Adverse Conditions , pp. 31-24
- Furui, S.¹

9
- 0030263447
- Mean and variance adaptation within the MLLR framework
- Gales M.J.F., Woodland P.C. Mean and variance adaptation within the MLLR framework. Comput. Speech and Language. 10:1996;249-264.
- (1996) Comput. Speech and Language , vol.10 , pp. 249-264
- Gales, M.J.F.¹ Woodland, P.C.²

10
- 0027622731
- Cepstral parameter compensation for HMM recognition in noise
- Gales M.J.F., Young S.J. Cepstral parameter compensation for HMM recognition in noise. Speech Communication. 12:1993;231-239.
- (1993) Speech Communication , vol.12 , pp. 231-239
- Gales, M.J.F.¹ Young, S.J.²

11
- 0029390135
- Robust speech recognition in additive and convolutional noise using parallel model combination
- Gales M.J.F., Young S.J. Robust speech recognition in additive and convolutional noise using parallel model combination. Comput. Speech and Language. 9:1995;289-307.
- (1995) Comput. Speech and Language , vol.9 , pp. 289-307
- Gales, M.J.F.¹ Young, S.J.²

12
- 0030245128
- Robust continuous speech recognition using parallel model combination
- Gales M.J.F., Young S.J. Robust continuous speech recognition using parallel model combination. IEEE Trans. Speech and Audio Process. 5:1996;352-359.
- (1996) IEEE Trans. Speech and Audio Process. , vol.5 , pp. 352-359
- Gales, M.J.F.¹ Young, S.J.²

13
- 0029288202
- Speech recognition in noisy environments: A survey
- Gong Y. Speech recognition in noisy environments: A survey. Speech Communication. 16:1995;261-291.
- (1995) Speech Communication , vol.16 , pp. 261-291
- Gong, Y.¹

14
- 0347321460
- Source normalization training for HMM applied to noisy telephone speech recognition
- Gong, Y., 1997. Source normalization training for HMM applied to noisy telephone speech recognition. In: Proceedings of EuroSpeech-97, Vol. 3, pp. 1555-1558.
- (1997) In: Proceedings of EuroSpeech-97 , vol.3 , pp. 1555-1558
- Gong, Y.¹

15
- 0026135903
- Constrained iterative speech enhancement with application to speech recognition
- Hansen J.H.L., Clements M.A. Constrained iterative speech enhancement with application to speech recognition. IEEE Trans. Signal Process. 39:1991;795-805.
- (1991) IEEE Trans. Signal Process. , vol.39 , pp. 795-805
- Hansen, J.H.L.¹ Clements, M.A.²

16
- 0028517164
- RASTA processing of speech
- Hermansky H., Morgan N. RASTA processing of speech. IEEE Trans. Speech and Audio Process. 2:1994;578-589.
- (1994) IEEE Trans. Speech and Audio Process. , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

17
- 33747947441
- A robust RNN-based pre-classification for Noisy Mandarin speech recognition
- Hong, W.-T., Chen, S.-H., 1997. A robust RNN-based pre-classification for Noisy Mandarin speech recognition. In: Proceedings of EuroSpeech-97, Vol. 3, pp. 1083-1086.
- (1997) In: Proceedings of EuroSpeech-97 , vol.3 , pp. 1083-1086
- Hong, W.-T.¹ Chen, S.-H.²

18
- 0343800873
- RNN-based speech segmentation and its applications to robust noisy Mandarin speech recognition
- revised
- Hong, W.-T., Liao, Y.-F., Wang, Y.-R., Chen, S.-H., 1999. RNN-based speech segmentation and its applications to robust noisy Mandarin speech recognition. J. Acoust. Soc. Amer., revised.
- (1999) J. Acoust. Soc. Amer.
- Hong, W.-T.¹ Liao, Y.-F.² Wang, Y.-R.³ Chen, S.-H.⁴

19
- 0026189808
- Speech recognition in adverse environment
- Juang B.-H. Speech recognition in adverse environment. Comput. Speech and Language. 5:1991;275-294.
- (1991) Comput. Speech and Language , vol.5 , pp. 275-294
- Juang, B.-H.¹

20
- 0025493667
- The segmental K-means algorithm for estimating parameters of hidden Markov models
- Juang B.-H., Rabiner L.R. The segmental K-means algorithm for estimating parameters of hidden Markov models. IEEE Trans. Acoust. Speech Signal Process. 38:1990;1639-1641.
- (1990) IEEE Trans. Acoust. Speech Signal Process. , vol.38 , pp. 1639-1641
- Juang, B.-H.¹ Rabiner, L.R.²

21
- 0003770709
- Boston, MA: Kluwer Academic Press
- Junqua J.-C., Halton J.-P. Robustness in Automatic Speech Recognition: Fundaments and Applications. 1996;Kluwer Academic Press, Boston, MA.
- (1996) Robustness in Automatic Speech Recognition: Fundaments and Applications
- Junqua, J.-C.¹ Halton, J.-P.²

22
- 0028461861
- A robust algorithm for word boundary detection in the presence of noise
- Junqua J.S., Mak B., Reaves B. A robust algorithm for word boundary detection in the presence of noise. IEEE Trans. Speech and Audio Process. 2:1994;406-412.
- (1994) IEEE Trans. Speech and Audio Process. , vol.2 , pp. 406-412
- Junqua, J.S.¹ Mak, B.² Reaves, B.³

23
- 0032140546
- On stochastic feature and model compensation approaches to robust speech recognition
- Lee C.-H. On stochastic feature and model compensation approaches to robust speech recognition. Speech Communication. 25:1998;29-47.
- (1998) Speech Communication , vol.25 , pp. 29-47
- Lee, C.-H.¹

24
- 0005122887
- A survey on automatic speech recognition with an illustrative example on continuous speech recognition of Mandarin
- Lee C.-H., Juang B.-H. A survey on automatic speech recognition with an illustrative example on continuous speech recognition of Mandarin. J. Comput. Linguist. Chinese Language Process. 1:1996;1-36.
- (1996) J. Comput. Linguist. Chinese Language Process. , vol.1 , pp. 1-36
- Lee, C.-H.¹ Juang, B.-H.²

25
- 0017980972
- All-pole modeling of degraded speech
- Lim J.S., Oppenheim A.V. All-pole modeling of degraded speech. IEEE Trans. Acoust. Speech Sig. Process. 26:1978;197-210.
- (1978) IEEE Trans. Acoust. Speech Sig. Process. , vol.26 , pp. 197-210
- Lim, J.S.¹ Oppenheim, A.V.²

26
- 0029748334
- Speech recognition on Mandarin call home: A large vocabulary, conversational and telephone speech corpus
- Liu, F.-H., Picheny, M., Srinivasa, P., Monkowaski, M., Chen, J., 1996. Speech recognition on Mandarin call home: a large vocabulary, conversational and telephone speech corpus. In: Proceedings of ICASSP-96, Vol. 1, pp. 157-160.
- (1996) In: Proceedings of ICASSP-96 , vol.1 , pp. 157-160
- Liu, F.-H.¹ Picheny, M.² Srinivasa, P.³ Monkowaski, M.⁴ Chen, J.⁵

27
- 0026882842
- Experiments with a Nonlinear Spectral Subtractor (NSS), Hidden Markov Models and the projection, for robust speech recognition in cars
- Lockwood P., Boudy J. Experiments with a Nonlinear Spectral Subtractor (NSS), Hidden Markov Models and the projection, for robust speech recognition in cars. Speech Communication. 11:1992;215-228.
- (1992) Speech Communication , vol.11 , pp. 215-228
- Lockwood, P.¹ Boudy, J.²

28
- 0029375754
- Automatic word recognition in cars
- Mokbel C.E., Chollet G.F.A. Automatic word recognition in cars. IEEE Trans. Speech and Audio Process. 3:1995;346-356.
- (1995) IEEE Trans. Speech and Audio Process. , vol.3 , pp. 346-356
- Mokbel, C.E.¹ Chollet, G.F.A.²

29
- 0029745435
- Adaptation method based on HMM composition and EM algorithm
- Minami, Y., Furui, S., 1996. Adaptation method based on HMM composition and EM algorithm. In: Proceedings of ICASSP-96, pp. 327-330.
- (1996) In: Proceedings of ICASSP-96 , pp. 327-330
- Minami, Y.¹ Furui, S.²

30
- 0029747581
- Noise and room acoustics distorted speech recognition by HMM composition
- Nakamura, S., Takigucgi, T., Shikano, K., 1996. Noise and room acoustics distorted speech recognition by HMM composition. In: Proceedings of ICASSP-96, Vol. 1, pp. 69-72.
- (1996) In: Proceedings of ICASSP-96 , vol.1 , pp. 69-72
- Nakamura, S.¹ Takigucgi, T.² Shikano, K.³

31
- 85135164500
- Evaluating features set performance using the F-ratio and J-measures
- Nicholson, S., Milner, B., Cox, S., 1997. Evaluating features set performance using the F-ratio and J-measures. In: Proceedings of EuroSpeech-97, Vol. 1, pp. 413-416.
- (1997) In: Proceedings of EuroSpeech-97 , vol.1 , pp. 413-416
- Nicholson, S.¹ Milner, B.² Cox, S.³

32
- 0029769867
- Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
- Rahim M., Juang B.-H. Signal bias removal by maximum likelihood estimation for robust telephone speech recognition. IEEE Trans. Speech and Audio Process. 4:1996;19-30.
- (1996) IEEE Trans. Speech and Audio Process. , vol.4 , pp. 19-30
- Rahim, M.¹ Juang, B.-H.²

33
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- Sankar A., Lee C.-H. A maximum-likelihood approach to stochastic matching for robust speech recognition. IEEE Trans. Speech and Audio Process. 4:1996;190-202.
- (1996) IEEE Trans. Speech and Audio Process. , vol.4 , pp. 190-202
- Sankar, A.¹ Lee, C.-H.²

34
- 0027623210
- Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
- Varga A., Steeneken H.J.M. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication. 12:1993;247-251.
- (1993) Speech Communication , vol.12 , pp. 247-251
- Varga, A.¹ Steeneken, H.J.M.²

35
- 0030779363
- Noise compensation methods for hidden Markov model speech recognition in adverse environments
- Vaseghi S.V., Milner B.P. Noise compensation methods for hidden Markov model speech recognition in adverse environments. IEEE Trans. Speech and Audio Process. 5:1997;11-21.
- (1997) IEEE Trans. Speech and Audio Process. , vol.5 , pp. 11-21
- Vaseghi, S.V.¹ Milner, B.P.²

36
- 0006498352
- Mandarin telephone speech recognition for automatic telephone number directory service
- Wang, Y.-R., Chen, S.-H., 1998. Mandarin telephone speech recognition for automatic telephone number directory service. In: Proceedings of ICASSP-98, Vol. 2, pp. 841-844.
- (1998) In: Proceedings of ICASSP-98 , vol.2 , pp. 841-844
- Wang, Y.-R.¹ Chen, S.-H.²

37
- 0029770844
- Self-learning speaker and channel adaptation based on spectral variation source decomposition
- Zhao Y. Self-learning speaker and channel adaptation based on spectral variation source decomposition. Speech Communication. 18:1996;65-77.
- (1996) Speech Communication , vol.18 , pp. 65-77
- Zhao, Y.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.