메뉴 건너뛰기




Volumn 30, Issue 4, 2000, Pages 273-293

Robust training algorithm for adverse speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; COMMUNICATION CHANNELS (INFORMATION THEORY); ITERATIVE METHODS; LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES; MATHEMATICAL MODELS; SPEECH ANALYSIS; SPEECH COMMUNICATION;

EID: 0033888153     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(99)00057-6     Document Type: Article
Times cited : (17)

References (37)
  • 1
    • 0025628728 scopus 로고
    • Environmental robustness in automatic speech recognition
    • Acero, A., Stern, R.M., 1990. Environmental robustness in automatic speech recognition. In: Proceedings of ICASSP-90, pp. 849-852.
    • (1990) In: Proceedings of ICASSP-90 , pp. 849-852
    • Acero, A.1    Stern, R.M.2
  • 2
    • 0026385284 scopus 로고
    • Robust speech recognition by normalization of the acoustic space
    • Acero, A., Stern, R.M., 1991. Robust speech recognition by normalization of the acoustic space. In: Proceedings of ICASSP-91, pp. 893-896.
    • (1991) In: Proceedings of ICASSP-91 , pp. 893-896
    • Acero, A.1    Stern, R.M.2
  • 3
    • 0030677475 scopus 로고    scopus 로고
    • Speaker adaptive training: A maximum likelihood approach to speaker normalization
    • Anastasakos, T., McDonough, J., Makhoul, J., 1997. Speaker adaptive training: a maximum likelihood approach to speaker normalization. In: Proceedings of ICASSP-97, pp. 1043-1046.
    • (1997) In: Proceedings of ICASSP-97 , pp. 1043-1046
    • Anastasakos, T.1    McDonough, J.2    Makhoul, J.3
  • 5
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • Dempster A., Laird N., Rubin D. Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Statist. Soc. 39:1977;1-38.
    • (1977) J. Roy. Statist. Soc. , vol.39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 6
    • 84948598244 scopus 로고
    • Statistical-model-based speech enhancement systems
    • Ephraim Y. Statistical-model-based speech enhancement systems. Proc. IEEE. 80:1992;1526-1555.
    • (1992) Proc. IEEE , vol.80 , pp. 1526-1555
    • Ephraim, Y.1
  • 7
    • 0015600423 scopus 로고
    • The Viterbi algorithm
    • Forney G. The Viterbi algorithm. Proc. IEEE. 61:1973;268-278.
    • (1973) Proc. IEEE , vol.61 , pp. 268-278
    • Forney, G.1
  • 9
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • Gales M.J.F., Woodland P.C. Mean and variance adaptation within the MLLR framework. Comput. Speech and Language. 10:1996;249-264.
    • (1996) Comput. Speech and Language , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 10
    • 0027622731 scopus 로고
    • Cepstral parameter compensation for HMM recognition in noise
    • Gales M.J.F., Young S.J. Cepstral parameter compensation for HMM recognition in noise. Speech Communication. 12:1993;231-239.
    • (1993) Speech Communication , vol.12 , pp. 231-239
    • Gales, M.J.F.1    Young, S.J.2
  • 11
    • 0029390135 scopus 로고
    • Robust speech recognition in additive and convolutional noise using parallel model combination
    • Gales M.J.F., Young S.J. Robust speech recognition in additive and convolutional noise using parallel model combination. Comput. Speech and Language. 9:1995;289-307.
    • (1995) Comput. Speech and Language , vol.9 , pp. 289-307
    • Gales, M.J.F.1    Young, S.J.2
  • 12
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • Gales M.J.F., Young S.J. Robust continuous speech recognition using parallel model combination. IEEE Trans. Speech and Audio Process. 5:1996;352-359.
    • (1996) IEEE Trans. Speech and Audio Process. , vol.5 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 13
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Gong Y. Speech recognition in noisy environments: A survey. Speech Communication. 16:1995;261-291.
    • (1995) Speech Communication , vol.16 , pp. 261-291
    • Gong, Y.1
  • 14
    • 0347321460 scopus 로고    scopus 로고
    • Source normalization training for HMM applied to noisy telephone speech recognition
    • Gong, Y., 1997. Source normalization training for HMM applied to noisy telephone speech recognition. In: Proceedings of EuroSpeech-97, Vol. 3, pp. 1555-1558.
    • (1997) In: Proceedings of EuroSpeech-97 , vol.3 , pp. 1555-1558
    • Gong, Y.1
  • 15
    • 0026135903 scopus 로고
    • Constrained iterative speech enhancement with application to speech recognition
    • Hansen J.H.L., Clements M.A. Constrained iterative speech enhancement with application to speech recognition. IEEE Trans. Signal Process. 39:1991;795-805.
    • (1991) IEEE Trans. Signal Process. , vol.39 , pp. 795-805
    • Hansen, J.H.L.1    Clements, M.A.2
  • 17
    • 33747947441 scopus 로고    scopus 로고
    • A robust RNN-based pre-classification for Noisy Mandarin speech recognition
    • Hong, W.-T., Chen, S.-H., 1997. A robust RNN-based pre-classification for Noisy Mandarin speech recognition. In: Proceedings of EuroSpeech-97, Vol. 3, pp. 1083-1086.
    • (1997) In: Proceedings of EuroSpeech-97 , vol.3 , pp. 1083-1086
    • Hong, W.-T.1    Chen, S.-H.2
  • 18
    • 0343800873 scopus 로고    scopus 로고
    • RNN-based speech segmentation and its applications to robust noisy Mandarin speech recognition
    • revised
    • Hong, W.-T., Liao, Y.-F., Wang, Y.-R., Chen, S.-H., 1999. RNN-based speech segmentation and its applications to robust noisy Mandarin speech recognition. J. Acoust. Soc. Amer., revised.
    • (1999) J. Acoust. Soc. Amer.
    • Hong, W.-T.1    Liao, Y.-F.2    Wang, Y.-R.3    Chen, S.-H.4
  • 19
    • 0026189808 scopus 로고
    • Speech recognition in adverse environment
    • Juang B.-H. Speech recognition in adverse environment. Comput. Speech and Language. 5:1991;275-294.
    • (1991) Comput. Speech and Language , vol.5 , pp. 275-294
    • Juang, B.-H.1
  • 20
    • 0025493667 scopus 로고
    • The segmental K-means algorithm for estimating parameters of hidden Markov models
    • Juang B.-H., Rabiner L.R. The segmental K-means algorithm for estimating parameters of hidden Markov models. IEEE Trans. Acoust. Speech Signal Process. 38:1990;1639-1641.
    • (1990) IEEE Trans. Acoust. Speech Signal Process. , vol.38 , pp. 1639-1641
    • Juang, B.-H.1    Rabiner, L.R.2
  • 22
    • 0028461861 scopus 로고
    • A robust algorithm for word boundary detection in the presence of noise
    • Junqua J.S., Mak B., Reaves B. A robust algorithm for word boundary detection in the presence of noise. IEEE Trans. Speech and Audio Process. 2:1994;406-412.
    • (1994) IEEE Trans. Speech and Audio Process. , vol.2 , pp. 406-412
    • Junqua, J.S.1    Mak, B.2    Reaves, B.3
  • 23
    • 0032140546 scopus 로고    scopus 로고
    • On stochastic feature and model compensation approaches to robust speech recognition
    • Lee C.-H. On stochastic feature and model compensation approaches to robust speech recognition. Speech Communication. 25:1998;29-47.
    • (1998) Speech Communication , vol.25 , pp. 29-47
    • Lee, C.-H.1
  • 24
    • 0005122887 scopus 로고    scopus 로고
    • A survey on automatic speech recognition with an illustrative example on continuous speech recognition of Mandarin
    • Lee C.-H., Juang B.-H. A survey on automatic speech recognition with an illustrative example on continuous speech recognition of Mandarin. J. Comput. Linguist. Chinese Language Process. 1:1996;1-36.
    • (1996) J. Comput. Linguist. Chinese Language Process. , vol.1 , pp. 1-36
    • Lee, C.-H.1    Juang, B.-H.2
  • 26
    • 0029748334 scopus 로고    scopus 로고
    • Speech recognition on Mandarin call home: A large vocabulary, conversational and telephone speech corpus
    • Liu, F.-H., Picheny, M., Srinivasa, P., Monkowaski, M., Chen, J., 1996. Speech recognition on Mandarin call home: a large vocabulary, conversational and telephone speech corpus. In: Proceedings of ICASSP-96, Vol. 1, pp. 157-160.
    • (1996) In: Proceedings of ICASSP-96 , vol.1 , pp. 157-160
    • Liu, F.-H.1    Picheny, M.2    Srinivasa, P.3    Monkowaski, M.4    Chen, J.5
  • 27
    • 0026882842 scopus 로고
    • Experiments with a Nonlinear Spectral Subtractor (NSS), Hidden Markov Models and the projection, for robust speech recognition in cars
    • Lockwood P., Boudy J. Experiments with a Nonlinear Spectral Subtractor (NSS), Hidden Markov Models and the projection, for robust speech recognition in cars. Speech Communication. 11:1992;215-228.
    • (1992) Speech Communication , vol.11 , pp. 215-228
    • Lockwood, P.1    Boudy, J.2
  • 29
    • 0029745435 scopus 로고    scopus 로고
    • Adaptation method based on HMM composition and EM algorithm
    • Minami, Y., Furui, S., 1996. Adaptation method based on HMM composition and EM algorithm. In: Proceedings of ICASSP-96, pp. 327-330.
    • (1996) In: Proceedings of ICASSP-96 , pp. 327-330
    • Minami, Y.1    Furui, S.2
  • 30
    • 0029747581 scopus 로고    scopus 로고
    • Noise and room acoustics distorted speech recognition by HMM composition
    • Nakamura, S., Takigucgi, T., Shikano, K., 1996. Noise and room acoustics distorted speech recognition by HMM composition. In: Proceedings of ICASSP-96, Vol. 1, pp. 69-72.
    • (1996) In: Proceedings of ICASSP-96 , vol.1 , pp. 69-72
    • Nakamura, S.1    Takigucgi, T.2    Shikano, K.3
  • 31
    • 85135164500 scopus 로고    scopus 로고
    • Evaluating features set performance using the F-ratio and J-measures
    • Nicholson, S., Milner, B., Cox, S., 1997. Evaluating features set performance using the F-ratio and J-measures. In: Proceedings of EuroSpeech-97, Vol. 1, pp. 413-416.
    • (1997) In: Proceedings of EuroSpeech-97 , vol.1 , pp. 413-416
    • Nicholson, S.1    Milner, B.2    Cox, S.3
  • 32
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • Rahim M., Juang B.-H. Signal bias removal by maximum likelihood estimation for robust telephone speech recognition. IEEE Trans. Speech and Audio Process. 4:1996;19-30.
    • (1996) IEEE Trans. Speech and Audio Process. , vol.4 , pp. 19-30
    • Rahim, M.1    Juang, B.-H.2
  • 33
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • Sankar A., Lee C.-H. A maximum-likelihood approach to stochastic matching for robust speech recognition. IEEE Trans. Speech and Audio Process. 4:1996;190-202.
    • (1996) IEEE Trans. Speech and Audio Process. , vol.4 , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 34
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • Varga A., Steeneken H.J.M. Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication. 12:1993;247-251.
    • (1993) Speech Communication , vol.12 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.M.2
  • 35
    • 0030779363 scopus 로고    scopus 로고
    • Noise compensation methods for hidden Markov model speech recognition in adverse environments
    • Vaseghi S.V., Milner B.P. Noise compensation methods for hidden Markov model speech recognition in adverse environments. IEEE Trans. Speech and Audio Process. 5:1997;11-21.
    • (1997) IEEE Trans. Speech and Audio Process. , vol.5 , pp. 11-21
    • Vaseghi, S.V.1    Milner, B.P.2
  • 36
    • 0006498352 scopus 로고    scopus 로고
    • Mandarin telephone speech recognition for automatic telephone number directory service
    • Wang, Y.-R., Chen, S.-H., 1998. Mandarin telephone speech recognition for automatic telephone number directory service. In: Proceedings of ICASSP-98, Vol. 2, pp. 841-844.
    • (1998) In: Proceedings of ICASSP-98 , vol.2 , pp. 841-844
    • Wang, Y.-R.1    Chen, S.-H.2
  • 37
    • 0029770844 scopus 로고    scopus 로고
    • Self-learning speaker and channel adaptation based on spectral variation source decomposition
    • Zhao Y. Self-learning speaker and channel adaptation based on spectral variation source decomposition. Speech Communication. 18:1996;65-77.
    • (1996) Speech Communication , vol.18 , pp. 65-77
    • Zhao, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.