메뉴 건너뛰기




Volumn 16, Issue 2, 2002, Pages 205-223

Hidden markov model training with contaminated speech material for distant-talking speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC NOISE; ACOUSTIC NOISE MEASUREMENT; ADAPTIVE ALGORITHMS; MARKOV PROCESSES; SPEECH INTELLIGIBILITY; SPEECH SYNTHESIS;

EID: 0036556170     PISSN: 08852308     EISSN: None     Source Type: Journal    
DOI: 10.1006/csla.2002.0191     Document Type: Article
Times cited : (41)

References (54)
  • 4
    • 0019565863 scopus 로고
    • Computer-generated pulse signal applied for sound measurement
    • Aoshima, M. (1981). Computer-generated pulse signal applied for sound measurement. Journal of the Acoustical Society of America, 69, 1484-1488.
    • (1981) Journal of the Acoustical Society of America , vol.69 , pp. 1484-1488
    • Aoshima, M.1
  • 5
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • Atal, B. S. (1974). Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. Journal of the Acoustical Society of America, 55, 1304-1312.
    • (1974) Journal of the Acoustical Society of America , vol.55 , pp. 1304-1312
    • Atal, B.S.1
  • 6
    • 0003152968 scopus 로고
    • Speech enhancement in the 1980s: Noise suppression with pattern matching
    • (S. Furui and M. M. Sondhi, eds), chapter 10
    • Boll, S. (1992). Speech enhancement in the 1980s: noise suppression with pattern matching. In Advances in Speech Signal Processing, (S. Furui and M. M. Sondhi, eds), chapter 10.
    • (1992) Advances in Speech Signal Processing
    • Boll, S.1
  • 12
    • 0026881830 scopus 로고
    • Gain-adapted hidden Markov models for recognition of clean and noisy speech
    • Ephraim, Y. (1992). Gain-adapted hidden Markov models for recognition of clean and noisy speech. IEEE Transactions on Signal Processing, 40, 1303-1316.
    • (1992) IEEE Transactions on Signal Processing , vol.40 , pp. 1303-1316
    • Ephraim, Y.1
  • 17
    • 0003671941 scopus 로고
    • Model-based techniques for noise robust speech recognition
    • PhD Thesis, Cambridge University
    • Gales, M. J. F. (1995). Model-based techniques for noise robust speech recognition. PhD Thesis, Cambridge University.
    • (1995)
    • Gales, M.J.F.1
  • 18
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Gales, M. J. F. (1998). Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language, 12, 75-98.
    • (1998) Computer Speech and Language , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 19
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • Gales, M. J. F. & Woodland, P. C. (1996). Mean and variance adaptation within the MLLR framework. Computer Speech and Language, 10, 249-264.
    • (1996) Computer Speech and Language , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 22
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Gauvain, J. L. & Lee, C. H. (1994). Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Transactions on Speech and Audio Processing, 2, 291-298.
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 23
    • 84908169893 scopus 로고    scopus 로고
    • Use of different microphone array configurations for hands-free speech recognition in noisy and reverberant environment
    • Giuliani, D., Matassoni, M., Omologo, M. & Svaizer, P. (1997). Use of different microphone array configurations for hands-free speech recognition in noisy and reverberant environment. Proceedings of the EUROSPEECH, volume 1, pp. 347-350.
    • (1997) Proceedings of the EUROSPEECH , vol.1 , pp. 347-350
    • Giuliani, D.1    Matassoni, M.2    Omologo, M.3    Svaizer, P.4
  • 26
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Gong, Y. (1995). Speech recognition in noisy environments: a survey. Speech Communication, 16, 261-291.
    • (1995) Speech Communication , vol.16 , pp. 261-291
    • Gong, Y.1
  • 29
    • 0027465491 scopus 로고
    • The Lombard reflex and its role on human listeners and automatic speech recognizers
    • Junqua, J. C. (1993). The Lombard reflex and its role on human listeners and automatic speech recognizers. Journal of the Acoustical Society of America, 1, 510-524.
    • (1993) Journal of the Acoustical Society of America , vol.1 , pp. 510-524
    • Junqua, J.C.1
  • 32
    • 0032651723 scopus 로고    scopus 로고
    • Integrated bias removal techniques for robust speech recognition
    • Lawrence, C. & Rahim, M. (1999). Integrated bias removal techniques for robust speech recognition. Computer Speech and Language, 13, 283-298.
    • (1999) Computer Speech and Language , vol.13 , pp. 283-298
    • Lawrence, C.1    Rahim, M.2
  • 33
    • 0032140546 scopus 로고    scopus 로고
    • On stochastic feature and model compensation approaches to robust speech recognition
    • Lee, C. H. (1998). On stochastic feature and model compensation approaches to robust speech recognition. Speech Communication, 25, 29-47.
    • (1998) Speech Communication , vol.25 , pp. 29-47
    • Lee, C.H.1
  • 35
    • 85135194048 scopus 로고
    • Flexible speaker adaptation for large vocabulary speech recognition
    • Leggetter, C. J. & Woodland, P. C. (1995a). Flexible speaker adaptation for large vocabulary speech recognition. Proceedings of EUROSPEECH, volume, 2, pp. 1155-1158.
    • (1995) Proceedings of EUROSPEECH , vol.2 , pp. 1155-1158
    • Leggetter, C.J.1    Woodland, P.C.2
  • 36
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Leggetter, C. J. & Woodland, P. C. (1995b). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 9, 171-185.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 41
    • 0032142014 scopus 로고    scopus 로고
    • Environmental conditions and acoustic transduction in hands-free speech recognition
    • Omologo, M., Svaizer, P. & Matassoni, M. (1998) Environmental conditions and acoustic transduction in hands-free speech recognition. Speech Communication, 25, 75-95.
    • (1998) Speech Communication , vol.25 , pp. 75-95
    • Omologo, M.1    Svaizer, P.2    Matassoni, M.3
  • 44
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, L. R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of IEEE, 77, 257-286.
    • (1989) Proceedings of IEEE , vol.77 , pp. 257-286
    • Rabiner, L.R.1
  • 46
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • Rahim, M. & Juang, B. H. (1996). Signal bias removal by maximum likelihood estimation for robust telephone speech recognition. IEEE Transactions on Speech and Audio Processing, 4, 19-30.
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , pp. 19-30
    • Rahim, M.1    Juang, B.H.2
  • 47
    • 0028420014 scopus 로고
    • Integrated models of signals and background noise with application to speaker identification in noise
    • Rose, R. C., Hofstetter, E. M. & Reynolds, D. A. (1994). Integrated models of signals and background noise with application to speaker identification in noise. IEEE Transactions on Speech and Audio Processing, 2, 245-257.
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 49
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • Sankar, A. & Lee, C. H. (1996). A maximum-likelihood approach to stochastic matching for robust speech recognition. IEEE Transactions on Speech and Audio Processing, 4, 190-202.
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , pp. 190-202
    • Sankar, A.1    Lee, C.H.2
  • 50
    • 0028812427 scopus 로고
    • An optimum computer-generated pulse signal suitable for the measurement of very long impulse responses
    • Suzuki, Y., Asano, F., Kim, H. Y. & Sone, T. (1995). An optimum computer-generated pulse signal suitable for the measurement of very long impulse responses. Journal of the Acoustical Society of America, 97, 1119-1123.
    • (1995) Journal of the Acoustical Society of America , vol.97 , pp. 1119-1123
    • Suzuki, Y.1    Asano, F.2    Kim, H.Y.3    Sone, T.4
  • 54
    • 0001459635 scopus 로고    scopus 로고
    • Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises
    • Zhao, Y. (2000). Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises. IEEE Transactions on Speech and Audio Processing, 8, 255-266.
    • (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , pp. 255-266
    • Zhao, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.