메뉴 건너뛰기




Volumn 6, Issue 6, 1998, Pages 524-537

A general joint additive and convolutive bias compensation approach applied to noisy lombard speech recognition

Author keywords

Bias compensation; Continuous density hmm; Lombard speech; Noise

Indexed keywords

ALGORITHMS; DATABASE SYSTEMS; ERROR ANALYSIS; MARKOV PROCESSES; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION;

EID: 0032203405     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.725319     Document Type: Article
Times cited : (29)

References (34)
  • 2
    • 0030674098 scopus 로고    scopus 로고
    • A unified maximum likelihood approach to acoustic mismatch compensation: Application to noisy Lombard speech recognition, in
    • Munich, Germany, to be published.
    • M. Afify, Y. Gong, and J. P. Haton, "A unified maximum likelihood approach to acoustic mismatch compensation: Application to noisy Lombard speech recognition," in Proc. IEEE ICASSP'97, Munich, Germany, to be published.
    • Proc. IEEE ICASSP'97
    • Afify, M.1    Gong, Y.2    Haton, J.P.3
  • 3
    • 0027189327 scopus 로고    scopus 로고
    • Speech discrimination in adverse conditions using acoustic knowledge and selectively trained neural networks, in
    • Y. Anglade,.D. Fohr, and J. C. Junqua, "Speech discrimination in adverse conditions using acoustic knowledge and selectively trained neural networks," in Proc. ICASSP'93, vol. 2, pp. 279-282.
    • Proc. ICASSP'93 , vol.2 , pp. 279-282
    • Anglade, Y.1    Fohr, D.2    Junqua, J.C.3
  • 4
    • 0023925221 scopus 로고
    • Cepstral domain talker stress compensation for robust speech recognition
    • Y. Chen, "Cepstral domain talker stress compensation for robust speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 36, pp. 433-139, Apr. 1988.
    • (1988) IEEE Trans. Acoust., Speech, Signal Processing , vol.36 , pp. 433-139
    • Chen, Y.1
  • 5
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Stat. Soc., vol. 39, pp. 1-38, 1977.
    • (1977) J. R. Stat. Soc. , vol.39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 6
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • V. Digalakis, D. Rtischev, and L. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Processing, vol. 3, pp. 357-366, Sept. 1995.
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 357-366
    • Digalakis, V.1    Rtischev, D.2    Neumeyer, L.3
  • 7
    • 0030189744 scopus 로고    scopus 로고
    • Speaker adaptation using combined transformation and Bayesian methods
    • V. Digalakis and L. Neumeyer, "Speaker adaptation using combined transformation and Bayesian methods," IEEE Trans. Speech and Audio Processing, vol. 4, pp. 294-299, July 1996.
    • (1996) IEEE Trans. Speech and Audio Processing , vol.4 , pp. 294-299
    • Digalakis, V.1    Neumeyer, L.2
  • 8
  • 9
    • 0026881830 scopus 로고
    • Gain-adapted hidden Markov models for recognition of clean and noisy speech
    • Y. Ephraim, "Gain-adapted hidden Markov models for recognition of clean and noisy speech," IEEE Trans. Signal Processing, vol. 40, pp. 1303-1316, June 1992.
    • (1992) IEEE Trans. Signal Processing , vol.40 , pp. 1303-1316
    • Ephraim, Y.1
  • 10
    • 0027622731 scopus 로고
    • Cepstral parameter compensation for HMM recognition in noise
    • M. Gales and S. Young, "Cepstral parameter compensation for HMM recognition in noise," Speech Commun., vol. 12, pp. 231-239, July 1993.
    • (1993) Speech Commun. , vol.12 , pp. 231-239
    • Gales, M.1    Young, S.2
  • 11
    • 0028996863 scopus 로고    scopus 로고
    • A fast and flexible implementation of parallel model combination
    • A fast and flexible implementation of parallel model combination," in Proc. ICASSP'95, vol. 1, pp. 133-136.
    • Proc. ICASSP'95 , vol.1 , pp. 133-136
  • 12
    • 0029390135 scopus 로고
    • Robust speech recognition in additive and convolutive noise using parallel model combination
    • Oct.
    • Robust speech recognition in additive and convolutive noise using parallel model combination," Comput. Speech Lang., vol. 9, pp. 289-307, Oct. 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 289-307
  • 13
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, June 1995.
    • (1995) Speech Commun. , vol.16 , pp. 261-291
    • Gong, Y.1
  • 14
    • 85106119047 scopus 로고    scopus 로고
    • Lombard effect compensation for robust automatic speech recognition in noise, in
    • J. H. L. Hansen and O. N. Bria, "Lombard effect compensation for robust automatic speech recognition in noise," in Proc. ICSLP'90, pp. 1125-1128.
    • Proc. ICSLP'90 , pp. 1125-1128
    • Hansen, J.H.L.1    Bria, O.N.2
  • 15
    • 0028516405 scopus 로고
    • Morphological constrained feature enhancement with adaptive cepstral compensation for speech recognition in noise and Lombard effect
    • J. H. L. Hansen, "Morphological constrained feature enhancement with adaptive cepstral compensation for speech recognition in noise and Lombard effect," IEEE Trans. Speech Audio Processing, vol. 2, pp. 598-614, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 598-614
    • Hansen, J.H.L.1
  • 18
    • 0030196359 scopus 로고    scopus 로고
    • Feature analysis and neural network-based classification of speech under stress
    • J. H. L. Hansen and B. D. Womack, "Feature analysis and neural network-based classification of speech under stress," IEEE Trans. Speech Audio Processing, vol. 4, pp. 307-313, July 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 307-313
    • Hansen, J.H.L.1    Womack, B.D.2
  • 19
    • 0026982122 scopus 로고
    • Discriminative learning for minimum error classification
    • B. H. Juang and S. Katagiri, "Discriminative learning for minimum error classification," IEEE Trans. Signal Processing, vol. 40, pp. 3043-3054, Dec. 1992.
    • (1992) IEEE Trans. Signal Processing , vol.40 , pp. 3043-3054
    • Juang, B.H.1    Katagiri, S.2
  • 20
    • 0027465491 scopus 로고
    • The Lombard reflex and its role on human listeners and automatic speech recognizers
    • J. C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizers," J. Acoust. Soc. Amer., vol. 93, pp. 510-524, Jan. 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , pp. 510-524
    • Junqua, J.C.1
  • 21
    • 84864010278 scopus 로고    scopus 로고
    • Speaker adaptation of continuous density HMM's using multivariate linear regression, in
    • Yokohma, Japan
    • C. Leggetter and P. Woodland, "Speaker adaptation of continuous density HMM's using multivariate linear regression," in Proc. ICSLP'94, Yokohma, Japan, pp. 451-454.
    • Proc. ICSLP'94 , pp. 451-454
    • Leggetter, C.1    Woodland, P.2
  • 22
    • 0028996915 scopus 로고    scopus 로고
    • A maximum likelihood procedure for a universal adaptation method based on HMM composition, in
    • Y. Minami and S. Furui, "A maximum likelihood procedure for a universal adaptation method based on HMM composition," in Proc. ICASSP'95, vol. 1, pp. 129-132.
    • Proc. ICASSP'95 , vol.1 , pp. 129-132
    • Minami, Y.1    Furui, S.2
  • 24
    • 0028996864 scopus 로고    scopus 로고
    • Noisy speech recognition using robust inversion of hidden Markov models, in
    • S. Moon and J. Hwang, "Noisy speech recognition using robust inversion of hidden Markov models," in Proc. ICASSP'95, vol. 1, pp. 145-148.
    • Proc. ICASSP'95 , vol.1 , pp. 145-148
    • Moon, S.1    Hwang, J.2
  • 26
    • 0028516117 scopus 로고
    • Training issues and channel equalization techniques for the construction of telephone acoustic models using high-quality speech corpus
    • L. G. Neumeyer, V. V. Digalakis, and M. Weintraub, "Training issues and channel equalization techniques for the construction of telephone acoustic models using high-quality speech corpus," IEEE Trans. Speech Audio Processing, vol. 2, pp. 590-597, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 590-597
    • Neumeyer, L.G.1    Digalakis, V.V.2    Weintraub, M.3
  • 27
    • 85079103466 scopus 로고    scopus 로고
    • Signal bias removal for robust telephone speech recognition, in
    • M. Rahim and B.-H. Juang, "Signal bias removal for robust telephone speech recognition," in Proc. ICASSP'94, vol. 1, pp. 445-448.
    • Proc. ICASSP'94 , vol.1 , pp. 445-448
    • Rahim, M.1    Juang, B.-H.2
  • 28
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 19-30, Jan. 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 19-30
  • 29
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • R. Rose, E. Hofstetter, and D. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE Trans. Speech Audio Processing, vol. 2, pp. 245-257, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 245-257
    • Rose, R.1    Hofstetter, E.2    Reynolds, D.3
  • 30
    • 0028996860 scopus 로고    scopus 로고
    • Robust speech recognition based on stochastic matching, in
    • A. Sankar and C. H. Lee, "Robust speech recognition based on stochastic matching," in Proc. ICASSP'95, vol. 1, pp. 121-124.
    • Proc. ICASSP'95 , vol.1 , pp. 121-124
    • Sankar, A.1    Lee, C.H.2
  • 31
    • 0030149866 scopus 로고    scopus 로고
    • A maximum likelihood approach to stochastic matching for robust speech recognition
    • A maximum likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 190-202, May 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 190-202
  • 33
    • 30244464907 scopus 로고    scopus 로고
    • Noise independent speech recognition for a variety of noise types, in
    • W. C. Treurniet and Y. Gong, "Noise independent speech recognition for a variety of noise types," in Proc. ICASSP'94, vol. 1, pp. 437-440.
    • Proc. ICASSP'94 , vol.1 , pp. 437-440
    • Treurniet, W.C.1    Gong, Y.2
  • 34
    • 0028460810 scopus 로고
    • An acoustic-phonetic based speaker adaptation technique for improving speaker independent continuous speech recognition
    • Y. Zhao, "An acoustic-phonetic based speaker adaptation technique for improving speaker independent continuous speech recognition," IEEE Trans. Speech Audio Processing, vol. 2, pp. 380-394, July 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 380-394
    • Zhao, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.