SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 6, Issue 6, 1998, Pages 524-537

A general joint additive and convolutive bias compensation approach applied to noisy lombard speech recognition

(1) Afify, Mohamed a

a LORIA (France)

Author keywords

Bias compensation; Continuous density hmm; Lombard speech; Noise

Indexed keywords

ALGORITHMS; DATABASE SYSTEMS; ERROR ANALYSIS; MARKOV PROCESSES; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION;

ADDITIVE BIAS COMPENSATION ALGORITHMS; CONTINUOUS DENSITY HIDDEN MARKOV MODELS (CDHMM); EXPECTATION MAXIMIZATION (EM) METHOD; LOMBARD SPEECH; PARALLEL MODEL COMBINATION (PMC) TRANSFORMATIONS;

CONTINUOUS SPEECH RECOGNITION;

EID: 0032203405 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/89.725319 Document Type: Article

Times cited : (29)

References (34)

1
- 0004319970
- A. Acero, Acoustical Environmental Robustness in Automatic Speech Recognition. Boston, MA: Kluwer, 1992.
- (1992) Acoustical Environmental Robustness in Automatic Speech Recognition. Boston, MA: Kluwer
- Acero, A.¹

2
- 0030674098
- A unified maximum likelihood approach to acoustic mismatch compensation: Application to noisy Lombard speech recognition, in
- Munich, Germany, to be published.
- M. Afify, Y. Gong, and J. P. Haton, "A unified maximum likelihood approach to acoustic mismatch compensation: Application to noisy Lombard speech recognition," in Proc. IEEE ICASSP'97, Munich, Germany, to be published.
- Proc. IEEE ICASSP'97
- Afify, M.¹ Gong, Y.² Haton, J.P.³

3
- 0027189327
- Speech discrimination in adverse conditions using acoustic knowledge and selectively trained neural networks, in
- Y. Anglade,.D. Fohr, and J. C. Junqua, "Speech discrimination in adverse conditions using acoustic knowledge and selectively trained neural networks," in Proc. ICASSP'93, vol. 2, pp. 279-282.
- Proc. ICASSP'93 , vol.2 , pp. 279-282
- Anglade, Y.¹ Fohr, D.² Junqua, J.C.³

4
- 0023925221
- Cepstral domain talker stress compensation for robust speech recognition
- Y. Chen, "Cepstral domain talker stress compensation for robust speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 36, pp. 433-139, Apr. 1988.
- (1988) IEEE Trans. Acoust., Speech, Signal Processing , vol.36 , pp. 433-139
- Chen, Y.¹

5
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Stat. Soc., vol. 39, pp. 1-38, 1977.
- (1977) J. R. Stat. Soc. , vol.39 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

6
- 0029375590
- Speaker adaptation using constrained estimation of Gaussian mixtures
- V. Digalakis, D. Rtischev, and L. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Processing, vol. 3, pp. 357-366, Sept. 1995.
- (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 357-366
- Digalakis, V.¹ Rtischev, D.² Neumeyer, L.³

7
- 0030189744
- Speaker adaptation using combined transformation and Bayesian methods
- V. Digalakis and L. Neumeyer, "Speaker adaptation using combined transformation and Bayesian methods," IEEE Trans. Speech and Audio Processing, vol. 4, pp. 294-299, July 1996.
- (1996) IEEE Trans. Speech and Audio Processing , vol.4 , pp. 294-299
- Digalakis, V.¹ Neumeyer, L.²

8
- 0024909863
- On the application of hidden Markov models for enhancing noisy speech
- Y. Ephraim, D. Malah, and B. H. Juang, "On the application of hidden Markov models for enhancing noisy speech," IEEE Trans. Acoust., Speech, Signal Processing, vol. 37, pp. 1846-1856, Dec. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Processing , vol.37 , pp. 1846-1856
- Ephraim, Y.¹ Malah, D.² Juang, B.H.³

9
- 0026881830
- Gain-adapted hidden Markov models for recognition of clean and noisy speech
- Y. Ephraim, "Gain-adapted hidden Markov models for recognition of clean and noisy speech," IEEE Trans. Signal Processing, vol. 40, pp. 1303-1316, June 1992.
- (1992) IEEE Trans. Signal Processing , vol.40 , pp. 1303-1316
- Ephraim, Y.¹

10
- 0027622731
- Cepstral parameter compensation for HMM recognition in noise
- M. Gales and S. Young, "Cepstral parameter compensation for HMM recognition in noise," Speech Commun., vol. 12, pp. 231-239, July 1993.
- (1993) Speech Commun. , vol.12 , pp. 231-239
- Gales, M.¹ Young, S.²

11
- 0028996863
- A fast and flexible implementation of parallel model combination
- A fast and flexible implementation of parallel model combination," in Proc. ICASSP'95, vol. 1, pp. 133-136.
- Proc. ICASSP'95 , vol.1 , pp. 133-136

12
- 0029390135
- Robust speech recognition in additive and convolutive noise using parallel model combination
- Oct.
- Robust speech recognition in additive and convolutive noise using parallel model combination," Comput. Speech Lang., vol. 9, pp. 289-307, Oct. 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 289-307

13
- 0029288202
- Speech recognition in noisy environments: A survey
- Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, June 1995.
- (1995) Speech Commun. , vol.16 , pp. 261-291
- Gong, Y.¹

14
- 85106119047
- Lombard effect compensation for robust automatic speech recognition in noise, in
- J. H. L. Hansen and O. N. Bria, "Lombard effect compensation for robust automatic speech recognition in noise," in Proc. ICSLP'90, pp. 1125-1128.
- Proc. ICSLP'90 , pp. 1125-1128
- Hansen, J.H.L.¹ Bria, O.N.²

15
- 0028516405
- Morphological constrained feature enhancement with adaptive cepstral compensation for speech recognition in noise and Lombard effect
- J. H. L. Hansen, "Morphological constrained feature enhancement with adaptive cepstral compensation for speech recognition in noise and Lombard effect," IEEE Trans. Speech Audio Processing, vol. 2, pp. 598-614, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 598-614
- Hansen, J.H.L.¹

16
- 33646923064
- J. H. L. Hansen and M. A. Clements, "Source generator equalization and enhancement of spectral properties for robust speech recognition
- Source generator equalization and enhancement of spectral properties for robust speech recognition
- Hansen, J.H.L.¹ Clements, M.A.²

17
- 0029512832
- in noise and stress," IEEE Trans. Speech Audio Processing, vol. 3, pp. 407-415, Sept. 1995.
- (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 407-415

18
- 0030196359
- Feature analysis and neural network-based classification of speech under stress
- J. H. L. Hansen and B. D. Womack, "Feature analysis and neural network-based classification of speech under stress," IEEE Trans. Speech Audio Processing, vol. 4, pp. 307-313, July 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 307-313
- Hansen, J.H.L.¹ Womack, B.D.²

19
- 0026982122
- Discriminative learning for minimum error classification
- B. H. Juang and S. Katagiri, "Discriminative learning for minimum error classification," IEEE Trans. Signal Processing, vol. 40, pp. 3043-3054, Dec. 1992.
- (1992) IEEE Trans. Signal Processing , vol.40 , pp. 3043-3054
- Juang, B.H.¹ Katagiri, S.²

20
- 0027465491
- The Lombard reflex and its role on human listeners and automatic speech recognizers
- J. C. Junqua, "The Lombard reflex and its role on human listeners and automatic speech recognizers," J. Acoust. Soc. Amer., vol. 93, pp. 510-524, Jan. 1993.
- (1993) J. Acoust. Soc. Amer. , vol.93 , pp. 510-524
- Junqua, J.C.¹

21
- 84864010278
- Speaker adaptation of continuous density HMM's using multivariate linear regression, in
- Yokohma, Japan
- C. Leggetter and P. Woodland, "Speaker adaptation of continuous density HMM's using multivariate linear regression," in Proc. ICSLP'94, Yokohma, Japan, pp. 451-454.
- Proc. ICSLP'94 , pp. 451-454
- Leggetter, C.¹ Woodland, P.²

22
- 0028996915
- A maximum likelihood procedure for a universal adaptation method based on HMM composition, in
- Y. Minami and S. Furui, "A maximum likelihood procedure for a universal adaptation method based on HMM composition," in Proc. ICASSP'95, vol. 1, pp. 129-132.
- Proc. ICASSP'95 , vol.1 , pp. 129-132
- Minami, Y.¹ Furui, S.²

23
- 0029375754
- Automatic word recognition in cars
- C. Mokbel and G. Chollet, "Automatic word recognition in cars," IEEE Trans. Speech Audio Processing, vol. 3, pp. 346-356, Sept. 1995.
- (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 346-356
- Mokbel, C.¹ Chollet, G.²

24
- 0028996864
- Noisy speech recognition using robust inversion of hidden Markov models, in
- S. Moon and J. Hwang, "Noisy speech recognition using robust inversion of hidden Markov models," in Proc. ICASSP'95, vol. 1, pp. 145-148.
- Proc. ICASSP'95 , vol.1 , pp. 145-148
- Moon, S.¹ Hwang, J.²

25
- 0024753593
- Speech recognition using noise adaptive prototypes
- A. Nadas, D. Nahamoo, and M. Pichney, "Speech recognition using noise adaptive prototypes," IEEE Trans. Acoust., Speech, Signal Processing, vol. 37, pp. 1495-1503, Oct. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Processing , vol.37 , pp. 1495-1503
- Nadas, A.¹ Nahamoo, D.² Pichney, M.³

26
- 0028516117
- Training issues and channel equalization techniques for the construction of telephone acoustic models using high-quality speech corpus
- L. G. Neumeyer, V. V. Digalakis, and M. Weintraub, "Training issues and channel equalization techniques for the construction of telephone acoustic models using high-quality speech corpus," IEEE Trans. Speech Audio Processing, vol. 2, pp. 590-597, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 590-597
- Neumeyer, L.G.¹ Digalakis, V.V.² Weintraub, M.³

27
- 85079103466
- Signal bias removal for robust telephone speech recognition, in
- M. Rahim and B.-H. Juang, "Signal bias removal for robust telephone speech recognition," in Proc. ICASSP'94, vol. 1, pp. 445-448.
- Proc. ICASSP'94 , vol.1 , pp. 445-448
- Rahim, M.¹ Juang, B.-H.²

28
- 0029769867
- Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
- Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 19-30, Jan. 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 19-30

29
- 0028420014
- Integrated models of signal and background with application to speaker identification in noise
- R. Rose, E. Hofstetter, and D. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE Trans. Speech Audio Processing, vol. 2, pp. 245-257, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 245-257
- Rose, R.¹ Hofstetter, E.² Reynolds, D.³

30
- 0028996860
- Robust speech recognition based on stochastic matching, in
- A. Sankar and C. H. Lee, "Robust speech recognition based on stochastic matching," in Proc. ICASSP'95, vol. 1, pp. 121-124.
- Proc. ICASSP'95 , vol.1 , pp. 121-124
- Sankar, A.¹ Lee, C.H.²

31
- 0030149866
- A maximum likelihood approach to stochastic matching for robust speech recognition
- A maximum likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 190-202, May 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 190-202

32
- 33646938665
- Ph.D. dissertation, Univ. Nancy, France, Sept.
- O. Siohan, "Continuous speech recognition in noisy environments: Application to stochastic trajectory models," Ph.D. dissertation, Univ. Nancy, France, Sept. 1995.
- (1995) Continuous speech recognition in noisy environments: Application to stochastic trajectory models
- Siohan, O.¹

33
- 30244464907
- Noise independent speech recognition for a variety of noise types, in
- W. C. Treurniet and Y. Gong, "Noise independent speech recognition for a variety of noise types," in Proc. ICASSP'94, vol. 1, pp. 437-440.
- Proc. ICASSP'94 , vol.1 , pp. 437-440
- Treurniet, W.C.¹ Gong, Y.²

34
- 0028460810
- An acoustic-phonetic based speaker adaptation technique for improving speaker independent continuous speech recognition
- Y. Zhao, "An acoustic-phonetic based speaker adaptation technique for improving speaker independent continuous speech recognition," IEEE Trans. Speech Audio Processing, vol. 2, pp. 380-394, July 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 380-394
- Zhao, Y.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.