SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 4, Issue 3, 1996, Pages 190-202

A maximum-likelihood approach to stochastic matching for robust speech recognition

(2) Sankar, Ananth a,b Lee, Chin Hui a

a LUCENT TECHNOLOGIES (United States)

b SRI INTERNATIONAL (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; ESTIMATION; FEATURE EXTRACTION; MARKOV PROCESSES; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; RANDOM PROCESSES; SIGNAL DISTORTION; TRANSDUCERS;

EXPECTATION MAXIMIZATION ALGORITHM; HIDDEN MARKOV MODELS; INVERSE DISTORTION FUNCTION; SPEECH RECOGNITION SYSTEM; WORD ERROR RATE;

SPEECH RECOGNITION;

EID: 0030149866 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/89.496215 Document Type: Article

Times cited : (308)

References (48)

1
- 0026189808
- Speech recognition in adverse environments
- B.-H. Juang, "Speech recognition in adverse environments," Comput. Speech Lang., vol. 5, pp. 275-294, 1991.
- (1991) Comput. Speech Lang. , vol.5 , pp. 275-294
- Juang, B.-H.¹

2
- 0023165215
- On the use of bandpass liftering in speech recognition
- July
- B.-H. Juang, L. R. Rabiner, and J. G. Wilpon, "On the use of bandpass liftering in speech recognition," IEEE Trans. Acoiist., Speech. Signal. Processing, vol. ASSP-35, pp. 947-954, July 1987.
- (1987) IEEE Trans. Acoiist., Speech. Signal. Processing, Vol. ASSP , vol.35 , pp. 947-954
- Juang, B.-H.¹ Rabiner, L.R.² Wilpon, J.G.³

3
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- June
- B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, June 1974.
- (1974) J. Acoust. Soc. Amer. , vol.55 , pp. 1304-1312
- Atal, B.¹

4
- 85135377175
- Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP), in
- H. Hermansky, N. Morgan, A. Bayya, and P. Kohn, "Compensation for The effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)," in Proc. EVROSPEECH, 1991, pp. 1367-1370.
- Proc. EVROSPEECH , vol.1991 , pp. 1367-1370
- Hermansky, H.¹ Morgan, N.² Bayya, A.³ Kohn, P.⁴

5
- 0000030810
- Auditory nerve representation as a basis for speech processing, in
- O. Ghitza, "Auditory nerve representation as a basis for speech processing," in Advances in Speech and Signal Processing, S. Furui and M. Sondhi, Eds. New York: Marcel Dekker, 1991, pp. 453-485.
- (1991) Advances in Speech and Signal Processing, S. Furui and M. Sondhi, Eds. New York: Marcel Dekker , pp. 453-485
- Ghitza, O.¹

6
- 84928837806
- A joint synchrony/mean-rate model of auditory speech processing
- S. Seneff, "A joint synchrony/mean-rate model of auditory speech processing," J. Phonetics, vol. 16, pp. 55-76, 1990.
- (1990) J. Phonetics , vol.16 , pp. 55-76
- Seneff, S.¹

7
- 0027646437
- On the use of a family of signal limiters for recognition of noisy speech
- C.-H. Lee and C.-H. Lin, "On the use of a family of signal limiters for recognition of noisy speech," Speech Commun., vol. 12, pp. 383-392, 1993.
- (1993) Speech Commun. , vol.12 , pp. 383-392
- Lee C-H¹ Lin C-H²

8
- 0027229711
- Influence of background noise and microphone on the performance of the IBM TANGORA speech recognition system, in Proc
- S. Das et al., "Influence of background noise and microphone on the performance of the IBM TANGORA speech recognition system," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1993, pp. 11-71.
- (1993) IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 11-71
- Das Et Al, S.¹

9
- 0024885488
- Spectral estimation using a log-distance error criterion applied to speech recognition, in
- D. Van Compernolle, "Spectral estimation using a log-distance error criterion applied to speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing. 1989, pp. 258-261.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing. 1989 , pp. 258-261
- Van Compernolle, D.¹

10
- 0006923547
- Noise adaptation in a hidden Markov model speech recognition system
- _, "Noise adaptation in a hidden Markov model speech recognition system," Compiit. Speech Lang., vol. 3, pp. 151-167, 1989.
- (1989) Compiit. Speech Lang. , vol.3 , pp. 151-167

11
- 0021158675
- Optimal estimators for spectral restoration of noisy speech, in
- J. E. Porter and S. F. Boll, "Optimal estimators for spectral restoration of noisy speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1984, pp. 18.
- (1984) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 18
- Porter, J.E.¹ Boll, S.F.²

12
- 0021645331
- Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
- Dec.
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-32, pp. 1109-1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Processing, Vol. ASSP , vol.32 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

13
- 0021892216
- Speech enhancement using a minimum mean square error log-spectral amplitude estimator
- Apr.
- -, "Speech enhancement using a minimum mean square error log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 443-445, Apr. 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Processing, Vol. ASSP , vol.33 , pp. 443-445

14
- 0025621983
- Estimation using log-spectral-distance criterion for noise-robust speech recognition, in
- A. Ereil and M. Weintraub, "Estimation using log-spectral-distance criterion for noise-robust speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1990, pp. 853-856.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1990 , pp. 853-856
- Ereil, A.¹ Weintraub, M.²

15
- 0025587084
- A minimum mean square error approach for speech enhancement, in
- Y. Ephraim, "A minimum mean square error approach for speech enhancement," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1990, pp. 829-832.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1990 , pp. 829-832
- Ephraim, Y.¹

16
- 0001873457
- Filterbank-energy estimation using mixture and Markov models for recognition of noisy speech
- Jan.
- A. Ereil and M. Weintraub, "Filterbank-energy estimation using mixture and Markov models for recognition of noisy speech," IEEE Trans. Speech Audio Processing, vol. 1, pp. 68-76, Jan. 1993.
- (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 68-76
- Ereil, A.¹ Weintraub, M.²

17
- 84944816135
- A digital filterbank for spectral matching, in
- D. Klatt, "A digital filterbank for spectral matching," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1976, pp. 573-576.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1976 , pp. 573-576
- Klatt, D.¹

18
- 0022883703
- Noise compensation for speech recognition using probabilistic models, in
- J. Holmes and N. Sedgwick, "Noise compensation for speech recognition using probabilistic models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, pp. 14, 1986.
- (1986) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 14
- Holmes, J.¹ Sedgwick, N.²

19
- 0023739211
- Speech recognition using noise-adaptive prototypes, in
- A. Nadas, D. Nahamoo, and M. Picheny, "Speech recognition using noise-adaptive prototypes," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing. 1988, pp. 517-520.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing. , vol.1988 , pp. 517-520
- Nadas, A.¹ Nahamoo, D.² Picheny, M.³

20
- 0023671987
- Noise compensation algorithms for use with hidden Markov model based speech recognition, in
- A. Varga, R. Moore, J. Bridle, K. Ponting, and M. Russell, "Noise compensation algorithms for use with hidden Markov model based speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1988, pp. 481-484.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1988 , pp. 481-484
- Varga, A.¹ Moore, R.² Bridle, J.³ Ponting, K.⁴ Russell, M.⁵

21
- 0025681008
- Hidden Markov model decomposition of speech and noise, in
- A. Varga and R. Moore, "Hidden Markov model decomposition of speech and noise," in Proc. IEEE Int. Conf. Acousl., Speech, Signal Processing, 1990, pp. 845-848.
- (1990) Proc. IEEE Int. Conf. Acousl., Speech, Signal Processing , pp. 845-848
- Varga, A.¹ Moore, R.²

22
- 85006657791
- Speech recognition using hidden Markov model decomposition and a general background speech model, in
- M. Wang and S. Young, "Speech recognition using hidden Markov model decomposition and a general background speech model," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1992, pp. I-253-I-256.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1992
- Wang, M.¹ Young, S.²

23
- 85017310148
- An improved approach to the hidden Markov model decomposition of speech and noise, in
- M. Gales and S. Young, "An improved approach to the hidden Markov model decomposition of speech and noise," in Proc. IEEE Int. Conf. Acomt., Speech, Signal Processing, 1992, pp. I-233-I-236.
- Proc. IEEE Int. Conf. Acomt., Speech, Signal Processing , vol.1992
- Gales, M.¹ Young, S.²

24
- 0028420014
- R. Rose, E. Hofstetter, and D. Rey nolds"lntegrated models of speech and background with application to speaker identification in noise," IEEE Trans. Speech Audio Processing, vol. 2, pp. 245-257, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 245-257
- Rose, R.¹ Hofstetter, E.²

25
- 0026881830
- Gain-adapted hidden Markov models for recognition of clean and noisy speech
- June
- Y. Ephraim, "Gain-adapted hidden Markov models for recognition of clean and noisy speech," IEEE Trans. Signal Processing, vol. 40, pp. 1303-1316, June 1992.
- (1992) IEEE Trans. Signal Processing , vol.40 , pp. 1303-1316
- Ephraim, Y.¹

26
- 0002671953
- A minimax classification approach with application to robust speech recognition
- Jan.
- N. Merhav and C.-H. Lee, "A minimax classification approach with application to robust speech recognition," IEEE Trans. Speech Audio Processing, vol. 1, pp. 90-100, Jan. 1993.
- (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 90-100
- Merhav, N.¹ Lee C-H²

27
- 0024940640
- Unsupervised speaker adaptation by probabilistic spectrum fitting, in
- S. Cox and J. Bridle, "Unsupervised speaker adaptation by probabilistic spectrum fitting," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1989, pp. 294-297.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1989 , pp. 294-297
- Cox, S.¹ Bridle, J.²

28
- 0025587779
- Simultaneous speaker normalization and utterance labeling using Bayesian/neural net techniques, in
- -, "Simultaneous speaker normalization and utterance labeling using Bayesian/neural net techniques," in Proc. IEEE Int. Conf. Acousl., Speech, Signal Processing, 1990, pp. 161-164.
- Proc. IEEE Int. Conf. Acousl., Speech, Signal Processing , vol.1990 , pp. 161-164

29
- 0027167189
- A new speaker adaptation technique using very short calibration speech, in
- Y. Zhao, "A new speaker adaptation technique using very short calibration speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1993, pp. II-562-II-565.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1993
- Zhao, Y.¹

30
- 85079103466
- Signal bias removal for robust telephone speech recognition in adverse environments, in
- M. Rahim and B.-H. Juang, "Signal bias removal for robust telephone speech recognition in adverse environments," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1994.
- (1994) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing
- Rahim, M.¹ Juang, B.-H.²

31
- 0025628728
- Environmental robustness in automatic speech recognition, in
- A. Acero and R. Stern, "Environmental robustness in automatic speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1990, pp. 849-852.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1990 , pp. 849-852
- Acero, A.¹ Stern, R.²

32
- 0004319970
- A. Acero, Acoustical and Environmental Robustness in Automatic Speech Recognition. Boston, MA: Kluwer, 1992.
- (1992) Acoustical and Environmental Robustness in Automatic Speech Recognition. Boston, MA: Kluwer
- Acero, A.¹

33
- 0001013568
- Acoustic modeling for large vocabulary speech recognition
- Jan.
- C.-H. Lee, L. R. Rabiner, R. Pieraccini, and J. Wilpon, "Acoustic modeling for large vocabulary speech recognition," Comput. Speech Lang., vol. 4, pp. 127-165, Jan. 1990.
- (1990) Comput. Speech Lang. , vol.4 , pp. 127-165
- Lee C-H¹ Rabiner, L.R.² Pieraccini, R.³ Wilpon, J.⁴

34
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Feb.
- L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, pp. 257-286, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , pp. 257-286
- Rabiner, L.R.¹

35
- 0020719320
- A maximum likelihood approach to continuous speech recognition
- Mar.
- L. Bahl, F. Jelinek, and R. Mercer, "A maximum likelihood approach to continuous speech recognition," IEEE Trans. Pattern Anal. Machine Intell., vol. PAMI-5, pp. 179-190, Mar. 1983.
- (1983) IEEE Trans. Pattern Anal. Machine Intell., Vol. PAMI , vol.5 , pp. 179-190
- Bahl, L.¹ Jelinek, F.² Mercer, R.³

36
- 0018724280
- H. Sakoe, 'Two-level DP-matching-A dynamic programming-based pattern matching algorithm for connected word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, pp. 588-595, Dec. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Processing, Vol. ASSP , vol.27 , pp. 588-595

37
- 0019558276
- A level building dynamic time warping algorithm for connected word recognition
- Apr.
- C. Myers and L. Rabiner, "A level building dynamic time warping algorithm for connected word recognition," IEEE Trans. Acoust., Speech Signal Processing, vol. ASSP-29, pp. 284-297, Apr. 1981.
- (1981) IEEE Trans. Acoust., Speech Signal Processing, Vol. ASSP , vol.29 , pp. 284-297
- Myers, C.¹ Rabiner, L.²

38
- 0024769238
- A frame-synchronous network search algorithm for connected word recognition
- Nov.
- C.-H. Lee and L. Rabiner, "A frame-synchronous network search algorithm for connected word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 37, pp. 1649-1658, Nov. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Processing , vol.37 , pp. 1649-1658
- Lee C-H¹ Rabiner, L.²

39
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Royal Statist. Soc., vol. 39, pp. 1-38, 1977.
- (1977) J. Royal Statist. Soc. , vol.39 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

40
- 0000353178
- A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
- L. Baum, T. Pétrie, G. Soûles, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," Ann. Math. Statist., vol. 41, no. 1, pp. 164-171, 1970.
- (1970) Ann. Math. Statist. , vol.41 , Issue.1 , pp. 164-171
- Baum, L.¹ Pétrie, T.² Soûles, G.³ Weiss, N.⁴

41
- 0022097649
- Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains
- B.-H. Juang, "Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains," AT&T Tech. J., vol. 64, no. 6, pp. 1235-1249, 1985.
- (1985) AT&T Tech. J. , vol.64 , Issue.6 , pp. 1235-1249
- Juang, B.-H.¹

42
- 0022712081
- A segmental K-means training procedure for connected word recognition
- May
- L. R. Rabiner, J. Wilpon, and B.-H. Juang, "A segmental K-means training procedure for connected word recognition," AT&T Tech. J., vol. 64, pp. 21, May 1986.
- (1986) AT&T Tech. J. , vol.64 , pp. 21
- Rabiner, L.R.¹ Wilpon, J.² Juang, B.-H.³

43
- 0015600423
- The Viterbi algorithm
- Mar.
- G. Forney, "The Viterbi algorithm," Proc. IEEE, vol. 61, pp. 268-278, Mar. 1973.
- (1973) Proc. IEEE , vol.61 , pp. 268-278
- Forney, G.¹

44
- 0023776398
- A database for continuous speech recognition in a 1000-word domain, in
- P. Price, W. Fisher, J. Bernstein, and D. Pallet!, "A database for continuous speech recognition in a 1000-word domain," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1988, pp. 651-654.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1988 , pp. 651-654
- Price, P.¹ Fisher, W.² Bernstein, J.³ Pallet, D.⁴

45
- 0026854591
- Improved acoustic modeling for large vocabulary continuous speech recognition
- C.-H. Lee, E. Giachin, L. Rabiner, R. Pieraccini, and A. Rosenberg, "Improved acoustic modeling for large vocabulary continuous speech recognition," Comput. Speech Lang., vol. 6, pp. 103-127, 1992.
- (1992) Comput. Speech Lang. , vol.6 , pp. 103-127
- Lee C-H¹ Giachin, E.² Rabiner, L.³ Pieraccini, R.⁴ Rosenberg, A.⁵

46
- 0028419019
- Maximum a posterior estimation for multivariate Gaussian mixture observations of Markov chains
- Apr.
- J. Gauvain and C.-H. Lee, "Maximum a posterior estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Processing, vol. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 291-298
- Gauvain, J.¹ Lee C-H²

47
- 0027877970
- Large vocabulary speech recognition using subword units
- C.-H. Lee, J.-L. Gauvain, R. Pieraccini, and L. Rabiner, "Large vocabulary speech recognition using subword units," Speech Commun., vol. 13, pp. 263-279, 1993.
- (1993) Speech Commun. , vol.13 , pp. 263-279
- Lee, C.-H.¹ Gauvain, J.-L.² Pieraccini, R.³ Rabiner, L.⁴

48
- 0027192618
- Speaker adaptation based on MAP estimation of HMM parameters, in
- C.-H. Lee and J.-L. Gauvain, "Speaker adaptation based on MAP estimation of HMM parameters," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1993, pp. II-558-II-561.
- Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1993
- Lee C-H¹ Gauvain, J.-L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.