SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 7, Issue 2, 1999, Pages 162-176

On second-order statistics and linear estimation of cepstral coefficients

(2) Ephraim, Yariv a Rahim, Mazin b

a Krasnow Institute for Advanced Study (United States)

b AT AND T LABS RESEARCH (United States)

Author keywords

Cepstral statistics; Hidden markov model; Speech recognition

Indexed keywords

ERROR ANALYSIS; MARKOV PROCESSES; MATHEMATICAL MODELS; MATRIX ALGEBRA; SIGNAL PROCESSING; STATISTICS;

CEPSTRAL STATISTICS; HIDDEN MARKOV MODELS (HMM);

SPEECH RECOGNITION;

EID: 0033099548 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/89.748121 Document Type: Article

Times cited : (52)

References (41)

1
- 0003874959
- Berlin, Germany: Springer-Verlag
- J. D. Markel and A. H. Gray, Jr., Linear Prediction of Speech. Berlin, Germany: Springer-Verlag, 1976.
- (1976) Linear Prediction of Speech.
- Markel, J.D.¹ Gray Jr., A.H.²

2
- 0003793552
- Englewood, NJ: Prentice-Hall
- A. V. Oppenheim and R. W. Schafer, Digital Signal Processing. Englewood, NJ: Prentice-Hall, 1975.
- (1975) Digital Signal Processing.
- Oppenheim, A.V.¹ Schafer, R.W.²

3
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug.
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoiist., Speech, Signal Processing, vol. ASSP-28, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoiist., Speech, Signal Processing , vol.VOL. ASSP-28 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

4
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Feb.
- L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, pp. 257-286, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , pp. 257-286
- Rabiner, L.R.¹

5
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition.
- Rabiner, L.R.¹ Juang, B.-H.²

6
- 0029345416
- A comparison of signal processing front ends for automatic word recognition
- July
- C. R. Jankowski, Jr., H.-D. H. Vo, and R. P. Lippmann, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech Audio Processing, vol. 3, pp. 286-293, July 1995.
- (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 286-293
- Jankowski, C.R.¹ Vo, H.-D.H.² Lippmann, R.P.³

7
- 0003770711
- C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds., Boston, MA: Kluwer
- C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds., Automatic Speech and Speaker Recognition: Advanced Topics. Boston, MA: Kluwer, 1996.
- (1996) Automatic Speech and Speaker Recognition: Advanced Topics.

8
- 0003406402
- . New York: Academic
- M. B. Priestley, Spectral Analysis and Time Series. New York: Academic, 1992.
- (1992) Spectral Analysis and Time Series
- Priestley, M.B.¹

9
- 0003757962
- Berlin, Germany: Springer-Verlag
- J. L. Flanagan, Speech Analysis, Synthesis, and Perception. Berlin, Germany: Springer-Verlag, 1972.
- (1972) Speech Analysis, Synthesis, and Perception.
- Flanagan, J.L.¹

10
- 0029769867
- Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
- Jan.
- M. G. Rahim and B.-H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 19-30, Jan. 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 19-30
- Rahim, M.G.¹ Juang, B.-H.²

11
- 0030149866
- A maximum likelihood approach to stochastic matching for robust speech recognition
- May
- A. Sankar and C. H. Lee, "A maximum likelihood approach to stochastic matching for robust speech recognition" IEEE Trans. Speech Audio Processing, vol. 4, pp. 190-202, May 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 190-202
- Sankar, A.¹ Lee, C.²

12
- 0000353178
- A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
- L. E. Baum, T. Pétrie, G. Soules, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," Ann. Math. Stat., vol. 41, pp. 164-171, 1970.
- (1970) Ann. Math. Stat. , vol.41 , pp. 164-171
- Baum, L.E.¹ Pétrie, T.² Soules, G.³ Weiss, N.⁴

13
- 0001862769
- An inequality and associated maximization technique in statistical estimation of probabilistic functions of Markov processes
- L. E. Baum, "An inequality and associated maximization technique in statistical estimation of probabilistic functions of Markov processes," Inequalities, vol. 3, pp. 1-8, 1972.
- (1972) Inequalities , vol.3 , pp. 1-8
- Baum, L.E.¹

14
- 0027592532
- On the asymptotic statistical behavior of empirical cepstral coefficients
- May
- N. Merhav and C.-H. Lee, "On the asymptotic statistical behavior of empirical cepstral coefficients," IEEE Trans. Signal Processing, vol. 41, pp. 1990-1993, May 1993.
- (1993) IEEE Trans. Signal Processing , vol.41 , pp. 1990-1993
- Merhav, N.¹ Lee, C.-H.²

15
- 0023246158
- A speaker-stress resistant HMM isolated word recognizer
- Apr.
- D. B. Paul, "A speaker-stress resistant HMM isolated word recognizer," Int. Co/iJ. Acoustics, Speech, Signal Processing, Apr. 1987, pp. 713-715.
- (1987) Int. Co/iJ. Acoustics, Speech, Signal Processing , pp. 713-715
- Paul, D.B.¹

16
- 0023263708
- Multi-style training for robust isolated-word speech recognition
- Apr.
- R. P. Lippmann, E. A. Martin, and D. B. Paul, "Multi-style training for robust isolated-word speech recognition, in Proc. Int. Conf. Acoustics, Speech, Signal Processing, Apr. 1987, pp. 705-708.
- (1987) Proc. Int. Conf. Acoustics, Speech, Signal Processing , pp. 705-708
- Lippmann, R.P.¹ Martin, E.A.² Paul, D.B.³

17
- 0023246343
- Two-stage discriminant analysis for improved isolated-word recognition
- Apr.
- E. A. Martin, R. P. Lippmann, and D. B. Paul, "Two-stage discriminant analysis for improved isolated-word recognition, in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 1987, pp. 709-712.
- (1987) Proc. Int. Conf. Acoustics, Speech, and Signal Processing , pp. 709-712
- Martin, E.A.¹ Lippmann, R.P.² Paul, D.B.³

18
- 0023168987
- Cepstral domain stress compensation for robust speech recognition
- Apr.
- Y. Chen, "Cepstral domain stress compensation for robust speech recognition, in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 1987, pp. 717-720.
- (1987) Proc. Int. Conf. Acoustics, Speech, and Signal Processing , pp. 717-720
- Chen, Y.¹

19
- 0003690113
- New York: Holt, Rinehart
- D. R. Brillinger, Time Series-Data Analysis and Theory. New York: Holt, Rinehart, 1975.
- (1975) Time Series-Data Analysis and Theory.
- Brillinger, D.R.¹

20
- 0018032060
- Source coding of the discrete Fourier transform
- Nov.
- W. A. Pearlman and R. M. Gray, "Source coding of the discrete Fourier transform, IEEE Trans. Inform. Theory, vol. IT-24, pp. 683-692, Nov. 1978.
- (1978) IEEE Trans. Inform. Theory , vol.VOL. IT-24 , pp. 683-692
- Pearlman, W.A.¹ Gray, R.M.²

21
- 0018642851
- Enhancement and bandwidth compression of noisy speech
- Dec.
- J. S. Lim and A. V. Oppenheim, "Enhancement and bandwidth compression of noisy speech," Proc. IEEE, vol. 67, pp. 1586-1604, Dec. 1979.
- (1979) Proc. IEEE , vol.67 , pp. 1586-1604
- Lim, J.S.¹ Oppenheim, A.V.²

22
- 0019009880
- Speech enhancement using a soft-decision noise suppression filter
- Apr.
- R. J. McAulay and M. L. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 137-145, Apr. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.VOL. ASSP-28 , pp. 137-145
- McAulay, R.J.¹ Malpass, M.L.²

23
- 0021645331
- Speech enhancement using a minimum mean square error short time spectral amplitude estimator
- Dec.
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-32, pp. 1109-1121, Dec. 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Processing , vol.VOL. ASSP-32 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

24
- 0021892216
- Speech enhancement using a minimum mean square error Log-spectral amplitude estimator
- Apr.
- "Speech enhancement using a minimum mean square error Log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 443-445, Apr. 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.VOL. ASSP-33 , pp. 443-445

25
- 0003959189
- Boston, MA: Kluwer
- A. Gersho and R. M. Gray, Vector Quantization and Signal Compression. Boston, MA: Kluwer, 1991.
- (1991) Vector Quantization and Signal Compression.
- Gersho, A.¹ Gray, R.M.²

26
- 0026189808
- Speech recognition in adverse environments
- B.-H. Juang, "Speech recognition in adverse environments," Comput., Speech, Lang., vol. 5, pp. 275-294, 1991.
- (1991) Comput., Speech, Lang. , vol.5 , pp. 275-294
- Juang, B.-H.¹

27
- 84948598244
- Statistical model based speech enhancement system
- Oct.
- Y. Ephraim, "Statistical model based speech enhancement system," Proc. IEEE, vol. 80, pp. 1526-1555, Oct. 1992.
- (1992) Proc. IEEE , vol.80 , pp. 1526-1555
- Ephraim, Y.¹

28
- 0030245128
- Robust continuous speech recognition using parallel model combination
- Sept.
- M. J. F. Gales and S. J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Processing, vol. 4, pp. 352-359, Sept. 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 352-359
- Gales, M.J.F.¹ Young, S.J.²

29
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Stat. Soc. B, vol. 39, pp. 1-38, 1977.
- (1977) J. R. Stat. Soc. B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

30
- 0003498504
- New York: Academic
- I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series, and Products. New York: Academic, 1979.
- (1979) Table of Integrals, Series, and Products.
- Gradshteyn, I.S.¹ Ryzhik, I.M.²

31
- 84947382512
- Automatic smoothing of the log periodogram
- Mar.
- G. Wahba, "Automatic smoothing of the log periodogram," J. Amer. Slat.Assoc., vol. 75, pp. 122-132, Mar. 1980.
- (1980) J. Amer. Slat.Assoc. , vol.75 , pp. 122-132
- Wahba, G.¹

32
- 0004149831
- Cambridge, MA: Cambridge Univ. Press
- S. Wolfram, The Mathematica Book. Cambridge, MA: Cambridge Univ. Press, 1996.
- (1996) The Mathematica Book.
- Wolfram, S.¹

33
- 0026382131
- Noisy speech recognition using hidden Markov state-based filtering
- V. L. Beattie and S. J. Young, "Noisy speech recognition using hidden Markov state-based filtering," in Proc. Int. Conf. Acoustics, Speech; Signal Processing, 1991, pp. 917-920.
- (1991) Proc. Int. Conf. Acoustics, Speech; Signal Processing , pp. 917-920
- Beattie, V.L.¹ Young, S.J.²

34
- 0006936809
- Hidden Markov model state-based cepstral noise compensation
- "Hidden Markov model state-based cepstral noise compensation," in Proc. ICSLP,\992, pp. 519-522.
- Proc. ICSLP,\992 , pp. 519-522

35
- 0026384952
- A hypothesized Wiener filtering approach to noisy speech recognition
- A. D. Berstein and I. D. Shalom, "A hypothesized Wiener filtering approach to noisy speech recognition," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, 1991, pp. 913-916.
- (1991) Proc. Int. Conf. Acoustics, Speech, and Signal Processing , pp. 913-916
- Berstein, A.D.¹ Shalom, I.D.²

36
- 0003897447
- New York: Academic
- S. Ross, Introduction to Probability Models. New York: Academic, 1997.
- (1997) Introduction to Probability Models.
- Ross, S.¹

37
- 0015600423
- The Viterbi algorithm
- Mar.
- G. D. Foraey, Jr., "The Viterbi algorithm," Proc. IEEE, vol. 61, pp. 268-278, Mar. 1973.
- (1973) Proc. IEEE , vol.61 , pp. 268-278
- Foraey Jr., G.D.¹

38
- 0026223168
- Maximum likelihood hidden Markov modeling using a dominant sequence of states
- Sept
- N. Merhav and Y. Ephraim, "Maximum likelihood hidden Markov modeling using a dominant sequence of states," IEEE Trans. Signal Processing, vol. 39, pp. 2111-2115, Sept. 1991.
- (1991) IEEE Trans. Signal Processing , vol.39 , pp. 2111-2115
- Merhav, N.¹ Ephraim, Y.²

39
- 0026242124
- Hidden Markov modeling using a dominant state sequence with application to speech recognition
- "Hidden Markov modeling using a dominant state sequence with application to speech recognition," Comput. Speech Lang., vol. 5, pp. 327-339, 1991.
- (1991) Comput. Speech Lang. , vol.5 , pp. 327-339

40
- 0024909863
- On the application of hidden Markov models for enhancing noisy speech,"
- Dec.
- Y. Ephraim, D. Malah, and B.-H. Juang "On the application of hidden Markov models for enhancing noisy speech," IEEE Trans. Aconst., Speech, Signal Processing, vol. 37, pp. 1846-1856, Dec. 1989.
- (1989) IEEE Trans. Aconst., Speech, Signal Processing , vol.37 , pp. 1846-1856
- Ephraim, Y.¹ Malah, D.² Juang, B.-H.³

41
- 0029345417
- A signal subspace approach for speech enhancement
- July
- Y. Ephraim and H. L. Van Tress, "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 3, pp. 251-266, July 1995. Yariv Ephraim (S'82-M'84-SM'90-F'94) received the D.Sc. degree in electrical engineering in 1984 from the Technion-Israel Institute of Technology, Haifa. He was a Rothschild Post-Doctoral Fellow at the Information Systems Laboratory, Stanford University, Stanford, CA, from 1984 to 1985. He was a Member of Technical Staff at the Information Principles Research Laboratory of AT&T Bell Laboratories, Murray Hill, NJ, from 1985 to 1993. In 1991. he ioined Georce Mason University, Fairfax, VA, where he currently is an Associate Professor of electrical and computer engineering. Mazin Rahim (S'86-M'91-SM'96) received the B.Eng. and Ph.D. degrees from the University of Liverpool, U.K., in 1987 and 1991, respectively. He is currently a Principal Technical Staff Member at AT&T Labs Research, Murray Hill, NJ, where he is pursuing research in the areas of robustness, acoustic modeling, and utterance verification for automatic speech recognition. Prior to joining AT&T, he was a Research Professor with the Center for Computer Aids for Industrial Productivity, Rutgers University, New Brunswick, NJ, where he was engaged in research in neural networks for speech and speaker recognition. He has over 40 publications in the area of speech processing and is the author of the book Artificial Neural Neru-orks for Speech Analysis/Synthesis (London, U.K.: Chapman & Hall, 1994). Dr. Rahim is currently an associate editor for the IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING. He has been an invited guest at several speech processing workshops, including the U.S. government sponsored CAIP workshops in 1993 and 1994. He is a recipient of several professional awards, including two best papers from IEE in 1989 and from ASA in 1992. He is a member of the British Institute of Electrical Engineers (IEE).
- (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 251-266
- Ephraim, Y.¹ Van Tress, H.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.