메뉴 건너뛰기




Volumn 7, Issue 2, 1999, Pages 162-176

On second-order statistics and linear estimation of cepstral coefficients

Author keywords

Cepstral statistics; Hidden markov model; Speech recognition

Indexed keywords

ERROR ANALYSIS; MARKOV PROCESSES; MATHEMATICAL MODELS; MATRIX ALGEBRA; SIGNAL PROCESSING; STATISTICS;

EID: 0033099548     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.748121     Document Type: Article
Times cited : (51)

References (41)
  • 3
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoiist., Speech, Signal Processing, vol. ASSP-28, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoiist., Speech, Signal Processing , vol.VOL. ASSP-28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 4
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , pp. 257-286
    • Rabiner, L.R.1
  • 6
    • 0029345416 scopus 로고
    • A comparison of signal processing front ends for automatic word recognition
    • July
    • C. R. Jankowski, Jr., H.-D. H. Vo, and R. P. Lippmann, "A comparison of signal processing front ends for automatic word recognition," IEEE Trans. Speech Audio Processing, vol. 3, pp. 286-293, July 1995.
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 286-293
    • Jankowski, C.R.1    Vo, H.-D.H.2    Lippmann, R.P.3
  • 10
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • Jan.
    • M. G. Rahim and B.-H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 19-30, Jan. 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 19-30
    • Rahim, M.G.1    Juang, B.-H.2
  • 11
    • 0030149866 scopus 로고    scopus 로고
    • A maximum likelihood approach to stochastic matching for robust speech recognition
    • May
    • A. Sankar and C. H. Lee, "A maximum likelihood approach to stochastic matching for robust speech recognition" IEEE Trans. Speech Audio Processing, vol. 4, pp. 190-202, May 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 190-202
    • Sankar, A.1    Lee, C.2
  • 12
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • L. E. Baum, T. Pétrie, G. Soules, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," Ann. Math. Stat., vol. 41, pp. 164-171, 1970.
    • (1970) Ann. Math. Stat. , vol.41 , pp. 164-171
    • Baum, L.E.1    Pétrie, T.2    Soules, G.3    Weiss, N.4
  • 13
    • 0001862769 scopus 로고
    • An inequality and associated maximization technique in statistical estimation of probabilistic functions of Markov processes
    • L. E. Baum, "An inequality and associated maximization technique in statistical estimation of probabilistic functions of Markov processes," Inequalities, vol. 3, pp. 1-8, 1972.
    • (1972) Inequalities , vol.3 , pp. 1-8
    • Baum, L.E.1
  • 14
    • 0027592532 scopus 로고
    • On the asymptotic statistical behavior of empirical cepstral coefficients
    • May
    • N. Merhav and C.-H. Lee, "On the asymptotic statistical behavior of empirical cepstral coefficients," IEEE Trans. Signal Processing, vol. 41, pp. 1990-1993, May 1993.
    • (1993) IEEE Trans. Signal Processing , vol.41 , pp. 1990-1993
    • Merhav, N.1    Lee, C.-H.2
  • 15
    • 0023246158 scopus 로고
    • A speaker-stress resistant HMM isolated word recognizer
    • Apr.
    • D. B. Paul, "A speaker-stress resistant HMM isolated word recognizer," Int. Co/iJ. Acoustics, Speech, Signal Processing, Apr. 1987, pp. 713-715.
    • (1987) Int. Co/iJ. Acoustics, Speech, Signal Processing , pp. 713-715
    • Paul, D.B.1
  • 18
    • 0023168987 scopus 로고
    • Cepstral domain stress compensation for robust speech recognition
    • Apr.
    • Y. Chen, "Cepstral domain stress compensation for robust speech recognition, in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, Apr. 1987, pp. 717-720.
    • (1987) Proc. Int. Conf. Acoustics, Speech, and Signal Processing , pp. 717-720
    • Chen, Y.1
  • 20
    • 0018032060 scopus 로고
    • Source coding of the discrete Fourier transform
    • Nov.
    • W. A. Pearlman and R. M. Gray, "Source coding of the discrete Fourier transform, IEEE Trans. Inform. Theory, vol. IT-24, pp. 683-692, Nov. 1978.
    • (1978) IEEE Trans. Inform. Theory , vol.VOL. IT-24 , pp. 683-692
    • Pearlman, W.A.1    Gray, R.M.2
  • 21
    • 0018642851 scopus 로고
    • Enhancement and bandwidth compression of noisy speech
    • Dec.
    • J. S. Lim and A. V. Oppenheim, "Enhancement and bandwidth compression of noisy speech," Proc. IEEE, vol. 67, pp. 1586-1604, Dec. 1979.
    • (1979) Proc. IEEE , vol.67 , pp. 1586-1604
    • Lim, J.S.1    Oppenheim, A.V.2
  • 22
    • 0019009880 scopus 로고
    • Speech enhancement using a soft-decision noise suppression filter
    • Apr.
    • R. J. McAulay and M. L. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 137-145, Apr. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.VOL. ASSP-28 , pp. 137-145
    • McAulay, R.J.1    Malpass, M.L.2
  • 23
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean square error short time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-32, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Processing , vol.VOL. ASSP-32 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 24
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean square error Log-spectral amplitude estimator
    • Apr.
    • "Speech enhancement using a minimum mean square error Log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 443-445, Apr. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.VOL. ASSP-33 , pp. 443-445
  • 26
    • 0026189808 scopus 로고
    • Speech recognition in adverse environments
    • B.-H. Juang, "Speech recognition in adverse environments," Comput., Speech, Lang., vol. 5, pp. 275-294, 1991.
    • (1991) Comput., Speech, Lang. , vol.5 , pp. 275-294
    • Juang, B.-H.1
  • 27
    • 84948598244 scopus 로고
    • Statistical model based speech enhancement system
    • Oct.
    • Y. Ephraim, "Statistical model based speech enhancement system," Proc. IEEE, vol. 80, pp. 1526-1555, Oct. 1992.
    • (1992) Proc. IEEE , vol.80 , pp. 1526-1555
    • Ephraim, Y.1
  • 28
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • Sept.
    • M. J. F. Gales and S. J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Processing, vol. 4, pp. 352-359, Sept. 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 29
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Stat. Soc. B, vol. 39, pp. 1-38, 1977.
    • (1977) J. R. Stat. Soc. B , vol.39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 31
    • 84947382512 scopus 로고
    • Automatic smoothing of the log periodogram
    • Mar.
    • G. Wahba, "Automatic smoothing of the log periodogram," J. Amer. Slat.Assoc., vol. 75, pp. 122-132, Mar. 1980.
    • (1980) J. Amer. Slat.Assoc. , vol.75 , pp. 122-132
    • Wahba, G.1
  • 32
    • 0004149831 scopus 로고    scopus 로고
    • Cambridge, MA: Cambridge Univ. Press
    • S. Wolfram, The Mathematica Book. Cambridge, MA: Cambridge Univ. Press, 1996.
    • (1996) The Mathematica Book.
    • Wolfram, S.1
  • 34
    • 0006936809 scopus 로고    scopus 로고
    • Hidden Markov model state-based cepstral noise compensation
    • "Hidden Markov model state-based cepstral noise compensation," in Proc. ICSLP,\992, pp. 519-522.
    • Proc. ICSLP,\992 , pp. 519-522
  • 37
    • 0015600423 scopus 로고
    • The Viterbi algorithm
    • Mar.
    • G. D. Foraey, Jr., "The Viterbi algorithm," Proc. IEEE, vol. 61, pp. 268-278, Mar. 1973.
    • (1973) Proc. IEEE , vol.61 , pp. 268-278
    • Foraey Jr., G.D.1
  • 38
    • 0026223168 scopus 로고
    • Maximum likelihood hidden Markov modeling using a dominant sequence of states
    • Sept
    • N. Merhav and Y. Ephraim, "Maximum likelihood hidden Markov modeling using a dominant sequence of states," IEEE Trans. Signal Processing, vol. 39, pp. 2111-2115, Sept. 1991.
    • (1991) IEEE Trans. Signal Processing , vol.39 , pp. 2111-2115
    • Merhav, N.1    Ephraim, Y.2
  • 39
    • 0026242124 scopus 로고
    • Hidden Markov modeling using a dominant state sequence with application to speech recognition
    • "Hidden Markov modeling using a dominant state sequence with application to speech recognition," Comput. Speech Lang., vol. 5, pp. 327-339, 1991.
    • (1991) Comput. Speech Lang. , vol.5 , pp. 327-339
  • 40
    • 0024909863 scopus 로고
    • On the application of hidden Markov models for enhancing noisy speech,"
    • Dec.
    • Y. Ephraim, D. Malah, and B.-H. Juang "On the application of hidden Markov models for enhancing noisy speech," IEEE Trans. Aconst., Speech, Signal Processing, vol. 37, pp. 1846-1856, Dec. 1989.
    • (1989) IEEE Trans. Aconst., Speech, Signal Processing , vol.37 , pp. 1846-1856
    • Ephraim, Y.1    Malah, D.2    Juang, B.-H.3
  • 41
    • 0029345417 scopus 로고
    • A signal subspace approach for speech enhancement
    • July
    • Y. Ephraim and H. L. Van Tress, "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 3, pp. 251-266, July 1995. Yariv Ephraim (S'82-M'84-SM'90-F'94) received the D.Sc. degree in electrical engineering in 1984 from the Technion-Israel Institute of Technology, Haifa. He was a Rothschild Post-Doctoral Fellow at the Information Systems Laboratory, Stanford University, Stanford, CA, from 1984 to 1985. He was a Member of Technical Staff at the Information Principles Research Laboratory of AT&T Bell Laboratories, Murray Hill, NJ, from 1985 to 1993. In 1991. he ioined Georce Mason University, Fairfax, VA, where he currently is an Associate Professor of electrical and computer engineering. Mazin Rahim (S'86-M'91-SM'96) received the B.Eng. and Ph.D. degrees from the University of Liverpool, U.K., in 1987 and 1991, respectively. He is currently a Principal Technical Staff Member at AT&T Labs Research, Murray Hill, NJ, where he is pursuing research in the areas of robustness, acoustic modeling, and utterance verification for automatic speech recognition. Prior to joining AT&T, he was a Research Professor with the Center for Computer Aids for Industrial Productivity, Rutgers University, New Brunswick, NJ, where he was engaged in research in neural networks for speech and speaker recognition. He has over 40 publications in the area of speech processing and is the author of the book Artificial Neural Neru-orks for Speech Analysis/Synthesis (London, U.K.: Chapman & Hall, 1994). Dr. Rahim is currently an associate editor for the IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING. He has been an invited guest at several speech processing workshops, including the U.S. government sponsored CAIP workshops in 1993 and 1994. He is a recipient of several professional awards, including two best papers from IEE in 1989 and from ASA in 1992. He is a member of the British Institute of Electrical Engineers (IEE).
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 251-266
    • Ephraim, Y.1    Van Tress, H.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.