메뉴 건너뛰기




Volumn 4, Issue 3, 1996, Pages 190-202

A maximum-likelihood approach to stochastic matching for robust speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; ESTIMATION; FEATURE EXTRACTION; MARKOV PROCESSES; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; RANDOM PROCESSES; SIGNAL DISTORTION; TRANSDUCERS;

EID: 0030149866     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.496215     Document Type: Article
Times cited : (308)

References (48)
  • 1
    • 0026189808 scopus 로고
    • Speech recognition in adverse environments
    • B.-H. Juang, "Speech recognition in adverse environments," Comput. Speech Lang., vol. 5, pp. 275-294, 1991.
    • (1991) Comput. Speech Lang. , vol.5 , pp. 275-294
    • Juang, B.-H.1
  • 3
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • June
    • B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, pp. 1304-1312, June 1974.
    • (1974) J. Acoust. Soc. Amer. , vol.55 , pp. 1304-1312
    • Atal, B.1
  • 4
    • 85135377175 scopus 로고    scopus 로고
    • Compensation for the effect of the communication channel in auditory-like analysis of speech (RASTA-PLP), in
    • H. Hermansky, N. Morgan, A. Bayya, and P. Kohn, "Compensation for The effect of the communication channel in auditory-like analysis of speech (RASTA-PLP)," in Proc. EVROSPEECH, 1991, pp. 1367-1370.
    • Proc. EVROSPEECH , vol.1991 , pp. 1367-1370
    • Hermansky, H.1    Morgan, N.2    Bayya, A.3    Kohn, P.4
  • 6
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory speech processing
    • S. Seneff, "A joint synchrony/mean-rate model of auditory speech processing," J. Phonetics, vol. 16, pp. 55-76, 1990.
    • (1990) J. Phonetics , vol.16 , pp. 55-76
    • Seneff, S.1
  • 7
    • 0027646437 scopus 로고
    • On the use of a family of signal limiters for recognition of noisy speech
    • C.-H. Lee and C.-H. Lin, "On the use of a family of signal limiters for recognition of noisy speech," Speech Commun., vol. 12, pp. 383-392, 1993.
    • (1993) Speech Commun. , vol.12 , pp. 383-392
    • Lee C-H1    Lin C-H2
  • 8
    • 0027229711 scopus 로고
    • Influence of background noise and microphone on the performance of the IBM TANGORA speech recognition system, in Proc
    • S. Das et al., "Influence of background noise and microphone on the performance of the IBM TANGORA speech recognition system," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1993, pp. 11-71.
    • (1993) IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 11-71
    • Das Et Al, S.1
  • 10
    • 0006923547 scopus 로고
    • Noise adaptation in a hidden Markov model speech recognition system
    • _, "Noise adaptation in a hidden Markov model speech recognition system," Compiit. Speech Lang., vol. 3, pp. 151-167, 1989.
    • (1989) Compiit. Speech Lang. , vol.3 , pp. 151-167
  • 12
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-32, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Processing, Vol. ASSP , vol.32 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 13
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean square error log-spectral amplitude estimator
    • Apr.
    • -, "Speech enhancement using a minimum mean square error log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 443-445, Apr. 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Processing, Vol. ASSP , vol.33 , pp. 443-445
  • 16
    • 0001873457 scopus 로고
    • Filterbank-energy estimation using mixture and Markov models for recognition of noisy speech
    • Jan.
    • A. Ereil and M. Weintraub, "Filterbank-energy estimation using mixture and Markov models for recognition of noisy speech," IEEE Trans. Speech Audio Processing, vol. 1, pp. 68-76, Jan. 1993.
    • (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 68-76
    • Ereil, A.1    Weintraub, M.2
  • 22
    • 85006657791 scopus 로고    scopus 로고
    • Speech recognition using hidden Markov model decomposition and a general background speech model, in
    • M. Wang and S. Young, "Speech recognition using hidden Markov model decomposition and a general background speech model," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1992, pp. I-253-I-256.
    • Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1992
    • Wang, M.1    Young, S.2
  • 23
    • 85017310148 scopus 로고    scopus 로고
    • An improved approach to the hidden Markov model decomposition of speech and noise, in
    • M. Gales and S. Young, "An improved approach to the hidden Markov model decomposition of speech and noise," in Proc. IEEE Int. Conf. Acomt., Speech, Signal Processing, 1992, pp. I-233-I-236.
    • Proc. IEEE Int. Conf. Acomt., Speech, Signal Processing , vol.1992
    • Gales, M.1    Young, S.2
  • 24
    • 0028420014 scopus 로고
    • R. Rose, E. Hofstetter, and D. Rey nolds"lntegrated models of speech and background with application to speaker identification in noise," IEEE Trans. Speech Audio Processing, vol. 2, pp. 245-257, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 245-257
    • Rose, R.1    Hofstetter, E.2
  • 25
    • 0026881830 scopus 로고
    • Gain-adapted hidden Markov models for recognition of clean and noisy speech
    • June
    • Y. Ephraim, "Gain-adapted hidden Markov models for recognition of clean and noisy speech," IEEE Trans. Signal Processing, vol. 40, pp. 1303-1316, June 1992.
    • (1992) IEEE Trans. Signal Processing , vol.40 , pp. 1303-1316
    • Ephraim, Y.1
  • 26
    • 0002671953 scopus 로고
    • A minimax classification approach with application to robust speech recognition
    • Jan.
    • N. Merhav and C.-H. Lee, "A minimax classification approach with application to robust speech recognition," IEEE Trans. Speech Audio Processing, vol. 1, pp. 90-100, Jan. 1993.
    • (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 90-100
    • Merhav, N.1    Lee C-H2
  • 28
    • 0025587779 scopus 로고    scopus 로고
    • Simultaneous speaker normalization and utterance labeling using Bayesian/neural net techniques, in
    • -, "Simultaneous speaker normalization and utterance labeling using Bayesian/neural net techniques," in Proc. IEEE Int. Conf. Acousl., Speech, Signal Processing, 1990, pp. 161-164.
    • Proc. IEEE Int. Conf. Acousl., Speech, Signal Processing , vol.1990 , pp. 161-164
  • 29
    • 0027167189 scopus 로고    scopus 로고
    • A new speaker adaptation technique using very short calibration speech, in
    • Y. Zhao, "A new speaker adaptation technique using very short calibration speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1993, pp. II-562-II-565.
    • Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1993
    • Zhao, Y.1
  • 33
    • 0001013568 scopus 로고
    • Acoustic modeling for large vocabulary speech recognition
    • Jan.
    • C.-H. Lee, L. R. Rabiner, R. Pieraccini, and J. Wilpon, "Acoustic modeling for large vocabulary speech recognition," Comput. Speech Lang., vol. 4, pp. 127-165, Jan. 1990.
    • (1990) Comput. Speech Lang. , vol.4 , pp. 127-165
    • Lee C-H1    Rabiner, L.R.2    Pieraccini, R.3    Wilpon, J.4
  • 34
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , pp. 257-286
    • Rabiner, L.R.1
  • 36
    • 0018724280 scopus 로고
    • H. Sakoe, 'Two-level DP-matching-A dynamic programming-based pattern matching algorithm for connected word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, pp. 588-595, Dec. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Processing, Vol. ASSP , vol.27 , pp. 588-595
  • 37
    • 0019558276 scopus 로고
    • A level building dynamic time warping algorithm for connected word recognition
    • Apr.
    • C. Myers and L. Rabiner, "A level building dynamic time warping algorithm for connected word recognition," IEEE Trans. Acoust., Speech Signal Processing, vol. ASSP-29, pp. 284-297, Apr. 1981.
    • (1981) IEEE Trans. Acoust., Speech Signal Processing, Vol. ASSP , vol.29 , pp. 284-297
    • Myers, C.1    Rabiner, L.2
  • 38
    • 0024769238 scopus 로고
    • A frame-synchronous network search algorithm for connected word recognition
    • Nov.
    • C.-H. Lee and L. Rabiner, "A frame-synchronous network search algorithm for connected word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. 37, pp. 1649-1658, Nov. 1989.
    • (1989) IEEE Trans. Acoust., Speech, Signal Processing , vol.37 , pp. 1649-1658
    • Lee C-H1    Rabiner, L.2
  • 39
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Royal Statist. Soc., vol. 39, pp. 1-38, 1977.
    • (1977) J. Royal Statist. Soc. , vol.39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 40
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • L. Baum, T. Pétrie, G. Soûles, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," Ann. Math. Statist., vol. 41, no. 1, pp. 164-171, 1970.
    • (1970) Ann. Math. Statist. , vol.41 , Issue.1 , pp. 164-171
    • Baum, L.1    Pétrie, T.2    Soûles, G.3    Weiss, N.4
  • 41
    • 0022097649 scopus 로고
    • Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains
    • B.-H. Juang, "Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains," AT&T Tech. J., vol. 64, no. 6, pp. 1235-1249, 1985.
    • (1985) AT&T Tech. J. , vol.64 , Issue.6 , pp. 1235-1249
    • Juang, B.-H.1
  • 42
    • 0022712081 scopus 로고
    • A segmental K-means training procedure for connected word recognition
    • May
    • L. R. Rabiner, J. Wilpon, and B.-H. Juang, "A segmental K-means training procedure for connected word recognition," AT&T Tech. J., vol. 64, pp. 21, May 1986.
    • (1986) AT&T Tech. J. , vol.64 , pp. 21
    • Rabiner, L.R.1    Wilpon, J.2    Juang, B.-H.3
  • 43
    • 0015600423 scopus 로고
    • The Viterbi algorithm
    • Mar.
    • G. Forney, "The Viterbi algorithm," Proc. IEEE, vol. 61, pp. 268-278, Mar. 1973.
    • (1973) Proc. IEEE , vol.61 , pp. 268-278
    • Forney, G.1
  • 45
    • 0026854591 scopus 로고
    • Improved acoustic modeling for large vocabulary continuous speech recognition
    • C.-H. Lee, E. Giachin, L. Rabiner, R. Pieraccini, and A. Rosenberg, "Improved acoustic modeling for large vocabulary continuous speech recognition," Comput. Speech Lang., vol. 6, pp. 103-127, 1992.
    • (1992) Comput. Speech Lang. , vol.6 , pp. 103-127
    • Lee C-H1    Giachin, E.2    Rabiner, L.3    Pieraccini, R.4    Rosenberg, A.5
  • 46
    • 0028419019 scopus 로고
    • Maximum a posterior estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J. Gauvain and C.-H. Lee, "Maximum a posterior estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Processing, vol. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 291-298
    • Gauvain, J.1    Lee C-H2
  • 47
    • 0027877970 scopus 로고
    • Large vocabulary speech recognition using subword units
    • C.-H. Lee, J.-L. Gauvain, R. Pieraccini, and L. Rabiner, "Large vocabulary speech recognition using subword units," Speech Commun., vol. 13, pp. 263-279, 1993.
    • (1993) Speech Commun. , vol.13 , pp. 263-279
    • Lee, C.-H.1    Gauvain, J.-L.2    Pieraccini, R.3    Rabiner, L.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.