메뉴 건너뛰기




Volumn 5, Issue 1, 1997, Pages 33-44

Stochastic trajectory modeling and sentence searching for continuous speech recognition

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; MARKOV PROCESSES; MATHEMATICAL MODELS; PROBABILITY DENSITY FUNCTION; SPEECH ANALYSIS; VECTORS;

EID: 0030784572     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.554267     Document Type: Article
Times cited : (34)

References (42)
  • 1
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE, vol. 77, no. 2, pp. 257-285, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-285
    • Rabiner, L.R.1
  • 2
    • 0001862769 scopus 로고    scopus 로고
    • An inequality and associated maximation technique in statistical estimation for probabilistic functions of Markov processes
    • O. Shisha, Ed., New York: Academic
    • L. E. Baum, An inequality and associated maximation technique in statistical estimation for probabilistic functions of Markov processes O. Shisha, Ed., Inequalities-Ill. New York: Academic, pp. 1-8.
    • Inequalities-Ill , pp. 1-8
    • Baum, L.E.1
  • 3
    • 0041769048 scopus 로고
    • Segmental phoneme recognition using piecewise linear regression
    • Adelaide, Australia, Apr.
    • S. Krishnan and P. V. S. Rao, Segmental phoneme recognition using piecewise linear regression Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, Adelaide, Australia, Apr. 1994, pp. 49-52.
    • (1994) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 49-52
    • Krishnan, S.1    Rao, P.V.S.2
  • 4
    • 0022667694 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of speech spectrum
    • S. Furui, Speaker-independent isolated word recognition using dynamic features of speech spectrum, IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, no. 1, pp. 53-59, 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , Issue.1 , pp. 53-59
    • Furui, S.1
  • 7
    • 0023211846 scopus 로고
    • Explicit time correlation in hidden Markov models for speech recognition
    • Dallas, TX
    • C. J. Wellekens, Explicit time correlation in hidden Markov models for speech recognition Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Dallas, TX, 1987, pp. 384-387.
    • (1987) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 384-387
    • Wellekens, C.J.1
  • 8
    • 18544404092 scopus 로고
    • Use of temporal correlation between successive frames in a hidden Markov model based speech recognizer
    • K. K. Paliwal, Use of temporal correlation between successive frames in a hidden Markov model based speech recognizer Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1993, vol. 2, pp. 215-218.
    • (1993) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.2 , pp. 215-218
    • Paliwal, K.K.1
  • 10
    • 0005840635 scopus 로고
    • A real-time recurrent error propagation network word recognition system
    • T. Robinson, A real-time recurrent error propagation network word recognition system Int. Conf. Acoust., Speech, Signal Processing, vol. 1, 1992, pp. 617-620.
    • (1992) Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 617-620
    • Robinson, T.1
  • 11
    • 0026821564 scopus 로고
    • Modeling acoustic transitions in speech by state-interpolation hidden Markov models
    • Feb.
    • L. Deng, P. Kenny, M. Lenning, and P. Mermelstein, Modeling acoustic transitions in speech by state-interpolation hidden Markov models, IEEE Trans. Signal Processing, vol. 40, no. 2, pp. 265-271, Feb. 1992.
    • (1992) IEEE Trans. Signal Processing , vol.40 , Issue.2 , pp. 265-271
    • Deng, L.1    Kenny, P.2    Lenning, M.3    Mermelstein, P.4
  • 13
    • 0024900279 scopus 로고
    • A stochastic segment model for phonemebased continuous speech recognition
    • Dec.
    • M. Ostendorf and S. Roucos, A stochastic segment model for phonemebased continuous speech recognition, IEEE Trans. Acoust, Speech, Signal Processing, vol. 37, no. 12, pp. 1857-1869, Dec. 1989.
    • (1989) IEEE Trans. Acoust, Speech, Signal Processing , vol.37 , Issue.12 , pp. 1857-1869
    • Ostendorf, M.1    Roucos, S.2
  • 15
    • 0026991192 scopus 로고
    • Fast algorithms for phone classification and recognition using segment-based models
    • Dec.
    • V. V. Digalakis, M. Ostendorf, and J.R. Rohlicek, Fast algorithms for phone classification and recognition using segment-based models, IEEE Trans. Signal Processing, vol.49, no. 12, pp. 2885-2896, Dec. 1992.
    • (1992) IEEE Trans. Signal Processing, Vol. , vol.49 , Issue.12 , pp. 2885-2896
    • Digalakis, V.V.1    Ostendorf, M.2    Rohlicek, J.R.3
  • 16
    • 0027681974 scopus 로고
    • ML estimation of a stochastic linear system with em algorithm and its application to speech recognition
    • Oct.
    • V. V. Digalakis, J. R. Rohlicek, and M. Ostendorf, ML estimation of a stochastic linear system with EM algorithm and its application to speech recognition, IEEE Trans. Speech Audio Processing, vol. 1, no. 4, pp. 431-442, Oct. 1993.
    • (1993) IEEE Trans. Speech Audio Processing , vol.1 , Issue.4 , pp. 431-442
    • Digalakis, V.V.1    Rohlicek, J.R.2    Ostendorf, M.3
  • 17
    • 0026854213 scopus 로고
    • A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
    • L. Deng, A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal, Signal Processing, vol. 27, no. 1, pp. 65-78, 1992.
    • (1992) Signal Processing , vol.27 , Issue.1 , pp. 65-78
    • Deng, L.1
  • 18
    • 0028516022 scopus 로고
    • Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
    • L. Deng, M. Asmanovic, D. Sun, and J. Wu, Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states, IEEE Trans. Speech Audio Processing, vol. 2, no. 4, pp. 507-520, 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.4 , pp. 507-520
    • Deng, L.1    Asmanovic, M.2    Sun, D.3    Wu, J.4
  • 19
    • 0028513410 scopus 로고
    • State-dependent time warping in the trended hidden Markov model
    • X. D. Sun, L. Deng, and C. F. J. Wu, State-dependent time warping in the trended hidden Markov model, Signal Processing, vol. 39, no. 3, pp. 263-275, 1994.
    • (1994) Signal Processing , vol.39 , Issue.3 , pp. 263-275
    • Sun, X.D.1    Deng, L.2    Wu, C.F.J.3
  • 20
    • 0027578207 scopus 로고
    • Hidden Markov models with templates as nonstationary states: An application to speech recognition
    • Apr.
    • O. Ghitza and M. M. Sondhi, Hidden Markov models with templates as nonstationary states: An application to speech recognition, Comput., Speech, Language, vol. 2, pp. 101-119, Apr. 1993.
    • (1993) Comput., Speech, Language , vol.2 , pp. 101-119
    • Ghitza, O.1    Sondhi, M.M.2
  • 21
    • 33646906381 scopus 로고
    • Phoneme-based continuous speech recognition without presegmentation
    • pp. Edinburgh, Scotland, Sept.
    • Y. Gong and J.-P. Haton, Phoneme-based continuous speech recognition without presegmentation Proc. Europ. Conf. Speech Technol, vol. 1, pp. Edinburgh, Scotland, Sept. 1987, pp. 121-124.
    • (1987) Proc. Europ. Conf. Speech Technol , vol.1 , pp. 121-124
    • Gong, Y.1    Haton, J.-P.2
  • 22
    • 0026124299 scopus 로고
    • Signal-to-string conversion based on high likelihood regions using embedded dynamic programming
    • Mar.
    • _, Signal-to-string conversion based on high likelihood regions using embedded dynamic programming, IEEE Trans. Pattern Anal. Machine Intell., vol. 13, no. 3, pp. 297-302, Mar. 1991.
    • (1991) IEEE Trans. Pattern Anal. Machine Intell. , vol.13 , Issue.3 , pp. 297-302
  • 24
    • 33646949590 scopus 로고
    • VINICS: A continuous speech recognizer based on a new robust formulation
    • Genova, Italy, Sept.
    • Y. Gong and J.-P. Haton, VINICS: A continuous speech recognizer based on a new robust formulation Proc. Europ. Conf. Speech Commun. Technol., vol. III, Genova, Italy, Sept. 1991, pp. 1221-1224.
    • (1991) Proc. Europ. Conf. Speech Commun. Technol. , vol.3 , pp. 1221-1224
    • Gong, Y.1    Haton, J.-P.2
  • 25
    • 0003384830 scopus 로고
    • Stochastic trajectory modeling for speech recognition
    • Adelaide, Australia, Apr.
    • _, Stochastic trajectory modeling for speech recognition Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. I, Adelaide, Australia, Apr. 1994, pp. 57-60.
    • (1994) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 57-60
  • 26
    • 33646907303 scopus 로고
    • Issues in acoustic modeling of speech for automatic speech recognition
    • H. Niemann, R. De Mon, and G. Hanrieder, Eds, INFIX: Sankt Augustin, Sept.
    • Y. Gong, J.-P. Haton, and J.-F. Mari, Issues in acoustic modeling of speech for automatic speech recognition H. Niemann, R. De Mon, and G. Hanrieder, Eds, Progress and Prospects of Speech Research and Technology. INFIX: Sankt Augustin, Sept. 1994.
    • (1994) Progress and Prospects of Speech Research and Technology
    • Gong, Y.1    Haton, J.-P.2    Mari, J.-F.3
  • 29
    • 0039492227 scopus 로고
    • Non-linear time alignment in stochastic trajectory models for speech recognition
    • Yokohama, Japan, Sept.
    • M. Afify, Y. Gong, and J.-P. Haton, Non-linear time alignment in stochastic trajectory models for speech recognition Proc. Int. Conf. Spoken Language Processing '94, vol. 1, Yokohama, Japan, Sept. 1994, pp. 291-293.
    • (1994) Proc. Int. Conf. Spoken Language Processing '94 , vol.1 , pp. 291-293
    • Afify, M.1    Gong, Y.2    Haton, J.-P.3
  • 30
    • 0000007140 scopus 로고
    • Recursive Bayesian estimation using Gaussian sums
    • H. W. Sorenson and D. L. Alspach, Recursive Bayesian estimation using Gaussian sums, Automatica, vol. 7, pp. 465-497, 1971.
    • (1971) Automatica , vol.7 , pp. 465-497
    • Sorenson, H.W.1    Alspach, D.L.2
  • 31
    • 0025807354 scopus 로고
    • Development of an acoustic-phonetic hidden Markov model for continuous speech recognition
    • Jan.
    • A. Ljolje and S. E. Levinson, Development of an acoustic-phonetic hidden Markov model for continuous speech recognition, IEEE Trans. Signal Processing, vol. 39, no. 1, pp. 29-39, Jan. 1991.
    • (1991) IEEE Trans. Signal Processing , vol.39 , Issue.1 , pp. 29-39
    • Ljolje, A.1    Levinson, S.E.2
  • 33
    • 0018918171 scopus 로고
    • An algorithm for the vector quantizer design
    • Jan
    • Y. Linde, A. Buzo, and R. M. Gray, An algorithm for the vector quantizer design, IEEE Trans. Commun., vol. COM-28, no. 1, pp. 84-95, Jan 1980.
    • (1980) IEEE Trans. Commun. , vol.COM-28 , Issue.1 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.M.3
  • 35
    • 0347321459 scopus 로고
    • DTW-based phonetic labeling using explicit phoneme duration constraints
    • Banff, Canada, Oct.
    • Y. Gong and J.-P. Haton, DTW-based phonetic labeling using explicit phoneme duration constraints Proc. Int. Conf. Spoken Language Processing '92, vol. II, Banff, Canada, Oct. 1992, pp. 863-866.
    • (1992) Proc. Int. Conf. Spoken Language Processing '92 , vol.2 , pp. 863-866
    • Gong, Y.1    Haton, J.-P.2
  • 36
    • 0038899479 scopus 로고
    • Iterative transformation and alignment for speech labeling
    • Berlin, Germany, Sept.
    • _, Iterative transformation and alignment for speech labeling Proc. Europ. Conf. Speech Commun. Technol., vol. 3, Berlin, Germany, Sept. 1993, pp. 1759-1762.
    • (1993) Proc. Europ. Conf. Speech Commun. Technol. , vol.3 , pp. 1759-1762
  • 37
    • 0003483593 scopus 로고
    • HTK: Hidden Markov model toolkit V1.4 reference manual
    • Speech Group, Cambridge Univ. Eng. Dept., Cambridge, England, Sept.
    • S. J. Young, HTK: Hidden Markov model toolkit V1.4 reference manual, Tech. Rep., Speech Group, Cambridge Univ. Eng. Dept., Cambridge, England, Sept. 1992.
    • (1992) Tech. Rep.
    • Young, S.J.1
  • 38
    • 33646917089 scopus 로고
    • Modeling and search in continuous speech recognition
    • Berlin, Germany
    • H. Ney, Modeling and search in continuous speech recognition Proc. Etirop. Conf. Speech Technol, vol. 1, Berlin, Germany, 1993, pp. 491-498.
    • (1993) Proc. Etirop. Conf. Speech Technol , vol.1 , pp. 491-498
    • Ney, H.1
  • 41
    • 0022685753 scopus 로고
    • Continuously variable duration hidden Markov models for automatic speech recognition
    • S. E. Levinson, Continuously variable duration hidden Markov models for automatic speech recognition, Comput., Speech Language, vol. 1, no. 1, pp. 29-45, 1986.
    • (1986) Comput., Speech Language , vol.1 , Issue.1 , pp. 29-45
    • Levinson, S.E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.