SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 5, Issue 1, 1997, Pages 33-44

Stochastic trajectory modeling and sentence searching for continuous speech recognition

(1) Gong, Yifan a,b

a INRIA (France)

b TEXAS INSTRUMENTS (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; MARKOV PROCESSES; MATHEMATICAL MODELS; PROBABILITY DENSITY FUNCTION; SPEECH ANALYSIS; VECTORS;

HIDDEN MARKOV MODELS (HMM); SENTENCE SEARCHING; STOCHASTIC TRAJECTORY MODELING; WORD ERROR RATE;

SPEECH RECOGNITION;

EID: 0030784572 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/89.554267 Document Type: Article

Times cited : (34)

References (42)

1
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- Feb.
- L. R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE, vol. 77, no. 2, pp. 257-285, Feb. 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-285
- Rabiner, L.R.¹

2
- 0001862769
- An inequality and associated maximation technique in statistical estimation for probabilistic functions of Markov processes
- O. Shisha, Ed., New York: Academic
- L. E. Baum, An inequality and associated maximation technique in statistical estimation for probabilistic functions of Markov processes O. Shisha, Ed., Inequalities-Ill. New York: Academic, pp. 1-8.
- Inequalities-Ill , pp. 1-8
- Baum, L.E.¹

3
- 0041769048
- Segmental phoneme recognition using piecewise linear regression
- Adelaide, Australia, Apr.
- S. Krishnan and P. V. S. Rao, Segmental phoneme recognition using piecewise linear regression Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, Adelaide, Australia, Apr. 1994, pp. 49-52.
- (1994) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 49-52
- Krishnan, S.¹ Rao, P.V.S.²

4
- 0022667694
- Speaker-independent isolated word recognition using dynamic features of speech spectrum
- S. Furui, Speaker-independent isolated word recognition using dynamic features of speech spectrum, IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, no. 1, pp. 53-59, 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , Issue.1 , pp. 53-59
- Furui, S.¹

5
- 0003539541
- Ph.D. dissertation, Carnegie-Mellon Univ., Pittsburgh, PA
- K. F. Lee, Large vocabulary speaker-independent continuous speech recognition: The SPHINX system, Ph.D. dissertation, Carnegie-Mellon Univ., Pittsburgh, PA, 1988.
- (1988) Large Vocabulary Speaker-independent Continuous Speech Recognition: the SPHINX System
- Lee, K.F.¹

6
- 0003572996
- Ph.D. dissertation, School Comput. Sei., Carnegie-Mellon Univ., Pittsburgh, PA
- P. F. Brown, The acoustic modeling problem in automatic speech recognition, Ph.D. dissertation, School Comput. Sei., Carnegie-Mellon Univ., Pittsburgh, PA, 1987.
- (1987) The Acoustic Modeling Problem in Automatic Speech Recognition
- Brown, P.F.¹

7
- 0023211846
- Explicit time correlation in hidden Markov models for speech recognition
- Dallas, TX
- C. J. Wellekens, Explicit time correlation in hidden Markov models for speech recognition Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Dallas, TX, 1987, pp. 384-387.
- (1987) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 384-387
- Wellekens, C.J.¹

8
- 18544404092
- Use of temporal correlation between successive frames in a hidden Markov model based speech recognizer
- K. K. Paliwal, Use of temporal correlation between successive frames in a hidden Markov model based speech recognizer Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1993, vol. 2, pp. 215-218.
- (1993) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.2 , pp. 215-218
- Paliwal, K.K.¹

9
- 0027309782
- Phoneme HMMS constrained by frame correlations
- S. Takahashi, T. Matsuoka, Y. Minami, and K. Shikano. Phoneme HMMS constrained by frame correlations Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 2, 1993, pp. 219-222.
- (1993) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.2 , pp. 219-222
- Takahashi, S.¹ Matsuoka, T.² Minami, Y.³ Shikano, K.⁴

10
- 0005840635
- A real-time recurrent error propagation network word recognition system
- T. Robinson, A real-time recurrent error propagation network word recognition system Int. Conf. Acoust., Speech, Signal Processing, vol. 1, 1992, pp. 617-620.
- (1992) Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 617-620
- Robinson, T.¹

11
- 0026821564
- Modeling acoustic transitions in speech by state-interpolation hidden Markov models
- Feb.
- L. Deng, P. Kenny, M. Lenning, and P. Mermelstein, Modeling acoustic transitions in speech by state-interpolation hidden Markov models, IEEE Trans. Signal Processing, vol. 40, no. 2, pp. 265-271, Feb. 1992.
- (1992) IEEE Trans. Signal Processing , vol.40 , Issue.2 , pp. 265-271
- Deng, L.¹ Kenny, P.² Lenning, M.³ Mermelstein, P.⁴

12
- 0022270364
- Mixture autoregressive hidden Markov models
- B. H. Juang and L. R. Rabiner, Mixture autoregressive hidden Markov models, IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, no. 6, pp. 1404-1413, 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.33 ASSP- , Issue.6 , pp. 1404-1413
- Juang, B.H.¹ Rabiner, L.R.²

13
- 0024900279
- A stochastic segment model for phonemebased continuous speech recognition
- Dec.
- M. Ostendorf and S. Roucos, A stochastic segment model for phonemebased continuous speech recognition, IEEE Trans. Acoust, Speech, Signal Processing, vol. 37, no. 12, pp. 1857-1869, Dec. 1989.
- (1989) IEEE Trans. Acoust, Speech, Signal Processing , vol.37 , Issue.12 , pp. 1857-1869
- Ostendorf, M.¹ Roucos, S.²

14
- 0023248924
- A stochastic segment model for phoneme-based continuous speech recognition
- Dallas, TX
- S. Roucos and M. O. Dunham, A stochastic segment model for phoneme-based continuous speech recognition Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Dallas, TX, 1987, pp. 73-76.
- (1987) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 73-76
- Roucos, S.¹ Dunham, M.O.²

15
- 0026991192
- Fast algorithms for phone classification and recognition using segment-based models
- Dec.
- V. V. Digalakis, M. Ostendorf, and J.R. Rohlicek, Fast algorithms for phone classification and recognition using segment-based models, IEEE Trans. Signal Processing, vol.49, no. 12, pp. 2885-2896, Dec. 1992.
- (1992) IEEE Trans. Signal Processing, Vol. , vol.49 , Issue.12 , pp. 2885-2896
- Digalakis, V.V.¹ Ostendorf, M.² Rohlicek, J.R.³

16
- 0027681974
- ML estimation of a stochastic linear system with em algorithm and its application to speech recognition
- Oct.
- V. V. Digalakis, J. R. Rohlicek, and M. Ostendorf, ML estimation of a stochastic linear system with EM algorithm and its application to speech recognition, IEEE Trans. Speech Audio Processing, vol. 1, no. 4, pp. 431-442, Oct. 1993.
- (1993) IEEE Trans. Speech Audio Processing , vol.1 , Issue.4 , pp. 431-442
- Digalakis, V.V.¹ Rohlicek, J.R.² Ostendorf, M.³

17
- 0026854213
- A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
- L. Deng, A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal, Signal Processing, vol. 27, no. 1, pp. 65-78, 1992.
- (1992) Signal Processing , vol.27 , Issue.1 , pp. 65-78
- Deng, L.¹

18
- 0028516022
- Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states
- L. Deng, M. Asmanovic, D. Sun, and J. Wu, Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states, IEEE Trans. Speech Audio Processing, vol. 2, no. 4, pp. 507-520, 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.4 , pp. 507-520
- Deng, L.¹ Asmanovic, M.² Sun, D.³ Wu, J.⁴

19
- 0028513410
- State-dependent time warping in the trended hidden Markov model
- X. D. Sun, L. Deng, and C. F. J. Wu, State-dependent time warping in the trended hidden Markov model, Signal Processing, vol. 39, no. 3, pp. 263-275, 1994.
- (1994) Signal Processing , vol.39 , Issue.3 , pp. 263-275
- Sun, X.D.¹ Deng, L.² Wu, C.F.J.³

20
- 0027578207
- Hidden Markov models with templates as nonstationary states: An application to speech recognition
- Apr.
- O. Ghitza and M. M. Sondhi, Hidden Markov models with templates as nonstationary states: An application to speech recognition, Comput., Speech, Language, vol. 2, pp. 101-119, Apr. 1993.
- (1993) Comput., Speech, Language , vol.2 , pp. 101-119
- Ghitza, O.¹ Sondhi, M.M.²

21
- 33646906381
- Phoneme-based continuous speech recognition without presegmentation
- pp. Edinburgh, Scotland, Sept.
- Y. Gong and J.-P. Haton, Phoneme-based continuous speech recognition without presegmentation Proc. Europ. Conf. Speech Technol, vol. 1, pp. Edinburgh, Scotland, Sept. 1987, pp. 121-124.
- (1987) Proc. Europ. Conf. Speech Technol , vol.1 , pp. 121-124
- Gong, Y.¹ Haton, J.-P.²

22
- 0026124299
- Signal-to-string conversion based on high likelihood regions using embedded dynamic programming
- Mar.
- _, Signal-to-string conversion based on high likelihood regions using embedded dynamic programming, IEEE Trans. Pattern Anal. Machine Intell., vol. 13, no. 3, pp. 297-302, Mar. 1991.
- (1991) IEEE Trans. Pattern Anal. Machine Intell. , vol.13 , Issue.3 , pp. 297-302

23
- 0026370313
- Continuous speech recognition based on high plausibility regions
- Toronto, Canada, May
- Y. Gong, J.-P. Haton, and F. Mouria, Continuous speech recognition based on high plausibility regions Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 1, Toronto, Canada, May 1991, pp. 725-728.
- (1991) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 725-728
- Gong, Y.¹ Haton, J.-P.² Mouria, F.³

24
- 33646949590
- VINICS: A continuous speech recognizer based on a new robust formulation
- Genova, Italy, Sept.
- Y. Gong and J.-P. Haton, VINICS: A continuous speech recognizer based on a new robust formulation Proc. Europ. Conf. Speech Commun. Technol., vol. III, Genova, Italy, Sept. 1991, pp. 1221-1224.
- (1991) Proc. Europ. Conf. Speech Commun. Technol. , vol.3 , pp. 1221-1224
- Gong, Y.¹ Haton, J.-P.²

25
- 0003384830
- Stochastic trajectory modeling for speech recognition
- Adelaide, Australia, Apr.
- _, Stochastic trajectory modeling for speech recognition Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. I, Adelaide, Australia, Apr. 1994, pp. 57-60.
- (1994) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 57-60

26
- 33646907303
- Issues in acoustic modeling of speech for automatic speech recognition
- H. Niemann, R. De Mon, and G. Hanrieder, Eds, INFIX: Sankt Augustin, Sept.
- Y. Gong, J.-P. Haton, and J.-F. Mari, Issues in acoustic modeling of speech for automatic speech recognition H. Niemann, R. De Mon, and G. Hanrieder, Eds, Progress and Prospects of Speech Research and Technology. INFIX: Sankt Augustin, Sept. 1994.
- (1994) Progress and Prospects of Speech Research and Technology
- Gong, Y.¹ Haton, J.-P.² Mari, J.-F.³

27
- 0003663467
- New York: McGraw-Hill
- A. Papoulis. Probability, Random Variables, and Stochastic Processes, 3rd ed. New York: McGraw-Hill, 1991.
- (1991) Probability, Random Variables, and Stochastic Processes, 3rd Ed.
- Papoulis, A.¹

28
- 85135330883
- Statistical trajectory models for phonetic recognition
- Yokohama, Japan, Sept.
- W. D. Goldenthal and J. R. Glass, Statistical trajectory models for phonetic recognition Proc. Int. Conf. Spoken Language Processing '94, Yokohama, Japan, Sept. 1994, pp. 1871-1873.
- (1994) Proc. Int. Conf. Spoken Language Processing '94 , pp. 1871-1873
- Goldenthal, W.D.¹ Glass, J.R.²

29
- 0039492227
- Non-linear time alignment in stochastic trajectory models for speech recognition
- Yokohama, Japan, Sept.
- M. Afify, Y. Gong, and J.-P. Haton, Non-linear time alignment in stochastic trajectory models for speech recognition Proc. Int. Conf. Spoken Language Processing '94, vol. 1, Yokohama, Japan, Sept. 1994, pp. 291-293.
- (1994) Proc. Int. Conf. Spoken Language Processing '94 , vol.1 , pp. 291-293
- Afify, M.¹ Gong, Y.² Haton, J.-P.³

30
- 0000007140
- Recursive Bayesian estimation using Gaussian sums
- H. W. Sorenson and D. L. Alspach, Recursive Bayesian estimation using Gaussian sums, Automatica, vol. 7, pp. 465-497, 1971.
- (1971) Automatica , vol.7 , pp. 465-497
- Sorenson, H.W.¹ Alspach, D.L.²

31
- 0025807354
- Development of an acoustic-phonetic hidden Markov model for continuous speech recognition
- Jan.
- A. Ljolje and S. E. Levinson, Development of an acoustic-phonetic hidden Markov model for continuous speech recognition, IEEE Trans. Signal Processing, vol. 39, no. 1, pp. 29-39, Jan. 1991.
- (1991) IEEE Trans. Signal Processing , vol.39 , Issue.1 , pp. 29-39
- Ljolje, A.¹ Levinson, S.E.²

32
- 0003823974
- Englewood Cliffs, NJ: Prentice-Hall
- J. M. Mendel, Lessons in Digital Estimation Tlieoty. Englewood Cliffs, NJ: Prentice-Hall, 1987.
- (1987) Lessons in Digital Estimation Tlieoty
- Mendel, J.M.¹

33
- 0018918171
- An algorithm for the vector quantizer design
- Jan
- Y. Linde, A. Buzo, and R. M. Gray, An algorithm for the vector quantizer design, IEEE Trans. Commun., vol. COM-28, no. 1, pp. 84-95, Jan 1980.
- (1980) IEEE Trans. Commun. , vol.COM-28 , Issue.1 , pp. 84-95
- Linde, Y.¹ Buzo, A.² Gray, R.M.³

34
- 0017789489
- Dynamic programming, the Viterbi algorithm, and low cost speech recognition
- G. M. White, Dynamic programming, the Viterbi algorithm, and low cost speech recognition Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1978, pp. 413-417.
- (1978) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 413-417
- White, G.M.¹

35
- 0347321459
- DTW-based phonetic labeling using explicit phoneme duration constraints
- Banff, Canada, Oct.
- Y. Gong and J.-P. Haton, DTW-based phonetic labeling using explicit phoneme duration constraints Proc. Int. Conf. Spoken Language Processing '92, vol. II, Banff, Canada, Oct. 1992, pp. 863-866.
- (1992) Proc. Int. Conf. Spoken Language Processing '92 , vol.2 , pp. 863-866
- Gong, Y.¹ Haton, J.-P.²

36
- 0038899479
- Iterative transformation and alignment for speech labeling
- Berlin, Germany, Sept.
- _, Iterative transformation and alignment for speech labeling Proc. Europ. Conf. Speech Commun. Technol., vol. 3, Berlin, Germany, Sept. 1993, pp. 1759-1762.
- (1993) Proc. Europ. Conf. Speech Commun. Technol. , vol.3 , pp. 1759-1762

37
- 0003483593
- HTK: Hidden Markov model toolkit V1.4 reference manual
- Speech Group, Cambridge Univ. Eng. Dept., Cambridge, England, Sept.
- S. J. Young, HTK: Hidden Markov model toolkit V1.4 reference manual, Tech. Rep., Speech Group, Cambridge Univ. Eng. Dept., Cambridge, England, Sept. 1992.
- (1992) Tech. Rep.
- Young, S.J.¹

38
- 33646917089
- Modeling and search in continuous speech recognition
- Berlin, Germany
- H. Ney, Modeling and search in continuous speech recognition Proc. Etirop. Conf. Speech Technol, vol. 1, Berlin, Germany, 1993, pp. 491-498.
- (1993) Proc. Etirop. Conf. Speech Technol , vol.1 , pp. 491-498
- Ney, H.¹

39
- 0002585974
- Variable duration models for speech
- Princeton, NJ
- J. D. Ferguson, Variable duration models for speech Proc. Symp. Applic. Hidden Markov Models Text Speech, Princeton, NJ, 1980, pp. 143-179.
- (1980) Proc. Symp. Applic. Hidden Markov Models Text Speech , pp. 143-179
- Ferguson, J.D.¹

40
- 0022234383
- Explicit modeling of state occupancy in hidden Markov models for automatic speech recognition
- M. J. Russell and R. K. Moore, Explicit modeling of state occupancy in hidden Markov models for automatic speech recognition Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 1985, pp. 5-8.
- (1985) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 5-8
- Russell, M.J.¹ Moore, R.K.²

41
- 0022685753
- Continuously variable duration hidden Markov models for automatic speech recognition
- S. E. Levinson, Continuously variable duration hidden Markov models for automatic speech recognition, Comput., Speech Language, vol. 1, no. 1, pp. 29-45, 1986.
- (1986) Comput., Speech Language , vol.1 , Issue.1 , pp. 29-45
- Levinson, S.E.¹

42
- 0000009468
- New York: Springer-Verlag
- L. R. Rabiner, Mathematical Foundations of Hidden Markov Models, vol. F-46 of NATO ASl: Recent Advances in Speech Understanding and Dialog Systems. New York: Springer-Verlag, 1988, pp. 183-205.
- (1988) Mathematical Foundations of Hidden Markov Models, Vol. F-46 of NATO ASl: Recent Advances in Speech Understanding and Dialog Systems , pp. 183-205
- Rabiner, L.R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.