SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 1, 2007, Pages 246-256

Speech recognition using linear dynamic models

(2) Frankel, Joe a King, Simon a

a UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Automatic speech recognition (ASR); Linear dynamic models (LDMs); Stack decoding

Indexed keywords

ASYNCHRONOUS DECODING; AUTOMATIC SPEECH RECOGNITION (ASR); AUTOMATIC SPEECH RECOGNITION SYSTEMS; COVARIANCE MATRICES; DERIVATIVE INFORMATIONS; DYNAMIC STATE; FEATURE VECTORS; FIRST ORDERS; GAUSSIAN MIXTURES; LINEAR DYNAMIC MODELS (LDMS); LINEAR STATE-SPACE MODELS; OUTPUT DISTRIBUTIONS; PHONE RECOGNITION; SEGMENT MODELS; SPATIAL CORRELATIONS; STACK DECODING; STATIC MODELS; UNDERLYING DYNAMICS;

COVARIANCE MATRIX; DECODING; HIDDEN MARKOV MODELS; REMELTING; SPEECH ANALYSIS; SPEECH RECOGNITION; TELEPHONE SETS; TELEPHONE SYSTEMS;

DYNAMIC MODELS;

EID: 34547549792 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.876766 Document Type: Article

Times cited : (26)

References (28)

1
- 33846687633
- Linear Dynamic Models for Automatic Speech Recognition,
- Ph.D. dissertation, The Centre for Speech Technology Research, Univ. of Edinburgh, Edinburgh, U.K
- J. Frankel, "Linear Dynamic Models for Automatic Speech Recognition," Ph.D. dissertation, The Centre for Speech Technology Research, Univ. of Edinburgh, Edinburgh, U.K., 2003.
- (2003)
- Frankel, J.¹

2
- 0003938589
- Segment-Based Stochastic Models of Spectral Dynamics for Continuous Speech Recognition,
- Ph.D. dissertation, Boston Univ. Graduate School, Boston, MA
- V. Digalakis, "Segment-Based Stochastic Models of Spectral Dynamics for Continuous Speech Recognition," Ph.D. dissertation, Boston Univ. Graduate School, Boston, MA, 1992.
- (1992)
- Digalakis, V.¹

3
- 0033556862
- A unifying review of linear Gaussian models
- S. Roweis and Z. Ghahramani, "A unifying review of linear Gaussian models," Neural Comput., vol. 11, no. 2, 1999.
- (1999) Neural Comput , vol.11 , Issue.2
- Roweis, S.¹ Ghahramani, Z.²

4
- 64149125858
- Generalised Linear Gaussian Models Cambridge Univ. Engineering, Cambridge, U.K
- Tech. Rep. CUED/F-INFENG/ TR.420
- A. Rosti and M. Gales, Generalised Linear Gaussian Models Cambridge Univ. Engineering, Cambridge, U.K., Tech. Rep. CUED/F-INFENG/ TR.420, 2001.
- (2001)
- Rosti, A.¹ Gales, M.²

5
- 85024429815
- A new approach to linear filtering and prediction problems
- Mar
- R. Kalman, "A new approach to linear filtering and prediction problems," J. Basic Eng., vol. 82, pp. 35-44, Mar. 1960.
- (1960) J. Basic Eng , vol.82 , pp. 35-44
- Kalman, R.¹

6
- 84937741903
- Solutions to the linear smoothing problem
- H. E. Rauch, "Solutions to the linear smoothing problem," IEEE Trans. Automat. Contr., vol. 8, pp. 371-372, 1963.
- (1963) IEEE Trans. Automat. Contr , vol.8 , pp. 371-372
- Rauch, H.E.¹

7
- 0027681974
- ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition
- Oct
- V. Digalakis, J. Rohlicek, and M. Ostendorf, "ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition," IEEE Trans. Speech Audio Process., vol. 1, no. 4, pp. 431-442, Oct. 1993.
- (1993) IEEE Trans. Speech Audio Process , vol.1 , Issue.4 , pp. 431-442
- Digalakis, V.¹ Rohlicek, J.² Ostendorf, M.³

8
- 0027261926
- Speech recognition using dynamical model of speech production
- Minneapolis, MN
- K. Iso, "Speech recognition using dynamical model of speech production," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, Minneapolis, MN, 1993, vol. 2, pp. 283-286.
- (1993) Proc. Int. Conf. Acoustics, Speech, and Signal Processing , vol.2 , pp. 283-286
- Iso, K.¹

9
- 64149096831
- Center for Language and Speech Processing, Johns Hopkins Univ, Baltimore, MD, Tech. Rep
- J. Bridle, L. Deng, J. Picone, H. Richards, J. Ma, T. Kamm, M. Schuster, S. Pike, and R. Reagan, An Investigation of Segmental Hidden Dynamic Models of Speech Coarticulation for Automatic Speech Recognition Workshop on Language Engineering, Center for Language and Speech Processing, Johns Hopkins Univ., Baltimore, MD, 1998, Tech. Rep..
- (1998) An Investigation of Segmental Hidden Dynamic Models of Speech Coarticulation for Automatic Speech Recognition Workshop on Language Engineering
- Bridle, J.¹ Deng, L.² Picone, J.³ Richards, H.⁴ Ma, J.⁵ Kamm, T.⁶ Schuster, M.⁷ Pike, S.⁸ Reagan, R.⁹

10
- 0032639922
- Initial evaluation of hidden dynamic models on conversational speech
- Phoenix, AZ
- J. Picone, S. Pike, R. Regan, T. Kamm, J. Bridle, L. Deng, Z. Ma, H. Richards, and M. Schuster, "Initial evaluation of hidden dynamic models on conversational speech," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, Phoenix, AZ, 1999, vol. 1, pp. 109-112.
- (1999) Proc. Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 109-112
- Picone, J.¹ Pike, S.² Regan, R.³ Kamm, T.⁴ Bridle, J.⁵ Deng, L.⁶ Ma, Z.⁷ Richards, H.⁸ Schuster, M.⁹

11
- 0001523807
- A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamic model of speech
- J. Ma and L. Deng, "A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamic model of speech," Comput. Speech and Lang., vol. 14, no. 2, pp. 101-114, 2000.
- (2000) Comput. Speech and Lang , vol.14 , Issue.2 , pp. 101-114
- Ma, J.¹ Deng, L.²

12
- 0033623527
- Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics
- Dec
- L. Deng and J. Ma, "Spontaneous speech recognition using a statistical coarticulatory model for the vocal-tract-resonance dynamics," J. Acoust. Soc. Amer., vol. 108, no. 6, pp. 3036-3048, Dec. 2000.
- (2000) J. Acoust. Soc. Amer , vol.108 , Issue.6 , pp. 3036-3048
- Deng, L.¹ Ma, J.²

13
- 0742307392
- Target-directed mixture linear dynamic models for spontaneous speech recognition
- Jan
- J. Ma and L. Deng, "Target-directed mixture linear dynamic models for spontaneous speech recognition," IEEE Trans. Speech Audio Process., vol. 12, no. 1, pp. 47-58, Jan. 2004.
- (2004) IEEE Trans. Speech Audio Process , vol.12 , Issue.1 , pp. 47-58
- Ma, J.¹ Deng, L.²

14
- 0347761233
- A mixed-level switching dynamic system for continuous speech recognition
- -, "A mixed-level switching dynamic system for continuous speech recognition," Comput. Speech Lang., vol. 18, pp. 49-65, 2004.
- (2004) Comput. Speech Lang , vol.18 , pp. 49-65
- Ma, J.¹ Deng, L.²

15
- 0141702226
- Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM-MAP decoding and evaluation
- Hong Kong, China
- F. Seide, J. Zhou, and L. Deng, "Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM-MAP decoding and evaluation," in Proc. Int. Conf. Acoustics, Speech, and Signal Processing, Hong Kong, China, 2003, vol. 1, pp. 748-751.
- (2003) Proc. Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 748-751
- Seide, F.¹ Zhou, J.² Deng, L.³

16
- 15844394960
- Linear Gaussian Models for Speech Recognition,
- Ph.D. dissertation, Engineering Department, Cambridge Univ, Cambridge, U.K
- A.-V. I. Rosti, "Linear Gaussian Models for Speech Recognition," Ph.D. dissertation, Engineering Department, Cambridge Univ., Cambridge, U.K., 2004.
- (2004)
- Rosti, A.-V.I.¹

17
- 0002583871
- Speech database development: Design and analysis of the acoustic-phonetic corpus
- Palo Alto, CA, Feb
- L. Lamel, R. Kassel, and S. Seneff, "Speech database development: design and analysis of the acoustic-phonetic corpus," in Proc. Speech Recognition Workshop, Palo Alto, CA, Feb. 1986, pp. 100-109.
- (1986) Proc. Speech Recognition Workshop , pp. 100-109
- Lamel, L.¹ Kassel, R.² Seneff, S.³

18
- 0024768209
- Speaker-independent phone recognition using hidden Markov models
- Nov
- K. Lee and H. Hon, "Speaker-independent phone recognition using hidden Markov models," IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 11, pp. 1641-1648, Nov. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Process , vol.37 , Issue.11 , pp. 1641-1648
- Lee, K.¹ Hon, H.²

19
- 0004129646
- Cambridge, MA: The MIT Press
- K. Stevens, Acoustic Phonetics. Cambridge, MA: The MIT Press, 1998.
- (1998) Acoustic Phonetics
- Stevens, K.¹

20
- 84935113569
- A. Viterbi, Error bounds for convolutional codes and an asymptotically optimal decoding algorithm, IEEE Trans. Inform. Process., 13, pp. 260-269, 1967.
- A. Viterbi, "Error bounds for convolutional codes and an asymptotically optimal decoding algorithm," IEEE Trans. Inform. Process., vol. 13, pp. 260-269, 1967.

21
- 64149088172
- S.Young, G. Evermann, D.Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P.Woodland, The HTK Book for HTK Version 3.2, Cambridge, U.K, Cambridge Univ, 2002
- S.Young, G. Evermann, D.Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P.Woodland, The HTK Book (for HTK Version 3.2). Cambridge, U.K.: Cambridge Univ., 2002.

22
- 0003663951
- New York: McGraw-Hill
- N. J. Nilsson, Problem-Solving Methods in Artificial Intelligence. New York: McGraw-Hill, 1971.
- (1971) Problem-Solving Methods in Artificial Intelligence
- Nilsson, N.J.¹

23
- 85017287102
- An efficient A stack decoder algorithm for continuous speech recognition with a stochastic language model
- San Francisco, CA
- D. Paul, "An efficient A stack decoder algorithm for continuous speech recognition with a stochastic language model," in Proc. ICASSP, San Francisco, CA, 1992, vol. 1, pp. 25-28.
- (1992) Proc. ICASSP , vol.1 , pp. 25-28
- Paul, D.¹

24
- 0010018471
- A -admissible heuristics for rapid lexical access
- Jan
- P. Kenny, R. Hollan, V. Gupta, M. Lennig, P. Mermelstein, and D. O'Shaughnessy, " A -admissible heuristics for rapid lexical access," IEEE Trans. Speech Audio Process., vol. 1, no. 1, pp. 49-58, Jan. 1993.
- (1993) IEEE Trans. Speech Audio Process , vol.1 , Issue.1 , pp. 49-58
- Kenny, P.¹ Hollan, R.² Gupta, V.³ Lennig, M.⁴ Mermelstein, P.⁵ O'Shaughnessy, D.⁶

25
- 64149121404
- S. Renals and M. Hochberg, Decoder Technology for Connectionist Large Vocabulary Speech Recognition Dept. Comput. Sci., Univ. Sheffield, Sheffield, U.K., Tech. Rep. +CS-95-17, 1995.
- S. Renals and M. Hochberg, Decoder Technology for Connectionist Large Vocabulary Speech Recognition Dept. Comput. Sci., Univ. Sheffield, Sheffield, U.K., Tech. Rep. +CS-95-17, 1995.

26
- 0003712010
- A General Method for Approximating Nonlinear Transformations of Probability Distributions Dept. Eng. Sci., Univ. Oxford, Oxford, U.K
- Tech. Rep
- S. Julier and J. Uhlmann, A General Method for Approximating Nonlinear Transformations of Probability Distributions Dept. Eng. Sci., Univ. Oxford, Oxford, U.K., 1996, Tech. Rep..
- (1996)
- Julier, S.¹ Uhlmann, J.²

27
- 0030263447
- Mean and variance adaptation within the MLLR framework
- M. Gales and P. Woodland, "Mean and variance adaptation within the MLLR framework," Comput., Speech and Lang., vol. 10, pp. 249-264, 1996.
- (1996) Comput., Speech and Lang , vol.10 , pp. 249-264
- Gales, M.¹ Woodland, P.²

28
- 0033677172
- Factored sparse inverse covariance matrices
- J. Bilmes, "Factored sparse inverse covariance matrices," in Proc. ICASSP 2000.
- Proc. ICASSP 2000
- Bilmes, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.