SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 11, Issue 3, 2003, Pages 216-228

An enhanced dynamic time warping model for improved estimation of DTW parameters

(2) Yaniv, Ran a Burshtein, David a

a TEL AVIV UNIVERSITY (Israel)

Author keywords

Baum inequality; Dynamic time warping (DTW); Hidden Markov model (HMM); Speech recognition

Indexed keywords

ALGORITHMS; CONSTRAINT THEORY; ITERATIVE METHODS; MARKOV PROCESSES; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; PARAMETER ESTIMATION;

BAUM-WELCH ESTIMATION ALGORITHM; ENHANCED DYNAMIC TIME WARPING MODEL; HIDDEN MARKOV MODEL;

SPEECH RECOGNITION;

EID: 0038782382 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2003.811540 Document Type: Article

Times cited : (31)

References (27)

1
- 0005670423
- A dynamic programming approach to continous speech recognition
- H. Sakoe and S. Chiba, "A dynamic programming approach to continous speech recognition," in Proc. Int. Congress on Acoustics, Budapest, Hungary, 1971, 20 C-13.
- Proc. Int. Congress on Acoustics, Budapest, Hungary, 1971
- Sakoe, H.¹ Chiba, S.²

2
- 0017244215
- A statistical decision approach to the recognition of connected digits
- Dec.
- M. R. Sambur and L. R. Rabiner, "A statistical decision approach to the recognition of connected digits," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-24, pp. 550-558, Dec. 1976.
- (1976) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-24 , pp. 550-558
- Sambur, M.R.¹ Rabiner, L.R.²

3
- 56749117310
- Stochastic models and template matching: Some important relationships between two apparently different techniques for automatic speech recognition
- J. S. Bridle, "Stochastic models and template matching: Some important relationships between two apparently different techniques for automatic speech recognition," in Proc. Inst. Acoust., vol. 6, 1984, pp. 452a-452h.
- (1984) Proc. Inst. Acoust. , vol.6
- Bridle, J.S.¹

4
- 0038426396
- A segmental mixture model
- Ph.D., Univ. Wales, Swansea, U.K.
- R. Stapert, "A Segmental Mixture Model," Ph.D., Univ. Wales, Swansea, U.K., 2000.
- (2000)
- Stapert, R.¹

5
- 0017931304
- On creating reference templates for speaker independent recognition of isolated words
- Feb.
- L. R. Rabiner, "On creating reference templates for speaker independent recognition of isolated words," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-26, pp. 34-42, Feb. 1978.
- (1978) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-26 , pp. 34-42
- Rabiner, L.R.¹

6
- 0018320941
- Order dependence in templates for monosyllabic word identification
- S. B. Davis, "Order dependence in templates for monosyllabic word identification," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Apr. 1979, pp. 570-573.
- Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Apr. 1979 , pp. 570-573
- Davis, S.B.¹

7
- 0020141497
- Effects of speaker accent on the performance of a speaker-independent, isolated-word recognizer
- June
- V. Gupta and P. Mermelstein, "Effects of speaker accent on the performance of a speaker-independent, isolated-word recognizer," J. Acoust. Soc. Amer., vol. 71, no. 6, pp. 1581-1587, June 1982.
- (1982) J. Acoust. Soc. Amer. , vol.71 , Issue.6 , pp. 1581-1587
- Gupta, V.¹ Mermelstein, P.²

8
- 0032636332
- Segmental modeling using a continuous mixture of nonparametric models
- May
- J. Goldberger, D. Burshtein, and H. Franco, "Segmental modeling using a continuous mixture of nonparametric models," IEEE Trans. Speech Audio Processing, vol. 7, pp. 262-271, May 1999.
- (1999) IEEE Trans. Speech Audio Processing , vol.7 , pp. 262-271
- Goldberger, J.¹ Burshtein, D.² Franco, H.³

9
- 0038087678
- Locally constrained dynamic programming in automatic speech recognition
- R. K. Moore, M. J. Russell, and M. J. Tomlinson, "Locally constrained dynamic programming in automatic speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, 1982, pp. 1270-1273.
- (1982) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 1270-1273
- Moore, R.K.¹ Russell, M.J.² Tomlinson, M.J.³

10
- 0027579316
- Discriminative training of dynamic programming based speech recognizers
- Apr.
- P.-C. Chang and B.-H. Juang, "Discriminative training of dynamic programming based speech recognizers," IEEE Trans. Speech Audio Processing, vol. 1, pp. 135-143, Apr. 1993.
- (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 135-143
- Chang, P.-C.¹ Juang, B.-H.²

11
- 0021494282
- On the hidden Markov model and dynamic time warping for speech recognition - A unified view
- Sept.
- B.-H. Juang, "On the hidden Markov model and dynamic time warping for speech recognition - A unified view," AT&T Bell Labs. Tech. J., vol. 63, no. 7, pp. 1213-1243, Sept. 1984.
- (1984) AT&T Bell Labs. Tech. J. , vol.63 , Issue.7 , pp. 1213-1243
- Juang, B.-H.¹

12
- 0018637484
- Considerations in applying clustering techniques to speaker independent word recognition
- Sept.
- L. R. Rabiner and J. G. Wilpon, "Considerations in applying clustering techniques to speaker independent word recognition," J. Acoust. Soc. Amer., vol. 66, no. 3, pp. 663-673, Sept. 1979.
- (1979) J. Acoust. Soc. Amer. , vol.66 , Issue.3 , pp. 663-673
- Rabiner, L.R.¹ Wilpon, J.G.²

13
- 0022082035
- A modified K-means clustering algorithm for use in isolated word recognition
- June
- J. G. Wilpon and L. R. Rabiner, "A modified K-means clustering algorithm for use in isolated word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 587-594, June 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-33 , pp. 587-594
- Wilpon, J.G.¹ Rabiner, L.R.²

14
- 0014704814
- A statistical method for estimation of speech spectral density and formant frequencies
- F. Itakura and S. Saito, "A statistical method for estimation of speech spectral density and formant frequencies," Electron. Commun. Jpn., vol. 53A, pp. 36-43, 1970.
- (1970) Electron. Commun. Jpn. , vol.53 A , pp. 36-43
- Itakura, F.¹ Saito, S.²

15
- 0016467604
- Minimum prediction residual principle applied to speech recognition
- Feb.
- F. Itakura, "Minimum prediction residual principle applied to speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, pp. 57-72, Feb. 1975.
- (1975) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-23 , pp. 57-72
- Itakura, F.¹

16
- 0000090514
- A weighted cepstral distance measure for speech recognition
- Oct.
- Y. Tohkura, "A weighted cepstral distance measure for speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-35, pp. 1414-1422, Oct. 1987.
- (1987) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-35 , pp. 1414-1422
- Tohkura, Y.¹

17
- 0017930815
- Dynamic programming optimization for spoken word recognition
- Feb.
- H. Sakoe and S. Chiba, "Dynamic programming optimization for spoken word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-26, pp. 43-49, Feb. 1978.
- (1978) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-26 , pp. 43-49
- Sakoe, H.¹ Chiba, S.²

18
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.-H.²

19
- 0019280090
- Performance tradeoffs in dynamic time warping algorithms for isolated word recognition
- Dec.
- C. Myers, L. R. Rabiner, and A. E. Rosenberg, "Performance tradeoffs in dynamic time warping algorithms for isolated word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 623-635, Dec. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-28 , pp. 623-635
- Myers, C.¹ Rabiner, L.R.² Rosenberg, A.E.³

20
- 0001862769
- An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes
- L. E. Baum, "An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes," Inequalities, vol. 3, pp. 1-8, 1972.
- (1972) Inequalities , vol.3 , pp. 1-8
- Baum, L.E.¹

21
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. 1, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

22
- 0025493667
- The segmental K-means algorithm for estimating parameters of hidden Markov models
- Sept.
- B.-H. Juang and L. R. Rabiner, "The segmental K-means algorithm for estimating parameters of hidden Markov models," IEEE Trans. Acoust., Speech, Signal Processing, vol. 38, pp. 1639-1641, Sept. 1990.
- (1990) IEEE Trans. Acoust., Speech, Signal Processing , vol.38 , pp. 1639-1641
- Juang, B.-H.¹ Rabiner, L.R.²

23
- 0038087682
- [Online]
- HTK Version 3.0 (2000). [Online]. Available: http://htk.eng.cam.ac.uk
- (2000) HTK Version 3.0

24
- 0038764328
- TI 46-word speaker-dependent isolated word corpus
- NIST, NIST Speech Disc 7-1.1
- "TI 46-Word Speaker-Dependent Isolated Word Corpus," NIST, NIST Speech Disc 7-1.1, 1991.
- (1991)

25
- 0003640523
- The ISOLET spoken letter database
- Dept. of Computer Science and Engineering, Oregon Graduate Inst. Sci. Technol., CS/E 90-004
- R. Cole, Y. Muthusamy, and M. Fanty, "The ISOLET Spoken Letter Database," Dept. of Computer Science and Engineering, Oregon Graduate Inst. Sci. Technol., CS/E 90-004, 1990.
- (1990)
- Cole, R.¹ Muthusamy, Y.² Fanty, M.³

26
- 0036299139
- Using SVMS and discriminative models for speech recognition
- N. D. Smith and M. J. F. Gales, "Using SVMS and discriminative models for speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2002.
- Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2002
- Smith, N.D.¹ Gales, M.J.F.²

27
- 84892187452
- Maximum likelihood modeling with Gaussian distributions for classification
- R. A. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1998.
- Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1998
- Gopinath, R.A.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.