메뉴 건너뛰기




Volumn 11, Issue 3, 2003, Pages 216-228

An enhanced dynamic time warping model for improved estimation of DTW parameters

Author keywords

Baum inequality; Dynamic time warping (DTW); Hidden Markov model (HMM); Speech recognition

Indexed keywords

ALGORITHMS; CONSTRAINT THEORY; ITERATIVE METHODS; MARKOV PROCESSES; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; PARAMETER ESTIMATION;

EID: 0038782382     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2003.811540     Document Type: Article
Times cited : (31)

References (27)
  • 2
    • 0017244215 scopus 로고
    • A statistical decision approach to the recognition of connected digits
    • Dec.
    • M. R. Sambur and L. R. Rabiner, "A statistical decision approach to the recognition of connected digits," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-24, pp. 550-558, Dec. 1976.
    • (1976) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-24 , pp. 550-558
    • Sambur, M.R.1    Rabiner, L.R.2
  • 3
    • 56749117310 scopus 로고
    • Stochastic models and template matching: Some important relationships between two apparently different techniques for automatic speech recognition
    • J. S. Bridle, "Stochastic models and template matching: Some important relationships between two apparently different techniques for automatic speech recognition," in Proc. Inst. Acoust., vol. 6, 1984, pp. 452a-452h.
    • (1984) Proc. Inst. Acoust. , vol.6
    • Bridle, J.S.1
  • 4
    • 0038426396 scopus 로고    scopus 로고
    • A segmental mixture model
    • Ph.D., Univ. Wales, Swansea, U.K.
    • R. Stapert, "A Segmental Mixture Model," Ph.D., Univ. Wales, Swansea, U.K., 2000.
    • (2000)
    • Stapert, R.1
  • 5
    • 0017931304 scopus 로고
    • On creating reference templates for speaker independent recognition of isolated words
    • Feb.
    • L. R. Rabiner, "On creating reference templates for speaker independent recognition of isolated words," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-26, pp. 34-42, Feb. 1978.
    • (1978) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-26 , pp. 34-42
    • Rabiner, L.R.1
  • 7
    • 0020141497 scopus 로고
    • Effects of speaker accent on the performance of a speaker-independent, isolated-word recognizer
    • June
    • V. Gupta and P. Mermelstein, "Effects of speaker accent on the performance of a speaker-independent, isolated-word recognizer," J. Acoust. Soc. Amer., vol. 71, no. 6, pp. 1581-1587, June 1982.
    • (1982) J. Acoust. Soc. Amer. , vol.71 , Issue.6 , pp. 1581-1587
    • Gupta, V.1    Mermelstein, P.2
  • 8
    • 0032636332 scopus 로고    scopus 로고
    • Segmental modeling using a continuous mixture of nonparametric models
    • May
    • J. Goldberger, D. Burshtein, and H. Franco, "Segmental modeling using a continuous mixture of nonparametric models," IEEE Trans. Speech Audio Processing, vol. 7, pp. 262-271, May 1999.
    • (1999) IEEE Trans. Speech Audio Processing , vol.7 , pp. 262-271
    • Goldberger, J.1    Burshtein, D.2    Franco, H.3
  • 10
    • 0027579316 scopus 로고
    • Discriminative training of dynamic programming based speech recognizers
    • Apr.
    • P.-C. Chang and B.-H. Juang, "Discriminative training of dynamic programming based speech recognizers," IEEE Trans. Speech Audio Processing, vol. 1, pp. 135-143, Apr. 1993.
    • (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 135-143
    • Chang, P.-C.1    Juang, B.-H.2
  • 11
    • 0021494282 scopus 로고
    • On the hidden Markov model and dynamic time warping for speech recognition - A unified view
    • Sept.
    • B.-H. Juang, "On the hidden Markov model and dynamic time warping for speech recognition - A unified view," AT&T Bell Labs. Tech. J., vol. 63, no. 7, pp. 1213-1243, Sept. 1984.
    • (1984) AT&T Bell Labs. Tech. J. , vol.63 , Issue.7 , pp. 1213-1243
    • Juang, B.-H.1
  • 12
    • 0018637484 scopus 로고
    • Considerations in applying clustering techniques to speaker independent word recognition
    • Sept.
    • L. R. Rabiner and J. G. Wilpon, "Considerations in applying clustering techniques to speaker independent word recognition," J. Acoust. Soc. Amer., vol. 66, no. 3, pp. 663-673, Sept. 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.66 , Issue.3 , pp. 663-673
    • Rabiner, L.R.1    Wilpon, J.G.2
  • 13
    • 0022082035 scopus 로고
    • A modified K-means clustering algorithm for use in isolated word recognition
    • June
    • J. G. Wilpon and L. R. Rabiner, "A modified K-means clustering algorithm for use in isolated word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, pp. 587-594, June 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-33 , pp. 587-594
    • Wilpon, J.G.1    Rabiner, L.R.2
  • 14
    • 0014704814 scopus 로고
    • A statistical method for estimation of speech spectral density and formant frequencies
    • F. Itakura and S. Saito, "A statistical method for estimation of speech spectral density and formant frequencies," Electron. Commun. Jpn., vol. 53A, pp. 36-43, 1970.
    • (1970) Electron. Commun. Jpn. , vol.53 A , pp. 36-43
    • Itakura, F.1    Saito, S.2
  • 15
    • 0016467604 scopus 로고
    • Minimum prediction residual principle applied to speech recognition
    • Feb.
    • F. Itakura, "Minimum prediction residual principle applied to speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-23, pp. 57-72, Feb. 1975.
    • (1975) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-23 , pp. 57-72
    • Itakura, F.1
  • 16
    • 0000090514 scopus 로고
    • A weighted cepstral distance measure for speech recognition
    • Oct.
    • Y. Tohkura, "A weighted cepstral distance measure for speech recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-35, pp. 1414-1422, Oct. 1987.
    • (1987) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-35 , pp. 1414-1422
    • Tohkura, Y.1
  • 17
    • 0017930815 scopus 로고
    • Dynamic programming optimization for spoken word recognition
    • Feb.
    • H. Sakoe and S. Chiba, "Dynamic programming optimization for spoken word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-26, pp. 43-49, Feb. 1978.
    • (1978) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-26 , pp. 43-49
    • Sakoe, H.1    Chiba, S.2
  • 19
    • 0019280090 scopus 로고
    • Performance tradeoffs in dynamic time warping algorithms for isolated word recognition
    • Dec.
    • C. Myers, L. R. Rabiner, and A. E. Rosenberg, "Performance tradeoffs in dynamic time warping algorithms for isolated word recognition," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, pp. 623-635, Dec. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-28 , pp. 623-635
    • Myers, C.1    Rabiner, L.R.2    Rosenberg, A.E.3
  • 20
    • 0001862769 scopus 로고
    • An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes
    • L. E. Baum, "An inequality and associated maximization technique in statistical estimation for probabilistic functions of Markov processes," Inequalities, vol. 3, pp. 1-8, 1972.
    • (1972) Inequalities , vol.3 , pp. 1-8
    • Baum, L.E.1
  • 21
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 22
    • 0025493667 scopus 로고
    • The segmental K-means algorithm for estimating parameters of hidden Markov models
    • Sept.
    • B.-H. Juang and L. R. Rabiner, "The segmental K-means algorithm for estimating parameters of hidden Markov models," IEEE Trans. Acoust., Speech, Signal Processing, vol. 38, pp. 1639-1641, Sept. 1990.
    • (1990) IEEE Trans. Acoust., Speech, Signal Processing , vol.38 , pp. 1639-1641
    • Juang, B.-H.1    Rabiner, L.R.2
  • 23
    • 0038087682 scopus 로고    scopus 로고
    • [Online]
    • HTK Version 3.0 (2000). [Online]. Available: http://htk.eng.cam.ac.uk
    • (2000) HTK Version 3.0
  • 24
    • 0038764328 scopus 로고
    • TI 46-word speaker-dependent isolated word corpus
    • NIST, NIST Speech Disc 7-1.1
    • "TI 46-Word Speaker-Dependent Isolated Word Corpus," NIST, NIST Speech Disc 7-1.1, 1991.
    • (1991)
  • 25
    • 0003640523 scopus 로고
    • The ISOLET spoken letter database
    • Dept. of Computer Science and Engineering, Oregon Graduate Inst. Sci. Technol., CS/E 90-004
    • R. Cole, Y. Muthusamy, and M. Fanty, "The ISOLET Spoken Letter Database," Dept. of Computer Science and Engineering, Oregon Graduate Inst. Sci. Technol., CS/E 90-004, 1990.
    • (1990)
    • Cole, R.1    Muthusamy, Y.2    Fanty, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.