메뉴 건너뛰기




Volumn 13, Issue 5, 2005, Pages 930-944

Vocal tract normalization equals linear transformation in cepstral space

Author keywords

Linear transformation; Speaker adaptive modeling and training; Speaker adaptive recognition; Speech recognition; Vocal tract (length) normalization

Indexed keywords

LINEAR TRANSFORMATION; SPEAKER ADAPTIVE MODELING AND TRAINING; SPEAKER ADAPTIVE RECOGNITION; VOCAL TRACT (LENGTH) NORMALIZATION;

EID: 27644522706     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.848881     Document Type: Article
Times cited : (135)

References (24)
  • 2
    • 0029725604 scopus 로고    scopus 로고
    • A parametric approach to vocal tract length normalization
    • Atlanta, GA, May
    • E. Eide and H. Gish, "A parametric approach to vocal tract length normalization," in IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, Atlanta, GA, May 1996, pp. 346-349.
    • (1996) IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 346-349
    • Eide, E.1    Gish, H.2
  • 3
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedures
    • Atlanta, GA, May
    • L. Lee and R. Rose, "Speaker normalization using efficient frequency warping procedures," in IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, Atlanta, GA, May 1996, pp. 353-356.
    • (1996) IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 353-356
    • Lee, L.1    Rose, R.2
  • 4
    • 0017482612 scopus 로고
    • Normalization of vowels by vocal tract length and its application to vowel identification
    • Apr.
    • H. Wakita, "Normalization of vowels by vocal tract length and its application to vowel identification," in IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. ASSP-25, Apr. 1977, pp. 183-192.
    • (1977) IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.ASSP-25 , pp. 183-192
    • Wakita, H.1
  • 9
    • 85009075256 scopus 로고    scopus 로고
    • Speaker normalization in the MFCC domain
    • Bejing, China, Oct.
    • S. Cox, "Speaker normalization in the MFCC domain," in Int. Conf. on Spoken Language Processing, vol. 2, Bejing, China, Oct. 2000, pp. 853-856.
    • (2000) Int. Conf. on Spoken Language Processing , vol.2 , pp. 853-856
    • Cox, S.1
  • 12
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Speech Audio Process., vol. 28, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Speech Audio Process. , vol.28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 13
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Jun.
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, Jun. 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 17
    • 0036753897 scopus 로고    scopus 로고
    • Speaker adaptive modeling by vocal tract normalization
    • Sep.
    • L. Welling, H. Ney, and S. Kanthak, "Speaker adaptive modeling by vocal tract normalization," IEEE Trans. Speech Audio Process., vol. 10, no. 6, pp. 415-426, Sep. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.6 , pp. 415-426
    • Welling, L.1    Ney, H.2    Kanthak, S.3
  • 18
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • Sep.
    • V. Digalakis, D. Rtischev, and L. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.1    Rtischev, D.2    Neumeyer, L.3
  • 19
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Apr.
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, no. 2, pp. 75-98, Apr. 1998.
    • (1998) Comput. Speech Lang. , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 20
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • May
    • A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 3, pp. 190-202, May 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.3 , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 21
    • 85009064348 scopus 로고    scopus 로고
    • Constrained maximum likelihood linear regression for speaker adaptation
    • Beijing, China, Oct.
    • M. Afify and O. Siohan, "Constrained maximum likelihood linear regression for speaker adaptation," in Proc. Int. Conf. Spoken Language Processing, vol. 3, Beijing, China, Oct. 2000, pp. 861-864.
    • (2000) Proc. Int. Conf. Spoken Language Processing , vol.3 , pp. 861-864
    • Afify, M.1    Siohan, O.2
  • 22
    • 84966252352 scopus 로고
    • Decay rates for inverse band matrices
    • Oct.
    • S. Demko, W. F. Moss, and P. W. Smith, "Decay rates for inverse band matrices," Math. Comput., vol. 43, no. 168, pp. 491-499, Oct. 1984.
    • (1984) Math. Comput. , vol.43 , Issue.168 , pp. 491-499
    • Demko, S.1    Moss, W.F.2    Smith, P.W.3
  • 23
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Apr.
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, no. 2, pp. 171-185, Apr. 1995.
    • (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 24
    • 0036556171 scopus 로고    scopus 로고
    • From within-word model search to across-word model search in large vocabulary continuous speech recognition
    • May
    • A. Sixtus and H. Key, "From within-word model search to across-word model search in large vocabulary continuous speech recognition," Comput. Speech Lang., vol. 16, no. 2, pp. 245-271, May 2002.
    • (2002) Comput. Speech Lang. , vol.16 , Issue.2 , pp. 245-271
    • Sixtus, A.1    Key, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.