메뉴 건너뛰기




Volumn 20, Issue 1, 2006, Pages 107-123

Improved automatic speech recognition through speaker normalization

Author keywords

[No Author keywords available]

Indexed keywords

ERROR ANALYSIS; ESTIMATION; MARKOV PROCESSES; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; PERSONNEL TRAINING;

EID: 27744595137     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2005.05.002     Document Type: Article
Times cited : (52)

References (27)
  • 1
    • 0030362995 scopus 로고    scopus 로고
    • A compact model for speaker-adaptive training
    • Philadelphia, PA
    • Anastasakos, T., McDonough, J., Schwartz, R., Makhoul, J., 1996. A compact model for speaker-adaptive training. In: Proc. of ICSLP, Philadelphia, PA, pp. 1137-1140.
    • (1996) Proc. of ICSLP , pp. 1137-1140
    • Anastasakos, T.1    McDonough, J.2    Schwartz, R.3    Makhoul, J.4
  • 2
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • L.E. Baum, T. Petrie, G. Soules, and N. Weiss A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains Ann. Math. Stat. 41 1970 167 171
    • (1970) Ann. Math. Stat. , vol.41 , pp. 167-171
    • Baum, L.E.1    Petrie, T.2    Soules, G.3    Weiss, N.4
  • 4
    • 0029725604 scopus 로고    scopus 로고
    • A parametric approach to vocal tract lenght normalization
    • Atlanta, GA
    • Eide, E., Gish, H., 1996. A parametric approach to vocal tract lenght normalization. In: Proc. of ICASSP, Atlanta, GA, pp. 346-349.
    • (1996) Proc. of ICASSP , pp. 346-349
    • Eide, E.1    Gish, H.2
  • 5
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M.J.F. Gales Maximum likelihood linear transformations for HMM-based speech recognition Computer Speech and Language 12 1998 75 98
    • (1998) Computer Speech and Language , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 6
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • M.J.F. Gales Cluster adaptive training of hidden Markov models IEEE Trans. Speech and Audio Process. 8 4 2000 417 428
    • (2000) IEEE Trans. Speech and Audio Process. , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.J.F.1
  • 7
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • J.-L. Gauvain, and C.-H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Trans. Speech and Audio Process. 2 2 1994 291 298
    • (1994) IEEE Trans. Speech and Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 8
    • 0024909979 scopus 로고
    • Some statistical issues in the comparison of speech recognition algorithms
    • Glasgow, Scotland
    • Gillick, L., Cox, S., 1989. Some statistical issues in the comparison of speech recognition algorithms. In: Proc. of ICASSP, Glasgow, Scotland, pp. I-532-535.
    • (1989) Proc. of ICASSP
    • Gillick, L.1    Cox, S.2
  • 9
    • 0141702066 scopus 로고    scopus 로고
    • Investigating recognition of children's speech
    • Hong Kong, China
    • Giuliani, D., Gerosa, M., 2003. Investigating recognition of children's speech. In: Proc. of ICASSP, Hong Kong, China, pp. II-137-140.
    • (2003) Proc. of ICASSP
    • Giuliani, D.1    Gerosa, M.2
  • 10
    • 85009110337 scopus 로고    scopus 로고
    • Speaker normalization through constrained MLLR based transforms
    • Jeju Island, Korea
    • Giuliani, D., Gerosa, M., Brugnara, F., 2004. Speaker normalization through constrained MLLR based transforms. In: Proc. of INTERSPEECH/ICSLP, Jeju Island, Korea, pp. 2893-2897.
    • (2004) Proc. of INTERSPEECH/ICSLP , pp. 2893-2897
    • Giuliani, D.1    Gerosa, M.2    Brugnara, F.3
  • 11
    • 0027578837 scopus 로고
    • On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition
    • X. Huang, and K.-F. Lee On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition IEEE Trans. Speech and Audio Process. 1 2 1993 150 157
    • (1993) IEEE Trans. Speech and Audio Process. , vol.1 , Issue.2 , pp. 150-157
    • Huang, X.1    Lee, K.-F.2
  • 12
    • 0036124301 scopus 로고    scopus 로고
    • A robust compensation strategy against extraneous acoustic variations for spontaneous speech recognition
    • H. Jiang, and L. Deng A robust compensation strategy against extraneous acoustic variations for spontaneous speech recognition IEEE Trans. Speech and Audio Process. 10 1 2002 9 17
    • (2002) IEEE Trans. Speech and Audio Process. , vol.10 , Issue.1 , pp. 9-17
    • Jiang, H.1    Deng, L.2
  • 13
    • 0025681028 scopus 로고
    • Speaker adaptation from speaker-independent training corpus
    • Albuquerque, NM
    • Kubala, F., Schwartz, R., Barry, C., 1990. Speaker adaptation from speaker-independent training corpus. In: Proc. of ICASSP. Albuquerque, NM, pp. I-137-140.
    • (1990) Proc. of ICASSP
    • Kubala, F.1    Schwartz, R.2    Barry, C.3
  • 15
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedure
    • Atlanta, GA
    • Lee, L., Rose, R., 1996. Speaker normalization using efficient frequency warping procedure. In: Proc. of ICASSP, Atlanta, GA, pp. I-353-356.
    • (1996) Proc. of ICASSP
    • Lee, L.1    Rose, R.2
  • 16
    • 0032969462 scopus 로고    scopus 로고
    • Acoustic of children's speech: Developmental changes of temporal and spectral parameters
    • S. Lee, A. Potamianos, and S. Narayanan Acoustic of children's speech: developmental changes of temporal and spectral parameters J. Acoust. Soc. Am. 105 3 1999 1455 1468
    • (1999) J. Acoust. Soc. Am. , vol.105 , Issue.3 , pp. 1455-1468
    • Lee, S.1    Potamianos, A.2    Narayanan, S.3
  • 17
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Computer Speech and Language 9 1995 171 185
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 18
    • 85128432219 scopus 로고    scopus 로고
    • Speaker normalization with all-pass transforms
    • Sydney, Australia
    • McDonough, J., Byrne, W., Luo, X., 1998. Speaker normalization with all-pass transforms. In: Proc. of ICSLP, vol. VI, Sydney, Australia, pp. 2307-2310.
    • (1998) Proc. of ICSLP , vol.6 , pp. 2307-2310
    • McDonough, J.1    Byrne, W.2    Luo, X.3
  • 21
    • 85009174854 scopus 로고    scopus 로고
    • Vocal tract normalization as linear transformation of MFCC
    • Geneva, Switzerland
    • Pitz, M., Ney, H., 2003. Vocal tract normalization as linear transformation of MFCC. In: Proc. of EUROSPEECH, Geneva, Switzerland, pp. 1445-1448.
    • (2003) Proc. of EUROSPEECH , pp. 1445-1448
    • Pitz, M.1    Ney, H.2
  • 22
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • L. Rabiner A tutorial on hidden Markov models and selected applications in speech recognition Proc. IEEE 77 2 1989 257 285
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-285
    • Rabiner, L.1
  • 23
    • 33646759965 scopus 로고    scopus 로고
    • Adaptive training using simple target models
    • Philadelphia, PA
    • Stemmer, G., Brugnara, F., Giuliani, D., 2005. Adaptive training using simple target models. In: Proc. of ICASSP, Philadelphia, PA.
    • (2005) Proc. of ICASSP
    • Stemmer, G.1    Brugnara, F.2    Giuliani, D.3
  • 24
    • 85135261079 scopus 로고    scopus 로고
    • An investigation into vocal tract length normalisation
    • Budapest, Hungary
    • Uebel, L., Woodland, P., 1999. An investigation into vocal tract length normalisation. In: Proc. of EUROSPEECH, Budapest, Hungary, pp. 2527-2530.
    • (1999) Proc. of EUROSPEECH , pp. 2527-2530
    • Uebel, L.1    Woodland, P.2
  • 25
    • 0029764708 scopus 로고    scopus 로고
    • Speaker normalisation on conversational telephone speech
    • Atlanta, GA
    • Wegmann, S., McAllaster, D., Orloff., J., Peskin, B., 1996. Speaker normalisation on conversational telephone speech. In: Proc. of ICASSP, Atlanta, GA, pp. I-339-341.
    • (1996) Proc. of ICASSP
    • Wegmann, S.1    McAllaster, D.2    Orloff, J.3    Peskin, B.4
  • 26
    • 0032629626 scopus 로고    scopus 로고
    • Improved methods for vocal tract normalization
    • Phoenix, AZ
    • Welling, L., Kanthak, S., Ney, H., 1999. Improved methods for vocal tract normalization. In: Proc. of ICASSP, vol. 2, Phoenix, AZ, pp. 761-764.
    • (1999) Proc. of ICASSP , vol.2 , pp. 761-764
    • Welling, L.1    Kanthak, S.2    Ney, H.3
  • 27
    • 0029747582 scopus 로고    scopus 로고
    • A study of speech recognition for children and elderly
    • Atlanta, GA
    • Wilpon, J., Jacobsen, C., 1996. A study of speech recognition for children and elderly. In: Proc. of ICASSP, Atlanta, GA, pp. I-349-352.
    • (1996) Proc. of ICASSP
    • Wilpon, J.1    Jacobsen, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.