메뉴 건너뛰기




Volumn 42, Issue 1, 2004, Pages 75-91

Speaker adaptation with all-pass transforms

Author keywords

Speaker adaptation; Speech recognition

Indexed keywords

ERROR COMPENSATION; MARKOV PROCESSES; MATHEMATICAL TRANSFORMATIONS; MAXIMUM LIKELIHOOD ESTIMATION; PARAMETER ESTIMATION; PROBABILITY DENSITY FUNCTION; REGRESSION ANALYSIS; SPEECH COMMUNICATION;

EID: 0347269184     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2003.09.005     Document Type: Conference Paper
Times cited : (17)

References (41)
  • 4
    • 0032657749 scopus 로고    scopus 로고
    • Correlation modeling of MLLR transform biases for rapid HMM adaptation to new speakers
    • Bocchieri, E., Digalakis, V., Corduneanu, A., Boulis, C., 1999. Correlation modeling of MLLR transform biases for rapid HMM adaptation to new speakers. In: Proc. ICASSP, Vol. II, pp. 773-776.
    • (1999) Proc. ICASSP , vol.2 , pp. 773-776
    • Bocchieri, E.1    Digalakis, V.2    Corduneanu, A.3    Boulis, C.4
  • 6
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • Dempster A.P., Laird N.M., Rubin D.B. Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Statist. Soc. 39B:1977;1-38.
    • (1977) J. Roy. Statist. Soc. , vol.39 B , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 9
    • 85009236682 scopus 로고    scopus 로고
    • Implementing vocal tract length normalization in the MLLR framework
    • Ding, G.-H., Zhu, Y.-F., Li, C., Xu, B., 2002. Implementing vocal tract length normalization in the MLLR framework. In: ICSLP, pp. 1389-1392.
    • (2002) ICSLP , pp. 1389-1392
    • Ding, G.-H.1    Zhu, Y.-F.2    Li, C.3    Xu, B.4
  • 10
    • 0029725604 scopus 로고    scopus 로고
    • A parametric approach to vocal tract length normalization
    • Eide, E., Gish, H., 1996. A parametric approach to vocal tract length normalization. In: Proc. ICASSP, Vol. I, pp. 346-348.
    • (1996) Proc. ICASSP , vol.1 , pp. 346-348
    • Eide, E.1    Gish, H.2
  • 11
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Gales M.J.F. Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech Language. 12:1998;75-98.
    • (1998) Computer Speech Language , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 12
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden markov models
    • Gales M.J.F. Semi-tied covariance matrices for hidden markov models. IEEE Trans. Speech Audio Process. 7:1999;272-281.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , pp. 272-281
    • Gales, M.J.F.1
  • 13
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • Gales M.J.F., Woodland P.C. Mean and variance adaptation within the MLLR framework. Computer Speech Language. 10:1996;249-264.
    • (1996) Computer Speech Language , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 15
    • 0033708818 scopus 로고    scopus 로고
    • Robust estimation for rapid speaker adaptation using discounted likelihood techniques
    • 5-9 June 2000
    • Gunawardana, A., Byrne, W., 2000. Robust estimation for rapid speaker adaptation using discounted likelihood techniques, IEEE ICASSP, 5-9 June 2000, Vol. 2, pp. II985-II988.
    • (2000) IEEE ICASSP , vol.2
    • Gunawardana, A.1    Byrne, W.2
  • 16
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Hermansky H. Perceptual linear predictive (PLP) analysis of speech. J. Acoust. Soc. Am. 87(4):1990;1738-1752.
    • (1990) J. Acoust. Soc. Am. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 17
    • 0032653576 scopus 로고    scopus 로고
    • Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition
    • Kannan, A., Khudanpur, S., 1996. Tree-structured models of parameter dependence for rapid adaptation in large vocabulary conversational speech recognition. In: Proc. ICASSP, Vol. II, pp. 769-772.
    • (1996) Proc. ICASSP , vol.2 , pp. 769-772
    • Kannan, A.1    Khudanpur, S.2
  • 19
    • 0032289099 scopus 로고    scopus 로고
    • Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition
    • Kumar N., Andreou A.G. Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition. Speech Commun. 26:1998;238-297.
    • (1998) Speech Commun. , vol.26 , pp. 238-297
    • Kumar, N.1    Andreou, A.G.2
  • 20
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedures
    • Lee, L., Rose, R.C., 1996. Speaker normalization using efficient frequency warping procedures. In: Proc. ICASSP, Vol. I, pp. 353-356.
    • (1996) Proc. ICASSP , vol.1 , pp. 353-356
    • Lee, L.1    Rose, R.C.2
  • 21
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Leggetter C.J., Woodland P.C. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Computer Speech Language. 9:1995;171-185.
    • (1995) Computer Speech Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 23
    • 0347330345 scopus 로고
    • Bases in hilbert space related to the representation of stationary operators
    • Masry E., Stieglitz K., Liu B. Bases in hilbert space related to the representation of stationary operators. SIAM J. Appl. Math. 16:1968;552-562.
    • (1968) SIAM J. Appl. Math. , vol.16 , pp. 552-562
    • Masry, E.1    Stieglitz, K.2    Liu, B.3
  • 24
    • 0347960638 scopus 로고    scopus 로고
    • On the estimation of optimal regression classes for speaker adaptation
    • Center for Language and Speech Processing, The Johns Hopkins University
    • McDonough, J.W., 1998. On the estimation of optimal regression classes for speaker adaptation. Tech. Rep. 36, Center for Language and Speech Processing, The Johns Hopkins University.
    • (1998) Tech. Rep. , vol.36
    • McDonough, J.W.1
  • 25
    • 85017327824 scopus 로고    scopus 로고
    • The Homewood extensions
    • Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD
    • McDonough, J.W., 1999. The Homewood extensions. Tech. Rep. 39, Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD.
    • (1999) Tech. Rep. , vol.39
    • McDonough, J.W.1
  • 27
    • 0346699960 scopus 로고    scopus 로고
    • On maximum mutual information speaker-adapted training
    • Universität Karlsruhe
    • McDonough, J.W., 2001. On maximum mutual information speaker-adapted training. Tech. Rep. 103, Universität Karlsruhe.
    • (2001) Tech. Rep. , vol.103
    • McDonough, J.W.1
  • 28
    • 0346699959 scopus 로고    scopus 로고
    • Performance comparisons of all-pass transform adaptation with maximum likelihood linear regression
    • Universität Karlsruhe
    • McDonough, J., Waibel, A., 2003. Performance comparisons of all-pass transform adaptation with maximum likelihood linear regression. Tech. Rep. 102, Universität Karlsruhe.
    • (2003) Tech. Rep. , vol.102
    • McDonough, J.1    Waibel, A.2
  • 29
    • 84902047630 scopus 로고    scopus 로고
    • Single-pass adapted training with all-pass transforms
    • McDonough, J.W., Byrne, W., 1999. Single-pass adapted training with all-pass transforms. In: Proc. Eurospeech.
    • (1999) Proc. Eurospeech
    • McDonough, J.W.1    Byrne, W.2
  • 30
    • 85128432219 scopus 로고    scopus 로고
    • Speaker normalization with all-pass transforms
    • McDonough, J., Byrne, W., Luo, X., 1998. Speaker normalization with all-pass transforms. In: Proc. ICSLP.
    • (1998) Proc. ICSLP
    • McDonough, J.1    Byrne, W.2    Luo, X.3
  • 31
    • 0001052406 scopus 로고
    • Discrete-time representation of signals
    • Oppenheim A.V., Johnson D.H. Discrete-time representation of signals. Proc. IEEE. 60(6):1972;681-691.
    • (1972) Proc. IEEE , vol.60 , Issue.6 , pp. 681-691
    • Oppenheim, A.V.1    Johnson, D.H.2
  • 33
    • 0347960630 scopus 로고    scopus 로고
    • Vocal tract normalization equals linear transformation in cepstral space
    • Pitz, M., Molau, S., Schlüter, R., Ney, H., 2001. Vocal tract normalization equals linear transformation in cepstral space. In: Eurospeech, pp. 721-724.
    • (2001) Eurospeech , pp. 721-724
    • Pitz, M.1    Molau, S.2    Schlüter, R.3    Ney, H.4
  • 34
    • 0030672082 scopus 로고    scopus 로고
    • Experiments in speaker normalisation and adaptation for large vocabulary speech recognition
    • Pye, D., Woodland, P.C., 1997. Experiments in speaker normalisation and adaptation for large vocabulary speech recognition. In: Proc. ICASSP, Vol. II, pp. 1047-1050.
    • (1997) Proc. ICASSP , vol.2 , pp. 1047-1050
    • Pye, D.1    Woodland, P.C.2
  • 35
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • Sankar A., Lee C.-H. A maximum-likelihood approach to stochastic matching for robust speech recognition. IEEE Trans. Speech Audio Process. 4(3):1996;190-201.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.3 , pp. 190-201
    • Sankar, A.1    Lee, C.-H.2
  • 36
  • 37
    • 0003459982 scopus 로고
    • Evaluation of LPC spectral matching measures for phonetic unit recognition
    • Computer Science Department, Carnegie Mellon University, Pittsburgh, PA
    • Shikano, K., 1986. Evaluation of LPC spectral matching measures for phonetic unit recognition. Tech. Rep., Computer Science Department, Carnegie Mellon University, Pittsburgh, PA.
    • (1986) Tech. Rep.
    • Shikano, K.1
  • 38
    • 0029764708 scopus 로고    scopus 로고
    • Speaker normalization on conversational telephone speech
    • Wegmann, S., McAllaster, D., Orloff, J., Peskin, B., 1996. Speaker normalization on conversational telephone speech. In: Proc. ICASSP, Vol. I, pp. 339-341.
    • (1996) Proc. ICASSP , vol.1 , pp. 339-341
    • Wegmann, S.1    McAllaster, D.2    Orloff, J.3    Peskin, B.4
  • 41
    • 0347960637 scopus 로고
    • Translation of divers' speech using digital frequency warping
    • Res. Lab. Eltron., Massachusetts Institute of Technology, Cambridge, MA
    • Zue, V., 1971. Translation of divers' speech using digital frequency warping. Tech. Rep. 101, Res. Lab. Eltron., Massachusetts Institute of Technology, Cambridge, MA.
    • (1971) Tech. Rep. , vol.101
    • Zue, V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.