메뉴 건너뛰기




Volumn 19, Issue 3, 2011, Pages 505-515

Speaker verification with feature-space MAPLR parameters

Author keywords

Feature transform; maximum a posteriori; speaker recognition; support vector machine (SVM)

Indexed keywords

AFFINE TRANSFORM; BIAS VECTORS; FEATURE TRANSFORM; FIRST ORDER; GAUSSIAN MIXTURE MODEL; MAXIMUM A POSTERIORI; MAXIMUM A POSTERIORI ALGORITHM; NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY; NUMERICAL PROBLEMS; REGRESSION FUNCTION; SECOND ORDERS; SPEAKER DEPENDENTS; SPEAKER RECOGNITION; SPEAKER VERIFICATION; SPEAKER VERIFICATION SYSTEM; SUFFICIENT STATISTICS; SUPPORT VECTOR MACHINE (SVM); TRANSFORM MATRICES;

EID: 78649983534     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2051269     Document Type: Article
Times cited : (10)

References (34)
  • 1
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • D. A. Reynolds, T. F. Quatieri, and B. D. Robert, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, pp. 19-41, 2000.
    • (2000) Digital Signal Process. , vol.10 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Robert, B.D.3
  • 2
    • 0036289656 scopus 로고    scopus 로고
    • Generalized linear discriminant sequence kernels for speaker recognition
    • W. M. Campbell, "Generalized linear discriminant sequence kernels for speaker recognition," in Proc. ICASSP, 2002, pp. 161-164.
    • (2002) Proc. ICASSP , pp. 161-164
    • Campbell, W.M.1
  • 4
    • 33947696754 scopus 로고    scopus 로고
    • SVM based speaker verification using a GMMsupervector kernel and NAP variability compensation
    • W. M. Campbell, D. E. Sturim, D. A. Reynolds, and A. Solomonoff, "SVM based speaker verification using a GMMsupervector kernel and NAP variability compensation," in Proc. ICASSP, 2006, pp. 97-100.
    • (2006) Proc. ICASSP , pp. 97-100
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3    Solomonoff, A.4
  • 5
    • 51449111842 scopus 로고    scopus 로고
    • Speaker recognition with session variability normalization based on MLLR adaptation transforms
    • Sep
    • A. Stolcke, S. S. Kajarekar, L. Ferrer, and E. Shrinberg, "Speaker recognition with session variability normalization based on MLLR adaptation transforms," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 1987-1998, Sep. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 1987-1998
    • Stolcke, A.1    Kajarekar, S.S.2    Ferrer, L.3    Shrinberg, E.4
  • 6
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 7
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput.Speech Lang., vol. 9, pp. 171-185, 1995.
    • (1995) Comput.Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 8
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework," Comput. Speech Lang., vol. 10, pp. 249-264, 1996.
    • (1996) Comput. Speech Lang. , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 9
    • 51449096374 scopus 로고    scopus 로고
    • A new kernel for SVM MLLR based speaker recognition
    • Z. N. Karam and W. M. Campbell, "A new kernel for SVM MLLR based speaker recognition," in Proc. Interspeech, 2007, pp. 290-293.
    • (2007) Proc. Interspeech , pp. 290-293
    • Karam, Z.N.1    Campbell, W.M.2
  • 11
    • 0009623939 scopus 로고
    • Flexible speaker adaptation using maximum likelihood linear regression
    • C. J. Leggetter and P. C.Woodland, "Flexible speaker adaptation using maximum likelihood linear regression," in Proc. ARPA SLS Technol. Workshop, 1995, pp. 110-115.
    • (1995) Proc. ARPA SLS Technol. Workshop , pp. 110-115
    • Leggetter, C.J.1    Woodland, P.C.2
  • 12
    • 84874875877 scopus 로고    scopus 로고
    • Maximum posterior linear regression with elliptically symmetric matrix variate priors
    • W. Chou, "Maximum posterior linear regression with elliptically symmetric matrix variate priors," in Proc. Eurospeech, 1999, pp. 1-4.
    • (1999) Proc. Eurospeech , pp. 1-4
    • Chou, W.1
  • 14
    • 44949145553 scopus 로고    scopus 로고
    • Robust feature space adaptation for telephony speech recognition
    • X. Lei, J. Hamaker, and X. He, "Robust feature space adaptation for telephony speech recognition," in Proc. ICSLP, 2006, pp. 773-776.
    • (2006) Proc. ICSLP , pp. 773-776
    • Lei, X.1    Hamaker, J.2    He, X.3
  • 15
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMMbased speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
    • (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 16
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • Jan
    • M. Rahim and B.-H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 1, pp. 19-30, Jan. 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 19-30
    • Rahim, M.1    Juang, B.-H.2
  • 17
    • 0030149866 scopus 로고    scopus 로고
    • A maximum likelihood approach to stochastic matching for robust speech recognition
    • May
    • A. Sankar and C.-H. Lee, "A maximum likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 3, pp. 190-202, May 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.3 , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 18
    • 70349488711 scopus 로고    scopus 로고
    • Robust speech recognition based on structured modeling, irrelevant variability normalization and unsupervised online adaptation
    • Q. Huo and D. Zhu, "Robust speech recognition based on structured modeling, irrelevant variability normalization and unsupervised online adaptation," in Proc. ICASSP, 2009, pp. 4637-4640.
    • (2009) Proc. ICASSP , pp. 4637-4640
    • Huo, Q.1    Zhu, D.2
  • 19
    • 84867218530 scopus 로고    scopus 로고
    • Using MAP estimation of feature transformation for speaker recognition
    • D. Zhu, B. Ma, and H. Li, "Using MAP estimation of feature transformation for speaker recognition," in Proc. Interspeech, 2008, pp. 849-852.
    • (2008) Proc. Interspeech , pp. 849-852
    • Zhu, D.1    Ma, B.2    Li, H.3
  • 20
    • 70349200796 scopus 로고    scopus 로고
    • Joint MAP adaptation of feature transformation and Gaussian mixture model for speaker recognition
    • D. Zhu, B. Ma, and H. Li, "Joint MAP adaptation of feature transformation and Gaussian mixture model for speaker recognition," in Proc. ICASSP, 2009, pp. 4045-4048.
    • (2009) Proc. ICASSP , pp. 4045-4048
    • Zhu, D.1    Ma, B.2    Li, H.3
  • 22
    • 0029375590 scopus 로고
    • Sep,IEEE Trans. Speech Audio Process.,Digalakis V.V.,Rtischev D.,Neumeyer L.G.
    • V. V. Digalakis, D. Rtischev, and L. G. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
    • (1995) Speaker adaptation using constrained estimation of Gaussian mixtures , vol.3 , Issue.5 , pp. 357-366
  • 23
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • D. A. Reynolds, T. F. Quatieri, and B. D. Robert, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, pp. 19-41, 2000.
    • (2000) Digital Signal Process. , vol.10 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Robert, B.D.3
  • 25
    • 33645895387 scopus 로고    scopus 로고
    • Advances in channel compensation for SVM speaker recognition
    • A. Solomonoff, W. M. Campbell, and I. Boardman, "Advances in channel compensation for SVM speaker recognition," in Proc. ICASSP, 2005, pp. 629-632.
    • (2005) Proc. ICASSP , pp. 629-632
    • Solomonoff, A.1    Campbell, W.M.2    Boardman, I.3
  • 26
    • 44949114401 scopus 로고    scopus 로고
    • Within-class covariance normalization for SVM-Based speaker recognition
    • A. O. Hatch, S. Kajarekar, and A. Stolcke, "Within-class covariance normalization for SVM-Based speaker recognition," in Proc. Interspeech, 2006, pp. 1471-1474.
    • (2006) Proc. Interspeech , pp. 1471-1474
    • Hatch, A.O.1    Kajarekar, S.2    Stolcke, A.3
  • 28
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification systems
    • R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, pp. 42-54, 2000.
    • (2000) Digital Signal Process. , vol.10 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 29
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • Apr
    • R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 245-257, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 30
    • 0032203405 scopus 로고    scopus 로고
    • A general joint additive and convolutive bias compensation approach applied to noisy Lombard speech recognition
    • Nov
    • M. Afify, Y. Gong, and J.-P. Haton, "A general joint additive and convolutive bias compensation approach applied to noisy Lombard speech recognition," IEEE Trans. Speech Audio Process., vol. 6, no. 6, pp. 524-538, Nov. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.6 , pp. 524-538
    • Afify, M.1    Gong, Y.2    Haton, J.-P.3
  • 31
    • 85009212133 scopus 로고    scopus 로고
    • A switching linear Gaussian hidden Markov model and its application to nonstationary noise compensation for robust speech recognition
    • J. Wu and Q. Huo, "A switching linear Gaussian hidden Markov model and its application to nonstationary noise compensation for robust speech recognition," in Proc. ICASSP, 2003, pp. 977-980.
    • (2003) Proc. ICASSP , pp. 977-980
    • Wu, J.1    Huo, Q.2
  • 32
    • 0000913324 scopus 로고    scopus 로고
    • SVMTorch: Support vector machines for large-scale regression problems
    • R. Collobert and S. Bengio, "SVMTorch: Support vector machines for large-scale regression problems," J. Mach. Learn. Res., vol. 1, pp. 143-160, 2001.
    • (2001) J. Mach. Learn. Res. , vol.1 , pp. 143-160
    • Collobert, R.1    Bengio, S.2
  • 33
    • 78649996628 scopus 로고    scopus 로고
    • Online.Available:
    • NIST Speaker Recognition Evaluation, [Online]. Available: http://www.itl.nist.gov/iad/mig/tests/sre
  • 34
    • 51449085646 scopus 로고    scopus 로고
    • A multi-class MLLR kernel for SVM speaker recognition
    • Z. N. Karam and W. M. Campbell, "A multi-class MLLR kernel for SVM speaker recognition," in Proc. ICASSP, 2008, pp. 4117-4120.
    • (2008) Proc. ICASSP , pp. 4117-4120
    • Karam, Z.N.1    Campbell, W.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.