메뉴 건너뛰기




Volumn 2004, Issue 4, 2004, Pages 452-465

Stochastic feature transformation with divergence-based out-of-handset rejection for robust speaker verification

Author keywords

Divergence; EM algorithm; Feature transformation; Handset distortion; Robust speaker verification

Indexed keywords

ACOUSTIC DISTORTION; ALGORITHMS; CURVE FITTING; ERROR ANALYSIS; GAUSSIAN NOISE (ELECTRONIC); INFORMATION ANALYSIS; PARAMETER ESTIMATION; RANDOM PROCESSES; TELEPHONE SETS; TREES (MATHEMATICS); VECTORS;

EID: 2942532899     PISSN: 11108657     EISSN: None     Source Type: Journal    
DOI: 10.1155/S1110865704308048     Document Type: Article
Times cited : (11)

References (28)
  • 1
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," Journal of the Acoustical Society of America, vol. 55, no. 6, pp. 1304-1312, 1974.
    • (1974) Journal of the Acoustical Society of America , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.S.1
  • 2
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • M. G. Rahim and B. H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech and Audio Processing, vol. 4, no. 1, pp. 19-30, 1996.
    • (1996) IEEE Trans. Speech and Audio Processing , vol.4 , Issue.1 , pp. 19-30
    • Rahim, M.G.1    Juang, B.H.2
  • 4
    • 0002127129 scopus 로고
    • Probabilistic optimal filtering for robust speech recognition
    • Adelaide, Australia, April
    • L. Neumeyer and M. Weintraub, "Probabilistic optimal filtering for robust speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, pp. 417-420, Adelaide, Australia, April 1994.
    • (1994) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 417-420
    • Neumeyer, L.1    Weintraub, M.2
  • 5
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • A. Sankar and C. H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech and Audio Processing, vol. 4, no. 3, pp. 190-202, 1996.
    • (1996) IEEE Trans. Speech and Audio Processing , vol.4 , Issue.3 , pp. 190-202
    • Sankar, A.1    Lee, C.H.2
  • 6
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE Trans. Speech and Audio Processing, vol. 2, no. 2, pp. 245-257, 1994.
    • (1994) IEEE Trans. Speech and Audio Processing , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 7
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech and Language, vol. 9, no. 2, pp. 171-185, 1995.
    • (1995) Computer Speech and Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 8
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained reestimation of Gaussian mixtures
    • V. Digalakis, D. Rtischev, and L. Neumeyer, "Speaker adaptation using constrained reestimation of Gaussian mixtures," IEEE Trans. Speech and Audio Processing, vol. 3, no. 5, pp. 357-366, 1995.
    • (1995) IEEE Trans. Speech and Audio Processing , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.1    Rtischev, D.2    Neumeyer, L.3
  • 9
    • 0032050110 scopus 로고    scopus 로고
    • Maximum-likelihood linear transformation for HMM-based speech recognition
    • M. J. F. Gales, "Maximum-likelihood linear transformation for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 10
    • 0033100038 scopus 로고    scopus 로고
    • Maximum-likelihood stochastic-transformation adaptation of hidden Markov models
    • V. D. Diakoloukas and V. Digalakis, "Maximum-likelihood stochastic-transformation adaptation of hidden Markov models," IEEE Trans. Speech and Audio Processing, vol. 7, no. 2, pp. 177-187, 1999.
    • (1999) IEEE Trans. Speech and Audio Processing , vol.7 , Issue.2 , pp. 177-187
    • Diakoloukas, V.D.1    Digalakis, V.2
  • 12
    • 0031103160 scopus 로고    scopus 로고
    • On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive bayes estimate
    • Q. Huo, C. Chan, and C. H. Lee, "On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive bayes estimate," IEEE Trans. Speech and Audio Processing, vol. 5, no. 2, pp. 161-172, 1997.
    • (1997) IEEE Trans. Speech and Audio Processing , vol.5 , Issue.2 , pp. 161-172
    • Huo, Q.1    Chan, C.2    Lee, C.H.3
  • 13
    • 0026142334 scopus 로고
    • A study on speaker adaptation of the parameters of continuous density hidden Markov models
    • C. H. Lee, C. H. Lin, and B. H. Juang, "A study on speaker adaptation of the parameters of continuous density hidden Markov models," IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 39, no. 4, pp. 806-814, 1991.
    • (1991) IEEE Trans. Acoustics, Speech, and Signal Processing , vol.39 , Issue.4 , pp. 806-814
    • Lee, C.H.1    Lin, C.H.2    Juang, B.H.3
  • 14
    • 0035340712 scopus 로고    scopus 로고
    • Online adaptation of HMMs to real-life conditions: A unified framework
    • C. Mokbel, "Online adaptation of HMMs to real-life conditions: A unified framework," IEEE Trans. Speech and Audio Processing, vol. 9, no. 4, pp. 342-357, 2001.
    • (2001) IEEE Trans. Speech and Audio Processing , vol.9 , Issue.4 , pp. 342-357
    • Mokbel, C.1
  • 15
    • 0035341086 scopus 로고    scopus 로고
    • Joint maximum a posteriori adaptation of transformation and HMM parameters
    • O. Siohan, C. Chesta, and C. H. Lee, "Joint maximum a posteriori adaptation of transformation and HMM parameters," IEEE Trans. Speech and Audio Processing, vol. 9, no. 4, pp. 417-428, 2001.
    • (2001) IEEE Trans. Speech and Audio Processing , vol.9 , Issue.4 , pp. 417-428
    • Siohan, O.1    Chesta, C.2    Lee, C.H.3
  • 18
    • 0034274733 scopus 로고    scopus 로고
    • Estimation of handset nonlinearity with application to speaker recognition
    • T. F. Quatieri, D. A. Reynolds, and G. C. O'Leary, "Estimation of handset nonlinearity with application to speaker recognition," IEEE Trans. Speech and Audio Processing, vol. 8, no. 5, pp. 567-584, 2000.
    • (2000) IEEE Trans. Speech and Audio Processing , vol.8 , Issue.5 , pp. 567-584
    • Quatieri, T.F.1    Reynolds, D.A.2    O'Leary, G.C.3
  • 19
    • 0036297843 scopus 로고    scopus 로고
    • Combining stochastic feature transformation and handset identification for telephone-based speaker verification
    • Orlando, Fla, USA, May
    • M. W. Mak and S. Y. Kung, "Combining stochastic feature transformation and handset identification for telephone-based speaker verification," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 1701-1704, Orlando, Fla, USA, May 2002.
    • (2002) Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 1701-1704
    • Mak, M.W.1    Kung, S.Y.2
  • 20
    • 0141516869 scopus 로고    scopus 로고
    • Divergence-based out-of-class rejection for telephone handset identification
    • Denver, Colo, USA, September
    • C. L. Tsang, M. W. Mak, and S. Y. Kung, "Divergence-based out-of-class rejection for telephone handset identification," in Proc. International Conf. on Spoken Language Processing, pp. 2329-2332, Denver, Colo, USA, September 2002.
    • (2002) Proc. International Conf. on Spoken Language Processing , pp. 2329-2332
    • Tsang, C.L.1    Mak, M.W.2    Kung, S.Y.3
  • 21
    • 84946721819 scopus 로고    scopus 로고
    • A GMM-based handset selector for channel mismatch compensation with applications to speaker identification
    • Beijing, China, October
    • K. K. Yiu, M. W. Mak, and S. Y. Kung, "A GMM-based handset selector for channel mismatch compensation with applications to speaker identification," in Proc. 2nd IEEE Pacific-Rim Conference on Multimedia 2001, pp. 1132-1137, Beijing, China, October 2001.
    • (2001) Proc. 2nd IEEE Pacific-rim Conference on Multimedia 2001 , pp. 1132-1137
    • Yiu, K.K.1    Mak, M.W.2    Kung, S.Y.3
  • 22
    • 0030682302 scopus 로고    scopus 로고
    • HTIMIT and LLHDB: Speech corpora for the study of handset transducer effects
    • Munich, Germany, April
    • D. A. Reynolds, "HTIMIT and LLHDB: speech corpora for the study of handset transducer effects," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, pp. 1535-1538, Munich, Germany, April 1997.
    • (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 1535-1538
    • Reynolds, D.A.1
  • 23
    • 0020126872 scopus 로고
    • On the convexity of some divergence measures based on entropy functions
    • J. Burbea and C. R. Rao, "On the convexity of some divergence measures based on entropy functions," IEEE Transactions on Information Theory, vol. 28, no. 3, pp. 489-495, 1982.
    • (1982) IEEE Transactions on Information Theory , vol.28 , Issue.3 , pp. 489-495
    • Burbea, J.1    Rao, C.R.2
  • 24
  • 25
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech and Audio Processing, vol. 3, no. 1, pp. 72-83, 1995.
    • (1995) IEEE Trans. Speech and Audio Processing , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 26
    • 0034227415 scopus 로고    scopus 로고
    • Estimation of elliptical basis function parameters by the EM algorithms with application to speaker verification
    • M. W. Mak and S. Y. Kung, "Estimation of elliptical basis function parameters by the EM algorithms with application to speaker verification," IEEE Transactions on Neural Networks, vol. 11, no. 4, pp. 961-969, 2000.
    • (2000) IEEE Transactions on Neural Networks , vol.11 , Issue.4 , pp. 961-969
    • Mak, M.W.1    Kung, S.Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.