SCOPUS 정보 검색 플랫폼

Eurasip Journal on Applied Signal Processing

Volumn 2004, Issue 4, 2004, Pages 452-465

Stochastic feature transformation with divergence-based out-of-handset rejection for robust speaker verification

(3) Mak, Man Wai a Tsang, Chi Leung a Kung, Sun Yuan b

a HONG KONG POLYTECHNIC UNIVERSITY (Hong Kong)

b Princeton University (United States)

Author keywords

Divergence; EM algorithm; Feature transformation; Handset distortion; Robust speaker verification

Indexed keywords

ACOUSTIC DISTORTION; ALGORITHMS; CURVE FITTING; ERROR ANALYSIS; GAUSSIAN NOISE (ELECTRONIC); INFORMATION ANALYSIS; PARAMETER ESTIMATION; RANDOM PROCESSES; TELEPHONE SETS; TREES (MATHEMATICS); VECTORS;

DIVERGENCE; EM ALGORITHMS; FEATURE TRANSFORMATION; HANDSET DISTORTION; ROBUST SPEAKER VERIFICATION;

SPEECH RECOGNITION;

EID: 2942532899 PISSN: 11108657 EISSN: None Source Type: Journal
DOI: 10.1155/S1110865704308048 Document Type: Article

Times cited : (11)

References (28)

1
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," Journal of the Acoustical Society of America, vol. 55, no. 6, pp. 1304-1312, 1974.
- (1974) Journal of the Acoustical Society of America , vol.55 , Issue.6 , pp. 1304-1312
- Atal, B.S.¹

2
- 0029769867
- Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
- M. G. Rahim and B. H. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech and Audio Processing, vol. 4, no. 1, pp. 19-30, 1996.
- (1996) IEEE Trans. Speech and Audio Processing , vol.4 , Issue.1 , pp. 19-30
- Rahim, M.G.¹ Juang, B.H.²

3
- 0004319970
- Kluwer Academic Publishers, Dordrecht, Netherlands
- A. Acero, Acoustical and Environmental Robustness in Automatic Speech Recognition, Kluwer Academic Publishers, Dordrecht, Netherlands, 1992.
- (1992) Acoustical and Environmental Robustness in Automatic Speech Recognition
- Acero, A.¹

4
- 0002127129
- Probabilistic optimal filtering for robust speech recognition
- Adelaide, Australia, April
- L. Neumeyer and M. Weintraub, "Probabilistic optimal filtering for robust speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, pp. 417-420, Adelaide, Australia, April 1994.
- (1994) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 417-420
- Neumeyer, L.¹ Weintraub, M.²

5
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- A. Sankar and C. H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech and Audio Processing, vol. 4, no. 3, pp. 190-202, 1996.
- (1996) IEEE Trans. Speech and Audio Processing , vol.4 , Issue.3 , pp. 190-202
- Sankar, A.¹ Lee, C.H.²

6
- 0028420014
- Integrated models of signal and background with application to speaker identification in noise
- R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE Trans. Speech and Audio Processing, vol. 2, no. 2, pp. 245-257, 1994.
- (1994) IEEE Trans. Speech and Audio Processing , vol.2 , Issue.2 , pp. 245-257
- Rose, R.C.¹ Hofstetter, E.M.² Reynolds, D.A.³

7
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech and Language, vol. 9, no. 2, pp. 171-185, 1995.
- (1995) Computer Speech and Language , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

8
- 0029375590
- Speaker adaptation using constrained reestimation of Gaussian mixtures
- V. Digalakis, D. Rtischev, and L. Neumeyer, "Speaker adaptation using constrained reestimation of Gaussian mixtures," IEEE Trans. Speech and Audio Processing, vol. 3, no. 5, pp. 357-366, 1995.
- (1995) IEEE Trans. Speech and Audio Processing , vol.3 , Issue.5 , pp. 357-366
- Digalakis, V.¹ Rtischev, D.² Neumeyer, L.³

9
- 0032050110
- Maximum-likelihood linear transformation for HMM-based speech recognition
- M. J. F. Gales, "Maximum-likelihood linear transformation for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

10
- 0033100038
- Maximum-likelihood stochastic-transformation adaptation of hidden Markov models
- V. D. Diakoloukas and V. Digalakis, "Maximum-likelihood stochastic-transformation adaptation of hidden Markov models," IEEE Trans. Speech and Audio Processing, vol. 7, no. 2, pp. 177-187, 1999.
- (1999) IEEE Trans. Speech and Audio Processing , vol.7 , Issue.2 , pp. 177-187
- Diakoloukas, V.D.¹ Digalakis, V.²

11
- 0001583797
- Nonlinear compensation for stochastic matching
- A. C. Surendran, C. H. Lee, and M. Rahim, "Nonlinear compensation for stochastic matching," IEEE Trans. Speech and Audio Processing, vol. 7, no. 6, pp. 643-655, 1999.
- (1999) IEEE Trans. Speech and Audio Processing , vol.7 , Issue.6 , pp. 643-655
- Surendran, A.C.¹ Lee, C.H.² Rahim, M.³

12
- 0031103160
- On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive bayes estimate
- Q. Huo, C. Chan, and C. H. Lee, "On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive bayes estimate," IEEE Trans. Speech and Audio Processing, vol. 5, no. 2, pp. 161-172, 1997.
- (1997) IEEE Trans. Speech and Audio Processing , vol.5 , Issue.2 , pp. 161-172
- Huo, Q.¹ Chan, C.² Lee, C.H.³

13
- 0026142334
- A study on speaker adaptation of the parameters of continuous density hidden Markov models
- C. H. Lee, C. H. Lin, and B. H. Juang, "A study on speaker adaptation of the parameters of continuous density hidden Markov models," IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 39, no. 4, pp. 806-814, 1991.
- (1991) IEEE Trans. Acoustics, Speech, and Signal Processing , vol.39 , Issue.4 , pp. 806-814
- Lee, C.H.¹ Lin, C.H.² Juang, B.H.³

14
- 0035340712
- Online adaptation of HMMs to real-life conditions: A unified framework
- C. Mokbel, "Online adaptation of HMMs to real-life conditions: A unified framework," IEEE Trans. Speech and Audio Processing, vol. 9, no. 4, pp. 342-357, 2001.
- (2001) IEEE Trans. Speech and Audio Processing , vol.9 , Issue.4 , pp. 342-357
- Mokbel, C.¹

15
- 0035341086
- Joint maximum a posteriori adaptation of transformation and HMM parameters
- O. Siohan, C. Chesta, and C. H. Lee, "Joint maximum a posteriori adaptation of transformation and HMM parameters," IEEE Trans. Speech and Audio Processing, vol. 9, no. 4, pp. 417-428, 2001.
- (2001) IEEE Trans. Speech and Audio Processing , vol.9 , Issue.4 , pp. 417-428
- Siohan, O.¹ Chesta, C.² Lee, C.H.³

16
- 0028996949
- The effects of telephone transmission degradations on speaker recognition performance
- Detroit, Mich, USA, May
- D. A. Reynolds, M. A. Zissman, T. F. Quatieri, G. C. O'Leary, and B. Carlson, "The effects of telephone transmission degradations on speaker recognition performance," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 329-332, Detroit, Mich, USA, May 1995.
- (1995) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 329-332
- Reynolds, D.A.¹ Zissman, M.A.² Quatieri, T.F.³ O'Leary, G.C.⁴ Carlson, B.⁵

17
- 0009652961
- Robust speaker verification over the telephone by feature recuperation
- Hong Kong, May
- X. Li, M. W. Mak, and S. Y. Kung, "Robust speaker verification over the telephone by feature recuperation," in Proc. International Symposium on Intelligent Multimedia, Video and Speech Processing, pp. 433-436, Hong Kong, May 2001.
- (2001) Proc. International Symposium on Intelligent Multimedia, Video and Speech Processing , pp. 433-436
- Li, X.¹ Mak, M.W.² Kung, S.Y.³

18
- 0034274733
- Estimation of handset nonlinearity with application to speaker recognition
- T. F. Quatieri, D. A. Reynolds, and G. C. O'Leary, "Estimation of handset nonlinearity with application to speaker recognition," IEEE Trans. Speech and Audio Processing, vol. 8, no. 5, pp. 567-584, 2000.
- (2000) IEEE Trans. Speech and Audio Processing , vol.8 , Issue.5 , pp. 567-584
- Quatieri, T.F.¹ Reynolds, D.A.² O'Leary, G.C.³

19
- 0036297843
- Combining stochastic feature transformation and handset identification for telephone-based speaker verification
- Orlando, Fla, USA, May
- M. W. Mak and S. Y. Kung, "Combining stochastic feature transformation and handset identification for telephone-based speaker verification," in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 1701-1704, Orlando, Fla, USA, May 2002.
- (2002) Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 1701-1704
- Mak, M.W.¹ Kung, S.Y.²

20
- 0141516869
- Divergence-based out-of-class rejection for telephone handset identification
- Denver, Colo, USA, September
- C. L. Tsang, M. W. Mak, and S. Y. Kung, "Divergence-based out-of-class rejection for telephone handset identification," in Proc. International Conf. on Spoken Language Processing, pp. 2329-2332, Denver, Colo, USA, September 2002.
- (2002) Proc. International Conf. on Spoken Language Processing , pp. 2329-2332
- Tsang, C.L.¹ Mak, M.W.² Kung, S.Y.³

21
- 84946721819
- A GMM-based handset selector for channel mismatch compensation with applications to speaker identification
- Beijing, China, October
- K. K. Yiu, M. W. Mak, and S. Y. Kung, "A GMM-based handset selector for channel mismatch compensation with applications to speaker identification," in Proc. 2nd IEEE Pacific-Rim Conference on Multimedia 2001, pp. 1132-1137, Beijing, China, October 2001.
- (2001) Proc. 2nd IEEE Pacific-rim Conference on Multimedia 2001 , pp. 1132-1137
- Yiu, K.K.¹ Mak, M.W.² Kung, S.Y.³

22
- 0030682302
- HTIMIT and LLHDB: Speech corpora for the study of handset transducer effects
- Munich, Germany, April
- D. A. Reynolds, "HTIMIT and LLHDB: speech corpora for the study of handset transducer effects," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, pp. 1535-1538, Munich, Germany, April 1997.
- (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 1535-1538
- Reynolds, D.A.¹

23
- 0020126872
- On the convexity of some divergence measures based on entropy functions
- J. Burbea and C. R. Rao, "On the convexity of some divergence measures based on entropy functions," IEEE Transactions on Information Theory, vol. 28, no. 3, pp. 489-495, 1982.
- (1982) IEEE Transactions on Information Theory , vol.28 , Issue.3 , pp. 489-495
- Burbea, J.¹ Rao, C.R.²

24
- 0032654473
- On the use of some divergence measures in speaker recognition
- Phoenix, Ariz, USA, March
- R. Vergin and D. O'Shaughnessy, "On the use of some divergence measures in speaker recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, pp. 309-312, Phoenix, Ariz, USA, March 1999.
- (1999) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 309-312
- Vergin, R.¹ O'Shaughnessy, D.²

25
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech and Audio Processing, vol. 3, no. 1, pp. 72-83, 1995.
- (1995) IEEE Trans. Speech and Audio Processing , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

26
- 0034227415
- Estimation of elliptical basis function parameters by the EM algorithms with application to speaker verification
- M. W. Mak and S. Y. Kung, "Estimation of elliptical basis function parameters by the EM algorithms with application to speaker verification," IEEE Transactions on Neural Networks, vol. 11, no. 4, pp. 961-969, 2000.
- (2000) IEEE Transactions on Neural Networks , vol.11 , Issue.4 , pp. 961-969
- Mak, M.W.¹ Kung, S.Y.²

27
- 85046873967
- The DET curve in assessment of detection task performance
- Rhodes, Greece, September
- A. Martin, G. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," in Proc. 5th biennial European Conference on Speech Communication and Technology, vol. 4, pp. 1895-1898, Rhodes, Greece, September 1997.
- (1997) Proc. 5th Biennial European Conference on Speech Communication and Technology , vol.4 , pp. 1895-1898
- Martin, A.¹ Doddington, G.² Kamm, T.³ Ordowski, M.⁴ Przybocki, M.⁵

28
- 0003500248
- Morgan Kaufmann Publishers, San Mateo, Calif, USA
- J. R. Quinlan, C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers, San Mateo, Calif, USA, 1993.
- (1993) C4.5: Programs for Machine Learning
- Quinlan, J.R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.