SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 7, 2007, Pages 2072-2084

Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST Speaker Recognition Evaluation 2006

(10) Brümmer, Niko a,b Burget, Lukáš c Černocký, Jan Honza c Glembek, Ondřej c Grézl, František c Karafiát, Martin c Van Leeuwen, David A d Matějka, Pavel c Schwarz, Petr c Strasheim, Albert b

a Spescom DataVoice (South Africa)

b STELLENBOSCH UNIVERSITY (South Africa)

c BRNO UNIVERSITY OF TECHNOLOGY (Czech Republic)

d TNO (Netherlands)

Author keywords

Eigenchannel; Fusion; Gaussian mixture model (GMM); Nuisance attribute projection (NAP); Speaker recognition; Support vector machine (SVM)

Indexed keywords

EIGENCHANNEL; FUSION; GAUSSIAN MIXTURE MODEL (GMM); NUISANCE ATTRIBUTE PROJECTION (NAP); SPEAKER RECOGNITION; SUPPORT VECTOR MACHINE (SVM);

COMMUNICATION CHANNELS (INFORMATION THEORY); CONTINUOUS SPEECH RECOGNITION; IMAGE RETRIEVAL; MAGNETOSTRICTIVE DEVICES; MAXIMUM LIKELIHOOD; MIXTURES; OBJECT RECOGNITION; TRELLIS CODES; VECTORS;

SUPPORT VECTOR MACHINES;

EID: 51449086024 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.902870 Document Type: Article

Times cited : (221)

References (39)

1
- 34547500703
- P. Maťejka, L. Burget, P. Schwarz, O. Glembek, M. Karafiát, J. ̌ Cernocký, D. A. van Leeuwen, N. Brümmer, A. Strasheim, and F. Grézl, STBU system for the NIST 2006 speaker recognition evaluation, in Proc. ICASSP, 2007, pp. IV-221-IV-224.
- P. Maťejka, L. Burget, P. Schwarz, O. Glembek, M. Karafiát, J. ̌ Cernocký, D. A. van Leeuwen, N. Brümmer, A. Strasheim, and F. Grézl, "STBU system for the NIST 2006 speaker recognition evaluation," in Proc. ICASSP, 2007, pp. IV-221-IV-224.

2
- 0033738539
- The NIST speaker recognition evaluation-Overview, methodology, systems, results, perspective
- G. R. Doddington, M. A. Przybocki, A. F. Martin, and D. A. Reynolds, "The NIST speaker recognition evaluation-Overview, methodology, systems, results, perspective," Speech Commun., vol. 31, pp. 225-254, 2000.
- (2000) Speech Commun , vol.31 , pp. 225-254
- Doddington, G.R.¹ Przybocki, M.A.² Martin, A.F.³ Reynolds, D.A.⁴

3
- 29044433161
- NIST and TNO-NFI evaluations of automatic speaker recognition
- D. A. van Leeuwen, A. F. Martin, M. A. Przybocki, and J. S. Bouten, "NIST and TNO-NFI evaluations of automatic speaker recognition," Comput. Speech Lang., vol. 20, pp. 128-158, 2006.
- (2006) Comput. Speech Lang , vol.20 , pp. 128-158
- van Leeuwen, D.A.¹ Martin, A.F.² Przybocki, M.A.³ Bouten, J.S.⁴

4
- 0033884858
- Speaker verification using adapted gaussian mixture models
- D. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted gaussian mixture models," Digital Signal Process., vol. 10, pp. 19-41, 2000.
- (2000) Digital Signal Process , vol.10 , pp. 19-41
- Reynolds, D.¹ Quatieri, T.² Dunn, R.³

5
- 0036289656
- Generalized linear discriminant sequence kernels for speaker recognition
- W. M. Campbell, "Generalized linear discriminant sequence kernels for speaker recognition," in Proc. ICASSP, 2002, pp. 161-164.
- (2002) Proc. ICASSP , pp. 161-164
- Campbell, W.M.¹

6
- 4544237515
- Disentangling speaker and channel effects in speaker verification
- P. Kenny and P. Dumouchel, "Disentangling speaker and channel effects in speaker verification," in Proc. ICASSP, 2004, pp. 37-40.
- (2004) Proc. ICASSP , pp. 37-40
- Kenny, P.¹ Dumouchel, P.²

7
- 33645895387
- Advances in channel compensation for SVM speaker recognition
- Philadelphia, PA, Mar
- A. Solomonoff, W. Campbell, and I. BoardmanCampbell, "Advances in channel compensation for SVM speaker recognition," in Proc. ICASSP, Philadelphia, PA, Mar. 2005, vol. I, pp. 629-632.
- (2005) Proc. ICASSP , vol.1 , pp. 629-632
- Solomonoff, A.¹ Campbell, W.² BoardmanCampbell, I.³

8
- 42749099051
- NIST speaker recognition evaluation chronicles-Part 2
- San Juan, Puerto Rico
- M. A. Przybocki, A. F. Martin, and A. N. Le, "NIST speaker recognition evaluation chronicles-Part 2," in Proc. Odyssey 2006 Speaker Lang. Recognition Workshop, San Juan, Puerto Rico, 2006.
- (2006) Proc. Odyssey 2006 Speaker Lang. Recognition Workshop
- Przybocki, M.A.¹ Martin, A.F.² Le, A.N.³

9
- 0028517164
- RASTA processing of speech
- Oct
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

10
- 85073258179
- Feature warping for robust speaker verification
- Crete, Greece
- J. Pelecanos and S. Sridharan, "Feature warping for robust speaker verification," in Proc. Speaker Odyssey, Crete, Greece, 2001, pp. 213-218.
- (2001) Proc. Speaker Odyssey , pp. 213-218
- Pelecanos, J.¹ Sridharan, S.²

11
- 0003871508
- Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,
- Ph.D. dissertation, John Hopkins Univ, Baltimore, MD
- N.Kumar, "Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition," Ph.D. dissertation, John Hopkins Univ., Baltimore, MD, 1997.
- (1997)
- Kumar, N.¹

12
- 33947643073
- Complementarity of speech recognition systems and system combination,
- Ph.D. dissertation, Brno Univ. Technol, Brno, Czech Republic
- L. Burget, "Complementarity of speech recognition systems and system combination," Ph.D. dissertation, Brno Univ. Technol., Brno, Czech Republic, 2004.
- (2004)
- Burget, L.¹

13
- 0141813506
- Channel robust speaker verification via feature mapping
- D. A. Reynolds, "Channel robust speaker verification via feature mapping," in Proc. ICASSP, 2003, pp. 53-56.
- (2003) Proc. ICASSP , pp. 53-56
- Reynolds, D.A.¹

14
- 58349102016
- Analysis of feature extraction and channel compensation in GMM speaker recognition system
- Sep
- L. Burget, P. Maťejka, O. Glembek, P. Schwarz, and J. H. ̌ Cernocký, "Analysis of feature extraction and channel compensation in GMM speaker recognition system," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 1979-1986, Sep. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.7 , pp. 1979-1986
- Burget, L.¹ Maťejka, P.² Glembek, O.³ Schwarz, P.⁴ Cernocký, J.H.⁵

15
- 33745207323
- Data-driven clustering for blind feature mapping in speaker verification
- Lisbon, Portugal, Sep
- M. Mason, R. Vogt, B. Baker, and S. Sridharan, "Data-driven clustering for blind feature mapping in speaker verification," in Proc. Eurospeech, Lisbon, Portugal, Sep. 2005, pp. 3109-3112.
- (2005) Proc. Eurospeech , pp. 3109-3112
- Mason, M.¹ Vogt, R.² Baker, B.³ Sridharan, S.⁴

16
- 34547496857
- Spescom DataVoice NIST 2004 system description
- Toledo, Spain, Jun
- N. Brümmer, "Spescom DataVoice NIST 2004 system description," in Proc. NIST Speaker Recognition Evaluation 2004, Toledo, Spain, Jun. 2004, pp. 1-8.
- (2004) Proc. NIST Speaker Recognition Evaluation 2004 , pp. 1-8
- Brümmer, N.¹

17
- 43249091937
- Speaker and session variability in GMM-based speaker verification
- May
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Speaker and session variability in GMM-based speaker verification," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1448-1460, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.4 , pp. 1448-1460
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

18
- 50249170027
- Joint factor analysis versus eigenchannels in speaker recognition
- May
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Joint factor analysis versus eigenchannels in speaker recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1435-1447, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.4 , pp. 1435-1447
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

19
- 33745210768
- Modelling session variability in text-independent speaker verification
- R. Vogt, B. Baker, and S. Sridharan, "Modelling session variability in text-independent speaker verification," in Proc. Interspeech, 2005, pp. 3117-3120.
- (2005) Proc. Interspeech , pp. 3117-3120
- Vogt, R.¹ Baker, B.² Sridharan, S.³

20
- 33947670889
- Experiments in session variability modelling for speaker verification
- Toulouse, France, May
- R. Vogt and S. Sridharan, "Experiments in session variability modelling for speaker verification," in Proc. ICASSP, Toulouse, France, May 2006, vol. 1, pp. 897-900.
- (2006) Proc. ICASSP , vol.1 , pp. 897-900
- Vogt, R.¹ Sridharan, S.²

21
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

22
- 0033884857
- Score normalization for text-independent speaker verification systems
- R. Auckenthaler, M. Carey, and H. Lloyd-Tomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, pp. 42-54, 2000.
- (2000) Digital Signal Process , vol.10 , pp. 42-54
- Auckenthaler, R.¹ Carey, M.² Lloyd-Tomas, H.³

23
- 33645887246
- Support vector machines using GMM supervectors for speaker verification
- May
- W. Campbell, D. Sturim, and D. Reynolds, "Support vector machines using GMM supervectors for speaker verification," IEEE Signal Process. Lett., vol. 13, no. 5, pp. 308-311, May 2006.
- (2006) IEEE Signal Process. Lett , vol.13 , Issue.5 , pp. 308-311
- Campbell, W.¹ Sturim, D.² Reynolds, D.³

24
- 33646718528
- A SVM/HMM system for speaker recognition
- Hong Kong, China, Apr
- W. Campbell, "A SVM/HMM system for speaker recognition," in Proc. ICASSP, Hong Kong, China, Apr. 2003, pp. 156-159.
- (2003) Proc. ICASSP , pp. 156-159
- Campbell, W.¹

25
- 33745216683
- MLLR transforms as features in speaker recognition
- Lisbon, Portugal, Sep
- A. Stolcke, L. Ferrer, S. Kajarekar, E. Shriberg, and A. Venkataraman, "MLLR transforms as features in speaker recognition," in Proc. Eurospeech, Lisbon, Portugal, Sep. 2005, pp. 2425-2428.
- (2005) Proc. Eurospeech , pp. 2425-2428
- Stolcke, A.¹ Ferrer, L.² Kajarekar, S.³ Shriberg, E.⁴ Venkataraman, A.⁵

26
- 33745536025
- The 2005 AMI system for the transcription of speech in meetings
- Heidelberg, Germany: Springer Berlin
- T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, I. McCowan, D. Moore, V.Wan, R. Ordelman, and S. Renals, "The 2005 AMI system for the transcription of speech in meetings," in Lecture Notes in Computer Science. Heidelberg, Germany: Springer Berlin, 2006, vol. 3869, pp. 450-462.
- (2006) Lecture Notes in Computer Science , vol.3869 , pp. 450-462
- Hain, T.¹ Burget, L.² Dines, J.³ Garau, G.⁴ Karafiat, M.⁵ Lincoln, M.⁶ McCowan, I.⁷ Moore, D.⁸ Wan, V.⁹ Ordelman, R.¹⁰ Renals, S.¹¹

27
- 33947696754
- SVM based speaker verification using a GMM supervector kernel and nap variability compensation
- Toulouse, France
- W. Campbell, D. Sturim, D. Reynolds, and A. Solomonoff, "SVM based speaker verification using a GMM supervector kernel and nap variability compensation," in Proc. ICASSP, Toulouse, France, 2006, pp. 97-100.
- (2006) Proc. ICASSP , pp. 97-100
- Campbell, W.¹ Sturim, D.² Reynolds, D.³ Solomonoff, A.⁴

28
- 0003710380
- Online, Available
- C.-C. Chang and C.-J. Lin, "LIBSVM: A library for support vector machines." [Online]. Available: http://www.csie.ntu.edu.tw/̃cjlin/ libsvm2001
- LIBSVM: A library for support vector machines
- Chang, C.-C.¹ Lin, C.-J.²

29
- 0033902487
- Applying logistic regression to the fusion of the NIST'99 1-speaker submissions
- S. Pigeon, P. Druyts, and P. Verlinde, "Applying logistic regression to the fusion of the NIST'99 1-speaker submissions," Digital Signal Process., pp. 237-248, 2000.
- (2000) Digital Signal Process , pp. 237-248
- Pigeon, S.¹ Druyts, P.² Verlinde, P.³

30
- 80052047297
- Measuring, refining and calibrating speaker and language information extracted from speech,
- Ph.D. dissertation, Stellenbosch Univ, Stellenbosch, South Africa
- N. Brümmer, "Measuring, refining and calibrating speaker and language information extracted from speech," Ph.D. dissertation, Stellenbosch Univ., Stellenbosch, South Africa, 2007.
- (2007)
- Brümmer, N.¹

31
- 0003508724
- New York: Wiley
- D. W. Hosner and S. Lemeshow, Applied Logistic Regression. New York: Wiley, 1989.
- (1989) Applied Logistic Regression
- Hosner, D.W.¹ Lemeshow, S.²

32
- 36248952139
- D. A. van Leeuwen and N. Brümmer, An introduction to applicationindependent evaluation of speaker recognition systems, in Speaker Classification, ser. Lecture Notes in Computer Science/Artificial Intelligence, C. Müller, Ed. New York: Springer, 2007, 4343.
- D. A. van Leeuwen and N. Brümmer, "An introduction to applicationindependent evaluation of speaker recognition systems," in Speaker Classification, ser. Lecture Notes in Computer Science/Artificial Intelligence, C. Müller, Ed. New York: Springer, 2007, vol. 4343.

33
- 29044433376
- Application-independent evaluation of speaker detection
- N. Brümmer and J. du Preez, "Application-independent evaluation of speaker detection," Comput. Speech, Lang., vol. 20, pp. 230-275, 2006.
- (2006) Comput. Speech, Lang , vol.20 , pp. 230-275
- Brümmer, N.¹ du Preez, J.²

34
- 33745190064
- Unsupervised online adaptation for a speaker verification system over the telephone
- C. Barras, S. Meigner, and J. L. Gauvain, "Unsupervised online adaptation for a speaker verification system over the telephone," in Proc. Speaker Odyssey, 2004, pp. 1-4.
- (2004) Proc. Speaker Odyssey , pp. 1-4
- Barras, C.¹ Meigner, S.² Gauvain, J.L.³

35
- 85009238236
- An adaptive speaker verification system with speaker dependent a priori decision thresholds
- N. Mirghafori and L. Heck, "An adaptive speaker verification system with speaker dependent a priori decision thresholds," in Proc. ICSLP, 2002, pp. 589-592.
- (2002) Proc. ICSLP , pp. 589-592
- Mirghafori, N.¹ Heck, L.²

36
- 33745206506
- Speaker adaptation in the NIST speaker recognition evaluation 2004
- D. A. van Leeuwen, "Speaker adaptation in the NIST speaker recognition evaluation 2004," in Proc. Eurospeech, 2005, pp. 1981-1984.
- (2005) Proc. Eurospeech , pp. 1981-1984
- van Leeuwen, D.A.¹

37
- 37649033022
- Supervised and unsupervised speaker adaptation in the NIST 2005 Speaker Recognition Evaluation
- E. G. Hansen, R. E. Slyh, and T. R. Anderson, "Supervised and unsupervised speaker adaptation in the NIST 2005 Speaker Recognition Evaluation," in Proc. Odyssey 2006 Speaker Lang. Recognition Workshop, 2006.
- (2006) Proc. Odyssey 2006 Speaker Lang. Recognition Workshop
- Hansen, E.G.¹ Slyh, R.E.² Anderson, T.R.³

38
- 42749101101
- Speaker adaptation for factor analysis based speaker verification
- San Juan, Puerto Rico, Jun
- S.-C. Yin, P. Kenny, and R. Rose, "Speaker adaptation for factor analysis based speaker verification," in Proc. Odyssey 2006 Speaker and Language Recognition Workshop, San Juan, Puerto Rico, Jun. 2006.
- (2006) Proc. Odyssey 2006 Speaker and Language Recognition Workshop
- Yin, S.-C.¹ Kenny, P.² Rose, R.³

39
- 0033884857
- Score normalization for text-independent speaker verification systems
- R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, pp. 42-54, 2000.
- (2000) Digital Signal Process , vol.10 , pp. 42-54
- Auckenthaler, R.¹ Carey, M.² Lloyd-Thomas, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.