메뉴 건너뛰기




Volumn 15, Issue 7, 2007, Pages 2072-2084

Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST Speaker Recognition Evaluation 2006

Author keywords

Eigenchannel; Fusion; Gaussian mixture model (GMM); Nuisance attribute projection (NAP); Speaker recognition; Support vector machine (SVM)

Indexed keywords

EIGENCHANNEL; FUSION; GAUSSIAN MIXTURE MODEL (GMM); NUISANCE ATTRIBUTE PROJECTION (NAP); SPEAKER RECOGNITION; SUPPORT VECTOR MACHINE (SVM);

EID: 51449086024     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.902870     Document Type: Article
Times cited : (221)

References (39)
  • 1
    • 34547500703 scopus 로고    scopus 로고
    • P. Maťejka, L. Burget, P. Schwarz, O. Glembek, M. Karafiát, J. ̌ Cernocký, D. A. van Leeuwen, N. Brümmer, A. Strasheim, and F. Grézl, STBU system for the NIST 2006 speaker recognition evaluation, in Proc. ICASSP, 2007, pp. IV-221-IV-224.
    • P. Maťejka, L. Burget, P. Schwarz, O. Glembek, M. Karafiát, J. ̌ Cernocký, D. A. van Leeuwen, N. Brümmer, A. Strasheim, and F. Grézl, "STBU system for the NIST 2006 speaker recognition evaluation," in Proc. ICASSP, 2007, pp. IV-221-IV-224.
  • 2
    • 0033738539 scopus 로고    scopus 로고
    • The NIST speaker recognition evaluation-Overview, methodology, systems, results, perspective
    • G. R. Doddington, M. A. Przybocki, A. F. Martin, and D. A. Reynolds, "The NIST speaker recognition evaluation-Overview, methodology, systems, results, perspective," Speech Commun., vol. 31, pp. 225-254, 2000.
    • (2000) Speech Commun , vol.31 , pp. 225-254
    • Doddington, G.R.1    Przybocki, M.A.2    Martin, A.F.3    Reynolds, D.A.4
  • 4
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted gaussian mixture models
    • D. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted gaussian mixture models," Digital Signal Process., vol. 10, pp. 19-41, 2000.
    • (2000) Digital Signal Process , vol.10 , pp. 19-41
    • Reynolds, D.1    Quatieri, T.2    Dunn, R.3
  • 5
    • 0036289656 scopus 로고    scopus 로고
    • Generalized linear discriminant sequence kernels for speaker recognition
    • W. M. Campbell, "Generalized linear discriminant sequence kernels for speaker recognition," in Proc. ICASSP, 2002, pp. 161-164.
    • (2002) Proc. ICASSP , pp. 161-164
    • Campbell, W.M.1
  • 6
    • 4544237515 scopus 로고    scopus 로고
    • Disentangling speaker and channel effects in speaker verification
    • P. Kenny and P. Dumouchel, "Disentangling speaker and channel effects in speaker verification," in Proc. ICASSP, 2004, pp. 37-40.
    • (2004) Proc. ICASSP , pp. 37-40
    • Kenny, P.1    Dumouchel, P.2
  • 7
    • 33645895387 scopus 로고    scopus 로고
    • Advances in channel compensation for SVM speaker recognition
    • Philadelphia, PA, Mar
    • A. Solomonoff, W. Campbell, and I. BoardmanCampbell, "Advances in channel compensation for SVM speaker recognition," in Proc. ICASSP, Philadelphia, PA, Mar. 2005, vol. I, pp. 629-632.
    • (2005) Proc. ICASSP , vol.1 , pp. 629-632
    • Solomonoff, A.1    Campbell, W.2    BoardmanCampbell, I.3
  • 10
    • 85073258179 scopus 로고    scopus 로고
    • Feature warping for robust speaker verification
    • Crete, Greece
    • J. Pelecanos and S. Sridharan, "Feature warping for robust speaker verification," in Proc. Speaker Odyssey, Crete, Greece, 2001, pp. 213-218.
    • (2001) Proc. Speaker Odyssey , pp. 213-218
    • Pelecanos, J.1    Sridharan, S.2
  • 11
    • 0003871508 scopus 로고    scopus 로고
    • Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,
    • Ph.D. dissertation, John Hopkins Univ, Baltimore, MD
    • N.Kumar, "Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition," Ph.D. dissertation, John Hopkins Univ., Baltimore, MD, 1997.
    • (1997)
    • Kumar, N.1
  • 12
    • 33947643073 scopus 로고    scopus 로고
    • Complementarity of speech recognition systems and system combination,
    • Ph.D. dissertation, Brno Univ. Technol, Brno, Czech Republic
    • L. Burget, "Complementarity of speech recognition systems and system combination," Ph.D. dissertation, Brno Univ. Technol., Brno, Czech Republic, 2004.
    • (2004)
    • Burget, L.1
  • 13
    • 0141813506 scopus 로고    scopus 로고
    • Channel robust speaker verification via feature mapping
    • D. A. Reynolds, "Channel robust speaker verification via feature mapping," in Proc. ICASSP, 2003, pp. 53-56.
    • (2003) Proc. ICASSP , pp. 53-56
    • Reynolds, D.A.1
  • 14
    • 58349102016 scopus 로고    scopus 로고
    • Analysis of feature extraction and channel compensation in GMM speaker recognition system
    • Sep
    • L. Burget, P. Maťejka, O. Glembek, P. Schwarz, and J. H. ̌ Cernocký, "Analysis of feature extraction and channel compensation in GMM speaker recognition system," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 1979-1986, Sep. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.7 , pp. 1979-1986
    • Burget, L.1    Maťejka, P.2    Glembek, O.3    Schwarz, P.4    Cernocký, J.H.5
  • 15
    • 33745207323 scopus 로고    scopus 로고
    • Data-driven clustering for blind feature mapping in speaker verification
    • Lisbon, Portugal, Sep
    • M. Mason, R. Vogt, B. Baker, and S. Sridharan, "Data-driven clustering for blind feature mapping in speaker verification," in Proc. Eurospeech, Lisbon, Portugal, Sep. 2005, pp. 3109-3112.
    • (2005) Proc. Eurospeech , pp. 3109-3112
    • Mason, M.1    Vogt, R.2    Baker, B.3    Sridharan, S.4
  • 16
    • 34547496857 scopus 로고    scopus 로고
    • Spescom DataVoice NIST 2004 system description
    • Toledo, Spain, Jun
    • N. Brümmer, "Spescom DataVoice NIST 2004 system description," in Proc. NIST Speaker Recognition Evaluation 2004, Toledo, Spain, Jun. 2004, pp. 1-8.
    • (2004) Proc. NIST Speaker Recognition Evaluation 2004 , pp. 1-8
    • Brümmer, N.1
  • 19
    • 33745210768 scopus 로고    scopus 로고
    • Modelling session variability in text-independent speaker verification
    • R. Vogt, B. Baker, and S. Sridharan, "Modelling session variability in text-independent speaker verification," in Proc. Interspeech, 2005, pp. 3117-3120.
    • (2005) Proc. Interspeech , pp. 3117-3120
    • Vogt, R.1    Baker, B.2    Sridharan, S.3
  • 20
    • 33947670889 scopus 로고    scopus 로고
    • Experiments in session variability modelling for speaker verification
    • Toulouse, France, May
    • R. Vogt and S. Sridharan, "Experiments in session variability modelling for speaker verification," in Proc. ICASSP, Toulouse, France, May 2006, vol. 1, pp. 897-900.
    • (2006) Proc. ICASSP , vol.1 , pp. 897-900
    • Vogt, R.1    Sridharan, S.2
  • 21
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 22
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification systems
    • R. Auckenthaler, M. Carey, and H. Lloyd-Tomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, pp. 42-54, 2000.
    • (2000) Digital Signal Process , vol.10 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Tomas, H.3
  • 23
    • 33645887246 scopus 로고    scopus 로고
    • Support vector machines using GMM supervectors for speaker verification
    • May
    • W. Campbell, D. Sturim, and D. Reynolds, "Support vector machines using GMM supervectors for speaker verification," IEEE Signal Process. Lett., vol. 13, no. 5, pp. 308-311, May 2006.
    • (2006) IEEE Signal Process. Lett , vol.13 , Issue.5 , pp. 308-311
    • Campbell, W.1    Sturim, D.2    Reynolds, D.3
  • 24
    • 33646718528 scopus 로고    scopus 로고
    • A SVM/HMM system for speaker recognition
    • Hong Kong, China, Apr
    • W. Campbell, "A SVM/HMM system for speaker recognition," in Proc. ICASSP, Hong Kong, China, Apr. 2003, pp. 156-159.
    • (2003) Proc. ICASSP , pp. 156-159
    • Campbell, W.1
  • 27
    • 33947696754 scopus 로고    scopus 로고
    • SVM based speaker verification using a GMM supervector kernel and nap variability compensation
    • Toulouse, France
    • W. Campbell, D. Sturim, D. Reynolds, and A. Solomonoff, "SVM based speaker verification using a GMM supervector kernel and nap variability compensation," in Proc. ICASSP, Toulouse, France, 2006, pp. 97-100.
    • (2006) Proc. ICASSP , pp. 97-100
    • Campbell, W.1    Sturim, D.2    Reynolds, D.3    Solomonoff, A.4
  • 29
    • 0033902487 scopus 로고    scopus 로고
    • Applying logistic regression to the fusion of the NIST'99 1-speaker submissions
    • S. Pigeon, P. Druyts, and P. Verlinde, "Applying logistic regression to the fusion of the NIST'99 1-speaker submissions," Digital Signal Process., pp. 237-248, 2000.
    • (2000) Digital Signal Process , pp. 237-248
    • Pigeon, S.1    Druyts, P.2    Verlinde, P.3
  • 30
    • 80052047297 scopus 로고    scopus 로고
    • Measuring, refining and calibrating speaker and language information extracted from speech,
    • Ph.D. dissertation, Stellenbosch Univ, Stellenbosch, South Africa
    • N. Brümmer, "Measuring, refining and calibrating speaker and language information extracted from speech," Ph.D. dissertation, Stellenbosch Univ., Stellenbosch, South Africa, 2007.
    • (2007)
    • Brümmer, N.1
  • 32
    • 36248952139 scopus 로고    scopus 로고
    • D. A. van Leeuwen and N. Brümmer, An introduction to applicationindependent evaluation of speaker recognition systems, in Speaker Classification, ser. Lecture Notes in Computer Science/Artificial Intelligence, C. Müller, Ed. New York: Springer, 2007, 4343.
    • D. A. van Leeuwen and N. Brümmer, "An introduction to applicationindependent evaluation of speaker recognition systems," in Speaker Classification, ser. Lecture Notes in Computer Science/Artificial Intelligence, C. Müller, Ed. New York: Springer, 2007, vol. 4343.
  • 33
    • 29044433376 scopus 로고    scopus 로고
    • Application-independent evaluation of speaker detection
    • N. Brümmer and J. du Preez, "Application-independent evaluation of speaker detection," Comput. Speech, Lang., vol. 20, pp. 230-275, 2006.
    • (2006) Comput. Speech, Lang , vol.20 , pp. 230-275
    • Brümmer, N.1    du Preez, J.2
  • 34
    • 33745190064 scopus 로고    scopus 로고
    • Unsupervised online adaptation for a speaker verification system over the telephone
    • C. Barras, S. Meigner, and J. L. Gauvain, "Unsupervised online adaptation for a speaker verification system over the telephone," in Proc. Speaker Odyssey, 2004, pp. 1-4.
    • (2004) Proc. Speaker Odyssey , pp. 1-4
    • Barras, C.1    Meigner, S.2    Gauvain, J.L.3
  • 35
    • 85009238236 scopus 로고    scopus 로고
    • An adaptive speaker verification system with speaker dependent a priori decision thresholds
    • N. Mirghafori and L. Heck, "An adaptive speaker verification system with speaker dependent a priori decision thresholds," in Proc. ICSLP, 2002, pp. 589-592.
    • (2002) Proc. ICSLP , pp. 589-592
    • Mirghafori, N.1    Heck, L.2
  • 36
    • 33745206506 scopus 로고    scopus 로고
    • Speaker adaptation in the NIST speaker recognition evaluation 2004
    • D. A. van Leeuwen, "Speaker adaptation in the NIST speaker recognition evaluation 2004," in Proc. Eurospeech, 2005, pp. 1981-1984.
    • (2005) Proc. Eurospeech , pp. 1981-1984
    • van Leeuwen, D.A.1
  • 39
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification systems
    • R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, pp. 42-54, 2000.
    • (2000) Digital Signal Process , vol.10 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.