메뉴 건너뛰기




Volumn 21, Issue 5, 2013, Pages 1012-1022

Boosting the performance of I-vector based speaker verification via utterance partitioning

Author keywords

I vectors; linear discriminant analysis; speaker verification; support vector machines; utterance partitioning with acoustic vector resampling (UP AVR)

Indexed keywords

CHANNEL COMPENSATION; COSINE DISTANCE SCORING; DECISION BOUNDARY; DISCRIMINATIVE POWER; I-VECTORS; LINEAR DISCRIMINANT ANALYSIS; REPEATED APPLICATION; REPRESENTATION POWER; RESAMPLING; SPEAKER CHARACTERISTICS; SPEAKER RECOGNITION EVALUATIONS; SPEAKER VERIFICATION; TARGET SPEAKER; TRANSFORMATION MATRICES; WITHIN-CLASS COVARIANCE;

EID: 84873907352     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2243436     Document Type: Article
Times cited : (53)

References (31)
  • 4
    • 44949114401 scopus 로고    scopus 로고
    • Within-class covariance normalization for SVM-based speaker recognition
    • Pittsburgh, PA, USA, Sep.
    • A. Hatch, S. Kajarekar, and A. Stolcke, "Within-class covariance normalization for SVM-based speaker recognition," in Proc. 9th Int. Conf. Spoken Lang. Process., Pittsburgh, PA, USA, Sep. 2006, pp. 1471-1474.
    • (2006) Proc. 9th Int. Conf. Spoken Lang. Process , pp. 1471-1474
    • Hatch, A.1    Kajarekar, S.2    Stolcke, A.3
  • 5
    • 84858973723 scopus 로고    scopus 로고
    • Bayesian speaker verification with heavy-tailed priors
    • Brno, Czech Republic, Jun.
    • P. Kenny, "Bayesian speaker verification with heavy-tailed priors," in Proc. Odyssey: Speaker Lang. Recognition Workshop, Brno, Czech Republic, Jun. 2010.
    • (2010) Proc. Odyssey: Speaker Lang. Recognition Workshop
    • Kenny, P.1
  • 6
    • 21844447839 scopus 로고    scopus 로고
    • Characterization of a family of algorithms for generalized discriminant analysis on undersampled problems
    • J. Ye, "Characterization of a family of algorithms for generalized discriminant analysis on undersampled problems," J. Mach. Learn. Res., vol. 6, no. 1, pp. 483-502, 2005.
    • (2005) J. Mach. Learn. Res. , vol.6 , Issue.1 , pp. 483-502
    • Ye, J.1
  • 7
    • 0034300875 scopus 로고    scopus 로고
    • A new LDA-based face recognition system which can solve the small sample size problem
    • L. F. Chen, H. Y. M. Liao, M. T. Ko, J. C. Lin, and G. J. Yu, "A new LDA-based face recognition system which can solve the small sample size problem," Pattern Recognit., vol. 33, pp. 1713-1726, 2000.
    • (2000) Pattern Recognit. , vol.33 , pp. 1713-1726
    • Chen, L.F.1    Liao, H.Y.M.2    Ko, M.T.3    Lin, J.C.4    Yu, G.J.5
  • 8
    • 85073246899 scopus 로고    scopus 로고
    • Utterance partitioning with acoustic vector resampling for i-vector based speaker verification
    • Singapore, Jun.
    • W. Rao and M. W. Mak, "Utterance partitioning with acoustic vector resampling for i-vector based speaker verification," in Proc. Odyssey: Speaker Lang. Recognit. Workshop, Singapore, Jun. 2012.
    • (2012) Proc. Odyssey: Speaker Lang. Recognit. Workshop
    • Rao, W.1    Mak, M.W.2
  • 9
    • 84865712637 scopus 로고    scopus 로고
    • Addressing the data-imbalance problem in kernel-based speaker verification via utterance partitioning and speaker comparison
    • Florence, Italy, Aug.
    • W. Rao and M. W. Mak, "Addressing the data-imbalance problem in kernel-based speaker verification via utterance partitioning and speaker comparison," in Proc. Interspeech '11, Florence, Italy, Aug. 2011, pp. 2717-2720.
    • (2011) Proc. Interspeech '11 , pp. 2717-2720
    • Rao, W.1    Mak, M.W.2
  • 10
    • 78649335505 scopus 로고    scopus 로고
    • Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification
    • Jan.
    • M. W. Mak and W. Rao, "Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification," Speech Commun., vol. 53, no. 1, pp. 119-130, Jan. 2011.
    • (2011) Speech Commun. , vol.53 , Issue.1 , pp. 119-130
    • Mak, M.W.1    Rao, W.2
  • 11
    • 84873912781 scopus 로고    scopus 로고
    • The NIST Year 2010 Speaker Recognition Evaluation Plan [Online]
    • The NIST Year 2010 Speaker Recognition Evaluation Plan [Online]. Available: http://www.itl.nist.gov/iad/mig/tests/sre/2010/index.html
  • 13
    • 33947696754 scopus 로고    scopus 로고
    • SVM based speaker verification using a GMM supervector kernel and NAP variability compensation
    • May.
    • W. M. Campbell, D. E. Sturim, D. A. Reynolds, and A. Solomonoff, "SVM based speaker verification using a GMM supervector kernel and NAP variability compensation," in Proc. ICASSP, Toulouse, France, May 2006, vol. 1, pp. 97-100.
    • (2006) Proc. ICASSP, Toulouse, France , vol.1 , pp. 97-100
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3    Solomonoff, A.4
  • 14
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • Jan
    • D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, no. 1-3, pp. 19-41, Jan. 2000.
    • (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 15
    • 70450180849 scopus 로고    scopus 로고
    • Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification
    • Sep.
    • N. Dehak, R. Dehak, P. Kenny, N. Brummer, P. Ouellet, and P. Du-mouchel, "Support vector machines versus fast scoring in the low-dimensional total variability space for speaker verification," in Proc. Interspeech '09, Sep. 2009, pp. 1559-1562.
    • (2009) Proc. Interspeech '09 , pp. 1559-1562
    • Dehak, N.1    Dehak, R.2    Kenny, P.3    Brummer, N.4    Ouellet, P.5    Du-Mouchel, P.6
  • 16
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification systems
    • Jan
    • R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Process., vol. 10, no. 1-3, pp. 42-54, Jan. 2000.
    • (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 17
    • 33845536164 scopus 로고    scopus 로고
    • The class imbalance problem: A systematic study
    • Oct.
    • N. Japkowicz and S. Stephen, "The class imbalance problem: A systematic study," Intell. Data Anal., vol. 6, no. 5, pp. 429-449, Oct. 2002.
    • (2002) Intell. Data Anal. , vol.6 , Issue.5 , pp. 429-449
    • Japkowicz, N.1    Stephen, S.2
  • 19
    • 84865791238 scopus 로고    scopus 로고
    • Comparison of voice activity detectors for interview speech in NIST speaker recognition evaluation
    • Florence, Italy, Aug.
    • H. Yu and M. W. Mak, "Comparison of voice activity detectors for interview speech in NIST speaker recognition evaluation," in Proc. Interspeech '11, Florence, Italy, Aug. 2011, pp. 2353-2356.
    • (2011) Proc. Interspeech '11 , pp. 2353-2356
    • Yu, H.1    Mak, M.W.2
  • 20
    • 84945737762 scopus 로고
    • A leisurely look at bootstrap, the jackknife, and cross-validation
    • B. Efron and G. Gong, "A leisurely look at bootstrap, the jackknife, and cross-validation," Amer. Statist., vol. 37, no. 1, pp. 36-48, 1983.
    • (1983) Amer. Statist. , vol.37 , Issue.1 , pp. 36-48
    • Efron, B.1    Gong, G.2
  • 21
    • 84859066901 scopus 로고    scopus 로고
    • Analysis of large-scale SVM training algorithms for language and speaker recognition
    • Jul
    • S. Cumani and P. Laface, "Analysis of large-scale SVM training algorithms for language and speaker recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 5, pp. 1585-1596, Jul. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.5 , pp. 1585-1596
    • Cumani, S.1    Laface, P.2
  • 22
    • 80051604674 scopus 로고    scopus 로고
    • Fast discriminative speaker verification in the i-vector space
    • Prague, Czech Republic, May
    • S. Cumani, N. Brummer, L. Burget, and P. Laface, "Fast discriminative speaker verification in the i-vector space," in Proc. ICASSP '11, Prague, Czech Republic, May 2011, pp. 4852-4855.
    • (2011) Proc. ICASSP '11 , pp. 4852-4855
    • Cumani, S.1    Brummer, N.2    Burget, L.3    Laface, P.4
  • 23
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • Jun.
    • B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, no. 6, pp. 1304-1312, Jun. 1974.
    • (1974) J. Acoust. Soc. Amer. , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.S.1
  • 25
    • 84873932216 scopus 로고    scopus 로고
    • Joint FActor Analysis Matlab Demo
    • Joint FActor Analysis Matlab Demo [Online]. Available: http://speech.fit.vutbr.cz/software/joint-factor-analysis-matlab-demo
  • 26
    • 84873894051 scopus 로고    scopus 로고
    • Alleviating the small sample-size problem in i-vector based speaker verification
    • Hong Kong, Dec.
    • W. Rao and M. Mak, "Alleviating the small sample-size problem in i-vector based speaker verification," in Proc. Int. Symp. Chinese Spoken Lang. Process. (ISCSLP'12), Hong Kong, Dec. 2012, pp. 335-339.
    • (2012) Proc. Int. Symp. Chinese Spoken Lang. Process. (ISCSLP'12) , pp. 335-339
    • Rao, W.1    Mak, M.2
  • 28
    • 0032042805 scopus 로고    scopus 로고
    • On expected classification error of the Fisher linear classifier with pseudo-inverse covariance matrix
    • S. Raudys and R. P. W. Duin, "On expected classification error of the Fisher linear classifier with pseudo-inverse covariance matrix," Pattern Recognit. Lett., vol. 19, pp. 385-392, 1998.
    • (1998) Pattern Recognit. Lett. , vol.19 , pp. 385-392
    • Raudys, S.1    Duin, R.P.W.2
  • 29
    • 0031185845 scopus 로고    scopus 로고
    • Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection
    • Jul
    • P. N. Belhumeur, J. P. Hespanha, and D. J. Kriegman, "Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection," IEEE Trans. Pattern Anal. Mach. Intell., vol. 19, no. 7, pp. 711-720, Jul. 1997.
    • (1997) IEEE Trans. Pattern Anal. Mach. Intell. , vol.19 , Issue.7 , pp. 711-720
    • Belhumeur, P.N.1    Hespanha, J.P.2    Kriegman, D.J.3
  • 31
    • 85008544242 scopus 로고    scopus 로고
    • Source-normalized LDA for robust speaker recognition using i-vectors from multiple speech sources
    • Mar
    • M. McLaren and D. van Leeuwen, "Source-normalized LDA for robust speaker recognition using i-vectors from multiple speech sources," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 3, pp. 755-766, Mar. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.3 , pp. 755-766
    • McLaren, M.1    Van Leeuwen, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.