SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 60, Issue , 2014, Pages 56-77

Text-dependent speaker verification: Classifiers, databases and RSR2015

(4) Larcher, Anthony a Lee, Kong Aik a Ma, Bin a Li, Haizhou a

a INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

Author keywords

Database; Speaker recognition; Text dependent

Indexed keywords

CLASSIFICATION (OF INFORMATION); DATABASE SYSTEMS; MOBILE DEVICES; NETWORK ARCHITECTURE;

EVALUATION PROTOCOL; EVALUATION SCHEME; HUMAN LANGUAGE TECHNOLOGIES; RESEARCH COMMUNITIES; SPEAKER RECOGNITION; SPEAKER VERIFICATION; SPEAKER VERIFICATION SYSTEM; TEXT-DEPENDENT;

SPEECH RECOGNITION;

EID: 84897385841 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2014.03.001 Document Type: Article

Times cited : (293)

References (136)

1
- 60349089517
- Speaker-dependent characteristics of the nasals
- K. Amino, and T. Arai Speaker-dependent characteristics of the nasals Forensic Sci. Int. 185 2009 21 28
- (2009) Forensic Sci. Int. , vol.185 , pp. 21-28
- Amino, K.¹ Arai, T.²

2
- 85073226884
- Text-dependent speaker verification using a small development set
- Aronowitz, H.; 2012. Text-dependent speaker verification using a small development set. In: Odyssey Speaker and Language Recognition Workshop.
- (2012) Odyssey Speaker and Language Recognition Workshop
- Aronowitz, H.¹

3
- 79959833152
- Exploring subsegmental and suprasegmental features for a text-dependent speaker verification in distant speech signals
- Avinash, B.; Guruprasad, S.; Ygnannarayana, B.; 2010. Exploring subsegmental and suprasegmental features for a text-dependent speaker verification in distant speech signals. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 1073-1076.
- (2010) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 1073-1076
- Avinash, B.¹ Guruprasad, S.² Ygnannarayana, B.³

4
- 35248819751
- The BANCA database and evaluation protocol
- E. Bailly-Bailliere, S. Bengio, F. Bimbot, M. Hamouz, J. Kittler, J. Mariethoz, J. Matas, K. Messer, V. Popovici, and F. Poree et al. The BANCA database and evaluation protocol Lect. Notes Comput. Sci. (LNCS) 2688 2003 625 638
- (2003) Lect. Notes Comput. Sci. (LNCS) , vol.2688 , pp. 625-638
- Bailly-Bailliere, E.¹ Bengio, S.² Bimbot, F.³ Hamouz, M.⁴ Kittler, J.⁵ Mariethoz, J.⁶ Matas, J.⁷ Messer, K.⁸ Popovici, V.⁹ Poree, F.¹⁰

5
- 33746432558
- User-customized password speaker verification using multiple reference and background models
- DOI 10.1016/j.specom.2005.08.008, PII S016763930600046X
- M.F. BenZeghiba, and H. Bourlard User-customized password speaker verification using multiple reference and background models Speech Commun. 48 2006 1200 1213 (Pubitemid 44128618)
- (2006) Speech Communication , vol.48 , Issue.9 , pp. 1200-1213
- BenZeghiba, M.F.¹ Bourlard, H.²

6
- 33646774519
- Text-constrained speaker recognition on a text-independent task
- Boakye, K.; Peskin, B.; 2004. Text-constrained speaker recognition on a text-independent task. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-6.
- (2004) Odyssey Speaker and Language Recognition Workshop , pp. 1-6
- Boakye, K.¹ Peskin, B.²

7
- 84897459077
- Study on the effect of lexical mismatch in text-dependent speaker verification
- Boies, D.; Hébert, M.; Heck, L.P.; 2004. Study on the effect of lexical mismatch in text-dependent speaker verification. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-5.
- (2004) Odyssey Speaker and Language Recognition Workshop , pp. 1-5
- Boies, D.¹ Hébert, M.² Heck, L.P.³

8
- 84955726258
- Gaussian dynamic warping (GDW) method applied to text-dependent speaker detection and verification
- Bonastre, J.F.; Morin, P.; Junqua, J.C.; 2003. Gaussian dynamic warping (GDW) method applied to text-dependent speaker detection and verification. In: European Conference on Speech Communication and Technology (Eurospeech), pp. 2013-2016.
- (2003) European Conference on Speech Communication and Technology (Eurospeech) , pp. 2013-2016
- Bonastre, J.F.¹ Morin, P.² Junqua, J.C.³

9
- 84865753339
- Intersession compensation and scoring methods in the i-vectors space for speaker recognition
- Bousquet, P.M.; Matrouf, D.; Bonastre, J.F.; 2011. Intersession compensation and scoring methods in the i-vectors space for speaker recognition. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 485-488.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 485-488
- Bousquet, P.M.¹ Matrouf, D.² Bonastre, J.F.³

10
- 85073229756
- Variance-spectra based normalization for I-vector standard and probabilistic linear discriminant analysis
- Bousquet, P.M.; Larcher, A.; Matrouf, D.; Bonastre, J.F.; Plchot, O.; 2012. Variance-spectra based normalization for I-vector standard and probabilistic linear discriminant analysis. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-8.
- (2012) Odyssey Speaker and Language Recognition Workshop , pp. 1-8
- Bousquet, P.M.¹ Larcher, A.² Matrouf, D.³ Bonastre, J.F.⁴ Plchot, O.⁵

11
- 85073103063
- The speaker partitioning problem
- Brümmer, N.; de Villiers, E.; 2010. The speaker partitioning problem. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-8.
- (2010) Odyssey Speaker and Language Recognition Workshop , pp. 1-8
- Brümmer, N.¹ De Villiers, E.²

12
- 0028996937
- Testing with the YOHO CD-ROM voice verification corpus
- Campbell, J.P.; 1995. Testing with the YOHO CD-ROM voice verification corpus. in: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 341-344.
- (1995) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 341-344
- Campbell, J.P.¹

13
- 77952748173
- LCD website
- Campbell, J.; Higgins, A.L.; 1994. A YOHO speaker verification corpus LDC94s16. Available on LCD website: .
- (1994) A YOHO Speaker Verification Corpus LDC94s16
- Campbell, J.¹ Higgins, A.L.²

14
- 0032674167
- Corpora for the evaluation of speaker recognition systems
- Campbell, J.P.; Reynolds, D.A.; 1999. Corpora for the evaluation of speaker recognition systems. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 829-832.
- (1999) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 829-832
- Campbell, J.P.¹ Reynolds, D.A.²

15
- 85032751904
- Forensic speaker recognition
- J.P. Campbell, W. Shen, W.M. Campbell, R. Schwartz, J.F. Bonastre, and D. Matrouf Forensic speaker recognition IEEE Signal Process. Mag. 26 2009 95 103
- (2009) IEEE Signal Process. Mag. , vol.26 , pp. 95-103
- Campbell, J.P.¹ Shen, W.² Campbell, W.M.³ Schwartz, R.⁴ Bonastre, J.F.⁵ Matrouf, D.⁶

16
- 0031220765
- Optimizing feature set for speaker verification
- PII S0167865597000640
- D. Charlet, and D. Jouvet Optimizing feature set for speaker verification Pattern Recognit. Lett. 18 1997 873 879 (Pubitemid 127411230)
- (1997) Pattern Recognition Letters , vol.18 , Issue.9 , pp. 873-879
- Charlet, D.¹ Jouvet, D.²

17
- 0033738353
- An alternative normalization scheme in HMM-based text-dependent speaker verification
- D. Charlet, D. Jouvet, and O. Collin An alternative normalization scheme in HMM-based text-dependent speaker verification Speech Commun. 31 2000 113 120
- (2000) Speech Commun. , vol.31 , pp. 113-120
- Charlet, D.¹ Jouvet, D.² Collin, O.³

18
- 60349089170
- A Robust to outliers hidden markov model with application in text-dependent speaker identification
- Chatzis, S.; Varvarigou, T.; 2007. A Robust to outliers hidden markov model with application in text-dependent speaker identification. In: International Conference on Signal Processing and Communications, pp. 804-807.
- (2007) International Conference on Signal Processing and Communications , pp. 804-807
- Chatzis, S.¹ Varvarigou, T.²

19
- 0029765817
- An HMM approach to text-prompted speaker verification
- Che, C.W.; Lin, Q.; Yuk, D.S.; 1996. An HMM approach to text-prompted speaker verification. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 673-676.
- (1996) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 673-676
- Che, C.W.¹ Lin, Q.² Yuk, D.S.³

20
- 0030244499
- A modified HME architecture for text-dependent speaker identification
- PII S1045922796066167
- K. Chen, D. Xie, and H. Chi A modified HME architecture for text-dependent speaker identification IEEE Trans. Neural Networks 7 1996 1309 1313 (Pubitemid 126776401)
- (1996) IEEE Transactions on Neural Networks , vol.7 , Issue.5 , pp. 1309-1313
- Chen, K.¹ Xie, D.² Chi, H.³

21
- 84872130198
- GMM-UBM for text-dependent speaker recognition
- IEEE
- W. Chen, Q. Hong, and X. Li GMM-UBM for text-dependent speaker recognition International Conference on Audio, Language and Image Processing (ICALIP) 2012 IEEE 432 435
- (2012) International Conference on Audio, Language and Image Processing (ICALIP) , pp. 432-435
- Chen, W.¹ Hong, Q.² Li, X.³

22
- 0010571638
- Swiss French PolyPhone and PolyVar: Telephone Speech Databases to Model Inter- and Intra-Speaker Variability
- Chollet, G.; Cochard, J.L.; Constantinescu, A.; Jaboulet, C.; Langlais, P.; 1996. Swiss French PolyPhone and PolyVar: Telephone Speech Databases to Model Inter- and Intra-Speaker Variability. Technical Report. IDIAP.
- (1996) Technical Report. IDIAP
- Chollet, G.¹ Cochard, J.L.² Constantinescu, A.³ Jaboulet, C.⁴ Langlais, P.⁵

23
- 85078515740
- The CSLU speaker recognition corpus
- Cole, R.; Noel, M.; Noel, V.; 1998. The CSLU speaker recognition corpus. In: Proceedings International Conference on Spoken Language Processing, ICSLP, pp. 3167-3170.
- (1998) Proceedings International Conference on Spoken Language Processing, ICSLP , pp. 3167-3170
- Cole, R.¹ Noel, M.² Noel, V.³

24
- 84867615142
- Gender independent discriminative speaker recognition in i-vector space
- Cumani, S.; Glembek, O.; Brummer, N.; de Villiers, E.; Laface, P.; 2012. Gender independent discriminative speaker recognition in i-vector space. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 4361-4364.
- (2012) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 4361-4364
- Cumani, S.¹ Glembek, O.² Brummer, N.³ De Villiers, E.⁴ Laface, P.⁵

25
- 84890465306
- Probabilistic linear discriminant analysis of I-vector posterior distribution
- Cumani, S.; Plchot, O.; Laface, P.; 2013. Probabilistic linear discriminant analysis of I-vector posterior distribution. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 7644-7647.
- (2013) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 7644-7647
- Cumani, S.¹ Plchot, O.² Laface, P.³

26
- 84897430194
- Direct modeling of spoken passwords for text-dependent speaker recognition by compressed time-feature representations
- Das, A.; Tapaswi, M.; 2010. Direct modeling of spoken passwords for text-dependent speaker recognition by compressed time-feature representations. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 4510-4513.
- (2010) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 4510-4513
- Das, A.¹ Tapaswi, M.²

27
- 85028060103
- Cosine similarity scoring without score normalization techniques
- Odyssey
- Dehak, N.; Dehak, R.; Glass, J.; Reynolds, D.; Kenny, P.; 2010. Cosine similarity scoring without score normalization techniques. In: Odyssey Speaker and Language Recognition Workshop, Odyssey, pp. 1-5.
- (2010) Odyssey Speaker and Language Recognition Workshop , pp. 1-5
- Dehak, N.¹ Dehak, R.² Glass, J.³ Reynolds, D.⁴ Kenny, P.⁵

28
- 79951609039
- Front-end factor analysis for speaker verification
- N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet Front-end factor analysis for speaker verification IEEE Trans. Audio Speech Lang. Process. 19 2011 788 798
- (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , pp. 788-798
- Dehak, N.¹ Kenny, P.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

29
- 84865750857
- Language recognition via i-vectors and dimensionality reduction
- Dehak, N.; Torres-Carrasquillo, P.A.; Reynolds, D.A.; Dehak, R.; 2011b. Language recognition via i-vectors and dimensionality reduction. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 857-860.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 857-860
- Dehak, N.¹ Torres-Carrasquillo, P.A.² Reynolds, D.A.³ Dehak, R.⁴

30
- 33947392865
- Multimodal biometrics for identity documents
- D. Dessimoz, J. Richiardi, C. Champod, and A. Drygajlo Multimodal biometrics for identity documents Forensic Sci. Int. 167 2008 154 159
- (2008) Forensic Sci. Int. , vol.167 , pp. 154-159
- Dessimoz, D.¹ Richiardi, J.² Champod, C.³ Drygajlo, A.⁴

31
- 84897384500
- Dialogues Spotlight Technology The Center for Communication Interface Research, University of Edinburgh
- Dialogues Spotlight Technology, 2000. Large Scale Evaluation of Automatic Speaker Verification Technology. Technical Report. The Center for Communication Interface Research, University of Edinburgh.
- (2000) Large Scale Evaluation of Automatic Speaker Verification Technology. Technical Report

32
- 0040035169
- Speaker recognition evaluation methodology - An overview and perspective
- Doddington, G.R.; 1998. Speaker recognition evaluation methodology - an overview and perspective. In: Workshop on Speaker Recognition and its Commercial and Forensic Applications (RLA2C), pp. 20-23.
- (1998) Workshop on Speaker Recognition and Its Commercial and Forensic Applications (RLA2C) , pp. 20-23
- Doddington, G.R.¹

33
- 85073255206
- The effect of target/non-target age difference on speaker recognition performance
- Doddington, G.; 2012. The effect of target/non-target age difference on speaker recognition performance. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-5.
- (2012) Odyssey Speaker and Language Recognition Workshop , pp. 1-5
- Doddington, G.¹

34
- 85084013548
- Support vector machines based text dependent speaker verification using HMM superverctors
- Dong, C.; Dong, Y.; Li, J.; Wang, H.; 2008. Support vector machines based text dependent speaker verification using HMM superverctors. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-7.
- (2008) Odyssey Speaker and Language Recognition Workshop , pp. 1-7
- Dong, C.¹ Dong, Y.² Li, J.³ Wang, H.⁴

35
- 34547221720
- MyIdea - Multimodal biometrics database, description of acquisition protocols
- B. Dumas, C. Pugin, J. Hennebert, D. Petrovska-Delacrétaz, A. Humm, F. Evéquoz, R. Ingold, and D.V. Rotz MyIdea - multimodal biometrics database, description of acquisition protocols Biometrics Internet 275 2005 59 62
- (2005) Biometrics Internet , vol.275 , pp. 59-62
- Dumas, B.¹ Pugin, C.² Hennebert, J.³ Petrovska-Delacrétaz, D.⁴ Humm, A.⁵ Evéquoz, F.⁶ Ingold, R.⁷ Rotz, D.V.⁸

36
- 52049089205
- Text dependent speaker identification based on spectrograms
- Dutta, T.; 2007. Text dependent speaker identification based on spectrograms. In: Image and Vision Computing, pp. 238-243.
- (2007) Image and Vision Computing , pp. 238-243
- Dutta, T.¹

37
- 52049109385
- Dynamic time warping based approach to text-dependent speaker identification using spectrograms
- Dutta, T.; 2008. Dynamic time warping based approach to text-dependent speaker identification using spectrograms. In: Congress on Image and Signal Processing, pp. 354-360.
- (2008) Congress on Image and Signal Processing , pp. 354-360
- Dutta, T.¹

38
- 84897388969
- S0050, RUSTEN: Russian switched telephone network speech database (STC)
- ELDA - Evaluations and Language resources Distribution Agency, 2003. S0050, RUSTEN: Russian switched telephone network speech database (STC).
- (2003) ELDA - Evaluations and Language Resources Distribution Agency

39
- 0028996904
- Text-dependent speaker verification using data fusion
- Institute of Electrical Engineers Inc (IEE)
- K.R. Farrell Text-dependent speaker verification using data fusion IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1995 Institute of Electrical Engineers Inc (IEE) 349 352
- (1995) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 349-352
- Farrell, K.R.¹

40
- 0002525996
- An analysis of data fusion methods for speaker verification
- Farrell, K.R.; Ramachandran, R.P.; Mammone, R.J.; 1998. An analysis of data fusion methods for speaker verification. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 1129-1132.
- (1998) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 1129-1132
- Farrell, K.R.¹ Ramachandran, R.P.² Mammone, R.J.³

41
- 33749525191
- Multimodal biometric databases: An overview
- 1703234
- M. Faundez-Zanuy, J. Fierrez-Aguilar, J. Ortega-Garcia, and J. Gonzalez-Rodriguez Multimodal biometric databases: an overview IEEE Aerosp. Electron. Syst. Mag. 21 2006 29 37 (Pubitemid 44523682)
- (2006) IEEE Aerospace and Electronic Systems Magazine , vol.21 , Issue.8 , pp. 29-37
- Faundez-Zanuy, M.¹ Fierrez-Aguilar, J.² Ortega-Garcia, J.³ Gonzalez-Rodriguez, J.⁴

42
- 78651239704
- (Ph.D. thesis). School of Engineering Swansea University
- Fauve, B.; 2009. Tackling Variabilities in Speaker Verification with a Focus on Short Durations (Ph.D. thesis). School of Engineering Swansea University.
- (2009) Tackling Variabilities in Speaker Verification with A Focus on Short Durations
- Fauve, B.¹

43
- 33845432328
- Biosec baseline corpus: A multimodal biometric database
- DOI 10.1016/j.patcog.2006.10.014, PII S0031320306004304
- J. Fierrez, J. Ortega-Garcia, D. Torre Toledano, and J. Gonzalez-Rodriguez Biosec baseline corpus: a multimodal biometric database Pattern Recognit. 40 2007 1389 1392 (Pubitemid 44894487)
- (2007) Pattern Recognition , vol.40 , Issue.4 , pp. 1389-1392
- Fierrez, J.¹ Ortega-Garcia, J.² Torre Toledano, D.³ Gonzalez-Rodriguez, J.⁴

44
- 77951880527
- BiosecurID: A multimodal biometric database
- J. Fierrez, J. Galbally, J. Ortega-Garcia, M. Freire, F. Alonso-Fernandez, D. Ramos, D. Toledano, J. Gonzalez-Rodriguez, J. Siguenza, and J. Garrido-Salas et al. BiosecurID: a multimodal biometric database Pattern Anal. Appl. 13 2010 235 246
- (2010) Pattern Anal. Appl. , vol.13 , pp. 235-246
- Fierrez, J.¹ Galbally, J.² Ortega-Garcia, J.³ Freire, M.⁴ Alonso-Fernandez, F.⁵ Ramos, D.⁶ Toledano, D.⁷ Gonzalez-Rodriguez, J.⁸ Siguenza, J.⁹ Garrido-Salas, J.¹⁰

45
- 0029768209
- Comparison of multilayer and radial basis function neural networks for text-dependent speaker recognition
- IEEE
- R. Finan, A. Sapeluk, and R. Damper Comparison of multilayer and radial basis function neural networks for text-dependent speaker recognition IEEE International Conference on Neural Networks 1996 IEEE 1992 1997
- (1996) IEEE International Conference on Neural Networks , pp. 1992-1997
- Finan, R.¹ Sapeluk, A.² Damper, R.³

46
- 0029354680
- Discriminating observation probability (DOP) HMM for speaker verification
- M. Forsyth Discriminating observation probability (DOP) HMM for speaker verification Speech Commun. 17 1995 117 129
- (1995) Speech Commun. , vol.17 , pp. 117-129
- Forsyth, M.¹

47
- 26444562315
- The Realistic multi-modal valid database and visual speaker identification comparison experiments
- New York, USA
- Fox, N.A.; O'Mullane, B.A.; Reilly, R.B.; 2005. The Realistic multi-modal valid database and visual speaker identification comparison experiments. In: International Conference of Audio and Video-Based Person Authentication, AVBPA, New York, USA, pp. 777-786.
- (2005) International Conference of Audio and Video-Based Person Authentication, AVBPA , pp. 777-786
- Fox, N.A.¹ O'Mullane, B.A.² Reilly, R.B.³

48
- 0019555090
- Cepstral analysis technique for automatic speaker verification
- S. Furui Cepstral analysis technique for automatic speaker verification IEEE Trans. Acoust. Speech Signal Process. (see also IEEE Trans. Signal Process.) 29 1981 254 272 (Pubitemid 11495877)
- (1981) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-29 , Issue.2 , pp. 254-272
- Furui Sadaoki¹

49
- 0019583902
- Comparison of speaker recognition methods using statistical features and dynamic features
- S. Furui Comparison of speaker recognition methods using statistical features and dynamic features IEEE Trans. Acoust. Speech Signal Process. 29 1981 342 350 (Pubitemid 11520516)
- (1981) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-29 , Issue.3 PART 1 , pp. 342-350
- Furui Sadaoki¹

50
- 84865733857
- Analysis of i-vector length normalization in speaker recognition systems
- Garcia-Romero, D.; Espy-Wilson, C.Y.; 2011. Analysis of i-vector length normalization in speaker recognition systems. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 249-252.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 249-252
- Garcia-Romero, D.¹ Espy-Wilson, C.Y.²

51
- 22844433950
- BIOMET: A multimodal person authentication database including face, voice, fingerprint, hand and signature modalities
- Springer; Berlin, Heidelberg
- Garcia-Salicetti, S.; Beumier, C.; Chollet, G.; Dorizzi, B.; Jardins, J.; Lunter, J.; Ni, Y.; Petrovska-Delacretaz, D. 2003. BIOMET: A multimodal person authentication database including face, voice, fingerprint, hand and signature modalities. Audio-and Video-Based Biometric Person Authentication. Springer; Berlin, Heidelberg.
- (2003) Audio-and Video-Based Biometric Person Authentication
- Garcia-Salicetti, S.¹ Beumier, C.² Chollet, G.³ Dorizzi, B.⁴ Jardins, J.⁵ Lunter, J.⁶ Ni, Y.⁷ Petrovska-Delacretaz, D.⁸

52
- 0003548585
- Philadelphia, PA
- Garofolo, J.S.; Lamel, L.F.; Fisher, W.M.; Fiscus, J.G.; Pallett, D.S.; Dahlgren, N.; Zue, V.; 1993. Timit Acoustic-Phonetic Continuous Speech Corpus Linguistic Data Consortium. Philadelphia, PA, p. 1.
- (1993) Timit Acoustic-Phonetic Continuous Speech Corpus Linguistic Data Consortium , pp. 1
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³ Fiscus, J.G.⁴ Pallett, D.S.⁵ Dahlgren, N.⁶ Zue, V.⁷

53
- 85009088413
- An implementation and evaluation of an on-line speaker verification system for field trials
- Gu, Y.; Thomas, T.; 1998. An implementation and evaluation of an on-line speaker verification system for field trials. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 125-128.
- (1998) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 125-128
- Gu, Y.¹ Thomas, T.²

54
- 84890472775
- Duration mismatch compensation for I-vector based speaker recognition systems
- Hasan, T.; Saeidi, R.; Hansen, J.H.L.; van Leeuwen, D.A.; 2013. Duration mismatch compensation for I-vector based speaker recognition systems. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 7663-7667.
- (2013) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 7663-7667
- Hasan, T.¹ Saeidi, R.² Hansen, J.H.L.³ Van Leeuwen, D.A.⁴

55
- 85024895429
- Text-dependent speaker recognition
- Springer-Verlag Heidelberg (Chapter)
- M. Hébert Text-dependent speaker recognition Handbook of Speech Processing 2008 Springer-Verlag Heidelberg 743 762 (Chapter)
- (2008) Handbook of Speech Processing , pp. 743-762
- Hébert, M.¹

56
- 33646797748
- T-norm for text-dependent commercial speaker verification applications: Effect of lexical mismatch
- Hébert, M.; Boies, D.; 2005. T-norm for text-dependent commercial speaker verification applications: Effect of lexical mismatch. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 729-732.
- (2005) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 729-732
- Hébert, M.¹ Boies, D.²

57
- 85009230334
- Phonetic class-based speaker verification
- Geneva
- Hebert, M.; Heck, L.P.; 2003. Phonetic class-based speaker verification. In: European Conference on Speech Communication and Technology (Eurospeech), Geneva, pp. 1665-1668.
- (2003) European Conference on Speech Communication and Technology (Eurospeech) , pp. 1665-1668
- Hebert, M.¹ Heck, L.P.²

58
- 85009131570
- Integrating speaker and speech recognizers: Automatic identity claim capture for speaker verification
- Heck, L.; Genoud, D.; 2001. Integrating speaker and speech recognizers: automatic identity claim capture for speaker verification. In: Odyssey Speaker and Language Recognition Workshop, pp. 249-254.
- (2001) Odyssey Speaker and Language Recognition Workshop , pp. 249-254
- Heck, L.¹ Genoud, D.²

59
- 0033729411
- POLYCOST: A telephone-speech database for speaker recognition
- J. Hennebert, H. Melin, D. Petrovska, and D. Genoud POLYCOST: a telephone-speech database for speaker recognition Speech Commun. 31 2000 265 270
- (2000) Speech Commun. , vol.31 , pp. 265-270
- Hennebert, J.¹ Melin, H.² Petrovska, D.³ Genoud, D.⁴

60
- 84878413073
- PLDA modeling in I-vector and supervector space for speaker verification
- Jiang, Y.; Lee, K.A.; Tang, Z.; Ma, B.; Larcher, A.; Li, H.; 2012. PLDA modeling in I-vector and supervector space for speaker verification. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 1680-1683.
- (2012) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 1680-1683
- Jiang, Y.¹ Lee, K.A.² Tang, Z.³ Ma, B.⁴ Larcher, A.⁵ Li, H.⁶

61
- 85073110747
- Intra-speaker variability effects on speaker verification performance
- Kahn, J.; Audibert, N.; Rossato, S.; Bonastre, J.F.; 2010. Intra-speaker variability effects on speaker verification performance. In: Odyssey Speaker and Language Recognition Workshop, pp. 109-116.
- (2010) Odyssey Speaker and Language Recognition Workshop , pp. 109-116
- Kahn, J.¹ Audibert, N.² Rossato, S.³ Bonastre, J.F.⁴

62
- 84897430426
- Inter and intra-speaker variability in French: An analysis of oral vowels and its implication for automatic speaker verification
- Kahn, J.; Audibert, N.; Bonastre, J.F.; Rossato, S.; 2011. Inter and intra-speaker variability in French: an analysis of oral vowels and its implication for automatic speaker verification. In: International Congress of Phonetic Sciences (ICPhS), pp. 1002-1005.
- (2011) International Congress of Phonetic Sciences (ICPhS) , pp. 1002-1005
- Kahn, J.¹ Audibert, N.² Bonastre, J.F.³ Rossato, S.⁴

63
- 84865718184
- I-vector based speaker recognition on short utterances
- Kanagasundaram, A.; Vogt, R.; Dean, D.; Sridharan, S.; Mason, M.; 2011. I-vector based speaker recognition on short utterances. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 2341-2344.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 2341-2344
- Kanagasundaram, A.¹ Vogt, R.² Dean, D.³ Sridharan, S.⁴ Mason, M.⁵

64
- 80052216683
- Graph relational features for speaker recognition and mining
- IEEE
- Z.N. Karam, W.M. Campbell, and N. Dehak Graph relational features for speaker recognition and mining Statistical Signal Processing Workshop (SSP) 2011 IEEE 525 528
- (2011) Statistical Signal Processing Workshop (SSP) , pp. 525-528
- Karam, Z.N.¹ Campbell, W.M.² Dehak, N.³

65
- 84889853754
- Within-speaker variability in the VeriVox database
- I. Karlsson Within-speaker variability in the VeriVox database Gothenburg Papers Theor. Ling. 1999 93 96
- (1999) Gothenburg Papers Theor. Ling. , pp. 93-96
- Karlsson, I.¹

66
- 0033748243
- Speaker verification with elicited speaking styles in the VeriVox project
- I. Karlsson, T. Banziger, J. Dankovicová, T. Johnstone, J. Lindberg, H. Melin, F. Nolan, and K. Scherer Speaker verification with elicited speaking styles in the VeriVox project Speech Commun. 31 2000 121 129
- (2000) Speech Commun. , vol.31 , pp. 121-129
- Karlsson, I.¹ Banziger, T.² Dankovicová, J.³ Johnstone, T.⁴ Lindberg, J.⁵ Melin, H.⁶ Nolan, F.⁷ Scherer, K.⁸

67
- 0141702109
- Improved speaker verification over the cellular phone network using phoneme-balanced and digit-sequence-preserving connected digit patterns
- Kato, T.; Shimizu, T.; 2003. Improved speaker verification over the cellular phone network using phoneme-balanced and digit-sequence-preserving connected digit patterns. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 57-60.
- (2003) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 57-60
- Kato, T.¹ Shimizu, T.²

68
- 79958764380
- Performance comparison of 2-D DCT on full/block spectrogram and 1-D DCT on row mean of spectrogram for speaker identification
- H. Kekre, T. Sarode, S. Natu, and P. Natu Performance comparison of 2-D DCT on full/block spectrogram and 1-D DCT on row mean of spectrogram for speaker identification Int. J. Biometrics Bioinf. (IJBB) 4 2010 100
- (2010) Int. J. Biometrics Bioinf. (IJBB) , vol.4 , pp. 100
- Kekre, H.¹ Sarode, T.² Natu, S.³ Natu, P.⁴

69
- 79952919345
- Effects of long-term ageing on speaker verification
- Springer
- F. Kelly, and N. Harte Effects of long-term ageing on speaker verification Biometrics and Id Management 2011 Springer 113 124
- (2011) Int. J. Biometrics Bioinf. (IJBB) , pp. 113-124
- Kelly, F.¹ Harte, N.²

70
- 84866792733
- Speaker verification with long-term ageing data
- Kelly, F.; Drygajlo, A.; Harte, N.; 2012. Speaker verification with long-term ageing data. In: International Conference on Biometrics (ICB), pp. 478-483.
- (2012) International Conference on Biometrics (ICB) , pp. 478-483
- Kelly, F.¹ Drygajlo, A.² Harte, N.³

71
- 4544237515
- Disentangling speaker and channel effects in speaker verification
- Kenny, P.; Dumouchel, P.; 2004. Disentangling speaker and channel effects in speaker verification. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 37-40.
- (2004) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 37-40
- Kenny, P.¹ Dumouchel, P.²

72
- 50249170027
- Joint factor analysis versus eigenchannels in speaker recognition
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel Joint factor analysis versus eigenchannels in speaker recognition IEEE Trans. Audio Speech Lang. Process. 15 2007 1435 1447
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 1435-1447
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

73
- 84890536185
- PLDA for Speaker Verification with Utterances of Arbitrary Duration
- Kenny, P.; Stafylakis, T.; Ouellet, P.; Alam, J.; Dumouchel, P.; 2013. PLDA for Speaker Verification with Utterances of Arbitrary Duration. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 7649-7653.
- (2013) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 7649-7653
- Kenny, P.¹ Stafylakis, T.² Ouellet, P.³ Alam, J.⁴ Dumouchel, P.⁵

74
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- T. Kinnunen, and H. Li An overview of text-independent speaker recognition: from features to supervectors Speech Commun. 52 2010 12 40
- (2010) Speech Commun. , vol.52 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

75
- 84865708702
- Reinforced temporal structure information for embedded utterance-based speaker recognition
- Larcher, A.; Bonastre, J.F.; Mason, J.S.D.; 2008. Reinforced temporal structure information for embedded utterance-based speaker recognition. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 371-374.
- (2008) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 371-374
- Larcher, A.¹ Bonastre, J.F.² Mason, J.S.D.³

76
- 84867598363
- I-vectors in the context of phonetically-constrained short utterances for speaker verification
- Larcher, A.; Bousquet, P.M.; Lee, K.A.; Matrouf, D.; Li, H.; Bonastre, J.F.; 2012a. I-vectors in the context of phonetically-constrained short utterances for speaker verification. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 4773-4776.
- (2012) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 4773-4776
- Larcher, A.¹ Bousquet, P.M.² Lee, K.A.³ Matrouf, D.⁴ Li, H.⁵ Bonastre, J.F.⁶

77
- 84878465724
- The RSR2015: Database for text-dependent speaker verification using multiple pass-phrases
- Larcher, A.; Lee, K.A.; Ma, B.; Li, H.; 2012b. The RSR2015: database for text-dependent speaker verification using multiple pass-phrases. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 1580-1583.
- (2012) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 1580-1583
- Larcher, A.¹ Lee, K.A.² Ma, B.³ Li, H.⁴

78
- 84896111913
- ALIZE 3.0 - Open source toolkit for state-of-the-art speaker recognition
- Larcher, A.; Bonastre, J.F.; Fauve, B.; Lee, K.A.; Lévy, C.; Li, H.; Mason, J.S.; Parfait, J.Y.; 2013a. ALIZE 3.0 - open source toolkit for state-of-the-art speaker recognition. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 2768-2773.
- (2013) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 2768-2773
- Larcher, A.¹ Bonastre, J.F.² Fauve, B.³ Lee, K.A.⁴ Lévy, C.⁵ Li, H.⁶ Mason, J.S.⁷ Parfait, J.Y.⁸

79
- 84884910090
- Reinforced temporal structure of acoustic models for speaker recognition
- A. Larcher, J.F. Bonastre, and J.S. Mason Reinforced temporal structure of acoustic models for speaker recognition Digital Signal Process. 23 2013 1910 1917
- (2013) Digital Signal Process. , vol.23 , pp. 1910-1917
- Larcher, A.¹ Bonastre, J.F.² Mason, J.S.³

80
- 84890536710
- Phonetically-constrained PLDA modeling for text-dependent speaker verification with multiple short utterances
- Larcher, A.; Lee, K.A.; Ma, B.; Li, H.; 2013c. Phonetically-constrained PLDA modeling for text-dependent speaker verification with multiple short utterances. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 7673-7677.
- (2013) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 7673-7677
- Larcher, A.¹ Lee, K.A.² Ma, B.³ Li, H.⁴

81
- 70450211393
- Long term examination of intra-session and inter-session speaker variability
- Lawson, A.D.; Staufer, A.; Smolenski, B.; Pokines, B.; Leoanrd, M.; Cupples, E.; 2009. Long term examination of intra-session and inter-session speaker variability. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 2899-2902.
- (2009) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 2899-2902
- Lawson, A.D.¹ Staufer, A.² Smolenski, B.³ Pokines, B.⁴ Leoanrd, M.⁵ Cupples, E.⁶

82
- 84865727618
- Joint application of speech and speaker recognition for automation and security in smart home
- Lee, K.A.; Larcher, A.; Thai, H.; Ma, B.; Li, H.; 2011. Joint application of speech and speaker recognition for automation and security in smart home. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 3317-3318.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 3317-3318
- Lee, K.A.¹ Larcher, A.² Thai, H.³ Ma, B.⁴ Li, H.⁵

83
- 84897395377
- Multi-session PLDA scoring of I-vector for partially open-set speaker detection
- Lee, K.A.; Larcher, A.; You, C.H.; Ma, B.; Li, H.; 2013a. Multi-session PLDA scoring of I-vector for partially open-set speaker detection. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 3651-3655.
- (2013) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 3651-3655
- Lee, K.A.¹ Larcher, A.² You, C.H.³ Ma, B.⁴ Li, H.⁵

84
- 84893339015
- Speaker verification makes its debut in smartphone
- Lee, K.A.; Ma, B.; Li, H.; 2013b. Speaker verification makes its debut in smartphone. In: SLTC Newsletter.
- (2013) SLTC Newsletter
- Lee, K.A.¹ Ma, B.² Li, H.³

85
- 84897405905
- The distribution of calibrated likelihood-ratios in speaker recognition
- van Leeuwen, D.A.; Brümmer, N.; 2013. The distribution of calibrated likelihood-ratios in speaker recognition. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 1619-1623.
- (2013) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 1619-1623
- Van Leeuwen, D.A.¹ Brümmer, N.²

86
- 70450167547
- The role of age in factor analysis for speaker identification
- Lei, Y.; Hansen, J.H.; 2009. The role of age in factor analysis for speaker identification. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 2371-2374.
- (2009) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 2371-2374
- Lei, Y.¹ Hansen, J.H.²

87
- 0036508040
- Robust endpoint detection and energy normalization for real-time speech and speaker recognition
- DOI 10.1109/TSA.2002.1001979, PII S106366760203972X
- Q. Li, J. Zheng, A. Tsai, and Q. Zhou Robust endpoint detection and energy normalization for real-time speech and speaker recognition IEEE Trans. Speech Audio Process. 10 2002 146 157 (Pubitemid 34692538)
- (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.3 , pp. 146-157
- Li, Q.¹ Zheng, J.² Tsai, A.³ Zhou, Q.⁴

88
- 84876676725
- Spoken language recognition: From fundamentals to practice
- H. Li, B. Ma, and K.A. Lee Spoken language recognition: from fundamentals to practice Proc. IEEE 101 2013 1136 1159
- (2013) Proc. IEEE , vol.101 , pp. 1136-1159
- Li, H.¹ Ma, B.² Lee, K.A.³

89
- 42749107507
- Template compression and distance normalization for reliable text-dependent speaker verification
- IEEE
- J. Luan, J. Hao, T. Kakino, and T. Ikumi Template compression and distance normalization for reliable text-dependent speaker verification Odyssey Speaker and Language Recognition Workshop 2006 IEEE 1 4
- (2006) Odyssey Speaker and Language Recognition Workshop , pp. 1-4
- Luan, J.¹ Hao, J.² Kakino, T.³ Ikumi, T.⁴

90
- 84865721310
- Evaluation of i-vector speaker recognition systems for forensic application
- Mandasari, M.I.; McLaren, M.; van Leeuwen, D.; 2011. Evaluation of i-vector speaker recognition systems for forensic application. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 21-24.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 21-24
- Mandasari, M.I.¹ McLaren, M.² Van Leeuwen, D.³

91
- 78650827060
- On the results of the first mobile biometry (MOBIO) face and speaker verification evaluation
- S. Marcel, C. McCool, P. Matejka, J. Cernocky, J. Kittler, O. Glembek, O. Plchot, Z. Jancik, A. Larcher, and C. Levy On the results of the first mobile biometry (MOBIO) face and speaker verification evaluation Lect. Notes Comput. Sci. 2010 2010 210 225
- (2010) Lect. Notes Comput. Sci. , vol.2010 , pp. 210-225
- Marcel, S.¹ McCool, C.² Matejka, P.³ Cernocky, J.⁴ Kittler, J.⁵ Glembek, O.⁶ Plchot, O.⁷ Jancik, Z.⁸ Larcher, A.⁹ Levy, C.¹⁰

92
- 70450183182
- NIST 2008 speaker recognition evaluation: Performance across telephone and room microphone channels
- Martin, A.F.; Greenberg, C.S.; 2009. NIST 2008 speaker recognition evaluation: performance across telephone and room microphone channels. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 2579-2582.
- (2009) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 2579-2582
- Martin, A.F.¹ Greenberg, C.S.²

93
- 79959850251
- The NIST 2010 speaker recognition evaluation
- Martin, A.F.; Greenberg, C.S.; 2010. The NIST 2010 speaker recognition evaluation. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 2726-2729.
- (2010) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 2726-2729
- Martin, A.F.¹ Greenberg, C.S.²

94
- 84865769863
- Language recognition in i-vectors space
- Martinez, D.; Plchot, O.; Burget, L.; Glembek, O.; Matejka, P.; 2011. Language recognition in i-vectors space. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 861-864.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 861-864
- Martinez, D.¹ Plchot, O.² Burget, L.³ Glembek, O.⁴ Matejka, P.⁵

95
- 84878445298
- Project: DAVID (Digital Audio Visual Integrated Database)
- University of Wales Swansea
- Mason, J.S.; Deravi, F.; Chibelushi, C.C.; Gandon, S.; 1996. Project: DAVID (Digital Audio Visual Integrated Database). Technical Report. Department of Electrical and Electronic Engineering, University of Wales Swansea.
- (1996) Technical Report. Department of Electrical and Electronic Engineering
- Mason, J.S.¹ Deravi, F.² Chibelushi, C.C.³ Gandon, S.⁴

96
- 0027311607
- Concatenated phoneme models for text-variable speaker recognition
- Matsui, T.; Furui, S.; 1993. Concatenated phoneme models for text-variable speaker recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 391-394.
- (1993) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 391-394
- Matsui, T.¹ Furui, S.²

97
- 84897465710
- The multi-biometric, multi-device and multilingual (m3) corpus
- Meng, H.; Ching, P.; Lee, T.; Mak, M.W.; Mak, B.; Moon, Y.; Siu, X.; Tang, M.H.; Tang, X.; Hui, H.P.; Lee, A.; et al.; 2006. The multi-biometric, multi-device and multilingual (m3) corpus. In: International Workshop on Multimodal User Authentication, pp. 1-8.
- (2006) International Workshop on Multimodal User Authentication , pp. 1-8
- Meng, H.¹ Ching, P.² Lee, T.³ Mak, M.W.⁴ Mak, B.⁵ Moon, Y.⁶ Siu, X.⁷ Tang, M.H.⁸ Tang, X.⁹ Hui, H.P.¹⁰ Lee, A.¹¹

98
- 0001935972
- XM2VTSDB: The extended M2VTS database
- Messer, K.; Matas, J.; Kittler, J.; Luettin, J.; Maitre, G.; 1999. XM2VTSDB: the extended M2VTS database. In: International Conference of Audio and Video-Based Person Authentication, AVBPA, pp. 965-966.
- (1999) International Conference of Audio and Video-Based Person Authentication, AVBPA , pp. 965-966
- Messer, K.¹ Matas, J.² Kittler, J.³ Luettin, J.⁴ Maitre, G.⁵

99
- 0031623943
- Model adaptation methods for speaker verification
- IEEE
- W. Mistretta, and K. Farrell Model adaptation methods for speaker verification IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1998 IEEE 113 116
- (1998) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 113-116
- Mistretta, W.¹ Farrell, K.²

100
- 4544375026
- Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM
- Montreal (Canada)
- Nakagawa, S.; Wei, Z.; Takahashi, M.; 2004. Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, Montreal (Canada), pp. I-81.
- (2004) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP
- Nakagawa, S.¹ Wei, Z.² Takahashi, M.³

101
- 77953915852
- A segment selection technique for speaker verification
- M. Nosratighods, E. Ambikairajah, J. Epps, and M.J. Carey A segment selection technique for speaker verification Speech Commun. 52 2010 753 761
- (2010) Speech Commun. , vol.52 , pp. 753-761
- Nosratighods, M.¹ Ambikairajah, E.² Epps, J.³ Carey, M.J.⁴

102
- 0033738983
- AHUMADA: A large speech corpus in Spanish for speaker characterization and identification
- J. Ortega-Garcia, J. Gonzalez-Rodriguez, and V. Marrero-Aguiar AHUMADA: a large speech corpus in Spanish for speaker characterization and identification Speech Commun. 31 2000 255 264
- (2000) Speech Commun. , vol.31 , pp. 255-264
- Ortega-Garcia, J.¹ Gonzalez-Rodriguez, J.² Marrero-Aguiar, V.³

103
- 77951623770
- The multiscenario multienvironment biosecure multimodal database (bmdb)
- J. Ortega-Garcia, J. Fierrez, F. Alonso-Fernandez, J. Galbally, M.R. Freire, J. Gonzalez-Rodriguez, C. Garcia-Mateo, J.L. Alba-Castro, E. Gonzalez-Agulla, and E. Otero-Muras et al. The multiscenario multienvironment biosecure multimodal database (bmdb) IEEE Trans. Pattern Anal. Mach. Intell. 32 2010 1097 1111
- (2010) IEEE Trans. Pattern Anal. Mach. Intell. , vol.32 , pp. 1097-1111
- Ortega-Garcia, J.¹ Fierrez, J.² Alonso-Fernandez, F.³ Galbally, J.⁴ Freire, M.R.⁵ Gonzalez-Rodriguez, J.⁶ Garcia-Mateo, C.⁷ Alba-Castro, J.L.⁸ Gonzalez-Agulla, E.⁹ Otero-Muras, E.¹⁰

104
- 0006184263
- The M2VTS multimodal face database (release 1.00)
- Springer; Berlin, Heidelberg
- Pigeon, Stéphane, and Luc Vandendorpe. 1997. The M2VTS multimodal face database (release 1.00). Audio-and Video-Based Biometric Person Authentication. Springer; Berlin, Heidelberg.
- (1997) Audio-and Video-Based Biometric Person Authentication
- Pigeon, S.¹ Luc, V.²

105
- 82955196715
- Speaker diarization using PLDA-based speaker clustering
- Prazak, J.; Silovsky, J.; 2011. Speaker diarization using PLDA-based speaker clustering. In: International Conference on Intelligent Data Acquisition and Advanced Computing Systems, pp. 347-350.
- (2011) International Conference on Intelligent Data Acquisition and Advanced Computing Systems , pp. 347-350
- Prazak, J.¹ Silovsky, J.²

106
- 50649094277
- Probabilistic linear discriminant analysis for inferences about identity
- IEEE
- S.J. Prince, and J.H. Elder Probabilistic linear discriminant analysis for inferences about identity International Conference on Computer Vision 2007 IEEE 1 8
- (2007) International Conference on Computer Vision , pp. 1-8
- Prince, S.J.¹ Elder, J.H.²

107
- 42749099051
- NIST speaker recognition evaluation chronicles - Part 2
- Przybocki, M.A.; Martin, A.F.; Le, A.N.; 2006. NIST speaker recognition evaluation chronicles - Part 2. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-6.
- (2006) Odyssey Speaker and Language Recognition Workshop , pp. 1-6
- Przybocki, M.A.¹ Martin, A.F.² Le, A.N.³

108
- 33947706790
- Text-dependent speaker-recognition using one-pass dynamic programming algorithm
- Ramasubramanian, V.; Das, A.; Kumar, V.; 2006. Text-dependent speaker-recognition using one-pass dynamic programming algorithm. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, p. 1.
- (2006) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 1
- Ramasubramanian, V.¹ Das, A.² Kumar, V.³

109
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- DOI 10.1006/dspr.1999.0361
- D.A. Reynolds, T.F. Quatieri, and R.B. Dunn Speaker verification using adapted gaussian mixture models Digital Signal Process. 10 2000 19 41 (Pubitemid 30592166)
- (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

110
- 0026401050
- Connected word talker verification using whole word Hidden Markov models
- Rosenberg, A.E.; Lee, C.; Gokcen, S.; 1991. Connected word talker verification using whole word Hidden Markov models. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 381-384.
- (1991) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 381-384
- Rosenberg, A.E.¹ Lee, C.² Gokcen, S.³

111
- 0033729692
- Small group speaker identification with common password phrases
- A.E. Rosenberg, O. Siohan, and S. Parthasarathy Small group speaker identification with common password phrases Speech Commun. 31 2000 131 140
- (2000) Speech Commun. , vol.31 , pp. 131-140
- Rosenberg, A.E.¹ Siohan, O.² Parthasarathy, S.³

112
- 0029748333
- Speaker identification via support vector classifiers
- Schmidt, M.; Gish, H.; 1996. Speaker identification via support vector classifiers. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 105-108.
- (1996) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 105-108
- Schmidt, M.¹ Gish, H.²

113
- 84865783736
- Mixture of PLDA models in i-vector space for gender independent speaker recognition
- Senoussaoui, M.; Kenny, P.; Brummer, N.; de Villiers, E.; Dumouchel, P.; 2011. Mixture of PLDA models in i-vector space for gender independent speaker recognition. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 25-28.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 25-28
- Senoussaoui, M.¹ Kenny, P.² Brummer, N.³ De Villiers, E.⁴ Dumouchel, P.⁵

114
- 84865779725
- PLDA-based clustering for speaker diarization of broadcast streams
- Silovsky, J.; Prazak, J.; Cerva, P.; Zdansky, J.; Nouza, J.; 2011. PLDA-based clustering for speaker diarization of broadcast streams. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 2909-2912.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 2909-2912
- Silovsky, J.¹ Prazak, J.² Cerva, P.³ Zdansky, J.⁴ Nouza, J.⁵

115
- 84897378628
- Text-dependent speaker recognition using PLDA with uncertainty propagation
- Stafylakis, T.; Kenny, P.; Ouellet, P.; Perez, J.; Kockmann, M.; Dumouchel, P.; 2013. Text-dependent speaker recognition using PLDA with uncertainty propagation. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 3684-3688.
- (2013) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 3684-3688
- Stafylakis, T.¹ Kenny, P.² Ouellet, P.³ Perez, J.⁴ Kockmann, M.⁵ Dumouchel, P.⁶

116
- 9444244480
- Development of user-state conventions for the multimodal corpus in SmartKom
- Las Palmas, Spain
- Steininger, S.; Rabold, S.; Dioubina, O.; Schiel, F.; 2002. Development of user-state conventions for the multimodal corpus in SmartKom. In: LREC Workshop on Multimodal Resources, Las Palmas, Spain.
- (2002) LREC Workshop on Multimodal Resources
- Steininger, S.¹ Rabold, S.² Dioubina, O.³ Schiel, F.⁴

117
- 84867626060
- Speaker Recognition with Region-Constrained MLLR Transforms
- Stolcke, A.; Mandal, A.; Shriberg, E.; 2012. Speaker Recognition with Region-Constrained MLLR Transforms. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 4397-4400.
- (2012) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 4397-4400
- Stolcke, A.¹ Mandal, A.² Shriberg, E.³

118
- 17344377138
- Speaker verification using text-constrained Gaussian mixture models
- IEEE
- D. Sturim, D. Reynolds, R. Dunn, and T. Quatieri Speaker verification using text-constrained Gaussian mixture models IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 1999 2002 IEEE 677 680
- (2002) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 1999 , pp. 677-680
- Sturim, D.¹ Reynolds, D.² Dunn, R.³ Quatieri, T.⁴

119
- 34547525967
- A generative-discriminative framework using ensemble methods for text-dependent speaker verification
- Subramanya, A.; Zhang, Z.; Surendran, A.C.; Nguyen, P.; Narasimhan, M.; Acero, A.; 2007. A generative-discriminative framework using ensemble methods for text-dependent speaker verification. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 4-25.
- (2007) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 4-25
- Subramanya, A.¹ Zhang, Z.² Surendran, A.C.³ Nguyen, P.⁴ Narasimhan, M.⁵ Acero, A.⁶

120
- 84897453885
- BioSec multimodal biometric database in text-dependent speaker recognition
- Toledano, D.T.; Hernandez-Lopez, D.; Esteve-Elizalde, C.; Fierrez, J.; Ortega-Garcia, J.; Ramos, D.; Gonzalez-Rodriguez, J.; 2008. BioSec multimodal biometric database in text-dependent speaker recognition. In: LREC.
- (2008) LREC
- Toledano, D.T.¹ Hernandez-Lopez, D.² Esteve-Elizalde, C.³ Fierrez, J.⁴ Ortega-Garcia, J.⁵ Ramos, D.⁶ Gonzalez-Rodriguez, J.⁷

121
- 84865788305
- Towards goat detection in text-dependent speaker verification
- Toledo-Ronen, O.; Aronowitz, H.; Hoory, R.; Pelecanos, J.; Nahamoo, D.; 2011. Towards goat detection in text-dependent speaker verification. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 9-12.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 9-12
- Toledo-Ronen, O.¹ Aronowitz, H.² Hoory, R.³ Pelecanos, J.⁴ Nahamoo, D.⁵

122
- 34548248573
- Explicit modelling of session variability for speaker verification
- DOI 10.1016/j.csl.2007.05.003, PII S0885230807000277
- R. Vogt, and S. Sridharan Explicit modelling of session variability for speaker verification Comput. Speech Lang. 22 2008 17 38 (Pubitemid 47333032)
- (2008) Computer Speech and Language , vol.22 , Issue.1 , pp. 17-38
- Vogt, R.¹ Sridharan, S.²

123
- 85084016571
- Factor analysis modelling for speaker verification with short utterances
- IEEE
- R.J. Vogt, C.J. Lustri, and S. Sridharan Factor analysis modelling for speaker verification with short utterances Odyssey Speaker and Language Recognition Workshop 2008 IEEE 1 4
- (2008) Odyssey Speaker and Language Recognition Workshop , pp. 1-4
- Vogt, R.J.¹ Lustri, C.J.² Sridharan, S.³

124
- 70450169291
- Within-session variability modelling for factor analysis speaker verification
- Vogt, R.J.; Pelecanos, J.; Scheffer, N.; Kajarekar, S.; Sridharan, S.; 2009. Within-session variability modelling for factor analysis speaker verification. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 1563-1566.
- (2009) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 1563-1566
- Vogt, R.J.¹ Pelecanos, J.² Scheffer, N.³ Kajarekar, S.⁴ Sridharan, S.⁵

125
- 42749099895
- An evaluation of commercial off-the-shelf speaker verification systems
- Wagner, M.; Summerfield, C.; Dunstone, T.; Summerfield, R.; Moss, J.; 2006. An evaluation of commercial off-the-shelf speaker verification systems. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-8.
- (2006) Odyssey Speaker and Language Recognition Workshop , pp. 1-8
- Wagner, M.¹ Summerfield, C.² Dunstone, T.³ Summerfield, R.⁴ Moss, J.⁵

126
- 79960251538
- A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities
- Y.W. Wong, S.I. Chang, K.P. Seng, L.M. Ang, S.W. Chin, W.J. Chew, and K.H. Lim A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities Pattern Recognit. Lett. 32 2011 1503 1510
- (2011) Pattern Recognit. Lett. , vol.32 , pp. 1503-1510
- Wong, Y.W.¹ Chang, S.I.² Seng, K.P.³ Ang, L.M.⁴ Chin, S.W.⁵ Chew, W.J.⁶ Lim, K.H.⁷

127
- 0034427815
- Text-dependent speaker recognition using the fuzzy ARTMAP neural network
- TENCON, Kuala Lumpur (Malaysia)
- Woo, S.C.; Lim, C.P.; Osman, R.; 2000. Text-dependent speaker recognition using the fuzzy ARTMAP neural network. In: Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology, TENCON, Kuala Lumpur (Malaysia).
- (2000) Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology
- Woo, S.C.¹ Lim, C.P.² Osman, R.³

128
- 42749101361
- The MIT mobile device speaker verification corpus: Data collection and preliminary experiments
- Woo, R.H.; Park, A.; Hazen, T.J.; 2006. The MIT mobile device speaker verification corpus: data collection and preliminary experiments. In: Odyssey Speaker and Language Recognition Workshop.
- (2006) Odyssey Speaker and Language Recognition Workshop
- Woo, R.H.¹ Park, A.² Hazen, T.J.³

129
- 84884901384
- I-Tech, Vienna, Austria
- Wu, D.; BaojieLi, Jiang, H.; 2008. Speech recognition, technologies and applications - Normalization and transformation techniques for robust speaker recognition. I-Tech, Vienna, Austria.
- (2008) Speech Recognition, Technologies and Applications - Normalization and Transformation Techniques for Robust Speaker Recognition
- Wu, D.¹ Li, B.² Jiang, H.³

130
- 84865722206
- An i-vector based approach to acoustic sniffing for irrelevant variability normalization based acoustic model training and speech recognition
- Xu, J.; Zhang, Y.; Yan, Z.J.; Huo, Q.; 2011. An i-vector based approach to acoustic sniffing for irrelevant variability normalization based acoustic model training and speech recognition. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 1701-1704.
- (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 1701-1704
- Xu, J.¹ Zhang, Y.² Yan, Z.J.³ Huo, Q.⁴

131
- 22544440896
- Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system
- DOI 10.1109/TSA.2005.848892
- B. Yegnanarayana, S.M. Prasanna, J.M. Zachariah, and C.S. Gupta Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system IEEE Trans. Speech Audio Process. 13 2005 575 582 (Pubitemid 41013160)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.4 , pp. 575-582
- Yegnanarayana, B.¹ Prasanna, S.R.M.² Zachariah, J.M.³ Gupta, C.S.⁴

132
- 0036722744
- Robust speaker verification with state duration modeling
- DOI 10.1016/S0167-6393(01)00044-9, PII S0167639301000449
- N.B. Yoma, and T.F. Pegoraro Robust speaker verification with state duration modeling Speech Commun. 38 2002 77 88 (Pubitemid 34867608)
- (2002) Speech Communication , vol.38 , Issue.1-2 , pp. 77-88
- Yoma, N.B.¹ Pegoraro, T.F.²

133
- 77955790894
- GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition
- C. You, K.A. Lee, and H. Li GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition IEEE Trans. Audio Speech Lang. Process. 18 2010 1300 1312
- (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , pp. 1300-1312
- You, C.¹ Lee, K.A.² Li, H.³

134
- 85009234518
- The general use of tying in phoneme-based HMM speech recognisers
- Young, S.J.; 1992. The general use of tying in phoneme-based HMM speech recognisers. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 569-572.
- (1992) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 569-572
- Young, S.J.¹

135
- 85075927145
- HMMs and related speech recognition technologies
- Springer Verlag
- S.J. Young HMMs and related speech recognition technologies Springer Handbook of Speech Processing 2008 Springer Verlag
- (2008) Springer Handbook of Speech Processing
- Young, S.J.¹

136
- 84897398363
- The voiceprint recognition activities over China
- Zheng, T.F.; 2005. The voiceprint recognition activities over China. In: Oriental COCOSDA.
- (2005) Oriental COCOSDA
- Zheng, T.F.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.