메뉴 건너뛰기




Volumn 60, Issue , 2014, Pages 56-77

Text-dependent speaker verification: Classifiers, databases and RSR2015

Author keywords

Database; Speaker recognition; Text dependent

Indexed keywords

CLASSIFICATION (OF INFORMATION); DATABASE SYSTEMS; MOBILE DEVICES; NETWORK ARCHITECTURE;

EID: 84897385841     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2014.03.001     Document Type: Article
Times cited : (293)

References (136)
  • 1
    • 60349089517 scopus 로고    scopus 로고
    • Speaker-dependent characteristics of the nasals
    • K. Amino, and T. Arai Speaker-dependent characteristics of the nasals Forensic Sci. Int. 185 2009 21 28
    • (2009) Forensic Sci. Int. , vol.185 , pp. 21-28
    • Amino, K.1    Arai, T.2
  • 5
    • 33746432558 scopus 로고    scopus 로고
    • User-customized password speaker verification using multiple reference and background models
    • DOI 10.1016/j.specom.2005.08.008, PII S016763930600046X
    • M.F. BenZeghiba, and H. Bourlard User-customized password speaker verification using multiple reference and background models Speech Commun. 48 2006 1200 1213 (Pubitemid 44128618)
    • (2006) Speech Communication , vol.48 , Issue.9 , pp. 1200-1213
    • BenZeghiba, M.F.1    Bourlard, H.2
  • 16
    • 0031220765 scopus 로고    scopus 로고
    • Optimizing feature set for speaker verification
    • PII S0167865597000640
    • D. Charlet, and D. Jouvet Optimizing feature set for speaker verification Pattern Recognit. Lett. 18 1997 873 879 (Pubitemid 127411230)
    • (1997) Pattern Recognition Letters , vol.18 , Issue.9 , pp. 873-879
    • Charlet, D.1    Jouvet, D.2
  • 17
    • 0033738353 scopus 로고    scopus 로고
    • An alternative normalization scheme in HMM-based text-dependent speaker verification
    • D. Charlet, D. Jouvet, and O. Collin An alternative normalization scheme in HMM-based text-dependent speaker verification Speech Commun. 31 2000 113 120
    • (2000) Speech Commun. , vol.31 , pp. 113-120
    • Charlet, D.1    Jouvet, D.2    Collin, O.3
  • 20
    • 0030244499 scopus 로고    scopus 로고
    • A modified HME architecture for text-dependent speaker identification
    • PII S1045922796066167
    • K. Chen, D. Xie, and H. Chi A modified HME architecture for text-dependent speaker identification IEEE Trans. Neural Networks 7 1996 1309 1313 (Pubitemid 126776401)
    • (1996) IEEE Transactions on Neural Networks , vol.7 , Issue.5 , pp. 1309-1313
    • Chen, K.1    Xie, D.2    Chi, H.3
  • 26
    • 84897430194 scopus 로고    scopus 로고
    • Direct modeling of spoken passwords for text-dependent speaker recognition by compressed time-feature representations
    • Das, A.; Tapaswi, M.; 2010. Direct modeling of spoken passwords for text-dependent speaker recognition by compressed time-feature representations. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 4510-4513.
    • (2010) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 4510-4513
    • Das, A.1    Tapaswi, M.2
  • 33
    • 85073255206 scopus 로고    scopus 로고
    • The effect of target/non-target age difference on speaker recognition performance
    • Doddington, G.; 2012. The effect of target/non-target age difference on speaker recognition performance. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-5.
    • (2012) Odyssey Speaker and Language Recognition Workshop , pp. 1-5
    • Doddington, G.1
  • 34
    • 85084013548 scopus 로고    scopus 로고
    • Support vector machines based text dependent speaker verification using HMM superverctors
    • Dong, C.; Dong, Y.; Li, J.; Wang, H.; 2008. Support vector machines based text dependent speaker verification using HMM superverctors. In: Odyssey Speaker and Language Recognition Workshop, pp. 1-7.
    • (2008) Odyssey Speaker and Language Recognition Workshop , pp. 1-7
    • Dong, C.1    Dong, Y.2    Li, J.3    Wang, H.4
  • 36
    • 52049089205 scopus 로고    scopus 로고
    • Text dependent speaker identification based on spectrograms
    • Dutta, T.; 2007. Text dependent speaker identification based on spectrograms. In: Image and Vision Computing, pp. 238-243.
    • (2007) Image and Vision Computing , pp. 238-243
    • Dutta, T.1
  • 37
    • 52049109385 scopus 로고    scopus 로고
    • Dynamic time warping based approach to text-dependent speaker identification using spectrograms
    • Dutta, T.; 2008. Dynamic time warping based approach to text-dependent speaker identification using spectrograms. In: Congress on Image and Signal Processing, pp. 354-360.
    • (2008) Congress on Image and Signal Processing , pp. 354-360
    • Dutta, T.1
  • 43
    • 33845432328 scopus 로고    scopus 로고
    • Biosec baseline corpus: A multimodal biometric database
    • DOI 10.1016/j.patcog.2006.10.014, PII S0031320306004304
    • J. Fierrez, J. Ortega-Garcia, D. Torre Toledano, and J. Gonzalez-Rodriguez Biosec baseline corpus: a multimodal biometric database Pattern Recognit. 40 2007 1389 1392 (Pubitemid 44894487)
    • (2007) Pattern Recognition , vol.40 , Issue.4 , pp. 1389-1392
    • Fierrez, J.1    Ortega-Garcia, J.2    Torre Toledano, D.3    Gonzalez-Rodriguez, J.4
  • 45
    • 0029768209 scopus 로고    scopus 로고
    • Comparison of multilayer and radial basis function neural networks for text-dependent speaker recognition
    • IEEE
    • R. Finan, A. Sapeluk, and R. Damper Comparison of multilayer and radial basis function neural networks for text-dependent speaker recognition IEEE International Conference on Neural Networks 1996 IEEE 1992 1997
    • (1996) IEEE International Conference on Neural Networks , pp. 1992-1997
    • Finan, R.1    Sapeluk, A.2    Damper, R.3
  • 46
    • 0029354680 scopus 로고
    • Discriminating observation probability (DOP) HMM for speaker verification
    • M. Forsyth Discriminating observation probability (DOP) HMM for speaker verification Speech Commun. 17 1995 117 129
    • (1995) Speech Commun. , vol.17 , pp. 117-129
    • Forsyth, M.1
  • 48
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • S. Furui Cepstral analysis technique for automatic speaker verification IEEE Trans. Acoust. Speech Signal Process. (see also IEEE Trans. Signal Process.) 29 1981 254 272 (Pubitemid 11495877)
    • (1981) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-29 , Issue.2 , pp. 254-272
    • Furui Sadaoki1
  • 49
    • 0019583902 scopus 로고
    • Comparison of speaker recognition methods using statistical features and dynamic features
    • S. Furui Comparison of speaker recognition methods using statistical features and dynamic features IEEE Trans. Acoust. Speech Signal Process. 29 1981 342 350 (Pubitemid 11520516)
    • (1981) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-29 , Issue.3 PART 1 , pp. 342-350
    • Furui Sadaoki1
  • 55
    • 85024895429 scopus 로고    scopus 로고
    • Text-dependent speaker recognition
    • Springer-Verlag Heidelberg (Chapter)
    • M. Hébert Text-dependent speaker recognition Handbook of Speech Processing 2008 Springer-Verlag Heidelberg 743 762 (Chapter)
    • (2008) Handbook of Speech Processing , pp. 743-762
    • Hébert, M.1
  • 58
    • 85009131570 scopus 로고    scopus 로고
    • Integrating speaker and speech recognizers: Automatic identity claim capture for speaker verification
    • Heck, L.; Genoud, D.; 2001. Integrating speaker and speech recognizers: automatic identity claim capture for speaker verification. In: Odyssey Speaker and Language Recognition Workshop, pp. 249-254.
    • (2001) Odyssey Speaker and Language Recognition Workshop , pp. 249-254
    • Heck, L.1    Genoud, D.2
  • 59
    • 0033729411 scopus 로고    scopus 로고
    • POLYCOST: A telephone-speech database for speaker recognition
    • J. Hennebert, H. Melin, D. Petrovska, and D. Genoud POLYCOST: a telephone-speech database for speaker recognition Speech Commun. 31 2000 265 270
    • (2000) Speech Commun. , vol.31 , pp. 265-270
    • Hennebert, J.1    Melin, H.2    Petrovska, D.3    Genoud, D.4
  • 62
    • 84897430426 scopus 로고    scopus 로고
    • Inter and intra-speaker variability in French: An analysis of oral vowels and its implication for automatic speaker verification
    • Kahn, J.; Audibert, N.; Bonastre, J.F.; Rossato, S.; 2011. Inter and intra-speaker variability in French: an analysis of oral vowels and its implication for automatic speaker verification. In: International Congress of Phonetic Sciences (ICPhS), pp. 1002-1005.
    • (2011) International Congress of Phonetic Sciences (ICPhS) , pp. 1002-1005
    • Kahn, J.1    Audibert, N.2    Bonastre, J.F.3    Rossato, S.4
  • 65
    • 84889853754 scopus 로고    scopus 로고
    • Within-speaker variability in the VeriVox database
    • I. Karlsson Within-speaker variability in the VeriVox database Gothenburg Papers Theor. Ling. 1999 93 96
    • (1999) Gothenburg Papers Theor. Ling. , pp. 93-96
    • Karlsson, I.1
  • 67
    • 0141702109 scopus 로고    scopus 로고
    • Improved speaker verification over the cellular phone network using phoneme-balanced and digit-sequence-preserving connected digit patterns
    • Kato, T.; Shimizu, T.; 2003. Improved speaker verification over the cellular phone network using phoneme-balanced and digit-sequence-preserving connected digit patterns. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp. 57-60.
    • (2003) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP , pp. 57-60
    • Kato, T.1    Shimizu, T.2
  • 68
    • 79958764380 scopus 로고    scopus 로고
    • Performance comparison of 2-D DCT on full/block spectrogram and 1-D DCT on row mean of spectrogram for speaker identification
    • H. Kekre, T. Sarode, S. Natu, and P. Natu Performance comparison of 2-D DCT on full/block spectrogram and 1-D DCT on row mean of spectrogram for speaker identification Int. J. Biometrics Bioinf. (IJBB) 4 2010 100
    • (2010) Int. J. Biometrics Bioinf. (IJBB) , vol.4 , pp. 100
    • Kekre, H.1    Sarode, T.2    Natu, S.3    Natu, P.4
  • 69
    • 79952919345 scopus 로고    scopus 로고
    • Effects of long-term ageing on speaker verification
    • Springer
    • F. Kelly, and N. Harte Effects of long-term ageing on speaker verification Biometrics and Id Management 2011 Springer 113 124
    • (2011) Int. J. Biometrics Bioinf. (IJBB) , pp. 113-124
    • Kelly, F.1    Harte, N.2
  • 74
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • T. Kinnunen, and H. Li An overview of text-independent speaker recognition: from features to supervectors Speech Commun. 52 2010 12 40
    • (2010) Speech Commun. , vol.52 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 79
    • 84884910090 scopus 로고    scopus 로고
    • Reinforced temporal structure of acoustic models for speaker recognition
    • A. Larcher, J.F. Bonastre, and J.S. Mason Reinforced temporal structure of acoustic models for speaker recognition Digital Signal Process. 23 2013 1910 1917
    • (2013) Digital Signal Process. , vol.23 , pp. 1910-1917
    • Larcher, A.1    Bonastre, J.F.2    Mason, J.S.3
  • 84
    • 84893339015 scopus 로고    scopus 로고
    • Speaker verification makes its debut in smartphone
    • Lee, K.A.; Ma, B.; Li, H.; 2013b. Speaker verification makes its debut in smartphone. In: SLTC Newsletter.
    • (2013) SLTC Newsletter
    • Lee, K.A.1    Ma, B.2    Li, H.3
  • 87
    • 0036508040 scopus 로고    scopus 로고
    • Robust endpoint detection and energy normalization for real-time speech and speaker recognition
    • DOI 10.1109/TSA.2002.1001979, PII S106366760203972X
    • Q. Li, J. Zheng, A. Tsai, and Q. Zhou Robust endpoint detection and energy normalization for real-time speech and speaker recognition IEEE Trans. Speech Audio Process. 10 2002 146 157 (Pubitemid 34692538)
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.3 , pp. 146-157
    • Li, Q.1    Zheng, J.2    Tsai, A.3    Zhou, Q.4
  • 88
    • 84876676725 scopus 로고    scopus 로고
    • Spoken language recognition: From fundamentals to practice
    • H. Li, B. Ma, and K.A. Lee Spoken language recognition: from fundamentals to practice Proc. IEEE 101 2013 1136 1159
    • (2013) Proc. IEEE , vol.101 , pp. 1136-1159
    • Li, H.1    Ma, B.2    Lee, K.A.3
  • 89
    • 42749107507 scopus 로고    scopus 로고
    • Template compression and distance normalization for reliable text-dependent speaker verification
    • IEEE
    • J. Luan, J. Hao, T. Kakino, and T. Ikumi Template compression and distance normalization for reliable text-dependent speaker verification Odyssey Speaker and Language Recognition Workshop 2006 IEEE 1 4
    • (2006) Odyssey Speaker and Language Recognition Workshop , pp. 1-4
    • Luan, J.1    Hao, J.2    Kakino, T.3    Ikumi, T.4
  • 102
    • 0033738983 scopus 로고    scopus 로고
    • AHUMADA: A large speech corpus in Spanish for speaker characterization and identification
    • J. Ortega-Garcia, J. Gonzalez-Rodriguez, and V. Marrero-Aguiar AHUMADA: a large speech corpus in Spanish for speaker characterization and identification Speech Commun. 31 2000 255 264
    • (2000) Speech Commun. , vol.31 , pp. 255-264
    • Ortega-Garcia, J.1    Gonzalez-Rodriguez, J.2    Marrero-Aguiar, V.3
  • 104
  • 106
  • 111
    • 0033729692 scopus 로고    scopus 로고
    • Small group speaker identification with common password phrases
    • A.E. Rosenberg, O. Siohan, and S. Parthasarathy Small group speaker identification with common password phrases Speech Commun. 31 2000 131 140
    • (2000) Speech Commun. , vol.31 , pp. 131-140
    • Rosenberg, A.E.1    Siohan, O.2    Parthasarathy, S.3
  • 122
    • 34548248573 scopus 로고    scopus 로고
    • Explicit modelling of session variability for speaker verification
    • DOI 10.1016/j.csl.2007.05.003, PII S0885230807000277
    • R. Vogt, and S. Sridharan Explicit modelling of session variability for speaker verification Comput. Speech Lang. 22 2008 17 38 (Pubitemid 47333032)
    • (2008) Computer Speech and Language , vol.22 , Issue.1 , pp. 17-38
    • Vogt, R.1    Sridharan, S.2
  • 128
  • 130
    • 84865722206 scopus 로고    scopus 로고
    • An i-vector based approach to acoustic sniffing for irrelevant variability normalization based acoustic model training and speech recognition
    • Xu, J.; Zhang, Y.; Yan, Z.J.; Huo, Q.; 2011. An i-vector based approach to acoustic sniffing for irrelevant variability normalization based acoustic model training and speech recognition. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 1701-1704.
    • (2011) Annual Conference of the International Speech Communication Association (Interspeech) , pp. 1701-1704
    • Xu, J.1    Zhang, Y.2    Yan, Z.J.3    Huo, Q.4
  • 131
    • 22544440896 scopus 로고    scopus 로고
    • Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system
    • DOI 10.1109/TSA.2005.848892
    • B. Yegnanarayana, S.M. Prasanna, J.M. Zachariah, and C.S. Gupta Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system IEEE Trans. Speech Audio Process. 13 2005 575 582 (Pubitemid 41013160)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.4 , pp. 575-582
    • Yegnanarayana, B.1    Prasanna, S.R.M.2    Zachariah, J.M.3    Gupta, C.S.4
  • 132
    • 0036722744 scopus 로고    scopus 로고
    • Robust speaker verification with state duration modeling
    • DOI 10.1016/S0167-6393(01)00044-9, PII S0167639301000449
    • N.B. Yoma, and T.F. Pegoraro Robust speaker verification with state duration modeling Speech Commun. 38 2002 77 88 (Pubitemid 34867608)
    • (2002) Speech Communication , vol.38 , Issue.1-2 , pp. 77-88
    • Yoma, N.B.1    Pegoraro, T.F.2
  • 133
    • 77955790894 scopus 로고    scopus 로고
    • GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition
    • C. You, K.A. Lee, and H. Li GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition IEEE Trans. Audio Speech Lang. Process. 18 2010 1300 1312
    • (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , pp. 1300-1312
    • You, C.1    Lee, K.A.2    Li, H.3
  • 135
    • 85075927145 scopus 로고    scopus 로고
    • HMMs and related speech recognition technologies
    • Springer Verlag
    • S.J. Young HMMs and related speech recognition technologies Springer Handbook of Speech Processing 2008 Springer Verlag
    • (2008) Springer Handbook of Speech Processing
    • Young, S.J.1
  • 136
    • 84897398363 scopus 로고    scopus 로고
    • The voiceprint recognition activities over China
    • Zheng, T.F.; 2005. The voiceprint recognition activities over China. In: Oriental COCOSDA.
    • (2005) Oriental COCOSDA
    • Zheng, T.F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.