메뉴 건너뛰기




Volumn 66, Issue , 2015, Pages 130-153

Spoofing and countermeasures for speaker verification: A survey

Author keywords

Anti Spoofing; Automatic speaker verification; Countermeasure; Security; Spoofing attack

Indexed keywords

BIOMETRICS; SPEECH PROCESSING; SPEECH SYNTHESIS; SURVEYS;

EID: 84919922238     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2014.10.005     Document Type: Review
Times cited : (659)

References (163)
  • 7
    • 84906244272 scopus 로고    scopus 로고
    • A new speaker verification spoofing countermeasure based on local binary patterns
    • Alegre, F.; Vipperla, R.; Amehraye, A.; Evans, N.; 2013c. A new speaker verification spoofing countermeasure based on local binary patterns. In: Proc. Interspeech.
    • (2013) Proc. Interspeech
    • Alegre, F.1    Vipperla, R.2    Amehraye, A.3    Evans, N.4
  • 9
    • 84878412793 scopus 로고    scopus 로고
    • Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals
    • Alegre, F.; Vipperla, R.; Evans, N.; et al.; 2012b. Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals. In: Proc. Interspeech.
    • (2012) Proc. Interspeech
    • Alegre, F.1    Vipperla, R.2    Evans, N.3
  • 10
    • 84995407786 scopus 로고    scopus 로고
    • Detecting voice disguise from speech variability: Analysis of three glottal and vocal tract measures
    • 4068-4068
    • T.B. Amin, J.S. German, and P. Marziliano Detecting voice disguise from speech variability: analysis of three glottal and vocal tract measures J. Acoust. Soc. Am. 134 2013 4068-4068
    • (2013) J. Acoust. Soc. Am. , vol.134
    • Amin, T.B.1    German, J.S.2    Marziliano, P.3
  • 11
    • 84896907805 scopus 로고    scopus 로고
    • Glottal and vocal tract characteristics of voice impersonators
    • T.B. Amin, P. Marziliano, and J.S. German Glottal and vocal tract characteristics of voice impersonators IEEE Trans. Multimedia 16 2014 668 678
    • (2014) IEEE Trans. Multimedia , vol.16 , pp. 668-678
    • Amin, T.B.1    Marziliano, P.2    German, J.S.3
  • 15
    • 44949232373 scopus 로고    scopus 로고
    • CLUSTERGEN: A statistical parametric synthesizer using trajectory modeling
    • Black, A.W.; 2006. CLUSTERGEN: A statistical parametric synthesizer using trajectory modeling. In: Proc. Interspeech.
    • (2006) Proc. Interspeech
    • Black, A.W.1
  • 16
    • 84890478111 scopus 로고    scopus 로고
    • Speaker verification scores and acoustic analysis of a professional impersonator
    • Blomberg, M.; Elenius, D.; Zetterholm, E.; 2004. Speaker verification scores and acoustic analysis of a professional impersonator. In: Proc. FONETIK.
    • (2004) Proc. FONETIK
    • Blomberg, M.1    Elenius, D.2    Zetterholm, E.3
  • 17
    • 84974570703 scopus 로고    scopus 로고
    • Praat: Doing phonetics by computer
    • retrieved 12 February 2014
    • Boersma, P.; Weenink, D.; 2014. Praat: doing phonetics by computer. Computer program. Version 5.3.64, retrieved 12 February 2014 from < http://www.praat.org/ >.
    • (2014) Computer Program. Version 5.3.64
    • Boersma, P.1    Weenink, D.2
  • 18
    • 65349113532 scopus 로고    scopus 로고
    • Artificial impostor voice transformation effects on false acceptance rates
    • Bonastre, J.F.; Matrouf, D.; Fredouille, C.; 2007. Artificial impostor voice transformation effects on false acceptance rates. In: Proc. Interspeech.
    • (2007) Proc. Interspeech
    • Bonastre J. ., F.1    Matrouf, D.2    Fredouille, C.3
  • 23
    • 33645887246 scopus 로고    scopus 로고
    • Support vector machines using GMM supervectors for speaker verification
    • W.M. Campbell, D.E. Sturim, and D.A. Reynolds Support vector machines using GMM supervectors for speaker verification IEEE Signal Process. Lett. 13 2006 308 311
    • (2006) IEEE Signal Process. Lett. , vol.13 , pp. 308-311
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3
  • 24
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • J.P. Campbell Jr. Speaker recognition: a tutorial Proc. IEEE 85 1997 1437 1462
    • (1997) Proc. IEEE , vol.85 , pp. 1437-1462
    • Campbell, Jr.J.P.1
  • 25
    • 84906225084 scopus 로고    scopus 로고
    • Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion
    • Chen, L.H.; Ling, Z.H.; Song, Y.; Dai, L.R.; 2013. Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion. In: Proc. Interspeech.
    • (2013) Proc. Interspeech
    • Chen L. ., H.1    Ling Z. ., H.2    Song, Y.3    Dai L. ., R.4
  • 35
    • 84878402831 scopus 로고    scopus 로고
    • Synthetic speech discrimination using pitch pattern statistics derived from image analysis
    • De Leon, P.L.; Stewart, B.; Yamagishi, J.; 2012b. Synthetic speech discrimination using pitch pattern statistics derived from image analysis. In: Proc. Interspeech.
    • (2012) Proc. Interspeech
    • De Leon P. ., L.1    Stewart, B.2    Yamagishi, J.3
  • 36
    • 64249101047 scopus 로고    scopus 로고
    • Modeling prosodic features with joint factor analysis for speaker verification
    • N. Dehak, P. Dumouchel, and P. Kenny Modeling prosodic features with joint factor analysis for speaker verification IEEE Trans. Audio Speech Language Process. 15 2007 2095 2103
    • (2007) IEEE Trans. Audio Speech Language Process. , vol.15 , pp. 2095-2103
    • Dehak, N.1    Dumouchel, P.2    Kenny, P.3
  • 43
    • 0015069185 scopus 로고
    • Voice spectrograms as a function of age, voice disguise, and voice imitation
    • W. Endres, W. Bambach, and G. Flösser Voice spectrograms as a function of age, voice disguise, and voice imitation J. Acoust. Soc. Am. 49 1971 1842 1848
    • (1971) J. Acoust. Soc. Am. , vol.49 , pp. 1842-1848
    • Endres, W.1    Bambach, W.2    Flösser, G.3
  • 46
    • 84872177757 scopus 로고    scopus 로고
    • Parametric voice conversion based on bilinear frequency warping plus amplitude scaling
    • D. Erro, E. Navas, and I. Hernaez Parametric voice conversion based on bilinear frequency warping plus amplitude scaling IEEE Trans. Audio Speech Language Process. 21 2013 556 566
    • (2013) IEEE Trans. Audio Speech Language Process. , vol.21 , pp. 556-566
    • Erro, D.1    Navas, E.2    Hernaez, I.3
  • 48
    • 84906263293 scopus 로고    scopus 로고
    • Spoofing and countermeasures for automatic speaker verification
    • Evans, N.; Kinnunen, T.; Yamagishi, J.; 2013. Spoofing and countermeasures for automatic speaker verification. In: Proc. Interspeech.
    • (2013) Proc. Interspeech
    • Evans, N.1    Kinnunen, T.2    Yamagishi, J.3
  • 51
    • 77956826012 scopus 로고    scopus 로고
    • Automatic speaker recognition as a measurement of voice imitation and conversion
    • M. Farrús, M. Wagner, D. Erro, and J. Hernando Automatic speaker recognition as a measurement of voice imitation and conversion Int. J. Speech Language Law 17 2010 119 142
    • (2010) Int. J. Speech Language Law , vol.17 , pp. 119-142
    • Farrús, M.1    Wagner, M.2    Erro, D.3    Hernando, J.4
  • 53
    • 33751542948 scopus 로고    scopus 로고
    • Speaker verification security improvement by means of speech watermarking
    • M. Faundez-Zanuy, M. Hagmüller, and G. Kubin Speaker verification security improvement by means of speech watermarking Speech Commun. 48 2006 1608 1619
    • (2006) Speech Commun. , vol.48 , pp. 1608-1619
    • Faundez-Zanuy, M.1    Hagmüller, M.2    Kubin, G.3
  • 56
  • 58
    • 84865733857 scopus 로고    scopus 로고
    • Analysis of i-vector length normalization in speaker recognition systems
    • Garcia-Romero, D.; Espy-Wilson, C.Y.; 2011. Analysis of i-vector length normalization in speaker recognition systems. In: Proc. Interspeech.
    • (2011) Proc. Interspeech
    • Garcia-Romero, D.1    Espy-Wilson C. ., Y.2
  • 59
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • J.L. Gauvain, and C.H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Trans. Speech Audio Process. 2 1994 291 298
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 60
    • 29844455876 scopus 로고    scopus 로고
    • Pitch extraction and fundamental frequency: History and current techniques
    • Department of Computer Science, University of Regina
    • Gerhard, D.; 2003. Pitch extraction and fundamental frequency: History and current techniques. Technical Report TR-CS 2003-06, Department of Computer Science, University of Regina.
    • (2003) Technical Report TR-CS 2003-06
    • Gerhard, D.1
  • 62
    • 84857498745 scopus 로고    scopus 로고
    • Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
    • E. Godoy, O. Rosec, and T. Chonavel Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora IEEE Trans. Audio Speech Language Process. 20 2012 1313 1323
    • (2012) IEEE Trans. Audio Speech Language Process. , vol.20 , pp. 1313-1323
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 75
    • 33947637189 scopus 로고    scopus 로고
    • Joint factor analysis of speaker and session variability: Theory and algorithms
    • Kenny, P.; 2006. Joint factor analysis of speaker and session variability: theory and algorithms. Technical report CRIM-06/08-14.
    • (2006) Technical Report CRIM-06/08-14
    • Kenny, P.1
  • 78
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • T. Kinnunen, and H. Li An overview of text-independent speaker recognition: from features to supervectors Speech Commun. 52 2010 12 40
    • (2010) Speech Commun. , vol.52 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 80
    • 84867211827 scopus 로고    scopus 로고
    • Acoustic analysis of imitated voice produced by a professional impersonator
    • Kitamura, T.; 2008. Acoustic analysis of imitated voice produced by a professional impersonator. In: Proc. Interspeech.
    • (2008) Proc. Interspeech
    • Kitamura, T.1
  • 81
    • 0018986665 scopus 로고
    • Software for a cascade/parallel formant synthesizer
    • D.H. Klatt Software for a cascade/parallel formant synthesizer J. Acoust. Soc. Am. 67 1980 971 995
    • (1980) J. Acoust. Soc. Am. , vol.67 , pp. 971-995
    • Klatt, D.H.1
  • 83
    • 84906234851 scopus 로고    scopus 로고
    • Voice transformation-based spoofing of text-dependent speaker verification systems
    • Kons, Z.; Aronowitz, H.; 2013. Voice transformation-based spoofing of text-dependent speaker verification systems. In: Proc. Interspeech.
    • (2013) Proc. Interspeech
    • Kons, Z.1    Aronowitz, H.2
  • 85
    • 84878465724 scopus 로고    scopus 로고
    • RSR2015: Database for text-dependent speaker verification using multiple pass-phrases
    • Larcher, A.; Lee, K.A.; Ma, B.; Li, H.; 2012. RSR2015: database for text-dependent speaker verification using multiple pass-phrases. In: Proc. Interspeech.
    • (2012) Proc. Interspeech
    • Larcher, A.1    Lee K. ., A.2    Ma, B.3    Li, H.4
  • 87
    • 84897385841 scopus 로고    scopus 로고
    • Text-dependent speaker verification: Classifiers, databases and RSR2015
    • A. Larcher, K.A. Lee, B. Ma, and H. Li Text-dependent speaker verification: classifiers, databases and RSR2015 Speech Commun. 60 2014 5677
    • (2014) Speech Commun. , vol.60 , pp. 5677
    • Larcher, A.1    Lee, K.A.2    Ma, B.3    Li, H.4
  • 91
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Comput. Speech Language 9 1995 171 185
    • (1995) Comput. Speech Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 93
    • 85032751399 scopus 로고    scopus 로고
    • Techware: Speaker and spoken language recognition resources [best of the web]
    • H. Li, and B. Ma Techware: speaker and spoken language recognition resources [best of the web] IEEE Signal Process. Mag. 27 2010 139 142
    • (2010) IEEE Signal Process. Mag. , vol.27 , pp. 139-142
    • Li, H.1    Ma, B.2
  • 94
    • 84876676725 scopus 로고    scopus 로고
    • Spoken language recognition: From fundamentals to practice
    • H. Li, B. Ma, and K.A. Lee Spoken language recognition: from fundamentals to practice Proc. IEEE 101 2013 1136 1159
    • (2013) Proc. IEEE , vol.101 , pp. 1136-1159
    • Li, H.1    Ma, B.2    Lee, K.A.3
  • 97
    • 84901237776 scopus 로고    scopus 로고
    • Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis
    • Z.H. Ling, L. Deng, and D. Yu Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis IEEE Trans. Audio Speech Language Process. 21 2013 2129 2139
    • (2013) IEEE Trans. Audio Speech Language Process. , vol.21 , pp. 2129-2139
    • Ling, Z.H.1    Deng, L.2    Yu, D.3
  • 101
    • 84929157442 scopus 로고    scopus 로고
    • Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis
    • Lu, H.; King, S.; Watts, O.; 2013. Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis. In: Proc. the 8th ISCA Speech Synthesis Workshop.
    • (2013) Proc. The 8th ISCA Speech Synthesis Workshop
    • Lu, H.1    King, S.2    Watts, O.3
  • 102
    • 84919943783 scopus 로고    scopus 로고
    • Spoofing and anti-spoofing in biometrics: Lessons learned from the tabula rasa project
    • Retrieved 26 February 2014
    • Marcel, S.; 2013. Spoofing and anti-spoofing in biometrics: Lessons learned from the tabula rasa project. Tutorial. Retrieved 26 February 2014 from < http://www.idiap.ch/marcel/professional/BTAS-2013.html >.
    • (2013) Tutorial
    • Marcel, S.1
  • 106
    • 1942512336 scopus 로고    scopus 로고
    • Imposture using synthetic speech against speaker verification based on spectrum and pitch
    • Masuko, T.; Tokuda, K.; Kobayashi, T.; 2000. Imposture using synthetic speech against speaker verification based on spectrum and pitch. In: Proc. Interspeech.
    • (2000) Proc. Interspeech
    • Masuko, T.1    Tokuda, K.2    Kobayashi, T.3
  • 110
    • 0029355724 scopus 로고
    • Likelihood normalization for speaker verification using a phoneme- and speaker-independent model
    • T. Matsui, and S. Furui Likelihood normalization for speaker verification using a phoneme- and speaker-independent model Speech Commun. 17 1995 109 116
    • (1995) Speech Commun. , vol.17 , pp. 109-116
    • Matsui, T.1    Furui, S.2
  • 111
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines, and F. Charpentier Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones Speech Commun. 9 1990 453 467
    • (1990) Speech Commun. , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 113
    • 84919943779 scopus 로고    scopus 로고
    • Nuance
    • Nuance, 2013. Nuance vocalpassword. In: < http://www.nuance.com/landing-pages/products/voicebiometrics/vocalpassword.asp >.
    • (2013) Nuance Vocalpassword
  • 114
    • 27544482501 scopus 로고    scopus 로고
    • Discrimination method of synthetic speech using pitch frequency against synthetic speech falsification
    • A. Ogihara, H. Unno, and A. Shiozakai Discrimination method of synthetic speech using pitch frequency against synthetic speech falsification IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 88 2005 280 286
    • (2005) IEICE Trans. Fundam. Electron. Commun. Comput. Sci. , vol.88 , pp. 280-286
    • Ogihara, A.1    Unno, H.2    Shiozakai, A.3
  • 115
    • 84919943778 scopus 로고    scopus 로고
    • Finding impostors in the crowd: The use of crowdsourcing to attack biometric systems
    • Bell Labs India
    • Panjwani, S.; Prakash, A.; 2014. Finding impostors in the crowd: the use of crowdsourcing to attack biometric systems. Unpublished manuscript, Bell Labs India.
    • (2014) Unpublished Manuscript
    • Panjwani, S.1    Prakash, A.2
  • 121
    • 85008039410 scopus 로고    scopus 로고
    • Improved prosody generation by maximizing joint probability of state and longer units
    • Y. Qian, Z. Wu, B. Gao, and F.K. Soong Improved prosody generation by maximizing joint probability of state and longer units IEEE Trans. Audio Speech Language Process. 19 2011 1702 1710
    • (2011) IEEE Trans. Audio Speech Language Process. , vol.19 , pp. 1702-1710
    • Qian, Y.1    Wu, Z.2    Gao, B.3    Soong, F.K.4
  • 123
    • 0034809453 scopus 로고    scopus 로고
    • Enhancing security and privacy in biometrics-based authentication systems
    • N.K. Ratha, J.H. Connell, and R.M. Bolle Enhancing security and privacy in biometrics-based authentication systems IBM Syst. J. 40 2001 614 634
    • (2001) IBM Syst. J. , vol.40 , pp. 614-634
    • Ratha, N.K.1    Connell, J.H.2    Bolle, R.M.3
  • 125
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • D. Reynolds, T. Quatieri, and R. Dunn Speaker verification using adapted Gaussian mixture models Digital Signal Process. 10 2000 19 41
    • (2000) Digital Signal Process. , vol.10 , pp. 19-41
    • Reynolds, D.1    Quatieri, T.2    Dunn, R.3
  • 126
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D. Reynolds, and R. Rose Robust text-independent speaker identification using Gaussian mixture speaker models IEEE Trans. Speech Audio Process. 3 1995 72 83
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 128
    • 67349227385 scopus 로고    scopus 로고
    • Robustness of multimodal biometric fusion methods against spoof attacks
    • R.N. Rodriques, L.L. Ling, and V. Govindaraju Robustness of multimodal biometric fusion methods against spoof attacks J. Visual Languages Comput. 20 2009 169 179
    • (2009) J. Visual Languages Comput. , vol.20 , pp. 169-179
    • Rodriques, R.N.1    Ling, L.L.2    Govindaraju, V.3
  • 129
    • 84898068800 scopus 로고    scopus 로고
    • I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification
    • Saeidi, R.; et al.; 2013. I4U submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification. In: Proc. Interspeech.
    • (2013) Proc. Interspeech
    • Saeidi, R.1
  • 141
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • T. Toda, A.W. Black, and K. Tokuda Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory IEEE Trans. Audio Speech Language Process. 15 2007 2222 2235
    • (2007) IEEE Trans. Audio Speech Language Process. , vol.15 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 143
    • 79958818321 scopus 로고    scopus 로고
    • An overview of speaker identification: Accuracy and robustness issues
    • R. Togneri, and D. Pullella An overview of speaker identification: accuracy and robustness issues IEEE Circ. Syst. Mag. 11 2011 23 61
    • (2011) IEEE Circ. Syst. Mag. , vol.11 , pp. 23-61
    • Togneri, R.1    Pullella, D.2
  • 144
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Tomoki, and K. Tokuda A speech parameter generation algorithm considering global variance for HMM-based speech synthesis IEICE Trans. Inform. Syst. 90 2007 816 824
    • (2007) IEICE Trans. Inform. Syst. , vol.90 , pp. 816-824
    • Tomoki, T.1    Tokuda, K.2
  • 145
    • 84867605072 scopus 로고    scopus 로고
    • Speaker verification performance degradation against spoofing and tampering attacks
    • Villalba, J.; Lleida, E.; 2010. Speaker verification performance degradation against spoofing and tampering attacks. In: FALA 10 workshop, pp. 131-134.
    • (2010) FALA 10 Workshop , pp. 131-134
    • Villalba, J.1    Lleida, E.2
  • 146
    • 79952940570 scopus 로고    scopus 로고
    • Detecting replay attacks from far-field recordings on speaker verification systems
    • C. Vielhauer, J. Dittmann, A. Drygajlo, N. Juul, M. Fairhurst, Lecture Notes in Computer Science Springer
    • J. Villalba, and E. Lleida Detecting replay attacks from far-field recordings on speaker verification systems C. Vielhauer, J. Dittmann, A. Drygajlo, N. Juul, M. Fairhurst, Biometrics and ID Management Lecture Notes in Computer Science 2011 Springer 274 285
    • (2011) IEICE Trans. Inform. Syst. , pp. 274-285
    • Villalba, J.1    Lleida, E.2
  • 150
  • 151
    • 84878410960 scopus 로고    scopus 로고
    • Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition
    • Wu, Z.; Chng, E.S.; Li, H.; 2012a. Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition. In: Proc. Interspeech 2012.
    • (2012) Proc. Interspeech 2012
    • Wu, Z.1    Chng E. ., S.2    Li, H.3
  • 154
    • 84906276055 scopus 로고    scopus 로고
    • Exemplar-based unit selection for voice conversion utilizing temporal information
    • Wu, Z.; Virtanen, T.; Kinnunen, T.; Chng, E.S.; Li, H.; 2013a. Exemplar-based unit selection for voice conversion utilizing temporal information. In: Proc. Interspeech.
    • (2013) Proc. Interspeech
    • Wu, Z.1    Virtanen, T.2    Kinnunen, T.3    Chng E. ., S.4    Li, H.5
  • 156
    • 79959842826 scopus 로고    scopus 로고
    • Text-independent F0 transformation with non-parallel data for voice conversion
    • Wu, Z.Z.; Kinnunen, T.; Chng, E.S.; Li, H.; 2010. Text-independent F0 transformation with non-parallel data for voice conversion. In: Proc. Interspeech.
    • (2010) Proc. Interspeech
    • Wu Z. ., Z.1    Kinnunen, T.2    Chng E. ., S.3    Li, H.4
  • 158
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
    • J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm IEEE Trans. Audio Speech Language Process. 17 2009 66 83
    • (2009) IEEE Trans. Audio Speech Language Process. , vol.17 , pp. 66-83
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5
  • 161
    • 33846405723 scopus 로고    scopus 로고
    • Details of the Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005
    • H. Zen, T. Toda, M. Nakamura, and K. Tokuda Details of the Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005 IEICE Trans. Inform. Syst. 2007 325 333
    • (2007) IEICE Trans. Inform. Syst. , pp. 325-333
    • Zen, H.1    Toda, T.2    Nakamura, M.3    Tokuda, K.4
  • 162
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A.W. Black Statistical parametric speech synthesis Speech Commun. 51 2009 1039 1064
    • (2009) Speech Commun. , vol.51 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.