SCOPUS 정보 검색 플랫폼

IEEE/ACM Transactions on Audio Speech and Language Processing

Volumn 24, Issue 4, 2016, Pages 768-783

Anti-spoofing for text-independent speaker verification: An initial database, comparison of countermeasures, and human performance

(11) Wu, Zhizheng a De Leon, Phillip L b Demiroglu, Cenk c Khodabakhsh, Ali c King, Simon a Ling, Zhen Hua d Saito, Daisuke e Stewart, Bryan b Toda, Tomoki f Wester, Mirjam a Yamagish, Junichi a

a UNIVERSITY OF EDINBURGH (United Kingdom)

b Horeshoe Cir Jett Hall (United States)

c OZYEGIN UNIVERSITY (Turkey)

d UNIVERSITY OF SCIENCE AND TECHNOLOGY OF CHINA (China)

e UNIVERSITY OF TOKYO (Japan)

f NAGOYA UNIVERSITY (Japan)

Author keywords

Anti spoofing; Countermeasure; Security; Speaker verification; Speech synthesis; Spoofing attack; Voice conversion

Indexed keywords

BENCHMARKING; SPEECH PROCESSING; SPEECH SYNTHESIS;

ANTI-SPOOFING; COUNTERMEASURE; SECURITY; SPEAKER VERIFICATION; SPOOFING ATTACKS; VOICE CONVERSION;

SPEECH RECOGNITION;

EID: 84962901047 PISSN: 23299290 EISSN: None Source Type: Journal
DOI: 10.1109/TASLP.2016.2526653 Document Type: Article

Times cited : (99)

References (78)

1
- 84946070918
- SAS: A speaker verification spoofing database containing diverse attacks
- Z. Wu et al., "SAS: A speaker verification spoofing database containing diverse attacks, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2015.
- (2015) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP)
- Wu, Z.¹

2
- 84959177524
- Human vs machine spoofing detection on wideband and narrowband data
- M. Wester, Z. Wu, and J. Yamagishi, "Human vs machine spoofing detection on wideband and narrowband data, " in Proc. Interspeech, 2015.
- (2015) Proc. Interspeech
- Wester, M.¹ Wu, Z.² Yamagishi, J.³

3
- 84862007811
- Voice biometrics-the Asia pacific experience
- P. Golden, "Voice biometrics-The Asia Pacific experience, " Biom. Technol. Today, vol. 2012, no. 4, pp. 10-11, 2012.
- (2012) Biom. Technol. Today , vol.2012 , Issue.4 , pp. 10-11
- Golden, P.¹

4
- 84875163582
- Talking passwords: Voice biometrics for data access and security
- M. Khitrov, "Talking passwords: Voice biometrics for data access and security, " Biom. Technol. Today, vol. 2013, no. 2, pp. 9-11, 2013.
- (2013) Biom. Technol. Today , vol.2013 , Issue.2 , pp. 9-11
- Khitrov, M.¹

5
- 84880875127
- Voice biometrics: Success stories, success factors and what's next
- B. Beranek, "Voice biometrics: Success stories, success factors and what's next, " Biom. Technol. Today, vol. 2013, no. 7, pp. 9-11, 2013.
- (2013) Biom. Technol. Today , vol.2013 , Issue.7 , pp. 9-11
- Beranek, B.¹

6
- 84893339015
- Speaker verification makes its debut in smartphone
- Feb
- K. A. Lee, B. Ma, and H. Li, "Speaker verification makes its debut in smartphone, " in Proc. IEEE Signal Process. Soc. Speech Lang. Tech. Committee Newsl., Feb. 2013.
- (2013) Proc. IEEE Signal Process. Soc. Speech Lang. Tech. Committee Newsl.
- Lee, K.A.¹ Ma, B.² Li, H.³

7
- 84929796028
- Surveying the development of biometric user authentication on mobile phones
- 3rd Quart
- W. Meng, D. Wong, S. Furnell, and J. Zhou, "Surveying the development of biometric user authentication on mobile phones, " IEEE Commun. Surv. Tuts., vol. 17, no. 3, pp. 1268-1293, 3rd Quart. 2015.
- (2015) IEEE Commun. Surv. Tuts. , vol.17 , Issue.3 , pp. 1268-1293
- Meng, W.¹ Wong, D.² Furnell, S.³ Zhou, J.⁴

8
- 84919922238
- Spoofing and countermeasures for speaker verification: A survey
- Z. Wu, N. Evans, T. Kinnunen, J. Yamagishi, F. Alegre, and H. Li, "Spoofing and countermeasures for speaker verification: A survey, " Speech Commun., vol. 66, pp. 130-153, 2015.
- (2015) Speech Commun. , vol.66 , pp. 130-153
- Wu, Z.¹ Evans, N.² Kinnunen, T.³ Yamagishi, J.⁴ Alegre, F.⁵ Li, H.⁶

9
- 14544274085
- Vulnerability of speaker verification to voice mimicking
- Y.W. Lau, M. Wagner, and D. Tran, "Vulnerability of speaker verification to voice mimicking, " in Proc. Int. Symp. Intell. Multimedia, Video Speech Process., 2004.
- (2004) Proc. Int. Symp. Intell. Multimedia, Video Speech Process
- Lau, Y.W.¹ Wagner, M.² Tran, D.³

10
- 84906213805
- I-vectors meet imitators: On vulnerability of speaker verification systems against voice mimicry
- R. G. Hautamäki, T. Kinnunen, V. Hautamäki, T. Leino, and A.- M. Laukkanen, "I-vectors meet imitators: On vulnerability of speaker verification systems against voice mimicry, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Hautamäki, R.G.¹ Kinnunen, T.² Hautamäki, V.³ Leino, T.⁴ Laukkanen, A.M.⁵

11
- 84929611508
- Automatic versus human speaker verification: The case of voice mimicry
- R. G. Hautamäki, T. Kinnunen, V. Hautamäki, and A.-M. Laukkanen, "Automatic versus human speaker verification: The case of voice mimicry, " Speech Commun., vol. 72, pp. 13-31, 2015.
- (2015) Speech Commun. , vol.72 , pp. 13-31
- Hautamäki, R.G.¹ Kinnunen, T.² Hautamäki, V.³ Laukkanen, A.-M.⁴

12
- 84455211532
- Preventing replay attacks on speaker verification systems
- J. Villalba and E. Lleida, "Preventing replay attacks on speaker verification systems, " in Proc. IEEE Int. Carnahan Conf. Secur. Technol. (ICCST), 2011.
- (2011) Proc. IEEE Int. Carnahan Conf. Secur. Technol. (ICCST)
- Villalba, J.¹ Lleida, E.²

13
- 84949924182
- A study on replay attack and anti-spoofing for text-dependent speaker verification
- Z. Wu, S. Gao, E. S. Chng, and H. Li, "A study on replay attack and anti-spoofing for text-dependent speaker verification, " in Proc. Asia-Pac. Signal Inf. Process. Assoc. Annu. Summit Conf. (APSIPA ASC), 2014.
- (2014) Proc. Asia-Pac. Signal Inf. Process. Assoc. Annu. Summit Conf. (APSIPA ASC)
- Wu, Z.¹ Gao, S.² Chng, E.S.³ Li, H.⁴

14
- 84949494025
- On the study of replay and voice conversion attacks to text-dependent speaker verification
- Z. Wu and H. Li, "On the study of replay and voice conversion attacks to text-dependent speaker verification, " Multimedia Tools Appl., 2015, doi:10.1007/s11042-015-3080-9.
- (2015) Multimedia Tools Appl.
- Wu, Z.¹ Li, H.²

15
- 84906233506
- Evaluation of the vulnerability of speaker verification to synthetic speech
- P. L. De Leon, M. Pucher, and J. Yamagishi, "Evaluation of the vulnerability of speaker verification to synthetic speech, " in Proc. Odyssey: Speaker Lang. Recognit. Workshop, 2010.
- (2010) Proc. Odyssey: Speaker Lang. Recognit. Workshop
- De Leon, P.L.¹ Pucher, M.² Yamagishi, J.³

16
- 84865369980
- Evaluation of speaker verification security and detection of hmm-based synthetic speech
- Oct
- P. L. De Leon, M. Pucher, J. Yamagishi, I. Hernaez, and I. Saratxaga, "Evaluation of speaker verification security and detection of HMM-based synthetic speech, " IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 8, pp. 2280-2290, Oct. 2012.
- (2012) IEEE Trans. Audio Speech Lang. Process , vol.20 , Issue.8 , pp. 2280-2290
- De Leon, P.L.¹ Pucher, M.² Yamagishi, J.³ Hernaez, I.⁴ Saratxaga, I.⁵

17
- 65349113532
- Artificial impostor voice transformation effects on false acceptance rates
- J.-F. Bonastre, D. Matrouf, and C. Fredouille, "Artificial impostor voice transformation effects on false acceptance rates, " in Proc. Interspeech, 2007.
- (2007) Proc. Interspeech
- Bonastre, J.-F.¹ Matrouf, D.² Fredouille, C.³

18
- 84867600098
- Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech
- T. Kinnunen, Z.-Z. Wu, K. A. Lee, F. Sedlak, E. S. Chng, and H. Li, "Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2012.
- (2012) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP)
- Kinnunen, T.¹ Wu, Z.-Z.² Lee, K.A.³ Sedlak, F.⁴ Chng, E.S.⁵ Li, H.⁶

19
- 84874448812
- A study on spoofing attack in state-of-the-art speaker verification: The telephone speech case
- Z. Wu, T. Kinnunen, E. S. Chng, H. Li, and E. Ambikairajah, "A study on spoofing attack in state-of-the-art speaker verification: The telephone speech case, " in Proc. Asia-Pac. Signal Inf. Process. Assoc. Annu. Summit Conf. (APSIPA ASC), 2012.
- (2012) Proc. Asia-Pac. Signal Inf. Process. Assoc. Annu. Summit Conf. (APSIPA ASC)
- Wu, Z.¹ Kinnunen, T.² Chng, E.S.³ Li, H.⁴ Ambikairajah, E.⁵

20
- 84906234851
- Voice transformation-based spoofing of text dependent speaker verification systems
- Z. Kons and H. Aronowitz, "Voice transformation-based spoofing of text dependent speaker verification systems, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Kons, Z.¹ Aronowitz, H.²

21
- 84956723787
- Voice conversion versus speaker verification: An overview
- Z. Wu and H. Li, "Voice conversion versus speaker verification: An overview, " APSIPA Trans. Signal Inf. Process., vol. 3, p. e17, 2014.
- (2014) APSIPA Trans. Signal Inf. Process , vol.3 , pp. e17
- Wu, Z.¹ Li, H.²

22
- 85135274466
- On the security of hmm-based speaker verification systems against imposture using synthetic speech
- T. Masuko, T. Hitotsumatsu, K. Tokuda, and T. Kobayashi, "On the security of HMM-based speaker verification systems against imposture using synthetic speech, " in Proc. Eur. Conf. Speech Commun. Technol. (Eurospeech), 1999.
- (1999) Proc. Eur. Conf. Speech Commun. Technol. (Eurospeech)
- Masuko, T.¹ Hitotsumatsu, T.² Tokuda, K.³ Kobayashi, T.⁴

23
- 1942512336
- Imposture using synthetic speech against speaker verification based on spectrum and pitch
- T. Masuko, K. Tokuda, and T. Kobayashi, "Imposture using synthetic speech against speaker verification based on spectrum and pitch, " in Proc. Interspeech, 2000.
- (2000) Proc. Interspeech
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³

24
- 0012330750
- The design for the wall street journal-based CSR corpus
- D. B. Paul and J. M. Baker, "The design for the wall street journal-based CSR corpus, " in Proc. Workshop Speech Nat. Lang., 1992, pp. 357-362.
- (1992) Proc. Workshop Speech Nat. Lang. , pp. 357-362
- Paul, D.B.¹ Baker, J.M.²

25
- 0032664931
- An experimental study of speaker verification sensitivity to computer voice-altered imposters
- B. L. Pellom and J. H. Hansen, "An experimental study of speaker verification sensitivity to computer voice-altered imposters, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 1999.
- (1999) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP)
- Pellom, B.L.¹ Hansen, J.H.²

26
- 33646787422
- Voice forgery using ALISP: Indexation in a client memory
- P. Perrot, G. Aversano, R. Blouet, M. Charbit, and G. Chollet, "Voice forgery using ALISP: Indexation in a client memory, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2005.
- (2005) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP)
- Perrot, P.¹ Aversano, G.² Blouet, R.³ Charbit, M.⁴ Chollet, G.⁵

27
- 33947714703
- Effect of speech transformation on impostor acceptance
- D. Matrouf, J.-F. Bonastre, and C. Fredouille, "Effect of speech transformation on impostor acceptance, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2006.
- (2006) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP)
- Matrouf, D.¹ Bonastre, J.-F.² Fredouille, C.³

28
- 85009119461
- A robust speaker verification system against imposture using an HMM-based speech synthesis system
- T. Satoh, T. Masuko, T. Kobayashi, and K. Tokuda, "A robust speaker verification system against imposture using an HMM-based speech synthesis system, " in Proc. Eur. Conf. Speech Commun. Technol. (Eurospeech), 2001.
- (2001) Proc. Eur. Conf. Speech Commun. Technol. (Eurospeech)
- Satoh, T.¹ Masuko, T.² Kobayashi, T.³ Tokuda, K.⁴

29
- 84878402831
- Synthetic speech discrimination using pitch pattern statistics derived from image analysis
- P. L. De Leon, B. Stewart, and J. Yamagishi, "Synthetic speech discrimination using pitch pattern statistics derived from image analysis, " in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- De Leon, P.L.¹ Stewart, B.² Yamagishi, J.³

30
- 84905216554
- Performance of ivector speaker verification and the detection of synthetic speech
- R. D. McClanahan, B. Stewart, and P. L. De Leon, "Performance of Ivector speaker verification and the detection of synthetic speech, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2014.
- (2014) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP)
- McClanahan, R.D.¹ Stewart, B.² De Leon, P.L.³

31
- 84890543945
- Synthetic speech detection using temporal modulation feature
- Z. Wu, X. Xiao, E. S. Chng, and H. Li, "Synthetic speech detection using temporal modulation feature, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2013.
- (2013) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP)
- Wu, Z.¹ Xiao, X.² Chng, E.S.³ Li, H.⁴

32
- 33947167478
- Face description with local binary patterns: Application to face recognition
- Dec
- T. Ahonen, A. Hadid, and M. Pietikainen, "Face description with local binary patterns: Application to face recognition, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 28, no. 12, pp. 2037-2041, Dec. 2006.
- (2006) IEEE Trans. Pattern Anal. Mach. Intell. , vol.28 , Issue.12 , pp. 2037-2041
- Ahonen, T.¹ Hadid, A.² Pietikainen, M.³

33
- 84906244272
- A new speaker verification spoofing countermeasure based on local binary patterns
- F. Alegre, R. Vipperla, A. Amehraye, and N. Evans, "A new speaker verification spoofing countermeasure based on local binary patterns, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Alegre, F.¹ Vipperla, R.² Amehraye, A.³ Evans, N.⁴

34
- 84893797780
- A one-class classification approach to generalised speaker verification spoofing countermeasures using local binary patterns
- F. Alegre, A. Amehraye, and N. Evans, "A one-class classification approach to generalised speaker verification spoofing countermeasures using local binary patterns, " in Proc. Int. Conf. Biom.: Theory Appl. Syst. (BTAS), 2013.
- (2013) Proc. Int. Conf. Biom.: Theory Appl. Syst. (BTAS)
- Alegre, F.¹ Amehraye, A.² Evans, N.³

35
- 84878410960
- Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition
- Z. Wu, E. S. Chng, and H. Li, "Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition, " in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Wu, Z.¹ Chng, E.S.² Li, H.³

36
- 84910072494
- A crossvocoder study of speaker independent synthetic speech detection using phase information
- J. Sanchez, I. Saratxaga, I. Hernaez, E. Navas, and D. Erro, "A crossvocoder study of speaker independent synthetic speech detection using phase information, " in Proc. Interspeech, 2014.
- (2014) Proc. Interspeech
- Sanchez, J.¹ Saratxaga, I.² Hernaez, I.³ Navas, E.⁴ Erro, D.⁵

37
- 84926346726
- Toward a universal synthetic speech spoofing detection using phase information
- Apr
- J. Sanchez, I. Saratxaga, I. Hernaez, E. Navas, D. Erro, and T. Raitio, "Toward a universal synthetic speech spoofing detection using phase information, " IEEE Trans. Inf. Forensics Secur., vol. 10, no. 4, pp. 810- 820, Apr. 2015.
- (2015) IEEE Trans. Inf. Forensics Secur. , vol.10 , Issue.4 , pp. 810-820
- Sanchez, J.¹ Saratxaga, I.² Hernaez, I.³ Navas, E.⁴ Erro, D.⁵ Raitio, T.⁶

38
- 84890542394
- Spoofing countermeasures to protect automatic speaker verification from voice conversion
- F. Alegre, A. Amehraye, and N. Evans, "Spoofing countermeasures to protect automatic speaker verification from voice conversion, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2013.
- (2013) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP)
- Alegre, F.¹ Amehraye, A.² Evans, N.³

39
- 84910058696
- Introducing I-vectors for joint anti-spoofing and speaker verification
- E. Khoury, T. Kinnunen, A. Sizov, Z. Wu, and S. Marcel, "Introducing I-vectors for joint anti-spoofing and speaker verification, " in Proc. Interspeech, 2014.
- (2014) Proc. Interspeech
- Khoury, E.¹ Kinnunen, T.² Sizov, A.³ Wu, Z.⁴ Marcel, S.⁵

40
- 84959103968
- Joint speaker verification and antispoofing in the i-vector space
- Apr
- A. Sizov, E. Khoury, T. Kinnunen, Z. Wu, and S. Marcel, "Joint speaker verification and antispoofing in the I-vector space, " IEEE Trans. Inf. Forensics Secur., vol. 10, no. 4, pp. 821-832, Apr. 2015.
- (2015) IEEE Trans. Inf. Forensics Secur. , vol.10 , Issue.4 , pp. 821-832
- Sizov, A.¹ Khoury, E.² Kinnunen, T.³ Wu, Z.⁴ Marcel, S.⁵

41
- 84959130948
- Online]. Available
- Z. Wu, T. Kinnunen, N. Evans, and J. Yamagishi. (2014). ASVspoof 2015: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan [Online]. Available: http://www.zhizheng.org/papers/asvSpoof-eval-plan.pdf.
- (2014) ASVspoof 2015: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan
- Wu, Z.¹ Kinnunen, T.² Evans, N.³ Yamagishi, J.⁴

42
- 84959130948
- ASVs poof 2015: The first automatic speaker verification spoofing and countermeasures challenge
- Z. Wu et al., "ASVs poof 2015: The first automatic speaker verification spoofing and countermeasures challenge, " in Proc. Interspeech, 2015.
- (2015) Proc. Interspeech
- Wu, Z.¹

43
- 85073095401
- Human assisted speaker recognition in NIST SRE10
- C. S. Greenberg et al., "Human assisted speaker recognition in NIST SRE10." in Proc. Odyssey: Speaker Lang. Recognit. Workshop, 2010.
- (2010) Proc. Odyssey: Speaker Lang. Recognit. Workshop
- Greenberg, C.S.¹

44
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis, " Speech Commun., vol. 51, no. 11, pp. 1039-1064, 2009.
- (2009) Speech Commun , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

45
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech Commun., vol. 27, pp. 187-207, 1999.
- (1999) Speech Commun. , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigné, A.³

46
- 84865777002
- The CSTR/EMIME HTS system for blizzard challenge 2010
- Kyoto, Japan, Sep
- J. Yamagishi and O. Watts, "The CSTR/EMIME HTS system for blizzard challenge 2010, " in Proc. Blizzard Challenge, Kyoto, Japan, Sep. 2010.
- (2010) Proc. Blizzard Challenge
- Yamagishi, J.¹ Watts, O.²

47
- 84897393748
- Structural Bayesian linear regression for hidden Markov models
- S. Watanabe, A. Nakamura, and B.-H. Juang, "Structural Bayesian linear regression for hidden Markov models, " J. Signal Process. Syst., vol. 74, no. 3, pp. 341-358, 2014.
- (2014) J. Signal Process. Syst. , vol.74 , Issue.3 , pp. 341-358
- Watanabe, S.¹ Nakamura, A.² Juang, B.-H.³

48
- 38549096029
- A speech parameter generation algorithm considering global variance for hmm-based speech synthesis
- May
- T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, " IEICE Trans. Inf. Syst., vol. E90-D, no. 5, pp. 816-824, May 2007.
- (2007) IEICE Trans. Inf. Syst. , vol.E90-D , Issue.5 , pp. 816-824
- Toda, T.¹ Tokuda, K.²

49
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- E. Moulines and F. Charpentier, "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones, " Speech Commun., vol. 9, no. 5-6, pp. 453-468, 1990.
- (1990) Speech Commun. , vol.9 , Issue.5-6 , pp. 453-468
- Moulines, E.¹ Charpentier, F.²

50
- 85016140477
- An adaptive algorithm for mel-cepstral analysis of speech
- Mar
- T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for Mel-cepstral analysis of speech, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), Mar. 1992, pp. 137-140.
- (1992) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP) , pp. 137-140
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

51
- 78049403515
- Simple methods for improving speakersimilarity of HMM-based speech synthesis
- J. Yamagishi and S. King, "Simple methods for improving speakersimilarity of HMM-based speech synthesis, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2010.
- (2010) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP)
- Yamagishi, J.¹ King, S.²

52
- 0142247093
- The German text-to-speech synthesis system Mary: A tool for research, development and teaching
- M. Schröder and J. Trouvain, "The German text-to-speech synthesis system MARY: A tool for research, development and teaching, " Int. J. Speech Technol., vol. 6, no. 4, pp. 365-377, 2003.
- (2003) Int. J. Speech Technol. , vol.6 , Issue.4 , pp. 365-377
- Schröder, M.¹ Trouvain, J.²

53
- 85131821539
- Mel-generalized cepstral analysis-a unified approach to speech spectral estimation
- K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Mel-generalized cepstral analysis-a unified approach to speech spectral estimation, " in Proc. Int. Conf. Spoken Lang. Process. (ICSLP), 1994.
- (1994) Proc. Int. Conf. Spoken Lang. Process. (ICSLP)
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Imai, S.⁴

54
- 78049398713
- Non-parallel training for many-to-many eigenvoice conversion
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Non-parallel training for many-to-many eigenvoice conversion, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2010, pp. 4822-4825.
- (2010) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP) , pp. 4822-4825
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

55
- 0025475528
- Atr Japanese speech database as a tool of speech recognition and synthesis
- A. Kurematsu, K. Takeda, Y. Sagisaka, H. Katagiri, S. Kuwabara, and K. Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis, " Speech Commun., vol. 9, pp. 357-363, 1990.
- (1990) Speech Commun. , vol.9 , pp. 357-363
- Kurematsu, A.¹ Takeda, K.² Sagisaka, Y.³ Katagiri, H.⁴ Kuwabara, S.⁵ Shikano, K.⁶

56
- 84878378722
- Effects of speaker adaptive training on tensor-based arbitrary speaker conversion
- D. Saito, N. Minematsu, and K. Hirose, "Effects of speaker adaptive training on tensor-based arbitrary speaker conversion, " in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Saito, D.¹ Minematsu, N.² Hirose, K.³

57
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- Nov
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. Audio Speech Lang. Process , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

58
- 84906276055
- Exemplarbased unit selection for voice conversion utilizing temporal information
- Z. Wu, T. Virtanen, T. Kinnunen, E. S. Chng, and H. Li, "Exemplarbased unit selection for voice conversion utilizing temporal information, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Wu, Z.¹ Virtanen, T.² Kinnunen, T.³ Chng, E.S.⁴ Li, H.⁵

59
- 84878390910
- Implementation of computationally efficient real-time voice conversion
- T. Toda, T. Muramatsu, and H. Banno, "Implementation of computationally efficient real-time voice conversion, " in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Toda, T.¹ Muramatsu, T.² Banno, H.³

60
- 84856141218
- Voice conversion using dynamic kernel partial least squares regression
- E. Helander, H. Silén, T. Virtanen, and M. Gabbouj, "Voice conversion using dynamic kernel partial least squares regression, " IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 3, pp. 806-817, 2012.
- (2012) IEEE Trans. Audio Speech Lang. Process , vol.20 , Issue.3 , pp. 806-817
- Helander, E.¹ Silén, H.² Virtanen, T.³ Gabbouj, M.⁴

61
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models, " Digit. Signal Process., vol. 10, no. 1, pp. 19-41, 2000.
- (2000) Digit. Signal Process , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

62
- 50249170027
- Joint factor analysis versus eigenchannels in speaker recognition
- May
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Joint factor analysis versus eigenchannels in speaker recognition, " IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 4, pp. 1435-1447, May 2007.
- (2007) IEEE Trans. Audio Speech Lang. Process , vol.15 , Issue.4 , pp. 1435-1447
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

63
- 81855205043
- Probabilistic models for inference about identity
- Jan
- P. Li, Y. Fu, U. Mohammed, J. H. Elder, and S. J. Prince, "Probabilistic models for inference about identity, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 34, no. 1, pp. 144-157, Jan. 2012.
- (2012) IEEE Trans. Pattern Anal. Mach. Intell. , vol.34 , Issue.1 , pp. 144-157
- Li, P.¹ Fu, Y.² Mohammed, U.³ Elder, J.H.⁴ Prince, S.J.⁵

64
- 84864277561
- Audioseg: Audio segmentation toolkit, release 1.2
- France, Jan
- G. Gravier, M. Betser, and M. Ben, "AudioSeg: Audio segmentation toolkit, release 1.2, " IRISA, France, Jan. 2010.
- (2010) IRISA
- Gravier, G.¹ Betser, M.² Ben, M.³

65
- 84910024698
- MSR identity toolbox v1. 0: A MATLAB toolbox for speaker recognition research
- Soc. Speech Lang. Tech. Committee Newsl. November
- S. O. Sadjadi, M. Slaney, and L. Heck, "MSR identity toolbox v1. 0: A MATLAB toolbox for speaker recognition research, " in Proc. IEEE Signal Process. Soc. Speech Lang. Tech. Committee Newsl., November 2013.
- (2013) Proc. IEEE Signal Process
- Sadjadi, S.O.¹ Slaney, M.² Heck, L.³

66
- 84901846660
- From single to multiple enrollment I-vectors: Practical PLDA scoring variants for speaker verification
- p
- P. Rajan, A. Afanasyev, V. Hautamäki, and T. Kinnunen, "From single to multiple enrollment I-vectors: Practical PLDA scoring variants for speaker verification, " Digit. Signal Process., vol. 31, pp. 93-101, 2014.
- (2014) Digit. Signal Process , vol.31 , pp. 93-101
- Rajan, P.¹ Afanasyev, A.² Hautamäki, V.³ Kinnunen, T.⁴

67
- 4544290141
- Usefulness of phase spectrum in human speech perception
- K. K. Paliwal and L. D. Alsteris, "Usefulness of phase spectrum in human speech perception, " in Proc. Interspeech, 2003.
- (2003) Proc. Interspeech
- Paliwal, K.K.¹ Alsteris, L.D.²

68
- 51849100937
- Significance of the modified group delay feature in speech recognition
- Jan
- R. M. Hegde et al., "Significance of the modified group delay feature in speech recognition, " IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 1, pp. 190-202, Jan. 2007.
- (2007) IEEE Trans. Audio Speech Lang. Process , vol.15 , Issue.1 , pp. 190-202
- Hegde, R.M.¹

69
- 70450194107
- Robustness of phase based features for speaker recognition
- P. Rajan, S. H. K. Parthasarathi, and H. A. Murthy, "Robustness of phase based features for speaker recognition, " in Proc. Interspeech, 2009.
- (2009) Proc. Interspeech
- Rajan, P.¹ Parthasarathi, S.H.K.² Murthy, H.A.³

70
- 79955702502
- LIBSVM: A library for support vector machines
- C.-C. Chang and C.-J. Lin, "LIBSVM: A library for support vector machines, " ACM Trans. Intell. Syst. Technol. (TIST), vol. 2, no. 3, p. 27, 2011.
- (2011) ACM Trans. Intell. Syst. Technol. (TIST) , vol.2 , Issue.3 , pp. 27
- Chang, C.-C.¹ Lin, C.-J.²

71
- 84925160976
- Cambridge, U.K.: Cambridge Univ. Press
- P. Taylor, Text-to-Speech Synthesis. Cambridge, U.K.: Cambridge Univ. Press, 2009.
- (2009) Text-to-Speech Synthesis
- Taylor, P.¹

72
- 27544482501
- Discrimination method of synthetic speech using pitch frequency against synthetic speech falsification
- Jan
- A. Ogihara, H. Unno, and A. Shiozakai, "Discrimination method of synthetic speech using pitch frequency against synthetic speech falsification, " IEICE Trans. Fundam. Electron. Commun. Comput. Sci., vol. 88, no. 1, pp. 280-286, Jan. 2005.
- (2005) IEICE Trans. Fundam. Electron. Commun. Comput. Sci. , vol.88 , Issue.1 , pp. 280-286
- Ogihara, A.¹ Unno, H.² Shiozakai, A.³

73
- 51449086024
- Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006
- Sep
- N. Brummer et al., "Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006, " IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 7, pp. 2072-2084, Sep. 2007.
- (2007) IEEE Trans. Audio Speech Lang. Process , vol.15 , Issue.7 , pp. 2072-2084
- Brummer, N.¹

74
- 84877743396
- Sparse classifier fusion for speaker verification
- Aug
- V. Hautamaki, T. Kinnunen, F. Sedlák, K. A. Lee, B. Ma, and H. Li, "Sparse classifier fusion for speaker verification, " IEEE Trans. Audio Speech Lang. Process., vol. 21, no. 8, pp. 1622-1631, Aug. 2013.
- (2013) IEEE Trans. Audio Speech Lang. Process , vol.21 , Issue.8 , pp. 1622-1631
- Hautamaki, V.¹ Kinnunen, T.² Sedlák, F.³ Lee, K.A.⁴ Ma, B.⁵ Li, H.⁶

75
- 0033889739
- Speaker verification by human listeners: Experiments comparing human and machine performance using the NIST 1998 speaker evaluation data
- A. Schmidt-Nielsen and T. H. Crystal, "Speaker verification by human listeners: Experiments comparing human and machine performance using the NIST 1998 speaker evaluation data, " Digit. Signal Process., vol. 10, no. 1, pp. 249-266, 2000.
- (2000) Digit. Signal Process , vol.10 , Issue.1 , pp. 249-266
- Schmidt-Nielsen, A.¹ Crystal, T.H.²

76
- 79959816522
- Approaching human listener accuracy with modern speaker verification
- V. Hautamäki, T. Kinnunen, M. Nosratighods, K. A. Lee, B. Ma, and H. Li, "Approaching human listener accuracy with modern speaker verification, " in Proc. Interspeech, 2010.
- (2010) Proc. Interspeech
- Hautamäki, V.¹ Kinnunen, T.² Nosratighods, M.³ Lee, K.A.⁴ Ma, B.⁵ Li, H.⁶

77
- 77956899791
- Discontinuity detection in concatenated speech synthesis based on nonlinear speech analysis
- Y. Pantazis, Y. Stylianou, and E. Klabbers, "Discontinuity detection in concatenated speech synthesis based on nonlinear speech analysis, " in Proc. Interspeech, 2005.
- (2005) Proc. Interspeech
- Pantazis, Y.¹ Stylianou, Y.² Klabbers, E.³

78
- 84962920323
- Available
- Z. Wu et al. (2015). Spoofing and Anti-Spoofing (SAS) corpus v1.0 [Online]. Available: http://dx.doi.org/10.7488/ds/252.
- (2015) Spoofing and Anti-Spoofing (SAS) Corpus V1.0 [Online]
- Wu, Z.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.