SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4440-4444

SAS: A speaker verification spoofing database containing diverse attacks

(7) Wu, Zhizheng a Khodabakhsh, Ali b Demiroglu, Cenk b Yamagishi, Junichi a,c Saito, Daisuke d Toda, Tomoki e King, Simon a

a UNIVERSITY OF EDINBURGH (United Kingdom)

b OZYEGIN UNIVERSITY (Turkey)

c NATIONAL INSTITUTE OF INFORMATICS (Japan)

d UNIVERSITY OF TOKYO (Japan)

e NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

Database; security; speaker verification; speech synthesis; spoofing attack; voice conversion

Indexed keywords

DATABASE SYSTEMS; SPEECH COMMUNICATION; SPEECH PROCESSING; SPEECH RECOGNITION; SPEECH SYNTHESIS;

ANTI-SPOOFING; SECURITY; SPEAKER VERIFICATION; SPOOFING ATTACKS; TWO-STATE; VOICE CONVERSION;

AUDIO SIGNAL PROCESSING;

EID: 84946070918 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178810 Document Type: Conference Paper

Times cited : (76)

References (42)

1
- 84946081587
- Speaker verification makes its debut in smartphone
- Kong Aik Lee, Bin Ma, and Haizhou Li, Speaker verification makes its debut in smartphone, in IEEE Signal Processing Society Speech and language Technical Committee Newsletter, 2013
- (2013) IEEE Signal Processing Society Speech and Language Technical Committee Newsletter
- Aik Lee, K.¹ Ma, B.² Li, H.³

2
- 84905231482
- Speaker recognition anti-spoofing
- S. Marcel, S. Z. Li, and M. Nixon, Eds. Springer
- N. Evans, T. Kinnunen, J. Yamagishi, Z. Wu, F. Alegre, and P. DeLeon, Speaker recognition anti-spoofing, in Handbook of biometric anti-spoofing, S. Marcel, S. Z. Li, and M. Nixon, Eds. Springer, 2014
- (2014) Handbook of Biometric Anti-spoofing
- Evans, N.¹ Kinnunen, T.² Yamagishi, J.³ Wu, Z.⁴ Alegre, F.⁵ DeLeon, P.⁶

3
- 84919922238
- Spoofing and countermeasures for speaker verification: A survey
- Zhizheng Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamgishi, Federico Alegre, and Haizhou Li, Spoofing and countermeasures for speaker verification: a survey, Speech Communication, vol. 66, pp. 130-153, 2015
- (2015) Speech Communication , vol.66 , pp. 130-153
- Wu, Z.¹ Evans, N.² Kinnunen, T.³ Yamgishi, J.⁴ Alegre, F.⁵ Li, H.⁶

4
- 84956723787
- Voice conversion versus speaker verification: An overview
- Zhizheng Wu and Haizhou Li, Voice conversion versus speaker verification: an overview, APSIPA Transactions on Signal and Information Processing, vol. 3, 2014
- (2014) APSIPA Transactions on Signal and Information Processing , vol.3
- Wu, Z.¹ Li, H.²

5
- 14544274085
- Vulnerability of speaker verification to voice mimicking
- Yee Wah Lau, Michael Wagner, and Dat Tran, Vulnerability of speaker verification to voice mimicking, in Proc. Int. Symposium on Intelligent Multimedia, Video and Speech Processing, 2004
- (2004) Proc. Int. Symposium on Intelligent Multimedia, Video and Speech Processing
- Wah Lau, Y.¹ Wagner, M.² Tran, D.³

6
- 84906213805
- I-vectors meet imitators: On vulnerability of speaker verification systems against voice mimicry
- R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki, T. Leino, and A.-M. Laukkanen, I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry, in Proc. Interspeech, 2013
- (2013) Proc. Interspeech
- Gonzalez Hautamäki, R.¹ Kinnunen, T.² Hautamäki, V.³ Leino, T.⁴ Laukkanen, A.-M.⁵

7
- 84949924182
- A study on replay attack and anti-spoofing for text-dependent speaker verification
- Zhizheng Wu, Sheng Gao, Eng Siong Chng, and Haizhou Li, A study on replay attack and anti-spoofing for text-dependent speaker verification, in Proc. Asia-Pacific Signal Information Processing Association Annual Summit and Conference (APSIPA ASC), 2014
- (2014) Proc. Asia-Pacific Signal Information Processing Association Annual Summit and Conference (APSIPA ASC)
- Wu, Z.¹ Gao, S.² Siong Chng, E.³ Li, H.⁴

8
- 84906233506
- Evaluation of the vulnerability of speaker verification to synthetic speech
- Phillip L Leon, Michael Pucher, and Junichi Yamagishi, Evaluation of the vulnerability of speaker verification to synthetic speech, in Proc. Odyssey: the Speaker and Language Recognition Workshop, 2010
- (2010) Proc. Odyssey: The Speaker and Language Recognition Workshop
- Leon, P.L.¹ Pucher, M.² Yamagishi, J.³

9
- 84865369980
- Evaluation of speaker verification security and detection of HMM-based synthetic speech
- P. L. Leon, M. Pucher, J. Yamagishi, I. Hernaez, and I. Saratxaga, Evaluation of speaker verification security and detection of HMM-based synthetic speech, IEEE Trans. Audio, Speech and Language Processing, vol. 20, no. 8, pp. 2280-2290, 2012
- (2012) IEEE Trans. Audio, Speech and Language Processing , vol.20 , Issue.8 , pp. 2280-2290
- Leon, P.L.¹ Pucher, M.² Yamagishi, J.³ Hernaez, I.⁴ Saratxaga, I.⁵

10
- 65349113532
- Artificial impostor voice transformation effects on false acceptance rates
- Jean-François Bonastre, Driss Matrouf, and Corinne Fredouille, Artificial impostor voice transformation effects on false acceptance rates, in Proc. Interspeech, 2007
- (2007) Proc. Interspeech
- Bonastre, J.¹ Matrouf, D.² Fredouille, C.³

11
- 84867600098
- Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech
- Tomi Kinnunen, Zhi-Zheng Wu, Kong Aik Lee, Filip Sedlak, Eng Siong Chng, and Haizhou Li, Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2012
- (2012) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
- Kinnunen, T.¹ Wu, Z.-Z.² Aik Lee, K.³ Sedlak, F.⁴ Siong Chng, E.⁵ Li, H.⁶

12
- 84874448812
- A study on spoofing attack in state-of-The-art speaker verification: The telephone speech case
- Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, and Eliathamby Ambikairajah, A study on spoofing attack in state-of-The-art speaker verification: the telephone speech case, in Proc. Asia-Pacific Signal Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012
- (2012) Proc. Asia-Pacific Signal Information Processing Association Annual Summit and Conference (APSIPA ASC)
- Wu, Z.¹ Kinnunen, T.² Siong Chng, E.³ Li, H.⁴ Ambikairajah, E.⁵

13
- 84878412793
- Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals
- Federico Alegre, Ravichander Vipperla, Nicholas Evans, et al., Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals, in Proc. Interspeech, 2012
- (2012) Proc. Interspeech
- Alegre, F.¹ Vipperla, R.² Evans, N.³

14
- 84906234851
- Voice transformation-based spoofing of textdependent speaker verification systems
- Zvi Kons and Hagai Aronowitz, Voice transformation-based spoofing of textdependent speaker verification systems, in Proc. Interspeech, 2013
- (2013) Proc. Interspeech
- Kons, Z.¹ Aronowitz, H.²

15
- 84946054709
- Spoofing and countermeasures for speaker verification: A need for standard corpora, protocols and metrics
- Nicholas W D Evans, Junichi Yamagishi, and Tomi Kinnunen, Spoofing and countermeasures for speaker verification: a need for standard corpora, protocols and metrics, in IEEE Signal Processing Society Speech and language Technical Committee Newsletter, 2013
- (2013) IEEE Signal Processing Society Speech and Language Technical Committee Newsletter
- Evans, D.N.W.¹ Yamagishi, J.² Kinnunen, T.³

16
- 0028996937
- Testing with the yoho cd-rom voice verification corpus
- J. P. Campbell Jr, Testing with the yoho cd-rom voice verification corpus, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 1995
- (1995) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
- Campbell, J.P.¹

17
- 84893797780
- A one-class classification approach to generalised speaker verification spoofing countermeasures using local binary patterns
- Federico Alegre, Asmaa Amehraye, and Nicholas Evans, A one-class classification approach to generalised speaker verification spoofing countermeasures using local binary patterns, in Proc. Int. Conf. on Biometrics: Theory, Applications and Systems (BTAS), 2013
- (2013) Proc. Int. Conf. on Biometrics: Theory, Applications and Systems (BTAS)
- Alegre, F.¹ Amehraye, A.² Evans, N.³

18
- 84906275384
- Vulnerability evaluation of speaker verification under voice conversion spoofing: The effect of text constraints
- Zhizheng Wu, Anthony Larcher, Kong Aik Lee, Eng Siong Chng, Tomi Kinnunen, and Haizhou Li, Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints, in Proc. Interspeech, 2013
- (2013) Proc. Interspeech
- Zhizheng Wu¹ Larcher, A.² Aik Lee, K.³ Siong Chng, E.⁴ Kinnunen, T.⁵ Li, H.⁶

19
- 84897385841
- Text-dependent speaker verification: Classifiers, databases and RSR2015
- A. Larcher, K. A. Lee, B. Ma, and H. Li, Text-dependent speaker verification: Classifiers, databases and RSR2015, Speech Communication, vol. 60, pp. 56-77, 2014
- (2014) Speech Communication , vol.60 , pp. 56-77
- Larcher, A.¹ Lee, K.A.² Ma, B.³ Li, H.⁴

20
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A. W. Black, Statistical parametric speech synthesis, Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

21
- 85133720638
- The HMM-based speech synthesis system (HTS) version 2.0
- Aug
- H. Zen, T. Nose, J. Yamagishi, S. Sako, and K. Tokuda, The HMM-based speech synthesis system (HTS) version 2.0, in Proceedings of Sixth ISCA Workshop on Speech Synthesis, Aug. 2007, pp. 294-299
- (2007) Proceedings of Sixth ISCA Workshop on Speech Synthesis , pp. 294-299
- Zen, H.¹ Nose, T.² Yamagishi, J.³ Sako, S.⁴ Tokuda, K.⁵

22
- 79952258981
- K. Tokuda, H. Zen, J. Yamagishi, T. Masuko, S. Sako, A.B. Black, and T. Nose, The HMM-based speech synthesis system (HTS) Version 2.2, 2011, http://hts.sp.nitech.ac.jp
- (2011) The HMM-based Speech Synthesis System (HTS) Version 2.2
- Tokuda, K.¹ Zen, H.² Yamagishi, J.³ Masuko, T.⁴ Sako, S.⁵ Black, A.B.⁶ Nose, T.⁷

23
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: possible role of a repetitive structure in sounds, Speech Commun., vol. 27, pp. 187-207, 1999
- (1999) Speech Commun , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigné, A.³

24
- 84865777002
- The CSTR/EMIME HTS system for blizzard challenge 2010
- Kyoto, Japan, Sept
- J. Yamagishi and O. Watts, The CSTR/EMIME HTS system for Blizzard Challenge 2010, in Proc. Blizzard Challenge 2010, Kyoto, Japan, Sept. 2010
- (2010) Proc. Blizzard Challenge 2010
- Yamagishi, J.¹ Watts, O.²

25
- 44449177634
- A hidden semi-Markov model-based speech synthesis system
- May
- H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, A hidden semi-Markov model-based speech synthesis system, IEICE Trans. Inf. &Syst., vol. E90-D, no. 5, pp. 825-834, May 2007
- (2007) IEICE Trans. Inf. &Syst , vol.E90-D , Issue.5 , pp. 825-834
- Zen, H.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

26
- 84894152556
- The voice bank corpus: Design, collection and data analysis of a large regional accent speech database
- Christophe Veaux, Junichi Yamagishi, and Simon King, The voice bank corpus: Design, collection and data analysis of a large regional accent speech database, in Proc. Int. Conf. Oriental COCOSDA, 2013
- (2013) Proc. Int. Conf. Oriental COCOSDA
- Veaux, C.¹ Yamagishi, J.² King, S.³

27
- 84897393748
- Structural Bayesian linear regression for hidden markov models
- Shinji Watanabe, Atsushi Nakamura, and Biing-Hwang(Fred) Juang, Structural Bayesian linear regression for hidden markov models, Journal of Signal Processing Systems, vol. 74, no. 3, pp. 341-358, 2014
- (2014) Journal of Signal Processing Systems , vol.74 , Issue.3 , pp. 341-358
- Watanabe, S.¹ Nakamura, A.² Biing-Hwang, F.J.³

28
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- May
- T. Toda and K. Tokuda, A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, IEICE Trans. Inf. &Syst., vol. E90-D, no. 5, pp. 816-824, May 2007
- (2007) IEICE Trans. Inf. &Syst , vol.E90-D , Issue.5 , pp. 816-824
- Toda, T.¹ Tokuda, K.²

29
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- E. Moulines and F. Charpentier, Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Commun., vol. 9, no. 5-6, pp. 453-468, 1990
- (1990) Speech Commun , vol.9 , Issue.5-6 , pp. 453-468
- Moulines, E.¹ Charpentier, F.²

30
- 85016140477
- An adaptive algorithm for Mel-cepstral analysis of speech
- Mar
- T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, An adaptive algorithm for Mel-cepstral analysis of speech, in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), Mar. 1992, pp. 137-140
- (1992) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 137-140
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

31
- 57749193836
- Voice conversion based on maximumlikelihood estimation of spectral parameter trajectory
- T. Toda, A. W. Black, and K. Tokuda, Voice conversion based on maximumlikelihood estimation of spectral parameter trajectory, IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007
- (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

32
- 84878390910
- Implementation of computationally efficient real-time voice conversion
- T. Toda, T. Muramatsu, and H. Banno, Implementation of computationally efficient real-time voice conversion, in Proc. Interspeech, 2012
- (2012) Proc. Interspeech
- Toda, T.¹ Muramatsu, T.² Banno, H.³

33
- 84856141218
- Voice conversion using dynamic kernel partial least squares regression
- Elina Helander, Hanna Silén, Tuomas Virtanen, and Moncef Gabbouj, Voice conversion using dynamic kernel partial least squares regression, IEEE Trans. Audio, Speech and Language Processing, vol. 20, no. 3, pp. 806-817, 2012
- (2012) IEEE Trans. Audio, Speech and Language Processing , vol.20 , Issue.3 , pp. 806-817
- Helander, E.¹ Silén, H.² Virtanen, T.³ Gabbouj, M.⁴

34
- 78049398713
- Non-parallel training for many-to-many eigenvoice conversion
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, Non-parallel training for many-to-many eigenvoice conversion, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2010, pp. 4822-4825
- (2010) Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP) , pp. 4822-4825
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

35
- 0025475528
- Atr Japanese speech database as a tool of speech recognition and synthesis
- A. Kurematsu, K. Takeda, Y. Sagisaka, H. Katagiri, S. Kuwabara, and K. Shikano, Atr japanese speech database as a tool of speech recognition and synthesis, Speech Communication, vol. 9, pp. 357-363, 1990
- (1990) Speech Communication , vol.9 , pp. 357-363
- Kurematsu, A.¹ Takeda, K.² Sagisaka, Y.³ Katagiri, H.⁴ Kuwabara, S.⁵ Shikano, K.⁶

36
- 84878378722
- Effects of speaker adaptive training on tensor-based arbitrary speaker conversion
- D. Saito, N. Minematsu, and K. Hirose, Effects of speaker adaptive training on tensor-based arbitrary speaker conversion, in Proc. Interspeech, 2012
- (2012) Proc. Interspeech
- Saito, D.¹ Minematsu, N.² Hirose, K.³

37
- 84906276055
- Exemplar-based unit selection for voice conversion utilizing temporal information
- Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, and Haizhou Li, Exemplar-based unit selection for voice conversion utilizing temporal information, in Proc. Interspeech, 2013
- (2013) Proc. Interspeech
- Wu, Z.¹ Virtanen, T.² Kinnunen, T.³ Siong Chng, E.⁴ Li, H.⁵

38
- 50249170027
- Joint factor analysis versus eigenchannels in speaker recognition
- Patrick Kenny, Gilles Boulianne, Pierre Ouellet, and Pierre Dumouchel, Joint factor analysis versus eigenchannels in speaker recognition, IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 4, pp. 1435-1447, 2007
- (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.4 , pp. 1435-1447
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

39
- 81855205043
- Probabilistic models for inference about identity
- January
- Peng Li, Yun Fu, Umar Mohammed, James H. Elder, and Simon J.D. Prince, Probabilistic models for inference about identity, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 34, no. 1, pp. 144-157, January 2012
- (2012) IEEE Trans. on Pattern Analysis and Machine Intelligence , vol.34 , Issue.1 , pp. 144-157
- Li, P.¹ Fu, Y.² Mohammed, U.³ Elder, J.H.⁴ Prince, D.S.J.⁵

40
- 84864277561
- Audioseg: Audio segmentation toolkit, release 1.2
- January
- G. Gravier, M. Betser, and M. Ben, audioseg: Audio segmentation toolkit, release 1.2, IRISA, January 2010
- (2010) IRISA
- Gravier, G.¹ Betser, M.² Ben, M.³

41
- 84865733857
- Analysis of i-vector length normalization in speaker recognition systems
- Daniel Garcia-Romero and Carol Y Espy-Wilson, Analysis of i-vector length normalization in speaker recognition systems, in Proc. Interspeech, 2011
- (2011) Proc. Interspeech
- Daniel, G.-R.¹ Espy-Wilson, C.Y.²

42
- 79951609039
- Front-end factor analysis for speaker verification
- May
- N. Dehak, P.J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, Front-end factor analysis for speaker verification, IEEE Trans. Audio, Speech and Language Processing, vol. 19, no. 4, pp. 788-798, May 2011
- (2011) IEEE Trans. Audio, Speech and Language Processing , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.J.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.