메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4440-4444

SAS: A speaker verification spoofing database containing diverse attacks

Author keywords

Database; security; speaker verification; speech synthesis; spoofing attack; voice conversion

Indexed keywords

DATABASE SYSTEMS; SPEECH COMMUNICATION; SPEECH PROCESSING; SPEECH RECOGNITION; SPEECH SYNTHESIS;

EID: 84946070918     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178810     Document Type: Conference Paper
Times cited : (76)

References (42)
  • 3
    • 84919922238 scopus 로고    scopus 로고
    • Spoofing and countermeasures for speaker verification: A survey
    • Zhizheng Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamgishi, Federico Alegre, and Haizhou Li, Spoofing and countermeasures for speaker verification: a survey, Speech Communication, vol. 66, pp. 130-153, 2015
    • (2015) Speech Communication , vol.66 , pp. 130-153
    • Wu, Z.1    Evans, N.2    Kinnunen, T.3    Yamgishi, J.4    Alegre, F.5    Li, H.6
  • 10
    • 65349113532 scopus 로고    scopus 로고
    • Artificial impostor voice transformation effects on false acceptance rates
    • Jean-François Bonastre, Driss Matrouf, and Corinne Fredouille, Artificial impostor voice transformation effects on false acceptance rates, in Proc. Interspeech, 2007
    • (2007) Proc. Interspeech
    • Bonastre, J.1    Matrouf, D.2    Fredouille, C.3
  • 13
    • 84878412793 scopus 로고    scopus 로고
    • Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals
    • Federico Alegre, Ravichander Vipperla, Nicholas Evans, et al., Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals, in Proc. Interspeech, 2012
    • (2012) Proc. Interspeech
    • Alegre, F.1    Vipperla, R.2    Evans, N.3
  • 14
    • 84906234851 scopus 로고    scopus 로고
    • Voice transformation-based spoofing of textdependent speaker verification systems
    • Zvi Kons and Hagai Aronowitz, Voice transformation-based spoofing of textdependent speaker verification systems, in Proc. Interspeech, 2013
    • (2013) Proc. Interspeech
    • Kons, Z.1    Aronowitz, H.2
  • 18
    • 84906275384 scopus 로고    scopus 로고
    • Vulnerability evaluation of speaker verification under voice conversion spoofing: The effect of text constraints
    • Zhizheng Wu, Anthony Larcher, Kong Aik Lee, Eng Siong Chng, Tomi Kinnunen, and Haizhou Li, Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints, in Proc. Interspeech, 2013
    • (2013) Proc. Interspeech
    • Zhizheng Wu1    Larcher, A.2    Aik Lee, K.3    Siong Chng, E.4    Kinnunen, T.5    Li, H.6
  • 19
    • 84897385841 scopus 로고    scopus 로고
    • Text-dependent speaker verification: Classifiers, databases and RSR2015
    • A. Larcher, K. A. Lee, B. Ma, and H. Li, Text-dependent speaker verification: Classifiers, databases and RSR2015, Speech Communication, vol. 60, pp. 56-77, 2014
    • (2014) Speech Communication , vol.60 , pp. 56-77
    • Larcher, A.1    Lee, K.A.2    Ma, B.3    Li, H.4
  • 20
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A. W. Black, Statistical parametric speech synthesis, Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 23
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: possible role of a repetitive structure in sounds, Speech Commun., vol. 27, pp. 187-207, 1999
    • (1999) Speech Commun , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigné, A.3
  • 24
    • 84865777002 scopus 로고    scopus 로고
    • The CSTR/EMIME HTS system for blizzard challenge 2010
    • Kyoto, Japan, Sept
    • J. Yamagishi and O. Watts, The CSTR/EMIME HTS system for Blizzard Challenge 2010, in Proc. Blizzard Challenge 2010, Kyoto, Japan, Sept. 2010
    • (2010) Proc. Blizzard Challenge 2010
    • Yamagishi, J.1    Watts, O.2
  • 25
    • 44449177634 scopus 로고    scopus 로고
    • A hidden semi-Markov model-based speech synthesis system
    • May
    • H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, A hidden semi-Markov model-based speech synthesis system, IEICE Trans. Inf. &Syst., vol. E90-D, no. 5, pp. 825-834, May 2007
    • (2007) IEICE Trans. Inf. &Syst , vol.E90-D , Issue.5 , pp. 825-834
    • Zen, H.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 26
    • 84894152556 scopus 로고    scopus 로고
    • The voice bank corpus: Design, collection and data analysis of a large regional accent speech database
    • Christophe Veaux, Junichi Yamagishi, and Simon King, The voice bank corpus: Design, collection and data analysis of a large regional accent speech database, in Proc. Int. Conf. Oriental COCOSDA, 2013
    • (2013) Proc. Int. Conf. Oriental COCOSDA
    • Veaux, C.1    Yamagishi, J.2    King, S.3
  • 27
    • 84897393748 scopus 로고    scopus 로고
    • Structural Bayesian linear regression for hidden markov models
    • Shinji Watanabe, Atsushi Nakamura, and Biing-Hwang(Fred) Juang, Structural Bayesian linear regression for hidden markov models, Journal of Signal Processing Systems, vol. 74, no. 3, pp. 341-358, 2014
    • (2014) Journal of Signal Processing Systems , vol.74 , Issue.3 , pp. 341-358
    • Watanabe, S.1    Nakamura, A.2    Biing-Hwang, F.J.3
  • 28
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • May
    • T. Toda and K. Tokuda, A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, IEICE Trans. Inf. &Syst., vol. E90-D, no. 5, pp. 816-824, May 2007
    • (2007) IEICE Trans. Inf. &Syst , vol.E90-D , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 29
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines and F. Charpentier, Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones, Speech Commun., vol. 9, no. 5-6, pp. 453-468, 1990
    • (1990) Speech Commun , vol.9 , Issue.5-6 , pp. 453-468
    • Moulines, E.1    Charpentier, F.2
  • 31
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximumlikelihood estimation of spectral parameter trajectory
    • T. Toda, A. W. Black, and K. Tokuda, Voice conversion based on maximumlikelihood estimation of spectral parameter trajectory, IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007
    • (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 32
    • 84878390910 scopus 로고    scopus 로고
    • Implementation of computationally efficient real-time voice conversion
    • T. Toda, T. Muramatsu, and H. Banno, Implementation of computationally efficient real-time voice conversion, in Proc. Interspeech, 2012
    • (2012) Proc. Interspeech
    • Toda, T.1    Muramatsu, T.2    Banno, H.3
  • 36
    • 84878378722 scopus 로고    scopus 로고
    • Effects of speaker adaptive training on tensor-based arbitrary speaker conversion
    • D. Saito, N. Minematsu, and K. Hirose, Effects of speaker adaptive training on tensor-based arbitrary speaker conversion, in Proc. Interspeech, 2012
    • (2012) Proc. Interspeech
    • Saito, D.1    Minematsu, N.2    Hirose, K.3
  • 37
    • 84906276055 scopus 로고    scopus 로고
    • Exemplar-based unit selection for voice conversion utilizing temporal information
    • Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, and Haizhou Li, Exemplar-based unit selection for voice conversion utilizing temporal information, in Proc. Interspeech, 2013
    • (2013) Proc. Interspeech
    • Wu, Z.1    Virtanen, T.2    Kinnunen, T.3    Siong Chng, E.4    Li, H.5
  • 40
    • 84864277561 scopus 로고    scopus 로고
    • Audioseg: Audio segmentation toolkit, release 1.2
    • January
    • G. Gravier, M. Betser, and M. Ben, audioseg: Audio segmentation toolkit, release 1.2, IRISA, January 2010
    • (2010) IRISA
    • Gravier, G.1    Betser, M.2    Ben, M.3
  • 41
    • 84865733857 scopus 로고    scopus 로고
    • Analysis of i-vector length normalization in speaker recognition systems
    • Daniel Garcia-Romero and Carol Y Espy-Wilson, Analysis of i-vector length normalization in speaker recognition systems, in Proc. Interspeech, 2011
    • (2011) Proc. Interspeech
    • Daniel, G.-R.1    Espy-Wilson, C.Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.