메뉴 건너뛰기




Volumn , Issue , 2010, Pages 151-158

Evaluation of the vulnerability of speaker verification to synthetic speech

Author keywords

[No Author keywords available]

Indexed keywords

SPEECH SYNTHESIS; SYSTEMS ANALYSIS;

EID: 84906233506     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (42)

References (35)
  • 2
    • 85135274466 scopus 로고    scopus 로고
    • On the security of HMM-based speaker verification systems against imposture using synthetic speech
    • T. Masuko, T. Hitotsumatsu, K. Tokuda, and T. Kobayashi, “On the security of HMM-based speaker verification systems against imposture using synthetic speech,” in Proc. EUROSPEECH, 1999.
    • (1999) Proc. EUROSPEECH
    • Masuko, T.1    Hitotsumatsu, T.2    Tokuda, K.3    Kobayashi, T.4
  • 3
    • 0029355724 scopus 로고
    • Likelihood normalization for speaker verification using a phoneme- And speaker-independent model
    • Aug
    • T. Matsui and S. Furui, “Likelihood normalization for speaker verification using a phoneme- and speaker-independent model,” Speech Commun., vol. 17, no. 1-2, pp. 109-116, Aug. 1995.
    • (1995) Speech Commun , vol.17 , Issue.1-2 , pp. 109-116
    • Matsui, T.1    Furui, S.2
  • 5
    • 85009077529 scopus 로고    scopus 로고
    • Imposture using synthetic speech against speaker verification based on spectrum and pitch
    • T. Masuko, K. Tokuda, and T. Kobayashi, “Imposture using synthetic speech against speaker verification based on spectrum and pitch,” in Proc. ICSLP, 2000.
    • (2000) Proc. ICSLP
    • Masuko, T.1    Tokuda, K.2    Kobayashi, T.3
  • 6
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, “Speaker verification using adapted gaussian mixture models,” Dig. Sig. Process., vol. 10, pp. 19-41, 2000.
    • (2000) Dig. Sig. Process. , vol.10 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 8
    • 65249096207 scopus 로고    scopus 로고
    • Combining derivative and parametric kernels for speaker verification
    • May
    • C. Longworth and M.L.F. Gales, “Combining derivative and parametric kernels for speaker verification,” IEEE Trans. Audio, Speech, and Language Process., vol. 17, no. 4, pp. 748-757, May 2009.
    • (2009) IEEE Trans. Audio, Speech, and Language Process. , vol.17 , Issue.4 , pp. 748-757
    • Longworth, C.1    Gales, M.L.F.2
  • 9
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Nov
    • H. Zen, K. Tokuda, and A. W. Black, “Statistical parametric speech synthesis,” Speech Communication, vol. 51, no. 11, pp. 1039-1064, Nov. 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 11
    • 84867223798 scopus 로고    scopus 로고
    • Robustness of HMM-based speech synthesis
    • Brisbane, Australia, Sept
    • J. Yamagishi, Z.-H. Ling, and S. King, “Robustness of HMM-based speech synthesis,” in Proc. Interspeech 2008, Brisbane, Australia, Sept. 2008, pp. 581-584.
    • (2008) Proc. Interspeech 2008 , pp. 581-584
    • Yamagishi, J.1    Ling, Z.-H.2    King, S.3
  • 14
    • 67650819492 scopus 로고    scopus 로고
    • The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 blizzard challenge
    • Sept
    • J. Yamagishi, H. Zen, Y.-J. Wu, T. Toda, and K. Tokuda, “The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge,” in Proc. Blizzard Challenge 2008, Sept. 2008.
    • (2008) Proc. Blizzard Challenge 2008
    • Yamagishi, J.1    Zen, H.2    Wu, Y.-J.3    Toda, T.4    Tokuda, K.5
  • 17
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, “Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds,” Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigné, A.3
  • 18
    • 33846405723 scopus 로고    scopus 로고
    • Details of Nitech HMM-based speech synthesis system for the blizzard challenge 2005
    • Jan
    • H. Zen, T. Toda, M. Nakamura, and K. Tokuda, “Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005,” IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.90 , Issue.1 , pp. 325-333
    • Zen, H.1    Toda, T.2    Nakamura, M.3    Tokuda, K.4
  • 19
    • 44449177634 scopus 로고    scopus 로고
    • A hidden semi-Markov model-based speech synthesis system
    • May
    • H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “A hidden semi-Markov model-based speech synthesis system,” IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 825-834, May 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.90 , Issue.5 , pp. 825-834
    • Zen, H.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 21
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based context-dependent subword modeling for speech recognition
    • Mar
    • K. Shinoda and T. Watanabe, “MDL-based context-dependent subword modeling for speech recognition,” J. Acoust. Soc. Japan (E), vol. 21, pp. 79-86, Mar. 2000.
    • (2000) J. Acoust. Soc. Japan (E) , vol.21 , pp. 79-86
    • Shinoda, K.1    Watanabe, T.2
  • 23
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M.J.F. Gales, “Maximum likelihood linear transformations for HMM-based speech recognition,” Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 24
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
    • 1
    • J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, “Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm,” IEEE Trans. Speech, Audio & Language Process., vol. 17, no. 1, pp. 66-83, 1 2009.
    • (2009) IEEE Trans. Speech, Audio & Language Process. , vol.17 , Issue.1 , pp. 66-83
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5
  • 25
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • Feb
    • J. Yamagishi and T. Kobayashi, “Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training,” IEICE Trans. Inf. & Syst., vol. E90-D, no. 2, pp. 533-543, Feb. 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.90 , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 26
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • May
    • T. Toda and K. Tokuda, “A speech parameter generation algorithm considering global variance for HMM-based speech synthesis,” IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 816-824, May 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.90 , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 27
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines and F. Charpentier, “Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones,” Speech Communication, vol. 9, no. 5-6, pp. 453-468, 1990.
    • (1990) Speech Communication , vol.9 , Issue.5-6 , pp. 453-468
    • Moulines, E.1    Charpentier, F.2
  • 28
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • Mar
    • T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, “An adaptive algorithm for mel-cepstral analysis of speech,” in Proc. ICASSP-92, Mar. 1992, pp. 137-140.
    • (1992) Proc. ICASSP-92 , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 29
    • 85073258179 scopus 로고    scopus 로고
    • Feature warping for robust speaker verification
    • J. Pelecanos and S. Sridharan, “Feature warping for robust speaker verification,” in Proc. ODYSSEY, 2001.
    • (2001) Proc. ODYSSEY
    • Pelecanos, J.1    Sridharan, S.2
  • 33
    • 0023704929 scopus 로고
    • Normalizations and selection of speech segments for speaker recognition scoring
    • April
    • K. P. Li and J. E. Porter, “Normalizations and selection of speech segments for speaker recognition scoring,” Proc. IEEE. Int. Conf. Acoustics, Speech and Signal Processing, vol. 1, pp. 595-598, April 1988.
    • (1988) Proc. IEEE. Int. Conf. Acoustics, Speech and Signal Processing , vol.1 , pp. 595-598
    • Li, K.P.1    Porter, J.E.2
  • 34
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for test-independent speaker verification system
    • R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, “Score normalization for test-independent speaker verification system,” Digital Signal Processing, vol. 10, no. 1, pp. 42-54, 2000.
    • (2000) Digital Signal Processing , vol.10 , Issue.1 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 35
    • 85009119461 scopus 로고    scopus 로고
    • A robust speaker verification system against imposture using an HMM-based speech synthesis system
    • T. Satoh, T. Masuko, T. Kobayashi, and K. Tokuda, “A robust speaker verification system against imposture using an HMM-based speech synthesis system,” in Proc. Eurospeech, 2001.
    • (2001) Proc. Eurospeech
    • Satoh, T.1    Masuko, T.2    Kobayashi, T.3    Tokuda, K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.