SCOPUS 정보 검색 플랫폼

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013

Volumn , Issue , 2013, Pages

Voice conversion and spoofing attack on speaker verification systems

(2) Wu, Zhizheng a Li, Haizhou a,b

a NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

b INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

Author keywords

[No Author keywords available]

Indexed keywords

ANTI-SPOOFING; MARKET ADOPTION; ONLINE COMMERCE; SPEAKER VERIFICATION; SPEAKER VERIFICATION SYSTEM; SPOOFING ATTACKS; USER AUTHENTICATION; VOICE CONVERSION;

AUTHENTICATION; DATA PROCESSING; SPEECH PROCESSING;

SPEECH RECOGNITION;

EID: 84893302435 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/APSIPA.2013.6694344 Document Type: Conference Paper

Times cited : (57)

References (81)

1
- 84893339015
- Speaker verification makes its debut in smartphone
- February
- Kong Aik Lee, Bin Ma, and Haizhou Li, "Speaker verification makes its debut in smartphone," in IEEE Signal Processing Society Speech and language Technical Committee Newsletter, February 2013.
- (2013) IEEE Signal Processing Society Speech and Language Technical Committee Newsletter
- Lee, K.A.¹ Ma, B.² Li, H.³

2
- 33751542948
- Speaker verification security improvement by means of speech watermarking
- DOI 10.1016/j.specom.2006.06.010, PII S0167639306000653
- Marcos Faundez-Zanuy, Martin Hagmüller, and Gernot Kubin, "Speaker verification security improvement by means of speech watermarking," Speech communication, vol. 48, no. 12, pp. 1608-1619, 2006. (Pubitemid 44829871)
- (2006) Speech Communication , vol.48 , Issue.12 , pp. 1608-1619
- Faundez-Zanuy, M.¹ Hagmuller, M.² Kubin, G.³

3
- 85135261394
- Vulnerability in speaker verification-A study of technical impostor techniques
- Johan Lindberg, Mats Blomberg, et al., "Vulnerability in speaker verification-a study of technical impostor techniques," in Proc. The European Conference on Speech Communication and Technology, 1999.
- (1999) Proc. The European Conference on Speech Communication and Technology
- Lindberg, J.¹ Blomberg, M.²

4
- 84867605072
- Speaker verification performance degradation against spoofing and tampering attacks
- Jesús Villalba and Eduardo Lleida, "Speaker verification performance degradation against spoofing and tampering attacks," in FALA 10 workshop, 2010.
- (2010) FALA 10 Workshop
- Villalba, J.¹ Lleida, E.²

5
- 14544274085
- Vulnerability of speaker verification to voice mimicking
- Yee Wah Lau, Michael Wagner, and Dat Tran, "Vulnerability of speaker verification to voice mimicking," in Proc. International Symposium on Intelligent Multimedia, Video and Speech Processing, 2004.
- (2004) Proc. International Symposium on Intelligent Multimedia, Video and Speech Processing
- Wah Lau, Y.¹ Wagner, M.² Tran, D.³

6
- 85084013778
- How vulnerable are prosodic features to professional imitators?
- Mireia Farrús, Michael Wagner, Jan Anguita, and Javier Hernando, "How vulnerable are prosodic features to professional imitators?," in The Speaker and Language Recognition Workshop (Odyssey 2008), 2008.
- (2008) The Speaker and Language Recognition Workshop (Odyssey 2008)
- Farrús, M.¹ Wagner, M.² Anguita, J.³ Hernando, J.⁴

7
- 84893295904
- I-vectors meet imitators: On vulnerability of speaker verification systems against voice mimicry
- Rosa González Hautamäki, Tomi Kinnunen, Ville Hautamäki, Timo Leino, and Anne-Maria Laukkanen, "I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry," in Proc. Interspeech.
- Proc. Interspeech
- Hautamäki, R.G.¹ Kinnunen, T.² Hautamäki, V.³ Leino, T.⁴ Laukkanen, A.-M.⁵

8
- 0029765811
- Unit selection in a concatenative speech synthesis system using a large speech database
- Andrew J Hunt and A.W. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP, 1996.
- (1996) Proc. ICASSP
- Hunt, A.J.¹ Black, A.W.²

9
- 67651002140
- Statistical parametric speech synthesis
- Heiga Zen, Keiichi Tokuda, and A.W. Black, "Statistical parametric speech synthesis," Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

10
- 84871382567
- A unified trajectory tiling approach to high quality speech rendering
- Yao Qian, Frank K Soong, and Zhi-Jie Yan, "A unified trajectory tiling approach to high quality speech rendering," IEEE transactions on audio, speech, and language processing, vol. 21, no. 1-2, pp. 280-290, 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.1-2 , pp. 280-290
- Qian, Y.¹ Soong, F.K.² Yan, Z.-J.³

11
- 85135274466
- On the security of HMM-based speaker verification systems against imposture using synthetic speech
- Takashi Masuko, Takafumi Hitotsumatsu, Keiichi Tokuda, and Takao Kobayashi, "On the security of HMM-based speaker verification systems against imposture using synthetic speech," in Proc. EUROSPEECH, 1999.
- (1999) Proc. Eurospeech
- Masuko, T.¹ Hitotsumatsu, T.² Tokuda, K.³ Kobayashi, T.⁴

12
- 85009077529
- Imposture using synthetic speech against speaker verification based on spectrum and pitch
- Takashi Masuko, Keiichi Tokuda, and Takao Kobayashi, "Imposture using synthetic speech against speaker verification based on spectrum and pitch," in Proc. ICSLP, 2000.
- (2000) Proc. ICSLP
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³

13
- 85009119461
- A robust speaker verification system against imposture using an HMMbased speech synthesis system
- Takayuki Satoh, Takashi Masuko, Takao Kobayashi, and Keiichi Tokuda, "A robust speaker verification system against imposture using an HMMbased speech synthesis system," in Proc. Eurospeech, 2001.
- (2001) Proc. Eurospeech
- Satoh, T.¹ Masuko, T.² Kobayashi, T.³ Tokuda, K.⁴

14
- 84865369980
- Evaluation of speaker verification security and detection of HMM-based synthetic speech
- Phillip L De Leon, Michael Pucher, Junichi Yamagishi, Inma Hernaez, and Ibon Saratxaga, "Evaluation of speaker verification security and detection of HMM-based synthetic speech," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 8, pp. 2280-2290, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.8 , pp. 2280-2290
- De Leon, P.L.¹ Pucher, M.² Yamagishi, J.³ Hernaez, I.⁴ Saratxaga, I.⁵

15
- 67650854725
- Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
- Junichi Yamagishi, Takao Kobayashi, Yuji Nakano, Katsumi Ogata, and Juri Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm," IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 1, pp. 66-83, 2009.
- (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.1 , pp. 66-83
- Yamagishi, J.¹ Kobayashi, T.² Nakano, Y.³ Ogata, K.⁴ Isogai, J.⁵

16
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- January
- Tomi Kinnunen and Haizhou Li, "An overview of text-independent speaker recognition: from features to supervectors," Speech Communication, vol. 52, no. 1, pp. 12-40, January 2010.
- (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

17
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- DOI 10.1006/dspr.1999.0361
- Douglas A Reynolds, Thomas F Quatieri, and Robert B Dunn, "Speaker verification using adapted gaussian mixture models," Digital signal processing, vol. 10, no. 1, pp. 19-41, 2000. (Pubitemid 30592166)
- (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

18
- 33645887246
- Support vector machines using GMM supervectors for speaker verification
- William M Campbell, Douglas E Sturim, and Douglas A Reynolds, "Support vector machines using GMM supervectors for speaker verification," IEEE Signal Processing Letters, vol. 13, no. 5, pp. 308-311, 2006.
- (2006) IEEE Signal Processing Letters , vol.13 , Issue.5 , pp. 308-311
- Campbell, W.M.¹ Sturim, D.E.² Reynolds, D.A.³

19
- 84872169719
- Advances in channel compensation for SVM speaker recognition
- Alex Solomonoff, William M Campbell, and Ian Boardman, "Advances in channel compensation for SVM speaker recognition," in Proc. ICASSP.
- Proc. ICASSP
- Solomonoff, A.¹ Campbell, W.M.² Boardman, I.³

20
- 58349102016
- Analysis of feature extraction and channel compensation in a GMM speaker recognition system
- Lukas Burget, Pavel Matejka, Petr Schwarz, Ondrej Glembek, and Jan Cernocky, "Analysis of feature extraction and channel compensation in a GMM speaker recognition system," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 7, pp. 1979-1986, 2007.
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.7 , pp. 1979-1986
- Burget, L.¹ Matejka, P.² Schwarz, P.³ Glembek, O.⁴ Cernocky, J.⁵

21
- 44949114401
- Within-class covariance normalization for SVM-based speaker recognition
- Andrew O Hatch, Sachin Kajarekar, and Andreas Stolcke, "Within-class covariance normalization for SVM-based speaker recognition," in Proc. ICSLP, 2006.
- (2006) Proc. ICSLP
- Hatch, A.O.¹ Kajarekar, S.² Stolcke, A.³

22
- 33947637189
- Joint factor analysis of speaker and session variability: Theory and algorithms
- P. Kenny, "Joint factor analysis of speaker and session variability: theory and algorithms," technical report CRIM-06/08-14, 2006.
- (2006) Technical Report CRIM-06/08-14
- Kenny, P.¹

23
- 43249091937
- Speaker and session variability in GMM-based speaker verification
- May
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Speaker and session variability in GMM-based speaker verification," IEEE Transactions on Audio, Speech and Language Processing, vol. 15, no. 4, pp. 1448-1460, May 2007.
- (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , Issue.4 , pp. 1448-1460
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

24
- 84886367915
- Bayesian speaker verification with heavy tailed priors
- Patrick Kenny, "Bayesian speaker verification with heavy tailed priors," in Speaker and Language Recognition Workshop (IEEE Odyssey), 2010.
- (2010) Speaker and Language Recognition Workshop ( IEEE Odyssey)
- Kenny, P.¹

25
- 79951609039
- Frontend factor analysis for speaker verification
- May
- N. Dehak, P.J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Frontend factor analysis for speaker verification," IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 4, pp. 788-798, May 2011.
- (2011) IEEE Transactions on Audio, Speech and Language Processing , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.J.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

26
- 84877743396
- Sparse classifier fusion for speaker verification
- V. Hautamaki, T. Kinnunen, F. Sedlak, Kong Aik Lee, Bin Ma, and Haizhou Li, "Sparse classifier fusion for speaker verification," IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, no. 8, pp. 1622-1631, 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.8 , pp. 1622-1631
- Hautamaki, V.¹ Kinnunen, T.² Sedlak, F.³ Lee, K.A.⁴ Ma, B.⁵ Li, H.⁶

27
- 84890540706
- CRSS systems for 2012 nist speaker recognition evaluation
- Taufiq Hasan, Seyed Omid Sadjadi, Gang Liu, Navid Shokouhi, Hynek Boril, and John HL Hansen, "CRSS systems for 2012 nist speaker recognition evaluation," in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Hasan, T.¹ Sadjadi, S.O.² Liu, G.³ Shokouhi, N.⁴ Boril, H.⁵ Hansen, J.H.L.⁶

28
- 84890510678
- Improving speaker identification robustness to highly channel-degraded speech through multiple system fusion
- Mitchell McLaren, Nicolas Scheffer, Martin Graciarena, Luciana Ferrer, and Yun Lei, "Improving speaker identification robustness to highly channel-degraded speech through multiple system fusion," in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- McLaren, M.¹ Scheffer, N.² Graciarena, M.³ Ferrer, L.⁴ Lei, Y.⁵

29
- 0029254176
- Transformation of formants for voice conversion using artificial neural networks
- M. Narendranath, H.A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks," Speech communication, vol. 16, no. 2, pp. 207-216, 1995.
- (1995) Speech Communication , vol.16 , Issue.2 , pp. 207-216
- Narendranath, M.¹ Murthy, H.A.² Rajendran, S.³ Yegnanarayana, B.⁴

30
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- Alexander Kain and Michael W Macon, "Spectral voice conversion for text-to-speech synthesis," in Proc. ICASSP, 1998.
- (1998) Proc. ICASSP
- Kain, A.¹ Macon, M.W.²

31
- 0032026483
- Continuous probabilistic transform for voice conversion
- PII S1063667698017386
- Yannis Stylianou, Olivier Cappé, and Eric Moulines, "Continuous probabilistic transform for voice conversion," IEEE Transactions on Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998. (Pubitemid 128720639)
- (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

32
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajec tory
- Tomoki Toda, A.W. Black, and Keiichi Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajec tory," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

33
- 85009212516
- Transforming F0 contours
- B Gillet and S King, "Transforming F0 contours," in Proc. Eurospeech, 2003.
- (2003) Proc. Eurospeech
- Gillet, B.¹ King, S.²

34
- 34047247202
- Voice conversion using duration-embedded Bi-HMMs for expressive speech synthesis
- DOI 10.1109/TASL.2006.876112
- Chung-Hsien Wu, Chi-Chun Hsia, Te-Hsien Liu, and Jhing-Fa Wang, "Voice conversion using duration-embedded bi-HMMs for expressive speech synthesis," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 4, pp. 1109-1116, 2006. (Pubitemid 46547608)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1109-1116
- Wu, C.-H.¹ Hsia, C.-C.² Liu, T.-H.³ Wang, J.-F.⁴

35
- 34547520011
- A novel method for prosody prediction in voice conversion
- Elina E Helander and Jani Nurminen, "A novel method for prosody prediction in voice conversion," in ICASSP, 2007.
- (2007) ICASSP
- Helander, E.E.¹ Nurminen, J.²

36
- 79959842826
- Text-independent F0 transformation with non-parallel data for voice conversion
- Z.-Z. Wu, Tomi Kinnunen, E.S. Chng, and Haizhou Li, "Text- independent F0 transformation with non-parallel data for voice conversion," in Proc. Interspeech, 2010.
- (2010) Proc. Interspeech
- Wu, Z.-Z.¹ Kinnunen, T.² Chng, E.S.³ Li, H.⁴

37
- 77953726259
- Pitch and duration transformation with non-parallel data
- Damien Lolive, Nelly Barbot, and Olivier Boeffard, "Pitch and duration transformation with non-parallel data," Proc. Speech Prosody, pp. 111-114, 2008.
- (2008) Proc. Speech Prosody , pp. 111-114
- Lolive, D.¹ Barbot, N.² Boeffard, O.³

38
- 84867198185
- On the impact of alignment on voice conversion performance
- Elina Helander, Jan Schwarz, Jani Nurminen, Hanna Silen, and Moncef Gabbouj, "On the impact of alignment on voice conversion performance," in Proc. Interspeech, 2008.
- (2008) Proc. Interspeech
- Helander, E.¹ Schwarz, J.² Nurminen, J.³ Silen, H.⁴ Gabbouj, M.⁵

39
- 77953725318
- INCA algorithm for training voice conversion systems from nonparallel corpora
- Daniel Erro, Asunción Moreno, and Antonio Bonafonte, "INCA algorithm for training voice conversion systems from nonparallel corpora," IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 5, pp. 944-953, 2010.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.5 , pp. 944-953
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

40
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- Hideki Kawahara, Ikuyo Masuda-Katsuse, and Alain de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech communication, vol. 27, no. 3, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3 , pp. 187-207
- Kawahara, H.¹ Katsuse, I.M.² De Cheveigné, A.³

41
- 0035127703
- Applying the harmonic plus noise model in concatenative speech synthesis
- DOI 10.1109/89.890068
- Yannis Stylianou, "Applying the harmonic plus noise model in concatenative speech synthesis," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 1, pp. 21-29, 2001. (Pubitemid 32130684)
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.1 , pp. 21-29
- Stylianou, Y.¹

42
- 77953712499
- Voice conversion using partial least squares regression
- Elina Helander, Tuomas Virtanen, Jani Nurminen, and Moncef Gabbouj, "Voice conversion using partial least squares regression," IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 5, pp. 912-921, 2010.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.5 , pp. 912-921
- Helander, E.¹ Virtanen, T.² Nurminen, J.³ Gabbouj, M.⁴

43
- 78149260085
- Continuous stochastic feature mapping based on trajectory HMMs
- Heiga Zen, Yoshihiko Nankaku, and Keiichi Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs," IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 2, pp. 417-430, 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.2 , pp. 417-430
- Zen, H.¹ Nankaku, Y.² Tokuda, K.³

44
- 84859768504
- Statistical voice conversion based on noisy channel model
- Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, and Nobuaki Minematsu, "Statistical voice conversion based on noisy channel model," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 6, pp. 1784-1794, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.6 , pp. 1784-1794
- Saito, D.¹ Watanabe, S.² Nakamura, A.³ Minematsu, N.⁴

45
- 85131821539
- Mel-generalized cepstral analysis-A unified approach to speech spectral estimation
- Keiichi Tokuda, Takao Kobayashi, Takashi Masuko, and Satoshi Imai, "Mel-generalized cepstral analysis-a unified approach to speech spectral estimation.," in Proc. ICSLP, 1994.
- (1994) Proc. ICSLP
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Imai, S.⁴

46
- 51449107658
- LSF mapping for voice conversion with very small training sets
- Elina Helander, Jani Nurminen, and Moncef Gabbouj, "LSF mapping for voice conversion with very small training sets," in Proc. ICASSP, 2008.
- (2008) Proc. ICASSP
- Helander, E.¹ Nurminen, J.² Gabbouj, M.³

47
- 84867594339
- Local linear transformation for voice conversion
- Victor Popa, Hanna Silen, Jani Nurminen, and Moncef Gabbouj, "Local linear transformation for voice conversion," in Proc. ICASSP, 2012.
- (2012) Proc. ICASSP
- Popa, V.¹ Silen, H.² Nurminen, J.³ Gabbouj, M.⁴

48
- 0021157408
- Line spectrum pair (LSP) and speech data compression
- Frank Soong and Biing-Hwang Juang, "Line spectrum pair (LSP) and speech data compression," in Proc. ICASSP, 1984.
- (1984) Proc. ICASSP
- Soong, F.¹ Juang, B.-H.²

49
- 0023739214
- Voice conversion through vector quantization
- Masanobu Abe, Satoshi Nakamura, Kiyohiro Shikano, and Hisao Kuwabara, "Voice conversion through vector quantization," in Proc. ICASSP, 1988.
- (1988) Proc. ICASSP
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

50
- 0033154052
- Speaker transformation algorithm using segmental codebooks (STASC)
- Levent M Arslan, "Speaker transformation algorithm using segmental codebooks (STASC)," Speech Communication, vol. 28, no. 3, pp. 211-226, 1999.
- (1999) Speech Communication , vol.28 , Issue.3 , pp. 211-226
- Arslan, L.M.¹

51
- 0026394044
- Speaker adaptation and voice conversion by codebook mapping
- K Shikano, S Nakamura, and M Abe, "Speaker adaptation and voice conversion by codebook mapping," in Proc. IEEE International Sympoisum on Circuits and Systems, 1991.
- (1991) Proc. IEEE International Sympoisum on Circuits and Systems
- Shikano, K.¹ Nakamura, S.² Abe, M.³

52
- 84905560807
- Voice conversion with smoothed GMM and MAP adaptation
- Yining Chen, Min Chu, Eric Chang, Jia Liu, and Runsheng Liu, "Voice conversion with smoothed GMM and MAP adaptation," in Proc. Eurospeech, 2003.
- (2003) Proc. Eurospeech
- Chen, Y.¹ Chu, M.² Chang, E.³ Liu, J.⁴ Liu, R.⁵

53
- 84878415076
- A study of mutual information for GMM-based spectral conversion
- Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, and Sin-Horng Chen, "A study of mutual information for GMM-based spectral conversion," in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Hwang, H.-T.¹ Tsao, Y.² Wang, H.-M.³ Wang, Y.-R.⁴ Chen, S.-H.⁵

54
- 84865737668
- Gaussian process experts for voice conversion
- Nicholas CV Pilkington, Heiga Zen, and Mark JF Gales, "Gaussian process experts for voice conversion," in Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Pilkington, N.C.V.¹ Zen, H.² Gales, M.J.F.³

55
- 84869384026
- Mixture of factor analyzers using priors from non-parallel speech for voice conversion
- Zhizheng Wu, Tomi Kinnunen, E.S. Chng, and Haizhou Li, "Mixture of factor analyzers using priors from non-parallel speech for voice conversion," IEEE SIGNAL PROCESSING LETTERS, vol. 19, no. 12, pp. 914-917, 2012.
- (2012) IEEE Signal Processing Letters , vol.19 , Issue.12 , pp. 914-917
- Wu, Z.¹ Kinnunen, T.² Chng, E.S.³ Li, H.⁴

56
- 70349197691
- Voice conversion using artificial neural networks
- Srinivas Desai, E Veera Raghavendra, B Yegnanarayana, A.W. Black, and Kishore Prahallad, "Voice conversion using artificial neural networks," in Proc. ICASSP, 2009.
- (2009) Proc. ICASSP
- Desai, S.¹ Raghavendra, E.V.² Yegnanarayana, B.³ Black, A.W.⁴ Prahallad, K.⁵

57
- 80053068819
- Voice conversion using support vector regression
- P Song, YQ Bao, L Zhao, and CR Zou, "Voice conversion using support vector regression," Electronics letters, vol. 47, no. 18, pp. 1045-1046, 2011.
- (2011) Electronics Letters , vol.47 , Issue.18 , pp. 1045-1046
- Song, P.¹ Bao, Y.Q.² Zhao, L.³ Zou, C.R.⁴

58
- 84856141218
- Voice conversion using dynamic kernel partial least squares regression
- Elina Helander, Hanna Silén, Tuomas Virtanen, and Moncef Gabbouj, "Voice conversion using dynamic kernel partial least squares regression," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 3, pp. 806-817, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.3 , pp. 806-817
- Helander, E.¹ Silén, H.² Virtanen, T.³ Gabbouj, M.⁴

59
- 84889579519
- Conditional restricted boltzmann machine for voice conversion
- Zhizheng Wu, E.S. Chng, and Haizhou Li, "Conditional restricted boltzmann machine for voice conversion," in the first IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP), 2013.
- (2013) The First IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)
- Wu, Z.¹ Chng, E.S.² Li, H.³

60
- 84901803470
- Exemplar-based voice conversion using non-negative spectrogram deconvolution
- Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, E.S. Chng, and Haizhou Li, "Exemplar-based voice conversion using non-negative spectrogram deconvolution," in the 8th ISCA Speech Synthesis Workshop, 2013.
- (2013) The 8th ISCA Speech Synthesis Workshop
- Wu, Z.¹ Virtanen, T.² Kinnunen, T.³ Chng, E.S.⁴ Li, H.⁵

61
- 84948175540
- VTLN-based voice conversion
- David Sundermann and Hermann Ney, "VTLN-based voice conversion," in Proc. The 3rd IEEE International Symposium on Signal Processing and Information Technology, 2003.
- (2003) Proc. The 3rd IEEE International Symposium on Signal Processing and Information Technology
- Sundermann, D.¹ Ney, H.²

62
- 77953727123
- Voice conversion based on weighted frequency warping
- Daniel Erro, Asunción Moreno, and Antonio Bonafonte, "Voice conversion based on weighted frequency warping," IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 5, pp. 922-931, 2010.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.5 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

63
- 84857498745
- Rosec, and thierry chonavel, voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
- Elizabeth Godoy, Olivier Rosec, and Thierry Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 4, pp. 1313-1323, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.4 , pp. 1313-1323
- Olivier, E.G.¹

64
- 84872177757
- Parametric voice conversion based on bilinear frequency warping plus amplitude scaling
- Daniel Erro, Eva Navas, and Inma Hernaez, "Parametric voice conversion based on bilinear frequency warping plus amplitude scaling," IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, no. 3, pp. 556-566, 2013.
- (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.3 , pp. 556-566
- Erro, D.¹ Navas, E.² Hernaez, I.³

65
- 84890475857
- Transmutative voice conversion
- S.H. Mohammadi and Alexander Kain, "Transmutative voice conversion," in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Mohammadi, S.H.¹ Kain, A.²

66
- 33947623206
- Text-independent voice conversion based on unit selection
- David Sundermann, Harald Hoge, Antonio Bonafonte, Hermann Ney, Alan Black, and Shri Narayanan, "Text-independent voice conversion based on unit selection," in Proc. ICASSP, 2006.
- (2006) Proc. ICASSP
- Sundermann, D.¹ Hoge, H.² Bonafonte, A.³ Ney, H.⁴ Black, A.⁵ Narayanan, S.⁶

67
- 34547496196
- Towards a voice conversion system based on frame selection
- Thierry Dutoit, A Holzapfel, Matthieu Jottrand, Alexis Moinet, Javier Perez, and Y Stylianou, "Towards a voice conversion system based on frame selection," in Proc. ICASSP, 2007.
- (2007) Proc. ICASSP
- Dutoit, T.¹ Holzapfel, A.² Jottrand, M.³ Moinet, A.⁴ Perez, J.⁵ Stylianou, Y.⁶

68
- 84906276055
- Exemplar-based unit selection for voice conversion utilizing temporal information
- Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, E.S. Chng, and Haizhou Li, "Exemplar-based unit selection for voice conversion utilizing temporal information," in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Wu, Z.¹ Virtanen, T.² Kinnunen, T.³ Chng, E.S.⁴ Li, H.⁵

69
- 84874448812
- A study on spoofing attack in state-of-the-art speaker verification: The telephone speech case
- Zhizheng Wu, Tomi Kinnunen, E.S. Chng, Haizhou Li, and Eliathamby Ambikairajah, "A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case," in Asia-Pacific Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012, pp. 1-5.
- (2012) Asia-Pacific Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC) , pp. 1-5
- Wu, Z.¹ Kinnunen, T.² Chng, E.S.³ Li, H.⁴ Ambikairajah, E.⁵

70
- 84906275384
- Vulnerability evaluation of speaker verification under voice conversion spoofing: The effect of text constraints
- Zhizheng Wu, Anthony Larcher, Kong Aik Lee, E.S. Chng, Tomi Kinnunen, and Haizhou Li, "Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints," in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Wu, Z.¹ Larcher, A.² Aik Lee, K.³ Chng, E.S.⁴ Kinnunen, T.⁵ Li, H.⁶

71
- 65349113532
- Artificial impostor voice transformation effects on false acceptance rates
- Jean-Francois Bonastre, Driss Matrouf, and Corinne Fredouille, "Artificial impostor voice transformation effects on false acceptance rates," in Proc. Interspeech, 2007.
- (2007) Proc. Interspeech
- Bonastre, J.-F.¹ Matrouf, D.² Fredouille, C.³

72
- 84869773314
- On the vulnerability of automatic speaker recognition to spoofing attacks with artificial signals
- Federico Alegre, Ravichander Vipperla, Nicholas Evans, and Benoit Fauve, "On the vulnerability of automatic speaker recognition to spoofing attacks with artificial signals," in Proc. European Signal Processing Conference (EUSIPCO), 2012.
- (2012) Proc. European Signal Processing Conference (EUSIPCO)
- Alegre, F.¹ Vipperla, R.² Evans, N.³ Fauve, B.⁴

73
- 84867600098
- Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech
- Tomi Kinnunen, Z.-Z. Wu, Kong Aik Lee, Filip Sedlak, E.S. Chng, and Haizhou Li, "Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech," in Proc. ICASSP, 2012.
- (2012) Proc. ICASSP
- Kinnunen, T.¹ Wu, Z.-Z.² Lee, K.A.³ Sedlak, F.⁴ Chng, E.S.⁵ Li, H.⁶

74
- 84906234851
- Voice transformation-based spoofing of text-dependent speaker verification systems
- Zvi Kons and Hagai Aronowitz, "Voice transformation-based spoofing of text-dependent speaker verification systems," in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Kons, Z.¹ Aronowitz, H.²

75
- 84865745406
- New developments in voice biometrics for user authentication
- Hagai Aronowitz, Ron Hoory, Jason Pelecanos, and David Nahamoo, "New developments in voice biometrics for user authentication," in Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Aronowitz, H.¹ Hoory, R.² Pelecanos, J.³ Nahamoo, D.⁴

76
- 84878465724
- RSR2015: Database for text-dependent speaker verification using multiple passphrases
- Anthony Larcher, Kong-Aik Lee, Bin Ma, and Haizhou Li, "RSR2015: Database for text-dependent speaker verification using multiple passphrases.," in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Larcher, A.¹ Lee, K.-A.² Ma, B.³ Li, H.⁴

77
- 33947714703
- Effect of speech transformation on impostor acceptance
- Driss Matrouf, J-F Bonastre, and Corinne Fredouille, "Effect of speech transformation on impostor acceptance," in Proc. ICASSP, 2006.
- (2006) Proc. ICASSP
- Matrouf, D.¹ Bonastre, J.-F.² Fredouille, C.³

78
- 84878410960
- Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition
- Zhizheng Wu, E.S. Chng, and Haizhou Li, "Detecting converted speech and natural speech for anti-spoofing attack in speaker recognition," in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Wu, Z.¹ Chng, E.S.² Li, H.³

79
- 84878412793
- Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals
- Federico Alegre, Ravichander Vipperla, Nicholas Evans, et al., "Spoofing countermeasures for the protection of automatic speaker recognition systems against attacks with artificial signals," in Proc. Interspeech, 2012.
- (2012) Proc. Interspeech
- Alegre, F.¹ Vipperla, R.² Evans, N.³

80
- 84890543945
- Synthetic speech detection using temporal modulation feature
- Zhizheng Wu, Xiong Xiao, E.S. Chng, and Haizhou Li, "Synthetic speech detection using temporal modulation feature," in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Wu, Z.¹ Xiao, X.² Chng, E.S.³ Li, H.⁴

81
- 84890542394
- Spoofing countermeasures to protect automatic speker verification from voice conversion
- Federico Alegre, Asmaa Amehraye, and Nicholas Evans, "Spoofing countermeasures to protect automatic speker verification from voice conversion," in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Alegre, F.¹ Amehraye, A.² Evans, N.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.