SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 58, Issue , 2014, Pages 124-138

Voice conversion based on Gaussian processes by coherent and asymmetric training with limited training data

(6) Xu, Ning a,b Tang, Yibing a Bao, Jingyi c Jiang, Aiming a Liu, Xiaofeng a Yang, Zhen d

a HOHAI UNIVERSITY (China)

b NANJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS (China)

c CHANGZHOU INSTITUTE OF TECHNOLOGY (China)

d NONE (China)

Author keywords

Asymmetric training; Coherent training; Gaussian mixture model; Gaussian processes; Voice conversion

Indexed keywords

COMPUTATIONAL COSTS; GAUSSIAN MIXTURE MODEL; GAUSSIAN PROCESSES; LIMITED TRAINING DATA; NONLINEAR MAPPINGS; SPECTRAL FEATURE; TRAINING STRATEGY; VOICE CONVERSION;

GAUSSIAN DISTRIBUTION; GAUSSIAN NOISE (ELECTRONIC); MAPPING;

SPEECH PROCESSING;

EID: 84890539284 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2013.11.005 Document Type: Article

Times cited : (31)

References (27)

1
- 0023739214
- Voice conversion through vector quantization
- Speech Signal Process., New York, USA
- Abe, M., Nakamura, S., Shikano, K., Kuwabara, H., 1998. Voice conversion through vector quantization. In: Proc. IEEE Int. Conf. Acoust. Speech Signal Process., New York, USA, pp. 655-658.
- (1998) Proc. IEEE Int. Conf. Acoust , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

2
- 0033154052
- Speaker transformation algorithm using segmental codebooks (STASC)
- L.M. Arslan Speaker transformation algorithm using segmental codebooks (STASC) Speech Commun. 28 1999 211 226
- (1999) Speech Commun. , vol.28 , pp. 211-226
- Arslan, L.M.¹

3
- 33846516584
- Springer
- C.M. Bishop Pattern Recognition and Machine Learning 2006 Springer
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.M.¹

4
- 84876887691
- Nonparametric mixtures of Gaussian processes with power-law behavior
- S.P. Chatzis, and Y. Demiris Nonparametric mixtures of Gaussian processes with power-law behavior IEEE Trans. Neural Networks Learn. Syst. 23 2012 1862 1871
- (2012) IEEE Trans. Neural Networks Learn. Syst. , vol.23 , pp. 1862-1871
- Chatzis, S.P.¹ Demiris, Y.²

5
- 84905560807
- Voice conversion with smoothed GMM and MAP adaptation
- Geneva, Switzerland
- Chen, Y., Chu, M., Chang, E., Liu, J., Liu, R., 2003. Voice conversion with smoothed GMM and MAP adaptation. In: Proc. Interspeech. Geneva, Switzerland, pp. 2413-2416.
- (2003) Proc. Interspeech , pp. 2413-2416
- Chen, Y.¹ Chu, M.² Chang, E.³ Liu, J.⁴ Liu, R.⁵

6
- 77953707533
- Spectral mapping using artificial neural networks for voice conversion
- S. Desai, A.W. Black, B. Yegnanarayana, and K. Prahallad Spectral mapping using artificial neural networks for voice conversion IEEE Trans. Audio Speech Lang. Process. 18 2010 954 964
- (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , pp. 954-964
- Desai, S.¹ Black, A.W.² Yegnanarayana, B.³ Prahallad, K.⁴

7
- 77953697940
- (Ph.D. thesis). Universitat Politècnica de Catalunya
- Erro, D., 2008. Intra-lingual and cross-lingual voice conversion using harmonic plus stochastic models (Ph.D. thesis). Universitat Politècnica de Catalunya.
- (2008) Intra-lingual and Cross-lingual Voice Conversion Using Harmonic Plus Stochastic Models
- Erro, D.¹

8
- 51449124416
- Flexible harmonic/stochastic speech synthesis
- Bonn, Germany
- Erro, D., Moreno, A., Bonafonte, A., 2007. Flexible harmonic/stochastic speech synthesis. In: Proc. ISCA Workshop Speech Synth., Bonn, Germany, pp. 194-199.
- (2007) Proc. ISCA Workshop Speech Synth , pp. 194-199
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

9
- 77953727123
- Voice conversion based on weighted frequency warping
- D. Erro, A. Moreno, and A. Bonafonte Voice conversion based on weighted frequency warping IEEE Trans. Audio Speech Lang. Process. 18 2010 922 931
- (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

10
- 51449107658
- LSF mapping for voice conversion with very small training sets
- Speech Signal Process
- Helander, E., Nurminen, J., Gabbouj, M., 2008. LSF mapping for voice conversion with very small training sets. In: Proc. IEEE Int. Conf. Acoust. Speech Signal Process., pp. 4669-4672.
- (2008) Proc. IEEE Int. Conf. Acoust , pp. 4669-4672
- Helander, E.¹ Nurminen, J.² Gabbouj, M.³

11
- 84856141218
- Voice conversion using dynamic kernel partial least squares regression
- E. Helander, H. Siln, T. Virtanen, and M. Gabbouj Voice conversion using dynamic kernel partial least squares regression IEEE Trans. Audio Speech Lang. Process. 20 2012 806 817
- (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , pp. 806-817
- Helander, E.¹ Siln, H.² Virtanen, T.³ Gabbouj, M.⁴

12
- 4444285698
- (Ph.D. thesis). Oregon Health and Sci University, Rockford, USA
- Kain, A., 2001. High resolution voice transformation (Ph.D. thesis). Oregon Health and Sci University, Rockford, USA.
- (2001) High Resolution Voice Transformation
- Kain, A.¹

13
- 38149065136
- Statistical approach for voice personality transformation
- K.S. Lee Statistical approach for voice personality transformation IEEE Trans. Audio Speech Lang. Process. 15 2007 641 651
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 641-651
- Lee, K.S.¹

14
- 0016495091
- Linear prediction: A tutorial review
- J. Makhoul Linear prediction: a tutorial review Proc. IEEE 63 1975 561 580
- (1975) Proc. IEEE , vol.63 , pp. 561-580
- Makhoul, J.¹

15
- 84890474765
- Voice conversion based on variational Bayes method
- Marume, M., Nankaku, Y., Sako, S., Tokuda, K., Kitamura, T., 2007. Voice conversion based on variational Bayes method. Technical Report of IEICE, vol. 107, pp. 103-108.
- (2007) Technical Report of IEICE , vol.107 , pp. 103-108
- Marume, M.¹ Nankaku, Y.² Sako, S.³ Tokuda, K.⁴ Kitamura, T.⁵

16
- 0029254176
- Transformation of formants for voice conversion using artificial neural networks
- M. Narendranath, H.A. Murthy, S. Rajendran, and B. Yegnanarayana Transformation of formants for voice conversion using artificial neural networks Speech Commun. 16 1995 207 216
- (1995) Speech Commun. , vol.16 , pp. 207-216
- Narendranath, M.¹ Murthy, H.A.² Rajendran, S.³ Yegnanarayana, B.⁴

17
- 84865737668
- Gaussian process experts for voice conversion
- Florence, Italy
- Pilkington, N.C.V., Zen, H., Gales, M.J.F., 2011. Gaussian process experts for voice conversion. In: Proc. Interspeech. Florence, Italy, pp. 2761-2764.
- (2011) Proc. Interspeech , pp. 2761-2764
- Pilkington, N.C.V.¹ Zen, H.² Gales, M.J.F.³

18
- 84873313607
- Prentice Hall
- L.R. Rabiner, and R.W. Schafer Theory and Applications of Digital Speech Processing 2009 Prentice Hall
- (2009) Theory and Applications of Digital Speech Processing
- Rabiner, L.R.¹ Schafer, R.W.²

19
- 25444448065
- MIT Press Cambridge
- C.E. Rasmussen, and C.K.I. Williams Gaussian Processes for Machine Learning 2006 MIT Press Cambridge
- (2006) Gaussian Processes for Machine Learning
- Rasmussen, C.E.¹ Williams, C.K.I.²

20
- 77953753547
- (Ph.D. thesis). University of London
- Snelson, E.L., 2007. Flexible and efficient Gaussian process for machine learning (Ph.D. thesis). University of London.
- (2007) Flexible and Efficient Gaussian Process for Machine Learning
- Snelson, E.L.¹

21
- 0003447548
- for speech and speaker modification (Ph.D. thesis). École Nationale Supérieure des Télécommunications
- Stylianou, Y., 1996. Harmonic plus noise models for speech, combined with statistical methods, for speech and speaker modification (Ph.D. thesis). École Nationale Supérieure des Télécommunications.
- (1996) Harmonic Plus Noise Models for Speech, Combined with Statistical Methods
- Stylianou, Y.¹

22
- 70349197715
- Voice transformation: A survey
- Taipei, Taiwan
- Stylianou, Y., 2009. Voice transformation: a survey. In: Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Taipei, Taiwan, pp. 3585-3588.
- (2009) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , pp. 3585-3588
- Stylianou, Y.¹

23
- 0032026483
- Continuous probabilistic transform for voice conversion
- PII S1063667698017386
- Y. Stylianou, O. Cappe, and E. Moulines Continuous probabilistic transform for voice conversion IEEE Trans. Audio Speech Lang. Process. 6 1998 131 142 (Pubitemid 128720639)
- (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

24
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- T. Toda, A.W. Black, and K. Tokuda Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory IEEE Trans. Audio Speech Lang. Process. 15 2007 2222 2235
- (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

25
- 38649140222
- Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model
- DOI 10.1016/j.specom.2007.09.001, PII S0167639307001495
- T. Toda, A.W. Black, and K. Tokuda Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model Speech Commun. 50 2008 215 227 (Pubitemid 351172471)
- (2008) Speech Communication , vol.50 , Issue.3 , pp. 215-227
- Toda, T.¹ Black, A.W.² Tokuda, K.³

26
- 78650194302
- A voice conversion algorithm in the context of sparse training data
- N. Xu, and Z. Yang A voice conversion algorithm in the context of sparse training data J. Nanjing Univ. Posts Telecommun. 30 2010 1 7
- (2010) J. Nanjing Univ. Posts Telecommun. , vol.30 , pp. 1-7
- Xu, N.¹ Yang, Z.²

27
- 34047254509
- Quality-enhanced voice morphing using maximum likelihood transformations
- DOI 10.1109/TSA.2005.860839
- H. Ye, and S. Young Quality enhanced voice morphing using maximum likelihood transformations IEEE Trans. Audio Speech Lang. Process. 14 2006 1301 1312 (Pubitemid 46547625)
- (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1301-1312
- Ye, H.¹ Young, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.