SCOPUS 정보 검색 플랫폼

2013 IEEE China Summit and International Conference on Signal and Information Processing, ChinaSIP 2013 - Proceedings

Volumn , Issue , 2013, Pages 104-108

Conditional restricted Boltzmann machine for voice conversion

(3) Wu, Zhizheng a,b Chng, Eng Siong a,b Li, Haizhou a,b,c

a NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

b NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

c INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

Author keywords

conditional restricted Boltzmann machine; Speech synthesis; voice conversion

Indexed keywords

CONDITIONAL RESTRICTED BOLTZMANN MACHINES; CORRELATION COEFFICIENT; EXPERIMENTAL VALIDATIONS; GAUSSIAN MIXTURE MODEL; NON-LINEAR TRANSFORMATIONS; STATISTICAL MODELING; TRANSFORMATION FUNCTIONS; VOICE CONVERSION;

DATA PROCESSING; PARAMETER ESTIMATION; SPEECH COMMUNICATION; SPEECH SYNTHESIS;

SPEECH PROCESSING;

EID: 84889579519 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ChinaSIP.2013.6625307 Document Type: Conference Paper

Times cited : (77)

References (17)

1
- 0032026483
- Continu-ous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappe, and E. Moulines, "Continu-ous probabilistic transform for voice conversion, " IEEE Transactions on Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998.
- (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

2
- 77953712499
- Voice conversion using partial least squares re-gression
- E. Helander, T. Virtanen, J. Nurminen, and M. Gab-bouj, "Voice conversion using partial least squares re-gression, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 5, pp. 912-921, 2010.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.5 , pp. 912-921
- Helander, E.¹ Virtanen, T.² Nurminen, J.³ Gab-Bouj, M.⁴

3
- 84869384026
- Mixture of factor analyzers using priors from non-parallel speech for voice conversion
- Z. Wu, T. Kinnunen, E. Chng, and H. Li, "Mixture of factor analyzers using priors from non-parallel speech for voice conversion, " Signal Processing Letters, IEEE, vol. 19, no. 12, pp. 914-917, 2012.
- (2012) Signal Processing Letters, IEEE , vol.19 , Issue.12 , pp. 914-917
- Wu, Z.¹ Kinnunen, T.² Chng, E.³ Li, H.⁴

4
- 84867594339
- Local linear transformation for voice conversion
- V. Popa, H. Silen, J. Nurminen, and M. Gabbouj, "Local linear transformation for voice conversion, " in ICASSP 2012.
- (2012) ICASSP
- Popa, V.¹ Silen, H.² Nurminen, J.³ Gabbouj, M.⁴

5
- 0029254176
- Transformation of formants for voice conversion using artificial neural networks
- M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks, " Speech communication, vol. 16, no. 2, pp. 207-216, 1995.
- (1995) Speech Communication , vol.16 , Issue.2 , pp. 207-216
- Narendranath, M.¹ Murthy, H.A.² Rajendran, S.³ Yegnanarayana, B.⁴

6
- 77953707533
- Spectral mapping using artificial neural net-works for voice conversion
- S. Desai, A. W. Black, B. Yegnanarayana, and K. Pra-hallad, "Spectral mapping using artificial neural net-works for voice conversion, " IEEE Transactions on Au-dio, Speech, and Language Processing, vol. 18, no. 5, pp. 954-964, 2010.
- (2010) IEEE Transactions on Au-dio, Speech, and Language Processing , vol.18 , Issue.5 , pp. 954-964
- Desai, S.¹ Black, A.W.² Yegnanarayana, B.³ Pra-Hallad, K.⁴

7
- 84856141218
- Voice conversion using dynamic kernel partial least squares regression
- E. Helander, H. Silen, T. Virtanen, and M. Gabbouj, "Voice conversion using dynamic kernel partial least squares regression, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 3, pp. 806-817, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.3 , pp. 806-817
- Helander, E.¹ Silen, H.² Virtanen, T.³ Gabbouj, M.⁴

8
- 57749193836
- Voice conver-sion based on maximum-likelihood estimation of spec-tral parameter trajectory
- T. Toda, A. W. Black, and K. Tokuda, "Voice conver-sion based on maximum-likelihood estimation of spec-tral parameter trajectory, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
- (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

9
- 84859768504
- Statistical voice conversion based on noisy channel model
- D. Saito, S. Watanabe, A. Nakamura, and N. Mine-matsu, "Statistical voice conversion based on noisy channel model, " Audio, Speech, and Language Process-ing, IEEE Transactions on, vol. 20, no. 6, pp. 1784-1794, 2012.
- (2012) Audio, Speech, and Language Process-ing, IEEE Transactions on , vol.20 , Issue.6 , pp. 1784-1794
- Saito, D.¹ Watanabe, S.² Nakamura, A.³ Mine-Matsu, N.⁴

10
- 84905560807
- Voice conversion with smoothed gmm and map adaptation
- Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothed gmm and map adaptation, " in Eurospeech-2003, 2003, pp. 2413-2416.
- (2003) Eurospeech-2003 , pp. 2413-2416
- Chen, Y.¹ Chu, M.² Chang, E.³ Liu, J.⁴ Liu, R.⁵

11
- 78049409973
- Phone recognition us-ing restricted boltzmann machines
- A. R. Mohamed and G. Hinton, "Phone recognition us-ing restricted boltzmann machines, " in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE Interna-tional Conference on. IEEE, 2010, pp. 4354-4357.
- (2010) Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE Interna-tional Conference On. IEEE , pp. 4354-4357
- Mohamed, A.R.¹ Hinton, G.²

12
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- Alexander Kain and MichaelWMacon, "Spectral voice conversion for text-to-speech synthesis, " in Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on. IEEE, 1998, vol. 1, pp. 285-288.
- (1998) Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference On. IEEE , vol.1 , pp. 285-288
- Kain, A.¹ Michaelwmacon²

13
- 84864026688
- Modeling human motion using binary latent variables
- G. W. Taylor, G. E. Hinton, and S. T. Roweis, "Modeling human motion using binary latent variables, " Advances in neural information processing systems, vol. 19, pp. 1345, 2007.
- (2007) Advances in Neural Information Processing Systems , vol.19 , pp. 1345
- Taylor, G.W.¹ Hinton, G.E.² Roweis, S.T.³

14
- 0013344078
- Training products of experts by minimiz-ing contrastive divergence
- G. E. Hinton, "Training products of experts by minimiz-ing contrastive divergence, " Neural computation, vol. 14, no. 8, pp. 1771-1800, 2002.
- (2002) Neural Computation , vol.14 , Issue.8 , pp. 1771-1800
- Hinton, G.E.¹

15
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possi-ble role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possi-ble role of a repetitive structure in sounds, " Speech communication, vol. 27, no. 3, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigne, A.³

16
- 79959842826
- Text-independent f0 transformation with non-parallel data for voice conversion
- Z. Z. Wu, T. Kinnunen, E. S. Chng, and H. Li, "Text-independent f0 transformation with non-parallel data for voice conversion, " Proc. Interspeech 2010, pp. 1732-1735, 2010.
- (2010) Proc. Interspeech 2010 , pp. 1732-1735
- Wu, Z.Z.¹ Kinnunen, T.² Chng, E.S.³ Li, H.⁴

17
- 85008039410
- Improved prosody generation by maximizing joint probability of state and longer units
- Y. Qian, Z. Wu, B. Gao, and F. K. Soong, "Improved prosody generation by maximizing joint probability of state and longer units, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 6, pp. 1702-1710, 2011.
- (2011) Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , Issue.6 , pp. 1702-1710
- Qian, Y.¹ Wu, Z.² Gao, B.³ Soong, F.K.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.