SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 7884-7888

Using bidirectional associative memories for joint spectral envelope modeling in voice conversion

(4) Liu, Li Juan a Chen, Ling Hui a Ling, Zhen Hua a Dai, Li Rong a

a National Engineering Laboratory for Speech and Language Information Processing (China)

Author keywords

bidirectional associative memory; contrastive divergence; Spectral envelope modeling; voice conversion

Indexed keywords

ASSOCIATIVE PROCESSING; ASSOCIATIVE STORAGE; GAUSSIAN DISTRIBUTION; SIGNAL PROCESSING;

BI-DIRECTIONAL ASSOCIATIVE MEMORY; CONTRASTIVE DIVERGENCE; HIGH-DIMENSIONAL; MODELING ABILITIES; NATURAL REPRESENTATION; SPECTRAL ENVELOPE MODELING; SPECTRAL ENVELOPES; VOICE CONVERSION;

SPEECH PROCESSING;

EID: 84905223323 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6855135 Document Type: Conference Paper

Times cited : (15)

References (13)

1
- 0031623661
- Spectral voice conversion for textto-speech synthesis
- A. Kain and M. W. Macon, "Spectral voice conversion for textto-speech synthesis, " in Proc. ICASSP, 1998, pp. 285-288.
- (1998) Proc. ICASSP , pp. 285-288
- Kain, A.¹ Macon, M.W.²

2
- 0034842552
- Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum
- T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum, " in Acoustics, Speech, and Signal Processing, 2001, vol. 2, pp. 841-844.
- (2001) Acoustics, Speech, and Signal Processing , vol.2 , pp. 841-844
- Toda, T.¹ Saruwatari, H.² Shikano, K.³

3
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- nov
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, and Lang. Process, vol. 15, no. 8, pp. 2222-2235, nov. 2007.
- (2007) IEEE Trans. Audio, Speech, and Lang. Process , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

4
- 84901237776
- Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis
- Z. H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 10, pp. 2129-2139, 2013.
- (2013) Audio, Speech, and Language Processing, IEEE Transactions on , vol.21 , Issue.10 , pp. 2129-2139
- Ling, Z.H.¹ Deng, L.² Yu, D.³

5
- 84905560807
- Voice conversion with smoothed GMM and MAP adaptation
- Yining Chen, Min Chu, Eric Chang, Jia Liu, and Runsheng Liu, "Voice conversion with smoothed GMM and MAP adaptation., " in Eurospeech, 2003, pp. 2413-2416.
- (2003) Eurospeech , pp. 2413-2416
- Chen, Y.¹ Chu, M.² Chang, E.³ Liu, J.⁴ Liu, R.⁵

6
- 78149260085
- Continuous stochastic feature mapping based on trajectory HMMs
- H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs, " IEEE Trans. Audio, Speech, and Lang. Process, vol. 19, no. 2, pp. 417-430, 2011.
- (2011) IEEE Trans. Audio, Speech, and Lang. Process , vol.19 , Issue.2 , pp. 417-430
- Zen, H.¹ Nankaku, Y.² Tokuda, K.³

7
- 78651276374
- Ph. D. Thesis, University of Toronto
- R. Salakhutdinov, Learning deep generative models, Ph. D. Thesis, University of Toronto, 2009.
- (2009) Learning Deep Generative Models
- Salakhutdinov, R.¹

8
- 84906225084
- Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion
- L. H. Chen, Z. H. Ling, Y. Song, and L. R. Dai, "Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion, " in Proc. InterSpeech, 2013, pp. 3052-3056.
- (2013) Proc. InterSpeech , pp. 3052-3056
- Chen, L.H.¹ Ling, Z.H.² Song, Y.³ Dai, L.R.⁴

9
- 0013344078
- Training products of experts by minimizing contrastive divergence
- G. E Hinton, "Training products of experts by minimizing contrastive divergence, " Neural Computation, vol. 12, no. 14, pp. 1711-1800, 2002.
- (2002) Neural Computation , vol.12 , Issue.14 , pp. 1711-1800
- Hinton, G.E.¹

10
- 0023861743
- Bidirectional associative memories
- B. Kosko, "Bidirectional associative memories, " IEEE Trans. Systems, Man and Cybernetics, vol. 18, no. 1, pp. 49-60, 1988.
- (1988) IEEE Trans. Systems, Man and Cybernetics , vol.18 , Issue.1 , pp. 49-60
- Kosko, B.¹

11
- 84878387361
- PLDA using Gaussian restricted Boltzmann machines with application to speaker verification
- T. Stafylakis, P. Kenny, M. Senoussaoui, and et al, "PLDA using Gaussian restricted Boltzmann machines with application to speaker verification, " in INTERSPEECH, 2012.
- (2012) INTERSPEECH
- Stafylakis, T.¹ Kenny, P.² Senoussaoui, M.³

12
- 84861125212
- A practical guide to training restricted Boltzmann machines
- G. E Hinton, "A practical guide to training restricted Boltzmann machines, " Momentum, vol. 9, no. 1, 2010.
- (2010) Momentum , vol.9 , Issue.1
- Hinton, G.E.¹

13
- 0032673049
- Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech Communication, vol. 27, no. 3, pp. 187-208, 1999.
- (1999) Speech Communication , vol.27 , Issue.3 , pp. 187-208
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigné, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.