SCOPUS 정보 검색 플랫폼

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013

Volumn , Issue , 2013, Pages

Incorporating global variance in the training phase of GMM-based voice conversion

(5) Hwang, Hsin Te a,c Tsao, Yu b Wang, Hsin Min c Wang, Yih Ru a Chen, Sin Horng a

a NATIONAL CHIAO TUNG UNIVERSITY (Taiwan)

b RESEARCH CENTER FOR INFORMATION TECHNOLOGY INNOVATION (Taiwan)

c INSTITUTE OF INFORMATION SCIENCE (Taiwan)

Author keywords

[No Author keywords available]

Indexed keywords

CLOSED FORM SOLUTIONS; COMPUTATIONAL COSTS; CONVERSION PROCESS; GAUSSIAN MIXTURES; ITERATIVE PROCESS; SPEECH QUALITY; TRAINING PHASE; VOICE CONVERSION;

DATA PROCESSING; ITERATIVE METHODS; MAPPING; SPEECH PROCESSING; TRAJECTORIES;

COST REDUCTION;

EID: 84893234191 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/APSIPA.2013.6694179 Document Type: Conference Paper

Times cited : (20)

References (13)

1
- 0032026483
- Continuous probabilistic transform for voice conversion
- PII S1063667698017386
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp.131-142, Mar. 1998. (Pubitemid 128720639)
- (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

2
- 0031623661
- Spectral voice conversion for textto- speech synthesis
- A. Kain, and M. W. Macon, "Spectral voice conversion for textto- speech synthesis," Proc. ICASSP, 1998, vol. 1, pp. 285-288.
- (1998) Proc. ICASSP , vol.1 , pp. 285-288
- Kain, A.¹ Macon, M.W.²

3
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- Nov.
- T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang., Process. , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

4
- 77953727123
- Voice conversion based on weighted frequency warping
- July.
- D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping," IEEE Trans. Audio, Speech, Lang., Process., vol. 18, no. 5, pp. 922-931, July. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang., Process. , vol.18 , Issue.5 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

5
- 84857498745
- Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
- May.
- E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora," IEEE Trans. Audio, Speech, Lang., Process, vol. 20, no. 4, pp. 1313-1323, May. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang., Process , vol.20 , Issue.4 , pp. 1313-1323
- Godoy, E.¹ Rosec, O.² Chonavel, T.³

6
- 84878415076
- A study of mutual information for GMM-based spectral conversion
- H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "A study of mutual information for GMM-based spectral conversion," Proc. INTERSPEECH, 2012.
- (2012) Proc. Interspeech
- Hwang, H.T.¹ Tsao, Y.² Wang, H.M.³ Wang, Y.R.⁴ Chen, S.H.⁵

7
- 84874485325
- Exploring mutual information for GMM-based spectral conversion
- H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "Exploring mutual information for GMM-based spectral conversion," Proc. ISCSLP, 2012, pp. 50-54.
- (2012) Proc. ISCSLP , pp. 50-54
- Hwang, H.T.¹ Tsao, Y.² Wang, H.M.³ Wang, Y.R.⁴ Chen, S.H.⁵

8
- 84906281888
- Alleviating the over-smoothing problem in GMMBased voice conversion with discriminative training
- H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "Alleviating the Over-Smoothing Problem in GMMBased Voice Conversion with Discriminative Training," Proc. INTERSPEECH, 2013.
- (2013) Proc. Interspeech
- Hwang, H.T.¹ Tsao, Y.² Wang, H.M.³ Wang, Y.R.⁴ Chen, S.H.⁵

9
- 84865754815
- Voice conversion using GMM with enhanced global variance
- H. Benisty and D. Malah, "Voice conversion using GMM with enhanced global variance", Proc. INTERSPEECH, 2011, pp. 669-672.
- (2011) Proc. Interspeech , pp. 669-672
- Benisty, H.¹ Malah, D.²

10
- 78149260085
- Continuous stochastic feature mapping based on trajectory HMMs
- Feb.
- H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs," IEEE Trans. Audio, Speech, Lang., Process., vol. 19, no. 2, pp. 417-430, Feb. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang., Process. , vol.19 , Issue.2 , pp. 417-430
- Zen, H.¹ Nankaku, Y.² Tokuda, K.³

11
- 84859768504
- Statistical voice conversion based on noisy channel model
- Aug.
- D. Saito, S. Watanabe, A. Nakamura, and N. Minematsu, "Statistical voice conversion based on noisy channel model," IEEE Trans. Audio, Speech, Lang., Process., vol. 20, no. 6, pp. 1784-1794, Aug. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang., Process. , vol.20 , Issue.6 , pp. 1784-1794
- Saito, D.¹ Watanabe, S.² Nakamura, A.³ Minematsu, N.⁴

12
- 67650826181
- Trajectory Training considering global variance for HMM-based speech synthesis
- T. Toda, and S. Young, "Trajectory Training considering global variance for HMM-based speech synthesis," Proc. ICASSP, 2009, pp. 4025-4028.
- (2009) Proc. ICASSP , pp. 4025-4028
- Toda, T.¹ Young, S.²

13
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp.187-207, 1999.
- (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigné, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.