SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 3062-3066

Alleviating the over-smoothing problem in GMM-based voice conversion with discriminative training

(5) Hwang, Hsin Te a,c Tsao, Yu b Wang, Hsin Min c Wang, Yih Ru a Chen, Sin Horng a

a NATIONAL CHIAO TUNG UNIVERSITY (Taiwan)

b RESEARCH CENTER FOR INFORMATION TECHNOLOGY INNOVATION (Taiwan)

c INSTITUTE OF INFORMATION SCIENCE (Taiwan)

Author keywords

Discriminative training; GMM; Voice conversion

Indexed keywords

ACOUSTIC WAVE EFFECTS;

DISCRIMINATIVE POWER; DISCRIMINATIVE TRAINING; GAUSSIAN MIXTURE MODEL; GMM; JOINT DENSITIES; OBJECTIVE EVALUATION; SUBJECTIVE EVALUATIONS; VOICE CONVERSION;

SPEECH PROCESSING;

EID: 84906281888 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (8)

References (15)

1
- 0032026483
- Continuous probabilistic transform for voice conversion
- Mar
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp.131-142, Mar. 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

2
- 0031623661
- Spectral voice conversion for text to-speech synthesis
- A. Kain, and M. W. Macon, "Spectral voice conversion for textto-speech synthesis, " Proc. ICASSP, 1998, vol. 1, pp. 285-288.
- (1998) Proc. ICASSP , vol.1 , pp. 285-288
- Kain, A.¹ Macon, M.W.²

3
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- Nov
- T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang., Process , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

4
- 77953727123
- Voice conversion based on weighted frequency warping
- July
- D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping, " IEEE Trans. Audio, Speech, Lang., Process., vol. 18, no. 5, pp. 922-931, July. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang., Process. , vol.18 , Issue.5 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

5
- 84857498745
- Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
- May
- E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora, " IEEE Trans. Audio, Speech, Lang., Process, vol. 20, no. 4, pp. 1313-1323, May. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang., Process , vol.20 , Issue.4 , pp. 1313-1323
- Godoy, E.¹ Rosec, O.² Chonavel, T.³

6
- 84878415076
- A study of mutual information for GMM-based spectral conversion
- H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "A study of mutual information for GMM-based spectral conversion, " Proc. INTERSPEECH, 2012.
- (2012) Proc. INTERSPEECH
- Hwang, H.T.¹ Tsao, Y.² Wang, H.M.³ Wang, Y.R.⁴ Chen, S.H.⁵

7
- 84874485325
- Exploring mutual information for GMM-based spectral conversion
- H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "Exploring mutual information for GMM-based spectral conversion, " Proc. ISCSLP, 2012, pp. 50-54.
- (2012) Proc. ISCSLP , pp. 50-54
- Hwang, H.T.¹ Tsao, Y.² Wang, H.M.³ Wang, Y.R.⁴ Chen, S.H.⁵

8
- 84865754815
- Voice conversion using GMM with enhanced global variance
- H. Benisty and D. Malah, "Voice conversion using GMM with enhanced global variance", Proc. INTERSPEECH, 2011, pp. 669-672.
- (2011) Proc. INTERSPEECH , pp. 669-672
- Benisty, H.¹ Malah, D.²

9
- 78149260085
- Continuous stochastic feature mapping based on trajectory HMMs
- Feb
- H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs, " IEEE Trans. Audio, Speech, Lang., Process., vol. 19, no. 2, pp. 417-430, Feb. 2011.
- (2011) IEEE Trans. Audio, Speech, Lang., Process. , vol.19 , Issue.2 , pp. 417-430
- Zen, H.¹ Nankaku, Y.² Tokuda, K.³

10
- 84859768504
- Statistical voice conversion based on noisy channel model
- Aug
- D. Saito, S. Watanabe, A. Nakamura, and N. Minematsu, "Statistical voice conversion based on noisy channel model, " IEEE Trans. Audio, Speech, Lang., Process., vol. 20, no. 6, pp. 1784-1794, Aug. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang., Process. , vol.20 , Issue.6 , pp. 1784-1794
- Saito, D.¹ Watanabe, S.² Nakamura, A.³ Minematsu, N.⁴

11
- 34547552192
- Conditional vector quantization for voice conversion
- A. Mouchtaris, Y. Agiomyrgiannakis, and Y. Stylianou, "Conditional vector quantization for voice conversion, " Proc. ICASSP, 2007, vol. 4, pp. 505-508.
- (2007) Proc. ICASSP , vol.4 , pp. 505-508
- Mouchtaris, A.¹ Agiomyrgiannakis, Y.² Stylianou, Y.³

12
- 70450186582
- Alleviating the one-tomany mapping problem in voice conversion with contextdependent modeling
- E. Godoy, O. Rosec, and T. Chonavel, "Alleviating the one-tomany mapping problem in voice conversion with contextdependent modeling", Proc. INTERSPEECH, 2009, pp. 1627-1630.
- (2009) Proc. INTERSPEECH , pp. 1627-1630
- Godoy, E.¹ Rosec, O.² Chonavel, T.³

13
- 0022890536
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- L. R. Bahl, P.F. Brown, P. V. De Souza, and L. R., Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition, " Proc. ICASSP, 1986, vol. 11, pp. 49-52.
- (1986) Proc. ICASSP , vol.11 , pp. 49-52
- Bahl, L.R.¹ Brown, P.F.² De Souza, P.V.³ Mercer, R.L.⁴

14
- 0031139839
- Minimum classification error rate methods for speech recognition
- May
- B. H. Juang, W. Chou, and C. H. Lee, "Minimum classification error rate methods for speech recognition, " IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May. 1997.
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.3 , pp. 257-265
- Juang, B.H.¹ Chou, W.² Lee, C.H.³

15
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech Commun., vol. 27, no. 3-4, pp.187-207, 1999.
- (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigné, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.