메뉴 건너뛰기




Volumn , Issue , 2013, Pages

Incorporating global variance in the training phase of GMM-based voice conversion

Author keywords

[No Author keywords available]

Indexed keywords

CLOSED FORM SOLUTIONS; COMPUTATIONAL COSTS; CONVERSION PROCESS; GAUSSIAN MIXTURES; ITERATIVE PROCESS; SPEECH QUALITY; TRAINING PHASE; VOICE CONVERSION;

EID: 84893234191     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/APSIPA.2013.6694179     Document Type: Conference Paper
Times cited : (20)

References (13)
  • 2
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for textto- speech synthesis
    • A. Kain, and M. W. Macon, "Spectral voice conversion for textto- speech synthesis," Proc. ICASSP, 1998, vol. 1, pp. 285-288.
    • (1998) Proc. ICASSP , vol.1 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 3
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • Nov.
    • T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang., Process. , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 5
    • 84857498745 scopus 로고    scopus 로고
    • Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
    • May.
    • E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora," IEEE Trans. Audio, Speech, Lang., Process, vol. 20, no. 4, pp. 1313-1323, May. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang., Process , vol.20 , Issue.4 , pp. 1313-1323
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 7
    • 84874485325 scopus 로고    scopus 로고
    • Exploring mutual information for GMM-based spectral conversion
    • H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "Exploring mutual information for GMM-based spectral conversion," Proc. ISCSLP, 2012, pp. 50-54.
    • (2012) Proc. ISCSLP , pp. 50-54
    • Hwang, H.T.1    Tsao, Y.2    Wang, H.M.3    Wang, Y.R.4    Chen, S.H.5
  • 8
    • 84906281888 scopus 로고    scopus 로고
    • Alleviating the over-smoothing problem in GMMBased voice conversion with discriminative training
    • H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "Alleviating the Over-Smoothing Problem in GMMBased Voice Conversion with Discriminative Training," Proc. INTERSPEECH, 2013.
    • (2013) Proc. Interspeech
    • Hwang, H.T.1    Tsao, Y.2    Wang, H.M.3    Wang, Y.R.4    Chen, S.H.5
  • 9
    • 84865754815 scopus 로고    scopus 로고
    • Voice conversion using GMM with enhanced global variance
    • H. Benisty and D. Malah, "Voice conversion using GMM with enhanced global variance", Proc. INTERSPEECH, 2011, pp. 669-672.
    • (2011) Proc. Interspeech , pp. 669-672
    • Benisty, H.1    Malah, D.2
  • 10
    • 78149260085 scopus 로고    scopus 로고
    • Continuous stochastic feature mapping based on trajectory HMMs
    • Feb.
    • H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs," IEEE Trans. Audio, Speech, Lang., Process., vol. 19, no. 2, pp. 417-430, Feb. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang., Process. , vol.19 , Issue.2 , pp. 417-430
    • Zen, H.1    Nankaku, Y.2    Tokuda, K.3
  • 12
    • 67650826181 scopus 로고    scopus 로고
    • Trajectory Training considering global variance for HMM-based speech synthesis
    • T. Toda, and S. Young, "Trajectory Training considering global variance for HMM-based speech synthesis," Proc. ICASSP, 2009, pp. 4025-4028.
    • (2009) Proc. ICASSP , pp. 4025-4028
    • Toda, T.1    Young, S.2
  • 13
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp.187-207, 1999.
    • (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.