메뉴 건너뛰기




Volumn , Issue , 2013, Pages 3062-3066

Alleviating the over-smoothing problem in GMM-based voice conversion with discriminative training

Author keywords

Discriminative training; GMM; Voice conversion

Indexed keywords

ACOUSTIC WAVE EFFECTS;

EID: 84906281888     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (8)

References (15)
  • 1
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Mar
    • Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp.131-142, Mar. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1    Cappé, O.2    Moulines, E.3
  • 2
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text to-speech synthesis
    • A. Kain, and M. W. Macon, "Spectral voice conversion for textto-speech synthesis, " Proc. ICASSP, 1998, vol. 1, pp. 285-288.
    • (1998) Proc. ICASSP , vol.1 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 3
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • Nov
    • T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang., Process , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 5
    • 84857498745 scopus 로고    scopus 로고
    • Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
    • May
    • E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora, " IEEE Trans. Audio, Speech, Lang., Process, vol. 20, no. 4, pp. 1313-1323, May. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang., Process , vol.20 , Issue.4 , pp. 1313-1323
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 7
    • 84874485325 scopus 로고    scopus 로고
    • Exploring mutual information for GMM-based spectral conversion
    • H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "Exploring mutual information for GMM-based spectral conversion, " Proc. ISCSLP, 2012, pp. 50-54.
    • (2012) Proc. ISCSLP , pp. 50-54
    • Hwang, H.T.1    Tsao, Y.2    Wang, H.M.3    Wang, Y.R.4    Chen, S.H.5
  • 8
    • 84865754815 scopus 로고    scopus 로고
    • Voice conversion using GMM with enhanced global variance
    • H. Benisty and D. Malah, "Voice conversion using GMM with enhanced global variance", Proc. INTERSPEECH, 2011, pp. 669-672.
    • (2011) Proc. INTERSPEECH , pp. 669-672
    • Benisty, H.1    Malah, D.2
  • 9
    • 78149260085 scopus 로고    scopus 로고
    • Continuous stochastic feature mapping based on trajectory HMMs
    • Feb
    • H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs, " IEEE Trans. Audio, Speech, Lang., Process., vol. 19, no. 2, pp. 417-430, Feb. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang., Process. , vol.19 , Issue.2 , pp. 417-430
    • Zen, H.1    Nankaku, Y.2    Tokuda, K.3
  • 11
    • 34547552192 scopus 로고    scopus 로고
    • Conditional vector quantization for voice conversion
    • A. Mouchtaris, Y. Agiomyrgiannakis, and Y. Stylianou, "Conditional vector quantization for voice conversion, " Proc. ICASSP, 2007, vol. 4, pp. 505-508.
    • (2007) Proc. ICASSP , vol.4 , pp. 505-508
    • Mouchtaris, A.1    Agiomyrgiannakis, Y.2    Stylianou, Y.3
  • 12
    • 70450186582 scopus 로고    scopus 로고
    • Alleviating the one-tomany mapping problem in voice conversion with contextdependent modeling
    • E. Godoy, O. Rosec, and T. Chonavel, "Alleviating the one-tomany mapping problem in voice conversion with contextdependent modeling", Proc. INTERSPEECH, 2009, pp. 1627-1630.
    • (2009) Proc. INTERSPEECH , pp. 1627-1630
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 13
    • 0022890536 scopus 로고
    • Maximum mutual information estimation of hidden Markov model parameters for speech recognition
    • L. R. Bahl, P.F. Brown, P. V. De Souza, and L. R., Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition, " Proc. ICASSP, 1986, vol. 11, pp. 49-52.
    • (1986) Proc. ICASSP , vol.11 , pp. 49-52
    • Bahl, L.R.1    Brown, P.F.2    De Souza, P.V.3    Mercer, R.L.4
  • 14
    • 0031139839 scopus 로고    scopus 로고
    • Minimum classification error rate methods for speech recognition
    • May
    • B. H. Juang, W. Chou, and C. H. Lee, "Minimum classification error rate methods for speech recognition, " IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May. 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.3 , pp. 257-265
    • Juang, B.H.1    Chou, W.2    Lee, C.H.3
  • 15
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech Commun., vol. 27, no. 3-4, pp.187-207, 1999.
    • (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.