메뉴 건너뛰기




Volumn 58, Issue , 2014, Pages 124-138

Voice conversion based on Gaussian processes by coherent and asymmetric training with limited training data

Author keywords

Asymmetric training; Coherent training; Gaussian mixture model; Gaussian processes; Voice conversion

Indexed keywords

COMPUTATIONAL COSTS; GAUSSIAN MIXTURE MODEL; GAUSSIAN PROCESSES; LIMITED TRAINING DATA; NONLINEAR MAPPINGS; SPECTRAL FEATURE; TRAINING STRATEGY; VOICE CONVERSION;

EID: 84890539284     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2013.11.005     Document Type: Article
Times cited : (31)

References (27)
  • 1
    • 0023739214 scopus 로고    scopus 로고
    • Voice conversion through vector quantization
    • Speech Signal Process., New York, USA
    • Abe, M., Nakamura, S., Shikano, K., Kuwabara, H., 1998. Voice conversion through vector quantization. In: Proc. IEEE Int. Conf. Acoust. Speech Signal Process., New York, USA, pp. 655-658.
    • (1998) Proc. IEEE Int. Conf. Acoust , pp. 655-658
    • Abe, M.1    Nakamura, S.2    Shikano, K.3    Kuwabara, H.4
  • 2
    • 0033154052 scopus 로고    scopus 로고
    • Speaker transformation algorithm using segmental codebooks (STASC)
    • L.M. Arslan Speaker transformation algorithm using segmental codebooks (STASC) Speech Commun. 28 1999 211 226
    • (1999) Speech Commun. , vol.28 , pp. 211-226
    • Arslan, L.M.1
  • 4
    • 84876887691 scopus 로고    scopus 로고
    • Nonparametric mixtures of Gaussian processes with power-law behavior
    • S.P. Chatzis, and Y. Demiris Nonparametric mixtures of Gaussian processes with power-law behavior IEEE Trans. Neural Networks Learn. Syst. 23 2012 1862 1871
    • (2012) IEEE Trans. Neural Networks Learn. Syst. , vol.23 , pp. 1862-1871
    • Chatzis, S.P.1    Demiris, Y.2
  • 5
    • 84905560807 scopus 로고    scopus 로고
    • Voice conversion with smoothed GMM and MAP adaptation
    • Geneva, Switzerland
    • Chen, Y., Chu, M., Chang, E., Liu, J., Liu, R., 2003. Voice conversion with smoothed GMM and MAP adaptation. In: Proc. Interspeech. Geneva, Switzerland, pp. 2413-2416.
    • (2003) Proc. Interspeech , pp. 2413-2416
    • Chen, Y.1    Chu, M.2    Chang, E.3    Liu, J.4    Liu, R.5
  • 10
    • 51449107658 scopus 로고    scopus 로고
    • LSF mapping for voice conversion with very small training sets
    • Speech Signal Process
    • Helander, E., Nurminen, J., Gabbouj, M., 2008. LSF mapping for voice conversion with very small training sets. In: Proc. IEEE Int. Conf. Acoust. Speech Signal Process., pp. 4669-4672.
    • (2008) Proc. IEEE Int. Conf. Acoust , pp. 4669-4672
    • Helander, E.1    Nurminen, J.2    Gabbouj, M.3
  • 12
    • 4444285698 scopus 로고    scopus 로고
    • (Ph.D. thesis). Oregon Health and Sci University, Rockford, USA
    • Kain, A., 2001. High resolution voice transformation (Ph.D. thesis). Oregon Health and Sci University, Rockford, USA.
    • (2001) High Resolution Voice Transformation
    • Kain, A.1
  • 13
    • 38149065136 scopus 로고    scopus 로고
    • Statistical approach for voice personality transformation
    • K.S. Lee Statistical approach for voice personality transformation IEEE Trans. Audio Speech Lang. Process. 15 2007 641 651
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 641-651
    • Lee, K.S.1
  • 14
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • J. Makhoul Linear prediction: a tutorial review Proc. IEEE 63 1975 561 580
    • (1975) Proc. IEEE , vol.63 , pp. 561-580
    • Makhoul, J.1
  • 16
    • 0029254176 scopus 로고
    • Transformation of formants for voice conversion using artificial neural networks
    • M. Narendranath, H.A. Murthy, S. Rajendran, and B. Yegnanarayana Transformation of formants for voice conversion using artificial neural networks Speech Commun. 16 1995 207 216
    • (1995) Speech Commun. , vol.16 , pp. 207-216
    • Narendranath, M.1    Murthy, H.A.2    Rajendran, S.3    Yegnanarayana, B.4
  • 17
    • 84865737668 scopus 로고    scopus 로고
    • Gaussian process experts for voice conversion
    • Florence, Italy
    • Pilkington, N.C.V., Zen, H., Gales, M.J.F., 2011. Gaussian process experts for voice conversion. In: Proc. Interspeech. Florence, Italy, pp. 2761-2764.
    • (2011) Proc. Interspeech , pp. 2761-2764
    • Pilkington, N.C.V.1    Zen, H.2    Gales, M.J.F.3
  • 24
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • T. Toda, A.W. Black, and K. Tokuda Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory IEEE Trans. Audio Speech Lang. Process. 15 2007 2222 2235
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 25
    • 38649140222 scopus 로고    scopus 로고
    • Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model
    • DOI 10.1016/j.specom.2007.09.001, PII S0167639307001495
    • T. Toda, A.W. Black, and K. Tokuda Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model Speech Commun. 50 2008 215 227 (Pubitemid 351172471)
    • (2008) Speech Communication , vol.50 , Issue.3 , pp. 215-227
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 26
    • 78650194302 scopus 로고    scopus 로고
    • A voice conversion algorithm in the context of sparse training data
    • N. Xu, and Z. Yang A voice conversion algorithm in the context of sparse training data J. Nanjing Univ. Posts Telecommun. 30 2010 1 7
    • (2010) J. Nanjing Univ. Posts Telecommun. , vol.30 , pp. 1-7
    • Xu, N.1    Yang, Z.2
  • 27
    • 34047254509 scopus 로고    scopus 로고
    • Quality-enhanced voice morphing using maximum likelihood transformations
    • DOI 10.1109/TSA.2005.860839
    • H. Ye, and S. Young Quality enhanced voice morphing using maximum likelihood transformations IEEE Trans. Audio Speech Lang. Process. 14 2006 1301 1312 (Pubitemid 46547625)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.4 , pp. 1301-1312
    • Ye, H.1    Young, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.