메뉴 건너뛰기




Volumn , Issue , 2013, Pages 104-108

Conditional restricted Boltzmann machine for voice conversion

Author keywords

conditional restricted Boltzmann machine; Speech synthesis; voice conversion

Indexed keywords

CONDITIONAL RESTRICTED BOLTZMANN MACHINES; CORRELATION COEFFICIENT; EXPERIMENTAL VALIDATIONS; GAUSSIAN MIXTURE MODEL; NON-LINEAR TRANSFORMATIONS; STATISTICAL MODELING; TRANSFORMATION FUNCTIONS; VOICE CONVERSION;

EID: 84889579519     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ChinaSIP.2013.6625307     Document Type: Conference Paper
Times cited : (77)

References (17)
  • 3
    • 84869384026 scopus 로고    scopus 로고
    • Mixture of factor analyzers using priors from non-parallel speech for voice conversion
    • Z. Wu, T. Kinnunen, E. Chng, and H. Li, "Mixture of factor analyzers using priors from non-parallel speech for voice conversion, " Signal Processing Letters, IEEE, vol. 19, no. 12, pp. 914-917, 2012.
    • (2012) Signal Processing Letters, IEEE , vol.19 , Issue.12 , pp. 914-917
    • Wu, Z.1    Kinnunen, T.2    Chng, E.3    Li, H.4
  • 4
    • 84867594339 scopus 로고    scopus 로고
    • Local linear transformation for voice conversion
    • V. Popa, H. Silen, J. Nurminen, and M. Gabbouj, "Local linear transformation for voice conversion, " in ICASSP 2012.
    • (2012) ICASSP
    • Popa, V.1    Silen, H.2    Nurminen, J.3    Gabbouj, M.4
  • 5
    • 0029254176 scopus 로고
    • Transformation of formants for voice conversion using artificial neural networks
    • M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks, " Speech communication, vol. 16, no. 2, pp. 207-216, 1995.
    • (1995) Speech Communication , vol.16 , Issue.2 , pp. 207-216
    • Narendranath, M.1    Murthy, H.A.2    Rajendran, S.3    Yegnanarayana, B.4
  • 8
    • 57749193836 scopus 로고    scopus 로고
    • Voice conver-sion based on maximum-likelihood estimation of spec-tral parameter trajectory
    • T. Toda, A. W. Black, and K. Tokuda, "Voice conver-sion based on maximum-likelihood estimation of spec-tral parameter trajectory, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 10
    • 84905560807 scopus 로고    scopus 로고
    • Voice conversion with smoothed gmm and map adaptation
    • Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothed gmm and map adaptation, " in Eurospeech-2003, 2003, pp. 2413-2416.
    • (2003) Eurospeech-2003 , pp. 2413-2416
    • Chen, Y.1    Chu, M.2    Chang, E.3    Liu, J.4    Liu, R.5
  • 14
    • 0013344078 scopus 로고    scopus 로고
    • Training products of experts by minimiz-ing contrastive divergence
    • G. E. Hinton, "Training products of experts by minimiz-ing contrastive divergence, " Neural computation, vol. 14, no. 8, pp. 1771-1800, 2002.
    • (2002) Neural Computation , vol.14 , Issue.8 , pp. 1771-1800
    • Hinton, G.E.1
  • 15
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possi-ble role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possi-ble role of a repetitive structure in sounds, " Speech communication, vol. 27, no. 3, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3
  • 16
    • 79959842826 scopus 로고    scopus 로고
    • Text-independent f0 transformation with non-parallel data for voice conversion
    • Z. Z. Wu, T. Kinnunen, E. S. Chng, and H. Li, "Text-independent f0 transformation with non-parallel data for voice conversion, " Proc. Interspeech 2010, pp. 1732-1735, 2010.
    • (2010) Proc. Interspeech 2010 , pp. 1732-1735
    • Wu, Z.Z.1    Kinnunen, T.2    Chng, E.S.3    Li, H.4
  • 17
    • 85008039410 scopus 로고    scopus 로고
    • Improved prosody generation by maximizing joint probability of state and longer units
    • Y. Qian, Z. Wu, B. Gao, and F. K. Soong, "Improved prosody generation by maximizing joint probability of state and longer units, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 6, pp. 1702-1710, 2011.
    • (2011) Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , Issue.6 , pp. 1702-1710
    • Qian, Y.1    Wu, Z.2    Gao, B.3    Soong, F.K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.