메뉴 건너뛰기




Volumn , Issue , 2014, Pages 7884-7888

Using bidirectional associative memories for joint spectral envelope modeling in voice conversion

Author keywords

bidirectional associative memory; contrastive divergence; Spectral envelope modeling; voice conversion

Indexed keywords

ASSOCIATIVE PROCESSING; ASSOCIATIVE STORAGE; GAUSSIAN DISTRIBUTION; SIGNAL PROCESSING;

EID: 84905223323     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6855135     Document Type: Conference Paper
Times cited : (15)

References (13)
  • 1
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for textto-speech synthesis
    • A. Kain and M. W. Macon, "Spectral voice conversion for textto-speech synthesis, " in Proc. ICASSP, 1998, pp. 285-288.
    • (1998) Proc. ICASSP , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 2
    • 0034842552 scopus 로고    scopus 로고
    • Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum
    • T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum, " in Acoustics, Speech, and Signal Processing, 2001, vol. 2, pp. 841-844.
    • (2001) Acoustics, Speech, and Signal Processing , vol.2 , pp. 841-844
    • Toda, T.1    Saruwatari, H.2    Shikano, K.3
  • 3
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • nov
    • T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, and Lang. Process, vol. 15, no. 8, pp. 2222-2235, nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, and Lang. Process , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 4
    • 84901237776 scopus 로고    scopus 로고
    • Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis
    • Z. H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 10, pp. 2129-2139, 2013.
    • (2013) Audio, Speech, and Language Processing, IEEE Transactions on , vol.21 , Issue.10 , pp. 2129-2139
    • Ling, Z.H.1    Deng, L.2    Yu, D.3
  • 5
    • 84905560807 scopus 로고    scopus 로고
    • Voice conversion with smoothed GMM and MAP adaptation
    • Yining Chen, Min Chu, Eric Chang, Jia Liu, and Runsheng Liu, "Voice conversion with smoothed GMM and MAP adaptation., " in Eurospeech, 2003, pp. 2413-2416.
    • (2003) Eurospeech , pp. 2413-2416
    • Chen, Y.1    Chu, M.2    Chang, E.3    Liu, J.4    Liu, R.5
  • 6
    • 78149260085 scopus 로고    scopus 로고
    • Continuous stochastic feature mapping based on trajectory HMMs
    • H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs, " IEEE Trans. Audio, Speech, and Lang. Process, vol. 19, no. 2, pp. 417-430, 2011.
    • (2011) IEEE Trans. Audio, Speech, and Lang. Process , vol.19 , Issue.2 , pp. 417-430
    • Zen, H.1    Nankaku, Y.2    Tokuda, K.3
  • 8
    • 84906225084 scopus 로고    scopus 로고
    • Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion
    • L. H. Chen, Z. H. Ling, Y. Song, and L. R. Dai, "Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion, " in Proc. InterSpeech, 2013, pp. 3052-3056.
    • (2013) Proc. InterSpeech , pp. 3052-3056
    • Chen, L.H.1    Ling, Z.H.2    Song, Y.3    Dai, L.R.4
  • 9
    • 0013344078 scopus 로고    scopus 로고
    • Training products of experts by minimizing contrastive divergence
    • G. E Hinton, "Training products of experts by minimizing contrastive divergence, " Neural Computation, vol. 12, no. 14, pp. 1711-1800, 2002.
    • (2002) Neural Computation , vol.12 , Issue.14 , pp. 1711-1800
    • Hinton, G.E.1
  • 10
    • 0023861743 scopus 로고
    • Bidirectional associative memories
    • B. Kosko, "Bidirectional associative memories, " IEEE Trans. Systems, Man and Cybernetics, vol. 18, no. 1, pp. 49-60, 1988.
    • (1988) IEEE Trans. Systems, Man and Cybernetics , vol.18 , Issue.1 , pp. 49-60
    • Kosko, B.1
  • 11
    • 84878387361 scopus 로고    scopus 로고
    • PLDA using Gaussian restricted Boltzmann machines with application to speaker verification
    • T. Stafylakis, P. Kenny, M. Senoussaoui, and et al, "PLDA using Gaussian restricted Boltzmann machines with application to speaker verification, " in INTERSPEECH, 2012.
    • (2012) INTERSPEECH
    • Stafylakis, T.1    Kenny, P.2    Senoussaoui, M.3
  • 12
    • 84861125212 scopus 로고    scopus 로고
    • A practical guide to training restricted Boltzmann machines
    • G. E Hinton, "A practical guide to training restricted Boltzmann machines, " Momentum, vol. 9, no. 1, 2010.
    • (2010) Momentum , vol.9 , Issue.1
    • Hinton, G.E.1
  • 13
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech Communication, vol. 27, no. 3, pp. 187-208, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3 , pp. 187-208
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.