-
1
-
-
0031623661
-
Spectral voice conversion for textto-speech synthesis
-
A. Kain and M. W. Macon, "Spectral voice conversion for textto-speech synthesis, " in Proc. ICASSP, 1998, pp. 285-288.
-
(1998)
Proc. ICASSP
, pp. 285-288
-
-
Kain, A.1
Macon, M.W.2
-
2
-
-
0034842552
-
Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum
-
T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum, " in Acoustics, Speech, and Signal Processing, 2001, vol. 2, pp. 841-844.
-
(2001)
Acoustics, Speech, and Signal Processing
, vol.2
, pp. 841-844
-
-
Toda, T.1
Saruwatari, H.2
Shikano, K.3
-
3
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
nov
-
T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, and Lang. Process, vol. 15, no. 8, pp. 2222-2235, nov. 2007.
-
(2007)
IEEE Trans. Audio, Speech, and Lang. Process
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
4
-
-
84901237776
-
Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis
-
Z. H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 10, pp. 2129-2139, 2013.
-
(2013)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.21
, Issue.10
, pp. 2129-2139
-
-
Ling, Z.H.1
Deng, L.2
Yu, D.3
-
5
-
-
84905560807
-
Voice conversion with smoothed GMM and MAP adaptation
-
Yining Chen, Min Chu, Eric Chang, Jia Liu, and Runsheng Liu, "Voice conversion with smoothed GMM and MAP adaptation., " in Eurospeech, 2003, pp. 2413-2416.
-
(2003)
Eurospeech
, pp. 2413-2416
-
-
Chen, Y.1
Chu, M.2
Chang, E.3
Liu, J.4
Liu, R.5
-
6
-
-
78149260085
-
Continuous stochastic feature mapping based on trajectory HMMs
-
H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs, " IEEE Trans. Audio, Speech, and Lang. Process, vol. 19, no. 2, pp. 417-430, 2011.
-
(2011)
IEEE Trans. Audio, Speech, and Lang. Process
, vol.19
, Issue.2
, pp. 417-430
-
-
Zen, H.1
Nankaku, Y.2
Tokuda, K.3
-
8
-
-
84906225084
-
Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion
-
L. H. Chen, Z. H. Ling, Y. Song, and L. R. Dai, "Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion, " in Proc. InterSpeech, 2013, pp. 3052-3056.
-
(2013)
Proc. InterSpeech
, pp. 3052-3056
-
-
Chen, L.H.1
Ling, Z.H.2
Song, Y.3
Dai, L.R.4
-
9
-
-
0013344078
-
Training products of experts by minimizing contrastive divergence
-
G. E Hinton, "Training products of experts by minimizing contrastive divergence, " Neural Computation, vol. 12, no. 14, pp. 1711-1800, 2002.
-
(2002)
Neural Computation
, vol.12
, Issue.14
, pp. 1711-1800
-
-
Hinton, G.E.1
-
10
-
-
0023861743
-
Bidirectional associative memories
-
B. Kosko, "Bidirectional associative memories, " IEEE Trans. Systems, Man and Cybernetics, vol. 18, no. 1, pp. 49-60, 1988.
-
(1988)
IEEE Trans. Systems, Man and Cybernetics
, vol.18
, Issue.1
, pp. 49-60
-
-
Kosko, B.1
-
11
-
-
84878387361
-
PLDA using Gaussian restricted Boltzmann machines with application to speaker verification
-
T. Stafylakis, P. Kenny, M. Senoussaoui, and et al, "PLDA using Gaussian restricted Boltzmann machines with application to speaker verification, " in INTERSPEECH, 2012.
-
(2012)
INTERSPEECH
-
-
Stafylakis, T.1
Kenny, P.2
Senoussaoui, M.3
-
12
-
-
84861125212
-
A practical guide to training restricted Boltzmann machines
-
G. E Hinton, "A practical guide to training restricted Boltzmann machines, " Momentum, vol. 9, no. 1, 2010.
-
(2010)
Momentum
, vol.9
, Issue.1
-
-
Hinton, G.E.1
-
13
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech Communication, vol. 27, no. 3, pp. 187-208, 1999.
-
(1999)
Speech Communication
, vol.27
, Issue.3
, pp. 187-208
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigné, A.3
|