-
1
-
-
0032026483
-
Continu-ous probabilistic transform for voice conversion
-
Y. Stylianou, O. Cappe, and E. Moulines, "Continu-ous probabilistic transform for voice conversion, " IEEE Transactions on Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998.
-
(1998)
IEEE Transactions on Speech and Audio Processing
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappe, O.2
Moulines, E.3
-
2
-
-
77953712499
-
Voice conversion using partial least squares re-gression
-
E. Helander, T. Virtanen, J. Nurminen, and M. Gab-bouj, "Voice conversion using partial least squares re-gression, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 5, pp. 912-921, 2010.
-
(2010)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.18
, Issue.5
, pp. 912-921
-
-
Helander, E.1
Virtanen, T.2
Nurminen, J.3
Gab-Bouj, M.4
-
3
-
-
84869384026
-
Mixture of factor analyzers using priors from non-parallel speech for voice conversion
-
Z. Wu, T. Kinnunen, E. Chng, and H. Li, "Mixture of factor analyzers using priors from non-parallel speech for voice conversion, " Signal Processing Letters, IEEE, vol. 19, no. 12, pp. 914-917, 2012.
-
(2012)
Signal Processing Letters, IEEE
, vol.19
, Issue.12
, pp. 914-917
-
-
Wu, Z.1
Kinnunen, T.2
Chng, E.3
Li, H.4
-
5
-
-
0029254176
-
Transformation of formants for voice conversion using artificial neural networks
-
M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks, " Speech communication, vol. 16, no. 2, pp. 207-216, 1995.
-
(1995)
Speech Communication
, vol.16
, Issue.2
, pp. 207-216
-
-
Narendranath, M.1
Murthy, H.A.2
Rajendran, S.3
Yegnanarayana, B.4
-
6
-
-
77953707533
-
Spectral mapping using artificial neural net-works for voice conversion
-
S. Desai, A. W. Black, B. Yegnanarayana, and K. Pra-hallad, "Spectral mapping using artificial neural net-works for voice conversion, " IEEE Transactions on Au-dio, Speech, and Language Processing, vol. 18, no. 5, pp. 954-964, 2010.
-
(2010)
IEEE Transactions on Au-dio, Speech, and Language Processing
, vol.18
, Issue.5
, pp. 954-964
-
-
Desai, S.1
Black, A.W.2
Yegnanarayana, B.3
Pra-Hallad, K.4
-
7
-
-
84856141218
-
Voice conversion using dynamic kernel partial least squares regression
-
E. Helander, H. Silen, T. Virtanen, and M. Gabbouj, "Voice conversion using dynamic kernel partial least squares regression, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 3, pp. 806-817, 2012.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.3
, pp. 806-817
-
-
Helander, E.1
Silen, H.2
Virtanen, T.3
Gabbouj, M.4
-
8
-
-
57749193836
-
Voice conver-sion based on maximum-likelihood estimation of spec-tral parameter trajectory
-
T. Toda, A. W. Black, and K. Tokuda, "Voice conver-sion based on maximum-likelihood estimation of spec-tral parameter trajectory, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
9
-
-
84859768504
-
Statistical voice conversion based on noisy channel model
-
D. Saito, S. Watanabe, A. Nakamura, and N. Mine-matsu, "Statistical voice conversion based on noisy channel model, " Audio, Speech, and Language Process-ing, IEEE Transactions on, vol. 20, no. 6, pp. 1784-1794, 2012.
-
(2012)
Audio, Speech, and Language Process-ing, IEEE Transactions on
, vol.20
, Issue.6
, pp. 1784-1794
-
-
Saito, D.1
Watanabe, S.2
Nakamura, A.3
Mine-Matsu, N.4
-
10
-
-
84905560807
-
Voice conversion with smoothed gmm and map adaptation
-
Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothed gmm and map adaptation, " in Eurospeech-2003, 2003, pp. 2413-2416.
-
(2003)
Eurospeech-2003
, pp. 2413-2416
-
-
Chen, Y.1
Chu, M.2
Chang, E.3
Liu, J.4
Liu, R.5
-
12
-
-
0031623661
-
Spectral voice conversion for text-to-speech synthesis
-
Alexander Kain and MichaelWMacon, "Spectral voice conversion for text-to-speech synthesis, " in Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on. IEEE, 1998, vol. 1, pp. 285-288.
-
(1998)
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference On. IEEE
, vol.1
, pp. 285-288
-
-
Kain, A.1
Michaelwmacon2
-
13
-
-
84864026688
-
Modeling human motion using binary latent variables
-
G. W. Taylor, G. E. Hinton, and S. T. Roweis, "Modeling human motion using binary latent variables, " Advances in neural information processing systems, vol. 19, pp. 1345, 2007.
-
(2007)
Advances in Neural Information Processing Systems
, vol.19
, pp. 1345
-
-
Taylor, G.W.1
Hinton, G.E.2
Roweis, S.T.3
-
14
-
-
0013344078
-
Training products of experts by minimiz-ing contrastive divergence
-
G. E. Hinton, "Training products of experts by minimiz-ing contrastive divergence, " Neural computation, vol. 14, no. 8, pp. 1771-1800, 2002.
-
(2002)
Neural Computation
, vol.14
, Issue.8
, pp. 1771-1800
-
-
Hinton, G.E.1
-
15
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possi-ble role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possi-ble role of a repetitive structure in sounds, " Speech communication, vol. 27, no. 3, pp. 187-207, 1999.
-
(1999)
Speech Communication
, vol.27
, Issue.3
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigne, A.3
-
16
-
-
79959842826
-
Text-independent f0 transformation with non-parallel data for voice conversion
-
Z. Z. Wu, T. Kinnunen, E. S. Chng, and H. Li, "Text-independent f0 transformation with non-parallel data for voice conversion, " Proc. Interspeech 2010, pp. 1732-1735, 2010.
-
(2010)
Proc. Interspeech 2010
, pp. 1732-1735
-
-
Wu, Z.Z.1
Kinnunen, T.2
Chng, E.S.3
Li, H.4
-
17
-
-
85008039410
-
Improved prosody generation by maximizing joint probability of state and longer units
-
Y. Qian, Z. Wu, B. Gao, and F. K. Soong, "Improved prosody generation by maximizing joint probability of state and longer units, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 6, pp. 1702-1710, 2011.
-
(2011)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.19
, Issue.6
, pp. 1702-1710
-
-
Qian, Y.1
Wu, Z.2
Gao, B.3
Soong, F.K.4
|