-
1
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
Mar
-
Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp.131-142, Mar. 1998.
-
(1998)
IEEE Trans. Speech Audio Process
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappé, O.2
Moulines, E.3
-
2
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
Nov
-
T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang., Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang., Process
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
3
-
-
84946033919
-
Modulation spectrum-constrained trajectory training algorithm for GMM-based voice conversion
-
S. Takamichi, T. Toda, A. W. Black, and S. Nakamura, "Modulation spectrum-constrained trajectory training algorithm for GMM-based voice conversion," Proc. ICASSP, 2015.
-
(2015)
Proc. ICASSP
-
-
Takamichi, S.1
Toda, T.2
Black, A.W.3
Nakamura, S.4
-
4
-
-
84893234191
-
Incorporating global variance in the training phase of GMMbased voice conversion
-
H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "Incorporating global variance in the training phase of GMMbased voice conversion," Proc. APSIPA, 2013.
-
(2013)
Proc. APSIPA
-
-
Hwang, H.T.1
Tsao, Y.2
Wang, H.M.3
Wang, Y.R.4
Chen, S.H.5
-
5
-
-
77953727123
-
Voice conversion based on weighted frequency warping
-
July
-
D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping," IEEE Trans. Audio, Speech, Lang., Process., vol. 18, no. 5, pp. 922-931, July. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang., Process
, vol.18
, Issue.5
, pp. 922-931
-
-
Erro, D.1
Moreno, A.2
Bonafonte, A.3
-
6
-
-
84857498745
-
Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
-
May
-
E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora," IEEE Trans. Audio, Speech, Lang., Process, vol. 20, no. 4, pp. 1313-1323, May. 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang., Process
, vol.20
, Issue.4
, pp. 1313-1323
-
-
Godoy, E.1
Rosec, O.2
Chonavel, T.3
-
7
-
-
77953707533
-
Spectral mapping using artificial neural networks for voice conversion
-
S. Desai, A. W. Black, B. Yegnanarayana, and K. Prahallad, "Spectral mapping using artificial neural networks for voice conversion," IEEE Trans. Audio, Speech, Lang., Process., vol. 18, no. 5, pp. 954-964, 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang., Process
, vol.18
, Issue.5
, pp. 954-964
-
-
Desai, S.1
Black, A.W.2
Yegnanarayana, B.3
Prahallad, K.4
-
8
-
-
84921735339
-
Voice conversion using deep neural networks with layer-wise generative training
-
L. H. Chen, Z. H. Ling, L. J. Liu, and L. R. Dai, "Voice conversion using deep neural networks with layer-wise generative training," IEEE/ACM Trans. Audio, Speech, Lang., Process., vol. 22, no. 12, pp.1859-1872, 2014.
-
(2014)
IEEE/ACM Trans. Audio, Speech, Lang., Process
, vol.22
, Issue.12
, pp. 1859-1872
-
-
Chen, L.H.1
Ling, Z.H.2
Liu, L.J.3
Dai, L.R.4
-
9
-
-
84906280857
-
Voice conversion in high-order eigen space using deep belief nets
-
T. Nakashika, R. Takashima, T. Takiguchi, and Y. Ariki, "Voice conversion in high-order eigen space using deep belief nets," Proc. INTERSPEEH, 2013.
-
(2013)
Proc. INTERSPEEH
-
-
Nakashika, T.1
Takashima, R.2
Takiguchi, T.3
Ariki, Y.4
-
10
-
-
84986185211
-
A probabilistic interpretation for artificial neural network-based voice conversion
-
H. T. Hwang, Y. Tsao, H. M. Wang, Y. R. Wang and S. H. Chen, "A Probabilistic Interpretation for Artificial Neural Network-based Voice Conversion," Proc. APSIPA, 2015.
-
(2015)
Proc. APSIPA
-
-
Hwang, H.T.1
Tsao, Y.2
Wang, H.M.3
Wang, Y.R.4
Chen, S.H.5
-
12
-
-
84901803470
-
Exemplar based voice conversion using non-negative spectrogram deconvolution
-
Z. Wu, T. Virtanen, T. Kinnunen, E. S. Chng, and H. Li, "Exemplar based voice conversion using non-negative spectrogram deconvolution," Proc. 8th ISCA Speech Synth. Workshop (SSW8), 2013.
-
(2013)
Proc. 8th ISCA Speech Synth. Workshop (SSW8
-
-
Wu, Z.1
Virtanen, T.2
Kinnunen, T.3
Chng, E.S.4
Li, H.5
-
13
-
-
84911369131
-
Exemplar-based sparse representation with residual compensation for voice conversion
-
Z. Wu, T. Virtanen, E. S. Chng, and H. Li, "Exemplar-based sparse representation with residual compensation for voice conversion," IEEE/ACM Trans. Audio, Speech, Lang., Process., vol. 22, no. 10, pp.1506-1521, 2014.
-
(2014)
IEEE/ACM Trans. Audio, Speech, Lang., Process
, vol.22
, Issue.10
, pp. 1506-1521
-
-
Wu, Z.1
Virtanen, T.2
Chng, E.S.3
Li, H.4
-
15
-
-
84879854889
-
Representation learning: A review and new perspectives
-
Y. Bengio, A. Courville, and P. Vincent, P. "Representation learning: A review and new perspectives," Pattern Analysis and Machine Intelligence, IEEE Transactions on, 35(8), pp. 1798-1828, 2013.
-
(2013)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.35
, Issue.8
, pp. 1798-1828
-
-
Bengio, Y.1
Courville, A.2
Vincent P, P.3
-
17
-
-
0034704229
-
A global geometric framework for nonlinear dimensionality reduction
-
J.B. Tenenbaum, V. De Silva, and J.C. Langford, "A global geometric framework for nonlinear dimensionality reduction," Science, vol. 290, no. 5500, pp. 2319-2323, 2000.
-
(2000)
Science
, vol.290
, Issue.5500
, pp. 2319-2323
-
-
Tenenbaum, J.B.1
De Silva, V.2
Langford, J.C.3
-
18
-
-
0043278893
-
Laplacian eigenmaps and spectral techniques for embedding and clustering
-
M. Belkin and P. Niyogi, "Laplacian eigenmaps and spectral techniques for embedding and clustering," Advances in neural information processing systems, vol. 14, pp. 585-591, 2001.
-
(2001)
Advances in Neural Information Processing Systems
, vol.14
, pp. 585-591
-
-
Belkin, M.1
Niyogi, P.2
-
19
-
-
0034704222
-
Nonlinear dimensionality reduction by locally linear embedding
-
S.T. Roweis and L.K. Saul, "Nonlinear dimensionality reduction by locally linear embedding," Science, vol. 290, no. 5500, pp. 2323-2326, 2000.
-
(2000)
Science
, vol.290
, Issue.5500
, pp. 2323-2326
-
-
Roweis, S.T.1
Saul, L.K.2
-
21
-
-
0033708106
-
Speech parameter generation algorithms for HMMbased speech synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. kitamura, "Speech parameter generation algorithms for HMMbased speech synthesis," Proc. ICASSP, 2000.
-
(2000)
Proc. ICASSP
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
22
-
-
84878384520
-
Ways to implement global variance in statistical speech synthesis
-
H. Silén, E. Helander, J. Nurminen, M. Gabbouj, "Ways to implement global variance in statistical speech synthesis," Proc. INTERSPEECH, 2012.
-
(2012)
Proc. INTERSPEECH
-
-
Silén, H.1
Helander, E.2
Nurminen, J.3
Gabbouj, M.4
-
24
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp.187-207, 1999.
-
(1999)
Speech Commun
, vol.27
, Issue.3-4
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigné, A.3
-
25
-
-
84994338928
-
-
Festvox. Available: http://www.festvox.org/download.html.
-
Festvox
-
-
-
26
-
-
84994361374
-
The voice conversion challenge 2016
-
T. Toda, L. H. Chen, D. Saito, F. Villavicencio, M. Wester, Z. Wu and J. Yamagishi, "The Voice Conversion Challenge 2016," Proc. INTERSPEECH, 2016.
-
(2016)
Proc. INTERSPEECH
-
-
Toda, T.1
Chen, L.H.2
Saito, D.3
Villavicencio, F.4
Wester, M.5
Wu, Z.6
Yamagishi, J.7
-
27
-
-
84994351528
-
Analysis of the voice conversion challenge 2016 evaluation results
-
M. Wester, Z. Wu and J. Yamagishi, "Analysis of the Voice Conversion Challenge 2016 Evaluation Results," Proc. INTERSPEECH, 2016.
-
(2016)
Proc. INTERSPEECH
-
-
Wester, M.1
Wu, Z.2
Yamagishi, J.3
|