-
1
-
-
0032026483
-
Continuousprobabilistic transform for voice conversion
-
Y. Stylianou, O. Cappé, and E. Moulines, "Continuousprobabilistic transform for voice conversion, " IEEETransactions on Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998.
-
(1998)
IEEETransactions on Speech and Audio Processing
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappé, O.2
Moulines, E.3
-
2
-
-
0031623661
-
Spectral voice conversionfor text-to-speech synthesis
-
A. Kain and M. W. Macon, "Spectral voice conversionfor text-to-speech synthesis, " in IEEE InternationalConference on Acoustics, Speech and Signal Processing(ICASSP), vol. 1, 1998, pp. 285-288.
-
(1998)
IEEE InternationalConference on Acoustics, Speech and Signal Processing(ICASSP)
, vol.1
, pp. 285-288
-
-
Kain, A.1
Macon, M.W.2
-
3
-
-
77953712499
-
Voice conversion using partial least squares regression
-
E. Heland er, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression, "IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 5, pp. 912-921, 2010.
-
(2010)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.18
, Issue.5
, pp. 912-921
-
-
Heland Er, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj, M.4
-
4
-
-
70349197691
-
Voice conversion using artificialneural networks
-
S. Desai, E. V. Raghavendra, B. Yegnanarayana, A. W. Black, and K. Prahallad, "Voice conversion using artificialneural networks, " in IEEE International Conferenceon Acoustics, Speech and Signal Processing (ICASSP), 2009, pp. 3893-3896.
-
(2009)
IEEE International Conferenceon Acoustics, Speech and Signal Processing (ICASSP)
, pp. 3893-3896
-
-
Desai, S.1
Raghavendra, E.V.2
Yegnanarayana, B.3
Black, A.W.4
Prahallad, K.5
-
5
-
-
84921735339
-
Voiceconversion using deep neural networks with layer-wisegenerative training
-
L.-H. Chen, Z.-H. Ling, L.-J. Liu, and L.-R. Dai, "Voiceconversion using deep neural networks with layer-wisegenerative training, " IEEE Transactions on Speech and Audio Processing, vol. 22, no. 12, pp. 1859-1872, 2014.
-
(2014)
IEEE Transactions on Speech and Audio Processing
, vol.22
, Issue.12
, pp. 1859-1872
-
-
Chen, L.-H.1
Ling, Z.-H.2
Liu, L.-J.3
Dai, L.-R.4
-
6
-
-
84910087395
-
Sequenceerror (SE) minimization training of neural networkfor voice conversion
-
F.-L. Xie, Y. Qian, Y. Fan, F. K. Soong, and H. Li, "Sequenceerror (SE) minimization training of neural networkfor voice conversion, " in INTERSPEECH, 2014.
-
(2014)
INTERSPEECH
-
-
Xie, F.-L.1
Qian, Y.2
Fan, Y.3
Soong, F.K.4
Li, H.5
-
7
-
-
84856141218
-
Voice conversion using dynamic kernel partial leastsquares regression
-
E. Heland er, H. Silén, T. Virtanen, and M. Gabbouj, "Voice conversion using dynamic kernel partial leastsquares regression, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 3, pp. 806-817, 2012.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.3
, pp. 806-817
-
-
Heland Er, E.1
Silén, H.2
Virtanen, T.3
Gabbouj, M.4
-
8
-
-
57749193836
-
Voice conversionbased on maximum-likelihood estimation of spectral parametertrajectory
-
T. Toda, A. W. Black, and K. Tokuda, "Voice conversionbased on maximum-likelihood estimation of spectral parametertrajectory, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
-
(2007)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
9
-
-
84865754815
-
Voice conversion using GMMwith enhanced global variance
-
H. Benisty and D. Malah, "Voice conversion using GMMwith enhanced global variance, " in INTERSPEECH, 2011, pp. 669-672.
-
(2011)
INTERSPEECH
, pp. 669-672
-
-
Benisty, H.1
Malah, D.2
-
10
-
-
84901803470
-
Exemplar-based voice conversion using non-negativespectrogram deconvolution
-
Z. Wu, T. Virtanen, T. Kinnunen, E. S. Chng, and H. Li, "Exemplar-based voice conversion using non-negativespectrogram deconvolution, " in 8th ISCA Speech SynthesisWorkshop, 2013.
-
(2013)
8th ISCA Speech SynthesisWorkshop
-
-
Wu, Z.1
Virtanen, T.2
Kinnunen, T.3
Chng, E.S.4
Li, H.5
-
11
-
-
84874248255
-
Exemplarbasedvoice conversion in noisy environment
-
R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplarbasedvoice conversion in noisy environment, " in SpokenLanguage Technology workshop (SLT), 2012, pp. 313-317.
-
(2012)
SpokenLanguage Technology Workshop (SLT)
, pp. 313-317
-
-
Takashima, R.1
Takiguchi, T.2
Ariki, Y.3
-
12
-
-
84911369131
-
Exemplarbasedsparse representation with residual compensationfor voice conversion
-
Z. Wu, T. Virtanen, E. S. Chng, and H. Li, "Exemplarbasedsparse representation with residual compensationfor voice conversion, " IEEE Transactions on Speech and Audio Processing, vol. 22, no. 10, pp. 1506-1521, 2014.
-
(2014)
IEEE Transactions on Speech and Audio Processing
, vol.22
, Issue.10
, pp. 1506-1521
-
-
Wu, Z.1
Virtanen, T.2
Chng, E.S.3
Li, H.4
-
14
-
-
84946753271
-
VTLN-basedcross-language voice conversion
-
D. Sundermann, H. Ney, and H. Hoge, "VTLN-basedcross-language voice conversion, " in IEEE Workshopon Automatic Speech Recognition and Understand ing(ASRU), 2003, pp. 676-681.
-
(2003)
IEEE Workshopon Automatic Speech Recognition and Understand Ing(ASRU)
, pp. 676-681
-
-
Sundermann, D.1
Ney, H.2
Hoge, H.3
-
15
-
-
77953727123
-
Voice conversionbased on weighted frequency warping
-
D. Erro, A. Moreno, and A. Bonafonte, "Voice conversionbased on weighted frequency warping, " IEEE Transactionson Audio, Speech, and Language Processing, vol. 18, no. 5, pp. 922-931, 2010.
-
(2010)
IEEE Transactionson Audio, Speech, and Language Processing
, vol.18
, Issue.5
, pp. 922-931
-
-
Erro, D.1
Moreno, A.2
Bonafonte, A.3
-
16
-
-
84872177757
-
Parametric voice conversionbased on bilinear frequency warping plus amplitudescaling
-
D. Erro, E. Navas, and I. Hernaez, "Parametric voice conversionbased on bilinear frequency warping plus amplitudescaling, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, no. 3, pp. 556-566, 2013.
-
(2013)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.21
, Issue.3
, pp. 556-566
-
-
Erro, D.1
Navas, E.2
Hernaez, I.3
-
17
-
-
84912079352
-
Correlationbasedfrequency warping for voice conversion
-
X. Tian, Z. Wu, S. W. Lee, and E. S. Chng, "Correlationbasedfrequency warping for voice conversion, " in 9th InternationalSymposium on Chinese Spoken Language Processing(ISCSLP), 2014, pp. 211-215.
-
(2014)
9th InternationalSymposium on Chinese Spoken Language Processing(ISCSLP)
, pp. 211-215
-
-
Tian, X.1
Wu, Z.2
Lee, S.W.3
Chng, E.S.4
-
18
-
-
84946020861
-
Sparse representation for frequency warpingbased voice conversion
-
to appear
-
X. Tian, Z. Wu, S. W. Lee, N. Q. Hy, E. S. Chng, and M. Dong, "Sparse representation for frequency warpingbased voice conversion, " in IEEE International Conferenceon Acoustics, Speech, and Signal Processing(ICASSP) to appear, 2015.
-
(2015)
IEEE International Conferenceon Acoustics, Speech, and Signal Processing(ICASSP)
-
-
Tian, X.1
Wu, Z.2
Lee, S.W.3
Hy, N.Q.4
Chng, E.S.5
Dong, M.6
-
19
-
-
84857498745
-
Voice conversionusing dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
-
E. Godoy, O. Rosec, and T. Chonavel, "Voice conversionusing dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora, " IEEE Transactions onAudio, Speech, and Language Processing, vol. 20, no. 4, pp. 1313-1323, 2012.
-
(2012)
IEEE Transactions OnAudio, Speech, and Language Processing
, vol.20
, Issue.4
, pp. 1313-1323
-
-
Godoy, E.1
Rosec, O.2
Chonavel, T.3
-
20
-
-
0030245128
-
Robust continuous speechrecognition using parallel model combination
-
M. J. Gales and S. J. Young, "Robust continuous speechrecognition using parallel model combination, " IEEETransactions on Speech and Audio Processing, vol. 4, no. 5, pp. 352-359, 1996.
-
(1996)
IEEETransactions on Speech and Audio Processing
, vol.4
, Issue.5
, pp. 352-359
-
-
Gales, M.J.1
Young, S.J.2
-
21
-
-
51449086024
-
Fusion of heterogeneousspeaker recognition systems in the stbu submissionfor the nist speaker recognition evaluation 2006
-
N. Brummer, L. Burget, J. H. Cernocky, O. Glembek, F. Grezl, M. Karafiat, D. A. Van Leeuwen, P. Matejka, P. Schwarz, and A. Strasheim, "Fusion of heterogeneousspeaker recognition systems in the stbu submissionfor the nist speaker recognition evaluation 2006, " IEEETransactions on Audio, Speech, and Language Processing, vol. 15, no. 7, pp. 2072-2084, 2007.
-
(2007)
IEEETransactions on Audio, Speech, and Language Processing
, vol.15
, Issue.7
, pp. 2072-2084
-
-
Brummer, N.1
Burget, L.2
Cernocky, J.H.3
Glembek, O.4
Grezl, F.5
Karafiat, M.6
Van Leeuwen, D.A.7
Matejka, P.8
Schwarz, P.9
Strasheim, A.10
-
22
-
-
85008525798
-
Productof experts for statistical parametric speech synthesis
-
H. Zen, M. J. Gales, Y. Nankaku, and K. Tokuda, "Productof experts for statistical parametric speech synthesis, "IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 3, pp. 794-805, 2012.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.3
, pp. 794-805
-
-
Zen, H.1
Gales, M.J.2
Nankaku, Y.3
Tokuda, K.4
-
23
-
-
0026880275
-
Voice transformationusing PSOLA technique
-
H. Valbret, E. Moulines, and J.-P. Tubach, "Voice transformationusing PSOLA technique, " Speech Communication, vol. 11, no. 2, pp. 175-187, 1992.
-
(1992)
Speech Communication
, vol.11
, Issue.2
, pp. 175-187
-
-
Valbret, H.1
Moulines, E.2
Tubach, J.-P.3
-
24
-
-
0001481529
-
Bark and ERB bilinear transforms
-
J. O. Smith and J. S. Abel, "Bark and ERB bilinear transforms, "IEEE Transactions on Speech and Audio Processing, vol. 7, no. 6, pp. 697-708, 1999.
-
(1999)
IEEE Transactions on Speech and Audio Processing
, vol.7
, Issue.6
, pp. 697-708
-
-
Smith, J.O.1
Abel, J.S.2
-
26
-
-
0032673049
-
Restructuring speech representations using a pitchadaptivetime-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitivestructure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitchadaptivetime-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitivestructure in sounds, " Speech communication, vol. 27, no. 3, pp. 187-207, 1999.
-
(1999)
Speech Communication
, vol.27
, Issue.3
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigné, A.3
-
27
-
-
84878390910
-
Implementationof computationally efficient real-time voice conversion
-
T. Toda, T. Muramatsu, and H. Banno, "Implementationof computationally efficient real-time voice conversion. "in INTERSPEECH, 2012.
-
(2012)
INTERSPEECH
-
-
Toda, T.1
Muramatsu, T.2
Banno, H.3
-
28
-
-
4544284652
-
High quality voice morphing
-
H. Ye and S. Young, "High quality voice morphing, " inIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, 2004, pp. 1-9.
-
(2004)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, vol.1
, pp. 1-9
-
-
Ye, H.1
Young, S.2
|