-
1
-
-
4444251929
-
Voice conversion: State of the art and perspectives
-
E. Moulines and Y. Sagisaka Eds. Elsevier
-
E. Moulines and Y. Sagisaka, Eds., "Voice conversion: State of the art and perspectives," Special Iss. Speech Commun., vol.16(2), 1995, Elsevier.
-
(1995)
Special Iss. Speech Commun.
, vol.16
, Issue.2
-
-
-
2
-
-
0023739214
-
Voice conversion through vector quantization
-
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1988, pp. 655-658.
-
(1988)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 655-658
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
3
-
-
0026394044
-
Speaker adaptation and voice conversion by codebook mapping
-
K. Shikano, S. Nakamura, and M. Abe, "Speaker adaptation and voice conversion by codebook mapping," in Proc. IEEE Int. Symp. Circuits Syst., 1991, vol.1, pp. 594-597.
-
(1991)
Proc. IEEE Int. Symp. Circuits Syst.
, vol.1
, pp. 594-597
-
-
Shikano, K.1
Nakamura, S.2
Abe, M.3
-
4
-
-
33646900967
-
Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt
-
H. Mizuno and M. Abe, "Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1994, vol.1, pp. 469-472.
-
(1994)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.1
, pp. 469-472
-
-
Mizuno, H.1
Abe, M.2
-
5
-
-
0033154052
-
Speaker transformation algorithm using segmental codebooks (STASC)
-
L. M. Arslan, "Speaker transformation algorithm using segmental codebooks (STASC)," Speech Commun., no.28, 1999.
-
(1999)
Speech Commun.
, Issue.28
-
-
Arslan, L.M.1
-
6
-
-
33746653351
-
Robust processing techniques for voice conversion
-
O. Turk and L. M. Arslan, "Robust processing techniques for voice conversion," Comput. Speech Lang., vol.20, no.4, pp. 441-467, 2006.
-
(2006)
Comput. Speech Lang.
, vol.20
, Issue.4
, pp. 441-467
-
-
Turk, O.1
Arslan, L.M.2
-
7
-
-
85010815133
-
Voice transformation using PSOLA technique
-
H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Commun., vol.1, pp. 145-148, 1992.
-
(1992)
Speech Commun
, vol.1
, pp. 145-148
-
-
Valbret, H.1
Moulines, E.2
Tubach, J.P.3
-
9
-
-
4544361661
-
Voice conversion through transformation of spectral and intonation features
-
D. Rentzos, S. Vaseghi, Q. Yan, and C. H. Ho, "Voice conversion through transformation of spectral and intonation features," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2004, vol.1, pp. 21-24.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.1
, pp. 21-24
-
-
Rentzos, D.1
Vaseghi, S.2
Yan, Q.3
Ho, C.H.4
-
10
-
-
34547507542
-
Frequency warping based on mapping formant parameters
-
Z. W. Shuang, R. Bakis, S. Shechtman, D. Chazan, and Y. Qin, "Frequency warping based on mapping formant parameters," in Proc. Int. Conf. Spoken Lang. Process., 2006.
-
(2006)
Proc. Int. Conf. Spoken Lang. Process.
-
-
Shuang, Z.W.1
Bakis, R.2
Shechtman, S.3
Chazan, D.4
Qin, Y.5
-
11
-
-
85064715894
-
Speech spectrum transformation by speaker interpolation
-
N. Iwahashi and Y. Sagisaka, "Speech spectrum transformation by speaker interpolation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1994, vol.1, pp. 461-464.
-
(1994)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.1
, pp. 461-464
-
-
Iwahashi, N.1
Sagisaka, Y.2
-
12
-
-
0029254176
-
Transformation of formants for voice conversion using artificial neural networks
-
M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks," Speech Commun., vol.16, no.2, pp. 207-216, 1995.
-
(1995)
Speech Commun
, vol.16
, Issue.2
, pp. 207-216
-
-
Narendranath, M.1
Murthy, H.A.2
Rajendran, S.3
Yegnanarayana, B.4
-
13
-
-
0003447548
-
-
Ph.D. dissertation, École Nationale Superieure Des Télécommunications, Paris, France
-
Y. Stylianou, "Harmonic plus noise models for speech, combined with statistical methods, for speech and speaker modification," Ph.D. dissertation, École Nationale Superieure Des Télé communications, Paris, France, 1996.
-
(1996)
Harmonic Plus Noise Models for Speech, Combined with Statistical Methods, for Speech and Speaker Modification
-
-
Stylianou, Y.1
-
14
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1998, vol.6, pp. 131-142.
-
(1998)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.6
, pp. 131-142
-
-
Stylianou, Y.1
Cappé, O.2
Moulines, E.3
-
15
-
-
4444285698
-
-
Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR
-
A. Kain, "High resolution voice transformation," Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR, 2001.
-
(2001)
High Resolution Voice Transformation
-
-
Kain, A.1
-
16
-
-
33846972308
-
Residual prediction
-
D. Sündermann, H. Höge, A. Bonafonte, and H. Duxans, "Residual prediction," in Proc. IEEE Symp. Signal Process. Inf. Technol., 2005, pp. 512-516.
-
(2005)
Proc. IEEE Symp. Signal Process. Inf. Technol.
, pp. 512-516
-
-
Sündermann, D.1
Höge, H.2
Bonafonte, A.3
Duxans, H.4
-
17
-
-
34047254509
-
Quality-enhanced voice morphing using maximum likelihood transformations
-
Jul.
-
H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1301-1312, Jul. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.14
, Issue.4
, pp. 1301-1312
-
-
Ye, H.1
Young, S.2
-
18
-
-
44949143155
-
Maximum likelihood voice conversion based on GMM with straight mixed excitation
-
Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with straight mixed excitation," in Proc. Interspeech, 2006.
-
(2006)
Proc. Interspeech
-
-
Ohtani, Y.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
19
-
-
84867216755
-
The linear transformation of LF glottal waveforms for voice conversion
-
A. del Pozo and S. Young, "The linear transformation of LF glottal waveforms for voice conversion," in Proc. Interspeech, 2008, pp. 1457-1460.
-
(2008)
Proc. Interspeech
, pp. 1457-1460
-
-
Del Pozo, A.1
Young, S.2
-
20
-
-
0034842552
-
Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum
-
T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2001, pp. 841-844.
-
(2001)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 841-844
-
-
Toda, T.1
Saruwatari, H.2
Shikano, K.3
-
21
-
-
33646779506
-
Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
-
T. Toda, A. W. Black, and K. Tokuda, "Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2005, vol.1, pp. 9-12.
-
(2005)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.1
, pp. 9-12
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
22
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
Nov.
-
T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.8, pp. 2222-2235, Nov. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
23
-
-
0029725605
-
Speech synthesis usingHMMSwith dynamic features
-
T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Speech synthesis usingHMMSwith dynamic features," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1996, pp. 389-392.
-
(1996)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 389-392
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
24
-
-
0030696416
-
Voice characteristics conversion for HMM-based speech synthesis system
-
T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Voice characteristics conversion for HMM-based speech synthesis system," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1997, pp. 1611-1614.
-
(1997)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 1611-1614
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
25
-
-
67650854725
-
Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
-
Jan.
-
J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.1, pp. 66-83, Jan. 2009.
-
(2009)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.17
, Issue.1
, pp. 66-83
-
-
Yamagishi, J.1
Kobayashi, T.2
Nakano, Y.3
Ogata, K.4
Isogai, J.5
-
26
-
-
67651002140
-
Statistical parametric speech synthesis
-
H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis," Speech Commun., vol.51, no.11, pp. 1039-1064, 2009.
-
(2009)
Speech Commun.
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.W.3
-
28
-
-
77953697940
-
-
Ph.D. dissertation, Univ. Politècnica de Catalunya, Barcelona, Spain
-
D. Erro, "Intra-lingual and cross-lingual voice conversion using harmonic plus stochastic models," Ph.D. dissertation, Univ. Politècnica de Catalunya, Barcelona, Spain, 2008.
-
(2008)
Intra-lingual and Cross-lingual Voice Conversion Using Harmonic Plus Stochastic Models
-
-
Erro, D.1
-
29
-
-
85068458327
-
Weighted frequency warping for voice conversion
-
D. Erro and A. Moreno, "Weighted frequency warping for voice conversion," in Proc. Interspeech, 2007, pp. 1965-1968.
-
(2007)
Proc. Interspeech
, pp. 1965-1968
-
-
Erro, D.1
Moreno, A.2
-
31
-
-
56149126421
-
Voice conversion of non-aligned data using unit selection
-
H. Duxans, D. Erro, J. Pérez, F. Diego, A. Bonafonte, and A. Moreno, "Voice conversion of non-aligned data using unit selection," in Proc. TC-STAR Workshop Speech to Speech Transl., 2006.
-
(2006)
Proc. TC-STAR Workshop Speech to Speech Transl.
-
-
Duxans, H.1
Erro, D.2
Pérez, J.3
Diego, F.4
Bonafonte, A.5
Moreno, A.6
-
32
-
-
0026106454
-
Discrete all-pole modeling
-
Feb.
-
A. El-Jaroudi and J. Makhoul, "Discrete all-pole modeling," IEEE Trans. Signal Process., vol.39, no.2, pp. 411-423, Feb. 1991.
-
(1991)
IEEE Trans. Signal Process.
, vol.39
, Issue.2
, pp. 411-423
-
-
El-Jaroudi, A.1
Makhoul, J.2
-
33
-
-
79961212205
-
TC-STAR: Specifications of language resources and evaluation for speech synthesis
-
A. Bonafonte, H. Höge, I. Kiss, A. Moreno, U. Ziegenhain, H. van Den Heuvel, H. U. Hain, X. S.Wang, and M. N. Garcia, "TC-STAR: Specifications of language resources and evaluation for speech synthesis," in Proc. Int. Conf. Lang. Resources Eval., 2006.
-
(2006)
Proc. Int. Conf. Lang. Resources Eval.
-
-
Bonafonte, A.1
Höge, H.2
Kiss, I.3
Moreno, A.4
Ziegenhain, U.5
Van Den Heuvel, H.6
Hain, H.U.7
Wang, X.S.8
Garcia, M.N.9
-
34
-
-
77953708737
-
The UPC TTS system description for the 2007 Blizzard Challenge
-
A. Bonafonte, J. Adell, P. D. Agüero, D. Erro, I. Esquerra, A. Moreno, J. Pérez, and T. Polyakova, "The UPC TTS system description for the 2007 Blizzard Challenge," in Proc. 6th ISCA Workshop Speech Synth., 2007.
-
(2007)
Proc. 6th ISCA Workshop Speech Synth.
-
-
Bonafonte, A.1
Adell, J.2
Agüero, P.D.3
Erro, D.4
Esquerra, I.5
Moreno, A.6
Pérez, J.7
Polyakova, T.8
|