-
1
-
-
0023739214
-
Voice conversion through vector quantization
-
New York, Apr
-
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, New York, Apr. 1988, pp. 565-568.
-
(1988)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing
, pp. 565-568
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
2
-
-
77953707533
-
Spectral mapping using artificial neural networks for voice conversion
-
Jul
-
S. Desai, A. Black, B. Yegnanarayana, and K. Prahallad, "Spectral mapping using artificial neural networks for voice conversion," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 954-964, Jul. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.5
, pp. 954-964
-
-
Desai, S.1
Black, A.2
Yegnanarayana, B.3
Prahallad, K.4
-
3
-
-
85010815133
-
Voice transformation using PSOLA technique
-
Mar
-
H. Valbret, E. Moulines, and J. Tubach, "Voice transformation using PSOLA technique," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Mar. 1992, vol. 1, pp. 145-148.
-
(1992)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing
, vol.1
, pp. 145-148
-
-
Valbret, H.1
Moulines, E.2
Tubach, J.3
-
4
-
-
0033154052
-
Speaker transformation algorithm using segmental codebooks (STASC)
-
Jun
-
L. Arslan, "Speaker transformation algorithm using segmental codebooks (STASC)," Speech Commun., vol. 28, no. 3, pp. 211-226, Jun. 1999.
-
(1999)
Speech Commun.
, vol.28
, Issue.3
, pp. 211-226
-
-
Arslan, L.1
-
5
-
-
79959836789
-
Maximum a posteriori voice conversion using sequential Monte Carlo methods
-
Sep
-
E. Helander, H. Silen, J. Miguez, and M. Gabbouj, "Maximum a posteriori voice conversion using sequential Monte Carlo methods," in Proc. Interspeech, Sep. 2010, pp. 1716-1719.
-
(2010)
Proc. Interspeech
, pp. 1716-1719
-
-
Helander, E.1
Silen, H.2
Miguez, J.3
Gabbouj, M.4
-
6
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
PII S1063667698017386
-
Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp. 131-142, Mar. 1998. (Pubitemid 128720639)
-
(1998)
IEEE Transactions on Speech and Audio Processing
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappe, O.2
Moulines, E.3
-
7
-
-
0031623661
-
Spectral voice conversion for text-tospeech synthesis
-
Seattle, WA, May
-
A. Kain and M. W. Macon, "Spectral voice conversion for text-tospeech synthesis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle, WA, May 1998, vol. 1, pp. 285-288.
-
(1998)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.1
, pp. 285-288
-
-
Kain, A.1
MacOn, M.W.2
-
8
-
-
77953712499
-
Voice conversion using partial least squares regression
-
Jul
-
E. Helander, T.Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 912-921, Jul. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.5
, pp. 912-921
-
-
Helander, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj, M.4
-
9
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
Nov
-
T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.2
Tokuda, K.3
-
10
-
-
84966348891
-
An HMM-based speech synthesis system applied to English
-
Sep
-
K. Tokuda, H. Zen, and A. W. Black, "An HMM-based speech synthesis system applied to English," in Proc. IEEE Workshop Speech Synth., Sep. 2002, pp. 227-230.
-
(2002)
Proc. IEEE Workshop Speech Synth.
, pp. 227-230
-
-
Tokuda, K.1
Zen, H.2
Black, A.W.3
-
11
-
-
34547496175
-
One-to-many and many-to-one voice conversion based on eigenvoices
-
DOI 10.1109/ICASSP.2007.367303, 4218334, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
-
T. Toda, Y. Ohtani, and K. Shikano, "One-to-many and many-to-one voice conversion based on eigenvoices," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 2007, vol. 4, pp. IV-1249-IV-1252. (Pubitemid 47178603)
-
(2007)
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
, vol.4
-
-
Toda, T.1
Ohtani, Y.2
Shikano, K.3
-
12
-
-
84905560807
-
Voice conversion with smoothedGMM and MAP adaptation
-
Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothedGMM and MAP adaptation," in Proc. Eurospeech, 2003, pp. 2413-2416.
-
(2003)
Proc. Eurospeech
, pp. 2413-2416
-
-
Chen, Y.1
Chu, M.2
Chang, E.3
Liu, J.4
Liu, R.5
-
13
-
-
77953727123
-
Voice conversion based on weighted frequency warping
-
Jul
-
D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 922-931, Jul. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.5
, pp. 922-931
-
-
Erro, D.1
Moreno, A.2
Bonafonte, A.3
-
14
-
-
33745171182
-
-
New York: Wiley, , ch. Introduction to Scientific Data Mining: Direct Kernel Methods & Applications
-
M. J. Embrechts and B. Szymanski, Computationally Intelligent Hybrid Systems. New York: Wiley, 2005, ch. Introduction to Scientific Data Mining: Direct Kernel Methods & Applications, pp. 317-365.
-
(2005)
Computationally Intelligent Hybrid Systems
, pp. 317-365
-
-
Embrechts, M.J.1
Szymanski, B.2
-
15
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and a instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
Apr
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and a instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
-
(1999)
Speech Commun.
, vol.27
, Issue.3-4
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigné, A.3
-
16
-
-
77953708096
-
Thousands of voices for HMM-based speech synthesis-Analysis and application of TTS systems built on various ASR corpora
-
Jul
-
J. Yamagishi, B. Usabaev, S. King, O.Watts, J. Dines, J. Tian, Y. Guan, R. Hu, K. Oura, Y.-J. Wu, K. Tokuda, R. Karhila, and M. Kurimo, "Thousands of voices for HMM-based speech synthesis-Analysis and application of TTS systems built on various ASR corpora," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 984-1004, Jul. 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.18
, Issue.5
, pp. 984-1004
-
-
Yamagishi, J.1
Usabaev, B.2
King, S.3
Watts, O.4
Dines, J.5
Tian, J.6
Guan, Y.7
Hu, R.8
Oura, K.9
Wu, Y.-J.10
Tokuda, K.11
Karhila, R.12
Kurimo, M.13
-
18
-
-
18144401294
-
A novel kernel method for clustering
-
May
-
F. Camastra and A.Verri, "A novel kernel method for clustering," IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 5, pp. 801-804, May 2005.
-
(2005)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.27
, Issue.5
, pp. 801-804
-
-
Camastra, F.1
Verri, A.2
-
19
-
-
0347243182
-
Nonlinear Component Analysis as a Kernel Eigenvalue Problem
-
B. Schölkopf, A. J. Smola, and K.-R. Müller, "Nonlinear component analysis as a kernel eigenvalue problem," Neural Comput., vol. 10, no. 5, pp. 1299-1319, 1998. (Pubitemid 128463674)
-
(1998)
Neural Computation
, vol.10
, Issue.5
, pp. 1299-1319
-
-
Scholkopf, B.1
Smola, A.2
Muller, K.-R.3
-
20
-
-
2442514721
-
-
ser. NATO Science Series. Series III: Computer and Systems Sciences. Amsterdam, The Netherlands: IOS Press , ch. An Optimization Perspective on Kernel Partial Least Squares Regression
-
K. P. Bennett and M. J. Embrechts, Advances in Learning Theory: Methods, Models and Applications, ser. NATO Science Series. Series III: Computer and Systems Sciences. Amsterdam, The Netherlands: IOS Press, 2003, vol. 190, ch. An Optimization Perspective on Kernel Partial Least Squares Regression, pp. 227-250.
-
(2003)
Advances in Learning Theory: Methods, Models and Applications
, vol.190
, pp. 227-250
-
-
Bennett, K.P.1
Embrechts, M.J.2
-
21
-
-
0027530250
-
SIMPLS: An alternative approach to partial least squares regression
-
Mar
-
S. de Jong, "SIMPLS: An alternative approach to partial least squares regression," Chemometrics Intell. Lab. Syst., vol. 18, no. 3, pp. 251-263, Mar. 1993.
-
(1993)
Chemometrics Intell. Lab. Syst.
, vol.18
, Issue.3
, pp. 251-263
-
-
De Jong, S.1
-
22
-
-
0038259120
-
Kernel partial least squares regression in reproducing kernel Hilbert space
-
Dec
-
R. Rosipal and L. Trejo, "Kernel partial least squares regression in reproducing kernel Hilbert space," J. Mach. Learn. Res., vol. 2, pp. 97-123, Dec. 2001.
-
(2001)
J. Mach. Learn. Res.
, vol.2
, pp. 97-123
-
-
Rosipal, R.1
Trejo, L.2
-
23
-
-
33846405723
-
Details of the nitech HMM-based speech synthesis system for the blizzard challenge 2005
-
DOI 10.1093/ietisy/e90-1.1.325
-
H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of the Nitech HMM-based speech synthesis system for the Blizzard challenge 2005," IEICE Trans. Inf. Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007. (Pubitemid 46145336)
-
(2007)
IEICE Transactions on Information and Systems
, vol.E90-D
, Issue.1
, pp. 325-333
-
-
Zen, H.1
Toda, T.2
Nakamura, M.3
Tokuda, K.4
-
24
-
-
44949143155
-
Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
-
Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation," in Proc. Interspeech, 2006, pp. 2266-2269.
-
(2006)
Proc. Interspeech
, pp. 2266-2269
-
-
Ohtani, Y.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
25
-
-
0028996842
-
CELP coding based on mel-cepstral analysis
-
May
-
K. Koishida, K. Tokuda, T. Kobayashi, and S. Imai, "CELP coding based on mel-cepstral analysis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 1995, vol. 1, pp. 33-36.
-
(1995)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, vol.1
, pp. 33-36
-
-
Koishida, K.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
26
-
-
33746653351
-
Robust processing techniques for voice conversion
-
DOI 10.1016/j.csl.2005.06.001, PII S088523080500029X
-
O. Turk and L. Arslan, "Robust processing techniques for voice conversion," Comput. Speech Lang., vol. 4, no. 20, pp. 441-467, Oct. 2006. (Pubitemid 44150541)
-
(2006)
Computer Speech and Language
, vol.20
, Issue.4
, pp. 441-467
-
-
Turk, O.1
Arslan, L.M.2
-
27
-
-
85009097254
-
Mixed excitation for HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Mixed excitation for HMM-based speech synthesis," in Proc. Eurospeech, 2001, pp. 2263-2266.
-
(2001)
Proc. Eurospeech
, pp. 2263-2266
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
28
-
-
70349218136
-
Voice conversion based on simultaneous modeling of spectrum and F0
-
May
-
K. Yutani, Y. Uto, Y. Nankaku, A. Lee, and K. Tokuda, "Voice conversion based on simultaneous modeling of spectrum and F0," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 2009, pp. 3897-3900.
-
(2009)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 3897-3900
-
-
Yutani, K.1
Uto, Y.2
Nankaku, Y.3
Lee, A.4
Tokuda, K.5
-
29
-
-
34547520011
-
A novel method for prosody prediction in voice conversion
-
DOI 10.1109/ICASSP.2007.366961, 4218149, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
-
E. Helander and J. Nurminen, "A novel method for prosody prediction in voice conversion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 2007, vol. 4, pp. IV-509-IV-512. (Pubitemid 47178423)
-
(2007)
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
, vol.4
-
-
Helander, E.E.1
Nurminen, J.2
-
30
-
-
5444243681
-
Speaker-specific pitch contour modelling and modification
-
Seattle, WA, May
-
D. Chapell and J. Hansen, "Speaker-specific pitch contour modelling and modification," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle, WA, May 1998, pp. 885-888.
-
(1998)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.
, pp. 885-888
-
-
Chapell, D.1
Hansen, J.2
-
32
-
-
0029209272
-
Robust text-independent speaker identification using Gaussian mixture speaker models
-
Jan
-
D. Reynolds and R. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
-
(1995)
IEEE Trans. Speech Audio Process.
, vol.3
, Issue.1
, pp. 72-83
-
-
Reynolds, D.1
Rose, R.2
-
33
-
-
77956826012
-
Automatic speaker recognition as a measurement of voice imitation and conversion
-
M. Farrus, M. Wagner, D. Erro, and J. Hernando, "Automatic speaker recognition as a measurement of voice imitation and conversion," Int. J. Speech Lang. Law, vol. 17, no. 1, 2010.
-
(2010)
Int. J. Speech Lang. Law
, vol.17
, Issue.1
-
-
Farrus, M.1
Wagner, M.2
Erro, D.3
Hernando, J.4
|