-
1
-
-
0023739214
-
Voice conversion through vector quantization
-
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," Proc. ICASSP, pp. 655-658, 1998.
-
(1998)
Proc. ICASSP
, pp. 655-658
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
2
-
-
85009266993
-
Transformation of spectral envelope for voice conversion based on radial basis function networks
-
T. Watanabe, T. Murakami, M. Namba, T. Hoya, and Y. Ishida, "Transformation of spectral envelope for voice conversion based on radial basis function networks," Proc. ICSLP, pp. 285-288, 2002.
-
(2002)
Proc. ICSLP
, pp. 285-288
-
-
Watanabe, T.1
Murakami, T.2
Namba, M.3
Hoya, T.4
Ishida, Y.5
-
3
-
-
85135141647
-
Hidden Markov model based voice conversion using dynamic characteristics of speaker
-
E. K. Kim, S. Lee, and Y. H. Oh, "Hidden Markov model based voice conversion using dynamic characteristics of speaker," Proc. Eurospeech, pp. 2519-2522, 1997.
-
(1997)
Proc. Eurospeech
, pp. 2519-2522
-
-
Kim, E.K.1
Lee, S.2
Oh, Y.H.3
-
4
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," Proc. IEEE Trans. Speech Audio, vol. 6, pp. 131-142, 1998.
-
(1998)
Proc. IEEE Trans. Speech Audio
, vol.6
, pp. 131-142
-
-
Stylianou, Y.1
Cappe, O.2
Moulines, E.3
-
5
-
-
0031623661
-
Spectral voice conversion for text-to-speech synthesis
-
A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," Proc. ICASSP, pp. 285-288, 1998.
-
(1998)
Proc. ICASSP
, pp. 285-288
-
-
Kain, A.1
Macon, M.W.2
-
6
-
-
0034842552
-
Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
-
T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum," Proc. ICASSP, pp. 841-844, 2001.
-
(2001)
Proc. ICASSP
, pp. 841-844
-
-
Toda, T.1
Saruwatari, H.2
Shikano, K.3
-
7
-
-
84905560807
-
Voice conversion with smoothed GMM and MAP adaptation
-
Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothed GMM and MAP adaptation," Proc. Eurospeech, pp. 2413-2416, 2003.
-
(2003)
Proc. Eurospeech
, pp. 2413-2416
-
-
Chen, Y.1
Chu, M.2
Chang, E.3
Liu, J.4
Liu, R.5
-
8
-
-
0141702280
-
Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts
-
A. Kumar and A. Verma, "Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts," Proc. ICASSP, pp. 720-723, 2003.
-
(2003)
Proc. ICASSP
, pp. 720-723
-
-
Kumar, A.1
Verma, A.2
-
9
-
-
84994241109
-
Including dynamic and phonetic information in voice conversion systems
-
H. Duxans, A. Bonafonte, A. Kain, and J. van Santen, "Including dynamic and phonetic information in voice conversion systems," Proc. ICSLP, pp. 1193-1196, 2004.
-
(2004)
Proc. ICSLP
, pp. 1193-1196
-
-
Duxans, H.1
Bonafonte, A.2
Kain, A.3
van Santen, J.4
-
10
-
-
34047254509
-
Quality-enhanced voice morphing using maximum likelihood transformations
-
H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. on Audio, Speech and lang. Proc., pp. 1301-1312, 2006.
-
(2006)
IEEE Trans. on Audio, Speech and lang. Proc
, pp. 1301-1312
-
-
Ye, H.1
Young, S.2
-
11
-
-
51549090536
-
High quality voice conversion through combining modified GMM and formant mapping for Mandarin
-
K. Liu, J. Zhang, and Y. Yan, "High quality voice conversion through combining modified GMM and formant mapping for Mandarin," Proc. ICDT, p. 10, 2007.
-
(2007)
Proc. ICDT
, pp. 10
-
-
Liu, K.1
Zhang, J.2
Yan, Y.3
-
12
-
-
85068458327
-
Weighted frequency warping for voice conversion
-
D. Erro and A. Moreno, "Weighted frequency warping for voice conversion," Proc. Interspeech, pp. 1965-1968, 2007.
-
(2007)
Proc. Interspeech
, pp. 1965-1968
-
-
Erro, D.1
Moreno, A.2
-
13
-
-
51549106452
-
Control of spectral dynamics using temporal decomposition in voice conversion and concatenative speech synthesis
-
B. P. Nguyen and M. Akagi, "Control of spectral dynamics using temporal decomposition in voice conversion and concatenative speech synthesis," Proc. NCSP, pp. 279-282, 2008.
-
(2008)
Proc. NCSP
, pp. 279-282
-
-
Nguyen, B.P.1
Akagi, M.2
-
14
-
-
0028997012
-
Spectral dynamics is more important than spectral distortion
-
H. P. Knagenhjelm and W. B. Kleijn, "Spectral dynamics is more important than spectral distortion," Proc. ICASSP, pp. 732-735, 1995.
-
(1995)
Proc. ICASSP
, pp. 732-735
-
-
Knagenhjelm, H.P.1
Kleijn, W.B.2
-
15
-
-
0020602364
-
Efficient coding of LPC parameters by temporal decomposition
-
B. S. Atal, "Efficient coding of LPC parameters by temporal decomposition," Proc. ICASSP, pp. 81-84, 1983.
-
(1983)
Proc. ICASSP
, pp. 81-84
-
-
Atal, B.S.1
-
16
-
-
0038719980
-
Modified restricted temporal decomposition and its application to low bit rate speech coding
-
P. C. Nguyen, T. Ochi, and M. Akagi, "Modified restricted temporal decomposition and its application to low bit rate speech coding," IEICE Transactions on Information and Systems, vol. E86-D, pp. 397-405, 2003.
-
(2003)
IEICE Transactions on Information and Systems
, vol.E86-D
, pp. 397-405
-
-
Nguyen, P.C.1
Ochi, T.2
Akagi, M.3
-
17
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Journal of Speech Communication, vol. 27, pp. 187-207, 1999.
-
(1999)
Journal of Speech Communication
, vol.27
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
de Cheveigné, A.3
-
18
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," Journal of the Royal Statistical Society Series B, vol. 39, pp. 1-38, 1977.
-
(1977)
Journal of the Royal Statistical Society Series B
, vol.39
, pp. 1-38
-
-
Dempster, A.1
Laird, N.2
Rubin, D.3
-
19
-
-
0141703296
-
Temporal decomposition: A promising approach to VQ-based speaker identification
-
P. C. Nguyen, M. Akagi, and T. B. Ho, "Temporal decomposition: A promising approach to VQ-based speaker identification," Proc. ICASSP, pp. 184-187, 2003.
-
(2003)
Proc. ICASSP
, pp. 184-187
-
-
Nguyen, P.C.1
Akagi, M.2
Ho, T.B.3
-
20
-
-
51549087731
-
A study on voice conversion method for synthesizing stimuli to perform gender perception experiments of speech
-
T. Shibata and M. Akagi, "A study on voice conversion method for synthesizing stimuli to perform gender perception experiments of speech," Proc. NCSP, pp. 180-183, 2008.
-
(2008)
Proc. NCSP
, pp. 180-183
-
-
Shibata, T.1
Akagi, M.2
-
22
-
-
51549089733
-
Voice conversion Matlab toolbox,
-
Technical Report, Siemens Corporate Technology, Munich, Germany
-
D. Suendermann, "Voice conversion Matlab toolbox," Technical Report, Siemens Corporate Technology, Munich, Germany, 2007.
-
(2007)
-
-
Suendermann, D.1
|