-
1
-
-
84876497245
-
GMM-based voice conversion applied to emotional speech synthesis
-
H. Kawanami, Y. Iwami, T. Toda, H. Saruwatari, and K. Shikano, "GMM-based voice conversion applied to emotional speech synthesis," in Proc. INTERSPEECH, 2003, pp. 2401-2404.
-
(2003)
Proc. INTERSPEECH
, pp. 2401-2404
-
-
Kawanami, H.1
Iwami, Y.2
Toda, T.3
Saruwatari, H.4
Shikano, K.5
-
2
-
-
84865747520
-
Intonation conversion from neutral to expressive speech
-
C. Veaux and X. Robet, "Intonation conversion from neutral to expressive speech," in Proc. INTERSPEECH, 2011, pp. 2765-2768.
-
(2011)
Proc. INTERSPEECH
, pp. 2765-2768
-
-
Veaux, C.1
Robet, X.2
-
3
-
-
80052698826
-
Speakingaid systems using gmm-based voice conversion for electrolaryngeal speech
-
K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Speakingaid systems using gmm-based voice conversion for electrolaryngeal speech," Speech Communication, vol. 54, no. 1, pp. 134-146, 2012.
-
(2012)
Speech Communication
, vol.54
, Issue.1
, pp. 134-146
-
-
Nakamura, K.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
4
-
-
77956795483
-
Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models
-
H. Doi, K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models," IEICE Trans. Information and Systems, vol. E93-D, no. 9, pp. 2472-2482, 2010.
-
(2010)
IEICE Trans. Information and Systems
, vol.E93-D
, Issue.9
, pp. 2472-2482
-
-
Doi, H.1
Nakamura, K.2
Toda, T.3
Saruwatari, H.4
Shikano, K.5
-
5
-
-
0023739214
-
Voice conversion through vector quantization
-
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. ICASSP, 1988, pp. 655-658.
-
(1988)
Proc. ICASSP
, pp. 655-658
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
6
-
-
0026880275
-
Voice transformation using PSOLA technique
-
H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Communication, vol. 11, no. 2-3, pp. 175-187, 1992.
-
(1992)
Speech Communication
, vol.11
, Issue.2-3
, pp. 175-187
-
-
Valbret, H.1
Moulines, E.2
Tubach, J. P.3
-
7
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998.
-
(1998)
IEEE Trans. Speech and Audio Processing
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappe, O.2
Moulines, E.3
-
8
-
-
57749193836
-
Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
-
T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
-
(2007)
IEEE Trans. Audio, Speech and Language Processing
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.2
Tokuda, K.3
-
9
-
-
77953712499
-
Voice conversion using partial least squares regression
-
E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression," IEEE Trans. Audio, Speech and Language Processing, vol. 18, no. 5, pp. 912-921, 2010.
-
(2010)
IEEE Trans. Audio, Speech and Language Processing
, vol.18
, Issue.5
, pp. 912-921
-
-
Helander, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj, M.4
-
10
-
-
44949210554
-
Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
-
C. H. Lee and C. H. Wu, "Map-based adaptation for speech conversion using adaptation data selection and non-parallel training," in Proc. INTERSPEECH, 2006, pp. 2254-2257.
-
(2006)
Proc. INTERSPEECH
, pp. 2254-2257
-
-
Lee, C. H.1
Wu, C. H.2
-
11
-
-
34547512822
-
Eigenvoice conversion based on gaussian mixture model
-
T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on gaussian mixture model," in Proc. INTERSPEECH, 2006, pp. 2446-2449.
-
(2006)
Proc. INTERSPEECH
, pp. 2446-2449
-
-
Toda, T.1
Ohtani, Y.2
Shikano, K.3
-
12
-
-
84865798483
-
One-tomany voice conversion based on tensor representation of speaker space
-
D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomany voice conversion based on tensor representation of speaker space," in Proc. INTERSPEECH, 2011, pp. 653-656.
-
(2011)
Proc. INTERSPEECH
, pp. 653-656
-
-
Saito, D.1
Yamamoto, K.2
Minematsu, N.3
Hirose, K.4
-
14
-
-
50249152311
-
Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
-
T. Virtanen, "Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 3, pp. 1066-1074, 2007.
-
(2007)
IEEE Trans. Audio, Speech and Language Processing
, vol.15
, Issue.3
, pp. 1066-1074
-
-
Virtanen, T.1
-
15
-
-
44949110218
-
Single-channel speech separation using sparse non-negative matrix factorization
-
M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. INTERSPEECH, 2006, pp. 2614-2617.
-
(2006)
Proc. INTERSPEECH
, pp. 2614-2617
-
-
Schmidt, M. N.1
Olsson, R. K.2
-
16
-
-
79960657803
-
Exemplarbased sparse representations for noise robust automatic speech recognition
-
J. F. Gemmeke, T. Viratnen, and A. Hurmalainen, "Exemplarbased sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech and Language Processing, vol. 19, no. 7, pp. 2067-2080, 2011.
-
(2011)
IEEE Trans. Audio, Speech and Language Processing
, vol.19
, Issue.7
, pp. 2067-2080
-
-
Gemmeke, J. F.1
Viratnen, T.2
Hurmalainen, A.3
-
17
-
-
84874248255
-
Exemplar-based voice conversion in noisy environment
-
R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voice conversion in noisy environment," in Proc. SLT, 2012, pp. 313-317.
-
(2012)
Proc. SLT
, pp. 313-317
-
-
Takashima, R.1
Takiguchi, T.2
Ariki, Y.3
-
18
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
-
(1999)
Speech Communication
, vol.27
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
de Cheveigne, A.3
|