-
1
-
-
0032026483
-
Continuous probabilistictransform for voice conversion
-
Y. Stylianou, O. Cappe, and E. Moilines, "Continuous probabilistictransform for voice conversion, " IEEE Trans. Speech and AudioProcessing, vol. 6, no. 2, pp. 131-142, 1998.
-
(1998)
IEEE Trans. Speech and AudioProcessing
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappe, O.2
Moilines, E.3
-
2
-
-
80052698826
-
Speakingaidsystems using GMM-based voice conversion for electrolaryngealspeech
-
K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Speakingaidsystems using GMM-based voice conversion for electrolaryngealspeech, " Speech Communication, vol. 54, no. 1, pp. 134-146, 2012.
-
(2012)
Speech Communication
, vol.54
, Issue.1
, pp. 134-146
-
-
Nakamura, K.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
3
-
-
0031623661
-
Spectral voice conversion for text-tospeechsynthesis
-
A. Kain and M. W. Macon, "Spectral voice conversion for text-tospeechsynthesis, " in Proc. ICASSP, vol. 1, pp. 285-288, 1998.
-
(1998)
Proc. ICASSP
, vol.1
, pp. 285-288
-
-
Kain, A.1
Macon, M.W.2
-
4
-
-
84910069658
-
A mel-cepstral analysis technique restoring high frequencycomponents from low-sampling-rate speech
-
K. Nakamura, K. Hashimoto, K. Oura, Y. Nankaku, and K. Tokuda, "A mel-cepstral analysis technique restoring high frequencycomponents from low-sampling-rate speech, " in Proc. Interspeech, pp. 2494-2498, 2014.
-
(2014)
Proc. Interspeech
, pp. 2494-2498
-
-
Nakamura, K.1
Hashimoto, K.2
Oura, K.3
Nankaku, Y.4
Tokuda, K.5
-
5
-
-
84910024857
-
GMM-basedband width extension using sub-band basis spectrum model
-
Y. Ohtani, M. Tamura, M. Morita, and M. Akamine, "GMM-basedband width extension using sub-band basis spectrum model, " inProc. Interspeech, pp. 2489-2493, 2014.
-
(2014)
Proc. Interspeech
, pp. 2489-2493
-
-
Ohtani, Y.1
Tamura, M.2
Morita, M.3
Akamine, M.4
-
6
-
-
0023739214
-
Esophageal speech enhancement based on statistical voice conversionwith Gaussian mixture models
-
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Esophageal speech enhancement based on statistical voice conversionwith Gaussian mixture models, " in Proc. ICASSP, pp. 655-658, 1988.
-
(1988)
Proc. ICASSP
, pp. 655-658
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
7
-
-
0026880275
-
Voice transformationusing PSOLA technique
-
H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformationusing PSOLA technique, " Speech Communication, vol. 11, no. 2-3, pp. 175-187, 1992.
-
(1992)
Speech Communication
, vol.11
, Issue.2-3
, pp. 175-187
-
-
Valbret, H.1
Moulines, E.2
Tubach, J.P.3
-
8
-
-
57749193836
-
Voice conversion based onmaximum likelihood estimation of spectral parameter trajectory
-
T. Toda, A. Black, and K. Tokuda, "Voice conversion based onmaximum likelihood estimation of spectral parameter trajectory, "IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.2
Tokuda, K.3
-
9
-
-
77953712499
-
Voiceconversion using partial least squares regression
-
E. Heland er, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voiceconversion using partial least squares regression, " IEEE Trans. Audio, Speech, Lang. Process., vol. 18, Issue: 5, pp. 912-921, 2010.
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process
, vol.18
, Issue.5
, pp. 912-921
-
-
Heland Er, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj, M.4
-
10
-
-
84874248255
-
Exemplar-based voiceconversion in noisy environment
-
R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voiceconversion in noisy environment, " in Proc. SLT, pp. 313-317, 2012.
-
(2012)
Proc. SLT
, pp. 313-317
-
-
Takashima, R.1
Takiguchi, T.2
Ariki, Y.3
-
11
-
-
84911369131
-
Exemplar-basedsparse representation with residual compensation for voice conversion
-
Z. Wu, T. Virtanen, E. S. Chng, and H. Li, "Exemplar-basedsparse representation with residual compensation for voice conversion, "IEEE Trans. Audio, Speech, Lang. Process., vol. 22, no. 10, pp. 1506-1521, 2014.
-
(2014)
IEEE Trans. Audio, Speech, Lang. Process
, vol.22
, Issue.10
, pp. 1506-1521
-
-
Wu, Z.1
Virtanen, T.2
Chng, E.S.3
Li, H.4
-
12
-
-
84901806271
-
Noiserobustvoice conversion based on sparse spectral mapping usingnon-negative matrix factorization
-
R. Aihara, R. Takashima, T. Takiguchi, and Y. Ariki, "Noiserobustvoice conversion based on sparse spectral mapping usingnon-negative matrix factorization, " IEICE Transactions on Informationand Systems, vol. E97-D, no. 6, pp. 1411-1418, 2014.
-
(2014)
IEICE Transactions on Informationand Systems
, vol.E97-D
, Issue.6
, pp. 1411-1418
-
-
Aihara, R.1
Takashima, R.2
Takiguchi, T.3
Ariki, Y.4
-
14
-
-
44949110218
-
Single-channel speech separationusing sparse non-negative matrix factorization
-
M. N. Schmidt and R. K. Olsson, "Single-channel speech separationusing sparse non-negative matrix factorization, " in Proc. Interspeech, 2006.
-
(2006)
Proc. Interspeech
-
-
Schmidt, M.N.1
Olsson, R.K.2
-
15
-
-
50249152311
-
Monaural sound source separation by non-negativematrix factorization with temporal continuity and sparseness criteria
-
T. Virtanen, "Monaural sound source separation by non-negativematrix factorization with temporal continuity and sparseness criteria, "IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process
, vol.15
, Issue.3
, pp. 1066-1074
-
-
Virtanen, T.1
-
16
-
-
79960657803
-
Exemplarbasedsparse representations for noise robust automatic speechrecognition
-
J. F. Gemmeke, T. Viratnen, and A. Hurmalainen, "Exemplarbasedsparse representations for noise robust automatic speechrecognition, " IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2067-2080, 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process
, vol.19
, Issue.7
, pp. 2067-2080
-
-
Gemmeke, J.F.1
Viratnen, T.2
Hurmalainen, A.3
-
17
-
-
84905227265
-
Voiceconversion based on non-negative matrix factorization usingphoneme-categorized dictionary
-
R. Aihara, T. Nakashika, T. Takiguchi, and Y. Ariki, "Voiceconversion based on non-negative matrix factorization usingphoneme-categorized dictionary, " in Proc. ICASSP, pp. 7944-7948, 2014.
-
(2014)
Proc. ICASSP
, pp. 7944-7948
-
-
Aihara, R.1
Nakashika, T.2
Takiguchi, T.3
Ariki, Y.4
-
18
-
-
84901801701
-
A preliminarydemonstration of exemplar-based voice conversion for articulationdisorders using an individuality-preserving dictionary
-
R. Aihara, R. Takashima, T. Takiguchi, and Y. Ariki, "A preliminarydemonstration of exemplar-based voice conversion for articulationdisorders using an individuality-preserving dictionary, "EURASIP Journal on Audio, Speech, and Music Processing, vol. 2014: 5, doi: 10. 1186/1687-4722-2014-5, 2014.
-
(2014)
EURASIP Journal on Audio, Speech, and Music Processing
, vol.2014
, pp. 5
-
-
Aihara, R.1
Takashima, R.2
Takiguchi, T.3
Ariki, Y.4
-
19
-
-
84910091291
-
Multimodalexemplar-based voice conversion using lip features in noisy environments
-
K. Masaka, R. Aihara, T. Takiguchi, and Y. Ariki, "Multimodalexemplar-based voice conversion using lip features in noisy environments, "in Proc. INTERSPEECH, vol. 1159-1163, 2014.
-
(2014)
Proc. INTERSPEECH
, vol.1159-1163
-
-
Masaka, K.1
Aihara, R.2
Takiguchi, T.3
Ariki, Y.4
-
20
-
-
44949210554
-
MAP-based adaptation for speech conversionusing adaptation data selection and non-parallel training
-
C. H. Lee and C. H. Wu, "MAP-based adaptation for speech conversionusing adaptation data selection and non-parallel training, "in Proc. INTERSPEECH, pp. 2254-2257, 2006.
-
(2006)
Proc. INTERSPEECH
, pp. 2254-2257
-
-
Lee, C.H.1
Wu, C.H.2
-
21
-
-
34047245444
-
Nonparalleltraining for voice conversion based on a parameter adaptation approach
-
A. Mouchtaris, J. V. der Spiegel, and P. Mueller, "Nonparalleltraining for voice conversion based on a parameter adaptation approach, "Audio, Speech, and Language Processing, IEEE Transactionson 14 (3), pp. 952-963, 2006.
-
(2006)
Audio, Speech, and Language Processing, IEEE Transactionson
, vol.14
, Issue.3
, pp. 952-963
-
-
Mouchtaris, A.1
Der Spiegel, J.V.2
Mueller, P.3
-
22
-
-
34547512822
-
Eigenvoice conversion basedon Gaussian mixture model
-
T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion basedon Gaussian mixture model, " in Proc. Interspeech, pp. 2446-2449, 2006.
-
(2006)
Proc. Interspeech
, pp. 2446-2449
-
-
Toda, T.1
Ohtani, Y.2
Shikano, K.3
-
23
-
-
70450194389
-
Many-tomanyeigenvoice conversion with reference voice
-
Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Many-tomanyeigenvoice conversion with reference voice, " in Proc. Interspeech, pp. 1623-1626, 2009.
-
(2009)
Proc. Interspeech
, pp. 1623-1626
-
-
Ohtani, Y.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
24
-
-
84865798483
-
One-tomanyvoice conversion based on tensor representation of speakerspace
-
D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomanyvoice conversion based on tensor representation of speakerspace, " in Proc. INTERSPEECH, pp. 653-656, 2011.
-
(2011)
Proc. INTERSPEECH
, pp. 653-656
-
-
Saito, D.1
Yamamoto, K.2
Minematsu, N.3
Hirose, K.4
-
25
-
-
0025475528
-
ATR Japanese speech database as a tool ofspeech recognition and synthesis
-
A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, "ATR Japanese speech database as a tool ofspeech recognition and synthesis, " Speech Communication, vol. 9, pp. 357-363, 1990.
-
(1990)
Speech Communication
, vol.9
, pp. 357-363
-
-
Kurematsu, A.1
Takeda, K.2
Sagisaka, Y.3
Katagiri, S.4
Kuwabara, H.5
Shikano, K.6
-
26
-
-
33750915991
-
STRAIGHT, exploitation of the other aspectof vocoder: Perceptually isomorphic decomposition of speechsounds
-
H. Kawahara, "STRAIGHT, exploitation of the other aspectof vocoder: Perceptually isomorphic decomposition of speechsounds, " Acoustical Science and Technology, pp. 349-353, 2006.
-
(2006)
Acoustical Science and Technology
, pp. 349-353
-
-
Kawahara, H.1
|