-
1
-
-
0032026483
-
Continuous prob-abilistic transform for voice conversion
-
Y. Stylianou, O. Cappe, and E. Moilines, Continuous prob-abilistic transform for voice conversion, IEEE Trans. Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998
-
(1998)
IEEE Trans. Speech and Audio Processing
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappe, O.2
Moilines, E.3
-
2
-
-
80052698826
-
Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech
-
K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech, Speech Communication, vol. 54, no. 1, pp. 134-146, 2012
-
(2012)
Speech Communication
, vol.54
, Issue.1
, pp. 134-146
-
-
Nakamura, K.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
3
-
-
0031623661
-
Spectral voice conversion for text-to-speech synthesis
-
A. Kain and M.W. Macon, Spectral voice conversion for text-to-speech synthesis, in ICASSP, vol. 1, pp. 285-288, 1998
-
(1998)
ICASSP
, vol.1
, pp. 285-288
-
-
Kain, A.1
Macon, M.W.2
-
4
-
-
84910069658
-
A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech
-
K. Nakamura, K. Hashimoto, K. Oura, Y. Nankaku, and K. Tokuda, A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech, in Interspeech, pp. 2494-2498, 2014
-
(2014)
Interspeech
, pp. 2494-2498
-
-
Nakamura, K.1
Hashimoto, K.2
Oura, K.3
Nankaku, Y.4
Tokuda, K.5
-
5
-
-
84910024857
-
Gmm-based bandwidth extension using sub-band basis spectrum model
-
Y. Ohtani, M. Tamura, M. Morita, and M. Akamine, Gmm-based bandwidth extension using sub-band basis spectrum model, in Interspeech, pp. 2489-2493, 2014
-
(2014)
Interspeech
, pp. 2489-2493
-
-
Ohtani, Y.1
Tamura, M.2
Morita, M.3
Akamine, M.4
-
6
-
-
0023739214
-
Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models
-
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models, in ICASSP, pp. 655-658, 1988
-
(1988)
ICASSP
, pp. 655-658
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
7
-
-
0026880275
-
Voice transforma-tion using PSOLA technique
-
H. Valbret, E. Moulines, and J. P. Tubach, Voice transforma-tion using PSOLA technique, Speech Communication, vol. 11, no. 2-3, pp. 175-187, 1992
-
(1992)
Speech Communication
, vol.11
, Issue.2-3
, pp. 175-187
-
-
Valbret, H.1
Moulines, E.2
Tubach, J.P.3
-
8
-
-
57749193836
-
Voice conversion based on maximum likelihood estimation of spectral parameter tra-jectory
-
T. Toda, A. Black, and K. Tokuda, Voice conversion based on maximum likelihood estimation of spectral parameter tra-jectory, IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, 2007
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.2
Tokuda, K.3
-
9
-
-
77953712499
-
Voice conversion using partial least squares regression
-
E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, Voice conversion using partial least squares regression, IEEE Trans. Audio, Speech, Lang. Process., vol. 18, Issue:5, pp. 912-921, 2010
-
(2010)
IEEE Trans. Audio, Speech, Lang. Process
, vol.18
, Issue.5
, pp. 912-921
-
-
Helander, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj, M.4
-
10
-
-
44949210554
-
Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
-
C. H. Lee and C. H. Wu, Map-based adaptation for speech conversion using adaptation data selection and non-parallel training, in Interspeech, pp. 2254-2257, 2006
-
(2006)
Interspeech
, pp. 2254-2257
-
-
Lee, C.H.1
Wu, C.H.2
-
11
-
-
34547512822
-
Eigenvoice conversion based on Gaussian mixture model
-
T. Toda, Y. Ohtani, and K. Shikano, Eigenvoice conversion based on Gaussian mixture model, in Interspeech, pp. 2446-2449, 2006
-
(2006)
Interspeech
, pp. 2446-2449
-
-
Toda, T.1
Ohtani, Y.2
Shikano, K.3
-
12
-
-
84865798483
-
One-to-many voice conversion based on tensor representation of speaker space
-
D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, One-to-many voice conversion based on tensor representation of speaker space, in Interspeech, pp. 653-656, 2011
-
(2011)
Interspeech
, pp. 653-656
-
-
Saito, D.1
Yamamoto, K.2
Minematsu, N.3
Hirose, K.4
-
13
-
-
84874248255
-
Exemplar-based voice conversion in noisy environment
-
R. Takashima, T. Takiguchi, and Y. Ariki, Exemplar-based voice conversion in noisy environment, in SLT, IEEE Work-shop on Spoken Language Technology, pp. 313-317, 2012
-
(2012)
SLT, IEEE Work-shop on Spoken Language Technology
, pp. 313-317
-
-
Takashima, R.1
Takiguchi, T.2
Ariki, Y.3
-
14
-
-
84885055553
-
Exemplar-based voice conversion using sparse representation in noisy environ-ments
-
R. Takashima, T. Takiguchi, and Y. Ariki, Exemplar-based voice conversion using sparse representation in noisy environ-ments, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, Vol. E96-A, No. 10, pp. 1946-1953, 2013
-
(2013)
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
, vol.E96-A
, Issue.10
, pp. 1946-1953
-
-
Takashima, R.1
Takiguchi, T.2
Ariki, Y.3
-
16
-
-
44949110218
-
Single-channel speech sep-aration using sparse non-negative matrix factorization
-
M. N. Schmidt and R. K. Olsson, Single-channel speech sep-aration using sparse non-negative matrix factorization, in Interspeech, 2006
-
(2006)
Interspeech
-
-
Schmidt, M.N.1
Olsson, R.K.2
-
17
-
-
50249152311
-
Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
-
T. Virtanen, Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria, IEEE Trans. Audio, Speech, Lang. Pro-cess., vol. 15, no. 3, pp. 1066-1074, 2007
-
(2007)
IEEE Trans. Audio, Speech, Lang. Pro-cess
, vol.15
, Issue.3
, pp. 1066-1074
-
-
Virtanen, T.1
-
18
-
-
79960657803
-
Exemplar-based sparse representations for noise robust automatic speech recognition
-
J. F. Gemmeke, T. Viratnen, and A. Hurmalainen, Exemplar-based sparse representations for noise robust automatic speech recognition, IEEE Trans. Audio, Speech and Language Pro-cessing, vol. 19, no. 7, pp. 2067-2080, 2011
-
(2011)
IEEE Trans. Audio, Speech and Language Pro-cessing
, vol.19
, Issue.7
, pp. 2067-2080
-
-
Gemmeke, J.F.1
Viratnen, T.2
Hurmalainen, A.3
-
19
-
-
84890519936
-
Individuality-preserving voice conversion for articulation dis-orders based on Non-negative Matrix Factorization
-
R. Aihara, R. Takashima, T. Takiguchi, and Y. Ariki, Individuality-preserving voice conversion for articulation dis-orders based on Non-negative Matrix Factorization, in ICASSP, pp. 8037-8040, 2013
-
(2013)
ICASSP
, pp. 8037-8040
-
-
Aihara, R.1
Takashima, R.2
Takiguchi, T.3
Ariki, Y.4
-
20
-
-
84905269973
-
Multi-modal voice conversion using non-negative matrix factoriza-tion in noisy environments
-
K. Masaka, R. Aihara, T. Takiguchi, and Y. Ariki, Multi-modal voice conversion using non-negative matrix factoriza-tion in noisy environments, in ICASSP, pp. 1561-1565, 2014
-
(2014)
ICASSP
, pp. 1561-1565
-
-
Masaka, K.1
Aihara, R.2
Takiguchi, T.3
Ariki, Y.4
-
21
-
-
84911369131
-
Exemplar-based sparse representation with residual compensation for voice conversion
-
Z. Wu, T. Virtanen, E. S. Chng, and H. Li, Exemplar-based sparse representation with residual compensation for voice conversion, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 22, no. 10, pp. 1506-1521, 2014
-
(2014)
IEEE/ACM Transactions on Audio, Speech and Language Processing
, vol.22
, Issue.10
, pp. 1506-1521
-
-
Wu, Z.1
Virtanen, T.2
Chng, E.S.3
Li, H.4
-
22
-
-
0025475528
-
ATR Japanese speech database as a tool of speech recognition and synthesis
-
A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, ATR Japanese speech database as a tool of speech recognition and synthesis, Speech Commu-nication, vol. 9, pp. 357-363, 1990
-
(1990)
Speech Commu-nication
, vol.9
, pp. 357-363
-
-
Kurematsu, A.1
Takeda, K.2
Sagisaka, Y.3
Katagiri, S.4
Kuwabara, H.5
Shikano, K.6
-
23
-
-
0032673049
-
Re-structuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigne, Re-structuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequencybased F0 extraction: possible role of a repetitive structure in sounds, Speech Communication, vol. 27, no. 3-4, pp. 187-207, 1999
-
(1999)
Speech Communication
, vol.27
, Issue.3-4
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigne, A.3
-
24
-
-
84905227265
-
Voice conversion based on non-negative matrix factorization using phoneme-categorized dictionary
-
R. Aihara, T. Nakashika, T. Takiguchi, and Y. Ariki, Voice conversion based on non-negative matrix factorization using phoneme-categorized dictionary, in ICASSP, pp. 7944-7948, 2014
-
(2014)
ICASSP
, pp. 7944-7948
-
-
Aihara, R.1
Nakashika, T.2
Takiguchi, T.3
Ariki, Y.4
-
25
-
-
84901806271
-
Noise-robust voice conversion based on sparse spectral mapping us-ing non-negative matrix factorization
-
R. Aihara, R. Takashima, T. Takiguchi, and Y. Ariki, Noise-robust voice conversion based on sparse spectral mapping us-ing non-negative matrix factorization, IEICE Transactions on Information and Systems, Vol. E97-D, No. 6, pp. 1411-1418, 2014
-
(2014)
IEICE Transactions on Information and Systems
, vol.E97-D
, Issue.6
, pp. 1411-1418
-
-
Aihara, R.1
Takashima, R.2
Takiguchi, T.3
Ariki, Y.4
|