-
1
-
-
70349197715
-
Voice transformation: a survey
-
Taipei, Taiwan
-
Y. Stylianou, "Voice transformation: a survey," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Taipei, Taiwan, 2009, pp. 3585-3588.
-
(2009)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 3585-3588
-
-
Stylianou, Y.1
-
2
-
-
85009084358
-
A first step towards text-independent voice conversion
-
Jeju Island, South Korea
-
D. Sündermann, A. Bonafonte, H. Ney, and H. Höge, "A first step towards text-independent voice conversion," in Proc. of the International Conference on Spoken Language Processing (ICSLP), Jeju Island, South Korea, 2004.
-
(2004)
Proc. of the International Conference on Spoken Language Processing (ICSLP)
-
-
Sündermann, D.1
Bonafonte, A.2
Ney, H.3
Höge, H.4
-
3
-
-
84905248180
-
Effectiveness of PLP-based phonetic segmentation for speech synthesis
-
Florence, Italy: IEEE
-
N. J. Shah, B. B. Vachhani, H. B. Sailor, and H. A. Patil, "Effectiveness of PLP-based phonetic segmentation for speech synthesis," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Florence, Italy: IEEE, 2014, pp. 270-274.
-
(2014)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 270-274
-
-
Shah, N. J.1
Vachhani, B. B.2
Sailor, H. B.3
Patil, H. A.4
-
4
-
-
84941033901
-
Effectiveness of multiscale fractal dimension-based phonetic segmentation in speech synthesis for low resource language
-
Kuching, Borneo Malaysia
-
M. Zaki, J. N. Shah, and H. A. Patil, "Effectiveness of multiscale fractal dimension-based phonetic segmentation in speech synthesis for low resource language," in International Conference on Asian Language Processing (IALP), Kuching, Borneo Malaysia, 2014, pp. 103-106.
-
(2014)
International Conference on Asian Language Processing (IALP)
, pp. 103-106
-
-
Zaki, M.1
Shah, J. N.2
Patil, H. A.3
-
5
-
-
84867198185
-
On the impact of alignment on voice conversion performance
-
Brisbane, Australia
-
E. Helander, J. Schwarz, J. Nurminen, H. Silen, and M. Gabbouj, "On the impact of alignment on voice conversion performance," in INTERSPEECH, Brisbane, Australia, 2008, pp. 1-5.
-
(2008)
INTERSPEECH
, pp. 1-5
-
-
Helander, E.1
Schwarz, J.2
Nurminen, J.3
Silen, H.4
Gabbouj, M.5
-
6
-
-
7544223741
-
A survey of outlier detection methodologies
-
V. J. Hodge and J. Austin, "A survey of outlier detection methodologies," Artificial Intelligence Review, vol. 22, no. 2, pp. 85-126, 2004.
-
(2004)
Artificial Intelligence Review
, vol.22
, Issue.2
, pp. 85-126
-
-
Hodge, V. J.1
Austin, J.2
-
7
-
-
13444287831
-
ROBPCA: a new approach to robust principal component analysis
-
M. Hubert, P. J. Rousseeuw, and K. Vanden Branden, "ROBPCA: a new approach to robust principal component analysis," Technometrics, vol. 47, no. 1, pp. 64-79, 2005.
-
(2005)
Technometrics
, vol.47
, Issue.1
, pp. 64-79
-
-
Hubert, M.1
Rousseeuw, P. J.2
Vanden Branden, K.3
-
8
-
-
0023739214
-
Voice conversion through vector quantization
-
New York, NY, USA: IEEE
-
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in International Conference on Acoustics, Speech, and Signal Processing (ICASSP). New York, NY, USA: IEEE, 1988, pp. 655-658.
-
(1988)
International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, pp. 655-658
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
9
-
-
0031623661
-
Spectral voice conversion for text-tospeech synthesis
-
Seattle, WA
-
A. Kain and M.W. Macon, "Spectral voice conversion for text-tospeech synthesis," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seattle, WA, 1998, pp. 285-288.
-
(1998)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 285-288
-
-
Kain, A.1
Macon, M.W.2
-
10
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. on Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998.
-
(1998)
IEEE Trans. on Speech and Audio Processing
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappé, O.2
Moulines, E.3
-
11
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. on Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
-
(2007)
IEEE Trans. on Audio, Speech and Language Processing
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
12
-
-
77953712499
-
Voice conversion using partial least squares regression
-
E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression," IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 5, pp. 912-921, 2010.
-
(2010)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.18
, Issue.5
, pp. 912-921
-
-
Helander, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj, M.4
-
13
-
-
84856141218
-
Voice conversion using dynamic kernel partial least squares regression
-
E. Helander, H. Silén, T. Virtanen, and M. Gabbouj, "Voice conversion using dynamic kernel partial least squares regression," IEEE Transactions on Audio, Speech, and Language processing, vol. 20, no. 3, pp. 806-817, 2012.
-
(2012)
IEEE Transactions on Audio, Speech, and Language processing
, vol.20
, Issue.3
, pp. 806-817
-
-
Helander, E.1
Silén, H.2
Virtanen, T.3
Gabbouj, M.4
-
14
-
-
84921735339
-
Voice conversion using deep neural networks with layer-wise generative training
-
L.-H. Chen, Z.-H. Ling, L.-J. Liu, and L.-R. Dai, "Voice conversion using deep neural networks with layer-wise generative training," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22, no. 12, pp. 1859-1872, 2014.
-
(2014)
IEEE/ACM Transactions on Audio, Speech, and Language Processing
, vol.22
, Issue.12
, pp. 1859-1872
-
-
Chen, L.-H.1
Ling, Z.-H.2
Liu, L.-J.3
Dai, L.-R.4
-
15
-
-
84946685887
-
Voice conversion using deep neural networks with speaker-independent pre-training
-
Nevada, USA
-
S. H. Mohammadi and A. Kain, "Voice conversion using deep neural networks with speaker-independent pre-training," in IEEE Spoken Language Technology Workshop (SLT), Nevada, USA, 2014, pp. 19-23.
-
(2014)
IEEE Spoken Language Technology Workshop (SLT)
, pp. 19-23
-
-
Mohammadi, S. H.1
Kain, A.2
-
16
-
-
84959173289
-
Semi-supervised training of a voice conversion mapping function using a joint-autoencoder
-
Dresden, Germany
-
S. H. Mohammadi and A. Kain, "Semi-supervised training of a voice conversion mapping function using a joint-autoencoder," in INTERSPEECH, Dresden, Germany, 2015, pp. 1-5.
-
(2015)
INTERSPEECH
, pp. 1-5
-
-
Mohammadi, S. H.1
Kain, A.2
-
17
-
-
84901803470
-
Exemplar-based voice conversion using non-negative spectrogram deconvolution
-
Barcelona, Spain
-
Z. Wu, T. Virtanen, T. Kinnunen, E. S. Chng, and H. Li, "Exemplar-based voice conversion using non-negative spectrogram deconvolution," in Proc. 8th ISCA Speech Synthesis Workshop, Barcelona, Spain, 2013, pp. 201-206.
-
(2013)
Proc. 8th ISCA Speech Synthesis Workshop
, pp. 201-206
-
-
Wu, Z.1
Virtanen, T.2
Kinnunen, T.3
Chng, E. S.4
Li, H.5
-
18
-
-
84911369131
-
Exemplar-based sparse representation with residual compensation for voice conversion
-
Z. Wu, T. Virtanen, E. S. Chng, and H. Li, "Exemplar-based sparse representation with residual compensation for voice conversion," IEEE/ACM Trans. on Audio, Speech, and Language Processing, vol. 22, no. 10, pp. 1506-1521, 2014.
-
(2014)
IEEE/ACM Trans. on Audio, Speech, and Language Processing
, vol.22
, Issue.10
, pp. 1506-1521
-
-
Wu, Z.1
Virtanen, T.2
Chng, E. S.3
Li, H.4
-
19
-
-
84973345217
-
Semi-nonnegative matrix factorization using alternating direction method of multipliers for voice conversion
-
Shanghai, China
-
R. AIHARA, T. TAKIGUCHI, and Y. ARIKI, "Semi-nonnegative matrix factorization using alternating direction method of multipliers for voice conversion," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 2016, pp. 5170-5174.
-
(2016)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 5170-5174
-
-
AIHARA, R.1
TAKIGUCHI, T.2
ARIKI, Y.3
-
20
-
-
0034842552
-
Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
-
Salt Lake City, UT, USA
-
T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum," in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Salt Lake City, UT, USA, 2001, pp. 841-844.
-
(2001)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 841-844
-
-
Toda, T.1
Saruwatari, H.2
Shikano, K.3
-
22
-
-
0032680362
-
A fast algorithm for the minimum covariance determinant estimator
-
P. J. Rousseeuw and K. V. Driessen, "A fast algorithm for the minimum covariance determinant estimator," Technometrics, vol. 41, no. 3, pp. 212-223, 1999.
-
(1999)
Technometrics
, vol.41
, Issue.3
, pp. 212-223
-
-
Rousseeuw, P. J.1
Driessen, K. V.2
-
23
-
-
0002629270
-
Maximum likelihood from incomplete data via the em algorithm
-
A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the em algorithm," Journal of the royal statistical society. Series B (methodological), vol. 39, no. 1, pp. 1-38, 1977.
-
(1977)
Journal of the royal statistical society. Series B (methodological)
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A. P.1
Laird, N. M.2
Rubin, D. B.3
-
25
-
-
84865795787
-
Improved hnm-based vocoder for statistical synthesizers
-
Florence, Italy
-
D. Erro, I. Sainz, E. Navas, and I. Hernáez, "Improved hnm-based vocoder for statistical synthesizers." in INTERSPEECH, Florence, Italy, 2011, pp. 1809-1812.
-
(2011)
INTERSPEECH
, pp. 1809-1812
-
-
Erro, D.1
Sainz, I.2
Navas, E.3
Hernáez, I.4
-
26
-
-
0012392720
-
P. 85. a method for subjective performance assessment of the quality of speech voice output devices
-
International Telecommunication Union (ITU), Geneva., Last Accessed {July 26, 2016
-
I. Rec, "P. 85. a method for subjective performance assessment of the quality of speech voice output devices," International Telecommunication Union (ITU), Geneva., Available Online: {https://www.itu.int/rec/T-REC-P.85-199406-I/en} Last Accessed {July 26, 2016}.
-
-
-
Rec, I.1
|