-
1
-
-
84905234183
-
Non-parallel voice conversion using joint optimization of alignment by temporal context and spectral distortion
-
Hadas Benisty, David Malah, and Koby Crammer, "Non-parallel voice conversion using joint optimization of alignment by temporal context and spectral distortion., " in ICASSP, 2014
-
(2014)
ICASSP
-
-
Benisty, H.1
Malah, D.2
Crammer, K.3
-
2
-
-
84930664922
-
Vocaine the vocoder and applications in speech synthesis
-
Yannis Agiomyrgiannakis, "Vocaine the vocoder and applications in speech synthesis, " in ICASSP, 2015
-
(2015)
ICASSP
-
-
Agiomyrgiannakis, Y.1
-
3
-
-
0026880275
-
Voice transformation using PSOLA technique
-
H Valbret, Eric Moulines, and Jean-Pierre Tubach, "Voice transformation using PSOLA technique, " Speech Communication, vol. 11, no. 2, pp. 175-187, 1992
-
(1992)
Speech Communication
, vol.11
, Issue.2
, pp. 175-187
-
-
Valbret, H.1
Moulines, E.2
Tubach, J.3
-
5
-
-
0031623661
-
Spectral voice conversion for text-to-speech synthesis
-
Alexander Kain and MichaelWMacon, "Spectral voice conversion for text-to-speech synthesis, " in ICASSP. IEEE, 1998, vol. 1, pp. 285-288
-
(1998)
ICASSP. IEEE
, vol.1
, pp. 285-288
-
-
Kain, A.1
Macon, M.W.2
-
6
-
-
85009224898
-
Perceptually weighted linear transformations for voice conversion
-
Hui Ye and Steve Young, "Perceptually weighted linear transformations for voice conversion, " in Proc. of the Eurospeech'03, 2003
-
(2003)
Proc. of the Eurospeech'03
-
-
Ye, H.1
Young, S.2
-
7
-
-
85009084358
-
A first step towards text-independent voice conversion
-
David Sündermann, A. Bonafonte, Hermann Ney, and Harald Höge, "A first step towards text-independent voice conversion, " in Proc. of the ICSLP'04, 2004
-
(2004)
Proc. of the ICSLP'04
-
-
Sündermann, D.1
Bonafonte, A.2
Ney, H.3
Höge, H.4
-
8
-
-
84908466787
-
Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts
-
Arun Kumar and Ashish Verma, "Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts, " in Multimedia and Expo, 2003. ICME'03. Proceedings. 2003 International Conference on. IEEE, 2003, vol. 1, pp. I-393
-
(2003)
Multimedia and Expo, 2003. ICME'03. Proceedings. 2003 International Conference On. IEEE
, vol.1
, pp. I-393
-
-
Kumar, A.1
Verma, A.2
-
9
-
-
0029375590
-
Speaker adaptation using constrained estimation of Gaussian mixtures
-
Vassilios V Digalakis, Dimitry Rtischev, and Leonardo G Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures, " Speech and Audio Processing, IEEE Transactions on, vol. 3, no. 5, pp. 357-366, 1995
-
(1995)
Speech and Audio Processing, IEEE Transactions on
, vol.3
, Issue.5
, pp. 357-366
-
-
Digalakis, V.V.1
Rtischev, D.2
Neumeyer, L.G.3
-
10
-
-
0033100038
-
Maximumlikelihood stochastic-transformation adaptation of hidden markov models
-
Vassilis D Diakoloukas and Vassilios V Digalakis, "Maximumlikelihood stochastic-transformation adaptation of hidden markov models, " Speech and Audio Processing, IEEE Transactions on, vol. 7, no. 2, pp. 177-187, 1999
-
(1999)
Speech and Audio Processing, IEEE Transactions on
, vol.7
, Issue.2
, pp. 177-187
-
-
Diakoloukas, V.D.1
Digalakis, V.V.2
-
11
-
-
4544297119
-
Nonparallel training for voice conversion by maximum likelihood constrained adaptation
-
Athanasios Mouchtaris, Jan Van der Spiegel, and Paul Mueller, "Nonparallel training for voice conversion by maximum likelihood constrained adaptation, " in ICASSP. IEEE, 2004, vol. 1, pp. I-1
-
(2004)
ICASSP. IEEE
, vol.1
, pp. I-1
-
-
Mouchtaris, A.1
Van Der Spiegel, J.2
Mueller, P.3
-
12
-
-
0034842740
-
Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
-
Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, and Takao Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR, " in ICASSP. IEEE, 2001, vol. 2, pp. 805-808
-
(2001)
ICASSP. IEEE
, vol.2
, pp. 805-808
-
-
Tamura, M.1
Masuko, T.2
Tokuda, K.3
Kobayashi, T.4
-
13
-
-
84966348891
-
An HMM-based speech synthesis system applied to english
-
IEEE
-
Keiichi Tokuda, Heiga Zen, and Alan W Black, "An HMM-based speech synthesis system applied to english, " in Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on. IEEE, 2002, pp. 227-230
-
(2002)
Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on
, pp. 227-230
-
-
Tokuda, K.1
Zen, H.2
Black, A.W.3
-
14
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, " in Proc. Eurospeech, 1999, pp. 2347-2350
-
(1999)
Proc. Eurospeech
, pp. 2347-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
15
-
-
84973338782
-
Speech synthesis with neural networks
-
cs. NE/9811031
-
Orhan Karaali, Gerald Corrigan, and Ira A. Gerson, "Speech synthesis with neural networks, " CoRR, vol. cs. NE/9811031, 1998
-
(1998)
CoRR
-
-
Karaali, O.1
Corrigan, G.2
Gerson, I.A.3
-
16
-
-
84890490547
-
Statistical parametric speech synthesis using deep neural networks
-
Heiga Zen, Andrew Senior, and Mike Schuster, "Statistical parametric speech synthesis using deep neural networks, " in ICASSP. IEEE, 2013, pp. 7962-7966
-
(2013)
ICASSP. IEEE
, pp. 7962-7966
-
-
Zen, H.1
Senior, A.2
Schuster, M.3
-
17
-
-
84959112868
-
A study of speaker adaptation for DNN-based speech synthesis
-
Date of Acceptance: 01/06/2015
-
Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Stephen Renals, and Simon King, A study of speaker adaptation for DNN-based speech synthesis, International Speech Communication Association, 2015, Date of Acceptance: 01/06/2015
-
(2015)
International Speech Communication Association
-
-
Wu, Z.1
Swietojanski, P.2
Veaux, C.3
Renals, S.4
King, S.5
-
18
-
-
84906280857
-
Voice conversion in high-order eigen space using deep belief nets
-
ISCA
-
Toru Nakashika, Ryoichi Takashima, Tetsuya Takiguchi, and Yasuo Ariki, "Voice conversion in high-order eigen space using deep belief nets., " in Interspeech. 2013, pp. 369-372, ISCA
-
(2013)
Interspeech.
, pp. 369-372
-
-
Nakashika, T.1
Takashima, R.2
Takiguchi, T.3
Ariki, Y.4
-
19
-
-
84946051934
-
Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis
-
April
-
Yuchen Fan, Yao Qian, F. K. Soong, and Lei He, "Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis, " in ICASSP, April 2015, pp. 4475-4479
-
(2015)
ICASSP
, pp. 4475-4479
-
-
Fan, Y.1
Qian, Y.2
Soong, F.K.3
He, L.4
-
20
-
-
77953725318
-
INCA algorithm for training voice conversion systems from nonparallel corpora
-
Daniel Erro, Asuncíon Moreno, and A. Bonafonte, "INCA algorithm for training voice conversion systems from nonparallel corpora, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 18, no. 5, pp. 944-953, 2010
-
(2010)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.18
, Issue.5
, pp. 944-953
-
-
Erro, D.1
Moreno, A.2
Bonafonte, A.3
-
21
-
-
56149106209
-
Frame alignment method for cross-lingual voice conversion
-
Daniel Erro and Asuncíon Moreno, "Frame alignment method for cross-lingual voice conversion, " in Interspeech, 2007
-
(2007)
Interspeech
-
-
Erro, D.1
Moreno, A.2
-
22
-
-
29144534131
-
Convergence theorems for generalized alternating minimization procedures
-
December
-
Asela Gunawardana and William Byrne, "Convergence theorems for generalized alternating minimization procedures, " Journal of Machine Learning Research, December 2005
-
(2005)
Journal of Machine Learning Research
-
-
Gunawardana, A.1
Byrne, W.2
-
23
-
-
0032202775
-
Deterministic annealing for clustering, compression, classification, regression, and related optimization problems
-
Kenneth Rose, "Deterministic annealing for clustering, compression, classification, regression, and related optimization problems, " Proceedings of the IEEE, vol. 86, no. 11, pp. 2210-2239, 1998
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2210-2239
-
-
Rose, K.1
-
24
-
-
84906257669
-
Voice conversion for non-parallel datasets using dynamic kernel partial least squares regression
-
Hanna Silén, Jani Nurminen, Elina Helander, and Moncef Gabbouj, "Voice conversion for non-parallel datasets using dynamic kernel partial least squares regression, " in Interspeech, 2013
-
(2013)
Interspeech
-
-
Silén, H.1
Nurminen, J.2
Helander, E.3
Gabbouj, M.4
-
25
-
-
84973359606
-
Voice morphing that improves TTS quality using an optimal dynamic frequency warping-and-weighting transform
-
Yannis Agiomyrgiannakis, "Voice Morphing that improves TTS quality using an Optimal Dynamic Frequency Warping-and-Weighting transform, " in ICASSP, 2016.
-
(2016)
ICASSP
-
-
Agiomyrgiannakis, Y.1
|