-
1
-
-
0033708106
-
Speech parameter generation algorithms for hmm-based speech synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in ICASSP, 2000, pp. 1315-1318.
-
(2000)
ICASSP
, pp. 1315-1318
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
2
-
-
84890522099
-
F0 contour prediction with a deep belief network-gaussian process hybrid model
-
R. Fernandez, R. Rendel, B. Ramabhadran, and R. Hoory, "F0 contour prediction with a Deep Belief Network-Gaussian Process hybrid model, " in ICASSP, 2013, pp. 6885-6889.
-
(2013)
ICASSP
, pp. 6885-6889
-
-
Fernandez, R.1
Rendel, R.2
Ramabhadran, B.3
Hoory, R.4
-
3
-
-
84890490547
-
Statistical parametric speech synthesis using deep neural networks
-
H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using Deep Neural Networks, " in ICASSP, 2013, pp. 7962-7966.
-
(2013)
ICASSP
, pp. 7962-7966
-
-
Zen, H.1
Senior, A.2
Schuster, M.3
-
4
-
-
84901237776
-
Modeling spectral envelops using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
-
Z.-H. Ling, L. Deng, and D. Yu, "Modeling spectral envelops using Restricted Boltzmann Machines and Deep Belief Networks for statistical parametric speech synthesis, " IEEE Trans. Audio, Speech, and Lang. Proc., vol. 21, no. 10, pp. 2129-2139, 2013.
-
(2013)
IEEE Trans. Audio, Speech, and Lang. Proc.
, vol.21
, Issue.10
, pp. 2129-2139
-
-
Ling, Z.-H.1
Deng, L.2
Yu, D.3
-
5
-
-
84890527090
-
Multi-distribution deep belief networks for speech synthesis
-
S. Kang, X. Qian, and H. Meng, "Multi-distribution Deep Belief Networks for speech synthesis, " in ICASSP, 2013, pp. 8012-8016.
-
(2013)
ICASSP
, pp. 8012-8016
-
-
Kang, S.1
Qian, X.2
Meng, H.3
-
6
-
-
84890545600
-
Multi-task learning in deep neural networks for improved phoneme recognition
-
M. L. Seltzer and J. Droppo, "Multi-task learning in Deep Neural Networks for improved phoneme recognition, " in Proc. ICASSP, 2013, pp. 6965-6969.
-
(2013)
Proc. ICASSP
, pp. 6965-6969
-
-
Seltzer, M.L.1
Droppo, J.2
-
7
-
-
71249112130
-
Offline handwriting recognition with multidimensional recurrent neural networks
-
A. Graves and J. Schmidhuber, "Offline handwriting recognition with multidimensional Recurrent Neural Networks, " in NIPS, 2009.
-
(2009)
NIPS
-
-
Graves, A.1
Schmidhuber, J.2
-
8
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
A. Graves, M. Abdel-rahman, and G. Hinton, "Speech recognition with Deep Recurrent Neural Networks, " in ICASSP, 2013, pp. 6885-6889.
-
(2013)
ICASSP
, pp. 6885-6889
-
-
Graves, A.1
Abdel-Rahman, M.2
Hinton, G.3
-
9
-
-
56449118171
-
Phrase-level phonology in speech production planning: Evidence for the role of prosodic structure
-
G. Bruce and M. Horne, Eds. Netherlands: Springer
-
S. Shattuck-Hufnagel, "Phrase-level phonology in speech production planning: Evidence for the role of prosodic structure, " in Prosody: Theory and Experiment. Studies Presented to Gösta Bruce, ser. Text, Speech and Language Technology, G. Bruce and M. Horne, Eds. Netherlands: Springer, 2000, vol. 14, pp. 201- 229.
-
(2000)
Prosody: Theory and Experiment. Studies Presented to Gösta Bruce, Ser. Text, Speech and Language Technology
, vol.14
, pp. 201-229
-
-
Shattuck-Hufnagel, S.1
-
10
-
-
0031573117
-
Long short-term memory
-
S. Hochreiter and J. Schmidhuber, "Long short-term memory, " Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
11
-
-
0034293152
-
Learning to forget: Continual prediction with lstm
-
F. A. Gers, J. Schmidhuber, and F. Cummings, "Learning to forget: Continual prediction with LSTM, " Neural Computaiton, vol. 12, no. 10, pp. 2451-2471, 2000.
-
(2000)
Neural Computaiton
, vol.12
, Issue.10
, pp. 2451-2471
-
-
Gers, F.A.1
Schmidhuber, J.2
Cummings, F.3
-
12
-
-
0041965934
-
Learning precise timing with lstm recurrent networks
-
F. A. Gers, N. N. Schraudolph, and J. Schmidhuber, "Learning precise timing with LSTM Recurrent Networks, " J. of Machine Learning Research, vol. 3, pp. 115-143, 2002.
-
(2002)
J. of Machine Learning Research
, vol.3
, pp. 115-143
-
-
Gers, F.A.1
Schraudolph, N.N.2
Schmidhuber, J.3
-
13
-
-
84943274699
-
A direct adaptive method for faster back propagation learning: The rprop algorithm
-
M. Riedmiller and H. Braun, "A direct adaptive method for faster back propagation learning: The RPROP algorithm, " in Proc. IEEE Intnl. Conf. on Neural Networks, 1993, pp. 586-591.
-
(1993)
Proc. IEEE Intnl. Conf. on Neural Networks
, pp. 586-591
-
-
Riedmiller, M.1
Braun, H.2
-
15
-
-
33745200051
-
Speech parameter generation algorithm considering global variance for hmm-based speech synthesis
-
T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis, " in Inter speech, 2005, pp. 2801-2804.
-
(2005)
Inter Speech
, pp. 2801-2804
-
-
Toda, T.1
Tokuda, K.2
-
16
-
-
80051607565
-
Crowdmos: An approach for crowd sourcing mean opinion score studies
-
F. Ribeiro, D. Floreâncio, C. Zhang, and M. Seltzer, "CROWDMOS: An approach for crowd sourcing Mean Opinion Score studies, " in ICASSP, 2011, pp. 2416-2419.
-
(2011)
ICASSP
, pp. 2416-2419
-
-
Ribeiro, F.1
Florêncio, D.2
Zhang, C.3
Seltzer, M.4
|