-
1
-
-
67651002140
-
Statistical parametric speech synthesis
-
Heiga Zen, Keiichi Tokuda, and Alan W Black, Statistical parametric speech synthesis, Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009
-
(2009)
Speech Communication
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.W.3
-
2
-
-
84910105608
-
Measuring a decade of progress in text-tospeech
-
Simon King, Measuring a decade of progress in text-tospeech, Loquens, vol. 1, no. 1, 2014
-
(2014)
Loquens
, vol.1
, Issue.1
-
-
King, S.1
-
5
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
Toda Tomoki and Keiichi Tokuda, A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, IEICE Transactions on Information and Systems, vol. 90, no. 5, pp. 816-824, 2007
-
(2007)
IEICE Transactions on Information and Systems
, vol.90
, Issue.5
, pp. 816-824
-
-
Tomoki, T.1
Tokuda, K.2
-
6
-
-
33749573927
-
Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences
-
Heiga Zen, Keiichi Tokuda, and Tadashi Kitamura, Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences, Computer Speech &Language, vol. 21, no. 1, pp. 153-173, 2007
-
(2007)
Computer Speech &Language
, vol.21
, Issue.1
, pp. 153-173
-
-
Zen, H.1
Tokuda, K.2
Kitamura, T.3
-
7
-
-
84890490547
-
Statistical parametric speech synthesis using deep neural networks
-
Heiga Zen, Andrew Senior, and Mike Schuster, Statistical parametric speech synthesis using deep neural networks, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2013
-
(2013)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Zen, H.1
Senior, A.2
Schuster, M.3
-
8
-
-
84890527090
-
Multidistribution deep belief network for speech synthesis
-
Shiyin Kang, Xiaojun Qian, and Helen Meng, Multidistribution deep belief network for speech synthesis, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2013
-
(2013)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Kang, S.1
Qian, X.2
Meng, H.3
-
9
-
-
84901237776
-
Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
-
Zhen-Hua Ling, Li Deng, and Dong Yu, Modeling spectral envelopes using Restricted Boltzmann Machines and Deep Belief Networks for statistical parametric speech synthesis, IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, no. 10, pp. 2129-2139, 2013
-
(2013)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.21
, Issue.10
, pp. 2129-2139
-
-
Ling, Z.-H.1
Deng, L.2
Yu, D.3
-
10
-
-
84929157442
-
Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis
-
Heng Lu, Simon King, and Oliver Watts, Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis, Proc. the 8th ISCA Speech Synthesis Workshop (SSW), 2013
-
(2013)
Proc the 8th ISCA Speech Synthesis Workshop (SSW)
-
-
Lu, H.1
King, S.2
Watts, O.3
-
11
-
-
84905251808
-
On the training aspects of deep neural network (DNN) for parametric TTS synthesis
-
Yao Qian, Yuchen Fan, Wenping Hu, and Frank K Soong, On the training aspects of deep neural network (DNN) for parametric TTS synthesis, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2014
-
(2014)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Qian, Y.1
Fan, Y.2
Hu, W.3
Soong, F.K.4
-
12
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, Li Deng, Dong Yu, G.E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T.N. Sainath, and B. Kingsbury, Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82-97, 2012
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
Kingsbury, B.11
-
14
-
-
84910047819
-
TTS synthesis with bidirectional LSTM based recurrent neural networks
-
Yuchen Fan, Yao Qian, Fenglong Xie, and Frank K. Soong, TTS synthesis with bidirectional LSTM based recurrent neural networks, in Proc. Interspeech, 2014
-
(2014)
Proc. Interspeech
-
-
Fan, Y.1
Qian, Y.2
Xie, F.3
Soong, F.K.4
-
16
-
-
56449095373
-
A unified architecture for natural language processing: Deep neural networks with multitask learning
-
Ronan Collobert and Jason Weston, A unified architecture for natural language processing: Deep neural networks with multitask learning, in Proc. IEEE Int. Conf. on Machine Learning (ICML), 2008
-
(2008)
Proc. IEEE Int. Conf. on Machine Learning (ICML)
-
-
Collobert, R.1
Jason Weston2
-
17
-
-
0033708106
-
Speech parameter generation algorithms for HMM-based speech synthesis
-
Keiichi Tokuda, Takayoshi Yoshimura, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura, Speech parameter generation algorithms for HMM-based speech synthesis, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2000
-
(2000)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
18
-
-
84865785753
-
Improved bottleneck features using pretrained deep neural networks
-
Dong Yu and Michael L Seltzer, Improved bottleneck features using pretrained deep neural networks, in Proc. Interspeech, 2011
-
(2011)
Proc. Interspeech
-
-
Yu, D.1
Seltzer, M.L.2
-
19
-
-
84867593213
-
Auto-encoder bottleneck features using deep belief networks
-
Tara N Sainath, Brian Kingsbury, and Bhuvana Ramabhadran, Auto-encoder bottleneck features using deep belief networks, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2012
-
(2012)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Sainath, T.N.1
Kingsbury, B.2
Ramabhadran, B.3
-
20
-
-
84890482429
-
Extracting deep bottleneck features using stacked autoencoders
-
Jonas Gehring, Yajie Miao, Florian Metze, and Alex Waibel, Extracting deep bottleneck features using stacked autoencoders, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2013
-
(2013)
Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Gehring, J.1
Miao, Y.2
Metze, F.3
Waibel, A.4
-
21
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
Hideki Kawahara, Ikuyo Masuda-Katsuse, and Alain Cheveigné, Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds, Speech communication, vol. 27, no. 3, pp. 187-207, 1999
-
(1999)
Speech Communication
, vol.27
, Issue.3
, pp. 187-207
-
-
Kawahara, H.1
Ikuyo, M.-K.2
Cheveigné, A.3
-
23
-
-
0345443172
-
Glimpsing speech
-
Martin Cooke, Glimpsing speech, Journal of Phonetics, vol. 31, pp. 579-584, 2003
-
(2003)
Journal of Phonetics
, vol.31
, pp. 579-584
-
-
Cooke, M.1
-
24
-
-
84857819132
-
Theano: A CPU and GPU math expression compiler
-
June
-
James Bergstra, Olivier Breuleux, Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, Guillaume Desjardins, Joseph Turian, David Warde-Farley, and Yoshua Bengio, Theano: a CPU and GPU math expression compiler, in Proceedings of the Python for Scientific Computing Conference (SciPy), June 2010
-
(2010)
Proceedings of the Python for Scientific Computing Conference (SciPy)
-
-
Bergstra, J.1
Breuleux, O.2
Bastien, F.3
Lamblin, P.4
Pascanu, R.5
Desjardins, G.6
Turian, J.7
David, W.-F.8
Bengio, Y.9
|