-
4
-
-
84910105608
-
Measuring a decade of progress in text-to-speech
-
Simon King, "Measuring a decade of progress in text-to-speech, " Loquens, vol. 1, no. 1, 2014.
-
(2014)
Loquens
, vol.1
, Issue.1
-
-
King, S.1
-
6
-
-
84910070288
-
Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis
-
Thomas Merritt, Tuomo Raitio, and Simon King, "Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis, " in Proc. Interspeech, 2014, pp. 1509-1513.
-
(2014)
Proc. Interspeech
, pp. 1509-1513
-
-
Merritt, T.1
Raitio, T.2
King, S.3
-
7
-
-
84910028520
-
Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech
-
Gustav Eje Henter, Thomas Merritt, Matt Shannon, Catherine Mayo, and Simon King, "Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech, " in Proc. Interspeech, 2014, pp. 1504-1508.
-
(2014)
Proc. Interspeech
, pp. 1504-1508
-
-
Eje Henter, G.1
Merritt, T.2
Shannon, M.3
Mayo, C.4
King, S.5
-
8
-
-
84946042252
-
Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech
-
Thomas Merritt, Javier Latorre, and Simon King, "Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech, " in Proc. ICASSP, 2015.
-
(2015)
Proc. ICASSP
-
-
Merritt, T.1
Latorre, J.2
King, S.3
-
9
-
-
84959122693
-
Deep neural network context embeddings for model selection in rich-context HMM synthesis
-
Thomas Merritt, Junichi Yamagishi, Zhizheng Wu, Oliver Watts, and Simon King, "Deep neural network context embeddings for model selection in rich-context HMM synthesis, " in Proc. Interspeech, 2015.
-
(2015)
Proc. Interspeech
-
-
Merritt, T.1
Yamagishi, J.2
Wu, Z.3
Watts, O.4
King, S.5
-
10
-
-
0029765811
-
Unit selection in a concatenative speech synthesis system using a large speech database
-
Andrew J Hunt and Alan W Black, "Unit selection in a concatenative speech synthesis system using a large speech database, " in Proc. ICASSP, 1996, pp. 373-376.
-
(1996)
Proc. ICASSP
, pp. 373-376
-
-
Hunt, A.J.1
Black, A.W.2
-
11
-
-
85133526552
-
Automatically clustering similar units for unit selection in speech synthesis
-
Alan W Black and Paul A Taylor, "Automatically clustering similar units for unit selection in speech synthesis., " in Proc. Eurospeech, 1997.
-
(1997)
Proc. Eurospeech
-
-
Black, A.W.1
Taylor, P.A.2
-
13
-
-
44949153641
-
The target cost formulation in unit selection speech synthesis
-
Paul Taylor, "The target cost formulation in unit selection speech synthesis., " in Proc. Interspeech, 2006, pp. 2038-2041.
-
(2006)
Proc. Interspeech
, pp. 2038-2041
-
-
Taylor, P.1
-
14
-
-
84871382567
-
A unified trajectory tiling approach to high quality speech rendering
-
Yao Qian, Frank K Soong, and Zhi-Jie Yan, "A unified trajectory tiling approach to high quality speech rendering, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 2, pp. 280-290, 2013.
-
(2013)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.21
, Issue.2
, pp. 280-290
-
-
Qian, Y.1
Soong, F.K.2
Yan, Z.3
-
15
-
-
84901015944
-
The USTC system for Blizzard Challenge 2008
-
Zhen-Hua Ling, Heng Lu, Guo-Ping Hu, Li-Rong Dai, and Ren-Hua Wang, "The USTC system for Blizzard Challenge 2008, " in Proc. Blizzard Challenge, 2008.
-
(2008)
Proc. Blizzard Challenge
-
-
Ling, Z.1
Lu, H.2
Hu, G.3
Dai, L.4
Wang, R.5
-
16
-
-
78049399368
-
Rich-context unit selection (RUS) approach to high quality TTS
-
Zhi-Jie Yan, Yao Qian, and Frank K Soong, "Rich-context unit selection (RUS) approach to high quality TTS, " in Proc. ICASSP, 2010, pp. 4798-4801.
-
(2010)
Proc. ICASSP
, pp. 4798-4801
-
-
Yan, Z.1
Qian, Y.2
Soong, F.K.3
-
17
-
-
84867217260
-
Synthesis by generation and concatenation of multiform segments
-
Vincent Pollet and Andrew Breen, "Synthesis by generation and concatenation of multiform segments., " in Proc. Interspeech, 2008, pp. 1825-1828.
-
(2008)
Proc. Interspeech
, pp. 1825-1828
-
-
Pollet, V.1
Breen, A.2
-
18
-
-
84865718211
-
Uniform speech parameterization for multi-form segment synthesis
-
Alexander Sorin, Slava Shechtman, and Vincent Pollet, "Uniform Speech Parameterization for Multi-form Segment Synthesis, " in Proc. Interspeech, 2011, pp. 337-340.
-
(2011)
Proc. Interspeech
, pp. 337-340
-
-
Sorin, A.1
Shechtman, S.2
Pollet, V.3
-
19
-
-
84878557723
-
Psychoacoustic segment scoring for multi-form speech synthesis
-
Alexander Sorin, Slava Shechtman, and Vincent Pollet, "Psychoacoustic Segment Scoring for Multi-Form Speech Synthesis., " in Proc. Interspeech, 2012, pp. 2214-2217.
-
(2012)
Proc. Interspeech
, pp. 2214-2217
-
-
Sorin, A.1
Shechtman, S.2
Pollet, V.3
-
20
-
-
84910091105
-
Refined intersegment joining in multi-form speech synthesis
-
Alexander Sorin, Slava Shechtman, and Vincent Pollet, "Refined Intersegment Joining in Multi-Form Speech Synthesis, " in Proc. Interspeech, 2014, pp. 790-794.
-
(2014)
Proc. Interspeech
, pp. 790-794
-
-
Sorin, A.1
Shechtman, S.2
Pollet, V.3
-
21
-
-
84959124410
-
Using deep bidirectional recurrent neural networks for prosodic-target prediction in a unit-selection text-to-speech system
-
Raul Fernandez, Asaf Rendel, Bhuvana Ramabhadran, and Ron Hoory, "Using Deep Bidirectional Recurrent Neural Networks for Prosodic-Target Prediction in a Unit-Selection Text-to-Speech System, " in Proc. Interspeech, 2015.
-
(2015)
Proc. Interspeech
-
-
Fernandez, R.1
Rendel, A.2
Ramabhadran, B.3
Hoory, R.4
-
22
-
-
84946033275
-
Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis
-
Zhizheng Wu, Cassia Valentini-Botinhao, Oliver Watts, and Simon King, "Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis, " in Proc. ICASSP, 2015.
-
(2015)
Proc. ICASSP
-
-
Wu, Z.1
Valentini-Botinhao, C.2
Watts, O.3
King, S.4
-
23
-
-
85032750981
-
Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends
-
Zhen-Hua Ling, Shi-Yin Kang, Heiga Zen, Andrew Senior, Mike Schuster, Xiao-Jun Qian, Helen M Meng, and Li Deng, "Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends, " IEEE Signal Processing Magazine, vol. 32, no. 3, pp. 35-52, 2015.
-
(2015)
IEEE Signal Processing Magazine
, vol.32
, Issue.3
, pp. 35-52
-
-
Ling, Z.1
Kang, S.2
Zen, H.3
Senior, A.4
Schuster, M.5
Qian, X.6
Meng, H.M.7
Deng, L.8
-
24
-
-
84973282956
-
Acoustic modeling in statistical parametric speech synthesis-from hmm to lstm-rnn
-
Heiga Zen, "Acoustic Modeling in Statistical Parametric Speech Synthesis-From HMM to LSTM-RNN, " in Proc. MLSLP, 2015.
-
(2015)
Proc. MLSLP
-
-
Zen, H.1
-
25
-
-
0032073761
-
An RNNbased prosodic information synthesizer for Mandarin text-to-speech
-
Sin-Horng Chen, Shaw-Hwa Hwang, and Yih-Ru Wang, "An RNNbased prosodic information synthesizer for Mandarin text-to-speech, " IEEE Transactions on Speech and Audio Processing, vol. 6, no. 3, pp. 226-239, 1998.
-
(1998)
IEEE Transactions on Speech and Audio Processing
, vol.6
, Issue.3
, pp. 226-239
-
-
Chen, S.1
Hwang, S.2
Wang, Y.3
-
26
-
-
70450161678
-
Rich context modeling for high quality HMM-based TTS
-
Zhi-Jie Yan, Yao Qian, and Frank K Soong, "Rich context modeling for high quality HMM-based TTS, " in Proc. Interspeech, 2009, pp. 1755-1758.
-
(2009)
Proc. Interspeech
, pp. 1755-1758
-
-
Yan, Z.1
Qian, Y.2
Soong, F.K.3
-
27
-
-
84973381105
-
Festival 2-build your own general purpose unit selection speech synthesiser
-
Robert AJ Clark, Korin Richmond, and Simon King, "Festival 2-build your own general purpose unit selection speech synthesiser, " in Proc. SSW5, 2004.
-
(2004)
Proc. SSW5
-
-
Clark, R.A.J.1
Richmond, K.2
King, S.3
-
28
-
-
34047123652
-
Multisyn: Opendomain unit selection for the festival speech synthesis system
-
Robert AJ Clark, Korin Richmond, and Simon King, "Multisyn: Opendomain unit selection for the festival speech synthesis system, " Speech Communication, vol. 49, no. 4, pp. 317-330, 2007.
-
(2007)
Speech Communication
, vol.49
, Issue.4
, pp. 317-330
-
-
Clark, R.A.J.1
Richmond, K.2
King, S.3
-
29
-
-
34547516258
-
Approximating the Kullback-Leibler divergence between Gaussian mixture models
-
John R. Hershey and Peder A. Olsen, "Approximating the Kullback-Leibler divergence between Gaussian mixture models, " in Proc. ICASSP, 2007.
-
(2007)
Proc. ICASSP
-
-
Hershey, J.R.1
Olsen, P.A.2
-
30
-
-
85133720638
-
The HMM-based speech synthesis system (HTS) version 2. 0
-
Heiga Zen, Takashi Nose, Junichi Yamagishi, Shinji Sako, Takashi Masuko, Alan W. Black, and Keiichi Tokuda, "The HMM-based speech synthesis system (HTS) version 2. 0, " in Proc. SSW6, 2007, pp. 294-299.
-
(2007)
Proc. SSW6
, pp. 294-299
-
-
Zen, H.1
Nose, T.2
Yamagishi, J.3
Sako, S.4
Masuko, T.5
Black, A.W.6
Tokuda, K.7
-
31
-
-
84959135757
-
Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features
-
Zhizheng Wu and Simon King, "Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features, " in Proc. Interspeech, 2015.
-
(2015)
Proc. Interspeech
-
-
Wu, Z.1
King, S.2
-
32
-
-
84994377328
-
Hurricane natural speech corpus, [sound]
-
Martin Cooke, Catherine Mayo, and Cassia Valentini-Botinhao, "Hurricane natural speech corpus, [sound], " LISTA Consortium, doi: 10. 7488/ds/140, 2013.
-
(2013)
LISTA Consortium
-
-
Cooke, M.1
Mayo, C.2
Valentini-Botinhao, C.3
-
35
-
-
84973403758
-
Listening test materials for
-
[dataset] University of Edinburgh, The Centre for Speech Technology Research (CSTR)
-
Thomas Merritt, Robert A. J. Clark, Zhizheng Wu, Junichi Yamagishi, and Simon King, "Listening test materials for "Deep neural network-guided unit selection synthesis", 2016 [dataset], " University of Edinburgh, The Centre for Speech Technology Research (CSTR), doi: 10. 7488/ds/1313.
-
(2016)
Deep Neural Network-guided Unit Selection Synthesis
-
-
Merritt, T.1
Clark, J.R.A.2
Wu, Z.3
Yamagishi, J.4
King, S.5
-
36
-
-
0002609530
-
Optimal coupling of diphones
-
Springer
-
Alistair D. Conkie and Stephen Isard, "Optimal coupling of diphones, " in Progress in speech synthesis, pp. 293-304. Springer, 1997.
-
(1997)
Progress in Speech Synthesis
, pp. 293-304
-
-
Conkie, A.D.1
Isard, S.2
|