-
4
-
-
84910105608
-
Measuring a decade of progress in text-to-speech
-
S. King, "Measuring a decade of progress in text-to-speech, " Loquens, vol. 1, no. 1, 2014.
-
(2014)
Loquens
, vol.1
, Issue.1
-
-
King, S.1
-
5
-
-
38549096029
-
A Speech parameter generation algorithmconsidering global variance for HMM-based speechsynthesis
-
May
-
T. Toda and K. Tokuda, "A Speech Parameter Generation AlgorithmConsidering Global Variance for HMM-Based SpeechSynthesis, " IEICE Transactions on Information and Systems, vol. E90-D, no. 5, pp. 816-824, May 2007.
-
(2007)
IEICE Transactions on Information and Systems
, vol.E90-D
, Issue.5
, pp. 816-824
-
-
Toda, T.1
Tokuda, K.2
-
6
-
-
84856237844
-
An introduction to statistical parametric speech synthesis
-
S. King, "An introduction to statistical parametric speech synthesis, "Sadhana, vol. 36, pp. 837-852, 2011.
-
(2011)
Sadhana
, vol.36
, pp. 837-852
-
-
King, S.1
-
7
-
-
0033708106
-
Speech parameter generation algorithms for HMM-basedspeech synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-basedspeech synthesis, " Proc. ICASSP, 2000.
-
(2000)
Proc. ICASSP
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
8
-
-
67651002140
-
Statistical parametricspeech synthesis
-
Nov.
-
H. Zen, K. Tokuda, and A. W. Black, "Statistical parametricspeech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, Nov. 2009.
-
(2009)
Speech Communication
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.W.3
-
9
-
-
84946042252
-
Attributing modelling errorsinHMMsynthesis by stepping gradually from natural to modelledspeech
-
T. Merritt, J. Latorre, and S. King, "Attributing modelling errorsinHMMsynthesis by stepping gradually from natural to modelledspeech, " in Proc. ICASSP, 2015.
-
(2015)
Proc. ICASSP
-
-
Merritt, T.1
Latorre, J.2
King, S.3
-
11
-
-
84910070288
-
Investigating source and filtercontributions, and their interaction, to statistical parametricspeech synthesis
-
T. Merritt, T. Raitio, and S. King, "Investigating source and filtercontributions, and their interaction, to statistical parametricspeech synthesis, " in Proc. Interspeech, 2014, pp. 1509-1513.
-
(2014)
Proc. Interspeech
, pp. 1509-1513
-
-
Merritt, T.1
Raitio, T.2
King, S.3
-
12
-
-
84910028520
-
Measuring the perceptual effects of modelling assumptions inspeech synthesis using stimuli constructed from repeated naturalspeech
-
G. E. Henter, T. Merritt, M. Shannon, C. Mayo, and S. King, "Measuring the perceptual effects of modelling assumptions inspeech synthesis using stimuli constructed from repeated naturalspeech, " in Proc. Interspeech, 2014, pp. 1504-1508.
-
(2014)
Proc. Interspeech
, pp. 1504-1508
-
-
Henter, G.E.1
Merritt, T.2
Shannon, M.3
Mayo, C.4
King, S.5
-
13
-
-
70450161678
-
Rich context modeling forhigh quality HMM-based TTS
-
Z.-J. Yan, Y. Qian, and F. K. Soong, "Rich context modeling forhigh quality HMM-based TTS, " in Proc. Interspeech, 2009, pp. 1755-1758.
-
(2009)
Proc. Interspeech
, pp. 1755-1758
-
-
Yan, Z.-J.1
Qian, Y.2
Soong, F.K.3
-
14
-
-
78049399368
-
Rich-context unit selection ( RUS) approach to high qualityTTS
-
-, "Rich-context unit selection ( RUS) approach to high qualityTTS, " in Proc. ICASSP, 2010, pp. 4798-4801.
-
(2010)
Proc. ICASSP
, pp. 4798-4801
-
-
Yan, Z.-J.1
Qian, Y.2
Soong, F.K.3
-
15
-
-
84878421733
-
An evaluation of parameter generation methods withrich context models in HMM-based speech synthesis
-
S. Takamichi, T. Toda, Y. Shiga, H. Kawai, S. Sakti, and S. Nakamura, "An Evaluation of Parameter Generation Methods withRich Context Models in HMM-Based Speech Synthesis, " in Proc. Interspeech, 2012, pp. 1139-1142.
-
(2012)
Proc. Interspeech
, pp. 1139-1142
-
-
Takamichi, S.1
Toda, T.2
Shiga, Y.3
Kawai, H.4
Sakti, S.5
Nakamura, S.6
-
16
-
-
84897862522
-
Parameter generation methodswith richcontext models for high-quality and flexible text-to-speechsynthesis
-
S. Takamichi, T. Toda, Y. Shiga, S. Sakti, G. Neubig, S. Nakamura, and S. Member, "Parameter Generation MethodsWith RichContext Models for High-Quality and Flexible Text-To-SpeechSynthesis, " Selected Topics in Signal Processing, IEEE Journalof, vol. 8, no. 2, pp. 239-250, 2014.
-
(2014)
Selected Topics in Signal Processing, IEEE Journalof
, vol.8
, Issue.2
, pp. 239-250
-
-
Takamichi, S.1
Toda, T.2
Shiga, Y.3
Sakti, S.4
Neubig, G.5
Nakamura, S.6
Member, S.7
-
17
-
-
51449111086
-
A cross-languagestate mapping approach to bilingual (Mand arin-English) TTS
-
H. Liang, Y. Qian, F. K. Soong, and G. Liu, "A cross-languagestate mapping approach to bilingual (Mand arin-English) TTS, " inProc. ICASSP, 2008, pp. 4641-4644.
-
(2008)
Proc. ICASSP
, pp. 4641-4644
-
-
Liang, H.1
Qian, Y.2
Soong, F.K.3
Liu, G.4
-
18
-
-
84946033275
-
Deep neuralnetworks employing multi-task learning and stacked bottleneckfeatures for speech synthesis
-
Z. Wu, C. Valentini-Botinhao, O. Watts, and S. King, "Deep neuralnetworks employing multi-task learning and stacked bottleneckfeatures for speech synthesis, " in ICASSP, 2015.
-
(2015)
ICASSP
-
-
Wu, Z.1
Valentini-Botinhao, C.2
Watts, O.3
King, S.4
-
19
-
-
84910030525
-
Word embeddings for speech recognition
-
S. Bengio and G. Heigold, "Word Embeddings for Speech Recognition, "in Proc. Interspeech, 2014, pp. 1053-1057.
-
(2014)
Proc. Interspeech
, pp. 1053-1057
-
-
Bengio, S.1
Heigold, G.2
-
20
-
-
44949153641
-
The target cost formulation in unit selection speechsynthesis
-
P. Taylor, "The target cost formulation in unit selection speechsynthesis. " in Proc. Interspeech, 2006, pp. 2038-2041.
-
(2006)
Proc. Interspeech
, pp. 2038-2041
-
-
Taylor, P.1
-
21
-
-
34547516258
-
Approximating the kullback-leibler divergence between Gaussian mixture models
-
J. R. Hershey and P. a. Olsen, "Approximating the Kullback-Leibler divergence between Gaussian mixture models, " in Proc. ICASSP, 2007.
-
(2007)
Proc. ICASSP
-
-
Hershey, J.R.1
Olsen P, A.2
-
22
-
-
84994377328
-
Hurricanenatural speech corpus, [sound]
-
M. Cooke, C. Mayo, and C. Valentini-Botinhao, "Hurricanenatural speech corpus, [sound], " LISTA Consortium, doi: 10. 7488/ds/140, 2013.
-
(2013)
LISTA Consortium
-
-
Cooke, M.1
Mayo, C.2
Valentini-Botinhao, C.3
-
23
-
-
33750915991
-
STRAIGHT, exploitation of the other aspect ofVOCODER: Perceptually isomorphic decomposition of speechsounds
-
H. Kawahara, "STRAIGHT, exploitation of the other aspect ofVOCODER: Perceptually isomorphic decomposition of speechsounds, " Acoust. Sci. Technol., vol. 27, no. 6, pp. 349-353, 2006.
-
(2006)
Acoust. Sci. Technol
, vol.27
, Issue.6
, pp. 349-353
-
-
Kawahara, H.1
-
24
-
-
84883051736
-
-
Objective measurement of active speech level, ITU RecommendationITU-T P. 56, Geneva, Switzerland, March
-
Objective measurement of active speech level, ITU RecommendationITU-T P. 56, International Telecommunication Union, Telecommunication Stand ardization Sector, Geneva, Switzerland, March 2011.
-
(2011)
International Telecommunication Union, Telecommunication Stand Ardization Sector
-
-
-
25
-
-
84959114033
-
-
Method for the subjective assessment of intermediate quality levelof coding systems, ITU Recommendation ITU-R BS. 1534-1, Geneva, Switzerland, March
-
Method for the subjective assessment of intermediate quality levelof coding systems, ITU Recommendation ITU-R BS. 1534-1, InternationalTelecommunication Union Radiocommunication Assembly, Geneva, Switzerland, March 2003.
-
(2003)
InternationalTelecommunication Union Radiocommunication Assembly
-
-
-
26
-
-
84959110971
-
-
[dataset] university of Edinburgh, The Centre for Speech Technology Research(CSTR)
-
T. Merritt, J. Yamagishi, Z. Wu, O. Watts, and S. King, "Listeningtest materials for "Deep neural network context embeddings formodel selection in rich-context HMM synthesis", 2015 [dataset], "university of Edinburgh, The Centre for Speech Technology Research(CSTR), doi: 10. 7488/ds/256.
-
(2015)
Listeningtest Materials For, Deep Neural Network Context Embeddings Formodel Selection in Rich-context HMM Synthesis
-
-
Merritt, T.1
Yamagishi, J.2
Wu, Z.3
Watts, O.4
King, S.5
-
27
-
-
84959127221
-
Are we usingenough listeners No! an empirically-supported critique ofInterspeech 2014 TTS evaluations
-
M. Wester, C. Valentini-Botinhao, and G. E. Henter, "Are we usingenough listeners No! an empirically-supported critique ofInterspeech 2014 TTS evaluations, " in Proc. Interspeech, 2015.
-
(2015)
Proc. Interspeech
-
-
Wester, M.1
Valentini-Botinhao, C.2
Henter, G.E.3
|