-
1
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Eurospeech, 1999, pp. 2347-2350.
-
(1999)
Eurospeech
, pp. 2347-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
2
-
-
0033708106
-
Speech parameter generation algorithms for HMM-based speech synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in ICASSP, vol. 3, 2000, pp. 1315-1318.
-
(2000)
ICASSP
, vol.3
, pp. 1315-1318
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
3
-
-
33645758767
-
HMM-based approach to multilingual speech synthesis
-
S. Narayanan and A. Alwan, Eds. Prentice Hall
-
K. Tokuda, H. Zen, and A. W. Black, "HMM-based approach to multilingual speech synthesis," in Text to speech synthesis: New paradigms and advances, S. Narayanan and A. Alwan, Eds. Prentice Hall, 2004.
-
(2004)
Text to Speech Synthesis: New Paradigms and Advances
-
-
Tokuda, K.1
Zen, H.2
Black, A.W.3
-
4
-
-
33846405723
-
Details of the nitech HMM-based speech synthesis system for the blizzard challenge 2005
-
DOI 10.1093/ietisy/e90-1.1.325
-
H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005," IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, 2007. (Pubitemid 46145336)
-
(2007)
IEICE Transactions on Information and Systems
, vol.E90-D
, Issue.1
, pp. 325-333
-
-
Zen, H.1
Toda, T.2
Nakamura, M.3
Tokuda, K.4
-
5
-
-
34547496747
-
USTC system for blizzard challenge 2006: An improved HMM-based speech synthesis method
-
Z. Ling, Y. Wu, Y. Wang, L. Qin, and R. Wang, "USTC system for Blizzard Challenge 2006: an improved HMM-based speech synthesis method," in Blizzard Challenge Workshop, 2006.
-
(2006)
Blizzard Challenge Workshop
-
-
Ling, Z.1
Wu, Y.2
Wang, Y.3
Qin, L.4
Wang, R.5
-
6
-
-
85009097254
-
Mixed excitation for HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Mixed excitation for HMM-based speech synthesis," in Eurospeech, 2001, pp. 2263-2266.
-
(2001)
Eurospeech
, pp. 2263-2266
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
7
-
-
70450161678
-
Rich context modeling for high quality HMM-based TTS
-
Z.-J. Yan, Y. Qian, and F. K. Soong, "Rich context modeling for high quality HMM-based TTS," in Interspeech, 2009, pp. 1755-1758.
-
(2009)
Interspeech
, pp. 1755-1758
-
-
Yan, Z.-J.1
Qian, Y.2
Soong, F.K.3
-
8
-
-
34547503417
-
HMM-based unit selection using frame sized speech segments
-
Z.-H. Ling and R.-H. Wang, "HMM-based unit selection using frame sized speech segments," in Interspeech, 2006, pp. 2034-2037.
-
(2006)
Interspeech
, pp. 2034-2037
-
-
Ling, Z.-H.1
Wang, R.-H.2
-
9
-
-
33745200051
-
Speech paramter generation algorithm considering global variance for HMM-based speech synthesis
-
T. Toda and K. Tokuda, "Speech paramter generation algorithm considering global variance for HMM-based speech synthesis," in Interspeech, 2005, pp. 2801-2804.
-
(2005)
Interspeech
, pp. 2801-2804
-
-
Toda, T.1
Tokuda, K.2
-
10
-
-
51449106803
-
Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
-
Y.-J. Wu, H. Zen, Y. Nankaku, and K. Tokuda, "Minimum generation error criterion considering global/local variance for HMM-based speech synthesis," in ICASSP, 2008, pp. 4621-4624.
-
(2008)
ICASSP
, pp. 4621-4624
-
-
Wu, Y.-J.1
Zen, H.2
Nankaku, Y.3
Tokuda, K.4
-
11
-
-
0001810975
-
Line spectrum representation of linear predictive coefficients of speech signals
-
F. Itakura, "Line spectrum representation of linear predictive coefficients of speech signals," J. Acoust. Soc. Am., vol. 57, p. S35, 1975.
-
(1975)
J. Acoust. Soc. Am.
, vol.57
-
-
Itakura, F.1
-
12
-
-
0032673049
-
Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
-
(1999)
Speech Communication
, vol.27
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigne, A.3
|