-
1
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
Sept
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. EUROSPEECH-99, Sept. 1999, pp. 2374-2350.
-
(1999)
Proc. EUROSPEECH-99
, pp. 2374-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
2
-
-
51449121285
-
-
K. Tokuda, H. Zen, J. Yamagishi, T. Masuko, S. Sako, A.B. Black, and T. Nose, The HMM-based speech synthesis system (HTS) Version 2.0.1
-
K. Tokuda, H. Zen, J. Yamagishi, T. Masuko, S. Sako, A.B. Black, and T. Nose, The HMM-based speech synthesis system (HTS) Version 2.0.1, http://hts.sp.nitech.ac.jp/.
-
-
-
-
3
-
-
33846405723
-
Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005
-
Jan
-
H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005," IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007.
-
(2007)
IEICE Trans. Inf. & Syst
, vol.E90-D
, Issue.1
, pp. 325-333
-
-
Zen, H.1
Toda, T.2
Nakamura, M.3
Tokuda, K.4
-
4
-
-
51449114385
-
The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006
-
Sept
-
H. Zen, T. Toda, and K. Tokuda, "The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006," in Proc. Blizzard Challenge 2006, Sept. 2006.
-
(2006)
Proc. Blizzard Challenge 2006
-
-
Zen, H.1
Toda, T.2
Tokuda, K.3
-
5
-
-
77953693469
-
Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007
-
Aug
-
J. Yamagishi, H. Zen, T. Toda, and K. Tokuda, "Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007," in Proc. BLZ3-2007 (in Proc. SSW6), Aug. 2007.
-
(2007)
Proc. BLZ3-2007 (in Proc. SSW6)
-
-
Yamagishi, J.1
Zen, H.2
Toda, T.3
Tokuda, K.4
-
6
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
-
(1999)
Speech Communication
, vol.27
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
Cheveigné, A.3
-
7
-
-
44449177634
-
A hidden semi-Markov model-based speech synthesis system
-
May
-
H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system," IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 825-834, May 2007.
-
(2007)
IEICE Trans. Inf. & Syst
, vol.E90-D
, Issue.5
, pp. 825-834
-
-
Zen, H.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
8
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
May
-
T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 816-824, May 2007.
-
(2007)
IEICE Trans. Inf. & Syst
, vol.E90-D
, Issue.5
, pp. 816-824
-
-
Toda, T.1
Tokuda, K.2
-
9
-
-
0032638856
-
Semi-tied covariance matrices for hidden Markov models
-
Mar
-
M.J.F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, pp. 272-281, Mar. 1999.
-
(1999)
IEEE Trans. Speech Audio Process
, vol.7
, pp. 272-281
-
-
Gales, M.J.F.1
-
10
-
-
33847129573
-
Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
-
Feb
-
J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans. Inf. & Syst., vol. E90-D, no. 2, pp. 533-543, Feb. 2007.
-
(2007)
IEICE Trans. Inf. & Syst
, vol.E90-D
, Issue.2
, pp. 533-543
-
-
Yamagishi, J.1
Kobayashi, T.2
-
11
-
-
34547529978
-
Model adaptation approach to speech synthesis with diverse voices and styles
-
Apr
-
J. Yamagishi, T. Kobayashi, M. Tachibana, K. Ogata, and Y. Nakano, "Model adaptation approach to speech synthesis with diverse voices and styles," in Proc. ICASSP 2007, Apr. 2007, pp. 1233-1236.
-
(2007)
Proc. ICASSP 2007
, pp. 1233-1236
-
-
Yamagishi, J.1
Kobayashi, T.2
Tachibana, M.3
Ogata, K.4
Nakano, Y.5
-
12
-
-
34547525896
-
Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis
-
Sept
-
K. Ogata, M. Tachibana, J. Yamagishi, and T. Kobayashi, "Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis," in Proc. ICSLP 2006, Sept. 2006, pp. 1328-1331.
-
(2006)
Proc. ICSLP 2006
, pp. 1328-1331
-
-
Ogata, K.1
Tachibana, M.2
Yamagishi, J.3
Kobayashi, T.4
-
13
-
-
85133674021
-
Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV
-
Aug
-
J. Yamagishi, T. Kobayashi, S. Renals, S. King, H. Zen, T. Toda, and K. Tokuda, "Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV," in Proc. of 6th ISCA Workshop on Speech Synthesis, Aug. 2007.
-
(2007)
Proc. of 6th ISCA Workshop on Speech Synthesis
-
-
Yamagishi, J.1
Kobayashi, T.2
Renals, S.3
King, S.4
Zen, H.5
Toda, T.6
Tokuda, K.7
-
14
-
-
0035279111
-
A structural Bayes approach to speaker adaptation
-
Mar
-
K. Shinoda and C.H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, pp. 276-287, Mar. 2001.
-
(2001)
IEEE Trans. Speech Audio Process
, vol.9
, pp. 276-287
-
-
Shinoda, K.1
Lee, C.H.2
-
15
-
-
51449120657
-
-
J. Ni, T. Hirai, H. Kawai, T. Toda, K. Tokuda, M. Tsuzaki, R.M∼ aia S.S∼akai, and S. Nakamura, Atrecss - atr english speech corpus for speech synthesis, in Proc. BLZ3-2007 (in Proc. SSW6), Aug. 2007.
-
J. Ni, T. Hirai, H. Kawai, T. Toda, K. Tokuda, M. Tsuzaki, R.M∼ aia S.S∼akai, and S. Nakamura, "Atrecss - atr english speech corpus for speech synthesis," in Proc. BLZ3-2007 (in Proc. SSW6), Aug. 2007.
-
-
-
-
16
-
-
51449101140
-
Festival Multisyn voices for the 2007 Blizzard Challenge
-
Aug
-
K. Richmond, V. Strom, R. Clark, J. Yamagishi, and S. Fitt, "Festival Multisyn voices for the 2007 Blizzard Challenge," in Proc. BLZ3-2007 (in Proc. SSW6), Aug. 2007.
-
(2007)
Proc. BLZ3-2007 (in Proc. SSW6)
-
-
Richmond, K.1
Strom, V.2
Clark, R.3
Yamagishi, J.4
Fitt, S.5
|