-
1
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis,” in Proc. of EUROSPEECH, 1999.
-
(1999)
Proc. of EUROSPEECH
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
2
-
-
4544291748
-
Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis
-
J. Yamagishi, M. Tachibana, T. Masuko, and T. Kobayashi, “Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis,” in Proc. of ICASSP, 2004.
-
(2004)
Proc. of ICASSP
-
-
Yamagishi, J.1
Tachibana, M.2
Masuko, T.3
Kobayashi, T.4
-
4
-
-
85009097254
-
Mixed-excitation for HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Mixed-excitation for HMM-based speech synthesis,” in Proc. of EUROSPEECH, 2001.
-
(2001)
Proc. of EUROSPEECH
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
5
-
-
85133716335
-
A 2.4 kbits/s MELP candidate for the U.S. Fdereal Standard
-
A. McCree, K. Truong, E. George, T. Barnwell, and V. Viswanathan, “A 2.4 kbits/s MELP candidate for the U.S. Fdereal Standard,” in Proc. of ICASSP, 2006.
-
(2006)
Proc. of ICASSP
-
-
McCree, A.1
Truong, K.2
George, E.3
Barnwell, T.4
Viswanathan, V.5
-
6
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds
-
Apr
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, “Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds,” Speech Communication, vol. 27, Apr. 1999.
-
(1999)
Speech Communication
, vol.27
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
de Cheveigné, A.3
-
7
-
-
33846405723
-
Details of the Nitech HMM-based speech synthesis for Blizzard Challenge 2005
-
H. Zen, T. Toda, M. Nakamura, and K. Tokuda, “Details of the Nitech HMM-based speech synthesis for Blizzard Challenge 2005,” IEICE Trans. on Inf. and Systems, vol. E90-D, no. 1, 2007.
-
(2007)
IEICE Trans. on Inf. and Systems
, vol.E90-D
, Issue.1
-
-
Zen, H.1
Toda, T.2
Nakamura, M.3
Tokuda, K.4
-
8
-
-
33846406459
-
Two-band excitation for HMM-based speech synthesis
-
S. J. Kim and M. Hahn, “Two-band excitation for HMM-based speech synthesis,” IEICE Trans. Inf. & Syst., vol. E90-D, 2007.
-
(2007)
IEICE Trans. Inf. & Syst
, vol.E90-D
-
-
Kim, S. J.1
Hahn, M.2
-
11
-
-
85089837272
-
Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS)
-
M. Akamine and T. Kagoshima, “Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS),” in Proc. ICSLP, 1998.
-
(1998)
Proc. ICSLP
-
-
Akamine, M.1
Kagoshima, T.2
-
12
-
-
0025543906
-
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
-
Dec
-
E. Moulines and F. Charpentier, “Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones,” Speech Communication, vol. 9, Dec. 1990.
-
(1990)
Speech Communication
, vol.9
-
-
Moulines, E.1
Charpentier, F.2
-
13
-
-
85133663448
-
Mixed excitation for HMM-based speech synthesis based on state-dependent filtering
-
R. Maia, T. Toda, H. Zen, Y. Nankaku, and K. Tokuda, “Mixed excitation for HMM-based speech synthesis based on state-dependent filtering,” in Proc. of Spring Meeting of the Acoust. Society of Japan, 2007.
-
(2007)
Proc. of Spring Meeting of the Acoust. Society of Japan
-
-
Maia, R.1
Toda, T.2
Zen, H.3
Nankaku, Y.4
Tokuda, K.5
-
16
-
-
84908144695
-
The use of Fast Fourier Transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms
-
June
-
P. Welch, “The use of Fast Fourier Transform for the estimation of power spectra: a method based on time averaging over short, modified periodograms,” IEEE Trans. Audio and Electroacoustics, vol. 15, June 1967.
-
(1967)
IEEE Trans. Audio and Electroacoustics
, vol.15
-
-
Welch, P.1
-
17
-
-
85133653256
-
-
http://festvox.org/cmu arctic.
-
-
-
-
18
-
-
85016140477
-
An adaptive algorithm for mel-cepstral analysis of speech
-
T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, “An adaptive algorithm for mel-cepstral analysis of speech,” in Proc. of ICASSP, 1992.
-
(1992)
Proc. of ICASSP
-
-
Fukada, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
19
-
-
33947629275
-
Residual conversion versus prediction on voice morphing systems
-
H. Duxans and A. Bonafonte, “Residual conversion versus prediction on voice morphing systems,” in Proc. of ICASSP, 2006.
-
(2006)
Proc. of ICASSP
-
-
Duxans, H.1
Bonafonte, A.2
|