-
1
-
-
67651002140
-
Statistical parametric speech synthesis
-
H. Zen, K. Tokuda, and A. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
-
(2009)
Speech Communication
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.3
-
2
-
-
84866846705
-
Recent development of HMM-based expressive speech synthesis and its applications
-
T. Nose and T. Kobayashi, "Recent development of HMM-based expressive speech synthesis and its applications, " in Proc. APSIPA ASC 2011, 2011, http://www.apsipa.org/proceedings2011/pdf/APSIPA189.pdf.
-
(2011)
Proc. APSIPA ASC 2011
-
-
Nose, T.1
Kobayashi, T.2
-
3
-
-
85009097254
-
Mixed excitation for HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Mixed excitation for HMM-based speech synthesis, " in Proc. EUROSPEECH 2001, vol. 3, 2001, pp. 2263-2266.
-
(2001)
Proc. EUROSPEECH 2001
, vol.3
, pp. 2263-2266
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
4
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, " IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 816-824, 2007.
-
(2007)
IEICE Trans. Inf. & Syst.
, vol.E90-D
, Issue.5
, pp. 816-824
-
-
Toda, T.1
Tokuda, K.2
-
5
-
-
77953715694
-
Statistical textto-speech synthesis based on segment-wise representation with a norm constraint
-
S. Tiomkin, D. Malah, and S. Shechtman, "Statistical textto-speech synthesis based on segment-wise representation with a norm constraint, " IEEE Trans. Audio, Speech, and Language Process., vol. 18, no. 5, pp. 1077-1082, 2010.
-
(2010)
IEEE Trans. Audio, Speech, and Language Process.
, vol.18
, Issue.5
, pp. 1077-1082
-
-
Tiomkin, S.1
Malah, D.2
Shechtman, S.3
-
6
-
-
84878387899
-
Histogram-based spectral equalization for HMM-based speech synthesis using MEL-LSP
-
Y. Ohtani, M. Tamura, M. Morita, T. Kagoshima, and M. Akamine, "Histogram-based spectral equalization for HMM-based speech synthesis using mel-LSP, " in Proc. INTERSPEECH 2012, 2012, pp. 1155-1158.
-
(2012)
Proc. INTERSPEECH 2012
, pp. 1155-1158
-
-
Ohtani, Y.1
Tamura, M.2
Morita, M.3
Kagoshima, T.4
Akamine, M.5
-
7
-
-
51449106803
-
Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
-
Y. Wu, H. Zen, Y. Nankaku, and K. Tokuda, "Minimum generation error criterion considering global/local variance for HMM-based speech synthesis, " in Proc. ICASSP 2008, 2008, pp. 4621-4624.
-
(2008)
Proc. ICASSP 2008
, pp. 4621-4624
-
-
Wu, Y.1
Zen, H.2
Nankaku, Y.3
Tokuda, K.4
-
8
-
-
67650826181
-
Trajectory training considering global variance for HMM-based speech synthesis
-
T. Toda and S. Young, "Trajectory training considering global variance for HMM-based speech synthesis, " in Proc. ICASSP 2009, 2009, pp. 4025-4028.
-
(2009)
Proc. ICASSP 2009
, pp. 4025-4028
-
-
Toda, T.1
Young, S.2
-
9
-
-
79959847301
-
Global variance modeling on the log power spectrum of LSPS for HMM-based speech synthesis
-
Z. Ling, Y. Hu, and L. Dai, "Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis, " in Proc. INTERSPEECH 2010, 2010, pp. 825-828.
-
(2010)
Proc. INTERSPEECH 2010
, pp. 825-828
-
-
Ling, Z.1
Hu, Y.2
Dai, L.3
-
10
-
-
80051648616
-
Global variance modeling on frequency domain delta LSP for HMM based speech synthesis
-
S. Pan, Y. Nankaku, K. Tokuda, and J. Tao, "Global variance modeling on frequency domain delta LSP for HMMbased speech synthesis, " in Proc. ICASSP 2011, 2011, pp. 4716-4719.
-
(2011)
Proc. ICASSP 2011
, pp. 4716-4719
-
-
Pan, S.1
Nankaku, Y.2
Tokuda, K.3
Tao, J.4
-
11
-
-
85008525798
-
Product of experts for statistical parametric speech synthesis
-
H. Zen, M. Gales, Y. Nankaku, and K. Tokuda, "Product of experts for statistical parametric speech synthesis, " IEEE Trans. Audio, Speech, and Language Process., vol. 20, no. 3, pp. 794-805, 2012.
-
(2012)
IEEE Trans. Audio, Speech, and Language Process.
, vol.20
, Issue.3
, pp. 794-805
-
-
Zen, H.1
Gales, M.2
Nankaku, Y.3
Tokuda, K.4
-
12
-
-
77957917902
-
Minimum generation error training for HMM-based speech synthesis
-
Y. Wu and R. Wang, "Minimum generation error training for HMM-based speech synthesis, " in Proc. ICASSP 2006, 2006, pp. 889-892.
-
(2006)
Proc. ICASSP 2006
, pp. 889-892
-
-
Wu, Y.1
Wang, R.2
-
13
-
-
33749573927
-
Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences
-
H. Zen, K. Tokuda, and T. Kitamura, "Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences, " Computer Speech & Language, vol. 21, no. 1, pp. 153-173, 2007.
-
(2007)
Computer Speech & Language
, vol.21
, Issue.1
, pp. 153-173
-
-
Zen, H.1
Tokuda, K.2
Kitamura, T.3
-
14
-
-
84897832343
-
A parameter generation algorithm using local variance for HMM-based speech synthesis
-
T. Nose, V. Chunwijitra, and T. Kobayashi, "A parameter generation algorithm using local variance for HMM-based speech synthesis, " IEEE Trans. Audio, Speech, and Language Process., pp. 221-228, 2014.
-
(2014)
IEEE Trans. Audio, Speech, and Language Process.
, pp. 221-228
-
-
Nose, T.1
Chunwijitra, V.2
Kobayashi, T.3
-
15
-
-
0028996993
-
Speech parameter generation from HMM using dynamic features
-
K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features, " in Proc. ICASSP-95, 1995, pp. 660-663.
-
(1995)
Proc. ICASSP-95
, pp. 660-663
-
-
Tokuda, K.1
Kobayashi, T.2
Imai, S.3
-
16
-
-
84890495160
-
Fast, low-artifact speech synthesis considering global variance
-
M. Shannon and W. Byrne, "Fast, low-artifact speech synthesis considering global variance, " in Proc. ICASSP 2013, 2013, pp. 7869-7873.
-
(2013)
Proc. ICASSP 2013
, pp. 7869-7873
-
-
Shannon, M.1
Byrne, W.2
-
17
-
-
84865754815
-
Voice conversion using GMM with enhanced global variance
-
H. Benisty and D. Malah, "Voice conversion using GMM with enhanced global variance, " in INTERSPEECH 2011, 2011, pp. 669-672.
-
(2011)
INTERSPEECH 2011
, pp. 669-672
-
-
Benisty, H.1
Malah, D.2
-
18
-
-
84901793334
-
Minimum kullback-leibler divergence parameter generation for HMM-based speech synthesis
-
Z.-H. Ling and L.-R. Dai, "Minimum Kullback-Leibler divergence parameter generation for HMM-based speech synthesis, " IEEE Trans. Audio, Speech, and Language Process., vol. 20, no. 5, pp. 1492-1502, 2012.
-
(2012)
IEEE Trans. Audio, Speech, and Language Process.
, vol.20
, Issue.5
, pp. 1492-1502
-
-
Ling, Z.-H.1
Dai, L.-R.2
-
19
-
-
0025475528
-
ATR Japanese speech database as a tool of speech recognition and synthesis
-
A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis, " Speech Communication, vol. 9, no. 4, pp. 357-363, 1990.
-
(1990)
Speech Communication
, vol.9
, Issue.4
, pp. 357-363
-
-
Kurematsu, A.1
Takeda, K.2
Sagisaka, Y.3
Katagiri, S.4
Kuwabara, H.5
Shikano, K.6
-
20
-
-
0032673049
-
Restructuring speech representations using a pitchadaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. deCheveigne, "Restructuring speech representations using a pitchadaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech Communication, vol. 27, no. 3-4, pp. 187-207, 1999.
-
(1999)
Speech Communication
, vol.27
, Issue.3-4
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
Decheveigne, A.3
-
21
-
-
44449177634
-
A hidden semi-markov model-based speech synthesis system
-
H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system, " IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 825-834, 2007.
-
(2007)
IEICE Trans. Inf. & Syst.
, vol.E90-D
, Issue.5
, pp. 825-834
-
-
Zen, H.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
22
-
-
0033906251
-
MDL-based contextdependent subword modeling for speech recognition
-
K. Shinoda and T. Watanabe, "MDL-based contextdependent subword modeling for speech recognition, " J. Acoust. Soc. Jpn. (E), vol. 21, no. 2, pp. 79-86, 2000.
-
(2000)
J. Acoust. Soc. Jpn. (E)
, vol.21
, Issue.2
, pp. 79-86
-
-
Shinoda, K.1
Watanabe, T.2
-
23
-
-
84890490547
-
Statistical parametric speech synthesis using deep neural networks
-
H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks, " in Proc. ICASSP 2013, 2013, pp. 7962-7966.
-
(2013)
Proc. ICASSP 2013
, pp. 7962-7966
-
-
Zen, H.1
Senior, A.2
Schuster, M.3
-
24
-
-
84897902941
-
Statistical parametric speech synthesis based on gaussian process regression
-
T. Koriyama, T. Nose, and T. Kobayashi, "Statistical parametric speech synthesis based on Gaussian process regression, " IEEE Trans. Audio, Speech, and Language Process., pp. 173-183, 2013.
-
(2013)
IEEE Trans. Audio, Speech, and Language Process.
, pp. 173-183
-
-
Koriyama, T.1
Nose, T.2
Kobayashi, T.3
|