-
1
-
-
0023756465
-
Speech synthesis by rule using an optimal selection of non-uniform synthesis units
-
Y. Sagisaka, "Speech synthesis by rule using an optimal selection of non-uniform synthesis units," in Proc. ICASSP, New York, NY, USA, Apr. 1988, pp. 679-682 (Pubitemid 18666106)
-
(1988)
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
, pp. 679-682
-
-
Sagisaka Yoshinori1
-
2
-
-
0027699809
-
Speech segment selection for concatenative synthesis based on spectral distortion minimization
-
N. Iwahashi, N. Kaiki, and Y. Sagisaka, "Speech segment selection for concatenative synthesis based on spectral distortion minimization," IEICE Trans., Fundamentals, vol. E76-A, no. 11, pp. 1942-1948, 1993
-
(1993)
IEICE Trans., Fundamentals
, vol.76
, Issue.11
, pp. 1942-1948
-
-
Iwahashi, N.1
Kaiki, N.2
Sagisaka, Y.3
-
3
-
-
0029765811
-
Unit selection in a concatenative speech synthesis system using a large speech database
-
May
-
A. J.Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP, Atlanta, GA, USA, May 1996, pp. 373-376
-
(1996)
Proc. ICASSP, Atlanta, GA, USA
, pp. 373-376
-
-
Hunt, A.J.1
Black, A.2
-
4
-
-
85001632375
-
Corpus-based techniques in the AT&T NextGen synthesis system
-
Oct
-
A. K. Syrdal, C.W.Wightman, A. Conkie, Y. Stylianou, M. Beutnagel, J. Schroeter, V. Strom, K.-S. Lee, and M. Makashay, "Corpus-based techniques in the AT&T NextGen synthesis system," in Proc. ICSLP, Beijing, China, Oct. 2000, pp. 410-415
-
(2000)
Proc. ICSLP, Beijing, China
, pp. 410-415
-
-
Syrdal, A.K.1
Wightman, C.W.2
Conkie, A.3
Stylianou, Y.4
Beutnagel, M.5
Schroeter, J.6
Strom, V.7
Lee, K.-S.8
Makashay, M.9
-
5
-
-
67651002140
-
Statistical parametric speech synthesis
-
H. Zen, K. Tokuda, and A. Black, "Statistical parametric speech synthesis," Speech Commun., vol. 51, no. 11, pp. 1039-1064, 2009
-
(2009)
Speech Commun
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.3
-
6
-
-
0034230270
-
Speaker interpolation for HMM-based speech synthesis system
-
T. Yoshimura, T. Masuko, K. Tokuda, T. Kobayashi, and T. Kitamura, "Speaker interpolation for HMM-based speech synthesis system," J. Acoust. Soc. Jpn. (E), vol. 21, no. 4, pp. 199-206, 2000
-
(2000)
J. Acoust. Soc. Jpn. (E)
, vol.21
, Issue.4
, pp. 199-206
-
-
Yoshimura, T.1
Masuko, T.2
Tokuda, K.3
Kobayashi, T.4
Kitamura, T.5
-
7
-
-
33847129573
-
Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
-
DOI 10.1093/ietisy/e90-d.2.533
-
J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans., Inf. Syst., vol. E90-D, no. 2, pp. 533-543, 2007 (Pubitemid 46279829)
-
(2007)
IEICE Transactions on Information and Systems
, vol.E90-D
, Issue.2
, pp. 533-543
-
-
Yamagishi, J.1
Kobayashi, T.2
-
8
-
-
51449114529
-
A style control technique forHMM-based expressive speech synthesis
-
T. Nose, J. Yamagishi, T. Masuko, and T. Kobayashi, "A style control technique forHMM-based expressive speech synthesis," IEICE Trans., Inf. Syst., vol. E90-D, no. 9, pp. 1406-1413, 2007
-
(2007)
IEICE Trans., Inf. Syst
, vol.90
, Issue.9
, pp. 1406-1413
-
-
Nose, T.1
Yamagishi, J.2
Masuko, T.3
Kobayashi, T.4
-
9
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans., vol. E90-D, no. 5, pp. 816-824, 2007
-
(2007)
IEICE Trans
, vol.90
, Issue.5
, pp. 816-824
-
-
Toda, T.1
Tokuda, K.2
-
11
-
-
67650816595
-
The USTC and iflytek speech synthesis systems for blizzard challenge 2007
-
Aug
-
Z. Ling, L. Qin, H. Lu, Y. Gao, L. Dai, R. Wang, Y. Jiang, Z. Zhao, J. Yang, J. Chen, and G. Hu, "The USTC and iflytek speech synthesis systems for blizzard challenge 2007," in Proc. Blizzard Challenge Workshop, Bonn, Germany, Aug. 2007
-
(2007)
Proc. Blizzard Challenge Workshop, Bonn, Germany
-
-
Ling, Z.1
Qin, L.2
Lu, H.3
Gao, Y.4
Dai, L.5
Wang, R.6
Jiang, Y.7
Zhao, Z.8
Yang, J.9
Chen, J.10
Hu, G.11
-
12
-
-
70450161678
-
Rich context modeling for high quality HMM-based TTS
-
Sep
-
Z. Yan, Q. Yao, and S. K. Frank, "Rich context modeling for high quality HMM-based TTS," in Proc. INTERSPEECH, Brighton, U.K., Sep. 2009, pp. 1755-1758
-
(2009)
Proc. INTERSPEECH, Brighton, U.K
, pp. 1755-1758
-
-
Yan, Z.1
Yao, Q.2
Frank, S.K.3
-
13
-
-
79959852154
-
An HMM trajectory tiling (HTT) approach to high quality TTS
-
Y. Qian, Z. Yan, Y. Wu, and F. K. Soong, "An HMM trajectory tiling (HTT) approach to high quality TTS," in Proc. INTERSPEECH, Chiba, Japan, Sept. 2010, pp. 422-425
-
(2010)
Proc. INTERSPEECH, Chiba, Japan, Sept
, pp. 422-425
-
-
Qian, Y.1
Yan, Z.2
Wu, Y.3
Soong, F.K.4
-
14
-
-
4544270859
-
Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesis
-
May
-
T. Toda, H. Kawai, and M. Tsuzaki, "Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesis," in Proc. ICASSP,Montreal,QC, Canada, May 2004, pp. 657-660
-
(2004)
Proc. ICASSP,Montreal,QC, Canada
, pp. 657-660
-
-
Toda, T.1
Kawai, H.2
Tsuzaki, M.3
-
15
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis
-
Apr
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis," in Proc. EUROSPEECH, Budapest, Hungary, Apr. 1999, pp. 2347-2350
-
(1999)
Proc. EUROSPEECH, Budapest, Hungary
, pp. 2347-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
17
-
-
0036522887
-
Multi-space probability distribution HMM
-
K. Tokuda, T. Masuko, B. Miyazaki, and T. Kobayashi, "Multi-space probability distribution HMM," IEICE Trans., Inf. Syst., vol. E85-D, no. 3, pp. 455-464, 2002 (Pubitemid 35353984)
-
(2002)
IEICE Transactions on Information and Systems
, vol.E85-D
, Issue.3
, pp. 455-464
-
-
Tokuda, K.1
Masuko, T.2
Miyazaki, N.3
Kobayashi, T.4
-
18
-
-
0033708106
-
Speech parameter generation algorithms for HMM-based speech synthesis
-
Jun
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in Proc. ICASSP, Istanbul, Turkey, Jun. 2000, pp. 1315-1318
-
(2000)
Proc. ICASSP, Istanbul, Turkey
, pp. 1315-1318
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
19
-
-
85009069251
-
Decision tree backing-off in HMM-based speech synthesis
-
Oct
-
S. Kataoka, N. Mizutani, K. Tokuda, and T. Kitamura, "Decision tree backing-off in HMM-based speech synthesis," in Proc. INTERSPEECH, Jeju, Korea, Oct. 2004, vol. 2, pp. 1205-1208
-
(2004)
Proc. INTERSPEECH, Jeju, Korea
, vol.2
, pp. 1205-1208
-
-
Kataoka, S.1
Mizutani, N.2
Tokuda, K.3
Kitamura, T.4
-
20
-
-
34547503417
-
HMM-based unit selection using frame sized speech segments
-
Sep
-
Z. Ling and R. Wang, "HMM-based unit selection using frame sized speech segments," in Proc. INTERSPEECH, Pittsburgh, PA, USA, Sep. 2013
-
(2013)
Proc. INTERSPEECH, Pittsburgh, PA, USA
-
-
Ling, Z.1
Wang, R.2
-
21
-
-
29144484191
-
Concatenative speech synthesis based on the plural unit selection and fusion method
-
DOI 10.1093/ietisy/e88-d.11.2565
-
T.Mizutani and T. Kagoshima, "Concatenative speech synthesis based on the plural unit selection and fusion method," IEICE Trans. Inf. Syst., vol. E88-D, no. 11, pp. 2565-2572, 2005 (Pubitemid 41816802)
-
(2005)
IEICE Transactions on Information and Systems
, vol.E88-D
, Issue.11
, pp. 2565-2572
-
-
Mizutani, T.1
Kagoshima, T.2
-
22
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995
-
(1995)
Comput. Speech Lang
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
23
-
-
35549000218
-
Cross-validation and aggregated EM training for robust parameter estimation
-
DOI 10.1016/j.csl.2007.07.005, PII S0885230807000472
-
T. Shinozaki and M. Ostendorf, "Cross-validation and aggregated EM training for robust parameter estimation," Comput. Speech Lang., vol. 22, pp. 185-195, 2008 (Pubitemid 350016715)
-
(2008)
Computer Speech and Language
, vol.22
, Issue.2
, pp. 185-195
-
-
Shinozaki, T.1
Ostendorf, M.2
-
24
-
-
44449177634
-
Hidden semimarkovmodel based speech synthesis system
-
H. Zen, K. Tokuda, T. K. T. Masuko, and T. Kitamura, "Hidden semimarkovmodel based speech synthesis system," IEICE Trans., Inf. Syst., vol. E90-D, no. 5, pp. 825-834, 2007
-
(2007)
IEICE Trans., Inf. Syst
, vol.90
, Issue.5
, pp. 825-834
-
-
Zen, H.1
Tokuda, K.2
Masuko, T.K.T.3
Kitamura, T.4
-
25
-
-
6644226630
-
A large-scale Japanese speech database
-
Nov
-
Y. Sagisaka, K. Takeda, M. Abe, S. Katagiri, T. Umeda, and H. Kuawhara, "A large-scale Japanese speech database," in Proc. ICSLP'90, Kobe, Japan, Nov. 1990, pp. 1089-1092
-
(1990)
Proc. ICSLP'90, Kobe, Japan
, pp. 1089-1092
-
-
Sagisaka, Y.1
Takeda, K.2
Abe, M.3
Katagiri, S.4
Umeda, T.5
Kuawhara, H.6
-
26
-
-
84874199000
-
Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT
-
Sep
-
H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT"," in Proc. MAVEBA ' 01, Florence, Italy, Sep. 2001, pp. 1-6
-
(2001)
Proc. MAVEBA ' 01, Florence, Italy
, pp. 1-6
-
-
Kawahara, H.1
Estill, J.2
Fujimura, O.3
-
27
-
-
44949143155
-
Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
-
Sep
-
Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation," in Proc. INTERSPEECH, Pittsburgh, PA, USA, Sep. 2006, pp. 2266-2269
-
(2006)
Proc. INTERSPEECH, Pittsburgh, PA, USA
, pp. 2266-2269
-
-
Ohtani, Y.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
28
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. D. Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp. 187-207, 1999.
-
(1999)
Speech Commun
, vol.27
, Issue.3-4
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
Cheveigne, A.D.3
|