-
1
-
-
0030362995
-
A compact model for speaker adaptive training
-
Anastasakos, T.; McDonough, J.; Schwartz, R.; Makhoul, J.; 1996. A compact model for speaker adaptive training. In: Proc. ICSLP-96, pp. 1137-1140.
-
(1996)
Proc. ICSLP-96
, pp. 1137-1140
-
-
Anastasakos, T.1
McDonough, J.2
Schwartz, R.3
Makhoul, J.4
-
2
-
-
0037382510
-
Describing the emotional states that are expressed in speech
-
R. Cowie, and R.R. Cornelius Describing the emotional states that are expressed in speech Speech Comm. 40 1-2 2003 5 32
-
(2003)
Speech Comm.
, vol.40
, Issue.12
, pp. 5-32
-
-
Cowie, R.1
Cornelius, R.R.2
-
3
-
-
23144458652
-
Expressive speech: Production, perception and application to speech synthesis
-
D. Erickson Expressive speech: production, perception and application to speech synthesis Acoust. Sci. Tech. 26 4 2005 317 325
-
(2005)
Acoust. Sci. Tech.
, vol.26
, Issue.4
, pp. 317-325
-
-
Erickson, D.1
-
4
-
-
0034855363
-
Multiple-regression hidden Markov model
-
Fujinaga, K.; Nakai, M.; Shimodaira, H.; Sagayama, S.; 2001. Multiple-regression hidden Markov model. In: Proc. ICASSP 2001, pp. 513-516.
-
(2001)
Proc. ICASSP 2001
, pp. 513-516
-
-
Fujinaga, K.1
Nakai, M.2
Shimodaira, H.3
Sagayama, S.4
-
5
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. Gales Maximum likelihood linear transformations for HMM-based speech recognition Comput. Speech Language 12 1998 75 98
-
(1998)
Comput. Speech Language
, vol.12
, pp. 75-98
-
-
Gales, M.1
-
6
-
-
0034227757
-
Cluster adaptive training of hidden Markov models
-
M. Gales Cluster adaptive training of hidden Markov models IEEE Trans. Speech Audio Process 8 4 2000 417 428
-
(2000)
IEEE Trans. Speech Audio Process
, vol.8
, Issue.4
, pp. 417-428
-
-
Gales, M.1
-
7
-
-
0037380318
-
A corpus-based speech synthesis system with emotion
-
A. Iida, N. Campbell, F. Higuchi, and M. Yasumura A corpus-based speech synthesis system with emotion Speech Comm. 40 1-2 2003 161 187
-
(2003)
Speech Comm.
, vol.40
, Issue.12
, pp. 161-187
-
-
Iida, A.1
Campbell, N.2
Higuchi, F.3
Yasumura, M.4
-
8
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds Speech Comm. 27 3-4 1999 187 207
-
(1999)
Speech Comm.
, vol.27
, Issue.34
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigne, A.3
-
9
-
-
84865794815
-
On the use of extended context for HMM-based spontaneous conversational speech synthesis
-
Koriyama, T.; Nose, T.; Kobayashi, T.; 2011. On the use of extended context for HMM-based spontaneous conversational speech synthesis. In: Proc. INTERSPEECH 2011, pp. 2657-2660.
-
(2011)
Proc. INTERSPEECH 2011
, pp. 2657-2660
-
-
Koriyama, T.1
Nose, T.2
Kobayashi, T.3
-
10
-
-
0025475528
-
ATR Japanese speech database as a tool of speech recognition and synthesis
-
A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano ATR Japanese speech database as a tool of speech recognition and synthesis Speech Comm. 9 4 1990 357 363
-
(1990)
Speech Comm.
, vol.9
, Issue.4
, pp. 357-363
-
-
Kurematsu, A.1
Takeda, K.2
Sagisaka, Y.3
Katagiri, S.4
Kuwabara, H.5
Shikano, K.6
-
12
-
-
29144493408
-
Human walking motion synthesis with desired pace and stride length based on HSMM
-
N. Niwase, J. Yamagishi, and T. Kobayashi Human walking motion synthesis with desired pace and stride length based on HSMM IEICE Trans. Inf. Syst. E88-D 11 2005 2492 2499
-
(2005)
IEICE Trans. Inf. Syst.
, vol.88
, Issue.11
, pp. 2492-2499
-
-
Niwase, N.1
Yamagishi, J.2
Kobayashi, T.3
-
13
-
-
67650793657
-
HMM-based style control for expressive speech synthesis with arbitrary speaker's voice using model adaptation
-
T. Nose, M. Tachibana, and T. Kobayashi HMM-based style control for expressive speech synthesis with arbitrary speaker's voice using model adaptation IEICE Trans. Inf. Syst. E92-D 3 2009 489 497
-
(2009)
IEICE Trans. Inf. Syst.
, vol.92
, Issue.3
, pp. 489-497
-
-
Nose, T.1
Tachibana, M.2
Kobayashi, T.3
-
14
-
-
51449114529
-
A style control technique for HMM-based expressive speech synthesis
-
T. Nose, J. Yamagishi, T. Masuko, and T. Kobayashi A style control technique for HMM-based expressive speech synthesis IEICE Trans. Inf. Syst. E90-D 9 2007 1406 1413
-
(2007)
IEICE Trans. Inf. Syst.
, vol.90
, Issue.9
, pp. 1406-1413
-
-
Nose, T.1
Yamagishi, J.2
Masuko, T.3
Kobayashi, T.4
-
15
-
-
34047275265
-
The IBM expressive text-to-speech synthesis system for American English
-
J.F. Pitrelli, R. Bakis, E.M. Eide, R. Fernandez, W. Hamza, and M.A. Picheny The IBM expressive text-to-speech synthesis system for American English IEEE Trans. Audio Speech Language Process. 14 4 2006 1099 1108
-
(2006)
IEEE Trans. Audio Speech Language Process.
, vol.14
, Issue.4
, pp. 1099-1108
-
-
Pitrelli, J.F.1
Bakis, R.2
Eide, E.M.3
Fernandez, R.4
Hamza, W.5
Picheny, M.A.6
-
16
-
-
0001309343
-
Cue utilization in emotion attribution from auditory stimuli
-
K. Scherer, and J. Oshinsky Cue utilization in emotion attribution from auditory stimuli Motivat. Emot. 1 4 1977 331 346
-
(1977)
Motivat. Emot.
, vol.1
, Issue.4
, pp. 331-346
-
-
Scherer, K.1
Oshinsky, J.2
-
17
-
-
84971539709
-
Emotional speech synthesis: A review
-
Schröder, M.; 2001. Emotional speech synthesis: a review. In: Proc. EUROSPEECH 2001, pp. 561-564.
-
(2001)
Proc. EUROSPEECH 2001
, pp. 561-564
-
-
Schröder, M.1
-
18
-
-
84908477401
-
Hidden Markov model-based speech emotion recognition
-
Schuller, B.; Rigoll, G.; Lang, M.; 2003. Hidden Markov model-based speech emotion recognition. In: Proc. ICASSP 2003, vol. 1, pp. 401-404.
-
(2003)
Proc. ICASSP 2003
, vol.1
, pp. 401-404
-
-
Schuller, B.1
Rigoll, G.2
Lang, M.3
-
19
-
-
0033906251
-
MDL-based context-dependent subword modeling for speech recognition
-
K. Shinoda, and T. Watanabe MDL-based context-dependent subword modeling for speech recognition J. Acoust. Soc. Jpn. (E) 21 2 2000 79 86
-
(2000)
J. Acoust. Soc. Jpn. (E)
, vol.21
, Issue.2
, pp. 79-86
-
-
Shinoda, K.1
Watanabe, T.2
-
20
-
-
29144475179
-
Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing
-
M. Tachibana, J. Yamagishi, T. Masuko, and T. Kobayashi Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing IEICE Trans. Inf. Syst. E88-D 11 2005 2484 2491
-
(2005)
IEICE Trans. Inf. Syst.
, vol.88
, Issue.11
, pp. 2484-2491
-
-
Tachibana, M.1
Yamagishi, J.2
Masuko, T.3
Kobayashi, T.4
-
21
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
T. Toda, and K. Tokuda A speech parameter generation algorithm considering global variance for HMM-based speech synthesis IEICE Trans. Inf. Syst. E90-D 5 2007 816 824
-
(2007)
IEICE Trans. Inf. Syst.
, vol.90
, Issue.5
, pp. 816-824
-
-
Toda, T.1
Tokuda, K.2
-
22
-
-
84982961818
-
Constructing emotional speech synthesizers with limited speech database
-
Tsuzuki, R.; Zen, H.; Tokuda, K.; Kitamura, T.; Bulut, M.; Narayanan, S.; 2004. Constructing emotional speech synthesizers with limited speech database. In: Proc. INTERSPEECH 2004-ICSLP, pp. 1185-1188.
-
(2004)
Proc. INTERSPEECH 2004-ICSLP
, pp. 1185-1188
-
-
Tsuzuki, R.1
Zen, H.2
Tokuda, K.3
Kitamura, T.4
Bulut, M.5
Narayanan, S.6
-
23
-
-
85009177437
-
Modeling of various speaking styles and emotions for HMM-based speech synthesis
-
Yamagishi, J.; Onishi, K.; Masuko, T.; Kobayashi, T.; 2003a. Modeling of various speaking styles and emotions for HMM-based speech synthesis. In: Proc. INTERSPEECH 2003-EUROSPEECH, pp. 2461-2464.
-
(2003)
Proc. INTERSPEECH 2003-EUROSPEECH
, pp. 2461-2464
-
-
Yamagishi, J.1
Onishi, K.2
Masuko, T.3
Kobayashi, T.4
-
24
-
-
0038042801
-
A context clustering technique for average voice models
-
J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi A context clustering technique for average voice models IEICE Trans. Inf. Syst. E86-D 3 2003 534 542
-
(2003)
IEICE Trans. Inf. Syst.
, vol.86
, Issue.3
, pp. 534-542
-
-
Yamagishi, J.1
Tamura, M.2
Masuko, T.3
Tokuda, K.4
Kobayashi, T.5
-
25
-
-
0142007308
-
A training method of average voice model for HMM-based speech synthesis
-
J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi A training method of average voice model for HMM-based speech synthesis IEICE Trans. Fundamentals E86-A 8 2003 1956 1963
-
(2003)
IEICE Trans. Fundamentals
, vol.86
, Issue.8
, pp. 1956-1963
-
-
Yamagishi, J.1
Tamura, M.2
Masuko, T.3
Tokuda, K.4
Kobayashi, T.5
-
26
-
-
67650819492
-
-
Yamagishi, J.; Zen, H.; Wu, Y.; Toda, T.; Tokuda, K.; 2008. The HTS-2008 system: yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 blizzard challenge.
-
(2008)
The HTS-2008 System: Yet Another Evaluation of the Speaker-adaptive HMM-based Speech Synthesis System in the 2008 Blizzard Challenge.
-
-
Yamagishi, J.1
Zen, H.2
Wu, Y.3
Toda, T.4
Tokuda, K.5
-
27
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
Yoshimura, T.; Tokuda, K.; Masuko, T.; Kobayashi, T.; Kitamura, T.; 1999. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis. In: Proc. EUROSPEECH, pp. 2347-2350.
-
(1999)
Proc. EUROSPEECH
, pp. 2347-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
28
-
-
79955538498
-
Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis
-
K. Yu, H. Zen, F. Mairesse, and S. Young Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis Speech Comm. 53 6 2011 914 923
-
(2011)
Speech Comm.
, vol.53
, Issue.6
, pp. 914-923
-
-
Yu, K.1
Zen, H.2
Mairesse, F.3
Young, S.4
-
29
-
-
67651002140
-
Statistical parametric speech synthesis
-
H. Zen, K. Tokuda, and A. Black Statistical parametric speech synthesis Speech Comm. 51 11 2009 1039 1064
-
(2009)
Speech Comm.
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.3
|