-
1
-
-
85009139544
-
Simultaneous Modeling of Spectrum, Pitch and Duration in HMM-Based Speech Synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Simultaneous Modeling of Spectrum, Pitch and Duration in HMM-Based Speech Synthesis,” Proc. of Eurospeech, pp. 2347-2350, 1999.
-
(1999)
Proc. of Eurospeech
, pp. 2347-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
3
-
-
85135145847
-
Speaker Interpolation in HMM-Based Speech Synthesis System
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Speaker Interpolation in HMM-Based Speech Synthesis System,” Proc. of Eurospeech, pp. 2523-2526, 1997.
-
(1997)
Proc. of Eurospeech
, pp. 2523-2526
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
4
-
-
85009257840
-
Eigenvoices for HMM-Based Speech Synthesis
-
K. Shichiri, A. Sawabe, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Eigenvoices for HMM-Based Speech Synthesis,” Proc. of ICSLP, pp. 1269-1272, 2002.
-
(2002)
Proc. of ICSLP
, pp. 1269-1272
-
-
Shichiri, K.1
Sawabe, A.2
Tokuda, K.3
Masuko, T.4
Kobayashi, T.5
Kitamura, T.6
-
5
-
-
50249141145
-
An HMM-Based Singing Voice Synthesis System
-
K. Saino, H. Zen, Y. Nankaku, A. Lee, and K. Tokuda, “An HMM-Based Singing Voice Synthesis System,” Proc. of ICSLP, pp. 1141-1144, 2006.
-
(2006)
Proc. of ICSLP
, pp. 1141-1144
-
-
Saino, K.1
Zen, H.2
Nankaku, Y.3
Lee, A.4
Tokuda, K.5
-
13
-
-
0038000318
-
Spectral Estimation of Speech by Mel-Generalized Cepstral Analysis
-
K. Tokuda, T. Kobayashi, T. Chiba, and S. Imai, “Spectral Estimation of Speech by Mel-Generalized Cepstral Analysis,” IEICE Trans. vol. 75-A, no. 7, pp. 1124-1134, 1992.
-
(1992)
IEICE Trans
, vol.75-A
, Issue.7
, pp. 1124-1134
-
-
Tokuda, K.1
Kobayashi, T.2
Chiba, T.3
Imai, S.4
-
14
-
-
0033708106
-
Speech Parameter Generation Algorithms for HMM-Based Speech Synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, “Speech Parameter Generation Algorithms for HMM-Based Speech Synthesis,” Proc. of ICASSP, pp. 1315-1318, 2000.
-
(2000)
Proc. of ICASSP
, pp. 1315-1318
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
15
-
-
0020596154
-
Cepstral Analysis Synthesis on the Mel Frequency Scale
-
S. Imai, “Cepstral Analysis Synthesis on the Mel Frequency Scale,” Proc. of ICASSP, pp. 93-96, 1983.
-
(1983)
Proc. of ICASSP
, pp. 93-96
-
-
Imai, S.1
-
16
-
-
44449177634
-
A Hidden Semi-Markov Model-Based Speech Synthesis System
-
H. Zen, T. Masuko, K. Tokuda, T. Kobayashi, and T. Kitamura, “A Hidden Semi-Markov Model-Based Speech Synthesis System,” Proc. of IEICE Trans. Inf. & Sys., vol. 90D, no. 5, pp. 825-834, 2007.
-
(2007)
Proc. of IEICE Trans. Inf. & Sys
, vol.90D
, Issue.5
, pp. 825-834
-
-
Zen, H.1
Masuko, T.2
Tokuda, K.3
Kobayashi, T.4
Kitamura, T.5
-
17
-
-
68749108220
-
A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System
-
208
-
K. Oura, H. Zen, Y. Nankaku, A. Lee, and K. Tokuda, “A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System,” Proc. of IEICE Trans. Inf. and Syst., vol. E91-D, no. 11, pp. 2693-2700, 208.
-
Proc. of IEICE Trans. Inf. and Syst
, vol.E91-D
, Issue.11
, pp. 2693-2700
-
-
Oura, K.1
Zen, H.2
Nankaku, Y.3
Lee, A.4
Tokuda, K.5
-
18
-
-
0025475528
-
ATR Japanese Speech Database as a Tool of Speech Recognition and Synthesis
-
A. Kuramatsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kawabara, and K. Shikano, “ATR Japanese Speech Database as a Tool of Speech Recognition and Synthesis,” Speech Communication, vol. 9, pp. 357-363, 1990.
-
(1990)
Speech Communication
, vol.9
, pp. 357-363
-
-
Kuramatsu, A.1
Takeda, K.2
Sagisaka, Y.3
Katagiri, S.4
Kawabara, H.5
Shikano, K.6
-
19
-
-
79959831939
-
HMM-Based Singing Voice Synthesis System Using Pitch-Shifted Pseudo Training Data
-
(to be published)
-
A. Mase, K. Oura, Y. Nankaku, and K. Tokuda, “HMM-Based Singing Voice Synthesis System Using Pitch-Shifted Pseudo Training Data,” Proc. of Interspeech, 2010 (to be published).
-
(2010)
Proc. of Interspeech
-
-
Mase, A.1
Oura, K.2
Nankaku, Y.3
Tokuda, K.4
-
20
-
-
0033906251
-
MDL-Based Context-Dependent Subword Modeling for Speech Recognition
-
K. Shinoda and T. Watanabe, “MDL-Based Context-Dependent Subword Modeling for Speech Recognition,” J. Acoust. Soc. Jpn.(E), vol.21, no. 2, pp. 79-86, 2000.
-
(2000)
J. Acoust. Soc. Jpn.(E)
, vol.21
, Issue.2
, pp. 79-86
-
-
Shinoda, K.1
Watanabe, T.2
-
21
-
-
85063493294
-
Vibrato Modeling for HMM-Based Singing Voice Synthesis
-
(in Japanese)
-
T. Yamada, S. Muto, Y. Nankaku, S. Sako, and K. Tokuda, “Vibrato Modeling for HMM-Based Singing Voice Synthesis,” Proc. of Information Processing Society of Japan, vol. 2009-MUS-80, no. 5, pp. 1-6, 2009 (in Japanese).
-
(2009)
Proc. of Information Processing Society of Japan
, vol.2009-MUS-80
, Issue.5
, pp. 1-6
-
-
Yamada, T.1
Muto, S.2
Nankaku, Y.3
Sako, S.4
Tokuda, K.5
-
22
-
-
44949192112
-
An Automatic Singing Skill Evaluation Method for Unknown Melodies Using Pitch Interval Accuracy and Vibrato Features
-
T. Nakano, M. Goto, and Y. Hiraga, “An Automatic Singing Skill Evaluation Method for Unknown Melodies Using Pitch Interval Accuracy and Vibrato Features”, Proc. of Interspeech, pp. 1706-1709, 2006.
-
(2006)
Proc. of Interspeech
, pp. 1706-1709
-
-
Nakano, T.1
Goto, M.2
Hiraga, Y.3
-
24
-
-
44949247517
-
A Musical Ornament, the Vibrato
-
McGraw-Hill Book Company
-
C. E. Seashore, “A Musical Ornament, the Vibrato,” Proc. of Psychology of Music, McGraw-Hill Book Company, pp. 33-52, 1938.
-
(1938)
Proc. of Psychology of Music
, pp. 33-52
-
-
Seashore, C. E.1
-
25
-
-
85133408098
-
Reducing Computational Cost of Training for HMM-Based Singing Voice Synthesis Using Note Boundaries
-
2-7-8, (in Japanese)
-
S. Muto, K. Oura, Y. Nankaku, and K. Tokuda, “Reducing Computational Cost of Training for HMM-Based Singing Voice Synthesis Using Note Boundaries,” Proc. of Acoustic Society of Japan Spring Meeting, vol. I, 2-7-8, pp. 347-348, 2009 (in Japanese).
-
(2009)
Proc. of Acoustic Society of Japan Spring Meeting
, vol.I
, pp. 347-348
-
-
Muto, S.1
Oura, K.2
Nankaku, Y.3
Tokuda, K.4
-
27
-
-
0032673049
-
Restructuring Speech Representations Using a Pitch-Adaptive Time-Frequency Smoothing and an Instantaneous-Frequency-Based F0 Extraction: Possible Role of a Repetitive Structure in Sounds
-
H. Kawahara, M. K. Ikuyo, and A. Cheneigne, “Restructuring Speech Representations Using a Pitch-Adaptive Time-Frequency Smoothing and an Instantaneous-Frequency-Based F0 Extraction: Possible Role of a Repetitive Structure in Sounds,” Proc. of Speech Communication, 27, pp. 187-207, 1999.
-
(1999)
Proc. of Speech Communication
, vol.27
, pp. 187-207
-
-
Kawahara, H.1
Ikuyo, M. K.2
Cheneigne, A.3
-
28
-
-
77950574571
-
Recent Development of the HMM-Based Speech Synthesis System (HTS)
-
H. Zen, K. Oura, T. Nose, J. Yamagishi, S. Sako, T. Toda, T. Masuko, A. W. Black, and K. Tokuda, “Recent Development of the HMM-Based Speech Synthesis System (HTS),” Proc. of APSIPA, pp. 121-130, 2009.
-
(2009)
Proc. of APSIPA
, pp. 121-130
-
-
Zen, H.1
Oura, K.2
Nose, T.3
Yamagishi, J.4
Sako, S.5
Toda, T.6
Masuko, T.7
Black, A. W.8
Tokuda, K.9
-
30
-
-
85133460481
-
On CrestMuseXML (CMX) Toolkit Ver. 0.40
-
(in Japanese)
-
T. Kitahara and H. Katayose, “On CrestMuseXML (CMX) Toolkit Ver. 0.40,” IPSJ SIG Technical Report, vol. 2008-MUS-75, no. 17, pp. 95-100, 2008 (in Japanese).
-
(2008)
IPSJ SIG Technical Report
, vol.2008-MUS-75
, Issue.17
, pp. 95-100
-
-
Kitahara, T.1
Katayose, H.2
-
31
-
-
0032678076
-
Hidden Markov Models Based on Multi-Space Probability Distribution for Pitch Pattern Modeling
-
K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, “Hidden Markov Models Based on Multi-Space Probability Distribution for Pitch Pattern Modeling,” Proc. of ICASSP, vol. I, pp. 229-232, 1999.
-
(1999)
Proc. of ICASSP
, vol.I
, pp. 229-232
-
-
Tokuda, K.1
Masuko, T.2
Miyazaki, N.3
Kobayashi, T.4
-
32
-
-
33846429403
-
Minimum Generation Error Training for HMM-Based Speech Synthesis
-
Y. J. Wu, and R. H. Wang, “Minimum Generation Error Training for HMM-Based Speech Synthesis,” Proc. of ICASSP, vol. I, pp. 89-92, 2006.
-
(2006)
Proc. of ICASSP
, vol.I
, pp. 89-92
-
-
Wu, Y. J.1
Wang, R. H.2
-
33
-
-
33745200051
-
Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis
-
T. Toda and K. Tokuda, “Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis,” Proc. of Interspeech, pp. 2801-2804, 2005.
-
(2005)
Proc. of Interspeech
, pp. 2801-2804
-
-
Toda, T.1
Tokuda, K.2
|