-
1
-
-
84966398940
-
Optimising selection of units from speech database for concatenative synthesis
-
Sep.
-
A. Black and N. Cambpbell, "Optimising selection of units from speech database for concatenative synthesis," in Proc. EUROSPEECH'95, Sep. 1995, pp. 581-584.
-
(1995)
Proc. EUROSPEECH'95
, pp. 581-584
-
-
Black, A.1
Cambpbell, N.2
-
2
-
-
0029765811
-
Unit selection in a concatenative speech synthesis system using a large speech database
-
May
-
A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP'96, May 1996, pp. 373-376.
-
(1996)
Proc. ICASSP'96
, pp. 373-376
-
-
Hunt, A.1
Black, A.2
-
3
-
-
0032651722
-
A hidden Markov-model-based trainable speech synthesizer
-
R. Donovan and P. Woodland, "A hidden Markov-model-based trainable speech synthesizer," Comput. Speech Lang., vol.13, no.3, pp. 223-241, 1999.
-
(1999)
Comput. Speech Lang.
, vol.13
, Issue.3
, pp. 223-241
-
-
Donovan, R.1
Woodland, P.2
-
4
-
-
85001632375
-
Corpus-based techniques in the AT&T NEXTGEN synthesis system
-
Oct.
-
A. Syrdal, C. Wightman, A. Conkie, Y. Stylianou, M. Beutnagel, J. Schroeter, V. Storm, K. Lee, and M. Makashay, "Corpus-based techniques in the AT&T NEXTGEN synthesis system," in Proc. ICSLP'00, Oct. 2000, pp. 411-416.
-
(2000)
Proc. ICSLP'00
, pp. 411-416
-
-
Syrdal, A.1
Wightman, C.2
Conkie, A.3
Stylianou, Y.4
Beutnagel, M.5
Schroeter, J.6
Storm, V.7
Lee, K.8
Makashay, M.9
-
5
-
-
85006631929
-
Unit selection and emotional speech
-
Sep.
-
A. Black, "Unit selection and emotional speech," in Proc. Eurospeech' 03, Sep. 2003, pp. 1649-1652.
-
(2003)
Proc. Eurospeech'03
, pp. 1649-1652
-
-
Black, A.1
-
6
-
-
0028996993
-
Speech parameter generation from HMM using dynamic features
-
May
-
K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features," in Proc. ICASSP'95, May 1995, pp. 660-663.
-
(1995)
Proc. ICASSP'95
, pp. 660-663
-
-
Tokuda, K.1
Kobayashi, T.2
Imai, S.3
-
7
-
-
0038582234
-
An algorithm for speech parameter generation from HMM using dynamic features
-
Mar.
-
K. Tokuda, T. Masuko, T. Kobayashi, and S. Imai, "An algorithm for speech parameter generation from HMM using dynamic features," (in Japanese) J. Acoust. Soc. Jpn., vol.53, no.3, pp. 192-200, Mar. 1997.
-
(1997)
J. Acoust. Soc. Jpn. (in Japanese)
, vol.53
, Issue.3
, pp. 192-200
-
-
Tokuda, K.1
Masuko, T.2
Kobayashi, T.3
Imai, S.4
-
8
-
-
0029725605
-
Speech synthesis using HMMs with dynamic features
-
May
-
T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Speech synthesis using HMMs with dynamic features," in Proc. ICASSP'96, May 1996, pp. 389-392.
-
(1996)
Proc. ICASSP'96
, pp. 389-392
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
9
-
-
0002025578
-
HMM-based speech synthesis using dynamic features
-
Dec.
-
T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "HMM-based speech synthesis using dynamic features," (in Japanese) IEICE Trans., vol.J79-D-II, no.12, pp. 2184-2190, Dec. 1996.
-
(1996)
IEICE Trans. (in Japanese)
, vol.J79-D-II
, Issue.12
, pp. 2184-2190
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
10
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
Sep.
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. Eurospeech'99, Sep. 1999, pp. 2374-12350
-
(1999)
Proc. Eurospeech'99
, pp. 2374-12350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
11
-
-
7044242284
-
Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis
-
Nov.
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis," (in Japanese) IEICE Trans., vol.J83-D-II, no.11, pp. 2099-2107, Nov. 2000.
-
(2000)
IEICE Trans. (in Japanese)
, vol.J83-D-II
, Issue.11
, pp. 2099-2107
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
12
-
-
0033708106
-
Speech parameter generation algorithms for HMM-based speech synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in Proc. ICASSP'00, Jun. 2000, pp. 1315-1318. (Pubitemid 30956411)
-
(2000)
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
, vol.3
, pp. 1315-1318
-
-
Tokuda Keiichi1
Yoshimura Takayoshi2
Masuko Takashi3
Kobayashi Takao4
Kitamura Tadashi5
-
13
-
-
24144497811
-
Acoustic modeling of speaking styles and emotional expressions in HMM-based speech synthesis
-
Mar.
-
J.Yamagishi, K. Onishi, T. Masuko, and T.Kobayashi, "Acoustic modeling of speaking styles and emotional expressions in HMM-based speech synthesis," IEICE Trans. Inf. Syst., vol.E88-D, no.3, pp. 503-509, Mar. 2005.
-
(2005)
IEICE Trans. Inf. Syst.
, vol.E88-D
, Issue.3
, pp. 503-509
-
-
Yamagishi, J.1
Onishi, K.2
Masuko, T.3
Kobayashi, T.4
-
14
-
-
33645768204
-
A style adaptation technique for speech synthesis using HSMM and suprasegmental features
-
Mar.
-
M. Tachibana, J. Yamagishi, T. Masuko, and T. Kobayashi, "A style adaptation technique for speech synthesis using HSMM and suprasegmental features," IEICE Trans. Inf. Syst., vol.E89-D, no.3, pp. 1092-1099, Mar. 2006.
-
(2006)
IEICE Trans. Inf. Syst.
, vol.E89-D
, Issue.3
, pp. 1092-1099
-
-
Tachibana, M.1
Yamagishi, J.2
Masuko, T.3
Kobayashi, T.4
-
15
-
-
29144475179
-
Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing
-
Nov.
-
M. Tachibana, J. Yamagishi, T. Masuko, and T. Kobayashi, "Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing," IEICE Trans. Inf. Syst., vol.E88-D, no.11, pp. 2484-2491, Nov. 2005.
-
(2005)
IEICE Trans. Inf. Syst.
, vol.E88-D
, Issue.11
, pp. 2484-2491
-
-
Tachibana, M.1
Yamagishi, J.2
Masuko, T.3
Kobayashi, T.4
-
16
-
-
51449114529
-
A style control technique for HMM-based expressive speech synthesis
-
Sep.
-
T. Nose, J. Yamagishi, and T. Kobayashi, "A style control technique for HMM-based expressive speech synthesis," IEICE Trans. Inf. Syst., vol.E90-D, no.9, pp. 1406-1413, Sep. 2007.
-
(2007)
IEICE Trans. Inf. Syst.
, vol.E90-D
, Issue.9
, pp. 1406-1413
-
-
Nose, T.1
Yamagishi, J.2
Kobayashi, T.3
-
17
-
-
0030696416
-
Voice characteristics conversion for HMM-based speech synthesis system
-
Apr.
-
T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Voice characteristics conversion for HMM-based speech synthesis system," in Proc. ICASSP'97, Apr. 1997, pp. 1611-1614.
-
(1997)
Proc. ICASSP'97
, pp. 1611-1614
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
18
-
-
0034842740
-
Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
-
May
-
M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR," in Proc. ICASSP'01, May 2001, pp. 805-808.
-
(2001)
Proc. ICASSP'01
, pp. 805-808
-
-
Tamura, M.1
Masuko, T.2
Tokuda, K.3
Kobayashi, T.4
-
19
-
-
0142007308
-
A training method of average voice model for HMM-based speech synthesis
-
Aug.
-
J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda, and T.Kobayashi, "A training method of average voice model for HMM-based speech synthesis," IEICE Trans. Fundamentals, vol.E86-A, no.8, pp. 1956-1963, Aug. 2003.
-
(2003)
IEICE Trans. Fundamentals
, vol.E86-A
, Issue.8
, pp. 1956-1963
-
-
Yamagishi, J.1
Tamura, M.2
Masuko, T.3
Tokuda, K.4
Kobayashi, T.5
-
20
-
-
33847129573
-
Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
-
Feb.
-
J.Yamagishi and T.Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans. Inf. Syst., vol.E90-D, no.2, pp. 533-543, Feb. 2007.
-
(2007)
IEICE Trans. Inf. Syst.
, vol.E90-D
, Issue.2
, pp. 533-543
-
-
Yamagishi, J.1
Kobayashi, T.2
-
21
-
-
0007985533
-
Speaker adaptation for HMM-based speech synthesis system using MLLR
-
Nov.
-
M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Speaker adaptation for HMM-based speech synthesis system using MLLR," in Proc. 3rd ESCA/COCOSDA Workshop Speech Synth., Nov. 1998, pp. 273-276.
-
(1998)
Proc. 3rd ESCA/COCOSDA Workshop Speech Synth.
, pp. 273-276
-
-
Tamura, M.1
Masuko, T.2
Tokuda, K.3
Kobayashi, T.4
-
22
-
-
1842604575
-
Voice characteristics conversion for HMM-based speech synthesis system using MAP-VFS
-
Dec.
-
T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Voice characteristics conversion for HMM-based speech synthesis system using MAP-VFS," (in Japanese) IEICE Trans., vol.J83-D-II, no.12, pp. 2509-2516, Dec. 2000.
-
(2000)
IEICE Trans. (in Japanese)
, vol.J83-D-II
, Issue.12
, pp. 2509-2516
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
23
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
C. Leggetter and P.Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol.9, no.2, pp. 171-185, 1995.
-
(1995)
Comput. Speech Lang.
, vol.9
, Issue.2
, pp. 171-185
-
-
Leggetter, C.1
Woodland, P.2
-
24
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
Apr.
-
J. Gauvain and C. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol.2, no.2, pp. 291-298, Apr. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.1
Lee, C.2
-
25
-
-
0030124675
-
Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation
-
M. Tonomura, T. Kosaka, and S. Matsunaga, "Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation," Comput. Speech Lang., vol.10, no.2, pp. 117-132, 1995.
-
(1995)
Comput. Speech Lang.
, vol.10
, Issue.2
, pp. 117-132
-
-
Tonomura, M.1
Kosaka, T.2
Matsunaga, S.3
-
26
-
-
0031118076
-
Vector-field-smoothed bayesian learning for fast and incremental speaker/telephone-channel adaptation
-
J. Takahashi and S. Sagayama, "Vector-field-smoothed bayesian learning for fast and incremental speaker/telephone-channel adaptation," Comput. Speech Lang., vol.11, no.2, pp. 127-146, 1997.
-
(1997)
Comput. Speech Lang.
, vol.11
, Issue.2
, pp. 127-146
-
-
Takahashi, J.1
Sagayama, S.2
-
27
-
-
0036522887
-
Multi-space probability distribution HMM
-
Mar.
-
K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Multi-space probability distribution HMM," IEICE Trans. Inf. Syst., vol.E85-D, no.3, pp. 455-464, Mar. 2002.
-
(2002)
IEICE Trans. Inf. Syst.
, vol.E85-D
, Issue.3
, pp. 455-464
-
-
Tokuda, K.1
Masuko, T.2
Miyazaki, N.3
Kobayashi, T.4
-
28
-
-
85008066911
-
Speaker adaptation of pitch and spectrum for HMM-based speech synthesis
-
Apr.
-
M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Speaker adaptation of pitch and spectrum for HMM-based speech synthesis," (in Japanese) IEICE Trans., vol.J85-D-II, no.4, pp. 545-553, Apr. 2002.
-
(2002)
IEICE Trans. (in Japanese)
, vol.J85-D-II
, Issue.4
, pp. 545-553
-
-
Tamura, M.1
Masuko, T.2
Tokuda, K.3
Kobayashi, T.4
-
29
-
-
44449177634
-
A hidden semi-Markov model-based speech synthesis system
-
May
-
H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system," IEICE Trans. Inf. & Syst., vol.E90-D, no.5, pp. 825-834, May 2007.
-
(2007)
IEICE Trans. Inf. & Syst.
, vol.E90-D
, Issue.5
, pp. 825-834
-
-
Zen, H.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
31
-
-
0022234383
-
Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition
-
Mar.
-
M. Russell and R. Moore, "Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition," in Proc. ICASSP'85, Mar. 1985, pp. 5-8.
-
(1985)
Proc. ICASSP'85
, pp. 5-8
-
-
Russell, M.1
Moore, R.2
-
32
-
-
0022685753
-
CONTINUOUSLY VARIABLE DURATION HIDDEN MARKOV MODELS FOR AUTOMATIC SPEECH RECOGNITION.
-
S. Levinson, "Continuously variable duration hidden Markov models for automatic speech recognition," Comput. Speech Lang., vol.1, no.1, pp. 29-45, 1986. (Pubitemid 17552445)
-
(1986)
Computer Speech and Language
, vol.1
, Issue.1
, pp. 29-45
-
-
Levinson, S.E.1
-
33
-
-
0030362995
-
A compact model for speaker-adaptive training
-
Oct.
-
T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," in Proc. ICSLP'96, Oct. 1996, pp. 1137-1140.
-
(1996)
Proc. ICSLP'96
, pp. 1137-1140
-
-
Anastasakos, T.1
McDonough, J.2
Schwartz, R.3
Makhoul, J.4
-
34
-
-
0038042801
-
A context clustering technique for average voice models
-
Mar.
-
J.Yamagishi, M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "A context clustering technique for average voice models," IEICE Trans. Inf. Syst., vol.E86-D, no.3, pp. 534-542, Mar. 2003.
-
(2003)
IEICE Trans. Inf. Syst.
, vol.E86-D
, Issue.3
, pp. 534-542
-
-
Yamagishi, J.1
Tamura, M.2
Masuko, T.3
Tokuda, K.4
Kobayashi, T.5
-
35
-
-
70350485779
-
HMM-based emotional speech synthesis using average emotion model
-
Dec.
-
L. Qin, Z. Ling, Y. Wu, B. Zhang, and R. Wang, "HMM-based emotional speech synthesis using average emotion model," in Proc. ISCSLP'06 (Springer LNAI Book), Dec. 2006, pp. 233-240.
-
(2006)
Proc. ISCSLP'06 (Springer LNAI Book)
, pp. 233-240
-
-
Qin, L.1
Ling, Z.2
Wu, Y.3
Zhang, B.4
Wang, R.5
-
36
-
-
33748468338
-
New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer
-
J. Latorre, K. Iwano, and S. Furui, "New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer," Speech Commun., vol.48, no.10, pp. 1227-1242, 2006.
-
(2006)
Speech Commun.
, vol.48
, Issue.10
, pp. 1227-1242
-
-
Latorre, J.1
Iwano, K.2
Furui, S.3
-
37
-
-
0029375590
-
Speaker adaptation using constrained reestimation of Gaussian mixtures
-
Sep.
-
V. Digalakis, D. Rtischev, and L. Neumeyer, "Speaker adaptation using constrained reestimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol.3, no.5, pp. 357-366, Sep. 1995.
-
(1995)
IEEE Trans. Speech Audio Process
, vol.3
, Issue.5
, pp. 357-366
-
-
Digalakis, V.1
Rtischev, D.2
Neumeyer, L.3
-
38
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol.12, no.2, pp. 75-98, 1998.
-
(1998)
Comput. Speech Lang.
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.1
-
39
-
-
0035279111
-
A structural Bayes approach to speaker adaptation
-
Mar.
-
K. Shinoda and C. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol.9, pp. 276-287, Mar. 2001.
-
(2001)
IEEE Trans. Speech Audio Process
, vol.9
, pp. 276-287
-
-
Shinoda, K.1
Lee, C.2
-
40
-
-
0036461005
-
Structural maximum a posteriori linear regression for fast HMM adaptation
-
O. Shiohan, T. Myrvoll, and C. Lee, "Structural maximum a posteriori linear regression for fast HMM adaptation," Comput. Speech Lang., vol.16, no.3, pp. 5-24, 2002.
-
(2002)
Comput. Speech Lang.
, vol.16
, Issue.3
, pp. 5-24
-
-
Shiohan, O.1
Myrvoll, T.2
Lee, C.3
-
41
-
-
0030189744
-
Speaker adaptation using combined transformation and Bayesian methods
-
Jul.
-
V. Digalakis and L. Neumeyer, "Speaker adaptation using combined transformation and Bayesian methods," IEEE Trans. Speech Audio Process., vol.4, no.3, pp. 294-300, Jul. 1996.
-
(1996)
IEEE Trans. Speech Audio Process
, vol.4
, Issue.3
, pp. 294-300
-
-
Digalakis, V.1
Neumeyer, L.2
-
42
-
-
33745214429
-
Model adaptation and adaptive training using ESAT algorithm for HMM-based speech synthesis
-
Sep.
-
J. Isogai, J. Yamagishi, and T. Kobayashi, "Model adaptation and adaptive training using ESAT algorithm for HMM-based speech synthesis," in Proc. Eurospeech'05, Sep. 2005, pp. 2597-2600.
-
(2005)
Proc. Eurospeech'05
, pp. 2597-2600
-
-
Isogai, J.1
Yamagishi, J.2
Kobayashi, T.3
-
43
-
-
33947669452
-
HSMM-based model adaptation algorithms for average-voice-based speech synthesis
-
May
-
J. Yamagishi, K. Ogata, Y. Nakano, J. Isogai, and T. Kobayashi, "HSMM-based model adaptation algorithms for average-voice-based speech synthesis," in Proc. ICASSP'06, May 2006, pp. 77-80.
-
(2006)
Proc. ICASSP'06
, pp. 77-80
-
-
Yamagishi, J.1
Ogata, K.2
Nakano, Y.3
Isogai, J.4
Kobayashi, T.5
-
44
-
-
34547496746
-
Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis
-
Sep.
-
Y. Nakano, M. Tachibana, J. Yamagishi, and T. Kobayashi, "Constrained structural maximum a posteriori linear regression for average-voice-based speech synthesis," in Proc. ICSLP'06, Sep. 2006, pp. 2286-2289.
-
(2006)
Proc. ICSLP'06
, pp. 2286-2289
-
-
Nakano, Y.1
Tachibana, M.2
Yamagishi, J.3
Kobayashi, T.4
-
45
-
-
34547525896
-
Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis
-
Sep.
-
K. Ogata, M. Tachibana, J. Yamagishi, and T. Kobayashi, "Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis," in Proc. ICSLP'06, Sep. 2006, pp. 1328-1331.
-
(2006)
Proc. ICSLP'06
, pp. 1328-1331
-
-
Ogata, K.1
Tachibana, M.2
Yamagishi, J.3
Kobayashi, T.4
-
46
-
-
34547529978
-
Model adaptation approach to speech synthesis with diverse voices and styles
-
Apr.
-
J. Yamagishi, T. Kobayashi, M. Tachibana, K. Ogata, and Y. Nakano, "Model adaptation approach to speech synthesis with diverse voices and styles," in Proc. ICASSP'07, Apr. 2007, pp. 1233-1236.
-
(2007)
Proc. ICASSP'07
, pp. 1233-1236
-
-
Yamagishi, J.1
Kobayashi, T.2
Tachibana, M.3
Ogata, K.4
Nakano, Y.5
-
47
-
-
0020596154
-
Cepstral analysis synthesis on the Mel frequency scale
-
Apr.
-
S. Imai, "Cepstral analysis synthesis on the Mel frequency scale," in Proc. ICASSP'83, Apr. 1983, pp. 93-96.
-
(1983)
Proc. ICASSP'83
, pp. 93-96
-
-
Imai, S.1
-
48
-
-
0001310760
-
Spectral estimation of speech based on Mel-cepstral representation
-
Aug.
-
K. Tokuda, T. Kobayashi, T. Fukada, H. Saito, and S. Imai, "Spectral estimation of speech based on Mel-cepstral representation," (in Japanese) IEICE Trans. Fundamentals, vol.J74-A, no.8, pp. 1240-1248, Aug. 1991.
-
(1991)
IEICE Trans. Fundamentals (in Japanese)
, vol.J74-A
, Issue.8
, pp. 1240-1248
-
-
Tokuda, K.1
Kobayashi, T.2
Fukada, T.3
Saito, H.4
Imai, S.5
-
50
-
-
11144317887
-
Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency
-
Dec.
-
D. Arifianto, T. Tanaka, T. Masuko, and T. Kobayashi, "Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency," IEICE Trans. Inf. Syst., vol.E87-D, no.12, pp. 2812-2820, Dec. 2004.
-
(2004)
IEICE Trans. Inf. Syst.
, vol.E87-D
, Issue.12
, pp. 2812-2820
-
-
Arifianto, D.1
Tanaka, T.2
Masuko, T.3
Kobayashi, T.4
-
51
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
May
-
T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans. Inf. Syst., vol.E90-D, no.5, pp. 816-824, May 2007.
-
(2007)
IEICE Trans. Inf. Syst.
, vol.E90-D
, Issue.5
, pp. 816-824
-
-
Toda, T.1
Tokuda, K.2
-
52
-
-
85133674021
-
Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV
-
Aug.
-
J. Yamagishi, T. Kobayashi, S. Renals, S. King, H. Zen, T. Toda, and K. Tokuda, "Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV," in Proc. 6th ISCA Workshop Speech Synth., Aug. 2007, pp. 125-130.
-
(2007)
Proc. 6th ISCA Workshop Speech Synth.
, pp. 125-130
-
-
Yamagishi, J.1
Kobayashi, T.2
Renals, S.3
King, S.4
Zen, H.5
Toda, T.6
Tokuda, K.7
-
53
-
-
70350480131
-
A speaker-adaptive HMM-based speech synthesis for the Blizzard Challenge 2007
-
submitted for publication
-
J. Yamagishi, T. Nose, H. Zen, T. Toda, K. Tokuda, S. King, and S. Renals, "A speaker-adaptive HMM-based speech synthesis for the Blizzard Challenge 2007," IEEE Audio, Speech, Lang. Process., 2008, submitted for publication.
-
(2008)
IEEE Audio, Speech, Lang. Process
-
-
Yamagishi, J.1
Nose, T.2
Zen, H.3
Toda, T.4
Tokuda, K.5
King, S.6
Renals, S.7
-
54
-
-
0030263447
-
Mean and variance adaptation within the MLLRframework
-
M. Gales and P. Woodland, "Mean and variance adaptation within the MLLRframework," Comput. Speech Lang., vol.10, no.4, pp. 249-264, 1996.
-
(1996)
Comput. Speech Lang.
, vol.10
, Issue.4
, pp. 249-264
-
-
Gales, M.1
Woodland, P.2
-
55
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm
-
Series B
-
A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., Series B, vol.39, no.1, pp. 1-38, 1977.
-
(1977)
J. R. Statist. Soc.
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A.1
Laird, N.2
Rubin, D.3
-
56
-
-
68249104241
-
The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006
-
Jun.
-
H. Zen, T. Toda, and K. Tokuda, "The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006," IEICE Trans. Inf. Syst., vol.E91-D, no.6, pp. 1764-1773, Jun. 2008.
-
(2008)
IEICE Trans. Inf. Syst.
, vol.E91-D
, Issue.6
, pp. 1764-1773
-
-
Zen, H.1
Toda, T.2
Tokuda, K.3
-
57
-
-
0032638856
-
Semi-tied covariance matrices for hidden Markov models
-
Mar.
-
M. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol.7, no.2, pp. 272-281, Mar. 1999.
-
(1999)
IEEE Trans. Speech Audio Process
, vol.7
, Issue.2
, pp. 272-281
-
-
Gales, M.1
-
58
-
-
84892187452
-
Maximum likelihood modeling with Gaussian distributions for classification
-
May
-
R. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. ICASSP'98, May 1998, pp. 661-664.
-
(1998)
Proc. ICASSP'98
, pp. 661-664
-
-
Gopinath, R.1
-
59
-
-
0029769867
-
Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
-
Jan.
-
M. Rahim and B. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Process., vol.4, no.1, pp. 19-30, Jan. 1996.
-
(1996)
IEEE Trans. Speech Audio Process
, vol.4
, Issue.1
, pp. 19-30
-
-
Rahim, M.1
Juang, B.2
-
60
-
-
0034853390
-
Multiple-cluster adaptive training schemes
-
May
-
M. Gales, "Multiple-cluster adaptive training schemes," in Proc. ICASSP'01, May 2001, pp. 361-364.
-
(2001)
Proc. ICASSP'01
, pp. 361-364
-
-
Gales, M.1
-
62
-
-
0030643678
-
Improved Bayesian learning of hidden Markov models for speaker adaptation
-
Apr.
-
J. Chien, H.Wang, and C. Lee, "Improved Bayesian learning of hidden Markov models for speaker adaptation," in Proc. ICASSP'97, Apr. 1997, pp. 1027-1030.
-
(1997)
Proc. ICASSP'97
, pp. 1027-1030
-
-
Chien, J.1
Wang, H.2
Lee, C.3
-
63
-
-
85016140477
-
An adaptive algorithm for Mel-cepstral analysis of speech
-
Mar.
-
T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for Mel-cepstral analysis of speech," in Proc. ICASSP'92, Mar. 1992, pp. 137-140.
-
(1992)
Proc. ICASSP'92
, pp. 137-140
-
-
Fukada, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
64
-
-
0033906251
-
MDL-based context-dependent subword modeling for speech recognition
-
Mar.
-
K. Shinoda and T.Watanabe, "MDL-based context-dependent subword modeling for speech recognition," J. Acoust. Soc. Japan (E), vol.21, pp. 79-86, Mar. 2000.
-
(2000)
J. Acoust. Soc. Japan (E)
, vol.21
, pp. 79-86
-
-
Shinoda, K.1
Watanabe, T.2
-
65
-
-
6644226630
-
A large-scale Japanese speech database
-
Nov.
-
Y. Sagisaka, K. Takeda, M. Abel, S. Katagiri, T. Umeda, and H. Kuwabara, "A large-scale Japanese speech database," in Proc. ICSLP'96, Nov. 1990, pp. 1089-1092.
-
(1990)
Proc. ICSLP'96
, pp. 1089-1092
-
-
Sagisaka, Y.1
Takeda, K.2
Abel, M.3
Katagiri, S.4
Umeda, T.5
Kuwabara, H.6
-
66
-
-
79952258981
-
-
[Online]. Available:
-
K. Tokuda, H. Zen, J. Yamagishi, T. Masuko, S. Sako, A. Black, and T. Nose, The HMM-Based Speech Synthesis System (HTS). [Online]. Available: http://www.hts.sp.nitech.ac.jp/
-
The HMM-Based Speech Synthesis System (HTS)
-
-
Tokuda, K.1
Zen, H.2
Yamagishi, J.3
Masuko, T.4
Sako, S.5
Black, A.6
Nose, T.7
-
67
-
-
85133720638
-
The HMMbased speech synthesis system (HTS) version 2.0
-
Aug.
-
H. Zen, T. Nose, J. Yamagishi, S. Sako, and K. Tokuda, "The HMMbased speech synthesis system (HTS) version 2.0," in Proc. 6th ISCA Workshop Speech Synth., Aug. 2007, pp. 294-299.
-
(2007)
Proc. 6th ISCA Workshop Speech Synth.
, pp. 294-299
-
-
Zen, H.1
Nose, T.2
Yamagishi, J.3
Sako, S.4
Tokuda, K.5
|