-
1
-
-
60849122466
-
-
EMIME project
-
EMIME project: http://www.emime.org
-
-
-
-
2
-
-
60849118010
-
-
TC-Star project
-
TC-Star project: http://www.tc-star.org
-
-
-
-
3
-
-
60849126020
-
TC-Star: Cross-language voice conversion revisited
-
Spain
-
D. Sundermann, H. Hoge, A. Bonafonte, H. Ney and J. Hirschberg, "TC-Star: Cross-language voice conversion revisited," in Proc. of the TC-Star Workshop 2006, Spain, 2006.
-
(2006)
Proc. of the TC-Star Workshop 2006
-
-
Sundermann, D.1
Hoge, H.2
Bonafonte, A.3
Ney, H.4
Hirschberg, J.5
-
5
-
-
0029725605
-
Speech synthesis from HMMs using dynamic features
-
T. Masuko, K. Tokuda, T. Kobayashi and S. Imai, "Speech synthesis from HMMs using dynamic features," in Proc. of ICASSP, pp. 389-392, 1996.
-
(1996)
Proc. of ICASSP
, pp. 389-392
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
6
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. of ICASSP, vol. 5, pp. 2347-2350, 1999.
-
(1999)
Proc. of ICASSP
, vol.5
, pp. 2347-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
7
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," in Computer Speech and Language, vol. 9, no. 2, pp. 171-185, 1995.
-
(1995)
Computer Speech and Language
, vol.9
, Issue.2
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
8
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," in Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
-
(1998)
Computer Speech and Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.J.F.1
-
9
-
-
0007985533
-
Speaker adaptation for HMM-based speech synthesis system using MLLR
-
T. Masuko, K. Tokuda, T. Kobayashi and S. Imai, "Speaker adaptation for HMM-based speech synthesis system using MLLR," in The Third ESCA/COCOSDA Workshop on Speech Synthesis, pp. 273-276, 1998.
-
(1998)
The Third ESCA/COCOSDA Workshop on Speech Synthesis
, pp. 273-276
-
-
Masuko, T.1
Tokuda, K.2
Kobayashi, T.3
Imai, S.4
-
10
-
-
33947669452
-
HSMM-based model adaptation algorithms for average-voice-based speech synthesis
-
May
-
J. Yamagishi, K. Ogata, Y. Nakano, J. Isogai and T. Kobayashi, "HSMM-based model adaptation algorithms for average-voice-based speech synthesis," in Proc. of ICASSP, pp. 77-80, May 2006.
-
(2006)
Proc. of ICASSP
, pp. 77-80
-
-
Yamagishi, J.1
Ogata, K.2
Nakano, Y.3
Isogai, J.4
Kobayashi, T.5
-
11
-
-
0142007308
-
A training method of average voice model for HMM-based speech synthesis
-
J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda and T. Kobayashi, "A training method of average voice model for HMM-based speech synthesis," in IEICE Trans. of Fundamentals, vol. E86-A, no. 8, pp. 1956-1963, 2003.
-
(2003)
IEICE Trans. of Fundamentals
, vol.E86-A
, Issue.8
, pp. 1956-1963
-
-
Yamagishi, J.1
Tamura, M.2
Masuko, T.3
Tokuda, K.4
Kobayashi, T.5
-
12
-
-
60849136241
-
-
Alphabet
-
http://en.wikipedia.org/wiki/International Phonetic Alphabet
-
Phonetic
-
-
-
13
-
-
51449098031
-
Minimum generation error lineal regression based model adaptation for HMM-based speech synthesis
-
Mar
-
L. Qin, Y.-J. Wu, Z.-H. Ling, R.-H. Wang and L.-R. Dai, "Minimum generation error lineal regression based model adaptation for HMM-based speech synthesis," in Proc. of ICASSP, pp. 3953-3956, Mar. 2008.
-
(2008)
Proc. of ICASSP
, pp. 3953-3956
-
-
Qin, L.1
Wu, Y.-J.2
Ling, Z.-H.3
Wang, R.-H.4
Dai, L.-R.5
-
14
-
-
0141479047
-
A Training Method for Average Voice Model Based on Shared Decision Tree Context Clustering and Speaker Adaptive Training
-
J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda and T. Kobayashi, "A Training Method for Average Voice Model Based on Shared Decision Tree Context Clustering and Speaker Adaptive Training," in Proc. ICASSP 2003, vol. 1, pp. 716-719, 2003.
-
(2003)
Proc. ICASSP 2003
, vol.1
, pp. 716-719
-
-
Yamagishi, J.1
Tamura, M.2
Masuko, T.3
Tokuda, K.4
Kobayashi, T.5
-
16
-
-
60849132933
-
-
J. Kominek and A. Black, The CMU ARCTIC speech databases for speech synthesis research, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, Tech. Rep. CMULTI-03-177, http://festvox.org/cmu arctic/, 2003.
-
J. Kominek and A. Black, "The CMU ARCTIC speech databases for speech synthesis research," Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, Tech. Rep. CMULTI-03-177, http://festvox.org/cmu arctic/, 2003.
-
-
-
-
17
-
-
60849119188
-
-
http://www.synsig.org/index.php/Blizzard Challenge 2008
-
(2008)
-
-
-
18
-
-
0032678076
-
Hidden markov models based on multi-space probability distribution for pitch pattern modeling
-
K. Tokuda, T. Masuko, N. Miyazaki and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, pp. 229-232, 1999.
-
(1999)
Proc. of ICASSP
, pp. 229-232
-
-
Tokuda, K.1
Masuko, T.2
Miyazaki, N.3
Kobayashi, T.4
-
19
-
-
60849139326
-
-
http://hts.sp.nitech.ac.jp/
-
-
-
-
20
-
-
0020596154
-
Cepstral analysis synthesis on the mel frequency scale
-
S. Imai, "Cepstral analysis synthesis on the mel frequency scale," in Proc. of ICASSP, pp. 93-96, 1983.
-
(1983)
Proc. of ICASSP
, pp. 93-96
-
-
Imai, S.1
-
21
-
-
33745200051
-
Speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis," in Proc. of Interspeech, pp. 2801-2804, 2005.
-
(2005)
Proc. of Interspeech
, pp. 2801-2804
-
-
Toda, T.1
Tokuda, K.2
|