-
2
-
-
0030362995
-
A compact model for speaker adaptive training
-
T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker adaptive training," in Proc. ICSLP, 1996, pp. 1137-1140.
-
(1996)
Proc. ICSLP
, pp. 1137-1140
-
-
Anastasakos, T.1
McDonough, J.2
Schwartz, R.3
Makhoul, J.4
-
3
-
-
84865713971
-
Crowdsourcing preference tests, and how to detect cheating
-
S. Buchholz and J. Latorre, "Crowdsourcing preference tests, and how to detect cheating," in Proc. Interspeech, 2011, pp. 3053-3056.
-
(2011)
Proc. Interspeech
, pp. 3053-3056
-
-
Buchholz, S.1
Latorre, J.2
-
4
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, no. 2, pp. 75-98, 1998. (Pubitemid 128383747)
-
(1998)
Computer Speech and Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.J.F.1
-
5
-
-
0034227757
-
Cluster adaptive training of hidden markov models
-
Jul
-
M. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 417-428, Jul. 2000.
-
(2000)
IEEE Trans. Speech Audio Process
, vol.8
, Issue.4
, pp. 417-428
-
-
Gales, M.1
-
6
-
-
84962787636
-
Acoustic factorisation
-
M. Gales, "Acoustic factorisation," in Proc. ASRU, 2001, pp. 77-80.
-
(2001)
Proc. ASRU
, pp. 77-80
-
-
Gales, M.1
-
7
-
-
0034320005
-
Rapid speaker adaptation in eigenvoice space
-
DOI 10.1109/89.876308
-
R. Kuhn, J. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in eigenvoice space," IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 695-707, Nov. 2000. (Pubitemid 32025317)
-
(2000)
IEEE Transactions on Speech and Audio Processing
, vol.8
, Issue.6
, pp. 695-707
-
-
Kuhn, R.1
Junqua, J.-C.2
Nguyen, P.3
Niedzielski, N.4
-
8
-
-
33748468338
-
New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer
-
DOI 10.1016/j.specom.2006.05.003, PII S0167639306000483
-
J. Latorre, K. Iwano, and S. Furui, "New approach to the polyglot speech generation by means of an HMM-based speaker adaptable synthesizer," Speech Commun., vol. 48, no. 10, pp. 1227-1242, 2006. (Pubitemid 44353817)
-
(2006)
Speech Communication
, vol.48
, Issue.10
, pp. 1227-1242
-
-
Latorre, J.1
Iwano, K.2
Furui, S.3
-
9
-
-
79959843446
-
An analysis of language mismatch in HMM state mapping-based cross-lingual speaker adaptation
-
H. Liang and J. Dines, "An analysis of language mismatch in HMM state mapping-based cross-lingual speaker adaptation," in Proc. Interspeech, 2010, pp. 622-625.
-
(2010)
Proc. Interspeech
, pp. 622-625
-
-
Liang, H.1
Dines, J.2
-
10
-
-
51449118125
-
Acoustic modeling with contextual additive structure for HMM-based speech recognition
-
Y. Nankaku, K. Nakamura, H. Zen, and K. Tokuda, "Acoustic modeling with contextual additive structure for HMM-based speech recognition," in Proc. ICASSP, 2008, pp. 4469-4472.
-
(2008)
Proc. ICASSP
, pp. 4469-4472
-
-
Nankaku, Y.1
Nakamura, K.2
Zen, H.3
Tokuda, K.4
-
11
-
-
0003805597
-
-
Ph.D. dissertation, Cambridge Univ., Cambridge, U.K
-
J. Odell, "The use of context in large vocabulary speech recognition," Ph.D. dissertation, Cambridge Univ., Cambridge, U.K., 1995.
-
(1995)
The Use of Context in Large Vocabulary Speech Recognition
-
-
Odell, J.1
-
12
-
-
78651062051
-
Cross-lingual speaker adaptation for HMM-based speech synthesis considering differences between language-dependent average voices
-
X. Peng, K. Oura, Y. Nankaku, and K. Tokuda, "Cross-lingual speaker adaptation for HMM-based speech synthesis considering differences between language-dependent average voices," in Proc. ICSP, 2010, pp. 605-608.
-
(2010)
Proc. ICSP
, pp. 605-608
-
-
Peng, X.1
Oura, K.2
Nankaku, Y.3
Tokuda, K.4
-
13
-
-
85008020260
-
A cross-language state sharing and mapping approach to bilingual (Mandarin-English) TTS
-
Aug
-
Y. Qian, H. Liang, and F. Soong, "A cross-language state sharing and mapping approach to bilingual (Mandarin-English) TTS," IEEE Trans. Audio Speech Lang. Process., vol. 17, no. 6, pp. 1231-1239, Aug. 2009.
-
(2009)
IEEE Trans. Audio Speech Lang. Process
, vol.17
, Issue.6
, pp. 1231-1239
-
-
Qian, Y.1
Liang, H.2
Soong, F.3
-
14
-
-
70450153447
-
-
Japanese M.S. thesis, Nagoya Inst. of Technol., Nagoya, Japan
-
K. Saino, "A clustering technique for factor analyzed voice models," (in Japanese) M.S. thesis, Nagoya Inst. of Technol., Nagoya, Japan, 2008.
-
(2008)
A Clustering Technique for Factor Analyzed Voice Models
-
-
Saino, K.1
-
15
-
-
1642370513
-
Solving unsymmetric sparse systems of linear equations with PARDISO
-
O. Schenk and K. Gärtner, "Solving unsymmetric sparse systems of linear equations with PARDISO," J. Future Gen. Comput. Syst., vol. 20, no. 3, pp. 475-487, 2004.
-
(2004)
J. Future Gen. Comput. Syst
, vol.20
, Issue.3
, pp. 475-487
-
-
Schenk, O.1
Gärtner, K.2
-
16
-
-
85009274666
-
Globalphone: A multilingual speech and text database developed at Karlsruhe University
-
T. Schultz, "Globalphone: A multilingual speech and text database developed at Karlsruhe University," in Proc. ICSLP, 2002, pp. 345-348.
-
(2002)
Proc. ICSLP
, pp. 345-348
-
-
Schultz, T.1
-
17
-
-
84865783757
-
Separating speaker and environmental variability using factored transforms
-
M. Seltzer and A. Acero, "Separating speaker and environmental variability using factored transforms," in Proc. Interspeech, 2011, pp. 1097-1100.
-
(2011)
Proc. Interspeech
, pp. 1097-1100
-
-
Seltzer, M.1
Acero, A.2
-
18
-
-
85009257840
-
-
Eigenvoices for HMM-based speech synthesis
-
K. Shichiri, A. Sawabe, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Eigenvoices for HMM-based speech synthesis," in Proc. ICSLP, 2002, pp. 1269-1272.
-
(2002)
Proc. ICSLP
, pp. 1269-1272
-
-
Shichiri, K.1
Sawabe, A.2
Tokuda, K.3
Masuko, T.4
Kobayashi, T.5
Kitamura, T.6
-
19
-
-
85135145174
-
Acoustic modeling based on the MDL criterion for speech recognition
-
K. Shinoda and T. Watanabe, "Acoustic modeling based on the MDL criterion for speech recognition," in Proc. Eurospeech, 1997, pp. 99-102.
-
(1997)
Proc. Eurospeech
, pp. 99-102
-
-
Shinoda, K.1
Watanabe, T.2
-
20
-
-
33947650089
-
HMM state clustering based on efficient cross-validation
-
T. Shinozaki, "HMM state clustering based on efficient cross-validation," in Proc. ICASSP, 2006, pp. 1157-1160.
-
(2006)
Proc. ICASSP
, pp. 1157-1160
-
-
Shinozaki, T.1
-
21
-
-
33646806075
-
Adaptation of precision matrix models on large vocabulary continuous speech recognition
-
K. Sim and M. Gales, "Adaptation of precision matrix models on large vocabulary continuous speech recognition," in Proc. ICASSP, 2005, pp. 97-100.
-
(2005)
Proc. ICASSP
, pp. 97-100
-
-
Sim, K.1
Gales, M.2
-
23
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans. Inf. Syst., vol. E90-D, no. 5, pp. 816-824, 2007.
-
(2007)
IEICE Trans. Inf. Syst., Vol. E90-D
, Issue.5
, pp. 816-824
-
-
Toda, T.1
Tokuda, K.2
-
24
-
-
0036522887
-
Multi-space probability distribution HMM
-
K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Multi-space probability distribution HMM," IEICE Trans. Inf. Syst., vol. E85-D, no. 3, pp. 455-464, 2002. (Pubitemid 35353984)
-
(2002)
IEICE Transactions on Information and Systems
, vol.E85-D
, Issue.3
, pp. 455-464
-
-
Tokuda, K.1
Masuko, T.2
Miyazaki, N.3
Kobayashi, T.4
-
25
-
-
0033708106
-
Speech parameter generation algorithms for HMM-based speech synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in Proc. ICASSP, 2000, pp. 1315-1318.
-
(2000)
Proc. ICASSP
, pp. 1315-1318
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
26
-
-
84966348891
-
An HMM-based speech synthesis system applied to English
-
Workshop, CD-ROM Proceeding
-
K. Tokuda, H. Zen, and A. Black, "An HMM-based speech synthesis system applied to English," in Proc. IEEE Speech Synth. Workshop, 2002, CD-ROM Proceeding.
-
(2002)
Proc. IEEE Speech Synth
-
-
Tokuda, K.1
Zen, H.2
Black, A.3
-
27
-
-
84856249636
-
From multilingual to polyglot speech synthesis
-
C. Traber, K. Huber, K. Nedir, B. Pfister, E. Keller, and B. Zellner, "From multilingual to polyglot speech synthesis," in Proc. Eurospeech, 1999, pp. 835-838.
-
(1999)
Proc. Eurospeech
, pp. 835-838
-
-
Traber, C.1
Huber, K.2
Nedir, K.3
Pfister, B.4
Keller, E.5
Zellner, B.6
-
28
-
-
80051617808
-
Speaker and noise factorisation on AURORA4 task
-
Y.Wang and M. Gales, "Speaker and noise factorisation on AURORA4 task," in Proc. ICASSP, 2011, pp. 4584-4587.
-
(2011)
Proc. ICASSP
, pp. 4584-4587
-
-
Wang, Y.1
Gales, M.2
-
29
-
-
84859768642
-
-
The EMIME Bilingual Database, Tech. Rep. EDI-INF-RR-1388
-
M. Wester, "The EMIME Bilingual Database," Univ. of Edinburgh, 2010, Tech. Rep. EDI-INF-RR-1388.
-
(2010)
Univ. of Edinburgh
-
-
Wester, M.1
-
30
-
-
70450192740
-
State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis
-
Y.Wu, Y. Nankaku, and K. Tokuda, "State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis," in Proc. Interspeech, 2009, pp. 528-531.
-
(2009)
Proc. Interspeech
, pp. 528-531
-
-
Wu, Y.1
Nankaku, Y.2
Tokuda, K.3
-
31
-
-
33846463597
-
-
Ph.D. dissertation, Tokyo Inst. of Technol., Yokohama, Japan
-
J. Yamagishi, "Average-voice-based speech synthesis," Ph.D. dissertation, Tokyo Inst. of Technol., Yokohama, Japan, 2006.
-
(2006)
Average-voice-based Speech Synthesis
-
-
Yamagishi, J.1
-
32
-
-
78049403515
-
Simple methods for improving speakersimilarity of HMM-based speech synthesis
-
J. Yamagishi and S. King, "Simple methods for improving speakersimilarity of HMM-based speech synthesis," in Proc. ICASSP, 2010, pp. 4610-4613.
-
(2010)
Proc. ICASSP
, pp. 4610-4613
-
-
Yamagishi, J.1
King, S.2
-
33
-
-
4544291748
-
Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis
-
J. Yamagishi, M. Tachibana, T. Masuko, and T. Kobayashi, "Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis," in Proc. ICASSP, 2004, pp. 5-8.
-
(2004)
Proc. ICASSP
, pp. 5-8
-
-
Yamagishi, J.1
Tachibana, M.2
Masuko, T.3
Kobayashi, T.4
-
35
-
-
67650819492
-
The HTS2007' system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge
-
J. Yamagishi, H. Zen, Y.Wu, T. Toda, and K. Tokuda, "The HTS2007' system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge," in Proc. Blizzard Challenge Workshop, 2008.
-
(2008)
Proc. Blizzard Challenge Workshop
-
-
Yamagishi, J.1
Zen, H.2
Wu, Y.3
Toda, T.4
Tokuda, K.5
-
36
-
-
85009139544
-
Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis," in Proc. Eurospeech, 1999, pp. 2347-2350.
-
(1999)
Proc. Eurospeech
, pp. 2347-2350
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
37
-
-
4544253619
-
Adaptive training using structured transforms
-
K. Yu and M. Gales, "Adaptive training using structured transforms," in Proc. ICASSP, 2004, pp. 317-320.
-
(2004)
Proc. ICASSP
, pp. 317-320
-
-
Yu, K.1
Gales, M.2
-
38
-
-
79955538498
-
Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis
-
K. Yu, H. Zen, F. Mairesse, and S. Young, "Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis," Speech Commun., vol. 53, no. 6, pp. 914-923, 2011.
-
(2011)
Speech Commun
, vol.53
, Issue.6
, pp. 914-923
-
-
Yu, K.1
Zen, H.2
Mairesse, F.3
Young, S.4
-
39
-
-
79959813917
-
Speaker and language adaptive training for HMM-based polyglot speech synthesis
-
H. Zen, "Speaker and language adaptive training for HMM-based polyglot speech synthesis," in Proc. Interspeech, 2010, pp. 410-413.
-
(2010)
Proc. Interspeech
, pp. 410-413
-
-
Zen, H.1
-
41
-
-
84921798247
-
HMM-based polyglot speech synthesis by speaker and language adaptive training
-
H. Zen, N. Braunschweiler, S. Buchholz, K. Knill, S. Krstulovic', and J. Latorre, "HMM-based polyglot speech synthesis by speaker and language adaptive training," in Proc. ISCA SSW7, 2010, pp. 186-191.
-
(2010)
Proc. ISCA SSW7
, pp. 186-191
-
-
Zen, H.1
Braunschweiler, N.2
Buchholz, S.3
Knill, K.4
Krstulovic, S.5
Latorre, J.6
-
42
-
-
67651002140
-
Statistical parametric speech synthesis
-
H. Zen, K. Tokuda, and A. Black, "Statistical parametric speech synthesis," Speech Commun., vol. 51, no. 11, pp. 1039-1064, 2009.
-
(2009)
Speech Commun
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.3
|