-
2
-
-
84865801900
-
The effect of using normalized models in statistical speech synthesis
-
M. Shannon, H. Zen, and W. Byrne, "The effect of using normalized models in statistical speech synthesis, " in Proc. Inter Speech, 2011.
-
(2011)
Proc. Inter Speech
-
-
Shannon, M.1
Zen, H.2
Byrne, W.3
-
3
-
-
0033708106
-
Speech parameter generation algorithms for HMM-based speech synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in Proc. ICASSP, vol. 3, 2000, pp. 1315-1318.
-
(2000)
Proc. ICASSP
, vol.3
, pp. 1315-1318
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
4
-
-
84905253193
-
An experimental comparison of multiple vocoder types
-
Q. Hu, K. Richmond, J. Yamagishi, and J. Latorre, "An experimental comparison of multiple vocoder types, " in Proc. SSW8, 2013, pp. 155-160.
-
(2013)
Proc. SSW8
, pp. 155-160
-
-
Hu, Q.1
Richmond, K.2
Yamagishi, J.3
Latorre, J.4
-
5
-
-
0032638856
-
Semi-tied covariance matrices for hidden Markov models
-
M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models, " IEEE T. Speech Audi. P., vol. 7, no. 3, pp. 272-281, 1999.
-
(1999)
IEEE T. Speech Audi. P.
, vol.7
, Issue.3
, pp. 272-281
-
-
Gales, M.J.F.1
-
6
-
-
85133720638
-
The HMM-based speech synthesis system (HTS) version 2.0
-
H. Zen, T. Nose, J. Yamagishi, S. Sako, T. Masuko, A. Black, and K. Tokuda, "The HMM-based speech synthesis system (HTS) version 2.0, " in Proc. SSW 6, 2007, pp. 294-299.
-
(2007)
Proc. SSW
, vol.6
, pp. 294-299
-
-
Zen, H.1
Nose, T.2
Yamagishi, J.3
Sako, S.4
Masuko, T.5
Black, A.6
Tokuda, K.7
-
7
-
-
84910063941
-
Investigating the shortcomings of HMM synthesis
-
T. Merritt and S. King, "Investigating the shortcomings of HMM synthesis, " in Proc. SSW8, 2013, pp. 185-190.
-
(2013)
Proc. SSW8
, pp. 185-190
-
-
Merritt, T.1
King, S.2
-
8
-
-
0004056285
-
-
Upper Saddle River, NJ: Prentice Hall
-
X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing. Upper Saddle River, NJ: Prentice Hall, 2001, p. 12.
-
(2001)
Spoken Language Processing
-
-
Huang, X.1
Acero, A.2
Hon, H.-W.3
-
9
-
-
70450184166
-
An assessment of automatic recognition techniques for spontaneous speech in comparison with human performance
-
T. Shinozaki and S. Furui, "An assessment of automatic recognition techniques for spontaneous speech in comparison with human performance, " in Proc. SSPR, 2003.
-
(2003)
Proc. SSPR
-
-
Shinozaki, T.1
Furui, S.2
-
10
-
-
84858986605
-
A comparison of automatic and human speech recognition in null grammar
-
A. Juneja, "A comparison of automatic and human speech recognition in null grammar, " J. Acoust. Soc. Am., vol. 131, no. 3, pp. EL256-EL261, 2012.
-
(2012)
J. Acoust. Soc. Am.
, vol.131
, Issue.3
, pp. EL256-EL261
-
-
Juneja, A.1
-
11
-
-
84943154470
-
Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
-
D. McAllaster, L. Gillick, F. Scattone, and M. Newman, "Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch, " in Proc. ICSLP, 1998.
-
(1998)
Proc. ICSLP
-
-
McAllaster, D.1
Gillick, L.2
Scattone, F.3
Newman, M.4
-
12
-
-
84858952478
-
Don't multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition
-
D. Gillick, L. Gillick, and S. Wegmann, "Don't multiply lightly: quantifying problems with the acoustic model assumptions in speech recognition, " in Proc. ASRU, 2011, pp. 71-76.
-
(2011)
Proc. ASRU
, pp. 71-76
-
-
Gillick, D.1
Gillick, L.2
Wegmann, S.3
-
13
-
-
84856237844
-
An introduction to statistical parametric speech synthesis
-
S. King, "An introduction to statistical parametric speech synthesis, " Sadhana, vol. 36, no. 5, pp. 837-852, 2011.
-
(2011)
Sadhana
, vol.36
, Issue.5
, pp. 837-852
-
-
King, S.1
-
14
-
-
67651002140
-
Statistical parametric speech synthesis
-
H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis, " Speech Commun., vol. 51, no. 11, pp. 1039- 1064, 2009.
-
(2009)
Speech Commun.
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.W.3
-
16
-
-
33749573927
-
Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences
-
H. Zen, K. Tokuda, and T. Kitamura, "Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences, " Comput. Speech Lang., vol. 21, no. 1, pp. 153-173, 2007.
-
(2007)
Comput. Speech Lang.
, vol.21
, Issue.1
, pp. 153-173
-
-
Zen, H.1
Tokuda, K.2
Kitamura, T.3
-
17
-
-
84872190545
-
Autoregressive models for statistical parametric speech synthesis
-
M. Shannon, H. Zen, and W. Byrne, "Autoregressive models for statistical parametric speech synthesis, " IEEE T. Audio Speech, vol. 21, no. 3, pp. 587-597, 2013.
-
(2013)
IEEE T. Audio Speech
, vol.21
, Issue.3
, pp. 587-597
-
-
Shannon, M.1
Zen, H.2
Byrne, W.3
-
18
-
-
0014568991
-
IEEE recommended practice for speech quality measurements
-
E. H. Rothauser, W. D. Chapman, N. Guttman, K. S. Nordby, H. R. Silbiger, G. E. Urbanek, and M. Weinstock, "IEEE recommended practice for speech quality measurements, " IEEE T. Acoust. Speech, vol. 17, no. 3, pp. 225-246, 1969.
-
(1969)
IEEE T. Acoust. Speech
, vol.17
, Issue.3
, pp. 225-246
-
-
Rothauser, E.H.1
Chapman, W.D.2
Guttman, N.3
Nordby, K.S.4
Silbiger, H.R.5
Urbanek, G.E.6
Weinstock, M.7
-
19
-
-
84910047268
-
-
Objective measurement of active speech level, Telecommunication Standardization Sector, Geneva, Switzerland, March
-
Objective measurement of active speech level, ITU Recommendation ITU-T P.56, International Telecommunication Union, Telecommunication Standardization Sector, Geneva, Switzerland, March 2011.
-
(2011)
ITU Recommendation ITU-T P.56, International Telecommunication Union
-
-
-
20
-
-
33750915991
-
STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds
-
H. Kawahara, "STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds, " Acoust. Sci. Technol., vol. 27, no. 6, pp. 349-353, 2006.
-
(2006)
Acoust. Sci. Technol.
, vol.27
, Issue.6
, pp. 349-353
-
-
Kawahara, H.1
-
21
-
-
84910053549
-
-
Method for the subjective assessment of intermediate quality level of coding systems, International Telecommunication Union Radiocommunication Assembly, Geneva, Switzerland, March
-
Method for the subjective assessment of intermediate quality level of coding systems, ITU Recommendation ITU-R BS.1534-1, International Telecommunication Union Radiocommunication Assembly, Geneva, Switzerland, March 2003.
-
(2003)
ITU Recommendation ITU-R BS.1534-1
-
-
-
22
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
T. Tomoki and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, " IEICE Trans. Inf. Syst., vol. E90-D, no. 5, pp. 816-824, 2007.
-
(2007)
IEICE Trans. Inf. Syst.
, vol.E90-D
, Issue.5
, pp. 816-824
-
-
Tomoki, T.1
Tokuda, K.2
-
23
-
-
84878384520
-
Ways to implement global variance in statistical speech synthesis
-
H. Silén, E. Helander, J. Nurminen, and M. Gabbouj, "Ways to implement global variance in statistical speech synthesis, " in Proc. Inter Speech, 2012.
-
(2012)
Proc. Inter Speech
-
-
Silén, H.1
Helander, E.2
Nurminen, J.3
Gabbouj, M.4
-
24
-
-
84890495160
-
Fast, low-artifact speech synthesis considering global variance
-
M. Shannon and W. Byrne, "Fast, low-artifact speech synthesis considering global variance, " in Proc. ICASSP, 2013, pp. 7869- 7873.
-
(2013)
Proc. ICASSP
, pp. 7869-7873
-
-
Shannon, M.1
Byrne, W.2
|