-
1
-
-
67651002140
-
Statistical parametric speech synthesis
-
Heiga Zen, Keiichi Tokuda, and Alan W. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
-
(2009)
Speech Communication
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.W.3
-
2
-
-
67650854725
-
Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
-
1
-
J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm, " IEEE Trans. Speech, Audio & Language Process., vol. 17, no. 1, pp. 66-83, 1 2009.
-
(2009)
IEEE Trans. Speech, Audio & Language Process.
, vol.17
, Issue.1
, pp. 66-83
-
-
Yamagishi, J.1
Kobayashi, T.2
Nakano, Y.3
Ogata, K.4
Isogai, J.5
-
3
-
-
77953708096
-
Thousands of voices for HMM-based speech synthesis-Analysis and application of TTS systems built on various ASR corpora
-
Jul
-
J. Yamagishi, B. Usabaev, S. King, O. Watts, J. Dines, J. Tian, R. Hu, K. Oura, K. Tokuda, R. Karhila, and M. Kurimo, "Thousands of voices for HMM-based speech synthesis-Analysis and application of TTS systems built on various ASR corpora, " IEEE Trans. Speech, Audio & Language Process., vol. 18, pp. 984-1004, Jul. 2010.
-
(2010)
IEEE Trans. Speech, Audio & Language Process.
, vol.18
, pp. 984-1004
-
-
Yamagishi, J.1
Usabaev, B.2
King, S.3
Watts, O.4
Dines, J.5
Tian, J.6
Hu, R.7
Oura, K.8
Tokuda, K.9
Karhila, R.10
Kurimo, M.11
-
4
-
-
80051625997
-
Vocal attractiveness of statistical speech synthesisers
-
May
-
S. Andraszewicz, J. Yamagishi, and S. King, "Vocal attractiveness of statistical speech synthesisers, " in Proc. ICASSP 2011, May 2011, pp. 5368-5371.
-
(2011)
Proc. ICASSP 2011
, pp. 5368-5371
-
-
Andraszewicz, S.1
Yamagishi, J.2
King, S.3
-
5
-
-
0018172553
-
Multidimensional classification of normal voice qualities
-
S. Singh and T. Murry, "Multidimensional classification of normal voice qualities, " Journal of the Acoustical Society of America, vol. 64, no. 1, pp. 81-87, 1978.
-
(1978)
Journal of the Acoustical Society of America
, vol.64
, Issue.1
, pp. 81-87
-
-
Singh, S.1
Murry, T.2
-
6
-
-
0032673049
-
Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds, " Speech Commun., vol. 27, pp. 187-207, 1999.
-
(1999)
Speech Commun.
, vol.27
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
Cheveigne, A.3
-
7
-
-
84874199000
-
Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT
-
H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT, " Proc. Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA), pp. 1-6, 2001.
-
(2001)
Proc. Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA)
, pp. 1-6
-
-
Kawahara, H.1
Estill, J.2
Fujimura, O.3
-
8
-
-
51449117929
-
Modelling and synthesising F0 contours with the discrete cosine transform
-
J. Teutenberg, C. Wason, and P. Riddle, "Modelling and synthesising F0 contours with the discrete cosine transform, " in Proc ICASSP 2008, 2008, vol. 2008, pp. 3973-3976.
-
(2008)
Proc ICASSP 2008
, vol.2008
, pp. 3973-3976
-
-
Teutenberg, J.1
Wason, C.2
Riddle, P.3
-
9
-
-
0035208673
-
Vocal intensity characteristics in normal and elderly speakers
-
F. Hodge, R. Colton, and R. Kelley, "Vocal intensity characteristics in normal and elderly speakers, " Journal of Voice, vol. 15, no. 4, pp. 503-511, 2001.
-
(2001)
Journal of Voice
, vol.15
, Issue.4
, pp. 503-511
-
-
Hodge, F.1
Colton, R.2
Kelley, R.3
-
10
-
-
52949129401
-
Voice loudness and gender effects on jitter and shimmer in healthy adults
-
M. Brockmann, C. Storck, P. N. Carding, and M. J. Drinnan, "Voice loudness and gender effects on jitter and shimmer in healthy adults, " Journal of Speech, Language, and Hearing Research, vol. 51, pp. 1152-1160, 2008.
-
(2008)
Journal of Speech, Language, and Hearing Research
, vol.51
, pp. 1152-1160
-
-
Brockmann, M.1
Storck, C.2
Carding, P.N.3
Drinnan, M.J.4
-
11
-
-
0029836816
-
Differentiated perceptual evaluation of pathological voice quality: Reliability and correlations with acoustic measurements
-
P. Dejonckere, M. Remacle, E. Fresnel-Elbaz, V. Woisard, L. Crevier-Buchmann, and B. Millet, "Differentiated perceptual evaluation of pathological voice quality: Reliability and correlations with acoustic measurements, " Revue de laryngologie, dotologie et de rhinologie, vol. 117, no. 3, pp. 219-224, 1996.
-
(1996)
Revue de Laryngologie, Dotologie et de Rhinologie
, vol.117
, Issue.3
, pp. 219-224
-
-
Dejonckere, P.1
Remacle, M.2
Fresnel-Elbaz, E.3
Woisard, V.4
Crevier-Buchmann, L.5
Millet, B.6
-
12
-
-
0036985308
-
Harmonics-to-noise ratio: An index of vocal aging
-
C. T. Ferrand, "Harmonics-to-noise ratio: An index of vocal aging, " Journal of Voice, vol. 16, no. 4, pp. 480-487, 2002.
-
(2002)
Journal of Voice
, vol.16
, Issue.4
, pp. 480-487
-
-
Ferrand, C.T.1
-
13
-
-
74549192033
-
Vocal attractiveness increases by averaging
-
Laetitia Bruckert, Patricia Bestelmeyer, Marianne Latinus, Julien Rouger, Ian Charest, Guillaume A. Rousselet, Hideki Kawahara, and Pascal Belin, "Vocal attractiveness increases by averaging, " Current Biology, vol. 20, no. 2, pp. 116-120, 2010.
-
(2010)
Current Biology
, vol.20
, Issue.2
, pp. 116-120
-
-
Bruckert, L.1
Bestelmeyer, P.2
Latinus, M.3
Rouger, J.4
Charest, I.5
Rousselet, G.A.6
Kawahara, H.7
Belin, P.8
-
14
-
-
7944221980
-
Spectral tilt as a cue to word segmentation in infancy and adulthood
-
E. D. Thiessen and J. R. Saffran, "Spectral tilt as a cue to word segmentation in infancy and adulthood, " Perception and Psychophysics, vol. 66, no. 5, pp. 779-791, 2004.
-
(2004)
Perception and Psychophysics
, vol.66
, Issue.5
, pp. 779-791
-
-
Thiessen, E.D.1
Saffran, J.R.2
-
15
-
-
69849091637
-
The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise
-
Y. Lu and M. Cooke, "The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise, " Speech Communication, vol. 51, pp. 1253-1262, 2009.
-
(2009)
Speech Communication
, vol.51
, pp. 1253-1262
-
-
Lu, Y.1
Cooke, M.2
-
16
-
-
84865787148
-
Correlation analysis of acoustic features with perceptual voice quality similarity for similar speaker selection
-
Y. Ijima, M. Isogai, and H. Mizuno, "Correlation analysis of acoustic features with perceptual voice quality similarity for similar speaker selection, " in Proc. Interspeech 2011, 2011, vol. 2011, pp. 2237-2240.
-
(2011)
Proc. Interspeech 2011
, vol.2011
, pp. 2237-2240
-
-
Ijima, Y.1
Isogai, M.2
Mizuno, H.3
|