메뉴 건너뛰기




Volumn 2, Issue , 2012, Pages 994-997

Analysis of speaker clustering strategies for HMM-based speech synthesis

Author keywords

Hidden Markov models; Speaker adaptation; Statistical parametric speech synthesis

Indexed keywords

AVERAGE VOICE MODELS; BETTER PERFORMANCE; HMM-BASED SPEECH SYNTHESIS; LISTENING TESTS; MULTIPLE LINEAR REGRESSIONS; SPEAKER ADAPTATION; SPEAKER CLUSTERING; STATISTICAL PARAMETRIC SPEECH SYNTHESIS;

EID: 84878404882     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (18)

References (17)
  • 1
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Heiga Zen, Keiichi Tokuda, and Alan W. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 2
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
    • 1
    • J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm, " IEEE Trans. Speech, Audio & Language Process., vol. 17, no. 1, pp. 66-83, 1 2009.
    • (2009) IEEE Trans. Speech, Audio & Language Process. , vol.17 , Issue.1 , pp. 66-83
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5
  • 4
    • 80051625997 scopus 로고    scopus 로고
    • Vocal attractiveness of statistical speech synthesisers
    • May
    • S. Andraszewicz, J. Yamagishi, and S. King, "Vocal attractiveness of statistical speech synthesisers, " in Proc. ICASSP 2011, May 2011, pp. 5368-5371.
    • (2011) Proc. ICASSP 2011 , pp. 5368-5371
    • Andraszewicz, S.1    Yamagishi, J.2    King, S.3
  • 5
    • 0018172553 scopus 로고
    • Multidimensional classification of normal voice qualities
    • S. Singh and T. Murry, "Multidimensional classification of normal voice qualities, " Journal of the Acoustical Society of America, vol. 64, no. 1, pp. 81-87, 1978.
    • (1978) Journal of the Acoustical Society of America , vol.64 , Issue.1 , pp. 81-87
    • Singh, S.1    Murry, T.2
  • 6
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds, " Speech Commun., vol. 27, pp. 187-207, 1999.
    • (1999) Speech Commun. , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3
  • 7
    • 84874199000 scopus 로고    scopus 로고
    • Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT
    • H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and synthesis system STRAIGHT, " Proc. Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA), pp. 1-6, 2001.
    • (2001) Proc. Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA) , pp. 1-6
    • Kawahara, H.1    Estill, J.2    Fujimura, O.3
  • 8
    • 51449117929 scopus 로고    scopus 로고
    • Modelling and synthesising F0 contours with the discrete cosine transform
    • J. Teutenberg, C. Wason, and P. Riddle, "Modelling and synthesising F0 contours with the discrete cosine transform, " in Proc ICASSP 2008, 2008, vol. 2008, pp. 3973-3976.
    • (2008) Proc ICASSP 2008 , vol.2008 , pp. 3973-3976
    • Teutenberg, J.1    Wason, C.2    Riddle, P.3
  • 9
    • 0035208673 scopus 로고    scopus 로고
    • Vocal intensity characteristics in normal and elderly speakers
    • F. Hodge, R. Colton, and R. Kelley, "Vocal intensity characteristics in normal and elderly speakers, " Journal of Voice, vol. 15, no. 4, pp. 503-511, 2001.
    • (2001) Journal of Voice , vol.15 , Issue.4 , pp. 503-511
    • Hodge, F.1    Colton, R.2    Kelley, R.3
  • 12
    • 0036985308 scopus 로고    scopus 로고
    • Harmonics-to-noise ratio: An index of vocal aging
    • C. T. Ferrand, "Harmonics-to-noise ratio: An index of vocal aging, " Journal of Voice, vol. 16, no. 4, pp. 480-487, 2002.
    • (2002) Journal of Voice , vol.16 , Issue.4 , pp. 480-487
    • Ferrand, C.T.1
  • 14
    • 7944221980 scopus 로고    scopus 로고
    • Spectral tilt as a cue to word segmentation in infancy and adulthood
    • E. D. Thiessen and J. R. Saffran, "Spectral tilt as a cue to word segmentation in infancy and adulthood, " Perception and Psychophysics, vol. 66, no. 5, pp. 779-791, 2004.
    • (2004) Perception and Psychophysics , vol.66 , Issue.5 , pp. 779-791
    • Thiessen, E.D.1    Saffran, J.R.2
  • 15
    • 69849091637 scopus 로고    scopus 로고
    • The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise
    • Y. Lu and M. Cooke, "The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise, " Speech Communication, vol. 51, pp. 1253-1262, 2009.
    • (2009) Speech Communication , vol.51 , pp. 1253-1262
    • Lu, Y.1    Cooke, M.2
  • 16
    • 84865787148 scopus 로고    scopus 로고
    • Correlation analysis of acoustic features with perceptual voice quality similarity for similar speaker selection
    • Y. Ijima, M. Isogai, and H. Mizuno, "Correlation analysis of acoustic features with perceptual voice quality similarity for similar speaker selection, " in Proc. Interspeech 2011, 2011, vol. 2011, pp. 2237-2240.
    • (2011) Proc. Interspeech 2011 , vol.2011 , pp. 2237-2240
    • Ijima, Y.1    Isogai, M.2    Mizuno, H.3
  • 17
    • 84865777002 scopus 로고    scopus 로고
    • The CSTR/EMIME HTS system for Blizzard Challenge 2010
    • Junichi Yamagishi and Oliver Watts, "The CSTR/EMIME HTS system for Blizzard Challenge 2010, " in Proc. Blizzard Challenge 2010, 2010.
    • (2010) Proc. Blizzard Challenge 2010
    • Yamagishi, J.1    Watts, O.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.