메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1504-1508

Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech

Author keywords

Acoustic modelling; Diagonal covariance matrices; Repeated speech; Speech synthesis; Stream independence

Indexed keywords

COVARIANCE MATRIX; SPEECH SYNTHESIS;

EID: 84910028520     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (35)

References (25)
  • 2
    • 84865801900 scopus 로고    scopus 로고
    • The effect of using normalized models in statistical speech synthesis
    • M. Shannon, H. Zen, and W. Byrne, "The effect of using normalized models in statistical speech synthesis, " in Proc. Inter Speech, 2011.
    • (2011) Proc. Inter Speech
    • Shannon, M.1    Zen, H.2    Byrne, W.3
  • 3
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in Proc. ICASSP, vol. 3, 2000, pp. 1315-1318.
    • (2000) Proc. ICASSP , vol.3 , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 4
    • 84905253193 scopus 로고    scopus 로고
    • An experimental comparison of multiple vocoder types
    • Q. Hu, K. Richmond, J. Yamagishi, and J. Latorre, "An experimental comparison of multiple vocoder types, " in Proc. SSW8, 2013, pp. 155-160.
    • (2013) Proc. SSW8 , pp. 155-160
    • Hu, Q.1    Richmond, K.2    Yamagishi, J.3    Latorre, J.4
  • 5
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models, " IEEE T. Speech Audi. P., vol. 7, no. 3, pp. 272-281, 1999.
    • (1999) IEEE T. Speech Audi. P. , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.J.F.1
  • 7
    • 84910063941 scopus 로고    scopus 로고
    • Investigating the shortcomings of HMM synthesis
    • T. Merritt and S. King, "Investigating the shortcomings of HMM synthesis, " in Proc. SSW8, 2013, pp. 185-190.
    • (2013) Proc. SSW8 , pp. 185-190
    • Merritt, T.1    King, S.2
  • 9
    • 70450184166 scopus 로고    scopus 로고
    • An assessment of automatic recognition techniques for spontaneous speech in comparison with human performance
    • T. Shinozaki and S. Furui, "An assessment of automatic recognition techniques for spontaneous speech in comparison with human performance, " in Proc. SSPR, 2003.
    • (2003) Proc. SSPR
    • Shinozaki, T.1    Furui, S.2
  • 10
    • 84858986605 scopus 로고    scopus 로고
    • A comparison of automatic and human speech recognition in null grammar
    • A. Juneja, "A comparison of automatic and human speech recognition in null grammar, " J. Acoust. Soc. Am., vol. 131, no. 3, pp. EL256-EL261, 2012.
    • (2012) J. Acoust. Soc. Am. , vol.131 , Issue.3 , pp. EL256-EL261
    • Juneja, A.1
  • 11
    • 84943154470 scopus 로고    scopus 로고
    • Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
    • D. McAllaster, L. Gillick, F. Scattone, and M. Newman, "Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch, " in Proc. ICSLP, 1998.
    • (1998) Proc. ICSLP
    • McAllaster, D.1    Gillick, L.2    Scattone, F.3    Newman, M.4
  • 12
    • 84858952478 scopus 로고    scopus 로고
    • Don't multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition
    • D. Gillick, L. Gillick, and S. Wegmann, "Don't multiply lightly: quantifying problems with the acoustic model assumptions in speech recognition, " in Proc. ASRU, 2011, pp. 71-76.
    • (2011) Proc. ASRU , pp. 71-76
    • Gillick, D.1    Gillick, L.2    Wegmann, S.3
  • 13
    • 84856237844 scopus 로고    scopus 로고
    • An introduction to statistical parametric speech synthesis
    • S. King, "An introduction to statistical parametric speech synthesis, " Sadhana, vol. 36, no. 5, pp. 837-852, 2011.
    • (2011) Sadhana , vol.36 , Issue.5 , pp. 837-852
    • King, S.1
  • 14
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis, " Speech Commun., vol. 51, no. 11, pp. 1039- 1064, 2009.
    • (2009) Speech Commun. , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 16
    • 33749573927 scopus 로고    scopus 로고
    • Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences
    • H. Zen, K. Tokuda, and T. Kitamura, "Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences, " Comput. Speech Lang., vol. 21, no. 1, pp. 153-173, 2007.
    • (2007) Comput. Speech Lang. , vol.21 , Issue.1 , pp. 153-173
    • Zen, H.1    Tokuda, K.2    Kitamura, T.3
  • 17
    • 84872190545 scopus 로고    scopus 로고
    • Autoregressive models for statistical parametric speech synthesis
    • M. Shannon, H. Zen, and W. Byrne, "Autoregressive models for statistical parametric speech synthesis, " IEEE T. Audio Speech, vol. 21, no. 3, pp. 587-597, 2013.
    • (2013) IEEE T. Audio Speech , vol.21 , Issue.3 , pp. 587-597
    • Shannon, M.1    Zen, H.2    Byrne, W.3
  • 19
    • 84910047268 scopus 로고    scopus 로고
    • Objective measurement of active speech level, Telecommunication Standardization Sector, Geneva, Switzerland, March
    • Objective measurement of active speech level, ITU Recommendation ITU-T P.56, International Telecommunication Union, Telecommunication Standardization Sector, Geneva, Switzerland, March 2011.
    • (2011) ITU Recommendation ITU-T P.56, International Telecommunication Union
  • 20
    • 33750915991 scopus 로고    scopus 로고
    • STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds
    • H. Kawahara, "STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds, " Acoust. Sci. Technol., vol. 27, no. 6, pp. 349-353, 2006.
    • (2006) Acoust. Sci. Technol. , vol.27 , Issue.6 , pp. 349-353
    • Kawahara, H.1
  • 21
    • 84910053549 scopus 로고    scopus 로고
    • Method for the subjective assessment of intermediate quality level of coding systems, International Telecommunication Union Radiocommunication Assembly, Geneva, Switzerland, March
    • Method for the subjective assessment of intermediate quality level of coding systems, ITU Recommendation ITU-R BS.1534-1, International Telecommunication Union Radiocommunication Assembly, Geneva, Switzerland, March 2003.
    • (2003) ITU Recommendation ITU-R BS.1534-1
  • 22
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Tomoki and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, " IEICE Trans. Inf. Syst., vol. E90-D, no. 5, pp. 816-824, 2007.
    • (2007) IEICE Trans. Inf. Syst. , vol.E90-D , Issue.5 , pp. 816-824
    • Tomoki, T.1    Tokuda, K.2
  • 24
    • 84890495160 scopus 로고    scopus 로고
    • Fast, low-artifact speech synthesis considering global variance
    • M. Shannon and W. Byrne, "Fast, low-artifact speech synthesis considering global variance, " in Proc. ICASSP, 2013, pp. 7869- 7873.
    • (2013) Proc. ICASSP , pp. 7869-7873
    • Shannon, M.1    Byrne, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.