메뉴 건너뛰기




Volumn , Issue , 2010, Pages 334-339

Comparison of Formant Enhancement Methods for HMM-Based Speech Synthesis

Author keywords

formant enhancement; hidden Markov model; over smoothing; speech synthesis

Indexed keywords

SPEECH SYNTHESIS;

EID: 84865718521     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (21)

References (26)
  • 1
    • 85031628788 scopus 로고
    • An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features
    • Tokuda, K., Masuko, T., Yamada, T., Kobayashi, T. and Imai, S., “An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features”, in Proc. Eurospeech, 1:757-760, 1995.
    • (1995) Proc. Eurospeech , vol.1 , pp. 757-760
    • Tokuda, K.1    Masuko, T.2    Yamada, T.3    Kobayashi, T.4    Imai, S.5
  • 2
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • Sep
    • Yoshimura, T., Tokuda, K., Masuko, T., Kobayashi, T. and Kitamura, T., “Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis”, in Proc. Eurospeech, 2374-2350, Sep. 1999.
    • (1999) Proc. Eurospeech , pp. 2374-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 4
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Zen, H., Tokuda, K. and Black, A. W., “Statistical parametric speech synthesis”, Speech Commun., 51(11):1039-1064, 2009.
    • (2009) Speech Commun , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A. W.3
  • 5
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • Apr
    • Makhoul, J., “Linear prediction: A tutorial review”, in Proc. of the IEEE, 63(4):561-580, Apr. 1975.
    • (1975) Proc. of the IEEE , vol.63 , Issue.4 , pp. 561-580
    • Makhoul, J.1
  • 6
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • Fukada, T., Tokuda, K., Kobayashi, T., Imai, S., “An adaptive algorithm for mel-cepstral analysis of speech”, in Proc. ICASSP, 137-140, 1992.
    • (1992) Proc. ICASSP , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 7
    • 85009231267 scopus 로고    scopus 로고
    • Trajectory modeling based on HMMs with the explicit relationship between static and dynamic features
    • Sep
    • Tokuda, K., Zen, H. and Kitamura, T., “Trajectory modeling based on HMMs with the explicit relationship between static and dynamic features”, In Proc. Eurospeech, 865-868. Sep. 2003.
    • (2003) Proc. Eurospeech , pp. 865-868
    • Tokuda, K.1    Zen, H.2    Kitamura, T.3
  • 8
    • 33846429403 scopus 로고    scopus 로고
    • Minimum generation error training for HMM-based speech synthesis
    • Wu, Y.-J. and Wang, R.-H., “Minimum generation error training for HMM-based speech synthesis”, in Proc. ICASSP, 89-92, 2006.
    • (2006) Proc. ICASSP , pp. 89-92
    • Wu, Y.-J.1    Wang, R.-H.2
  • 9
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • May
    • Toda, T. and Tokuda, K., “A speech parameter generation algorithm considering global variance for HMM-based speech synthesis”, IEICE Trans. Inf. & Syst., E90-D(5):816-824, May 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 10
    • 34547553049 scopus 로고    scopus 로고
    • A study on conditional parameter generation from HMM based on maximum likelihood criterion
    • [in Japanese]
    • Masuko, T., Tokuda, K. and Kobayashi, T., “A study on conditional parameter generation from HMM based on maximum likelihood criterion”, in Proc. Autumn Meeting of ASJ, 209-210, 2003. [in Japanese]
    • (2003) Proc. Autumn Meeting of ASJ , pp. 209-210
    • Masuko, T.1    Tokuda, K.2    Kobayashi, T.3
  • 12
    • 0029219433 scopus 로고
    • Adaptive postfiltering for quality enhancement of coded speech
    • Jan
    • Chen, J.-H. and Gersho, A., “Adaptive postfiltering for quality enhancement of coded speech”, IEEE Trans. on Speech and Audio Processing, 3(1):59-71, Jan. 1995.
    • (1995) IEEE Trans. on Speech and Audio Processing , vol.3 , Issue.1 , pp. 59-71
    • Chen, J.-H.1    Gersho, A.2
  • 15
    • 34547505386 scopus 로고    scopus 로고
    • Ph.D Thesis, University of Science and Technology of China, [in Chinese]
    • Wu, Y.-J., “Research on HMM-based Speech Synthesis”, Ph.D Thesis, University of Science and Technology of China, 2006. [in Chinese]
    • (2006) Research on HMM-based Speech Synthesis
    • Wu, Y.-J.1
  • 16
    • 67650797364 scopus 로고    scopus 로고
    • Postfiltering for HMM-based speech synthesis using mel-LSPs
    • [in Japanese]
    • Oura, K., Zen, H., Nankaku, Y., Lee, A. and Tokuda, K., “Postfiltering for HMM-based speech synthesis using mel-LSPs”, Proc. Autumn Meeting of ASJ, pp. 367-368, 2007. [in Japanese]
    • (2007) Proc. Autumn Meeting of ASJ , pp. 367-368
    • Oura, K.1    Zen, H.2    Nankaku, Y.3    Lee, A.4    Tokuda, K.5
  • 17
    • 0002557614 scopus 로고
    • Line spectrum pair (LSP) and speech data compression
    • Soong, F. K. and Juang, B.-H., “Line spectrum pair (LSP) and speech data compression”, Proc. ICASSP, 9:37-40, 1984.
    • (1984) Proc. ICASSP , vol.9 , pp. 37-40
    • Soong, F. K.1    Juang, B.-H.2
  • 19
    • 84867209230 scopus 로고    scopus 로고
    • HMM-based Finnish text-to-speech system utilizing glottal inverse filtering
    • Raitio, T., Suni, A., Pulakka, H., Vainio, M. and Alku, P., “HMM-based Finnish text-to-speech system utilizing glottal inverse filtering”, Proc. Interspeech, 2008.
    • (2008) Proc. Interspeech
    • Raitio, T.1    Suni, A.2    Pulakka, H.3    Vainio, M.4    Alku, P.5
  • 21
    • 0032875050 scopus 로고    scopus 로고
    • A method for generating natural-sounding speech stimuli for cognitive brain research
    • Alku, P., Tiitinen, H. and Näätänen, R., “A method for generating natural-sounding speech stimuli for cognitive brain research”, Clinical Neurophysiology, 110:1329-1333, 1999.
    • (1999) Clinical Neurophysiology , vol.110 , pp. 1329-1333
    • Alku, P.1    Tiitinen, H.2    Näätänen, R.3
  • 22
    • 0026881384 scopus 로고
    • Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
    • Jun
    • Alku, P., “Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering”, Speech Commun. 11(2-3):109-118, Jun. 1992.
    • (1992) Speech Commun , vol.11 , Issue.2-3 , pp. 109-118
    • Alku, P.1
  • 23
    • 0030101058 scopus 로고    scopus 로고
    • A revision of Zwicker's loudness model
    • Moore, B. C. J. and Glasberg, B. R., “A revision of Zwicker's loudness model”, ACTA Acustica, 82:335-345, 1996.
    • (1996) ACTA Acustica , vol.82 , pp. 335-345
    • Moore, B. C. J.1    Glasberg, B. R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.