메뉴 건너뛰기




Volumn , Issue , 2009, Pages 408-411

A minimum v/u error approach to F0 generation in HMM-based TTS

Author keywords

HMM based TTS; Speech synthesis; v u decision

Indexed keywords

FEATURE VECTORS; HMM MODELS; HMM-BASED TTS; KEY FACTORS; NEW APPROACHES; PITCH-TRACKING; POSTERIOR PROBABILITY; PREDICTION PERFORMANCE; PRIOR KNOWLEDGE; SWITCHING POINTS; VOICE QUALITY;

EID: 70450169782     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (12)
  • 3
    • 77249139677 scopus 로고    scopus 로고
    • An HMM-based Mandarin Chinese Text-To-Speech System
    • Proc. of ISCSLP, Springer
    • Y. Qian, F. K. Soong, Y. N. Chen, and M. Chu, "An HMM-based Mandarin Chinese Text-To-Speech System", Proc. of ISCSLP , Springer LNAI Vol. 4274, pp.223-232, 2006.
    • (2006) LNAI , vol.4274 , pp. 223-232
    • Qian, Y.1    Soong, F.K.2    Chen, Y.N.3    Chu, M.4
  • 4
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • Feb
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training", IEICE Trans. Inf. & Syst., vol. E90-D, no. 2, pp. 533-543, Feb. 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 5
    • 51449114529 scopus 로고    scopus 로고
    • A style control technique for HMM-based expressive speech synthesis
    • Sep
    • T. Nose, J. Yamagishi, and T. Kobayashi, "A style control technique for HMM-based expressive speech synthesis", IEICE Trans. Inf. & Syst., vol. E90-D, no. 9, pp. 1406-1413, Sep. 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.9 , pp. 1406-1413
    • Nose, T.1    Yamagishi, J.2    Kobayashi, T.3
  • 6
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. M. Katsuse, and A. D. Cheveigne, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based f0 extraction: possible role of a repetitive structure in sounds", Speech Communication, vol. 27, no. 3-4, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Katsuse, I.M.2    Cheveigne, A.D.3
  • 8
    • 11144317887 scopus 로고    scopus 로고
    • Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency
    • Dec
    • D. Arifianto, T. Tanaka, T. Masuko, and T. Kobayashi, "Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency", IEICE Trans. Inf. & Syst., vol. E87-D, no. 12, pp. 2812-2820, Dec. 2004.
    • (2004) IEICE Trans. Inf. & Syst , vol.E87-D , Issue.12 , pp. 2812-2820
    • Arifianto, D.1    Tanaka, T.2    Masuko, T.3    Kobayashi, T.4
  • 9
    • 84928118106 scopus 로고    scopus 로고
    • Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity
    • H. Kawahara, H. Katayose, A. Cheveigné, and R. Patterson, "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity", Proc .of EuroSpeech, 1999.
    • (1999) Proc .of EuroSpeech
    • Kawahara, H.1    Katayose, H.2    Cheveigné, A.3    Patterson, R.4
  • 10
    • 0001455934 scopus 로고
    • A robust algorithm for pitch tracking (RAPT)
    • W. Kleijn and K. Paliwal, Eds. Elsevier
    • D. Talkin, "A robust algorithm for pitch tracking (RAPT)", in Speech Coding and Synthesis,W. Kleijn and K. Paliwal, Eds. Elsevier, 1995, pp. 495-518.
    • (1995) Speech Coding and Synthesis , pp. 495-518
    • Talkin, D.1
  • 12
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based Context-Dependent Sub-word Modeling for Speech Recognition
    • K. Shinoda, and T. Watanable, "MDL-based Context-Dependent Sub-word Modeling for Speech Recognition", J. Acoust. Soc. Jpn(E), vol.21, no.2, pp.79-86, 2000.
    • (2000) J. Acoust. Soc. Jpn(E) , vol.21 , Issue.2 , pp. 79-86
    • Shinoda, K.1    Watanable, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.