메뉴 건너뛰기




Volumn , Issue , 2010, Pages 4606-4609

Improved modeling for F0 generation and V/U decision in HMM-based TTS

Author keywords

F0 generation; HMM based TTS; V U decision model; Voicing strength

Indexed keywords

EXTRACTION; PROBABILITY DENSITY FUNCTION; SIGNAL PROCESSING;

EID: 78049409326     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2010.5495561     Document Type: Conference Paper
Times cited : (25)

References (15)
  • 2
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. M. Katsuse, and A. D. Cheveigne, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based f0 extraction: possible role of a repetitive structure in sounds", Speech Communication, vol. 27, no. 3-4, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Katsuse, I.M.2    Cheveigne, A.D.3
  • 4
    • 11144317887 scopus 로고    scopus 로고
    • Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency
    • Dec.
    • D. Arifianto, T. Tanaka, T. Masuko, and T. Kobayashi, "Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency", IEICE Trans. Inf. & Syst., vol. E87-D, no. 12, pp. 2812-2820, Dec. 2004.
    • (2004) IEICE Trans. Inf. & Syst. , vol.E87-D , Issue.12 , pp. 2812-2820
    • Arifianto, D.1    Tanaka, T.2    Masuko, T.3    Kobayashi, T.4
  • 5
    • 84928118106 scopus 로고    scopus 로고
    • Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity
    • H. Kawahara, H. Katayose, A. Cheveigńe, and R. Patterson, "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity", Proc .of EuroSpeech, 1999.
    • Proc.of EuroSpeech, 1999
    • Kawahara, H.1    Katayose, H.2    Cheveigńe, A.3    Patterson, R.4
  • 6
    • 0001455934 scopus 로고
    • A robust algorithm for pitch tracking (RAPT)
    • W. Kleijn and K. Paliwal, Eds. Elsevier
    • D. Talkin, "A robust algorithm for pitch tracking (RAPT)", in Speech Coding and Synthesis,W. Kleijn and K. Paliwal, Eds. Elsevier, 1995, pp. 495-518.
    • (1995) Speech Coding and Synthesis , pp. 495-518
    • Talkin, D.1
  • 7
    • 33749573927 scopus 로고    scopus 로고
    • Reformulating the HMM as a Trajectory Model by Imposing Explicit Relationships between static and Dynamic Feature Vector Sequences
    • H. Zen, K. Tokuda, and T. Kitamura, "Reformulating the HMM as a Trajectory Model by Imposing Explicit Relationships between static and Dynamic Feature Vector Sequences," Computer Speech & Language, vol. 21, no. 1, pp. 153-173, 2007.
    • (2007) Computer Speech & Language , vol.21 , Issue.1 , pp. 153-173
    • Zen, H.1    Tokuda, K.2    Kitamura, T.3
  • 8
    • 34547517493 scopus 로고    scopus 로고
    • Full HMM Training for Minimizing Generation Error in Synthesis
    • Y. Wu, R. Wang, and F. Soong, "Full HMM Training for Minimizing Generation Error in Synthesis," in Proc. ICASSP,2007.
    • Proc. ICASSP,2007
    • Wu, Y.1    Wang, R.2    Soong, F.3
  • 10
    • 0037567970 scopus 로고    scopus 로고
    • Pitch pattern generation using multi-space probability distribution hmm
    • T. Masuko, K. Tokuda, N. Miyazaki, and T. Kobayashi, "Pitch pattern generation using multi-space probability distribution hmm," IEICE Trans., vol. J83-D-II, no. 7, pp. 1600-1609, 2000
    • (2000) IEICE Trans. , vol.J83-D-II , Issue.7 , pp. 1600-1609
    • Masuko, T.1    Tokuda, K.2    Miyazaki, N.3    Kobayashi, T.4
  • 14
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based Context-Dependent Sub-word Modeling for Speech Recognition
    • K. Shinoda, and T. Watanable, "MDL-based Context-Dependent Sub-word Modeling for Speech Recognition", J. Acoust. Soc. Jpn(E), vol.21, no.2, pp.79-86, 2000.
    • (2000) J. Acoust. Soc. Jpn(E) , vol.21 , Issue.2 , pp. 79-86
    • Shinoda, K.1    Watanable, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.