메뉴 건너뛰기




Volumn , Issue , 2009, Pages 1775-1778

Parameterization of vocal fry in HMM-based speech synthesis

Author keywords

Hidden markov models; Mixed excitation; Speech synthesis; STRAIGHT; Vocal fry

Indexed keywords

APERIODICITY; HIGH QUALITY; HMM-BASED SPEECH SYNTHESIS; MIXED EXCITATION; SPEECH QUALITY; SYNTHESIS EXPERIMENT; VOICE QUALITY;

EID: 70450183498     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (22)

References (15)
  • 1
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Eurospeech, 1999, pp. 2347-2350.
    • (1999) Eurospeech , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 2
    • 0025786649 scopus 로고
    • Vocal quality factors: Analysis, synthesis, and perception
    • D. Childers and C. Lee, "Vocal quality factors: Analysis, synthesis, and perception," J. Acoust. Soc. Am., vol. 90, no. 5, pp. 2394-2410, 1991.
    • (1991) J. Acoust. Soc. Am , vol.90 , Issue.5 , pp. 2394-2410
    • Childers, D.1    Lee, C.2
  • 3
    • 70450168688 scopus 로고    scopus 로고
    • H. Zen, T. Nose, J. Yamagishi, S. Sako, T. Masuko, A. Black, and K. Tokuda, The HMM-based speech synthesis system (HTS) version 2.0, in 6th ISCA Workshop on Speech Synthesis, 2007, pp. 294-299
    • H. Zen, T. Nose, J. Yamagishi, S. Sako, T. Masuko, A. Black, and K. Tokuda, "The HMM-based speech synthesis system (HTS) version 2.0," in 6th ISCA Workshop on Speech Synthesis, 2007, pp. 294-299.
  • 5
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    de Cheveigné, A.3
  • 6
    • 84867209230 scopus 로고    scopus 로고
    • HMM-based Finnish text-to-speech system utilizing glottal inverse filtering
    • T. Raitio, A. Suni, H. Pulakka, M. Vainio, and P. Alku, "HMM-based Finnish text-to-speech system utilizing glottal inverse filtering," in Interspeech, 2008, pp. 1881-1884.
    • (2008) Interspeech , pp. 1881-1884
    • Raitio, T.1    Suni, A.2    Pulakka, H.3    Vainio, M.4    Alku, P.5
  • 8
    • 0014262782 scopus 로고
    • Perceptual study of vocal fry
    • H. Hollien and R. Wendahl, "Perceptual study of vocal fry," J. Acoust. Soc. Am., vol. 43, no. 3, pp. 506-509, 1968.
    • (1968) J. Acoust. Soc. Am , vol.43 , Issue.3 , pp. 506-509
    • Hollien, H.1    Wendahl, R.2
  • 9
    • 70450192283 scopus 로고    scopus 로고
    • Creaky voice as a prosodic feature in Finnish
    • A. Iivonen, "Creaky voice as a prosodic feature in Finnish," in Nordic Prosody, IX Conference, 2004, pp. 137-146.
    • (2004) Nordic Prosody, IX Conference , pp. 137-146
    • Iivonen, A.1
  • 10
    • 33750915991 scopus 로고    scopus 로고
    • STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds
    • H. Kawahara, "STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds," Acoust. Sci. & Tech., vol. 27, no. 6, pp. 349-353, 2006.
    • (2006) Acoust. Sci. & Tech , vol.27 , Issue.6 , pp. 349-353
    • Kawahara, H.1
  • 12
    • 33745184239 scopus 로고    scopus 로고
    • Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT
    • H. Kawahara, A. de Cheveigné, H. Banno, T. Takahashi, and T. Irino, "Nearly defect-free F0 trajectory extraction for expressive speech modifications based on STRAIGHT," in Interspeech, 2005, pp. 537-540.
    • (2005) Interspeech , pp. 537-540
    • Kawahara, H.1    de Cheveigné, A.2    Banno, H.3    Takahashi, T.4    Irino, T.5
  • 13
    • 0025629485 scopus 로고
    • Pitch estimation and voicing detection based on a sinusoidal speech model
    • R. McAulay and T. Quatieri, "Pitch estimation and voicing detection based on a sinusoidal speech model," in ICASSP, 1990, pp. 249-252.
    • (1990) ICASSP , pp. 249-252
    • McAulay, R.1    Quatieri, T.2
  • 14
    • 84867227302 scopus 로고    scopus 로고
    • Evaluation of Finnish unit selection and HMM-based speech synthesis
    • H. Silén, E. Helander, J. Nurminen, and M. Gabbouj, "Evaluation of Finnish unit selection and HMM-based speech synthesis," in Interspeech, 2008, pp. 1853-1856.
    • (2008) Interspeech , pp. 1853-1856
    • Silén, H.1    Helander, E.2    Nurminen, J.3    Gabbouj, M.4
  • 15
    • 33745200051 scopus 로고    scopus 로고
    • Speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis," in Interspeech, 2005, pp. 2801-2804.
    • (2005) Interspeech , pp. 2801-2804
    • Toda, T.1    Tokuda, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.