메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2559-2563

Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis

Author keywords

hidden Markov model; pitch adaptive training; singing voice synthesis; speaker adaptive training

Indexed keywords

SIGNAL PROCESSING;

EID: 84905234613     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854062     Document Type: Conference Paper
Times cited : (17)

References (16)
  • 1
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. of Eurospeech, pp. 2347-2350, 1999.
    • (1999) Proc. of Eurospeech , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 2
    • 0034842740 scopus 로고    scopus 로고
    • Adaptation of pitch and spectrum for HMM-based speech synthesis using mllr
    • M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using mllr," in Proc. of ICASSP, pp. 805-808, 2001.
    • (2001) Proc. of ICASSP , pp. 805-808
    • Tamura, M.1    Masuko, T.2    Tokuda, K.3    Kobayashi, T.4
  • 7
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans. Inf. & Syst., vol. E-90D, no. 2, pp. 533-543, 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.E-90D , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 8
    • 84867200235 scopus 로고    scopus 로고
    • Generating natural F0 trajectory with additive trees
    • Y. Qian, H. Liang, and F. K. Soong, "Generating natural F0 trajectory with additive trees," in Proc. of Interspeech, pp. 2126-2129, 2008.
    • (2008) Proc. of Interspeech , pp. 2126-2129
    • Qian, Y.1    Liang, H.2    Soong, F.K.3
  • 9
    • 70450161503 scopus 로고    scopus 로고
    • Context-dependent additive log F0 model for HMM-based speech synthesis
    • H. Zen and N. Braunschweiler, "Context-dependent additive log F0 model for HMM-based speech synthesis," in Proc. of Interspeech, pp. 2091-2094, 2094.
    • Proc. of Interspeech , pp. 2091-2094
    • Zen, H.1    Braunschweiler, N.2
  • 10
    • 84867584634 scopus 로고    scopus 로고
    • Pitch adaptive training for HMM-based singing voice synthesis
    • K. Oura, A. Mase, Y. Nankaku, and K. Tokuda, "Pitch adaptive training for HMM-based singing voice synthesis," in Proc. of ICASSP, pp. 5377-5380, 2012.
    • (2012) Proc. of ICASSP , pp. 5377-5380
    • Oura, K.1    Mase, A.2    Nankaku, Y.3    Tokuda, K.4
  • 11
    • 0028996993 scopus 로고
    • Speech parameter generation from HMM using dynamic features
    • K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features," in Proc. of ICASSP, pp. 660-663, 1995.
    • (1995) Proc. of ICASSP , pp. 660-663
    • Tokuda, K.1    Kobayashi, T.2    Imai, S.3
  • 12
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 13
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, M. K. Ikuyo, and A. Cheneigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Ikuyo, M.K.2    Cheneigne, A.3
  • 15
    • 0032678076 scopus 로고    scopus 로고
    • Hidden markov models based on multi-space probability distribution for pitch pattern modeling
    • K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, vol. 1, pp. 229-232, 1999.
    • (1999) Proc. of ICASSP , vol.1 , pp. 229-232
    • Tokuda, K.1    Masuko, T.2    Miyazaki, N.3    Kobayashi, T.4
  • 16
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based contextdependent subword modeling for speech recognition
    • K. Shinoda and T. Watanabe, "MDL-based contextdependent subword modeling for speech recognition," J. Acoust. Soc. Jpn. (E), vol. 21, no. 2, pp. 76-86, 2000.
    • (2000) J. Acoust. Soc. Jpn. (E) , vol.21 , Issue.2 , pp. 76-86
    • Shinoda, K.1    Watanabe, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.