메뉴 건너뛰기




Volumn , Issue , 2010, Pages 211-216

Recent Development of the HMM-based Singing Voice Synthesis System - Sinsy

Author keywords

HMM based speech synthesis; singing voice synthesis

Indexed keywords

MUSIC; SPEECH SYNTHESIS;

EID: 84876667508     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (97)

References (33)
  • 1
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous Modeling of Spectrum, Pitch and Duration in HMM-Based Speech Synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Simultaneous Modeling of Spectrum, Pitch and Duration in HMM-Based Speech Synthesis,” Proc. of Eurospeech, pp. 2347-2350, 1999.
    • (1999) Proc. of Eurospeech , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 13
    • 0038000318 scopus 로고
    • Spectral Estimation of Speech by Mel-Generalized Cepstral Analysis
    • K. Tokuda, T. Kobayashi, T. Chiba, and S. Imai, “Spectral Estimation of Speech by Mel-Generalized Cepstral Analysis,” IEICE Trans. vol. 75-A, no. 7, pp. 1124-1134, 1992.
    • (1992) IEICE Trans , vol.75-A , Issue.7 , pp. 1124-1134
    • Tokuda, K.1    Kobayashi, T.2    Chiba, T.3    Imai, S.4
  • 14
  • 15
    • 0020596154 scopus 로고
    • Cepstral Analysis Synthesis on the Mel Frequency Scale
    • S. Imai, “Cepstral Analysis Synthesis on the Mel Frequency Scale,” Proc. of ICASSP, pp. 93-96, 1983.
    • (1983) Proc. of ICASSP , pp. 93-96
    • Imai, S.1
  • 17
    • 68749108220 scopus 로고    scopus 로고
    • A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System
    • 208
    • K. Oura, H. Zen, Y. Nankaku, A. Lee, and K. Tokuda, “A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System,” Proc. of IEICE Trans. Inf. and Syst., vol. E91-D, no. 11, pp. 2693-2700, 208.
    • Proc. of IEICE Trans. Inf. and Syst , vol.E91-D , Issue.11 , pp. 2693-2700
    • Oura, K.1    Zen, H.2    Nankaku, Y.3    Lee, A.4    Tokuda, K.5
  • 19
    • 79959831939 scopus 로고    scopus 로고
    • HMM-Based Singing Voice Synthesis System Using Pitch-Shifted Pseudo Training Data
    • (to be published)
    • A. Mase, K. Oura, Y. Nankaku, and K. Tokuda, “HMM-Based Singing Voice Synthesis System Using Pitch-Shifted Pseudo Training Data,” Proc. of Interspeech, 2010 (to be published).
    • (2010) Proc. of Interspeech
    • Mase, A.1    Oura, K.2    Nankaku, Y.3    Tokuda, K.4
  • 20
    • 0033906251 scopus 로고    scopus 로고
    • MDL-Based Context-Dependent Subword Modeling for Speech Recognition
    • K. Shinoda and T. Watanabe, “MDL-Based Context-Dependent Subword Modeling for Speech Recognition,” J. Acoust. Soc. Jpn.(E), vol.21, no. 2, pp. 79-86, 2000.
    • (2000) J. Acoust. Soc. Jpn.(E) , vol.21 , Issue.2 , pp. 79-86
    • Shinoda, K.1    Watanabe, T.2
  • 22
    • 44949192112 scopus 로고    scopus 로고
    • An Automatic Singing Skill Evaluation Method for Unknown Melodies Using Pitch Interval Accuracy and Vibrato Features
    • T. Nakano, M. Goto, and Y. Hiraga, “An Automatic Singing Skill Evaluation Method for Unknown Melodies Using Pitch Interval Accuracy and Vibrato Features”, Proc. of Interspeech, pp. 1706-1709, 2006.
    • (2006) Proc. of Interspeech , pp. 1706-1709
    • Nakano, T.1    Goto, M.2    Hiraga, Y.3
  • 24
    • 44949247517 scopus 로고
    • A Musical Ornament, the Vibrato
    • McGraw-Hill Book Company
    • C. E. Seashore, “A Musical Ornament, the Vibrato,” Proc. of Psychology of Music, McGraw-Hill Book Company, pp. 33-52, 1938.
    • (1938) Proc. of Psychology of Music , pp. 33-52
    • Seashore, C. E.1
  • 25
    • 85133408098 scopus 로고    scopus 로고
    • Reducing Computational Cost of Training for HMM-Based Singing Voice Synthesis Using Note Boundaries
    • 2-7-8, (in Japanese)
    • S. Muto, K. Oura, Y. Nankaku, and K. Tokuda, “Reducing Computational Cost of Training for HMM-Based Singing Voice Synthesis Using Note Boundaries,” Proc. of Acoustic Society of Japan Spring Meeting, vol. I, 2-7-8, pp. 347-348, 2009 (in Japanese).
    • (2009) Proc. of Acoustic Society of Japan Spring Meeting , vol.I , pp. 347-348
    • Muto, S.1    Oura, K.2    Nankaku, Y.3    Tokuda, K.4
  • 27
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring Speech Representations Using a Pitch-Adaptive Time-Frequency Smoothing and an Instantaneous-Frequency-Based F0 Extraction: Possible Role of a Repetitive Structure in Sounds
    • H. Kawahara, M. K. Ikuyo, and A. Cheneigne, “Restructuring Speech Representations Using a Pitch-Adaptive Time-Frequency Smoothing and an Instantaneous-Frequency-Based F0 Extraction: Possible Role of a Repetitive Structure in Sounds,” Proc. of Speech Communication, 27, pp. 187-207, 1999.
    • (1999) Proc. of Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Ikuyo, M. K.2    Cheneigne, A.3
  • 30
    • 85133460481 scopus 로고    scopus 로고
    • On CrestMuseXML (CMX) Toolkit Ver. 0.40
    • (in Japanese)
    • T. Kitahara and H. Katayose, “On CrestMuseXML (CMX) Toolkit Ver. 0.40,” IPSJ SIG Technical Report, vol. 2008-MUS-75, no. 17, pp. 95-100, 2008 (in Japanese).
    • (2008) IPSJ SIG Technical Report , vol.2008-MUS-75 , Issue.17 , pp. 95-100
    • Kitahara, T.1    Katayose, H.2
  • 31
    • 0032678076 scopus 로고    scopus 로고
    • Hidden Markov Models Based on Multi-Space Probability Distribution for Pitch Pattern Modeling
    • K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, “Hidden Markov Models Based on Multi-Space Probability Distribution for Pitch Pattern Modeling,” Proc. of ICASSP, vol. I, pp. 229-232, 1999.
    • (1999) Proc. of ICASSP , vol.I , pp. 229-232
    • Tokuda, K.1    Masuko, T.2    Miyazaki, N.3    Kobayashi, T.4
  • 32
    • 33846429403 scopus 로고    scopus 로고
    • Minimum Generation Error Training for HMM-Based Speech Synthesis
    • Y. J. Wu, and R. H. Wang, “Minimum Generation Error Training for HMM-Based Speech Synthesis,” Proc. of ICASSP, vol. I, pp. 89-92, 2006.
    • (2006) Proc. of ICASSP , vol.I , pp. 89-92
    • Wu, Y. J.1    Wang, R. H.2
  • 33
    • 33745200051 scopus 로고    scopus 로고
    • Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis
    • T. Toda and K. Tokuda, “Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis,” Proc. of Interspeech, pp. 2801-2804, 2005.
    • (2005) Proc. of Interspeech , pp. 2801-2804
    • Toda, T.1    Tokuda, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.