메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2499-2503

A comparative study of spectral transformation techniques for singing voice synthesis

Author keywords

Adaptation; Singing synthesis; Spectral transformation; Speech to singing; Voice conversion

Indexed keywords

COMPUTER MUSIC; MAXIMUM LIKELIHOOD ESTIMATION; SPECTRUM ANALYSIS; SPEECH COMMUNICATION;

EID: 84910071971     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (8)

References (28)
  • 1
    • 84910024916 scopus 로고    scopus 로고
    • Synthesis of singing challenge
    • Special Session, Aug
    • Synthesis of Singing Challenge (Special Session), Proc. Interspeech, Aug. 2007.
    • (2007) Proc. Interspeech
  • 2
    • 84865801323 scopus 로고    scopus 로고
    • Rule-based voice conversion derived from expressive speech perception model: How do computers sing a song joyfully?
    • Tutorial 01, Nov
    • M. Akagi, "Rule-based voice conversion derived from expressive speech perception model: How do computers sing a song joyfully?" in Proc. ISCSLP. Tutorial 01, Nov. 2010.
    • (2010) Proc. ISCSLP
    • Akagi, M.1
  • 3
    • 85032751318 scopus 로고    scopus 로고
    • Synthesis of the singing voice by performance sampling and spectral models
    • J. Bonada and X. Serra, "Synthesis of the singing voice by performance sampling and spectral models, " IEEE Signal Processing Magazine, vol. 24, pp. 67-79, 2007.
    • (2007) IEEE Signal Processing Magazine , vol.24 , pp. 67-79
    • Bonada, J.1    Serra, X.2
  • 5
    • 84867623442 scopus 로고    scopus 로고
    • Generalized F0 modeling with absolute and relative pitch features for singing voice synthesis
    • Mar
    • S. W. Lee, S. T. Ang, M. Dong, and H. Li, "Generalized F0 modeling with absolute and relative pitch features for singing voice synthesis, " in Proc. ICASSP, Mar. 2012, pp. 429-432.
    • (2012) Proc. ICASSP , pp. 429-432
    • Lee, S.W.1    Ang, S.T.2    Dong, M.3    Li, H.4
  • 6
    • 84865746117 scopus 로고    scopus 로고
    • Singing voice synthesis: Singerdependent vibrato modeling and coherent processing of spectral envelope
    • Aug
    • S. W. Lee and M. Dong, "Singing voice synthesis: Singerdependent vibrato modeling and coherent processing of spectral envelope, " in Proc. Interspeech, Aug. 2011, pp. 2001-2004.
    • (2011) Proc. Interspeech , pp. 2001-2004
    • Lee, S.W.1    Dong, M.2
  • 7
    • 76249125282 scopus 로고    scopus 로고
    • VOCALID - Commercial singing synthesizer based on sample concatenation
    • Aug
    • H. Kenmochi and H. Ohshita, "VOCALID - Commercial singing synthesizer based on sample concatenation, " in Proc. Interspeech, Aug. 2007.
    • (2007) Proc. Interspeech
    • Kenmochi, H.1    Ohshita, H.2
  • 9
    • 84910057227 scopus 로고    scopus 로고
    • Mar
    • "An app with speech-to-singing utility. NDP 2013 Mobile App [Online], " Mar. 2014, available: Https://itunes.apple.com/sg/app/ndp-2013-mobileapp/id524388683?mt=8.
    • (2014) An App with Speech-to-singing Utility
  • 10
    • 84867619250 scopus 로고    scopus 로고
    • Vocalistener and vocawatcher: Imitating a human singer by using signal processing
    • Mar
    • M. Goto, T. Nakano, S. Kajita, Y. Matsusaka, S. Nakaoka, and K. Yokoi, "Vocalistener and vocawatcher: Imitating a human singer by using signal processing, " in Proc. ICASSP, Mar. 2012, pp. 5393-5396.
    • (2012) Proc. ICASSP , pp. 5393-5396
    • Goto, M.1    Nakano, T.2    Kajita, S.3    Matsusaka, Y.4    Nakaoka, S.5    Yokoi, K.6
  • 11
    • 65549092601 scopus 로고    scopus 로고
    • Vocal tract resonances in speech, singing and playing music instruments
    • J. Wolfe, M. Garnier, and J. Smith, "Vocal tract resonances in speech, singing and playing music instruments, " Human Frontier Science Program Journal, vol. 3, pp. 6-23, 2009.
    • (2009) Human Frontier Science Program Journal , vol.3 , pp. 6-23
    • Wolfe, J.1    Garnier, M.2    Smith, J.3
  • 12
    • 0347087547 scopus 로고    scopus 로고
    • Tuning of vocal tract resonance by sopranos
    • Jan
    • E. Joliveau, J. Smith, and J. Wolfe, "Tuning of vocal tract resonance by sopranos, " Nature, vol. 427, p. 116, Jan. 2004.
    • (2004) Nature , vol.427
    • Joliveau, E.1    Smith, J.2    Wolfe, J.3
  • 13
    • 0017466904 scopus 로고
    • The acoustics of the singing voice
    • Mar
    • J. Sundberg, "The acoustics of the singing voice, " Scientific American, vol. 236, pp. 82-91, Mar. 1977.
    • (1977) Scientific American , vol.236 , pp. 82-91
    • Sundberg, J.1
  • 14
  • 15
    • 4444251929 scopus 로고
    • Voice conversion: State of the art and perspective
    • E. Moulines and Y. Sagisaka, "Voice conversion: State of the art and perspective, " Special Iss. Speech Commun., vol. 16, no. 2, 1995.
    • (1995) Special Iss. Speech Commun , vol.16 , Issue.2
    • Moulines, E.1    Sagisaka, Y.2
  • 16
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Mar
    • Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. Speech & Audio Proc., vol. 6, pp. 131-142, Mar. 1998.
    • (1998) IEEE Trans. Speech & Audio Proc. , vol.6 , pp. 131-142
    • Stylianou, Y.1    Cappe, O.2    Moulines, E.3
  • 17
    • 0034842552 scopus 로고    scopus 로고
    • Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
    • May
    • T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum, " in Proc. ICASSP, May 2001, pp. 841-844.
    • (2001) Proc. ICASSP , pp. 841-844
    • Toda, T.1    Saruwatari, H.2    Shikano, K.3
  • 18
    • 4444285698 scopus 로고    scopus 로고
    • Ph.D. dissertation, OGI School of Science & Engineering, Oct
    • A. B. Kain, "High resolution voice transformation, " Ph.D. dissertation, OGI School of Science & Engineering, Oct. 2001.
    • (2001) High Resolution Voice Transformation
    • Kain, A.B.1
  • 19
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likihood estimation of spectral parameter trajectory
    • Nov
    • T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum-likihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, & Lang. Proc., vol. 15, pp. 2222- 2235, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, & Lang. Proc , vol.15 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 21
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • Jun
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in Proc. ICASSP, Jun. 2000, pp. 1315-1318.
    • (2000) Proc. ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 22
    • 0034842740 scopus 로고    scopus 로고
    • Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
    • May
    • M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR, " in Proc. ICASSP, May 2011, pp. 805-808.
    • (2011) Proc. ICASSP , pp. 805-808
    • Tamura, M.1    Masuko, T.2    Tokuda, K.3    Kobayashi, T.4
  • 23
    • 84865767835 scopus 로고    scopus 로고
    • HMM-based expressive speech synthesis - Towards TTS with arbitrary speaking styles and emotions
    • Jan
    • J. Yamagishi, T. Masuko, and T. Kobayashi, "HMM-based expressive speech synthesis - Towards TTS with arbitrary speaking styles and emotions, " in Proc. Special Workshop in Maui (SWIM), Jan. 2004.
    • (2004) Proc. Special Workshop in Maui (SWIM)
    • Yamagishi, J.1    Masuko, T.2    Kobayashi, T.3
  • 24
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • Feb
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training, " IEICE Trans. Inf. & Syst., vol. E90-D, pp. 533-543, Feb. 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 25
    • 84865698185 scopus 로고    scopus 로고
    • Statistical voice conversion techniques for body-conducted unvoiced speech enhancement
    • Sep
    • T. Toda, M. Nakagiri, and K. Shikano, "Statistical voice conversion techniques for body-conducted unvoiced speech enhancement, " IEEE Trans. Audio, Speech, & Lang. Proc., vol. 20, pp. 2505-2517, Sep. 2012.
    • (2012) IEEE Trans. Audio, Speech, & Lang. Proc , vol.20 , pp. 2505-2517
    • Toda, T.1    Nakagiri, M.2    Shikano, K.3
  • 26
    • 51449108867 scopus 로고    scopus 로고
    • Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0 and aperiodicity estimation
    • Mar
    • H. Kawahara, M. Morise, T. Takahashi, R. Nisimura, T. Irino, and H. Banno, "Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0 and aperiodicity estimation, " in Proc. ICASSP, Mar. 2008, pp. 3933-3936.
    • (2008) Proc. ICASSP , pp. 3933-3936
    • Kawahara, H.1    Morise, M.2    Takahashi, T.3    Nisimura, R.4    Irino, T.5    Banno, H.6
  • 27
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
    • Jan
    • J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm, " IEEE Trans. Audio, Speech, & Lang. Proc., vol. 17, pp. 66-83, Jan. 2009.
    • (2009) IEEE Trans. Audio, Speech, & Lang. Proc , vol.17 , pp. 66-83
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.