메뉴 건너뛰기




Volumn , Issue , 2012, Pages

Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system

Author keywords

[No Author keywords available]

Indexed keywords

EIGENVOICES; MANY-TO-MANY; PARALLEL DATA; SINGING VOICES; TRAINING DATA; VOICE CONVERSION; VOICE QUALITY;

EID: 84874403435     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (39)

References (20)
  • 1
    • 76249125282 scopus 로고    scopus 로고
    • VOCALOID - Commericial singing synthesizer based on sample concatenation
    • Aug.
    • H. Kenmochi and H. Ohshita, "VOCALOID - Commericial singing synthesizer based on sample concatenation," Proc. INTERSPEECH, pp. 4011-4012, Aug. 2007.
    • (2007) Proc. INTERSPEECH , pp. 4011-4012
    • Kenmochi, H.1    Ohshita, H.2
  • 2
    • 84876667508 scopus 로고    scopus 로고
    • Recent development of the HMM-based singing voice synthesis system - Sinsy
    • Sept.
    • K. Oura, A. Mase, T. Yamada, S. Muto, Y. Nankaku, and K. Tokuda, "Recent development of the HMM-based singing voice synthesis system - Sinsy," SSW7, pp. 211-216, Sept. 2010.
    • (2010) SSW7 , pp. 211-216
    • Oura, K.1    Mase, A.2    Yamada, T.3    Muto, S.4    Nankaku, Y.5    Tokuda, K.6
  • 3
    • 84867588982 scopus 로고    scopus 로고
    • VOCALOID and Hatsune Miku phenomenon in Japan
    • Oct.
    • H. Kenmochi, "VOCALOID and Hatsune Miku phenomenon in Japan,"Proc. InterSinging, pp. 1-4, Oct. 2010.
    • (2010) Proc. InterSinging , pp. 1-4
    • Kenmochi, H.1
  • 4
    • 70349200811 scopus 로고    scopus 로고
    • Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown
    • Apr.
    • H. Kawahara, R. Nisimura, T. Irino, M. Morise, T. Takahashi, and H. Banno, "Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown," Proc. ICASSP, pp. 3905-3908, Apr. 2009.
    • (2009) Proc. ICASSP , pp. 3905-3908
    • Kawahara, H.1    Nisimura, R.2    Irino, T.3    Morise, M.4    Takahashi, T.5    Banno, H.6
  • 5
    • 79959827418 scopus 로고    scopus 로고
    • Applying voice conversion to concate-native singing-voice synthesis
    • Sept.
    • F. Villavicencio and J. Bonada, "Applying voice conversion to concate-native singing-voice synthesis," INTERSPEECH, pp. 2162-2165, Sept. 2010.
    • (2010) INTERSPEECH , pp. 2162-2165
    • Villavicencio, F.1    Bonada, J.2
  • 6
    • 80051619989 scopus 로고    scopus 로고
    • Vocalistener2: A singing synthesis sytem able to mimic a user's singing in terms of voice timbre changes as well as pitch and dynamics
    • May
    • T. Nakano and M. Goto, "Vocalistener2: A singing synthesis sytem able to mimic a user's singing in terms of voice timbre changes as well as pitch and dynamics," Proc. ICASSP, pp. 453-456, May 2012.
    • (2012) Proc. ICASSP , pp. 453-456
    • Nakano, T.1    Goto, M.2
  • 8
    • 84874432462 scopus 로고    scopus 로고
    • GMM voice conversion of singing voice using vocal tract area function
    • (Japanese edition), Nov.
    • Y. Kawakami, H. Banno, and F. Itakura, "GMM voice conversion of singing voice using vocal tract area function," IEICE technical report. Speech 110(297) (Japanese edition), pp. 71-76, Nov. 2010.
    • (2010) IEICE Technical Report. Speech , vol.110 , Issue.297 , pp. 71-76
    • Kawakami, Y.1    Banno, H.2    Itakura, F.3
  • 9
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Mar.
    • Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. SAP, vol. 6, no. 2, pp. 131-142, Mar. 1998.
    • (1998) IEEE Trans. SAP , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1    Cappe, O.2    Moulines, E.3
  • 10
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • May
    • A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," Proc. ICASSP, pp. 285-288, May 1998.
    • (1998) Proc. ICASSP , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 11
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Nov.
    • T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory," IEEE Trans. ASLP, vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
    • (2007) IEEE Trans. ASLP , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 12
    • 70450194389 scopus 로고    scopus 로고
    • Many-to-many eigenvoice conversion with reference voice
    • Sept.
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Many-to-many eigenvoice conversion with reference voice," INTERSPEECH, pp. 1623-1626, Sept. 2009.
    • (2009) INTERSPEECH , pp. 1623-1626
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 13
    • 78049360766 scopus 로고    scopus 로고
    • VocaListener: A singing-to-singing synthesis system based on iterative parameter estimation
    • May
    • T. Nakano and M. Goto, "VocaListener: A singing-to-singing synthesis system based on iterative parameter estimation," Proc. SMC 2009, pp. 343-348, May 2009.
    • (2009) Proc. SMC 2009 , pp. 343-348
    • Nakano, T.1    Goto, M.2
  • 14
    • 34547496175 scopus 로고    scopus 로고
    • One-to-many and many-to-one voice conversion based on eigenvoices
    • Apr.
    • T. Toda, Y. Ohtani, and K. Shikano, "One-to-many and many-to-one voice conversion based on eigenvoices," Proc. ICASSP, pp. 1249-1252, Apr. 2007.
    • (2007) Proc. ICASSP , pp. 1249-1252
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 15
    • 77952978184 scopus 로고    scopus 로고
    • Adaptive training for voice conversion based on eigenvoices
    • June
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Adaptive training for voice conversion based on eigenvoices," IEICE Trans. Inf. and Syst., vol. E93-D, no. 6, pp. 1589-1598, June 2010.
    • (2010) IEICE Trans. Inf. and Syst. , vol.E93-D , Issue.6 , pp. 1589-1598
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 16
    • 50249180273 scopus 로고    scopus 로고
    • Speech-to-singing synthesis: Converting speaking voices to singing voices by controlling acoustic features unique to singing voice
    • Oct.
    • T. Saitou, M. Goto, M. Unoki, and M. Akagi, "Speech-to-singing synthesis: Converting speaking voices to singing voices by controlling acoustic features unique to singing voice," Proc. WASPAA, pp. 215-218, Oct. 2007.
    • (2007) Proc. WASPAA , pp. 215-218
    • Saitou, T.1    Goto, M.2    Unoki, M.3    Akagi, M.4
  • 17
    • 84867211725 scopus 로고    scopus 로고
    • Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Sept.
    • T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory," Proc. INTERSPEECH, pp. 1076-1079, Sept. 2008.
    • (2008) Proc. INTERSPEECH , pp. 1076-1079
    • Muramatsu, T.1    Ohtani, Y.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 18
    • 2442437071 scopus 로고    scopus 로고
    • RWC Music Database: Music genre database and musical instrument sound database
    • Oct.
    • M. Goto, T. Nishimura, H. Hashiguchi, and R. Oka, "RWC Music Database: Music genre database and musical instrument sound database," Proc. ISMIR, pp. 229-230, Oct. 2003.
    • (2003) Proc. ISMIR , pp. 229-230
    • Goto, M.1    Nishimura, T.2    Hashiguchi, H.3    Oka, R.4
  • 19
    • 84874420831 scopus 로고    scopus 로고
    • What is the
    • Crypton Future Media, Available
    • Crypton Future Media, "What is the "HATSUNE MIKU movement"?"2012. [Online]. Available: http://www.crypton.co.jp/miku- eng
    • HATSUNE MIKU Movement"?"2012. [Online]
  • 20
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • Apr.
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.