SCOPUS 정보 검색 플랫폼

2012 Conference Handbook - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2012

Volumn , Issue , 2012, Pages

Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system

(5) Doi, Hironori a Toda, Tomoki a Nakano, Tomoyasu b Goto, Masataka b Nakamura, Satoshi a

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

b NATIONAL INSTITUTE OF ADVANCED INDUSTRIAL SCIENCE AND TECHNOLOGY AIST (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

EIGENVOICES; MANY-TO-MANY; PARALLEL DATA; SINGING VOICES; TRAINING DATA; VOICE CONVERSION; VOICE QUALITY;

DATA HANDLING; SIGNAL PROCESSING;

SPEECH PROCESSING;

EID: 84874403435 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (39)

References (20)

1
- 76249125282
- VOCALOID - Commericial singing synthesizer based on sample concatenation
- Aug.
- H. Kenmochi and H. Ohshita, "VOCALOID - Commericial singing synthesizer based on sample concatenation," Proc. INTERSPEECH, pp. 4011-4012, Aug. 2007.
- (2007) Proc. INTERSPEECH , pp. 4011-4012
- Kenmochi, H.¹ Ohshita, H.²

2
- 84876667508
- Recent development of the HMM-based singing voice synthesis system - Sinsy
- Sept.
- K. Oura, A. Mase, T. Yamada, S. Muto, Y. Nankaku, and K. Tokuda, "Recent development of the HMM-based singing voice synthesis system - Sinsy," SSW7, pp. 211-216, Sept. 2010.
- (2010) SSW7 , pp. 211-216
- Oura, K.¹ Mase, A.² Yamada, T.³ Muto, S.⁴ Nankaku, Y.⁵ Tokuda, K.⁶

3
- 84867588982
- VOCALOID and Hatsune Miku phenomenon in Japan
- Oct.
- H. Kenmochi, "VOCALOID and Hatsune Miku phenomenon in Japan,"Proc. InterSinging, pp. 1-4, Oct. 2010.
- (2010) Proc. InterSinging , pp. 1-4
- Kenmochi, H.¹

4
- 70349200811
- Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown
- Apr.
- H. Kawahara, R. Nisimura, T. Irino, M. Morise, T. Takahashi, and H. Banno, "Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown," Proc. ICASSP, pp. 3905-3908, Apr. 2009.
- (2009) Proc. ICASSP , pp. 3905-3908
- Kawahara, H.¹ Nisimura, R.² Irino, T.³ Morise, M.⁴ Takahashi, T.⁵ Banno, H.⁶

5
- 79959827418
- Applying voice conversion to concate-native singing-voice synthesis
- Sept.
- F. Villavicencio and J. Bonada, "Applying voice conversion to concate-native singing-voice synthesis," INTERSPEECH, pp. 2162-2165, Sept. 2010.
- (2010) INTERSPEECH , pp. 2162-2165
- Villavicencio, F.¹ Bonada, J.²

6
- 80051619989
- Vocalistener2: A singing synthesis sytem able to mimic a user's singing in terms of voice timbre changes as well as pitch and dynamics
- May
- T. Nakano and M. Goto, "Vocalistener2: A singing synthesis sytem able to mimic a user's singing in terms of voice timbre changes as well as pitch and dynamics," Proc. ICASSP, pp. 453-456, May 2012.
- (2012) Proc. ICASSP , pp. 453-456
- Nakano, T.¹ Goto, M.²

7
- 77951033715
- Vowel-based voice conversion and its application to singing-voice manipulation
- Feb.
- Y. Yoshida, R. Nishimura, T. Irino, and H. Kawahara, "Vowel-based voice conversion and its application to singing-voice manipulation,"Proc. AES 35th International Conference: Audio for Games, no. 6, Feb. 2009.
- (2009) Proc. AES 35th International Conference: Audio for Games , Issue.6
- Yoshida, Y.¹ Nishimura, R.² Irino, T.³ Kawahara, H.⁴

8
- 84874432462
- GMM voice conversion of singing voice using vocal tract area function
- (Japanese edition), Nov.
- Y. Kawakami, H. Banno, and F. Itakura, "GMM voice conversion of singing voice using vocal tract area function," IEICE technical report. Speech 110(297) (Japanese edition), pp. 71-76, Nov. 2010.
- (2010) IEICE Technical Report. Speech , vol.110 , Issue.297 , pp. 71-76
- Kawakami, Y.¹ Banno, H.² Itakura, F.³

9
- 0032026483
- Continuous probabilistic transform for voice conversion
- Mar.
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. SAP, vol. 6, no. 2, pp. 131-142, Mar. 1998.
- (1998) IEEE Trans. SAP , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

10
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- May
- A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," Proc. ICASSP, pp. 285-288, May 1998.
- (1998) Proc. ICASSP , pp. 285-288
- Kain, A.¹ Macon, M.W.²

11
- 57749193836
- Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- Nov.
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory," IEEE Trans. ASLP, vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. ASLP , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

12
- 70450194389
- Many-to-many eigenvoice conversion with reference voice
- Sept.
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Many-to-many eigenvoice conversion with reference voice," INTERSPEECH, pp. 1623-1626, Sept. 2009.
- (2009) INTERSPEECH , pp. 1623-1626
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

13
- 78049360766
- VocaListener: A singing-to-singing synthesis system based on iterative parameter estimation
- May
- T. Nakano and M. Goto, "VocaListener: A singing-to-singing synthesis system based on iterative parameter estimation," Proc. SMC 2009, pp. 343-348, May 2009.
- (2009) Proc. SMC 2009 , pp. 343-348
- Nakano, T.¹ Goto, M.²

14
- 34547496175
- One-to-many and many-to-one voice conversion based on eigenvoices
- Apr.
- T. Toda, Y. Ohtani, and K. Shikano, "One-to-many and many-to-one voice conversion based on eigenvoices," Proc. ICASSP, pp. 1249-1252, Apr. 2007.
- (2007) Proc. ICASSP , pp. 1249-1252
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

15
- 77952978184
- Adaptive training for voice conversion based on eigenvoices
- June
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Adaptive training for voice conversion based on eigenvoices," IEICE Trans. Inf. and Syst., vol. E93-D, no. 6, pp. 1589-1598, June 2010.
- (2010) IEICE Trans. Inf. and Syst. , vol.E93-D , Issue.6 , pp. 1589-1598
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

16
- 50249180273
- Speech-to-singing synthesis: Converting speaking voices to singing voices by controlling acoustic features unique to singing voice
- Oct.
- T. Saitou, M. Goto, M. Unoki, and M. Akagi, "Speech-to-singing synthesis: Converting speaking voices to singing voices by controlling acoustic features unique to singing voice," Proc. WASPAA, pp. 215-218, Oct. 2007.
- (2007) Proc. WASPAA , pp. 215-218
- Saitou, T.¹ Goto, M.² Unoki, M.³ Akagi, M.⁴

17
- 84867211725
- Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- Sept.
- T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory," Proc. INTERSPEECH, pp. 1076-1079, Sept. 2008.
- (2008) Proc. INTERSPEECH , pp. 1076-1079
- Muramatsu, T.¹ Ohtani, Y.² Toda, T.³ Saruwatari, H.⁴ Shikano, K.⁵

18
- 2442437071
- RWC Music Database: Music genre database and musical instrument sound database
- Oct.
- M. Goto, T. Nishimura, H. Hashiguchi, and R. Oka, "RWC Music Database: Music genre database and musical instrument sound database," Proc. ISMIR, pp. 229-230, Oct. 2003.
- (2003) Proc. ISMIR , pp. 229-230
- Goto, M.¹ Nishimura, T.² Hashiguchi, H.³ Oka, R.⁴

19
- 84874420831
- What is the
- Crypton Future Media, Available
- Crypton Future Media, "What is the "HATSUNE MIKU movement"?"2012. [Online]. Available: http://www.crypton.co.jp/miku- eng
- HATSUNE MIKU Movement"?"2012. [Online]

20
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
- Apr.
- H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigne, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.