SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 2499-2503

A comparative study of spectral transformation techniques for singing voice synthesis

(5) Lee, S W a Wu, Zhizheng b Dong, Minghui a Tian, Xiaohai b Li, Haizhou a,b

a INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

b NANYANG TECHNOLOGICAL UNIVERSITY (Singapore)

Author keywords

Adaptation; Singing synthesis; Spectral transformation; Speech to singing; Voice conversion

Indexed keywords

COMPUTER MUSIC; MAXIMUM LIKELIHOOD ESTIMATION; SPECTRUM ANALYSIS; SPEECH COMMUNICATION;

ADAPTATION; COMPARATIVE STUDIES; CONTEXT-DEPENDENT MODELS; GAUSSIAN MIXTURE MODEL; SINGING SYNTHESIS; SINGING-VOICE SYNTHESIS; SPECTRAL TRANSFORMATIONS; VOICE CONVERSION;

SPEECH PROCESSING;

EID: 84910071971 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (8)

References (28)

1
- 84910024916
- Synthesis of singing challenge
- Special Session, Aug
- Synthesis of Singing Challenge (Special Session), Proc. Interspeech, Aug. 2007.
- (2007) Proc. Interspeech

2
- 84865801323
- Rule-based voice conversion derived from expressive speech perception model: How do computers sing a song joyfully?
- Tutorial 01, Nov
- M. Akagi, "Rule-based voice conversion derived from expressive speech perception model: How do computers sing a song joyfully?" in Proc. ISCSLP. Tutorial 01, Nov. 2010.
- (2010) Proc. ISCSLP
- Akagi, M.¹

3
- 85032751318
- Synthesis of the singing voice by performance sampling and spectral models
- J. Bonada and X. Serra, "Synthesis of the singing voice by performance sampling and spectral models, " IEEE Signal Processing Magazine, vol. 24, pp. 67-79, 2007.
- (2007) IEEE Signal Processing Magazine , vol.24 , pp. 67-79
- Bonada, J.¹ Serra, X.²

4
- 44949085633
- An HMMbased singing voice synthesis system
- Sep
- K. Saino, H. Zen, Y. Nankaku, A. Lee, and K. Tokuda, "An HMMbased singing voice synthesis system, " in Proc. Interspeech, Sep. 2006, pp. 2274-2277.
- (2006) Proc. Interspeech , pp. 2274-2277
- Saino, K.¹ Zen, H.² Nankaku, Y.³ Lee, A.⁴ Tokuda, K.⁵

5
- 84867623442
- Generalized F0 modeling with absolute and relative pitch features for singing voice synthesis
- Mar
- S. W. Lee, S. T. Ang, M. Dong, and H. Li, "Generalized F0 modeling with absolute and relative pitch features for singing voice synthesis, " in Proc. ICASSP, Mar. 2012, pp. 429-432.
- (2012) Proc. ICASSP , pp. 429-432
- Lee, S.W.¹ Ang, S.T.² Dong, M.³ Li, H.⁴

6
- 84865746117
- Singing voice synthesis: Singerdependent vibrato modeling and coherent processing of spectral envelope
- Aug
- S. W. Lee and M. Dong, "Singing voice synthesis: Singerdependent vibrato modeling and coherent processing of spectral envelope, " in Proc. Interspeech, Aug. 2011, pp. 2001-2004.
- (2011) Proc. Interspeech , pp. 2001-2004
- Lee, S.W.¹ Dong, M.²

7
- 76249125282
- VOCALID - Commercial singing synthesizer based on sample concatenation
- Aug
- H. Kenmochi and H. Ohshita, "VOCALID - Commercial singing synthesizer based on sample concatenation, " in Proc. Interspeech, Aug. 2007.
- (2007) Proc. Interspeech
- Kenmochi, H.¹ Ohshita, H.²

8
- 84865766404
- Mar.
- P. Kirn, "iPhone Day: LaDiDa's Reverse Karaoke Composes Accompaniment to Singing [Online], " Mar. 2014, available: Http://createdigitalmusic.com/2009/10/iphone-day-ladidasreverse-karaoke-composes-accompaniment-to-singing/.
- (2014) IPhone Day: LaDiDa's Reverse Karaoke Composes Accompaniment to Singing
- Kirn, P.¹

9
- 84910057227
- Mar
- "An app with speech-to-singing utility. NDP 2013 Mobile App [Online], " Mar. 2014, available: Https://itunes.apple.com/sg/app/ndp-2013-mobileapp/id524388683?mt=8.
- (2014) An App with Speech-to-singing Utility

10
- 84867619250
- Vocalistener and vocawatcher: Imitating a human singer by using signal processing
- Mar
- M. Goto, T. Nakano, S. Kajita, Y. Matsusaka, S. Nakaoka, and K. Yokoi, "Vocalistener and vocawatcher: Imitating a human singer by using signal processing, " in Proc. ICASSP, Mar. 2012, pp. 5393-5396.
- (2012) Proc. ICASSP , pp. 5393-5396
- Goto, M.¹ Nakano, T.² Kajita, S.³ Matsusaka, Y.⁴ Nakaoka, S.⁵ Yokoi, K.⁶

11
- 65549092601
- Vocal tract resonances in speech, singing and playing music instruments
- J. Wolfe, M. Garnier, and J. Smith, "Vocal tract resonances in speech, singing and playing music instruments, " Human Frontier Science Program Journal, vol. 3, pp. 6-23, 2009.
- (2009) Human Frontier Science Program Journal , vol.3 , pp. 6-23
- Wolfe, J.¹ Garnier, M.² Smith, J.³

12
- 0347087547
- Tuning of vocal tract resonance by sopranos
- Jan
- E. Joliveau, J. Smith, and J. Wolfe, "Tuning of vocal tract resonance by sopranos, " Nature, vol. 427, p. 116, Jan. 2004.
- (2004) Nature , vol.427
- Joliveau, E.¹ Smith, J.² Wolfe, J.³

13
- 0017466904
- The acoustics of the singing voice
- Mar
- J. Sundberg, "The acoustics of the singing voice, " Scientific American, vol. 236, pp. 82-91, Mar. 1977.
- (1977) Scientific American , vol.236 , pp. 82-91
- Sundberg, J.¹

14
- 50249180273
- Speech-to-singing synthesis: Converting speaking voices to singing voices by controlling acoustic features unique to singing voices
- Oct
- T. Saitou, M. Goto, M. Unoki, and M. Akagi, "Speech-to-singing synthesis: Converting speaking voices to singing voices by controlling acoustic features unique to singing voices, " in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct. 2007, pp. 215-218.
- (2007) Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , pp. 215-218
- Saitou, T.¹ Goto, M.² Unoki, M.³ Akagi, M.⁴

15
- 4444251929
- Voice conversion: State of the art and perspective
- E. Moulines and Y. Sagisaka, "Voice conversion: State of the art and perspective, " Special Iss. Speech Commun., vol. 16, no. 2, 1995.
- (1995) Special Iss. Speech Commun , vol.16 , Issue.2
- Moulines, E.¹ Sagisaka, Y.²

16
- 0032026483
- Continuous probabilistic transform for voice conversion
- Mar
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. Speech & Audio Proc., vol. 6, pp. 131-142, Mar. 1998.
- (1998) IEEE Trans. Speech & Audio Proc. , vol.6 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

17
- 0034842552
- Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
- May
- T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum, " in Proc. ICASSP, May 2001, pp. 841-844.
- (2001) Proc. ICASSP , pp. 841-844
- Toda, T.¹ Saruwatari, H.² Shikano, K.³

18
- 4444285698
- Ph.D. dissertation, OGI School of Science & Engineering, Oct
- A. B. Kain, "High resolution voice transformation, " Ph.D. dissertation, OGI School of Science & Engineering, Oct. 2001.
- (2001) High Resolution Voice Transformation
- Kain, A.B.¹

19
- 57749193836
- Voice conversion based on maximum-likihood estimation of spectral parameter trajectory
- Nov
- T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum-likihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, & Lang. Proc., vol. 15, pp. 2222- 2235, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, & Lang. Proc , vol.15 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

20
- 77953727123
- Voice conversion based on weighted frequency warping
- Jul
- D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping, " IEEE Trans. Audio, Speech, & Lang. Proc., vol. 18, pp. 922-931, Jul. 2010.
- (2010) IEEE Trans. Audio, Speech, & Lang. Proc. , vol.18 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

21
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- Jun
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in Proc. ICASSP, Jun. 2000, pp. 1315-1318.
- (2000) Proc. ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

22
- 0034842740
- Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
- May
- M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR, " in Proc. ICASSP, May 2011, pp. 805-808.
- (2011) Proc. ICASSP , pp. 805-808
- Tamura, M.¹ Masuko, T.² Tokuda, K.³ Kobayashi, T.⁴

23
- 84865767835
- HMM-based expressive speech synthesis - Towards TTS with arbitrary speaking styles and emotions
- Jan
- J. Yamagishi, T. Masuko, and T. Kobayashi, "HMM-based expressive speech synthesis - Towards TTS with arbitrary speaking styles and emotions, " in Proc. Special Workshop in Maui (SWIM), Jan. 2004.
- (2004) Proc. Special Workshop in Maui (SWIM)
- Yamagishi, J.¹ Masuko, T.² Kobayashi, T.³

24
- 33847129573
- Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
- Feb
- J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training, " IEICE Trans. Inf. & Syst., vol. E90-D, pp. 533-543, Feb. 2007.
- (2007) IEICE Trans. Inf. & Syst , vol.E90-D , pp. 533-543
- Yamagishi, J.¹ Kobayashi, T.²

25
- 84865698185
- Statistical voice conversion techniques for body-conducted unvoiced speech enhancement
- Sep
- T. Toda, M. Nakagiri, and K. Shikano, "Statistical voice conversion techniques for body-conducted unvoiced speech enhancement, " IEEE Trans. Audio, Speech, & Lang. Proc., vol. 20, pp. 2505-2517, Sep. 2012.
- (2012) IEEE Trans. Audio, Speech, & Lang. Proc , vol.20 , pp. 2505-2517
- Toda, T.¹ Nakagiri, M.² Shikano, K.³

26
- 51449108867
- Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0 and aperiodicity estimation
- Mar
- H. Kawahara, M. Morise, T. Takahashi, R. Nisimura, T. Irino, and H. Banno, "Tandem-STRAIGHT: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, F0 and aperiodicity estimation, " in Proc. ICASSP, Mar. 2008, pp. 3933-3936.
- (2008) Proc. ICASSP , pp. 3933-3936
- Kawahara, H.¹ Morise, M.² Takahashi, T.³ Nisimura, R.⁴ Irino, T.⁵ Banno, H.⁶

27
- 67650854725
- Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
- Jan
- J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm, " IEEE Trans. Audio, Speech, & Lang. Proc., vol. 17, pp. 66-83, Jan. 2009.
- (2009) IEEE Trans. Audio, Speech, & Lang. Proc , vol.17 , pp. 66-83
- Yamagishi, J.¹ Kobayashi, T.² Nakano, Y.³ Ogata, K.⁴ Isogai, J.⁵

28
- 77950574571
- Recent development of the HMM-based speech synthesis system (HTS)
- Oct
- H. Zen, K. Oura, T. Nose, J. Yamagishi, S. Sako, T. Toda, T. Masuko, A. W. Black, and K. Tokuda, "Recent development of the HMM-based speech synthesis system (HTS), " in Proc. APSIPA ASC, Oct. 2009, pp. 121-130.
- (2009) Proc. APSIPA ASC , pp. 121-130
- Zen, H.¹ Oura, K.² Nose, T.³ Yamagishi, J.⁴ Sako, S.⁵ Toda, T.⁶ Masuko, T.⁷ Black, A.W.⁸ Tokuda, K.⁹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.