SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 2559-2563

Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis

(6) Shirota, Kanako a Nakamura, Kazuhiro a Hashimoto, Kei a Oura, Keiichiro a Nankaku, Yoshihiko a Tokuda, Keiichi a

a NAGOYA INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

hidden Markov model; pitch adaptive training; singing voice synthesis; speaker adaptive training

Indexed keywords

SIGNAL PROCESSING;

ADAPTIVE TRAINING; CONTEXT DEPENDENT; CONTEXTUAL FACTORS; HIDDEN MARKOV MODELS (HMMS); PARAMETRIC APPROACH; PITCH ADAPTIVE TRAININGS; SINGING-VOICE SYNTHESIS; SPEAKER ADAPTIVE TRAININGS;

HIDDEN MARKOV MODELS;

EID: 84905234613 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6854062 Document Type: Conference Paper

Times cited : (17)

References (16)

1
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. of Eurospeech, pp. 2347-2350, 1999.
- (1999) Proc. of Eurospeech , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

2
- 0034842740
- Adaptation of pitch and spectrum for HMM-based speech synthesis using mllr
- M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using mllr," in Proc. of ICASSP, pp. 805-808, 2001.
- (2001) Proc. of ICASSP , pp. 805-808
- Tamura, M.¹ Masuko, T.² Tokuda, K.³ Kobayashi, T.⁴

3
- 85135145847
- Speaker interpolation in HMM-based speech synthesis system
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Speaker interpolation in HMM-based speech synthesis system," in Proc. of Eurospeech, vol. 5, pp. 2523-2526, 1997.
- (1997) Proc. of Eurospeech , vol.5 , pp. 2523-2526
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

4
- 85009257840
- Eigenvoice for HMM-based speech synthesis
- K. Shichiri, A. Sawabe, T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Eigenvoice for HMM-based speech synthesis," in Proc. of ICSLP, vol. 1, pp. 1269-1272, 2002.
- (2002) Proc. of ICSLP , vol.1 , pp. 1269-1272
- Shichiri, K.¹ Sawabe, A.² Yoshimura, T.³ Tokuda, K.⁴ Masuko, T.⁵ Kobayashi, T.⁶ Kitamura, T.⁷

5
- 50249141145
- An HMM-based singing voice synthesis system
- K. Saino, H. Zen, Y. Nankaku, A. Lee, and K. Tokuda, "An HMM-based singing voice synthesis system," in Proc. of Interspeech, pp. 1141-1144, 2006.
- (2006) Proc. of Interspeech , pp. 1141-1144
- Saino, K.¹ Zen, H.² Nankaku, Y.³ Lee, A.⁴ Tokuda, K.⁵

6
- 84876667508
- Recent development of the hmm-based singing voice synthesis system-sinsy
- K. Oura, A. Mase, T. Yamada, S. Muto, Y. Nankaku, and K. Tokuda, "Recent development of the HMM-based singing voice synthesis system-Sinsy," in Proc. The 7th ISCA Tutorial and Research Workshop on Speech Synthesis, pp. 211-216, 2010.
- (2010) Proc. The 7th ISCA Tutorial and Research Workshop on Speech Synthesis , pp. 211-216
- Oura, K.¹ Mase, A.² Yamada, T.³ Muto, S.⁴ Nankaku, Y.⁵ Tokuda, K.⁶

7
- 33847129573
- Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
- J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans. Inf. & Syst., vol. E-90D, no. 2, pp. 533-543, 2007.
- (2007) IEICE Trans. Inf. & Syst. , vol.E-90D , Issue.2 , pp. 533-543
- Yamagishi, J.¹ Kobayashi, T.²

8
- 84867200235
- Generating natural F0 trajectory with additive trees
- Y. Qian, H. Liang, and F. K. Soong, "Generating natural F0 trajectory with additive trees," in Proc. of Interspeech, pp. 2126-2129, 2008.
- (2008) Proc. of Interspeech , pp. 2126-2129
- Qian, Y.¹ Liang, H.² Soong, F.K.³

9
- 70450161503
- Context-dependent additive log F0 model for HMM-based speech synthesis
- H. Zen and N. Braunschweiler, "Context-dependent additive log F0 model for HMM-based speech synthesis," in Proc. of Interspeech, pp. 2091-2094, 2094.
- Proc. of Interspeech , pp. 2091-2094
- Zen, H.¹ Braunschweiler, N.²

10
- 84867584634
- Pitch adaptive training for HMM-based singing voice synthesis
- K. Oura, A. Mase, Y. Nankaku, and K. Tokuda, "Pitch adaptive training for HMM-based singing voice synthesis," in Proc. of ICASSP, pp. 5377-5380, 2012.
- (2012) Proc. of ICASSP , pp. 5377-5380
- Oura, K.¹ Mase, A.² Nankaku, Y.³ Tokuda, K.⁴

11
- 0028996993
- Speech parameter generation from HMM using dynamic features
- K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features," in Proc. of ICASSP, pp. 660-663, 1995.
- (1995) Proc. of ICASSP , pp. 660-663
- Tokuda, K.¹ Kobayashi, T.² Imai, S.³

12
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

13
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, M. K. Ikuyo, and A. Cheneigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Ikuyo, M.K.² Cheneigne, A.³

14
- 44449177634
- A hidden semi-Markov model-based speech synthesis system
- H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system," IEICE Trans. Inf. & Sys., vol. 90-D, no. 5, pp. 825-834, 2007.
- (2007) IEICE Trans. Inf. & Sys. , vol.90-D , Issue.5 , pp. 825-834
- Zen, H.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

15
- 0032678076
- Hidden markov models based on multi-space probability distribution for pitch pattern modeling
- K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, vol. 1, pp. 229-232, 1999.
- (1999) Proc. of ICASSP , vol.1 , pp. 229-232
- Tokuda, K.¹ Masuko, T.² Miyazaki, N.³ Kobayashi, T.⁴

16
- 0033906251
- MDL-based contextdependent subword modeling for speech recognition
- K. Shinoda and T. Watanabe, "MDL-based contextdependent subword modeling for speech recognition," J. Acoust. Soc. Jpn. (E), vol. 21, no. 2, pp. 76-86, 2000.
- (2000) J. Acoust. Soc. Jpn. (E) , vol.21 , Issue.2 , pp. 76-86
- Shinoda, K.¹ Watanabe, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.